Related Work Across Scenarios
From XG Provenance Wiki
PLEASE DO NOT EDIT THIS PAGE: Instead, edit the individual scenario pages.
Contents
Related Work in the Three Flagship Scenarios
This section contains a summary of the related work cited in the original use cases that were used to create the three flagship scenarios.
News Aggregator
- Use_Case_Retweets
- An excellent discussion of the issues with retweeting and the introduction of retweet functionality can be read in a blog post by Evan Williams: Why Retweet works the way it does
- Use_Case_Provenance_in_Blogosphere
- The SIOC project has developed a vocabulary for representing posts. This vocabulary is often used together with FOAF (that represent information about the physical person related to a sioc:User, e.g. its name, lastname, phone, social network, etc.) and SKOS, used mainly to represent topics and taxonomy relationships between these topics.
- Use_Case_Creative_Commons
- Use_Case_Mapping_Digital_Rights
- The Open Digital Rights Language (ODRL): http://odrl.net/
- Creative Commons (CC): http://creativecommons.org/
- Digital Rights Management (DRM): http://en.wikipedia.org/wiki/Digital_rights_management
- liblicense: http://wiki.creativecommons.org/Liblicense
- Use_Case_Attribution_for_a_Versioned_Document
- None indicated
- Use_Case_Identifying_Attribution_And_Associations
- [Gil and Ratnakar ISWC02] describe an approach to enable users to express their assessment of complementary and contradictory sources of information. As the user considers information from different sources relevant to their purpose, they can view the ratings that other users assigned to the entities involved, and use those ratings to assess the information at hand. Sources were assigned a reliability rating, and individual sources could be selected to express the criteria used to accept or dismiss information. The user could also assign credibility ratings based on other information available.
Disease Outbreak
- From Use_Case_Evidence_for_Public_Policy
- Use case drawn from work reported by Peter Edwards and Lorna Philip, University of Aberdeen. (See http://wiki.esi.ac.uk/UseCasesForProvenanceWorkshop).
- From Use_Case_Provenance_of_Decision_Making_Emergency_Response
- None indicated
- From Use_Case_Provenance_for_IQ
- D. Stead, N. Paton, P. Missier, S. Embury, C. Hedeler, B. Jin, A. Brown, and A. Preece, "Information Quality in Proteomics," Briefings in Bioinformatics, vol. 9, 2008, pp. 174-188.
- P. Missier, S.M. Embury, R.M. Greenwood, A.D. Preece, and B. Jin, "Managing information quality in e-science: the Qurator workbench," SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, New York, NY, USA: ACM, 2007, pp. 1150-1152.
- From Domain_Specific_Provenance_2
- From Use_Case_Result_Differences
- An approach to addressing this use case is discussed in the paper Recording and Using Provenance in a Protein Compressibility Experiment
- From Closure_of_Experimental_Metadata
- None indicated
- From Use_Case_private_data_use
- Aldeco-Pérez, R. & Moreau, L. Provenance-based Auditing of Private Data Use International Academic Research Conference, Visions of Computer Science, 2008 [1]
Business Contract
- Use_Case_Fulfilling_Contractual_Obligations
- There is work across electronic records, e-notbooks, LIMS and asset management systems, workflow, e-Science, and semantic web commuities that address parts of this scenario. I've drawn from experience as part of the Collaborative Electronic Notebook Systems Association (censa.org, 1998-2008) where many requirements for documentation of scientific research and analyical sample processing in the Chemical and Pharmaceutical industries were discussed in the context of FDA regulatons, patent policies, and rules of legal evidence.
- Use_Case_Evidence_for_Engineering_Design
- This is (very loosely) based on discussion of provenance in engineering design by Alex Ball (see http://wiki.esi.ac.uk/UseCasesForProvenanceWorkshop).
- Toyota Recall
- Use_Case_Hidden_Bug
- None indicated
- Use_Case_Crosswalk_Maintenance
- Working examples by means of RDF Reification can be found here: DC-09 conference article
- Use_Case_Metadata_Merging
- Working examples by means of RDF Reification can be found here: DC-09 conference article
- Use_Case_Linked_Data_Timeliness
- [Hartig and Zhao SWPM09] describe an approach to develop a timeliness assessment method for Web data.
Related Work Compiled by the Group that Needs to be Incorporated into the Related Work Section of the Scenarios
Related Work from Other Use Cases
- Anonymous Information
- None indicated
- Information Quality Assessment for Linked Data
- Felix Naumann: Quality-Driven Query Answering for Integrated Information Systems. Springer Berlin / Heidelberg, 2002.
- Christian Bizer: Quality-Driven Information Filtering in the Context of Web-Based Information Systems. Thesis, Freie Universität Berlin, 2007.
- Tim Berners-Lee: Cleaning Up the User Interface, Section: The "Oh,yeah?"-Button, 1997.
- Olaf Hartig: Querying Trust in RDF Data with tSPARQL. In Proceedings of the 6th European Semantic Web Conference (ESWC), Heraklion, Greece, June 2009
- Olaf Hartig: Provenance Information in the Web of Data. In Proceedings of the Linked Data on the Web (LDOW) Workshop at WWW, Madrid, Spain, April 2009 Download PDF
- Simple Trustworthiness Assessment
- Hartig ESWC09 describes tSPARQL which is a trust-aware extension to the query language SPARQL. tSPARQL allows to describe trust requirements in SPARQL queries. Using tSPARQL an application can filter (intermediate) solutions for graph patterns in SPARQL queries based on the trustworthiness of the data from which the solutions originate. The tRDF4Jena library provides a query engine for tSPARQL.
- Ignoring Unreliable Data
- See WIQA framework on IQ in Linked Data main page.
- Answering user queries that require semantically annotated provenance
- We are aware of an early prototype where domain-specific provenance is added to OPM, and such semantics-augmented OPM is represented using RDF. This is described in a paper presented at the SWPM'09 workshop (ISWC'09): SWPM'09 paper
- Semantic extensions to OPM have also been recently proposed in this paper, presented at the 2009 All Hands Meeting, Oxford, UK.
- Additionally, [KDG+08] describes reasoning about semantic properties of datasets in the workflow as part of provenance records. [GGR+09] describes how this is done for the case of data collections.
- Using process provenance for assessing the quality of Information products
- D. Stead, N. Paton, P. Missier, S. Embury, C. Hedeler, B. Jin, A. Brown, and A. Preece, "Information Quality in Proteomics," Briefings in Bioinformatics, vol. 9, 2008, pp. 174-188.
- P. Missier, S.M. Embury, R.M. Greenwood, A.D. Preece, and B. Jin, "Managing information quality in e-science: the Qurator workbench," SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, New York, NY, USA: ACM, 2007, pp. 1150-1152.
- Provenance of Collections vs Objects in Cultural Heritage
- None indicated
- Provenance at different levels in Cultural Heritage
- None indicated
- Documenting axiom formulation
- The use case is described in terms of the use of semantic web ontologies and data, but its motivation comes from uses of ontologies for engineering problems. Consider for example a system developed to estimate the duration of carrying out specific engineering tasks, such as repairing a damaged road or leveling uneven terrain. Users invariably wanted explanations about where the answers came from in terms of the sources we consulted and the sources that we chose to pursue. They wanted to know whether well-known engineering manuals were consulted, which were given more weight, whether practical experience was considered to refine theoretical estimates, and what authoritative sources were consulted to decide among competing recommendations. In other words, the analysis process that knowledge engineers/developers perform is part of the rationale that needs to be captured in order to justify answers to user queries.
- [Gil EKAW 02] describes a tool that enables knowledge base developers to keep track of the knowledge sources and intermediate knowledge fragments that result in a formalized piece of knowledge. The resulting ontology is enhanced with pointers that capture the rationale of its design and development.
- Provenance for Environmental Marine Data
- J. Carroll, C. Bizer, P. Hayes, and P. Stickler. Named graphs, Provenance and Trust. In WWW, 2005.
- P. Pediaditis, G. Flouris, I. Fundulaki, and V. Christophides. On Explicit Provenance Management in RDF/S Graphs. In TAPP, 2009.
- G. Flouris, I. Fundulaki, P. Pediaditis, Y. Theoharis, and V. Christophides. Coloring RDF Triples to Capture Provenance. In ISWC, 2009.
- PSPARQL. psparql.inrialpes.fr.
- J. Perez, M. Arenas, and C. Gutierrez. nSPARQL: A Navigational Language for RDF. In ISWC, 2008.
- P. Buneman, J. Cheney, and S. Vansummeren. On the Expressiveness of Implicit Provenance in Query and Update Languages. In ICDT, 2007.
- T. J. Green, G. Karvounarakis, and V. Tannen. Provenance semirings. In PODS, 2007.
- Simon Schenk Steffen Staab. Networked Graphs: A Declarative Mechanism for SPARQL Rules, SPARQL Views and RDF Data Integration on the Web. In WWW 2008.
- Computer Assisted Research
- None indicated
- Handling Scientific Measurement Anomaly
- None indicated
- Human-Executed Processes
- None indicated
- Semantic disambiguation of data provider identity
- P. Bouquet, T. Palpanas, H. Stoermer, and M. Vignolo, "A Conceptual Model for a Web-scale Entity Name System," in ASWC, Shanghai, China, 2009
- http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData
- A. Jaffri, H. Glaser, and I. Millard. Uri identity management for semantic web data integration and linkage. In 3rd International Workshop On Scalable Semantic Web Knowledge Base Systems. Springer, 2007
- P. Bouquet, H. Stoermer “OKKAM : Enabling an Entity Name System for the Semantic Web” in: Proceedings of the I-ESA2008 Workshop on Semantic Interoperability, 2008
Related Work from State-Of-The-Art Presentations
- Open Provenance Model (OPM)
- OPM Core specification v1.1
- The Open Provenance Vision (in chapter 5).
- A presentation on OPM
- Current XML schema
- Current OWL ontology
- Formalisation of OPM (with Natalia Kwasnikowska and Jan Van den Bussche)
- Early proposal to map Dublin Core to OPM (by Simon Miles)
- Early proposal for collections in OPM (by Luc Moreau, Paolo Missier, Paul Groth, Simon Miles)
- Provenance in databases
- Buneman, P. 2006. How to cite curated databases and how to make them citable. SSDBM 2006:195-203.
- Buneman, P., Cheney, J., Tan, W., and Vansummeren, S. 2008. Curated databases. PODS 2008: 1-12.
- Buneman, P. and Tan, W-C. Provenance in databases. SIGMOD 2007: 1171-1173.
- Susan B. Davidson, Juliana Freire: Provenance and scientific workflows: challenges and opportunities. SIGMOD 2008:1345-1350
- Named graphs
- Jeremy Carroll, Christian Bizer, Patrick Hayes, Patrick Stickler. Named Graphs. Journal of Web Semantics, Vol. 3, Issue 4, 2005.
- Christian Bizer, Richard Cyganiak. Quality-driven information filtering using the WIQA policy framework. Journal of Web Semantics, Vol. 7, Issue 1, 2009.
- SPARQL Query Language for RDF - W3C Recommendation, Section: RDF Dataset
- Christian Bizer, Richard Cyganiak. The TriG Syntax
- Dublin Core Metadata Initiative
- Proof Markup Language (PML)
- Paulo Pinheiro da Silva and Deborah L. McGuinness and Richard Fikes. A Proof Markup Language for Semantic Web Services. Information Systems. Volume 31, Issues 4-5, June-July 2006, Pages 381-395.
- PML applications:
- provenance for extracted information/knowledge: J. William Murdock, Deborah McGuinness, Paulo Pinheiro da Silva, Chris Welty, and David Ferrucci. Explaining Conclusions from Diverse Knowledge Sources. In Proceedings of the 5th International Semantic Web Conference (ISWC2006), Athens, GA, USA, p. 861-872, November 2006.
- PML-based trust computation: Ilya Zaihrayeu, Paulo Pinheiro da Silva and Deborah L. McGuinness. IWTrust: Improving User Trust in Answers from the Web. In Proceedings of 3rd International Conference on Trust Management (iTrust2005), Springer, Rocquencourt, France, pages 384-392, 2005.
- PML visualization and user interfaces: Nicholas Del Rio and Paulo Pinheiro da Silva. Probe-It! Visualization Support for Provenance. In Proceedings of the Third International Symposium on Visual Computing (ISVC 2007), Lake Tahoe, NV/CA, November 26-28, 2007.
- 'Provenance Vocabulary
- Olaf Hartig and Jun Zhao: Publishing and Consuming Provenance Metadata on the Web of Linked Data. In Proceedings of the 3rd International Provenance and Annotation Workshop (IPAW), Troy, New York, USA, June 2010.
- Olaf Hartig and Jun Zhao: Guide to the Provenance Vocabulary
- Provenir Ontology
- Provenance for multimedia data
- Daniel Oberle, S. Lamparter, S. Grimm, D. Vrandecic, Steffen Staab, and Aldo Gangemi. Towards Ontologies for Formalizing Modularization and Communication in Large Software Systems. Journal of Applied Ontology, 1(2):163–202, 2006.
- Aldo Gangemi, Peter Mika. Understanding the Semantic Web through Descriptions and Situations. In On The Move 2003 Conferences (OTM2003), pages 689-706, 2003.
- Presentation on PREMIS
Other topics discussed by the group:
- Security and digital signatures
- Social Web
- e-Government
- Policies, trust, and privacy
Relevant Technologies and Standards for the Working Group
- Creative Commons
- XML Signature Specification
- Open Id
- Dublin Core
- Changeset
- The Open Provenance Model (OPM)
- The Provenance Vocabulary
- Jeni Tennison's attempts to use the Provenance Vocabulary for marking up Government data: Establishing Trust by Describing Provenance
- Protocol for Web Description Resources (POWDER)
- Named Graphs
- The Semantic Web Publishing Vocabulary
- Web Of Trust RDF Ontology
- Vocabulary of Interlinked Datasets (voiD)
- Open Archives Initiative - Object Reuse and Exchange (OAI-ORE)
- Ontology Design Patterns, in particular Information Object, Identity of Resources on the Web (IRW) ontology, and Basic Plan ontology
- CIDOC Conceptual Reference Model (CRM) (heritage-oriented)
- PREMIS (meta-data standard used by the Library of Congress)
- Intelligence Community Standard (ICS) for Source Reference Citation Metadata
- DDMS (DoD Discovery Metadata Specification)
- Provenir: A Foundational Model of Provenance
- Provenance Management Framework
- Inference Web
- Proof Markup Language - Provenance Interlingua
- ISO/TS 8000-120:2009 part 120 is concerned with data quality and provenance
- ODRL
- OAuth - Open Authentication Protocol
- - OATH - Open Authentication System
- SAML - Security Assertion Markup Language
- IMI - Identity Metasystem Interoperability
- XACML - eXtensible Access Control Markup Language
- X509 - X.509/PKI - ITU-T Recommendation X.509 (2005) | ISO/IEC 9594-8:2005
- NSIT SP800-63 - Electronic Authentication Guideline
- UMA - User-Managed Identity
- FOAF+SSL - Secure social web authentication protocol
- WebFinger - Personal data discovery protocol
Survey Papers
- Luc Moreau, The Foundations of Provenance on the Web, 2009.
- Rajendra Bose and James Frew. Lineage Retrieval for Scientific Data Processing: A Survey. ACM Computing Surveys, Volume 37, Issue 1, 2005).
- Donovan Artz and Yolanda Gil. A Survey of Trust in Computer Science and the Semantic Web, Journal of Web Semantics, Volume 5, Issue 2, 2007.
- Yogesh L. Simmhan, Beth Plale, Dennis Gannon. A survey of data provenance in e-science. ACM SIGMOD Vol 34 , No 3, 2005. See also a longer version.
- Juliana Freire, David Koop, Emanuele Santos, Claudio Silva. Provenance for Computational Tasks: A Survey, Computing Science and Engineering, Vol 10, No 3, pp 11-21, 2008.
- Provenance in databases: Why, where and how, J. Cheney, L. Chiticariu and W.-C. Tan. Foundations and Trends in Databases, 1(4):379-474, 2009.
Online Paper Collections
- Mendeley Collection. This bibliography is publicly readable. If you would like to have write access, send email to Paolo Missier.