Warning:
This wiki has been archived and is now read-only.

W3C Provenance Incubator Group Wiki

From XG Provenance Wiki
Jump to: navigation, search

Welcome to the Provenance Incubator Group Wiki.


Mission and Charter

The mission of the Provenance Incubator Group, part of the Incubator Activity, was to provide a state-of-the art understanding and develop a roadmap in the area of provenance for Semantic Web technologies, development, and possible standardization. See the charter for more information. The group's activities were public, and recorded on the W3C Provenance Incubator Group wiki.

Final Report

Final Report, December 2010
Overview presentation
Follow on Provenance Working Group, April 2011

About Provenance

"At the toolbar (menu, whatever) associated with a document there is a button marked "Oh, yeah?". You press it when you lose that feeling of trust. It says to the Web, 'so how do I know I can trust this information?'. The software then goes directly or indirectly back to metainformation about the document, which suggests a number of reasons." Tim Berners-Lee, W3C Chair, Web Design Issues, September 1997
"Provenance is the number one issue we face when publishing government data as linked data for data.gov.uk" John Sheridan, UK National Archives, data.gov.uk, February 2010
"We need a paradigm that makes it simple [...] to perform and publish reproducible computational research. [...] A Reproducible Research Environment (RRE) [...] provides computational tools together with the ability to automatically track the provenance of data, analyses, and results and to package them (or pointers to persistent versions of them) for redistribution." Jill Mesirov, Chief Informatics Officer of the MIT/Harvard Broad Institute, in Science, January 2010
"The number of publications on provenance is [...] a total of 425 [...] The first publication dates back to 1986, [...] with about half the papers published in the last two years." Luc Moreau, University of Southampton, in The Foundations of Provenance on the Web, November, 2009
"The problem is - and this is true of books and every other medium - we don't know whether the information we find [on the Web] is accurate or not. We don't necessarily know what its provenance is. So we have to teach people how to assess what they've found. [...] there's so much juxtaposition of the good stuff and not-so-good stuff and flat-out-wrong stuff or deliberate misinformation or plain ignorance." Vinton Cerf, Internet pioneer, in Smithsonian's "40 Things you need to know about the next 40 years" issue, July, 2010
"In content, as creation becomes overabundant and as value shifts from creator to curator, it becomes all the more vital to properly cite and link to sources [...]. Good curation demands good provenance. [...] Provenance is no longer merely the nicety of artists, academics, and wine makers. It is an ethic we expect." Jeff Jarvis, media company consultant and associate professor at the City University of New York's Graduate School of Journalism, in The importance of provenance on his BuzzMachine blog, June, 2010


Provenance of a resource is a record that describes entities and processes involved in producing and delivering or otherwise influencing that resource. Provenance provides a critical foundation for assessing authenticity, enabling trust, and allowing reproducibility. Provenance assertions are a form of contextual metadata and can themselves become important records with their own provenance.


What is Provenance?

Group Reports

A summary of the group's findings and recommendations can be found in the Final Report and in this this slide presentation.


The first phase of the group's activities focused on requirements for provenance, categorizing and describing requirements based on use cases. This phase resulted in the following report:

Based on the use cases raised in this report we submitted a paper as a group effort to the RDF Next Steps workshop:

The group analyzed and compared current proposals for representing provenance on the Web:

The group also assembled a report on the state of the art in provenance, highlighting existing approaches and technology gaps:

The draft of the final report of the group contained a summary of all the group's findings, a roadmap, and recommendations:

The official W3C final report:

Timeline of Activities

See the overview of released reports for major products of the group's work.

See the group's planned timeline of activities.

Meetings and Discussions

Twitter and social media

The official tag for the group is #prov-xg. This should be used across social sites. You can find the latest tweets here.

Liaisons with Other Groups

If you would be interested in being a liaison with other groups please contact Yolanda Gil, the Provenance XG Group Chair.

Contact

Chair: Yolanda Gil, University of Southern California's Information Sciences Institute.