Difference between revisions of "ProvenanceAccessScenarioUsingPAQ"

From Provenance WG Wiki
Jump to: navigation, search
m
Line 39: Line 39:
  
 
* '''Enacting the "Oh yeah?" feature'''
 
* '''Enacting the "Oh yeah?" feature'''
** ''A user requests a web browser (W) used to obtain the web documents (D1-2) to enact the "Oh yeah?" feature on the whole documents ''
+
** ''A user requests a web browser (W) used to obtain the web documents (D1-2) to enact the "Oh yeah?" feature on the whole documents''
 +
*# The web browser examines the HTTP response header of the documents to identify '''Link:''' headers.
 +
*# It retrieves all those that contain the relations, ''provenance'' or ''anchor''.
 +
*# If multiple candidates are found, it gives preference to the first ''provenance'' relation header that has a '''''default''''' anchor (i.e. no anchor relation specified), followed by the first ''provenance'' relation header that has an ''anchor'' that matches the URL of the documents '''D1''' or '''D2''', and finally the first ''provenance'' relation header.
 +
*# The browser extracts the ''provenance-uri'' information, '''I_P'', from the chosen ''provenance'' relation header. It may also extract the ''context-uri'' information, '''I_C''', from the chosen headers if the ''anchor'' parameter or relation were present. Otherwise, it uses the URL to the documents '''D1''' and '''D2''' that was used to access them as the value of '''I_C'''.
 +
 
  
 
== Online Image Documents D3 and D4 ==
 
== Online Image Documents D3 and D4 ==

Revision as of 12:01, 18 August 2011

Walk through of how the Provenance Access Scenario is satisfied using the Provenance Access and Query (W3C Editor's Draft 17 August 2011) proposal.

Online HTML Documents D1 and D2

  • Obtaining the document D
    • HTML documents D1 and D2 are accessed using HTTP GET, and the servers return Link: headers as follows (using both alternate proposals for anchor as a parameter and as a relation)

Using anchor parameter

GET https://github.com/timrdf/vsr/raw/master/data/source/tim-lebo/london-crime/version/2011-Jul-07/manual/crime.html HTTP/1.1
...

HTTP/1.1 200 OK
...
Link: http://example.com/provenance/d1.prov; rel="provenance"; anchor="https://github.com/timrdf/vsr/raw/master/data/source/tim-lebo/london-crime/version/2011-Jul-07/manual/crime.html"

GET http://lists.w3.org/Archives/Public/public-prov-wg/2011Jul/att-0031/crime.html HTTP/1.1
...

HTTP/1.1 200 OK
...
Link: http://example.com/provenance/d2.prov; rel="provenance"; anchor="http://lists.w3.org/Archives/Public/public-prov-wg/2011Jul/att-0031/crime.html"

Using anchor relation

GET https://github.com/timrdf/vsr/raw/master/data/source/tim-lebo/london-crime/version/2011-Jul-07/manual/crime.html HTTP/1.1
...

HTTP/1.1 200 OK
...
Link: http://example.com/provenance/d1.prov; rel="provenance"; 
Link: https://github.com/timrdf/vsr/raw/master/data/source/tim-lebo/london-crime/version/2011-Jul-07/manual/crime.html; rel="anchor" 

GET http://lists.w3.org/Archives/Public/public-prov-wg/2011Jul/att-0031/crime.html HTTP/1.1
...

HTTP/1.1 200 OK
...
Link: http://example.com/provenance/d2.prov; rel="provenance"; 
Link: http://lists.w3.org/Archives/Public/public-prov-wg/2011Jul/att-0031/crime.html; rel="anchor" 
  • Enacting the "Oh yeah?" feature
    • A user requests a web browser (W) used to obtain the web documents (D1-2) to enact the "Oh yeah?" feature on the whole documents
    1. The web browser examines the HTTP response header of the documents to identify Link: headers.
    2. It retrieves all those that contain the relations, provenance or anchor.
    3. If multiple candidates are found, it gives preference to the first provenance relation header that has a default anchor (i.e. no anchor relation specified), followed by the first provenance relation header that has an anchor that matches the URL of the documents D1 or D2, and finally the first provenance relation header.
    4. The browser extracts the provenance-uri information, I_P, from the chosen provenance relation header. It may also extract the context-uri information, I_C', from the chosen headers if the anchor parameter or relation were present. Otherwise, it uses the URL to the documents D1 and D2 that was used to access them as the value of I_C.


Online Image Documents D3 and D4

Email attachments of HTML and Image Documents D5 and D6

HTML and Image Documents D7 and D8 in File System