Data Catalog Vocabulary/AtomStrawman

From W3C eGovernment Wiki
Jump to: navigation, search

Data Catalog Vocabulary (dcat) in Atom Strawman

This is intended as a strawman for exploring how Data Catalog Vocabulary (dcat) could be used from Atom. The rationale for using Atom is that it offers an established mechanism for making updates available, and synchronizing data services with data publishers. Atom is an IETF standard (RFC 4287), with a thriving community of practice around it.

The goal would be to document best practices for expressing dcat in Atom, to provide a low barrier entry point for data publishers and data consumers who are more familiar with a document-centric tool chain. In addition a GRDDL transform would be supplied for developers who are more familiar using an rdf (graph centric) tool chain.

This is a (perhaps flawed) work in progress...so please feel free to edit and annotate!

example feed from the national data catalog

<feed xmlns="http://www.w3.org/2005/Atom"
        xmlns:dcat="http://vocab.deri.ie/dcat#"
        xmlns:grddl="http://www.w3.org/2003/g/data-view#"
        grddl:transformation="http://www.w3.org/2010/dcat-atom.xslt">

 <title>National Data Catalog Dataset Feed</title>
 <link rel="self" href="http://nationaldatacatalog.com/feed/" />
 <updated>2010-04-01T21:12:00Z</updated>
 <author>
   <name>Sunlight Foundation</name>
 </author>
 <id>http://nationaldatacatalog.com/feed/</id>

 <entry>
   <title>311 Service Requests</title>
   <id>http://nationaldatacatalog.com/data/311-service-requests</id>
   <updated>2003-12-13T18:30:02Z</updated>
   <summary>311 Service Requests. The dataset contains locations and attributes of service requests received by theof thethrough the Mayor's Call Center (311),, electronic and US mail service or via other methods of communication.</summary>
   <dcat:keyword>call centers</dcat:keyword>
   <author>
       <name>Washington DC Data Warehouse Program</name>
   </author>
   <link rel="alternate" type="text/html" href="http://nationaldatacatalog.com/data/311-service-requests"/>
   <link rel="alternate" type="atom/application+xml" href="http://data.octo.dc.gov/feeds/src/src_current.xml" />
   <link rel="http://vocab.deri.ie/dcat#accessURL" type="application/zip" href="http://data.octo.dc.gov/feeds/src/src_current_plain.zip">
     <dcat:bytes>8783</dcat:bytes>
   </link>
   <link rel="http://vocab.deri.ie/dcat#accessURL" type="application/vnd.google-earth.kml+xml" href="http://data.octo.dc.gov/feeds/src/src_current.kml">
       <dcat:bytes>71606</dcat:bytes>
   </link>
   <link rel="http://vocab.deri.ie/dcat#dataDictionary" type="text/html" href="http://nationaldatacatalog.com/data/311-service-requests/docs"
/>
 </entry>

  <!-- etc -->

</feed>

corresponding rdf in turtle

<http://nationaldatacatalog.com/feed/> a dcat:Catalog ;
  dc:creator "Sunlight Foundation" ;
  dc:updated "2010-04-01T21:12:00Z" ;
  dcat:dataset <http://nationaldatacatalog.com/data/311-service-requests> .

<http://nationaldatacatalog.com/data/311-service-requests> a dcat:Dataset ;
  dc:description "311 Service Requests. The dataset contains locations and attributes of service requests received through the Mayor's Call Center (311),, electronic and US mail service or via other methods of communication." ;
  dcat:keyword "call centers" ;
  dct:updated "2003-12-13T18:30:02Z" ;
  dct:dataDictionary <http://nationaldatacatalog.com/data/311-service-requests/docs> ;
  dct:distribution <http://data.octo.dc.gov/feeds/src/src_current_plain.zip>, <http://data.octo.dc.gov/feeds/src/src_current.kml>, <http://data.octo.dc.gov/feeds/src/src_current.xml> .

<http://data.octo.dc.gov/feeds/src/src_current_plain.zip> a dcat:Download ;
  dcat:bytes "8783" ;
  dc:format "application/zip" .

<http://data.octo.dc.gov/feeds/src/src_current.kml> a dcat:Download ;
  dcat:bytes "71606" ;
  dc:format "application/vnd.google-earth.kml+xml" .

<http://data.octo.dc.gov/feeds/src/src_current.xml> a dcat:Feed ;
  dc:format "application/atom+xml" .