Difference between revisions of "WebSchemas/Datasets"

From W3C Wiki
Jump to: navigation, search
m (embedded data)
m
Line 35: Line 35:
 
<div itemscope="itemscope" itemtype="http://schema.org/Dataset">
 
<div itemscope="itemscope" itemtype="http://schema.org/Dataset">
  
     <a href="http://www.datasf.org/story.php?title=seismic-hazard-zones-"><span itemprop="name">
+
     <span itemprop="name">
        <b>Seismic Hazard Zones</b>
+
        <a href="http://www.datasf.org/story.php?title=seismic-hazard-zones-">
     </span></a>
+
            <b>Seismic Hazard Zones</b>
 +
        </a>
 +
     </span>
 +
 
 +
    (<span itemprop="temporal">2011</span>)
  
 
     <div>
 
     <div>
Line 97: Line 101:
 
type: http://schema.org/dataset
 
type: http://schema.org/dataset
 
name: Seismic Hazard Zones
 
name: Seismic Hazard Zones
 +
temporal: 2011
 
url: http://www.datasf.org/story.php?title=seismic-hazard-zones-
 
url: http://www.datasf.org/story.php?title=seismic-hazard-zones-
 
description: The dataset represents the Liquefaction and Landslide Zones as determined by the California Dept. of Conservation Division of Mines and Geology. Liquefaction is the transformation of a confined...
 
description: The dataset represents the Liquefaction and Landslide Zones as determined by the California Dept. of Conservation Division of Mines and Geology. Liquefaction is the transformation of a confined...
Line 142: Line 147:
 
<div vocab="http://schema.org/" prefix="dcat: http://www.w3.org/ns/dcat#"  about="http://logd.tw.rpi.edu/source/datasf-org/dataset/catalog/datasf.org/version/2011-Jun-07/thing_89" typeof="Dataset dcat:Dataset">
 
<div vocab="http://schema.org/" prefix="dcat: http://www.w3.org/ns/dcat#"  about="http://logd.tw.rpi.edu/source/datasf-org/dataset/catalog/datasf.org/version/2011-Jun-07/thing_89" typeof="Dataset dcat:Dataset">
 
   
 
   
  <div property="name"><b><a href="http://www.datasf.org/story.php?title=seismic-hazard-zones-">Seismic Hazard Zones</a></b></div>
+
    <span property="name"><b><a href="http://www.datasf.org/story.php?title=seismic-hazard-zones-">Seismic Hazard Zones</a></b></span>
 +
 
 +
    (<span property="temporal">2011</span>)
  
 
     <div property="description">The dataset represents the Liquefaction and Landslide Zones as determined by the California Dept. of Conservation
 
     <div property="description">The dataset represents the Liquefaction and Landslide Zones as determined by the California Dept. of Conservation
Line 207: Line 214:
 
<div vocab="http://schema.org/" prefix="dcat: http://www.w3.org/ns/dcat#"  about="http://logd.tw.rpi.edu/source/datasf-org/dataset/catalog/datasf.org/version/2011-Jun-07/thing_89" typeof="Dataset dcat:Dataset">
 
<div vocab="http://schema.org/" prefix="dcat: http://www.w3.org/ns/dcat#"  about="http://logd.tw.rpi.edu/source/datasf-org/dataset/catalog/datasf.org/version/2011-Jun-07/thing_89" typeof="Dataset dcat:Dataset">
  
  <div property="name"><b><a href="http://www.datasf.org/story.php?title=seismic-hazard-zones-">Seismic Hazard Zones</a></b></div>
+
    <span property="name"><b><a href="http://www.datasf.org/story.php?title=seismic-hazard-zones-">Seismic Hazard Zones</a></b></span>
 +
 
 +
    (<span property="temporal">2011</span>)
  
 
     <div property="description">The dataset represents the Liquefaction and Landslide Zones as determined by the California Dept. of Conservation
 
     <div property="description">The dataset represents the Liquefaction and Landslide Zones as determined by the California Dept. of Conservation

Revision as of 01:15, 7 December 2012


This is a WebSchemas proposal Datasets for schema.org. See Proposals listing for more. Status: Discussion




Overview

This page discusses a proposal extending schema.org for describing datasets and data catalogs. For additional information, see this demo page. Comments on this proposal are welcome.

Status

We have a detailed proposal here, and something in this direction is a strong candidate for official addition to schema.org.

The natural next steps before finalizing the addition are:

  • some indication from potential publishers that there is willingness to adopt, and that the design is a good fit to existing sites/content
  • some positive indication from relevant experts (eg. open gov data publishers; from W3C's Government Linked Data group or others) that the schema has had some careful review. Comments can be here in the Wiki, to the public-vocabs@w3.org list (see details at WebSchemas page), or passed along via mail to danbri@google.com.

Vocabulary

The Datasets extension introduces three new types, with associated properties:

  • Thing > CreativeWork > Dataset: a body of structured information describing some topic(s) of interest
    • catalog(DataCatalog): the data catalog which contains a dataset
    • distribution(DataDownload): a downloadable form of this dataset, at a specific location, in a specific format
    • keyword(Text): a keyword describing a dataset
    • spatial(Place): the range of spatial applicability of a dataset, e.g. for a dataset of New York weather, the state of New York
    • temporal(Date): the range of temporal applicability of a dataset, e.g. for a 2011 census dataset, the year 2011 (in ISO 8601 time interval format)
  • Thing > CreativeWork > DataCatalog: a collection of datasets
    • dataset(Dataset): a dataset contained in a catalog
  • Thing > CreativeWork > MediaObject > DataDownload: a dataset in downloadable form

Example

Microdata markup

<div itemscope="itemscope" itemtype="http://schema.org/Dataset">

    <span itemprop="name">
        <a href="http://www.datasf.org/story.php?title=seismic-hazard-zones-">
            <b>Seismic Hazard Zones</b>
        </a>
    </span>

    (<span itemprop="temporal">2011</span>)

    <div>
        <meta itemprop="url" content="http://www.datasf.org/story.php?title=seismic-hazard-zones-"/>
        <span itemprop="description">The dataset represents the Liquefaction and Landslide Zones as determined by the California Dept. of Conservation
Division of Mines and Geology. Liquefaction is the transformation of a confined layer of sandy or silty water-satuated material into a liquid-like state
because of earthquake shaking. San Francisco Building Code Section1804.5 requires a geotechnical investigation in seismic hazard zones.</span>
    </div>

    <div itemprop="spatial" itemscope="itemscope" itemtype="http://schema.org/Country" itemid="http://dbpedia.org/resource/United_States"><i>Country:</i>
        <a href="http://en.wikipedia.org/wiki/United_States">
            <span itemprop="name">United States</span>
        </a>
    </div>

    <div itemprop="publisher" itemscope="itemscope" itemtype="http://schema.org/Organization"><i>Publisher:</i>
        <span itemprop="name">Department of Technology</span>
        <span itemprop="email">dot at example dot org</span>
    </div>

    <div><i>Topics:</i>
        <span itemprop="about" itemscope="itemscope" itemtype="http://schema.org/Thing" itemid="http://dbpedia.org/resource/Seismic_hazard">
            <a href="http://en.wikipedia.org/wiki/Seismic_hazard"><span itemprop="name">seismic hazard</span></a>
        </span>
    </div>

    <div><i>Keywords:</i>
        <span itemprop="keyword"><span itemscope="itemscope" itemtype="http://schema.org/Text">layers</span></span>,
        <span itemprop="keyword"><span itemscope="itemscope" itemtype="http://schema.org/Text">geography</span></span>,
        <span itemprop="keyword"><span itemscope="itemscope" itemtype="http://schema.org/Text">maps</span></span>,
        <span itemprop="keyword"><span itemscope="itemscope" itemtype="http://schema.org/Text">gis</span></span>
    </div>

    <div itemprop="license" itemscope="itemscope" itemtype="http://schema.org/Webpage"><i>License:</i>
        <a href="http://opendatacommons.org/licenses/pddl/1.0/">
            <span itemprop="name">ODC Public Domain Dedication and Licence (PDDL)</span>
        </a>
        <meta itemprop="url" content="http://opendatacommons.org/licenses/pddl/1.0/" />
    </div>

    <div itemprop="distribution" itemscope="itemscope" itemtype="http://schema.org/DataDownload"><i>Download:</i>
        <a href="http://data-gov.tw.rpi.edu/raw/1200/data-1200.nt.gz">
            <meta itemprop="encodingFormat" content="text/plain" />
            <meta itemprop="contentUrl" content="http://data-gov.tw.rpi.edu/raw/1200/data-1200.nt.gz" />
            <meta itemprop="inLanguage" content="en" />
            <span itemprop="description">compressed N-Triples dump</span>,
            <span itemprop="datePublished" content="2011-08-12">August 12, 2011</meta>
            <span itemprop="contentSize" content="13.9">(13.97MB)</span>
	</a>
    </div>

</div>

embedded data

Item 
type: http://schema.org/dataset	
name: Seismic Hazard Zones
temporal: 2011
url: http://www.datasf.org/story.php?title=seismic-hazard-zones-
description: The dataset represents the Liquefaction and Landslide Zones as determined by the California Dept. of Conservation Division of Mines and Geology. Liquefaction is the transformation of a confined...
spatial: Item 1
publisher: Item 2
about: Item 3
keyword: layers
keyword: geography
keyword: maps
keyword: gis
license: Item 4
distribution: Item 5

Item 1
type: http://schema.org/country
name: United States

Item 2
type: http://schema.org/organization
name: Department of Technology
email: dot at example dot org

Item 3
type: http://schema.org/thing
name: seismic hazard

Item 4
type: http://schema.org/webpage
name: ODC Public Domain Dedication and Licence (PDDL)
url: http://opendatacommons.org/licenses/pddl/1.0/

Item 5
type: http://schema.org/datadownload
encodingformat: text/plain
contenturl: http://data-gov.tw.rpi.edu/raw/1200/data-1200.nt.gz
inlanguage: en
description: compressed N-Triples dump
datepublished: 2011-08-12
contentsize: 13.9

equivalent RDFa 1.1 markup

<div vocab="http://schema.org/" prefix="dcat: http://www.w3.org/ns/dcat#"  about="http://logd.tw.rpi.edu/source/datasf-org/dataset/catalog/datasf.org/version/2011-Jun-07/thing_89" typeof="Dataset dcat:Dataset">
 
    <span property="name"><b><a href="http://www.datasf.org/story.php?title=seismic-hazard-zones-">Seismic Hazard Zones</a></b></span>

    (<span property="temporal">2011</span>)

    <div property="description">The dataset represents the Liquefaction and Landslide Zones as determined by the California Dept. of Conservation
Division of Mines and Geology. Liquefaction is the transformation of a confined layer of sandy or silty water-saturated material into a liquid-like state
because of earthquake shaking. San Francisco Building Code Section1804.5 requires a geotechnical investigation in seismic hazard zones.</div>

    <div rel="spatial" resource="http://dbpedia.org/resource/United_States"><i>Country:</i>
        <a href="http://dbpedia.org/resource/United_States">
            <span about="http://dbpedia.org/resource/United_States" typeof="Country">
                <span property="name">United States</span>
            </span>
        </a>
    </div>

    <div rel="publisher"><i>Publisher:</i>
        <span typeof="Organization">
            <span property="name">Department of Technology</span>
	    (<span property="email">dot at example dot org</span>)
        </span>
    </div>

    <div><i>Topics:</i>
        <span rel="about" resource="http://dbpedia.org/resource/Seismic_hazard">
            <a href="http://en.wikipedia.org/wiki/Seismic_hazard">
                <span property="name">seismic hazard</span></a>
        </span>
    </div>

    <div><i>Keywords:</i>
        <span property="keyword">layers</span>,
        <span property="keyword">geography</span>,
        <span property="keyword">maps</span>,
        <span property="keyword">gis</span>
    </div>

    <div rel="license"><i>License:</i>
    	<a href="http://opendatacommons.org/licenses/pddl/1.0/">
	<span typeof="Webpage">
	    <span property="name">ODC Public Domain Dedication and Licence (PDDL)</span>
	    <meta property="url" content="http://opendatacommons.org/licenses/pddl/1.0/"/>
	</span>
	</a>
    </div>
    
    <div rel="distribution"><i>Download:</i>
        <a href="http://data-gov.tw.rpi.edu/raw/1200/data-1200.nt.gz">
        <span typeof="DataDownload">
            <meta property="encodingFormat" content="text/plain" />
            <meta property="contentUrl" content="http://data-gov.tw.rpi.edu/raw/1200/data-1200.nt.gz" />
            <meta property="inLanguage" content="en" />
            <span property="description">compressed N-Triples dump</span>,
            <meta property="datePublished" content="2011-08-12">August 12, 2011</meta>
            <meta property="contentSize" content="13.9">(13.97MB)</span>
	</span>
	</a>
    </div>

</div>

RDFa 1.1 Lite markup

<div vocab="http://schema.org/" prefix="dcat: http://www.w3.org/ns/dcat#"  about="http://logd.tw.rpi.edu/source/datasf-org/dataset/catalog/datasf.org/version/2011-Jun-07/thing_89" typeof="Dataset dcat:Dataset">

    <span property="name"><b><a href="http://www.datasf.org/story.php?title=seismic-hazard-zones-">Seismic Hazard Zones</a></b></span>

    (<span property="temporal">2011</span>)

    <div property="description">The dataset represents the Liquefaction and Landslide Zones as determined by the California Dept. of Conservation
Division of Mines and Geology. Liquefaction is the transformation of a confined layer of sandy or silty water-saturated material into a liquid-like state
because of earthquake shaking. San Francisco Building Code Section1804.5 requires a geotechnical investigation in seismic hazard zones.</div>

    <div><i>Keywords:</i>
        <span property="keyword">layers</span>,
        <span property="keyword">geography</span>,
        <span property="keyword">maps</span>,
        <span property="keyword">gis</span>
    </div>

</div>

Related vocabularies

Mappings

This table maps Datasets extension types and properties (including supporting schema.org vocabulary) to and from their approximate equivalents in DCAT, ADMS, and VoID. Note that ADMS refers to a vocabulary, RADion, designed to act as a common substrate for all such languages - a role that perhaps schema.org could fulfill. The 'official release' of RADion is on the European Commission's Joinup platform and a namespace has been reserved on w3.org however, for ease of access, an unofficial easy-access copy is also available directly in HTML and RDF (renders with XSLT).

Datasets extension DCAT ADMS VoID
ds:DataCatalog dcat:Catalog adms:SemanticAssetRepository
ds:DataDownload dcat:Download adms:SemanticAssetDistribution
ds:Dataset dcat:Dataset adms:SemanticAsset void:Dataset
ds:catalog dcterms:isPartOf
ds:dataset dcat:dataset dcterms:hasPart
ds:distribution dcat:distribution radion:distribution void:dataDump
ds:keyword dcat:keyword radion:keyword
ds:license dcterms:license dcterms:license
ds:spatial dcterms:spatial dcterms:spatial
ds:temporal dcterms:temporal
sdo:about dcat:theme dcterms:subject
sdo:contentSize dcat:size
sdo:contentUrl dcat:accessURL adms:accessURL
sdo:copyrightHolder
sdo:Country
sdo:dateModified dcterms:modified dcterms:modified
sdo:datePublished dcterms:issued dcterms:created
sdo:description dcterms:description dcterms:description
sdo:encodingFormat dcterms:format dcterms:format
sdo:inLanguage dcterms:language dcterms:language
sdo:name dcterms:title rdfs:label
sdo:Organization foaf:Organization
sdo:Person foaf:Person
sdo:publisher dcterms:publisher dcterms:publisher
sdo:Thing skos:Concept (recommends but does not require skos:Concept)
sdo:url foaf:homepage
sdo:version radion:version
dcat:CatalogRecord
dcat:dataDictionary
dcat:dataQuality
dcat:Distribution
dcat:Feed
dcat:granularity
dcat:record
dcat:themeTaxonomy
dcat:WebService
dcterms:accrualPeriodicity
dcterms:identifier
dcterms:references
foaf:primaryTopic
skos:Concept
skos:ConceptScheme
void:class
void:classes
void:classPartition
void:DatasetDescription
void:distinctObjects
void:distinctSubjects
void:documents
void:entities
void:exampleResource
void:feature
void:inDataset
void:linkPredicate
void:Linkset
void:objectsTarget
void:openSearchDescription
void:properties
void:property
void:propertyPartition
void:rootResource
void:sparqlEndpoint
void:subjectsTarget
void:subset
void:target
void:TechnicalFeature
void:triples
void:uriLookupEndpoint
void:uriRegexPattern
void:uriSpace
void:vocabulary

See also