Metadata Vocabulary for Tabular Data

Abstract

Validation, conversion, display and search of tabular data on the web requires additional metadata that describes how the data should be interpreted. This document defines a vocabulary for metadata that annotates tabular data. This can be used to provide metadata at various levels, from collections of data from CSV documents and how they relate to each other down to individual cells within a table.

name of a column as the basis for naming machine-readable fields in the target format, such as the name of the equivalent element or attribute in XML, property in JSON or property URI in RDF. Conversion specifications MAY use any of the properties defined in this specification to adjust the mapping of an annotated table into another format. Conversion specifications MAY define additional properties, not defined in this specification, which are specifically used when converting to the target format of the conversion. For example, a conversion to XML might specify a element-or-attribute property on columns that determines whether a particular column is represented through an element or an attribute in the data. Conversion specifications SHOULD specify format-specific properties specifying external processing steps to provide more control to people defining conversions. If these are specified, the conversion specification MUST specify at what point in the processing this external processing takes place, and what it takes place on. Examples might be: the URL of an XSLT file that is used to process XML after it is generated a string containing a SPARQL CONSTRUCT pattern that is executed on RDF after it is generated properties that contain definitions of Javascript callback functions that are used when processing particular columns or individual rows 3. Metadata Format

This section defines a set of properties and permitted values for annotating tabular data, and how these annotations should be interpreted by applications.

~~Issue 4 We intend to support metadata for packages. In this version of this specification, we are scoping to single metadata files defining single CSV files. 3.1 Syntax~~

A metadata document is a JSON document which holds an object at the top level. This object is a description object of either a table group or a single table. A description object is a JSON object that describes a component of ~~a table~~ the tabular data model (a table group, a table, a column, a row or a cell) and has one or more properties are mapped into properties on that component.

3.1 Property Syntax

There are different types of properties on description objects:

array properties

These hold an array of one or more objects, which are usually description objects .

For example, the resources property is an array property. A table group description might contain:

"resources": [{

  "@id": "https://example.org/countries.csv",
  "schema": "https://example.org/countries.json"
}, {
  "@id": "https://example.org/country_slice.csv",
  "schema": "https://example.org/country_slice.json"
}]

in which case the resources property has a value that is an array of two table description objects.

link properties

These hold one or more references to other resources by URL. Their values may be:

strings — � resolved as URLs against the base URL
arrays — � lists of strings which are resolved as URLs against the base URL

For example, the hasVersion dc:hasVersion property is a link property. A table description might contain:

Example 4


"hasVersion"

"dc:hasVersion"


:




"example-2014-01-03.csv"

in which case the hasVersion dc:hasVersion property on the table would have a single value, a link to example-2014-01-03.csv , or it. Alternatively, the metadata document might contain:

Example 5

"dc:hasVersion": [
  "example-2014-01-03.csv",
  "example-2014-01-17.csv",
  "example-2014-01-25.csv"


]

in which case the hasVersion dc:hasVersion property on the table would ~~have~~ be an array of three values, links to other versions of the table.

~~internal~~ URI template properties

A URI template property contains a [ URI-TEMPLATE ] which can be used to generate a URI. These URI templates are expanded in the context of each row by combining the template with a set of variables with values. The variables that are set are:

_row: _row is set to the row number of the row that is currently being processed
Issue 32
Where does row numbering begin?
column names: a variable is set for each column within the schema; the name of the variable is the percent-encoded name of the column and the value is the canonical representation of the value of the cell in that column in the row that is currently being processed

For example, the urlTemplate property holds a URI template that is used to generate a URL identifier for each row, which might look like:

Example 6


"urlTemplate"
:
"http://example.org/example.csv#row={_row}"

The identifiers that are generated for the rows would then look like http://example.org/example.csv#row=1,http://example.org/example.csv#row=2 and so on.

Alternatively, with the CSV and metadata in the section 1. Introduction , the urlTemplate might look like:

Example 7


"urlTemplate"
:
"http://example.org/tree/{on%2Dstreet}/{GID}"

This would generate URIs such as http://example.org/tree/ADDISON%20AV/1 and http://example.org/tree/EMERSON%20ST/2.

Once the URI has been generated, it is resolved against the location of the resource (eg the CSV file) to create an absolute URI. For example, given a urlTemplate within a schema such as:

"urlTemplate": "#row={_row}"

and given a CSV file at http://example.com/temp.csv, the URL for the first row will be http://example.com/temp.csv#row=1.

column reference properties

These hold one or more references to other column description objects. The referenced description object must have an @id property whose value looks like _: name . Internal property. Column reference properties can then reference ~~other~~ column description objects through values that are:

strings ~~— in the format _: name~~ � which MUST match the @id name on ~~another~~ a column description object within the metadata document
arrays — � lists of strings as above

For example, the primaryKey property is an ~~internal~~ column reference property on the schema. It has to hold references to columns defined elsewhere in the schema, and the descriptions of those columns must have @id name properties. It can hold a single reference, like this:

Example 8

"schema": {
  "columns": [{
    "name": "GID"
  }, ... ],
  "primaryKey": "GID"


}

or it can contain an array of references, like this:

Example 9

"schema": {
  "columns": [{
    "name": "givenName"
  }, {
    "name": "familyName"
  }, ... ],
  "primaryKey": [ "givenName", "familyName" ]


}

object properties

These hold one or more objects or references to objects by URL. Their values may be:

strings — � resolved as URLs against the base URL
objects — � interpreted as structured objects
arrays — � lists of strings and/or objects, interpreted as URLs or structured objects

Object properties are often used when the values can be or should be values within controlled vocabularies, or structured information which may be held elsewhere. For example, the creator dc:creator of a table is should be an object property. It could be provided as a URL that indicates the creator, like this:

Example 8 10


"creator"

"dc:creator"


:




"http://ons.gov.uk"

or a structured object, like this:

Example 11

"dc:creator": {
  "sch:name": "Office of National Statistics",
  "sch:url": "http://ons.gov.uk",
  "sch:email": "info@ons.gsi.gov.uk"


}

or an array of URLs, like this:

Example 12


"dc:creator"
:
[

"http://ons.gov.uk"
,
"https://www.gov.uk/government/organisations/department-for-transport"

]

or an array of structured objects:

Example 13

"dc:creator": [{
  "sch:name": "Office of National Statistics",
  "sch:url": "http://ons.gov.uk",
  "sch:email": "info@ons.gsi.gov.uk"
}, {
  "sch:name": "Department for Transport",
  "sch:url": "https://www.gov.uk/government/organisations/department-for-transport"

}]

or an array that mixes URLs and objects:

Example 14

"dc:creator": [{
  "sch:name": "Office of National Statistics",
  "sch:url": "http://ons.gov.uk",
  "sch:email": "info@ons.gsi.gov.uk"

},

"https://www.gov.uk/government/organisations/department-for-transport"

]

natural language properties

These hold natural language strings. Their values may be:

strings — � interpreted as natural language strings in the default language
arrays — � interpreted as alternative natural language strings in the default language
objects whose properties MUST be language codes as defined by [ ~~RFC3066~~ BCP47 ] and whose values are either strings or arrays, providing natural language strings in that language

Natural language properties are used for things like descriptions and titles. For example, the title property provides a natural language label for a column. If it's a plain string like this:

Example 10 15


"title"

:




"Project
title"

then that string is assumed to be in the language provided through the @language property of the nearest @context (or have no assumed language, if there is no such property). Multiple alternative values can be given in an array:

Example 16

"title": [
  "Project title",
  "Project"


]

It's also possible to provide multiple values in different languages, using an object structure. For example:

Example 17

"title": {
  "en": "Project title",
  "fr": "Titre du projet"


}

and within such an object, the values of the properties can themselves be arrays:

Example 18

"title": {
  "en": [ "Project title", "Project" ],
  "fr": "Titre du projet"


}

Issue 5 49

We invite comment on whether it would be useful to enable some markup in natural language strings, for example by stating that they are interpreted as HTML or Markdown.

atomic properties

These hold atomic values. Their values may be:

numbers — � interpreted as integers or doubles
booleans — � interpreted as booleans ( true or false )
strings — � interpreted as defined by the property
arrays — � lists of numbers, booleans or strings

Note

JSON does not have date or time types. Where a property takes a date as a value, this MUST be a string in the format YYYY-MM-DD.

3.2 Top-Level Properties

The top-level object (whether it is a table group description or a table description ) MAY have a @context property. This holds an object that provides metadata for interpreting other properties, namely:

@language: indicates the default language for the values of properties in the ~~description;~~ metadata document; if present, its value MUST be a language code [ ~~RFC3066~~ BCP47 ] which is the default language for the values of other properties in the metadata document

Note

Note that the @language property of the @context object, which gives the default language used within the metadata file, is distinct from the language property on a description object , which gives the language used in the data within ~~the table.~~ a group of tables, table or column.
@base: indicates the base URL against which other URLs within the description are resolved; if present, its value MUST be a URL which is resolved against the ~~base URL~~ location of the metadata document ~~(the location from which it was retrieved)~~ to provide the base URL for other URLs in the metadata document; if unspecified, the base URL used for interpreting relative URLs within the metadata document is the location of the metadata document itself

Note

Note that the @base property of the @context object provides the base URL used for URLs within the metadata document, not the URLs that appear within the ~~table.~~ group of tables or table it describes.

3.3 3.2.1 Common Properties Importing Metadata

The ~~properties listed here may be applied~~ top-level object (whether it is a table group description or a table description ) MAY also have an import property. This is a link property which references one or more other metadata files to ~~any structure within~~ be imported into the ~~tabular data model: tables, columns, rows or cells.~~ original metadata file.

~~Issue 6 We invite comment on whether there~~

If the import property contains an array, imports are ~~other standard~~ carried out in sequence: the first metadata ~~vocabularies that should be reused within this specification.~~ file referenced is imported into the original metadata file; the second is imported into the result and so on. If a referenced metadata file has already been imported (or was the original metadata file) it is ignored.

~~3.3.1 Dublin Core Terms~~

~~Descriptions MAY contain any properties defined by [ DC-TERMS ] to describe~~ If the ~~table. This specification does not define~~ top-level object of any ~~application behaviour associated with these properties being present, except that validation~~ of the metadata files ~~MUST check that, if they~~ are ~~present,~~ table descriptions , these are treated as if they ~~adhere to the syntax defined here. Property Type Details abstract natural language property~~ were table group descriptions containing a single table description (ie having a single accessRights resource ~~object~~ property ~~accrualMethod~~ whose value is the same as the original table description).

An imported description object ~~property accrualPeriodicity~~ B is imported into an original description object A by merging each property ~~accrualPolicy object~~ of B into A. If the property ~~alternative natural language~~ from B does not exist on A, it is simply added to A. If A does have the property, the way the values are merged depends on the type of the property, as follows:

If the property ~~audience object~~ is an array property ~~available atomic~~ , the way in which values are merged depends on the property; see the relevant property ~~dates~~ for this definition.
If the property is a link property , then if the property only accepts single values, the value from A overrides that from B, otherwise the result is an array of links: those from A followed by those from B that were not already a value in A.
If the ~~format YYYY-MM-DD bibliographicCitation natural language~~ property ~~conformsTo object~~ is a URI template property ~~contributor object~~ , the value from A overrides that from B.
If the property ~~coverage~~ is a column reference property , the value from A overrides that from B.
If the property is an object property ~~created atomic~~ , then if the property ~~dates in~~ only accepts single objects:
1. if the value of the ~~format YYYY-MM-DD creator object~~ property ~~date atomic~~ in A is a string or the value from B is a string then the value from A overrides that from B
2. otherwise (if both values as objects) the objects are merged as described here
If the property ~~dates~~ accepts arrays, the result is an array of objects or strings: those from A followed by those from B that were not already a value in A.
If the ~~format YYYY-MM-DD dateAccepted atomic~~ property is a natural language property , the result is an object whose properties are language codes and where the values of those properties are arrays. The suitable language code for the values is either explicit within the existing value or determined through the default language ~~dates~~ in the ~~format YYYY-MM-DD~~ metadata document; if it can't be determined the language code dateCopyrighted und should be used. The arrays should provide the values from A followed by those from B that were not already a value in A.
If the property is an atomic property ~~dates~~ , then if the property only accepts single values, the value from A overrides that from B; otherwise the result is an array of values: those from A followed by those from B that were not already a value in A.

If the type of the ~~format YYYY-MM-DD dateSubmitted atomic~~ property ~~dates~~ cannot be determined, because it is not defined in this specification (ie because it is an extension property), the type of the ~~format YYYY-MM-DD description natural language~~ property ~~educationLevel object~~ is determined based on its values in A and B, as follows, and merged accordingly:

If the value of the property ~~extent object~~ in A and the value of the property ~~format object~~ in B are both objects, they are treated as if the property ~~hasFormat~~ is an object property ~~hasPart link property hasVersion link property identifier atomic~~ that only accepts single objects.
If one of the values is an array and the other is an object, they are treated as if the property ~~a URL instructionalMethod~~ is an object property ~~isFormatOf link~~ that accepts arrays.
If the value of the property ~~isPartOf link~~ in A and the value of the property ~~isReferencedBy link~~ in B are atomic values, they are treated as if the property ~~isReplacedBy link~~ is an atomic property ~~isRequiredBy link~~ that only accepts single values.
If one of the values is an array and the other is an atomic value, they are treated as if the property ~~issued~~ is an atomic property that accepts arrays.

dates in the format YYYY-MM-DD 3.3 Common Properties

Descriptions of groups of tables, tables, schemas, columns, rows and cells MAY contain any properties whose names are either absolute URLs or prefixed names. For example, a table description may contain isVersionOf dc:description,dcat:keyword ~~link property~~ or language schema:copyrightHolder ~~atomic property~~ properties to provide a ~~language code~~ description, keywords or the name of the copyright holder, as defined by in Dublin Core Terms , DCAT or schema.org .

The same prefixes are pre-defined as for [ ~~RFC3066~~ rdfa-core ~~]; this is an inherited property license object property~~ ] within the RDFa 1.1 Initial Context and MUST NOT be overridden. Properties from other vocabularies MUST be defined using full URLs.

Note

Forbidding the declaration of new prefixes ensures consistent processing between JSON-LD-aware and non-JSON-LD-aware processors.

~~mediator~~

3.4 Table Groups

A table group description is a JSON object ~~property~~ that describes a group of tables.

3.4.1 Required Properties

medium resources ~~object~~

An array property ~~modified atomic property~~ of table descriptions ~~dates~~ for the tables in the ~~format YYYY-MM-DD provenance object property publisher object property references link property~~ group. When an array of table descriptions B is imported into an original array of table descriptions A, each table description within B is combined into the original array A by:

if there is a table description with the same relation @id ~~link property~~ in A, the table description from B is imported into the matching table description in A
otherwise, the table description from B is appended to the array of table descriptions A

3.4.2 Optional Properties

The description of a group of tables MAY also contain:

replaces schema ~~link~~

An object property ~~requires link property~~ that provides a single schema description as described in section 3.8 Schemas ~~rights~~ , for all the tables in the group. This may be provided as an embedded object ~~property~~ within the JSON metadata or as a URL reference to a separate JSON schema document.

rightsHolder table-direction ~~object~~

An atomic property that MUST have a single string value that is one of source link property "rtl", spatial "ltr" ~~object property~~ or subject "default". Indicates whether the tables in the group should be displayed with the first column on the right, on the left, or based on the first character in the table that has a specific direction. See section 4.1.1 Bidirectional Tables ~~object property~~ for more details.

~~tableOfContents~~

Issue 51 ~~natural language property~~

This should be a defined controlled vocabulary in JSON-LD, so that the values map on to URIs in the RDF version rather than strings. We invite comment on how to configure the JSON-LD context to enable these values to be interpreted in this way.

temporal dialect

An object property that provides a single dialect description . If provided,


title

dialect

provides hints to processors about how to parse the referenced files for to create tabular data models for the tables in the group. This may be provided as an embedded object or as a URL reference. See section 3.6 Dialect Descriptions ~~natural language property~~ for more details.

type templates ~~object~~

An array property ~~valid atomic property~~ of template specifications ~~dates in~~ that provide mechanisms to transform the ~~format YYYY-MM-DD~~ tabular data into other formats. See section ~~3.3.2~~ 3.7 ~~Links Description MAY include properties~~ Template Specifications for ~~registered link relations , prefixed by link: . This specification does not define any application behaviour associated with these properties being present, except that validation~~ more details. When an array of ~~metadata files MUST check that, if they are present, they have values that are URLs or arrays~~ template specifications B is imported into an original array of ~~URLs. The following properties are particularly relevant to tabular data:~~ template specifications A, each template specification within B is combined into the original array A by:

link:alternate link:canonical link:collection link:duplicate link:glossary link:help link:icon link:last link:latest-version link:next link:original link:predecessor-version link:prev or link:previous link:preview link:profile link:related link:search link:self if there is a template specification with the same link:start targetFormat and link:successor-version templateFormat in A, the template specification from B is imported into the matching template specification in A
~~link:terms-of-service~~ otherwise, the template specification from B is appended to the array of template specifications A

link:up @type

If included,


link:version-history

@type

MUST be set to


link:working-copy

"TableGroup"

. Publishers MAY include this to provide additional information to JSON-LD based toolchains.

The description MAY contain any common properties as defined in section 3.3 Common Properties to provide extra metadata about the set of tables as a whole.

The description MAY contain any of the properties defined in section 2.2 Inherited Properties ~~link:working-copy-of~~ to describe cells within the tables.

~~Note~~ Issue 22

~~Unlike the Dublin Core terms, link relations are an ever-expanding list and there may eventually be clashes between link relation terms and those defined above. That's why~~ This issue relates to the ~~above list uses QNames for all link relations, so~~ use of type vs datatype as a column property. (This issue seems moot now that ~~they look like link: relation rather than plain relation .~~ neither are included.)

~~3.3.3 Other Properties~~

text-direction One of "rtl" or "ltr" (the default). Indicates whether the text within cells should be displayed by default as left-to-right or right-to-left text. See section 2.2.1 3.5 Bidirectional Tables for more details. 3.4 Tables

A table description is a JSON object that describes a table within a CSV file.

Issue 7 50

A CSV file might not be the same as the table that it contains. For example, a given CSV file might contain two tables (in different regions of the CSV file), or might contain a table that isn't positioned at the top left of the CSV file. We invite comment about whether we should assume that pre-processing is used to extract tables where there isn't a 1:1 correspondence between CSV file and table, or not.

3.4.1 3.5.1 Required Properties

@id: This link property gives the single URL of the CSV file that the table is held in, relative to the location of the metadata document.

3.4.2 3.5.2 Optional Properties

The description of a table MAY also contain:

schema: An object property that provides a single schema description as described in section ~~3.5~~ 3.8 Schemas . This may be provided as an embedded object within the JSON metadata or as a URL reference to a separate JSON schema document.
notes: An object property ~~, usually~~ that provides an ~~array,~~ array of ~~annotation~~ objects representing annotations. This specification does not place any constraints on the structure of these objects. ~~An annotation object~~

Note
The Web Annotation Working Group is ~~an object that holds general annotations about~~ developing a ~~particular column, row, cell or region~~ vocabulary for expressing annotations. In future versions of ~~the table. Each annotation object MUST have an @id property~~ this specification, we anticipate referencing that ~~references the relevant column, row, cell~~ vocabulary.
Issue 70
Should there be column or ~~region of the table using a fragment identifier. It MAY have any other common properties~~ level notes as ~~described in section 3.3 Common Properties~~ well?
.

Issue 9 71
~~We intend to add a small subset of properties that indicate how a CSV file should be parsed, specifically those that mirror the existing distinction between the media types for~~
The Annotation Model can indeed become very complex.
text/csv table-direction ~~and~~: As defined for table groups .
text/tab-separated-values , and the media type parameters that they allow, namely: templates: As defined for table groups .
separator dialect ~~to give the character used as the separator in the tabular data file~~: As defined for table groups .
encoding @type ~~to specify the encoding used in the file~~: If included, header @type MUST be set to ~~specify whether or not a header line is present We invite comment about whether these are the right properties~~ "Table". Publishers MAY include this to ~~specify.~~ provide additional information to JSON-LD based toolchains.

Issue 10

We invite comment on whether we should include properties that help in checking the integrity of the file: datapackage includes bytes and hash. We could reuse the Subresource Integrity work here.

The description MAY contain any ~~of the~~ common properties as defined in section 3.3 Common Properties to provide extra metadata about the table as a whole.

The description MAY contain any of the properties defined in section 2.2 Inherited Properties to describe cells within the table. ~~As well as links~~

3.6 Dialect Descriptions

Much of the tabular data that is published on the web is messy, and CSV parsers frequently need to ~~other related tables,~~ be configured in order to correctly read in CSV. A dialect description provides hints to parsers about how to parse the file linked to from the @id property. It can have any of the following ~~common properties are particularly suitable for tables:~~ properties, which relate to the flags described in Section 5 Parsing Tabular Data within [ tabular-data-model ]:

created encoding: An atomic property that sets the encoding flag to the single provided string value, which MUST be a defined [ encoding ].
creator lineTerminator: An atomic property that sets the line terminator flag to the single provided string value.
description quoteChar: An atomic property that sets the quote character flag to the single provided value, which MUST be a single character.
language doubleQuote: A single boolean atomic property that, if true, sets the escape character flag to ". If false, to \.
license skipRows: An atomic property that sets the skip rows flag to the single provided numeric value, which MUST be a non-negative integer.
modified commentPrefix: An atomic property that sets the comment prefix flag to the single provided value, which MUST be a single character string.
provenance header: A single boolean atomic property that, if true, sets the header row count flag to 1, and if false to 0, unless publisher headerRowCount is provided, in which case the value provided for the header property is ignored.
rights headerRowCount: An atomic property that sets the header row count flag to the single provided value, which MUST be a non-negative integer.
rightsHolder delimiter: An atomic property that sets the delimiter flag to the single provided value, which MUST be a single character string.
source skipColumns: An atomic property that sets the skip columns flag to the single provided numeric value, which MUST be a non-negative integer.
spatial headerColumnCount: An atomic property that sets the header column count flag to the single provided value, which MUST be non-negative integer.
subject skipBlankRows: An atomic property that sets the skip blank rows flag to the single provided boolean value.
temporal skipInitialSpace: A single boolean atomic property that, if true, sets the trim flag to "start". If false, to false. If the trim property is provided, the skipInitialSpace property is ignored.
trim: A single atomic property that, if the boolean true, sets the trim flag to true and if the boolean false to false. If the value provided is a string, sets the trim flag to the provided value, which MUST be one of "true","false","start" or "end".
@type: If included, @type MUST be set to "Dialect". Publishers MAY include this to provide additional information to JSON-LD based toolchains.

The default dialect description for CSV files is:

{

  "encoding": "utf-8",
  "lineTerminator": "\r\n",
  "quoteChar": "\"",
  "doubleQuote": true,
  "skipRows": 0,
  "header": true,
  "headerRowCount": 1,
  "delimiter": ",",
  "skipColumns": 0,
  "headerColumnCount": 0,
  "skipBlankRows": false,
  "skipInitialSpace": false,
  "trim": false
}

3.7 Template Specifications

A template specification is a definition of how tabular data can be transformed into another format. It has the following properties:

3.7.1 Required Properties

Template specifications MUST have the following properties:

targetFormat: A URL for the format that will be created through the transformation. If one has been defined, this should be a URL for a media type, in the form http://www.iana.org/assignments/media-types/ media-type such as http://www.iana.org/assignments/media-types/text/calendar. Otherwise, it can be any URL that describes the target format.
Note
The targetFormat URL is intended as an informative identifier for the target format, and applications MAY NOT access the URL.
templateFormat: A URL for the format that is used by the template. If one has been defined, this should be a URL for a media type, in the form http://www.iana.org/assignments/media-types/ media-type such as http://www.iana.org/assignments/media-types/application/javascript. Otherwise, it can be any URL that describes the template format.
Note
The templateFormat URL is intended as an informative identifier for the template format, and applications MAY NOT access the URL. The template formats that an application supports are implementation defined.

3.7.2 Optional Properties

Template specifications MAY have the following properties:

title: A natural language property that describes the format that will be generated from the transformation. This is useful if the target format is a generic format (such as application/json ) and the transformation is creating a specific profile of that format.
source: A single string atomic property that provides, if included, the format to which the tabular data should be transformed prior to the transformation using the template. If the value is "json", the tabular data should first be transformed first to JSON based on the simple mapping defined in Generating JSON from Tabular Data on the Web . If the value is "rdf", it should similarly first be transformed to XML based on the simple mapping defined in Generating RDF from Tabular Data on the Web . If the source property is missing or null then the source of the transformation is the annotated tabular data model.
@type: If included, @type MUST be set to "Template". Publishers MAY include this to provide additional information to JSON-LD based toolchains.

The template specification MAY contain any common properties as defined in section 3.3 Common Properties to provide extra metadata about the transformation.

3.7.3 Example

The following template specification will enable a processor that supports it to generate an iCalendar document using a Mustache template based on the JSON created from the simple mapping to JSON.

Example 19

{

  "title": "iCalendar",
  "targetFormat": "http://www.iana.org/assignments/media-types/text/calendar",
  "templateFormat": "https://mustache.github.io/",
  "source": "json"
}

3.5 3.8 Schemas

A schema is a definition of a tabular format that may be common to multiple tables. For example, multiple tables from different sources may have the same columns and be designed such that they can be aggregated together.

A schema description is a JSON object that encodes the information about a schema. All the properties of a schema description are optional.

columns

~~@type If included, @type MUST be set to "Schema" . Publishers MAY include this to provide additional information to JSON-LD based toolchains.~~

An array property of column descriptions as described in section ~~3.6~~ 3.9 Columns . These are matched to columns in ~~table~~ tables that use the schema by position: the first column description in the array applies to the first column in the table, the second to the second and so on.

The name properties of the column descriptions MUST be unique within a given table description.

~~rows~~

An When an array of ~~row~~ column descriptions ~~as described in section 3.7 Rows . These are matched to row by the value~~ B is imported into an original array of column descriptions A, each column description within B is combined into the ~~row in the row description. The values of~~ original array A by:

if there is a column description at the ~~row properties MUST be unique~~ same index within ~~a given table~~ A and that column description ~~(ie no row can have more than one description).~~ has the same cells An array of cell descriptions as described in section 3.8 Cells . These are matched to cell by name, the ~~value of~~ column description from B is imported into the ~~row and~~ matching column ~~properties~~ description in A
otherwise, the ~~cell description. The combination of values of the row and~~ column ~~properties MUST be unique within a given table~~ description ~~(ie no cell can have more than one description).~~ is ignored

primaryKey

~~An internal~~ A column reference property that holds either a single ~~references~~ reference to a column description object or an array of references.

Validators MUST check that each row has a unique combination of cells in the indicated columns. For example, if primaryKey is set to ["_:familyName", "_:givenName"] ["familyName", "givenName"] then every row must have a unique value for the combination of the familyName and givenName columns.

Issue 11 66

~~When referencing columns for a~~ Composite primary ~~key, it~~ keys and foreign key references.

foreignKeys

An array property of foreign key definitions that define how the values from specified columns within this table link to rows within this table or other tables. A foreign key definition is a ~~lot clearer to~~ JSON object with the properties:

columns

A column reference property that holds either a single reference ~~them by name rather than by number. For JSON-LD compatibility, we have~~ to ~~assign~~ a ~~blank node~~ column description object within this schema, or an array of references.

reference

An object with the properties:

resource: A link property holding a URL that is the identifier for a specific resource that is being referenced. If this is present then schema MUST NOT be present. The metadata document MUST contain a description of the resource.
schema: A link property holding a URL that is the identifier for a schema that is being referenced. If this is present then resource MUST NOT be present. The metadata document that forms the basis of processing MUST contain a description of a resource that uses the referenced schema, and there MUST NOT be more than one such resource.
columns: A column reference property that holds either a single reference to ~~each~~ a column ~~even though they each~~ description object within this schema, or an array of references.

Note

It is not required for the resource or schema referenced from a foreignKeys property to have a similarly defined name primaryKey.

When an array of foreign key definitions B is imported into an original array of foreign key definitions A, each foreign key definition within B which does not appear within A is appended to the original array A.

Issue 16

The cross reference between files should be limited to files from one publisher - else they are just web links with no guarantee of whether the target of the link exists which 'foreign key' might imply.

urlTemplate

A URI template property that ~~could~~ MAY be used ~~instead. We invite comment on how~~ to ~~make this easier~~ create a unique identifier for ~~people~~ each row when mapping data to ~~use while maintaining JSON-LD compatibility.~~ other formats.

@type

If included,


@type

MUST be set to


"Schema"

. Publishers MAY include this to provide additional information to JSON-LD based toolchains.

The description MAY contain any ~~of the~~ common properties as defined in section 3.3 Common Properties to ~~describe~~ provide extra metadata about the ~~schema. As well~~ schema as ~~links to other related schemas,~~ a whole.

The description MAY contain any of the ~~following common~~ inherited properties ~~are particularly suitable~~ defined for ~~schemas:~~ cells in section 2.2 Inherited Properties .

created 3.8.1 Examples creator

3.8.1.1 Foreign Key Reference Between Resources

A list of countries is published at description http://example.org/countries.csv with the structure:

Example 20

countryCode,latitude,longitude,name

AD,42.546245,1.601554,Andorra
AE,23.424076,53.847818,"United Arab Emirates"
AF

,

license


33.93911

,


67.709953
,

Afghanistan

Another file contains information about the population in some countries each year, at modified http://example.com/country_slice.csv with the structure:

Example 21

countryRef,year,population

AF,1960,9616353
AF,1961,9799379
AF

,


1962

,


9989846

The following metadata for the group of tables links the two together by defining a publisher foreignKeys property:

Example 22

{
  "@context": "http://www.w3.org/ns/csvw",
  "resources": [{
    "@id": "https://example.org/countries.csv",
    "schema": {
      "columns": [{
        "name": "countryCode",
        "datatype": "string"
      }, {
        "name": "latitude",
        "datatype": "number"
      }, {
        "name": "longitude",
        "datatype": "number"
      }, {
        "name": "name",
        "datatype": "string"
      }],
      "urlTemplate": "http://example.org/countries.csv{#countryCode}",
      "primaryKey": "countryCode"
    }
  }, {
    "@id": "http://example.com/country_slice.csv",
    "schema": {
      "columns": [{
        "name": "countryRef",
        "datatype": "string"
      }, {
        "name": "year",
        "datatype": "gYear"
      }, {
        "name": "population",
        "datatype": "integer"
      }],
      "foreignKeys": [{
        "columns": "countryRef",
        "reference": {
          "resource": "http://example.org/countries.csv",
          "columns": "countryCode"
        }
      }]
    }
  }]


}

When the population data in rights country_slice.csv is processed (displayed or mapped into another format), a link can be made from the content of the countryRef column based on the urlTemplate for country.csv. For example, if the countryRef column (the value of columns in the foreignKeys object) in country_slice.csv contains the value UK then the processor will use that value to populate the countryCode variable (the value of reference.columns in the foreignKeys object) when interpreting the urlTemplate for country.csv, and create the URL http://example.org/countries.csv#UK. The processor does not need to retrieve http://example.org/countries.csv or check that the value UK appears within the countryCode column to create this link: it is created purely based on the urlTemplate in the description of the referenced resource.

3.8.1.2 Foreign Key Reference Between Schemas

When publishing information about public sector roles and salaries, as in Use Case 4 , the UK government requires departments to publish two files which are interlinked. The first lists senior grades (simplified here) eg at rightsHolder HEFCE_organogram_senior_data_31032011.csv:

Example 23

Post Unique Reference,              Name,Grade,             Job Title,Reports to Senior Post

                90115,        Steve Egan,SCS1A,Deputy Chief Executive,                 90334
                90250,     David Sweeney,SCS1A,              Director,                 90334
                90284,       Heather Fry,SCS1A,              Director,                 90334
90334,Sir
Alan
Langlands,
SCS4,
Chief
Executive,
xx

The second provides information about the number of junior positions that report to those individuals (simplified here) eg at HEFCE_organogram_junior_data_31032011.csv:

Example 24

Reporting Senior Post,Grade,Payscale Minimum (�),Payscale Maximum (�),Generic Job Title,Number of Posts in FTE,          Profession

                90284,    4,               17426,               20002,    Administrator,                     2,Operational Delivery
                90284,    5,               19546,               22478,    Administrator,                     1,Operational Delivery
                90115,    4,               17426,               20002,    Administrator,                  8.67,Operational Delivery
90115,
5,
19546,
22478,
Administrator,
0.5,Operational
Delivery

The schemas are reused by multiple departments and for multiple pairs of files. The schemas are therefore defined in separate files, and they need to define links between the schemas which are then picked up as applying between tables that use those schemas.

The metadata file for the particular publication of the files above is:

Example 25

{
  "@context": "http://www.w3.org/ns/csvw",
  "resources": [{
    "@id": "HEFCE_organogram_senior_data_31032011.csv",
    "schema": "http://example.org/schema/senior-roles.json"
  }, {
    "@id": "HEFCE_organogram_junior_data_31032011.csv",
    "schema": "http://example.org/schema/junior-roles.json"
  }]


}

The schema for the senior role CSV (at subject http://example.org/schema/senior-roles.json ) is as follows; it includes a foreign key reference to itself:

Example 26

{
  "@context": "http://www.w3.org/ns/csvw",
  "@id": "http://example.org/schema/senior-roles.json",
  "columns": [{
    "name": "ref",
    "title": "Post Unique Reference"
  }, {
    "name": "name",
    "title": "Name"
  }, {
    "name": "grade",
    "title": "Grade"
  }, {
    "name": "job",
    "title": "Job Title"
  }, {
    "name": "reportsTo",
    "title": "Reports to Senior Post"
  }],
  "primaryKey": "ref",
  "urlTemplate": "#post-{ref}",
  "foreignKeys": [{
    "columns": "reportsTo",
    "reference": {
      "schema": "http://example.org/schema/senior-roles.json",
      "columns": "ref"
    }
  }]


}

The schema for the junior role CSV (at title http://example.org/schema/junior-roles.json ) is as follows; it includes a foreign key reference to the senior roles schema:

Example 27

{
  "@context": "http://www.w3.org/ns/csvw",
  "@id": "http://example.org/schema/junior-roles.json",
  "columns": [{
    "name": "reportsTo",
    "title": "Reporting Senior Post"
  }, 
  ...
  ],
  "foreignKeys": [{
    "columns": "reportsTo",
    "reference": {
      "schema": "http://example.org/schema/senior-roles.json",
      "columns": "ref"
    }
  }]


}

~~The description MAY contain any~~ In the first line of HEFCE_organogram_junior_data_31032011.csv, the ~~inherited properties~~ reportsTo ( Reporting Senior Post ) column contains the value 90284. When creating a link from that column, the urlTemplate defined within the schema at http://example.org/schema/senior-roles.json is used to generate a URL by expanding the variable reference for ~~cells in section 2.1.2 Inherited Properties .~~ ref based on the value from the reportsTo column. This gives the relative URL #post-90284 which is then resolved against the base URL of the resource that uses the senior-roles.json schema within the original metadata file, namely HEFCE_organogram_senior_data_31032011.csv.

3.6 3.9 Columns

A column description is a simple JSON object that describes a single column. The description provides additional human-readable documentation for a column, as well as additional information that may be used to validate the cells within the column, create a user interface for data entry, or inform conversion into other formats.

Issue 64

Should there be a way to suppress columns?

3.6.1 3.9.1 Required Properties

name

An atomic property that gives a single canonical name for the column. This MUST be a string. Conversion specifications MUST use this property as the basis for the names of properties/elements/attributes in the results of conversions.

For ease of reference within URI template properties , column names SHOULD consist only of alphanumeric characters or underscores ( [a-zA-Z0-9_]+ ). Names beginning with _ are reserved by this specification and MUST NOT be used.

Issue 12 33

What do to with conversion if no column name is given?

Issue 53

We invite comment on what the syntactic limitations should be on column names to make them most useful when used as the basis of conversion into other formats, bearing in mind that different target languages such as JSON, RDF and XML have different syntactic limitations and common naming conventions.

During validation, if there is no title property and the column already has a title annotation then a validator MUST issue a warning if the existing title annotation does not match the name specified in the column description.

3.6.2 3.9.2 Optional Properties

title

A natural language property that provides possible alternative names for the column. The possible column titles are defined as:

if the value of title is a string, that string
if the value of title is an array, the strings in that array
if the value of title is an object, the string or strings that are the value of the property of that object whose name is the column language

where the column language is the value of the language property on the column description, or (if there is no such language), the value of the language property on the table description.

If the column already has a title annotation (because a header row has been included in the original CSV file) then a validator MUST issue a warning if the existing title annotation is not the same as any of the possible column titles .

~~Note~~ Issue 56

The facility to specify multiple potential titles for a column is important when the same column description is used for multiple CSVs, through a mechanism yet to be defined by this specification.

required

~~@type If included, @type MUST be set to "Column" . Publishers MAY include this to provide additional information to JSON-LD based toolchains.~~ A boolean atomic property taking a single value which indicates whether every cell within the column must have a non-null value.

row predicateUrl

The description MAY contain any of the inherited properties defined for cells in section 2.1.2 Inherited Properties . 3.7 Rows Rows can be described using row description objects . A row description object is a JSON object within a metadata file that includes properties that describe an individual row. 3.7.1 Required Properties The following properties MUST appear on a row description: ~~an integer; the number of the row~~ An atomic property that holds one or more URIs that MAY be used as URIs for predicates if the ~~description object describes~~ table is mapped to another format.

@type

~~3.7.2 Optional Properties~~

If included, @type MUST be set to "Row" "Column". Publishers MAY include this to provide additional information to JSON-LD based toolchains.

The description MAY contain any ~~of the inherited~~ common properties as defined ~~for cells~~ in section ~~2.1.2~~ 3.3 ~~Inherited~~ Common Properties ~~. 3.8 Cells Cells can be described using cell description objects . A cell description object is a JSON object within a~~ to provide extra metadata file that includes properties that describe an individual cell. 3.8.1 Required Properties The following properties MUST appear on a cell description: row an integer; the number of the row on which the cell appears column an integer; the number of about the column ~~on which the cell appears 3.8.2 Optional Properties @type If included, @type MUST be set to "Cell" . Publishers MAY include this to provide additional information to JSON-LD based toolchains.~~ as a whole, such as a full description.

The description MAY contain any of the inherited properties defined for cells in section ~~2.1.2~~ 2.2 Inherited Properties .

3.8.3 3.10 Inherited Properties

Cell descriptions may override inherited properties , as described in section ~~2.1~~ 2. Annotating Tables . It is good practice to define these properties on columns, so that all cells within a given column are handled in the same ~~way.~~ way, or on tables if appropriate. These properties are:

null: ~~The~~ An atomic property giving the string or strings used for null values. If not specified, the default for this is the empty string.
language: An atomic property giving a single string language code as defined by [ BCP47 ]. Indicates the language of the value within the cell.
text-direction: An atomic property that MUST have a single string value that is one of "rtl" or "ltr" (the default). Indicates whether the text within cells should be displayed by default as left-to-right or right-to-left text. See section 4.1.1 Bidirectional Tables for more details.
separator: ~~The~~ An atomic property that MUST have a single string value that is the character used to separate items in the string value of the cell. If null , or unspecified, the cell does not contain a list. Otherwise, application MUST split the string value of the cell on the specified separator character and parse each of the resulting strings separately. The cell's value will then be a list. Conversion specifications MUST use the separator to determine the conversion of a cell into the target format. See ~~3.8.5~~ 3.12 Parsing cells for more details.
default: An atomic property holding a single string that provides a default string value for the cell in cases where the original string value is a null value. This default value MAY be used when converting the table into other formats.
format: A An atomic property that contains a single string that is the definition of the format of the cell, used when parsing the cell as described in ~~3.8.5~~ 3.12 Parsing cells .
datatype: ~~The~~ An atomic property that contains a single string that is the main datatype of the values of the cell. If the cell contains a list (ie separator is specified and not null ) then this is the datatype of each value within the list. Conversion specifications MUST use the datatype of the value to determine the conversion of a cell into the target format. See ~~3.8.4~~ 3.11 Datatypes for more details.
length: ~~The~~ An atomic property that contains a single integer that is the exact length of the value of the cell. See section ~~3.8.4.1~~ 3.11.1 Length Constraints for details.
minLength: ~~The~~ An atomic property that contains a single integer that is the minimum length of the value of the cell. See section ~~3.8.4.1~~ 3.11.1 Length Constraints for details.
maxLength: ~~The~~ An atomic property that contains a single integer that is the maximum length of the value of the cell. See section ~~3.8.4.1~~ 3.11.1 Length Constraints for details.
minimum: ~~The~~ An atomic property that contains a single number that is the minimum value for the cell (inclusive); equivalent to minInclusive. See section ~~3.8.4.2~~ 3.11.2 Value Constraints for details.
maximum: ~~The~~ An atomic property that contains a single number that is the maximum value for the cell (inclusive); equivalent to maxInclusive. See section ~~3.8.4.2~~ 3.11.2 Value Constraints for details.
minInclusive: ~~The~~ An atomic property that contains a single number that is the minimum value for the cell (inclusive). See section ~~3.8.4.2~~ 3.11.2 Value Constraints for details.
maxInclusive: ~~The~~ An atomic property that contains a single number that is the maximum value for the cell (inclusive). See section ~~3.8.4.2~~ 3.11.2 Value Constraints for details.
minExclusive: ~~The~~ An atomic property that contains a single number that is the minimum value for the cell (exclusive). See section ~~3.8.4.2~~ 3.11.2 Value Constraints for details.
maxExclusive: ~~The~~ An atomic property that contains a single number that is the maximum value for the cell (exclusive). See section ~~3.8.4.2~~ 3.11.2 Value Constraints for details.

3.8.4 3.11 Datatypes

Cells within tables may be annotated with a datatype which indicates the type of the value obtained by parsing the value of the cell. The format expected in the cell is determined by the format annotation, if there is one, or uses a default format determined by the type.

The possible datatypes are:

the datatypes defined in [ xmlschema-2 ] with the exception of those that rely on XML mechanisms for definition, namely:
- anySimpleType
- string ; a sub-value of anySimpleType
- normalizedString ; a sub-value of string
- token ; a sub-value of normalizedString
- language ; a sub-value of token
- Name ; a sub-value of token
- NCName ; a sub-value of Name
- boolean ; a sub-value of anySimpleType
- decimal ; a sub-value of anySimpleType
- integer ; a sub-value of decimal
- nonPositiveInteger ; a sub-value of integer
- negativeInteger ; a sub-value of nonPositiveInteger
- long ; a sub-value of integer
- int ; a sub-value of long
- short ; a sub-value of int
- byte ; a sub-value of short
- nonNegativeInteger ; a sub-value of integer
- unsignedLong ; a sub-value of nonNegativeInteger
- unsignedInt ; a sub-value of unsignedLong
- unsignedShort ; a sub-value of unsignedInt
- unsignedByte ; a sub-value of unsignedShort
- positiveInteger ; a sub-value of nonNegativeInteger
- float ; a sub-value of anySimpleType
- double ; a sub-value of anySimpleType
- duration ; a sub-value of anySimpleType
- dateTime ; a sub-value of anySimpleType
- time ; a sub-value of anySimpleType
- date ; a sub-value of anySimpleType
- gYearMonth ; a sub-value of anySimpleType
- gYear ; a sub-value of anySimpleType
- gMonthDay ; a sub-value of anySimpleType
- gDay ; a sub-value of anySimpleType
- gMonth ; a sub-value of anySimpleType
- hexBinary ; a sub-value of anySimpleType
- base64Binary ; a sub-value of anySimpleType
- anyURI ; a sub-value of anySimpleType
the datatype number which is exactly equivalent to double
the datatype binary which is exactly equivalent to base64Binary
the datatype datetime which is exactly equivalent to dateTime
the datatype geopoint any which ~~indicates a comma-separated longitude and latitude (ie values that after stripping leading and trailing whitespace are in the format longitude \s*,\s* latitude ); a sub-value of~~ is exactly equivalent to anySimpleType Issue 13 In JSON Table Schema, geopoint permits values in JSON representations of points, namely { lon: longitude , lat: latitude } and [ longitude , latitude ] . We invite comment about whether these types are suitable for CSV files. If they are, we suggest that these additional formats for geopoint are supported through the format property.
the datatype any xml which ~~is exactly equivalent to anySimpleType~~ indicates the cell contains an XML fragment

~~Issue 14 The JSON Table Schema also includes object ,~~

the datatype array html ~~and geojson . We invite comment on whether we should we support~~ which indicates the ~~inclusion of JSON-based structures within CSV files. Issue 15 We invite comment on whether~~ cell contains an HTML fragment
the datatype any json ~~type is useful. Issue 16 We invite comment on whether there should be types for formats like XML, HTML and markdown~~ which ~~may appear within CSV cells.~~ indicates the cell contains serialized JSON

3.8.4.1 3.11.1 Length Constraints

The length, minLength and maxLength properties indicate the exact, minimum and maximum lengths of the values of cells.

Applications MUST raise an error if both length and minLength are specified and they do not have the same value. Similarly, applications MUST raise an error if both length and maxLength are specified and they do not have the same value. Applications MUST raise an error if length, maxLength or minLength are specified and the cell value is not a list (ie separator is not specified), a string or one of its subtypes, or a binary value.

The length of a value of a cell is determined as follows:

if the cell is null its length is zero
if the value is a list, its length is the number of items in the list
if the value is a string or one of its subtypes, its length is the number of characters in the value
if the value is of a binary type, its length is the number of bytes in the binary value

3.8.4.2 3.11.2 Value Constraints

The minimum, maximum, minInclusive, maxInclusive, minExclusive and maxExclusive properties indicate limits on the values of cells. These apply to numeric and date/time types. The minimum property is equivalent to the minInclusive property and the maximum property is equivalent to the maxInclusive property.

Validation against these properties is as defined in [ xmlschema-2 ].

3.8.5 3.12 Parsing cells

Unlike many other data formats, tabular data is designed to be read by humans. For that reason, it's common for data to be represented within tabular data in a human-readable way. The separator and format properties indicates the format used to represent data within the table. This is used:

by validators to check that the data in the table is in the expected format
by converters to parse the values before mapping them into values in the target of the conversion
when displaying data, to map it into formats that are meaningful for those viewing the data (as opposed to those publishing it)
when inputting data, to turn entered values into representations in a consistent format

The process of parsing the string value of a cell into a single value or a list of values is as follows:

Issue 61

What should be the mapping of an empty cell?

unless the datatype is string or anySimpleType or any, strip leading and trailing whitespace from the value
if the value is the same as the null value, then the value is null
if the separator property is not null, create a list of values by splitting the string at the character specified by the separator property
validate the value(s) against the format, if one is specified, as described below; raise an error if any of the values do not match the specified format
parse the value(s) using the format, as described below

3.8.5.1 3.12.1 Formats for strings

If the datatype is a string type, the format property provides a regular expression for the string values, in the syntax defined by [ ECMASCRIPT ].

Issue 17 55

We invite comment about which reference to use for regular expression syntax. Other possibilities are to use that defined by XML Schema or XPath.

3.8.5.2 3.12.2 Formats for numeric types

It is not uncommon for numbers within tabular data to be formatted for human consumption, which may involve using commas for decimal points, grouping digits in the number using commas, or adding currency symbols or percent signs to the number.

If the datatype is a numeric type, the format property indicates the expected format for that number. Validators MUST check that the numbers in the column adhere to the specified format. Converters MUST use the format property to parse the number when mapping it into a suitable type in the target language of the conversion.

When the datatype is a numeric type, the format property's value MUST be a number format as specified in [ xslt-21 ].

Issue 18 54

We invite comment on the best format to specify how to parse numbers.

Issue 65

3.8.5.3 3.12.3 Formats for booleans

Boolean values may be represented in many ways aside from the standard 1 and 0 or true and false.

If the datatype is boolean, the format property provides the true and false values expected, separated by |. For example if format is Y|N then cells must hold either Y or N with Y meaning true and N meaning false.

3.8.5.4 3.12.4 Formats for dates and times

Dates and times are commonly represented in tabular data in formats other than those defined in [ xmlschema-2 ].

If the datatype is a date or time type, the format property indicates the expected format for that date or time. Validators MUST check that the dates or times in the column adhere to the specified format. Converters MUST use the format property to parse the date or time when mapping it into a suitable type in the target language of the conversion.

When the datatype is a date or time type, the format property's value MUST be a date/time format as specified in [ xslt-21 ].

Issue 19 54

We invite comment on which format to use when parsing dates and times.

3.8.5.5 3.12.5 Formats for durations

Issue 20 54

We invite comment on whether there are standard formats to use when parsing durations.

D. JSON-LD Context

The JSON-LD context, located at http://www.w3.org/ns/csvw.jsonld is used with metadata documents.

{

  "@context": {
    "id": "@id",
    "type": "@type",
    "dc:title": {
      "@container": "@language"
    },
    "dc:description": {
      "@container": "@language"
    },
    "rdfs:comment": {
      "@container": "@language"
    },
    "rdfs:domain": {
      "@type": "@id"
    },
    "rdfs:label": {
      "@container": "@language"
    },
    "rdfs:range": {
      "@type": "@id"
    },
    "rdfs:subClassOf": {
      "@type": "@id"
    },
    "rdfs:subPropertyOf": {
      "@type": "@id"
    },
    "owl:equivalentClass": {
      "@type": "@vocab"
    },
    "owl:equivalentProperty": {
      "@type": "@vocab"
    },
    "owl:oneOf": {
      "@container": "@list",
      "@type": "@vocab"
    },
    "owl:imports": {
      "@type": "@id"
    },
    "owl:versionInfo": {
      "@type": "xsd:string",
      "@language": null
    },
    "owl:inverseOf": {
      "@type": "@vocab"
    },
    "owl:unionOf": {
      "@type": "@vocab",
      "@container": "@list"
    },
    "rdfs_classes": {
      "@reverse": "rdfs:isDefinedBy",
      "@type": "@id"
    },
    "rdfs_properties": {
      "@reverse": "rdfs:isDefinedBy",
      "@type": "@id"
    },
    "rdfs_datatypes": {
      "@reverse": "rdfs:isDefinedBy",
      "@type": "@id"
    },
    "rdfs_instances": {
      "@reverse": "rdfs:isDefinedBy",
      "@type": "@id"
    },
    "cc": "http://creativecommons.org/ns#",
    "csvw": "http://www.w3.org/ns/csvw#",
    "ctag": "http://commontag.org/ns#",
    "dc": "http://purl.org/dc/terms/",
    "dc11": "http://purl.org/dc/elements/1.1/",
    "dcat": "http://www.w3.org/ns/dcat#",
    "dcterms": "http://purl.org/dc/terms/",
    "earl": "http://www.w3.org/ns/earl#",
    "foaf": "http://xmlns.com/foaf/0.1/",
    "gr": "http://purl.org/goodrelations/v1#",
    "grddl": "http://www.w3.org/2003/g/data-view#",
    "ical": "http://www.w3.org/2002/12/cal/icaltzd#",
    "ma": "http://www.w3.org/ns/ma-ont#",
    "og": "http://ogp.me/ns#",
    "org": "http://www.w3.org/ns/org#",
    "owl": "http://www.w3.org/2002/07/owl#",
    "prov": "http://www.w3.org/ns/prov#",
    "qb": "http://purl.org/linked-data/cube#",
    "rdf": "http://www.w3.org/1999/02/22-rdf-syntax-ns#",
    "rdfa": "http://www.w3.org/ns/rdfa#",
    "rdfs": "http://www.w3.org/2000/01/rdf-schema#",
    "rev": "http://purl.org/stuff/rev#",
    "rif": "http://www.w3.org/2007/rif#",
    "rr": "http://www.w3.org/ns/r2rml#",
    "schema": {
      "@id": "csvw:schema",
      "@type": "@id"
    },
    "sd": "http://www.w3.org/ns/sparql-service-description#",
    "sioc": "http://rdfs.org/sioc/ns#",
    "skos": "http://www.w3.org/2004/02/skos/core#",
    "skosxl": "http://www.w3.org/2008/05/skos-xl#",
    "v": "http://rdf.data-vocabulary.org/#",
    "vcard": "http://www.w3.org/2006/vcard/ns#",
    "void": "http://rdfs.org/ns/void#",
    "wdr": "http://www.w3.org/2007/05/powder#",
    "wrds": "http://www.w3.org/2007/05/powder-s#",
    "xhv": "http://www.w3.org/1999/xhtml/vocab#",
    "xml": "rdf:XMLLiteral",
    "xsd": "http://www.w3.org/2001/XMLSchema#",
    "any": "xsd:anySimpleType",
    "binary": "xsd:base64Binary",
    "datetime": "xsd:dateTime",
    "describedby": "wrds:describedby",
    "html": "rdf:HTML",
    "license": "xhv:license",
    "maximum": "csvw:maxInclusive",
    "minimum": "csvw:minInclusive",
    "number": "xsd:double",
    "role": "xhv:role",
    "Column": "csvw:Column",
    "Dialect": "csvw:Dialect",
    "Direction": "csvw:Direction",
    "Schema": "csvw:Schema",
    "Table": "csvw:Table",
    "TableGroup": "csvw:TableGroup",
    "Template": "csvw:Template",
    "columns": {
      "@id": "csvw:columns",
      "@type": "@id",
      "@container": "@list"
    },
    "commentPrefix": {
      "@id": "csvw:commentPrefix"
    },
    "datatype": {
      "@id": "csvw:datatype"
    },
    "default": {
      "@id": "csvw:default"
    },
    "delimiter": {
      "@id": "csvw:delimiter"
    },
    "dialect": {
      "@id": "csvw:dialect",
      "@type": "@id"
    },
    "doubleQuote": {
      "@id": "csvw:doubleQuote",
      "@type": "xsd:boolean"
    },
    "encoding": {
      "@id": "csvw:encoding"
    },
    "foreignKeys": {
      "@id": "csvw:foreignKeys"
    },
    "format": {
      "@id": "csvw:format"
    },
    "header": {
      "@id": "csvw:header",
      "@type": "xsd:boolean"
    },
    "headerColumnCount": {
      "@id": "csvw:headerColumnCount",
      "@type": "xsd:nonNegativeInteger"
    },
    "headerRowCount": {
      "@id": "csvw:headerRowCount",
      "@type": "xsd:nonNegativeInteger"
    },
    "language": {
      "@id": "csvw:language"
    },
    "length": {
      "@id": "csvw:length",
      "@type": "xsd:nonNegativeInteger"
    },
    "lineTerminator": {
      "@id": "csvw:lineTerminator"
    },
    "maxExclusive": {
      "@id": "csvw:maxExclusive"
    },
    "maxInclusive": {
      "@id": "csvw:maxInclusive"
    },
    "maxLength": {
      "@id": "csvw:maxLength",
      "@type": "xsd:nonNegativeInteger"
    },
    "minExclusive": {
      "@id": "csvw:minExclusive"
    },
    "minInclusive": {
      "@id": "csvw:minInclusive"
    },
    "minLength": {
      "@id": "csvw:minLength",
      "@type": "xsd:nonNegativeInteger"
    },
    "name": {
      "@id": "csvw:name"
    },
    "notes": {
      "@id": "csvw:notes"
    },
    "null": {
      "@id": "csvw:null"
    },
    "predicateUrl": {
      "@id": "csvw:predicateUrl",
      "@type": "xsd:anyURI"
    },
    "primaryKey": {
      "@id": "csvw:primaryKey"
    },
    "quoteChar": {
      "@id": "csvw:quoteChar"
    },
    "required": {
      "@id": "csvw:required",
      "@type": "xsd:boolean"
    },
    "resources": {
      "@id": "csvw:resources",
      "@type": "@id",
      "@container": "@set"
    },
    "row": {
      "@id": "csvw:row",
      "@container": "@set"
    },
    "separator": {
      "@id": "csvw:separator"
    },
    "skipBlankRows": {
      "@id": "csvw:skipBlankRows",
      "@type": "xsd:boolean"
    },
    "skipColumns": {
      "@id": "csvw:skipColumns",
      "@type": "xsd:nonNegativeInteger"
    },
    "skipInitialSpace": {
      "@id": "csvw:skipInitialSpace",
      "@type": "xsd:boolean"
    },
    "skipRows": {
      "@id": "csvw:skipRows",
      "@type": "xsd:nonNegativeInteger"
    },
    "source": {
      "@id": "csvw:source"
    },
    "table": {
      "@id": "csvw:table",
      "@type": "@id",
      "@container": "@set"
    },
    "table-direction": {
      "@id": "csvw:table-direction",
      "@type": "@vocab"
    },
    "targetFormat": {
      "@id": "csvw:targetFormat"
    },
    "templateFormat": {
      "@id": "csvw:templateFormat"
    },
    "templates": {
      "@id": "csvw:templates",
      "@type": "@id"
    },
    "text-direction": {
      "@id": "csvw:text-direction",
      "@type": "@vocab"
    },
    "title": {
      "@id": "csvw:title",
      "@container": "@language"
    },
    "trim": {
      "@id": "csvw:trim",
      "@type": "xsd:boolean"
    },
    "uriTemplate": {
      "@id": "csvw:uriTemplate"
    },
    "json": "csvw:json"
  },
  "@id": "http://www.w3.org/ns/csvw#",
  "@type": "owl:Ontology",
  "dc:title": {
    "en": "Metadata Vocabulary for Tabular Data"
  },
  "dc:description": {
    "en": "Validation, conversion, display and search of tabular data on the web\n    requires additional metadata that describes how the data should be\n    interpreted. This document defines a vocabulary for metadata that\n    annotates tabular data. This can be used to provide metadata at various\n    levels, from collections of data from CSV documents and how they relate\n    to each other down to individual cells within a table."
  },
  "rdfs_classes": [
    {
      "@id": "csvw:Column",
      "@type": "rdfs:Class",
      "rdfs:label": {
        "en": "Column Description"
      },
      "rdfs:comment": {
        "en": "A Column Description describes a single column."
      }
    },
    {
      "@id": "csvw:Dialect",
      "@type": "rdfs:Class",
      "rdfs:label": {
        "en": "Dialect Description"
      },
      "rdfs:comment": {
        "en": "A Dialect Description provides hints to parsers about how to parse a linked file."
      }
    },
    {
      "@id": "csvw:Direction",
      "@type": "rdfs:Class",
      "rdfs:label": {
        "en": "Direction"
      },
      "rdfs:comment": {
        "en": "The class of table/text directions."
      }
    },
    {
      "@id": "csvw:Schema",
      "@type": "rdfs:Class",
      "rdfs:label": {
        "en": "Schema"
      },
      "rdfs:comment": {
        "en": "A Schema is a definition of a tabular format that may be common to multiple tables."
      }
    },
    {
      "@id": "csvw:Table",
      "@type": "rdfs:Class",
      "rdfs:label": {
        "en": "Table Description"
      },
      "rdfs:comment": {
        "en": "A table description is a JSON object that describes a table within a CSV file."
      }
    },
    {
      "@id": "csvw:TableGroup",
      "@type": "rdfs:Class",
      "rdfs:label": {
        "en": "Table Group Description"
      },
      "rdfs:comment": {
        "en": "A Table Group Description describes a group of Tables."
      }
    },
    {
      "@id": "csvw:Template",
      "@type": "rdfs:Class",
      "rdfs:label": {
        "en": "Template Specification"
      },
      "rdfs:comment": {
        "en": "A Template Specification is a definition of how tabular data can be transformed into another format."
      }
    }
  ],
  "rdfs_properties": [
    {
      "@id": "csvw:columns",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "columns"
      },
      "rdfs:comment": {
        "en": "An array of Column Descriptions."
      },
      "rdfs:domain": "csvw:Schema",
      "rdfs:range": "csvw:Column"
    },
    {
      "@id": "csvw:commentPrefix",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "comment prefix"
      },
      "rdfs:comment": {
        "en": "A character that, when it appears at the beginning of a skipped row, indicates a comment that should be associated as a comment annotation to the table. The default is \"#\"."
      },
      "rdfs:domain": "csvw:Dialect"
    },
    {
      "@id": "csvw:datatype",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "datatype"
      },
      "rdfs:comment": {
        "en": "The main datatype of the values of the cell. If the cell contains a list (ie separator is specified and not null) then this is the datatype of each value within the list."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:default",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "default"
      },
      "rdfs:comment": {
        "en": "An atomic property holding a single string that provides a default string value for the cell in cases where the original string value is a null value. This default value may be used when converting the table into other formats."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:delimiter",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "delimiter"
      },
      "rdfs:comment": {
        "en": "The separator between cells. The default is \",\"."
      },
      "rdfs:domain": "csvw:Dialect"
    },
    {
      "@id": "csvw:dialect",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "dialect"
      },
      "rdfs:comment": {
        "en": "Provides hints to processors about how to parse the referenced files for to create tabular data models for an individual table, or all the tables in a group."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table"
        ]
      },
      "rdfs:range": "csvw:Dialect"
    },
    {
      "@id": "csvw:doubleQuote",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "double quote"
      },
      "rdfs:comment": {
        "en": "If true, sets the escape character flag to \". If false, to \\\\."
      },
      "rdfs:domain": "csvw:Dialect",
      "rdfs:range": "xsd:boolean"
    },
    {
      "@id": "csvw:encoding",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "encoding"
      },
      "rdfs:comment": {
        "en": "The character encoding for the file, one of the encodings listed in [encoding]. The default is utf-8."
      },
      "rdfs:domain": "csvw:Dialect"
    },
    {
      "@id": "csvw:foreignKeys",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "foreign keys"
      },
      "rdfs:comment": {
        "en": "An array of foreign key definitions that define how the values from specified columns within this table link to rows within this table or other tables."
      },
      "rdfs:domain": "csvw:Schema"
    },
    {
      "@id": "csvw:format",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "format"
      },
      "rdfs:comment": {
        "en": "A definition of the format of the cell, used when parsing the cell."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:header",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "header"
      },
      "rdfs:comment": {
        "en": ""
      },
      "rdfs:domain": "csvw:Dialect",
      "rdfs:range": "xsd:boolean"
    },
    {
      "@id": "csvw:headerColumnCount",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "header column count"
      },
      "rdfs:comment": {
        "en": "The number of header columns (following the skipped columns) in each row. The default is 0.\n"
      },
      "rdfs:domain": "csvw:Dialect",
      "rdfs:range": "xsd:nonNegativeInteger"
    },
    {
      "@id": "csvw:headerRowCount",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "header row count"
      },
      "rdfs:comment": {
        "en": "The number of header rows (following the skipped rows) in the file. The default is 1."
      },
      "rdfs:domain": "csvw:Dialect",
      "rdfs:range": "xsd:nonNegativeInteger"
    },
    {
      "@id": "csvw:language",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "language"
      },
      "rdfs:comment": {
        "en": "A language code as defined by [BCP47]. Indicates the language of the value within the cell."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:length",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "length"
      },
      "rdfs:comment": {
        "en": "The exact length of the value of the cell."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      },
      "rdfs:range": "xsd:nonNegativeInteger"
    },
    {
      "@id": "csvw:lineTerminator",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "line terminator"
      },
      "rdfs:comment": {
        "en": "The character that is used at the end of a row. The default is CRLF."
      },
      "rdfs:domain": "csvw:Dialect"
    },
    {
      "@id": "csvw:maxExclusive",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "max exclusive"
      },
      "rdfs:comment": {
        "en": "The maximum value for the cell (exclusive)."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:maxInclusive",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "max inclusive"
      },
      "rdfs:comment": {
        "en": "The maximum value for the cell (inclusive). "
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:maxLength",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "max length"
      },
      "rdfs:comment": {
        "en": "The maximum length of the value of the cell."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      },
      "rdfs:range": "xsd:nonNegativeInteger"
    },
    {
      "@id": "csvw:minExclusive",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "min exclusive"
      },
      "rdfs:comment": {
        "en": "The minimum value for the cell (exclusive)."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:minInclusive",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "min inclusive"
      },
      "rdfs:comment": {
        "en": "The minimum value for the cell (inclusive)."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:minLength",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "min length"
      },
      "rdfs:comment": {
        "en": "The minimum length of the value of the cell."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      },
      "rdfs:range": "xsd:nonNegativeInteger"
    },
    {
      "@id": "csvw:name",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "name"
      },
      "rdfs:comment": {
        "en": "An atomic property that gives a canonical name for the column. This must be a string. Conversion specifications must use this property as the basis for the names of properties/elements/attributes in the results of conversions."
      },
      "rdfs:domain": "csvw:Column"
    },
    {
      "@id": "csvw:notes",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "notes"
      },
      "rdfs:comment": {
        "en": "An array of objects representing annotations. This specification does not place any constraints on the structure of these objects."
      },
      "rdfs:domain": "csvw:Table"
    },
    {
      "@id": "csvw:null",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "null"
      },
      "rdfs:comment": {
        "en": "The string used for null values. If not specified, the default for this is the empty string."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:predicateUrl",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "predicate URL"
      },
      "rdfs:comment": {
        "en": "An atomic property that holds one or more URIs that may be used as URIs for predicates if the table is mapped to another format."
      },
      "rdfs:domain": "csvw:Column",
      "rdfs:range": "xsd:anyURI"
    },
    {
      "@id": "csvw:primaryKey",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "primary key"
      },
      "rdfs:comment": {
        "en": "A column reference property that holds either a single reference to a column description object or an array of references."
      },
      "rdfs:domain": "csvw:Schema"
    },
    {
      "@id": "csvw:quoteChar",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "quote char"
      },
      "rdfs:comment": {
        "en": "The character that is used around escaped cells."
      },
      "rdfs:domain": "csvw:Dialect"
    },
    {
      "@id": "csvw:required",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "required"
      },
      "rdfs:comment": {
        "en": "A boolean value which indicates whether every cell within the column must have a non-null value."
      },
      "rdfs:domain": "csvw:Column",
      "rdfs:range": "xsd:boolean"
    },
    {
      "@id": "csvw:resources",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "resources"
      },
      "rdfs:comment": {
        "en": "An array of table descriptions for the tables in the group."
      },
      "rdfs:domain": "csvw:TableGroup",
      "rdfs:range": "csvw:Table"
    },
    {
      "@id": "csvw:row",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "row"
      },
      "rdfs:comment": {
        "en": "Relates a Table to each Row output."
      },
      "rdfs:subPropertyOf": "rdfs:member",
      "rdfs:domain": "csvw:Table"
    },
    {
      "@id": "csvw:schema",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "schema"
      },
      "rdfs:comment": {
        "en": "An object property that provides a schema description for an individual table, or all the tables in a group."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table"
        ]
      },
      "rdfs:range": "csvw:Schema"
    },
    {
      "@id": "csvw:separator",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "separator"
      },
      "rdfs:comment": {
        "en": "The character used to separate items in the string value of the cell."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:skipBlankRows",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "skip blank rows"
      },
      "rdfs:comment": {
        "en": "Indicates whether to ignore wholly empty rows (ie rows in which all the cells are empty). The default is false."
      },
      "rdfs:domain": "csvw:Dialect",
      "rdfs:range": "xsd:boolean"
    },
    {
      "@id": "csvw:skipColumns",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "skip columns"
      },
      "rdfs:comment": {
        "en": "The number of columns to skip at the beginning of each row, before any header columns. The default is 0."
      },
      "rdfs:domain": "csvw:Dialect",
      "rdfs:range": "xsd:nonNegativeInteger"
    },
    {
      "@id": "csvw:skipInitialSpace",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "skip initial space"
      },
      "rdfs:comment": {
        "en": "If true, sets the trim flag to \"start\". If false, to false."
      },
      "rdfs:domain": "csvw:Dialect",
      "rdfs:range": "xsd:boolean"
    },
    {
      "@id": "csvw:skipRows",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "skip rows"
      },
      "rdfs:comment": {
        "en": "The number of rows to skip at the beginning of the file, before a header row or tabular data."
      },
      "rdfs:domain": "csvw:Dialect",
      "rdfs:range": "xsd:nonNegativeInteger"
    },
    {
      "@id": "csvw:source",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "source"
      },
      "rdfs:comment": {
        "en": "The format to which the tabular data should be transformed prior to the transformation using the template. If the value is \"json\", the tabular data should first be transformed first to JSON based on the simple mapping defined in Generating JSON from Tabular Data on the Web. If the value is \"rdf\", it should similarly first be transformed to XML based on the simple mapping defined in Generating RDF from Tabular Data on the Web."
      },
      "rdfs:domain": "csvw:Template"
    },
    {
      "@id": "csvw:table",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "table"
      },
      "rdfs:comment": {
        "en": "Relates an Table group to annotated tables. (Note, this is different from csvw:resources, which relates metadata, rather than resulting annotated table descriptions."
      },
      "rdfs:subPropertyOf": "rdfs:member",
      "rdfs:domain": "csvw:TableGroup",
      "rdfs:range": "csvw:Table"
    },
    {
      "@id": "csvw:table-direction",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "table direction"
      },
      "rdfs:comment": {
        "en": "One of csvw:rtl csvw:ltr or csvw:default. Indicates whether the tables in the group should be displayed with the first column on the right, on the left, or based on the first character in the table that has a specific direction. "
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table"
        ]
      },
      "rdfs:range": "csvw:Direction"
    },
    {
      "@id": "csvw:targetFormat",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "target format"
      },
      "rdfs:comment": {
        "en": "A URL for the format that will be created through the transformation. If one has been defined, this should be a URL for a media type, in the form http://www.iana.org/assignments/media-types/media-type such as http://www.iana.org/assignments/media-types/text/calendar. Otherwise, it can be any URL that describes the target format."
      },
      "rdfs:domain": "csvw:Template"
    },
    {
      "@id": "csvw:templateFormat",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "template format"
      },
      "rdfs:comment": {
        "en": "A URL for the format that is used by the template. If one has been defined, this should be a URL for a media type, in the form http://www.iana.org/assignments/media-types/media-type such as http://www.iana.org/assignments/media-types/application/javascript. Otherwise, it can be any URL that describes the template format."
      },
      "rdfs:domain": "csvw:Template"
    },
    {
      "@id": "csvw:templates",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "templates"
      },
      "rdfs:comment": {
        "en": "An array of template specifications that provide mechanisms to transform the tabular data into other formats. "
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table"
        ]
      },
      "rdfs:range": "csvw:Template"
    },
    {
      "@id": "csvw:text-direction",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "text direction"
      },
      "rdfs:comment": {
        "en": "One of csvw:rtl or csvw:ltr. Indicates whether the text within cells should be displayed by default as left-to-right or right-to-left text. "
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:TableGroup",
          "csvw:Table",
          "csvw:Schema",
          "csvw:Column"
        ]
      },
      "rdfs:range": "csvw:Direction"
    },
    {
      "@id": "csvw:title",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "title"
      },
      "rdfs:comment": {
        "en": "For a Template: A natural language property that describes the format that will be generated from the transformation. This is useful if the target format is a generic format (such as application/json) and the transformation is creating a specific profile of that format.\n\nFor a Column: A natural language property that provides possible alternative names for the column."
      },
      "rdfs:domain": {
        "owl:unionOf": [
          "csvw:Template",
          "csvw:Column"
        ]
      }
    },
    {
      "@id": "csvw:trim",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "trim"
      },
      "rdfs:comment": {
        "en": "Indicates whether to trim whitespace around cells; may be true, false, start or end. The default is false."
      },
      "rdfs:domain": "csvw:Dialect",
      "rdfs:range": "xsd:boolean"
    },
    {
      "@id": "csvw:uriTemplate",
      "@type": "rdf:Property",
      "rdfs:label": {
        "en": "uri template"
      },
      "rdfs:comment": {
        "en": "A URI template property that may be used to create a unique identifier for each row when mapping data to other formats."
      },
      "rdfs:domain": "csvw:Schema"
    }
  ],
  "rdfs_datatypes": [
    {
      "@id": "csvw:json",
      "@type": "rdfs:Datatype",
      "rdfs:label": {
        "en": "json"
      },
      "rdfs:comment": {
        "en": "A literal containing JSON."
      },
      "rdfs:subClassOf": "rdfs:Literal"
    }
  ],
  "rdfs_instances": [
    {
      "@id": "csvw:ltr",
      "@type": "Direction",
      "rdfs:label": {
        "en": "left to right"
      },
      "rdfs:comment": {
        "en": "Indicates text should be processed left to right."
      }
    },
    {
      "@id": "csvw:rtl",
      "@type": "Direction",
      "rdfs:label": {
        "en": "right to left"
      },
      "rdfs:comment": {
        "en": "Indiects text should be processed right to left"
      }
    }
  ]
}

Abstract

Status of This Document

Table of Contents

1. Introduction

2. Processing Tables This section describes how particular types of applications should use the metadata supplied about a CSV file when they process that CSV file. 2.1 2. Annotating Tables

2.1.1 2.1 Direct Annotations

2.1.2 2.2 Inherited Properties

3.1 Property Syntax

3.2 Top-Level Properties

3.3 3.2.1 Common Properties Importing Metadata

dates in the format YYYY-MM-DD 3.3 Common Properties

3.4 Table Groups

3.4.1 Required Properties

3.4.2 Optional Properties

text-direction One of "rtl" or "ltr" (the default). Indicates whether the text within cells should be displayed by default as left-to-right or right-to-left text. See section 2.2.1 3.5 Bidirectional Tables for more details. 3.4 Tables

3.4.1 3.5.1 Required Properties

3.4.2 3.5.2 Optional Properties

3.6 Dialect Descriptions

3.7 Template Specifications

3.7.1 Required Properties

3.7.2 Optional Properties

3.7.3 Example

3.5 3.8 Schemas

created 3.8.1 Examples creator

3.8.1.1 Foreign Key Reference Between Resources

3.8.1.2 Foreign Key Reference Between Schemas

3.6 3.9 Columns

3.6.1 3.9.1 Required Properties

3.6.2 3.9.2 Optional Properties

3.8.3 3.10 Inherited Properties

3.8.4 3.11 Datatypes

3.8.4.1 3.11.1 Length Constraints

3.8.4.2 3.11.2 Value Constraints

3.8.5 3.12 Parsing cells

3.8.5.1 3.12.1 Formats for strings

3.8.5.2 3.12.2 Formats for numeric types

3.8.5.3 3.12.3 Formats for booleans

3.8.5.4 3.12.4 Formats for dates and times

3.8.5.5 3.12.5 Formats for durations

3.8.6 4. Additional Constraints Processing Tables

4.1 Displaying Tables

contain any 4.1.1 Bidirectional Tables

4.2 Validating Tables

4.3 Converting Tables

A. Acknowledgements

B. IANA Considerations

B.1 Registration of application/csvm+json

C. JSON-LD Context Security Considerations

D. JSON-LD Context

E. References

D.1 E.1 Normative references

E.2 Informative references

B.1 Registration of `application/csvm+json`