Metadata Vocabulary for Tabular Data

Abstract

Validation, conversion, display and search of tabular data on the web requires additional metadata that describes how the data should be interpreted. This document defines a vocabulary for metadata that annotates tabular data. This can be used to provide metadata at various levels, from collections of data from CSV documents and how they relate to each other down to individual cells within a table.

3. Metadata Format

This section defines a set of properties and permitted values for annotating tabular data, and how these annotations should be interpreted by applications.

Issue 4

We intend to support metadata for packages. In this version of this specification, we are scoping to single metadata files defining single CSV files.

3.1 Syntax

A metadata document is a JSON document which holds an object at the top level. This object is a description object of a table. A description object is a JSON object that describes a component of a table (a table, a column, a row or a cell) and has one or more properties are mapped into properties on that component. There are different types of properties on description objects:

link properties

These hold one or more references to other resources by URL. Their values may be:

strings — resolved as URLs against the base URL
arrays — lists of strings which are resolved as URLs against the base URL

For example, the hasVersion property is a link property. A table description might contain:

Example 4

"hasVersion": "example-2014-01-03.csv"

in which case the hasVersion property on the table would have a single value, a link to example-2014-01-03.csv, or it might contain:

Example 5

"hasVersion": [
  "example-2014-01-03.csv",
  "example-2014-01-17.csv",
  "example-2014-01-25.csv"
]

in which case the hasVersion property on the table would have three values, links to other versions of the table.

internal reference properties

These hold one or more references to other description objects. The referenced description object must have an @id property whose value looks like _:name. Internal reference properties can then reference other description objects through values that are:

strings — in the format _:name which MUST match the @id on another description object within the metadata document
arrays — lists of strings as above

For example, the primaryKey property is an internal reference property on the schema. It has to hold references to columns defined elsewhere in the schema, and the descriptions of those columns must have @id properties. It can hold a single reference, like this:

Example 6

"schema": {
  "columns": [{
    "@id": "_:GID",
    "name": "GID"
  }, ... ],
  "primaryKey": "_:GID"
}

or it can contain an array of references, like this:

Example 7

"schema": {
  "columns": [{
    "@id": "_:givenName",
    "name": "givenName"
  }, {
    "@id": "_:familyName",
    "name": "familyName"
  }, ... ],
  "primaryKey": [ "_:givenName", "_:familyName" ]
}

object properties

These hold one or more objects or references to objects by URL. Their values may be:

strings — resolved as URLs against the base URL
objects — interpreted as structured objects
arrays — lists of strings and/or objects

Object properties are often used when the values can be or should be values within controlled vocabularies, or structured information which may be held elsewhere. For example, the creator of a table is an object property. It could be provided as a URL that indicates the creator, like this:

Example 8

"creator": "http://ons.gov.uk"

or a structured object, like this:

Example 9

"creator": {
  "name": "Office of National Statistics",
  "url": "http://ons.gov.uk",
  "email": "info@ons.gsi.gov.uk"
}

natural language properties

These hold natural language strings. Their values may be:

strings — interpreted as natural language strings in the default language
arrays — interpreted as alternative natural language strings in the default language
objects whose properties MUST be language codes as defined by [RFC3066] and whose values are either strings or arrays, providing natural language strings in that language

Natural language properties are used for things like descriptions and titles. For example, the title property provides a natural language label for a column. If it's a plain string like this:

Example 10

"title": "Project title"

then that string is assumed to be in the language provided through the @language property of the nearest @context (or have no assumed language, if there is no such property). Multiple alternative values can be given in an array:

Example 11

"title": [
  "Project title",
  "Project"
]

It's also possible to provide multiple values in different languages, using an object structure. For example:

Example 12

"title": {
  "en": "Project title",
  "fr": "Titre du projet"
}

Issue 5

We invite comment on whether it would be useful to enable some markup in natural language strings, for example by stating that they are interpreted as HTML or Markdown.

atomic properties

These hold atomic values. Their values may be:

numbers — interpreted as integers or doubles
booleans — interpreted as booleans (true or false)
strings — interpreted as defined by the property
arrays — lists of numbers, booleans or strings

Note

JSON does not have date or time types. Where a property takes a date as a value, this MUST be a string in the format YYYY-MM-DD.

3.2 Top-Level Properties

The top-level object MAY have a @context property. This holds an object that provides metadata for interpreting other properties, namely:

@language: indicates the default language for the values of properties in the description; if present, its value MUST be a language code [RFC3066] which is the default language for the values of other properties in the metadata document

Note
Note that the @language property of the @context object, which gives the default language used within the metadata file, is distinct from the language property on a description object, which gives the language used in the data within the table.
@base: indicates the base URL against which other URLs within the description are resolved; if present, its value MUST be a URL which is resolved against the base URL of the metadata document (the location from which it was retrieved) to provide the base URL for other URLs in the metadata document

Note
Note that the @base property of the @context object provides the base URL used for URLs within the metadata document, not the URLs that appear within the table.

3.3 Common Properties

The properties listed here may be applied to any structure within the tabular data model: tables, columns, rows or cells.

Issue 6

We invite comment on whether there are other standard metadata vocabularies that should be reused within this specification.

3.3.1 Dublin Core Terms

Descriptions MAY contain any properties defined by [DC-TERMS] to describe the table. This specification does not define any application behaviour associated with these properties being present, except that validation of metadata files MUST check that, if they are present, they adhere to the syntax defined here.

Property	Type	Details
`abstract`	natural language property
`accessRights`	object property
`accrualMethod`	object property
`accrualPeriodicity`	object property
`accrualPolicy`	object property
`alternative`	natural language property
`audience`	object property
`available`	atomic property	dates in the format `YYYY-MM-DD`
`bibliographicCitation`	natural language property
`conformsTo`	object property
`contributor`	object property
`coverage`	object property
`created`	atomic property	dates in the format `YYYY-MM-DD`
`creator`	object property
`date`	atomic property	dates in the format `YYYY-MM-DD`
`dateAccepted`	atomic property	dates in the format `YYYY-MM-DD`
`dateCopyrighted`	atomic property	dates in the format `YYYY-MM-DD`
`dateSubmitted`	atomic property	dates in the format `YYYY-MM-DD`
`description`	natural language property
`educationLevel`	object property
`extent`	object property
`format`	object property
`hasFormat`	object property
`hasPart`	link property
`hasVersion`	link property
`identifier`	atomic property	a URL
`instructionalMethod`	object property
`isFormatOf`	link property
`isPartOf`	link property
`isReferencedBy`	link property
`isReplacedBy`	link property
`isRequiredBy`	link property
`issued`	atomic property	dates in the format `YYYY-MM-DD`
`isVersionOf`	link property
`language`	atomic property	a language code as defined by [RFC3066]; this is an inherited property
`license`	object property
`mediator`	object property
`medium`	object property
`modified`	atomic property	dates in the format `YYYY-MM-DD`
`provenance`	object property
`publisher`	object property
`references`	link property
`relation`	link property
`replaces`	link property
`requires`	link property
`rights`	object property
`rightsHolder`	object property
`source`	link property
`spatial`	object property
`subject`	object property
`tableOfContents`	natural language property
`temporal`	object property
`title`	natural language property
`type`	object property
`valid`	atomic property	dates in the format `YYYY-MM-DD`

3.3.2 Links

Description MAY include properties for registered link relations, prefixed by link:. This specification does not define any application behaviour associated with these properties being present, except that validation of metadata files MUST check that, if they are present, they have values that are URLs or arrays of URLs. The following properties are particularly relevant to tabular data:

link:alternate
link:canonical
link:collection
link:duplicate
link:glossary
link:help
link:icon
link:last
link:latest-version
link:next
link:original
link:predecessor-version
link:prev or link:previous
link:preview
link:profile
link:related
link:search
link:self
link:start
link:successor-version
link:terms-of-service
link:up
link:version-history
link:working-copy
link:working-copy-of

Note

Unlike the Dublin Core terms, link relations are an ever-expanding list and there may eventually be clashes between link relation terms and those defined above. That's why the above list uses QNames for all link relations, so that they look like link:relation rather than plain relation.

3.3.3 Other Properties

text-direction: One of "rtl" or "ltr" (the default). Indicates whether the text within cells should be displayed by default as left-to-right or right-to-left text. See section 2.2.1 Bidirectional Tables for more details.

3.4 Tables

A table description is a JSON object that describes a table within a CSV file.

Issue 7

A CSV file might not be the same as the table that it contains. For example, a given CSV file might contain two tables (in different regions of the CSV file), or might contain a table that isn't positioned at the top left of the CSV file. We invite comment about whether we should assume that pre-processing is used to extract tables where there isn't a 1:1 correspondence between CSV file and table, or not.

3.4.1 Required Properties

@id: This gives the URL of the CSV file that the table is held in, relative to the location of the metadata document.

3.4.2 Optional Properties

The description of a table MAY also contain:

@type: If included, @type MUST be set to "Table". Publishers MAY include this to provide additional information to JSON-LD based toolchains.
table-direction: One of "rtl", "ltr" or "default". Indicates whether the table should be displayed with the first column on the right, on the left, or based on the first character in the table that has a specific direction. See section 2.2.1 Bidirectional Tables for more details.

Issue 8
This should be a defined controlled vocabulary in JSON-LD, so that the values map on to URIs in the RDF version rather than strings. We invite comment on how to configure the JSON-LD context to enable these values to be interpreted in this way.
schema: An object property that provides a schema description as described in section 3.5 Schemas. This may be provided as an embedded object within the JSON metadata or as a URL reference to a separate JSON schema document.
notes: An object property, usually an array, of annotation objects. An annotation object is an object that holds general annotations about a particular column, row, cell or region of the table. Each annotation object MUST have an @id property that references the relevant column, row, cell or region of the table using a fragment identifier. It MAY have any other common properties as described in section 3.3 Common Properties.

Issue 9

We intend to add a small subset of properties that indicate how a CSV file should be parsed, specifically those that mirror the existing distinction between the media types for text/csv and text/tab-separated-values, and the media type parameters that they allow, namely:

separator to give the character used as the separator in the tabular data file
encoding to specify the encoding used in the file
header to specify whether or not a header line is present

We invite comment about whether these are the right properties to specify.

Issue 10

We invite comment on whether we should include properties that help in checking the integrity of the file: datapackage includes bytes and hash. We could reuse the Subresource Integrity work here.

The description MAY contain any of the properties defined in section 3.3 Common Properties to describe the table. As well as links to other related tables, the following common properties are particularly suitable for tables:

created
creator
description
language
license
modified
provenance
publisher
rights
rightsHolder
source
spatial
subject
temporal
title

3.5 Schemas

A schema is a definition of a tabular format that may be common to multiple tables. For example, multiple tables from different sources may have the same columns and be designed such that they can be aggregated together.

A schema description is a JSON object that encodes the information about a schema. All the properties of a schema description are optional.

@type

If included, @type MUST be set to "Schema". Publishers MAY include this to provide additional information to JSON-LD based toolchains.

columns

An array of column descriptions as described in section 3.6 Columns. These are matched to columns in table that use the schema by position: the first column description in the array applies to the first column in the table, the second to the second and so on.

The name properties of the column descriptions MUST be unique within a given table description.

rows

An array of row descriptions as described in section 3.7 Rows. These are matched to row by the value of the row in the row description. The values of the row properties MUST be unique within a given table description (ie no row can have more than one description).

cells

An array of cell descriptions as described in section 3.8 Cells. These are matched to cell by the value of the row and column properties in the cell description. The combination of values of the row and column properties MUST be unique within a given table description (ie no cell can have more than one description).

primaryKey

An internal reference property that holds either a single references to a column description object or an array of references.

Validators MUST check that each row has a unique combination of cells in the indicated columns. For example, if primaryKey is set to ["_:familyName", "_:givenName"] then every row must have a unique value for the combination of the familyName and givenName columns.

Issue 11

When referencing columns for a primary key, it is a lot clearer to reference them by name rather than by number. For JSON-LD compatibility, we have to assign a blank node identifier to each column even though they each have a name property that could be used instead. We invite comment on how to make this easier for people to use while maintaining JSON-LD compatibility.

The description MAY contain any of the properties defined in section 3.3 Common Properties to describe the schema. As well as links to other related schemas, the following common properties are particularly suitable for schemas:

created
creator
description
license
modified
publisher
rights
rightsHolder
subject
title

The description MAY contain any of the inherited properties defined for cells in section 2.1.2 Inherited Properties.

3.6 Columns

A column description is a simple JSON object that describes a single column. The description provides additional human-readable documentation for a column, as well as additional information that may be used to validate the cells within the column, create a user interface for data entry, or inform conversion into other formats.

3.6.1 Required Properties

name

An atomic property that gives a canonical name for the column. This MUST be a string. Conversion specifications MUST use this property as the basis for the names of properties/elements/attributes in the results of conversions.

Issue 12

We invite comment on what the syntactic limitations should be on column names to make them most useful when used as the basis of conversion into other formats, bearing in mind that different target languages such as JSON, RDF and XML have different syntactic limitations and common naming conventions.

During validation, if there is no title property and the column already has a title annotation then a validator MUST issue a warning if the existing title annotation does not match the name specified in the column description.

3.6.2 Optional Properties

title

A natural language property that provides possible alternative names for the column. The possible column titles are defined as:

if the value of title is a string, that string
if the value of title is an array, the strings in that array
if the value of title is an object, the string or strings that are the value of the property of that object whose name is the column language

where the column language is the value of the language property on the column description, or (if there is no such language), the value of the language property on the table description.

If the column already has a title annotation (because a header row has been included in the original CSV file) then a validator MUST issue a warning if the existing title annotation is not the same as any of the possible column titles.

Note

The facility to specify multiple potential titles for a column is important when the same column description is used for multiple CSVs, through a mechanism yet to be defined by this specification.

@type

If included, @type MUST be set to "Column". Publishers MAY include this to provide additional information to JSON-LD based toolchains.

required

A boolean value which indicates whether every cell within the column must have a non-null value.

The description MAY contain any of the inherited properties defined for cells in section 2.1.2 Inherited Properties.

3.7 Rows

Rows can be described using row description objects. A row description object is a JSON object within a metadata file that includes properties that describe an individual row.

3.7.1 Required Properties

The following properties MUST appear on a row description:

row: an integer; the number of the row the description object describes

3.7.2 Optional Properties

@type: If included, @type MUST be set to "Row". Publishers MAY include this to provide additional information to JSON-LD based toolchains.

The description MAY contain any of the inherited properties defined for cells in section 2.1.2 Inherited Properties.

3.8 Cells

Cells can be described using cell description objects. A cell description object is a JSON object within a metadata file that includes properties that describe an individual cell.

3.8.1 Required Properties

The following properties MUST appear on a cell description:

row: an integer; the number of the row on which the cell appears
column: an integer; the number of the column on which the cell appears

3.8.2 Optional Properties

@type: If included, @type MUST be set to "Cell". Publishers MAY include this to provide additional information to JSON-LD based toolchains.

The description MAY contain any of the inherited properties defined for cells in section 2.1.2 Inherited Properties.

3.8.3 Inherited Properties

Cell descriptions may override inherited properties, as described in section 2.1 Annotating Tables. It is good practice to define these properties on columns, so that all cells within a given column are handled in the same way. These properties are:

null: The string used for null values. If not specified, the default for this is the empty string.
separator: The character used to separate items in the string value of the cell. If null, the cell does not contain a list. Otherwise, application MUST split the string value of the cell on the specified separator character and parse each of the resulting strings separately. The cell's value will then be a list. Conversion specifications MUST use the separator to determine the conversion of a cell into the target format. See 3.8.5 Parsing cells for more details.
format: A definition of the format of the cell, used when parsing the cell as described in 3.8.5 Parsing cells.
datatype: The main datatype of the values of the cell. If the cell contains a list (ie separator is not null) then this is the datatype of each value within the list. Conversion specifications MUST use the datatype of the value to determine the conversion of a cell into the target format. See 3.8.4 Datatypes for more details.
length: The exact length of the value of the cell. See section 3.8.4.1 Length Constraints for details.
minLength: The minimum length of the value of the cell. See section 3.8.4.1 Length Constraints for details.
maxLength: The maximum length of the value of the cell. See section 3.8.4.1 Length Constraints for details.
minimum: The minimum value for the cell (inclusive); equivalent to minInclusive. See section 3.8.4.2 Value Constraints for details.
maximum: The maximum value for the cell (inclusive); equivalent to maxInclusive. See section 3.8.4.2 Value Constraints for details.
minInclusive: The minimum value for the cell (inclusive). See section 3.8.4.2 Value Constraints for details.
maxInclusive: The maximum value for the cell (inclusive). See section 3.8.4.2 Value Constraints for details.
minExclusive: The minimum value for the cell (exclusive). See section 3.8.4.2 Value Constraints for details.
maxExclusive: The maximum value for the cell (exclusive). See section 3.8.4.2 Value Constraints for details.

3.8.4 Datatypes

Cells within tables may be annotated with a datatype which indicates the type of the value obtained by parsing the value of the cell. The format expected in the cell is determined by the format annotation, if there is one, or uses a default format determined by the type.

The possible datatypes are:

the datatypes defined in [xmlschema-2] with the exception of those that rely on XML mechanisms for definition, namely:
- anySimpleType
- string; a sub-value of anySimpleType
- normalizedString; a sub-value of string
- token; a sub-value of normalizedString
- language; a sub-value of token
- Name; a sub-value of token
- NCName; a sub-value of Name
- boolean; a sub-value of anySimpleType
- decimal; a sub-value of anySimpleType
- integer; a sub-value of decimal
- nonPositiveInteger; a sub-value of integer
- negativeInteger; a sub-value of nonPositiveInteger
- long; a sub-value of integer
- int; a sub-value of long
- short; a sub-value of int
- byte; a sub-value of short
- nonNegativeInteger; a sub-value of integer
- unsignedLong; a sub-value of nonNegativeInteger
- unsignedInt; a sub-value of unsignedLong
- unsignedShort; a sub-value of unsignedInt
- unsignedByte; a sub-value of unsignedShort
- positiveInteger; a sub-value of nonNegativeInteger
- float; a sub-value of anySimpleType
- double; a sub-value of anySimpleType
- duration; a sub-value of anySimpleType
- dateTime; a sub-value of anySimpleType
- time; a sub-value of anySimpleType
- date; a sub-value of anySimpleType
- gYearMonth; a sub-value of anySimpleType
- gYear; a sub-value of anySimpleType
- gMonthDay; a sub-value of anySimpleType
- gDay; a sub-value of anySimpleType
- gMonth; a sub-value of anySimpleType
- hexBinary; a sub-value of anySimpleType
- base64Binary; a sub-value of anySimpleType
- anyURI; a sub-value of anySimpleType
the datatype number which is exactly equivalent to double
the datatype binary which is exactly equivalent to base64Binary
the datatype datetime which is exactly equivalent to dateTime
the datatype geopoint which indicates a comma-separated longitude and latitude (ie values that after stripping leading and trailing whitespace are in the format longitude\s*,\s*latitude); a sub-value of anySimpleType

Issue 13
In JSON Table Schema, geopoint permits values in JSON representations of points, namely { lon: longitude, lat: latitude} and [longitude, latitude]. We invite comment about whether these types are suitable for CSV files. If they are, we suggest that these additional formats for geopoint are supported through the format property.
the datatype any which is exactly equivalent to anySimpleType

Issue 14

The JSON Table Schema also includes object, array and geojson. We invite comment on whether we should we support the inclusion of JSON-based structures within CSV files.

Issue 15

We invite comment on whether the any type is useful.

Issue 16

We invite comment on whether there should be types for formats like XML, HTML and markdown which may appear within CSV cells.

3.8.4.1 Length Constraints

The length, minLength and maxLength properties indicate the exact, minimum and maximum lengths of the values of cells.

Applications MUST raise an error if both length and minLength are specified and they do not have the same value. Similarly, applications MUST raise an error if both length and maxLength are specified and they do not have the same value. Applications MUST raise an error if length, maxLength or minLength are specified and the cell value is not a list (ie separator is not specified), a string or one of its subtypes, or a binary value.

The length of a value of a cell is determined as follows:

if the cell is null its length is zero
if the value is a list, its length is the number of items in the list
if the value is a string or one of its subtypes, its length is the number of characters in the value
if the value is of a binary type, its length is the number of bytes in the binary value

3.8.4.2 Value Constraints

The minimum, maximum, minInclusive, maxInclusive, minExclusive and maxExclusive properties indicate limits on the values of cells. These apply to numeric and date/time types. The minimum property is equivalent to the minInclusive property and the maximum property is equivalent to the maxInclusive property.

Validation against these properties is as defined in [xmlschema-2].

3.8.5 Parsing cells

Unlike many other data formats, tabular data is designed to be read by humans. For that reason, it's common for data to be represented within tabular data in a human-readable way. The separator and format properties indicates the format used to represent data within the table. This is used:

by validators to check that the data in the table is in the expected format
by converters to parse the values before mapping them into values in the target of the conversion
when displaying data, to map it into formats that are meaningful for those viewing the data (as opposed to those publishing it)
when inputting data, to turn entered values into representations in a consistent format

The process of parsing the string value of a cell into a single value or a list of values is as follows:

unless the datatype is string or anySimpleType or any, strip leading and trailing whitespace from the value
if the value is the same as the null value, then the value is null
if the separator property is not null, create a list of values by splitting the string at the character specified by the separator property
validate the value(s) against the format, if one is specified, as described below; raise an error if any of the values do not match the specified format
parse the value(s) using the format, as described below

3.8.5.1 Formats for strings

If the datatype is a string type, the format property provides a regular expression for the string values, in the syntax defined by [ECMASCRIPT].

Issue 17

We invite comment about which reference to use for regular expression syntax. Other possibilities are to use that defined by XML Schema or XPath.

3.8.5.2 Formats for numeric types

It is not uncommon for numbers within tabular data to be formatted for human consumption, which may involve using commas for decimal points, grouping digits in the number using commas, or adding currency symbols or percent signs to the number.

If the datatype is a numeric type, the format property indicates the expected format for that number. Validators MUST check that the numbers in the column adhere to the specified format. Converters MUST use the format property to parse the number when mapping it into a suitable type in the target language of the conversion.

When the datatype is a numeric type, the format property's value MUST be a number format as specified in [xslt-21].

Issue 18

We invite comment on the best format to specify how to parse numbers.

3.8.5.3 Formats for booleans

Boolean values may be represented in many ways aside from the standard 1 and 0 or true and false.

If the datatype is boolean, the format property provides the true and false values expected, separated by |. For example if format is Y|N then cells must hold either Y or N with Y meaning true and N meaning false.

3.8.5.4 Formats for dates and times

Dates and times are commonly represented in tabular data in formats other than those defined in [xmlschema-2].

If the datatype is a date or time type, the format property indicates the expected format for that date or time. Validators MUST check that the dates or times in the column adhere to the specified format. Converters MUST use the format property to parse the date or time when mapping it into a suitable type in the target language of the conversion.

When the datatype is a date or time type, the format property's value MUST be a date/time format as specified in [xslt-21].

Issue 19

We invite comment on which format to use when parsing dates and times.

3.8.5.5 Formats for durations

Issue 20

We invite comment on whether there are standard formats to use when parsing durations.

3.8.6 Additional Constraints

A set of constraints can be associated with a cell. These constraints can be used to validate data against a JSON Table Schema. The constraints might be used by consumers to validate, for example, the contents of a data package, or as a means to validate data being collected or updated via a data entry interface.

A constraints descriptor is a JSON hash. It MAY contain any of the following keys.

minLength – An integer that specifies the minimum number of characters for a value
maxLength – An integer that specifies the maximum number of characters for a value
unique – A boolean. If true, then all values for that cell MUST be unique within the data file in which it is found. This defines a unique key for a row although a row could potentially have several such keys.
pattern – A regular expression that can be used to test cell values. If the regular expression matches then the value is valid. Values will be treated as a string of characters. It is recommended that values of this cell conform to the standard XML Schema regular expression syntax. See also this reference.
minimum – specifies a minimum value for a cell. This is different to minLength which checks number of characters. A minimum value constraint checks whether a cell value is greater than or equal to the specified value. The range checking depends on the type of the cell. E.g. an integer cell may have a minimum value of 100; a date cell might have a minimum date. If a minimum value constraint is specified then the cell descriptor MUST contain a type key
maximum – as above, but specifies a maximum value for a cell.

A constraints descriptor may contain multiple constraints, in which case a consumer MUST apply all the constraints when determining if a cell value is valid.

A data file, e.g. an entry in a data package, is considered to be valid if all of its cells are valid according to their declared type and constraints.

Abstract

Status of This Document

Table of Contents

1. Introduction

2. Processing Tables

2.1 Annotating Tables

2.1.1 Direct Annotations

2.1.2 Inherited Properties

2.2 Displaying Tables

2.2.1 Bidirectional Tables

2.3 Validating Tables

2.4 Converting Tables

3. Metadata Format

3.1 Syntax

3.2 Top-Level Properties

3.3 Common Properties

3.3.1 Dublin Core Terms

3.3.2 Links

3.3.3 Other Properties

3.4 Tables

3.4.1 Required Properties

3.4.2 Optional Properties

3.5 Schemas

3.6 Columns

3.6.1 Required Properties

3.6.2 Optional Properties

3.7 Rows

3.7.1 Required Properties

3.7.2 Optional Properties

3.8 Cells

3.8.1 Required Properties

3.8.2 Optional Properties

3.8.3 Inherited Properties

3.8.4 Datatypes

3.8.4.1 Length Constraints

3.8.4.2 Value Constraints

3.8.5 Parsing cells

3.8.5.1 Formats for strings

3.8.5.2 Formats for numeric types

3.8.5.3 Formats for booleans

3.8.5.4 Formats for dates and times

3.8.5.5 Formats for durations

3.8.6 Additional Constraints

A. Acknowledgements

B. IANA Considerations

B.1 Registration of application/csvm+json

C. JSON-LD Context

D. References

D.1 Normative references

B.1 Registration of `application/csvm+json`