ISSUE-239: machine-readable standardized data formats - serialization data formats - dataset formats

Laufer

machine-readable standardized data formats - serialization data formats - dataset formats

State:
CLOSED
Product:
Best practices document(s)
Raised by:
Carlos Laufer
Opened on:
2016-02-17
Description:
In Best Practice 14, "Use machine-readable standardized data formats", the term data format is used to define the serialization format of a dataset distribution.

The example uses GTFS (https://developers.google.com/transit/gtfs/reference), a standard way of distributing timetables. We have here two standards: GTFS (structure and serialization) and CSV (serialization). GTFS is distributed as a set of CSV files embedded in a single .zip style file.

The previous BP examples use timetables but it is not explicit if it was a GTFS feed. It could be any format and it seems that it is a single file containing all the information, distributed in different formats as csv, json, ttl, etc. But GTFS is a standard way of defining more that the serialization format (a set of csv files). It defines the structure and the meaning of data (a set of specific named files and a vocabulary).

Serialization standardized data formats has a semantic related to how a machine understand the meta-model of the different ways of distributing data, the data itself is inside this pack. This data could use a standard: a vocabulary or a more complex structure of distribution, as GTFS, for example, and so on.

I think this difference should be clear in the document. Maybe it will be interesting to have a BP talking about things like GTFS. I cannot see a BP that talks about this: using standards for publishing datasets for specific domains or applications.






Related Actions Items:
No related actions
Related emails:
  1. ISSUE-239: machine-readable standardized data formats - serialization data formats - dataset formats (from bfl@cin.ufpe.br on 2016-03-10)
  2. ISSUE-242: APIs on the Web (for publishing Data on the Web) Best Practices (from bfl@cin.ufpe.br on 2016-03-10)
  3. dwbp-ISSUE-239 (Laufer): machine-readable standardized data formats - serialization data formats - dataset formats [Best practices document(s)] (from sysbot+tracker@w3.org on 2016-02-17)

Related notes:

No additional notes.

Display change log ATOM feed


Chair, Staff Contact
Tracker: documentation, (configuration for this group), originally developed by Dean Jackson, is developed and maintained by the Systems Team <w3t-sys@w3.org>.
$Id: 239.html,v 1.1 2017/02/13 15:26:32 ted Exp $