Linked Data Glossary

Abstract

This document is a glossary of terms defined and used to describe Linked Data, and its associated vocabularies and Best Practices. This document published by the W3C Government Linked Data Working Group as a Working Group Note, is intended to help information management professionals, Web developers, scientists and the general public better understand publishing structured data using Linked Data Principles.

Status of This Document

This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/.

This document was published by the Government Linked Data Working Group as a Working Group Note. If you wish to make comments regarding this document, please send them to public-gld-comments@w3.org (subscribe, archives). All comments are welcome.

Publication as a Working Group Note does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.

This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.

1. 5 Star Linked Open Data

5 Star Linked Open Data refers to an incremental framework for deploying data. Tim Berners-Lee, the inventor of the Web and initiator of the Linked Data project, suggested a 5 star deployment scheme for Linked Open Data. The 5 Star Linked Data system is cumulative. Each additional star presumes the data meets the criteria of the previous step(s). 5 Star Linked Open Data includes an Open License (expression of rights) and assumes publications on the public Web.

Organizations may elect to publish 5 Star Linked Data, without the word "open", implying that the data does not include an Open License (expression of rights) and does not imply publication on the public Web.

☆ Publish data on the Web in any format (e.g., PDF, JPEG) accompanied by an explicit Open License (expression of rights).

☆☆ Publish structured data on the Web in a machine-readable format (e.g., XML).

☆☆☆ Publish structured data on the Web in a documented, non-proprietary data format (e.g., CSV, KML).

☆☆☆☆ Publish structured data on the Web as RDF (eg Turtle, RDFa, JSON-LD, SPARQL)

☆☆☆☆☆ In your RDF, have the identifiers be links (URLs) to useful data sources.

52. Linked Data Principles

API

Linked Data

Use URIs to name things;

Use HTTP URIs so that things can be referred to and looked up ("dereferenced") by people and user agents;

When someone looks up a URI, provide useful information, using the open Web standards such as RDF, SPARQL;

Include links to other related things using their URIs when publishing on the Web.

58. Machine Readable Data

Data formats that may be readily parsed by computer programs without access to proprietary libraries. For example, CSV, TSV and RDF formats are machine readable, but PDF and Microsoft Excel are not. Creating and publishing data following Linked Data principles helps search engines and humans to find, access and re-use data. Once information is found, computer programs can re-use data without the need for custom scripts to manipulate the content.

Publishing machine readable data using Linked Data principles provides a human and machine readable version. For example, Wikipedia includes a Web page about the color Red. DBpedia, the database containing structured content contained in Wikipedia, allows a Linked Data client to look up "Red" [http://wikipedia.org/wiki/Red] by changing "wiki" to "data" and appending the appropriate file extension.

$ curl -L http://dbpedia.org/data/Red.ttl

72. Persistent Identifier Scheme

A persistent identifier scheme is a mechanmism for resolution of virtual resources. Persistent Uniform Resource Locator (PURLs) implement one form of persistent identifier for virtual resources. PURLs are valid URLs and their components must map to the URL specification. The scheme part tells a computer program, such as a Web browser, which protocol to use when resolving the address. The scheme used for PURLs is generally HTTP. Other persistent identifier schemes include Digital Object Identifiers (DOIs), Life Sciences Identifiers (LSIDs) and INFO URIs. All persistent identification schemes provide unique identifiers for (possibly changing) virtual resources, but not all schemes provide curation opportunities.

114. Uniform Resource Identifier

A global identifier standardized by joint action of the World Wide Web Consortium and Internet Engineering Task Force. A Uniform Resource Identifier (URI) may or may not be resolvable on the Web. URIs play a key role in enabling Linked Data. URIs can be used to uniquely identify virtually anything including a physical building or more abstract concepts such as colors. See also Internationalized Resource Identifier (IRI) and Uniform Resource Locator (URL). See also Uniform Resource Identifier (URI): Generic Syntax [RFC3986] and http://www.w3.org/DesignIssues/Architecture.html.

URIs have been known by many names: Web addresses, Universal Document Identifiers, Universal Resource Identifiers. If you are interested in the history of the many names, read Tim Berners-Lee's design document Web Architecture from 50,000 feet. See also Uniform Resource Identifier (URI): Generic Syntax [RFC3986].

A. Acknowledgments

The editors are grateful to David Wood for contributing the initial glossary terms from Linking Government Data, (Springer 2011). The editors wish to also thank members of the Government Linked Data Working Group with special thanks to the reviewers and contributors: Thomas Baker, Hadley Beeman, Richard Cyganiak, Michael Hausenblas, Sandro Hawke, Benedikt Kaempgen, James McKinney, Marios Meimaris, Jindrich Mynarz and Dave Reynolds who diligently iterated the W3C Linked Data Glossary in order to create a foundation of terms upon which to discuss and better describe the Web of Data. Thank you!

B. References

B.1 Normative references

[OWL2]: OWL 2 Web Ontology Language Document Overview, W3C OWL Working Group, 27 October 2009. W3C Recommendation. URL: http://www.w3.org/TR/owl2-overview/
[RDF]: RDF/XML Syntax Specification (Revised), Dave Beckett (eds), 10 February 2004, W3C Recommendation. URL: http://www.w3.org/TR/REC-rdf-syntax/
[RDF-CONCEPTS]: Resource Description Framework (RDF): Concepts and Abstract Syntax, Graham Klyne, Jeremy J. Carroll, 10 February 2004. W3C Recommendation. URL: http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/
[RDF11-CONCEPTS]: RDF 1.1: Concepts and Abstract Syntax, Richard Cyganiak, David Wood, 15 January 2013. W3C Recommendation. URL: http://www.w3.org/TR/rdf11-concepts
[RDFS]: RDF Vocabulary Description Language 1.0: RDF Schema,ed. Dan Brickley, R.V. Guha, 10 February 2004. W3C Recommendation. URL: http://www.w3.org/TR/rdf-schema/
[RDFa-PRIMER]: RDFa Primer, Ben Adida, Ivan Herman, Manu Sporny, 07 June 2012. W3C Note. URL: http://www.w3.org/TR/2012/NOTE-rdfa-primer-20120607/
[RFC2616]: Hypertext Transfer Protocol -- HTTP/1.1, R. Fielding; et al. June 1999. Internet RFC 2616. URL: http://www.w3.org/Protocols/rfc2616/rfc2616.html.
[RFC3986]: Uniform Resource Identifier (URI): Generic Syntax, Berners-Lee, et al. January 2005. Internet RFC 3986. URL: http://tools.ietf.org/html/rfc3986.
[RFC4627]: The application/json Media Type for JavaScript Object Notation (JSON), D. Crockford, July 2006. Network Working Group. URL: http://www.ietf.org/rfc/rfc4627.txt
[SKOS-REFERENCE]: SKOS: Simple Knowledge Organization System Reference, Sean Bechhofer, Alistair Miles (eds), 18 August 2009, W3C Recommendation. URL: http://www.w3.org/TR/2009/REC-skos-reference-20090818/
[SPARQL-11]: SPARQL 1.1 Overview,The W3C SPARQL Working Group, 21 March 20113. W3C Recommendation. URL: http://www.w3.org/TR/sparql11-overview/
[TURTLE-TR]: Turtle: Terse RDF Triple Language,Eric Prud'hommeaux, Gavin Carothers, 19 February 2013. W3C Candidate Recommendation. URL: http://www.w3.org/TR/turtle/
[XHTML1]: XHTML 1.0 The Extensible HyperText Markup Language (Second Edition), Steven Pemberton, Daniel Auster, et al., 26 January 2000. W3C Recommendation. URL: http://www.w3.org/TR/xhtml1/
[XML]: Extensible Markup Language (XML) 1.0 (Fifth Edition), Tim Bray, Jean Paoli, C.M. Sperberg-McQueen, Eve Maler, François Yergeau, 26 November 2008. W3C Recommendation. URL: http://www.w3.org/TR/REC-xml/
[XMLS-SCHEMA0]: XML Schema Part 0: Primer Second Edition, David C. Fallside, Priscilla Walmsley (eds), 28 October 2004, W3C Recommendation. URL: http://www.w3.org/TR/xmlschema-0/

B.2 Informative references

[COOL-SWURIS]: Cool URIs for the Semantic Web, L. Sauermann and R. Cyganiak, W3C Interest Group Note 03 December 2008. URL: http://www.w3.org/TR/cooluris/
[HOWTO-LODP]: Linked Data: Evolving the Web into a Global Data Space. 2011, Chris Bizer, Tom Health URL: http://linkeddatabook.com/editions/1.0/
[JSON-LD]: JSON-LD Syntax 1.0, Many Sporny, Gregg Kellogg, Markus Lanthaler (eds), 12 July 2012, W3C Working Draft. URL: http://www.w3.org/TR/json-ld-syntax/
[LD-FOR-DEVELOPERS]: Linked Data: Structured Data on the Web. David Wood, Marsh Zaidman, Luke Ruth, with Michael Hausenblas; 2013 URL: http://www.manning.com/dwood/
[LDP-ONE]: Linked Data Platform 1.0. Steve Speicher, John Arwe. 07 March 2013. W3C Working Draft, Linked Data Platform Working Group. URL: http://www.w3.org/TR/ldp/
[N3]: Notation3 (N3): A readable RDF syntax, Tim Berners-Lee, Dan Connolly, 28 March 2011. W3C Team Submission. URL: http://www.w3.org/TeamSubmission/n3/
[VOID-GUIDE]: Describing Linked Datasets with the VoID Vocabulary, K. Alexander, R. Cyganiak, M. Hausenblas, and J. Zhao, W3C Interest Group Note 03 March 2011. URL: http://www.w3.org/TR/void/

Linked Data Glossary

W3C Working Group Note 27 June 2013

Abstract

Status of This Document

Table of Contents

Scope

1. 5 Star Linked Open Data

2. Apache License

3. API

4. CC-BY-SA License

5. Closed World

6. Connection

7. Conneg

8. Content Negotiation

9. Controlled Vocabulary

10. Comma Separated Values (CSV)

11. Creative Commons Licenses

12. CURIEs

13. cURL

14. Data Cloud

15. Data Hub, The

16. Data Market

17. Data Modeling

18. Dataset, RDF

19. Data Warehouse

20. DBpedia

21. Dereferenceable URIs

22. Description Logic

23. DCAT

24. DCMI

25. Directed Graph

26. Document Type Definition

27. Domain Name System (DNS)

28. Dublin Core Metadata Element Set

29. Dublin Core Metadata Initiative

30. Dublin Core Metadata Terms

31. Entity

32. ETL

33. FOAF

34. Fragment Identifier

35. Free/Libre/Open Source Software

36. Friend of a Friend

37. Government Open Data

38. Graph

39. HyperText Markup Language

40. HyperText Transfer Protocol

41. HTTP URIs

42. Inference

43. International Organization for Standards (ISO)

44. Internet Engineering Task Force (IETF)

45. Internationalized Resource Identifier

46. JSON

47. JSON-LD

48. Linked Data

49. Linked Data API

50. Linked Data client

51. Linked Data Platform

52. Linked Data Principles

53. Linked Open Data

54. Linked Open Data Cloud

55. Linked Open Data Cloud diagram

56. Linking Open Data Project

57. Linkset

58. Machine Readable Data

59. Message

60. Metadata

61. Metadata Object Description Schema

62. Modeling Process

63. N3

64. Namespace IRI

65. Natural Keys

66. Neutral URI

67. N-Triples

68. Object

69. Ontology

70. Open Government Data

71. ORG Ontology

72. Persistent Identifier Scheme

73. Persistent Uniform Resource Locator

74. Predicate