W3C

Turtle - Terse RDF Triple Language

Editor's Draft January 2010

This version:
http://www.w3.org/2010/01/Turtle/
Previous version:
http://www.w3.org/TeamSubmission/2008/SUBM-turtle-20080114/
Latest version:
http://www.w3.org/TeamSubmission/turtle/
Authors:
David Beckett
Tim Berners-Lee W3C
Eric Prud'hommeaux W3C

Abstract

The Resource Description Framework (RDF) is a general-purpose language for representing information in the Web.

This document defines a textual syntax for RDF called Turtle that allows RDF graphs to be completely written in a compact and natural text form, with abbreviations for common usage patterns and datatypes. Turtle provides levels of compatibility with the existing N-Triples and Notation 3 formats as well as the triple pattern syntax of the SPARQL W3C Recommendation.

This document specifies a language that is in common usage under the name "Turtle". It is intended to be compatible with, and a subset of, Notation 3.

Status of this Document

This is a proposed replacement for the W3C Turtle Submission. It has not been endorsed by any formal W3C process or by the members.

The W3C Turtle Submission is at the head of Dave Beckett's Turtle revision chain, and has been serving as a specification since Jan 2004. While there is apparent interop between Turtle parsers (advice to the contrary welcome), more formality may encourage use in e.g. MPEG formats.

The following proposal is intended to address pfps's call for a parsing semantics. It also aligns the Turtle grammar with the SPARQL Grammar where appropriate. Similar grammars have been tested by DanC in his n3 and ntriples parsers. and ericP in his SWObjects Turtle parser which uses the turtleS Yacker grammar. Note the nonexhaustive differences between SPARQL and Turtle.

A set of named tests have been integrated into the document.

After extensive discussion of a registration request, the media type remains text/turtle.


Table of Contents


1. Introduction

This document defines Turtle, the Terse RDF Triple Language, a concrete syntax for RDF as defined in the RDF Concepts and Abstract Syntax ([RDF-CONCEPTS]) W3C Recommendation. Turtle is an extension of N-Triples ([N-TRIPLES]) carefully taking the most useful and appropriate things added from Notation 3 ([NOTATION3]) while staying within the RDF model.

The recommended XML syntax for RDF, RDF/XML ([RDF-XML]) has certain restrictions imposed by XML and the use of XML Namespaces that prevent it encoding all RDF graphs (some predicate URIs are forbidden and XML 1.0 forbids encoding some Unicode codepoints). These restrictions do not apply to Turtle.

Turtle is intended to be compatible with, and a subset of, Notation 3 (see Turtle compared to Notation 3), and is generally usable in systems that support N3.

The Turtle grammar for triples is a subset of the SPARQL Protocol And RDF Query Language (SPARQL) [SPARQLQ] grammar for TriplesBlock. The two grammars share production and terminal names where possible.

2. An Introduction to Turtle (Informative)

This section is informative. The Turtle Syntax and Turtle Grammar sections formally define the language.

A Turtle document allows writing down an RDF graph in a compact textual form. It consists of a sequence of directives, triple-generating statements or blank lines. Comments may be given after a # and continue to the end of the line.

Simple triples are a sequence of (subject, predicate, object) terms, separated by whitespace and terminated by '.' after each triple. This corresponds to N-Triples ([N-TRIPLES]).

There are three types of RDF Term: RDF URI References (URIs for short), literals and blank nodes.

2.1. RDF Terms

URIs are written enclosed in '<' and '>' and may be absolute RDF URI References or relative to the current base URI (described below).

# this is not a complete turtle document
<http://example.org/path/>
<http://example.org/path/#fragment>
</path>
<#fragment>
<>

URIs may also be abbreviated by using Turtle's @prefix directive that allows declaring a short prefix name for a long prefix of repeated URIs. This is useful for many RDF vocabularies that are all defined in nearby namespace URIs, possibly using XML's namespace mechanism that works in a similar fashion.

Once a prefix such as @prefix foo: <http://example.org/ns#> is defined, any mention of a URI later in the document may use a qualified name that starts foo: to stand for the longer URI. So for example, the qualified name foo:bar is a shorthand for the URI http://example.org/ns#bar.

# this is a complete turtle document
@prefix foo: <http://example.org/ns#> .
@prefix : <http://other.example.org/ns#> .
foo:bar foo: : .
:bar : foo:bar .

Literals are written either using double-quotes when they do not contain linebreaks like "simple literal" or """long literal""" when they may contain linebreaks.

# this is not a complete turtle document
"a string"
"""a string"""
"""a string
with newlines
"""

Literals may be given either a language suffix or a datatype URI but not both. Languages are indicated by appending the simple literal with @ and the language tag. Datatype URIs similarly append ^^ followed by any legal URI form (full or qualified) as described above to give the datatype URI.

# this is not a complete turtle document
"chat"
"chat"@en
"chat"@fr
"foo"^^<http://example.org/my/datatype>
"""10"""^^xsd:decimal

Blank nodes are written as _:BLANK_NODE_LABEL to provide a blank node either from the given BLANK_NODE_LABEL. A generated blank node may also be made with [] which is useful to provide the subject of RDF triples for each pair from the predicateObjectList or the root of the collection.

# this is not a complete turtle document
_:me
_:a1234

Literals and URIs may also contain escapes to encode surrounding syntax, non-printable characters and to encode Unicode characters by codepoint number (although they may also be given directly, encoded as UTF-8). The character escapes are:

See the String escapes section for full details.

2.2. Abbreviating URIs

The current base URI may be altered in a Turtle document using the @base directive. It allows further abbreviation of URIs but is usually for simplifying the URIs in the data, where the prefix directives are for vocabularies that describe the data.

Whenever this directive appears, it defines the base URI for which all relative URIs are resolved against. That includes URIs, qualified names, prefix directives as well as later base directives.

# this is a complete turtle document
# In-scope base URI is the document URI at this point
<a1> <b1> <c1> .
@base <http://example.org/ns/> .
# In-scope base URI is http://example.org/ns/ at this point
<a2> <http://example.org/ns/b2> <c2> .
@base <foo/> .
# In-scope base URI is http://example.org/ns/foo/ at this point
<a3> <b3> <c3> .
@prefix : <bar#> .
:a4 :b4 :c4 .
@prefix : <http://example.org/ns2#> .
:a5 :b5 :c5 .

The token a is equivalent to the URI <http://www.w3.org/1999/02/22-rdf-syntax-ns#type>

# this is a complete turtle document
@prefix doc: <http://example.org/#ns> .
<http://example.org/path> a doc:Document .

2.3. Abbreviating common datatypes

Decimal integers may be written directly and correspond to the XML Schema Datatype xsd:integer. in both syntax and datatype URI.

# this is not a complete turtle document
-5
0
1
10
+1
# some long form examples
"-5"^^xsd:integer
"10"^^<http://www.w3.org/2001/XMLSchema#integer>

Decimal floating point double/fixed precision numbers may be written directly and correspond to the XML Schema Datatype xsd:double in both syntax and datatype URI.

# this is not a complete turtle document
1.3e2
10e0
-12.5e10
# some long form examples
"1.3e2"^^xsd:double
"-12.5e10"^^<http://www.w3.org/2001/XMLSchema#double>

Decimal floating point arbitrary precision numbers may be written directly and correspond to the XML Schema Datatype xsd:decimal. in both syntax and datatype URI.

# this is not a complete turtle document
0.0
1.0
1.234567890123456789
-5.0
# some long form examples
"0.0"^^xsd:decimal
"-5.0"^^<http://www.w3.org/2001/XMLSchema#decimal>

Boolean may be written directly as true or false and correspond to the the XML Schema Datatype xsd:boolean in both syntax and datatype URI.

# this is not a complete turtle document
true
false
# same in long form
"true"^^xsd:boolean
"false"^^<http://www.w3.org/2001/XMLSchema#boolean>

2.4. Abbreviating groups of triples

The , symbol may be used to repeat the subject and predicate of triples that only differ in the object RDF term.

# this is not a complete turtle document
:a :b :c ,
      :d .
# the last triple is :a :b :d .

The ; symbol may be used to repeat the subject of of triples that vary only in predicate and object RDF terms.

# this is not a complete turtle document
:a :b :c ;
   :d :e .
# the last triple is :a :d :e .

2.5. Abbreviating RDF Collections

An RDF Collection may be abbreviated using a sequence of RDF Terms enclosed in ( ) brackets. Whitespace may be used to separate them, as usual. This format provides a blank node at the start of RDF Collection which may be used in further abbreviations.

# this is a complete turtle document
@prefix : <http://example.org/foo> .
# the value of this triple is the RDF collection blank node
:subject :predicate ( :a : b : c ) .

# an empty collection value - rdf:nil
:subject :predicate2 () .

See section Collections for the details on the long form of the generated triples.

3 Syntax for IRIs, Literals and Blank Nodes

Turtle is a language for an RDF graph, a set of RDF triples. An RDF graph is composed of URI references (now interpreted as IRIs), literals and blank nodes.

The Turtle syntax for IRIs is identical to that of SPARQL Query, including the use of prefix and base directives, thought these are spelled "@prefix" and "@base" respectively in Turtle. Per RFC3986 section 5.1.1, the parsing begins with a context-defined In-Scope Base URI. Each @base directive sets a new In-Scope Base URI, relative to the previous one. @prefix directives map a local name to an IRI, also resolved against the current In-Scope Base URI. Subsequent @prefix may re-map the same local name.

Turtle IRI syntax, including relative IRI resolution, is defined by SPARQL Query section 4.1.1 (noting the different spellings of the PREFIX and BASE keywords).

Example (test-30.ttl) with document base URI http://www.w3.org/2001/sw/DataAccess/df1/tests/

# In-scope base URI is http://www.w3.org/2001/sw/DataAccess/df1/tests/ at this point
<test-00.ttl> <test-01.ttl> <test-02.ttl> .
@base <http://example.org/ns/> .
# In-scope base URI is http://example.org/ns/ at this point
<a2> <http://example.org/ns/b2> <c2> .
@base <foo/> .
# In-scope base URI is http://example.org/ns/foo/ at this point
<a3> <b3> <c3> .
@prefix : <bar#> .
:a4 :b4 :c4 .
@prefix : <http://example.org/ns2#> .
:a5 :b5 :c5 .

encodes the following N-Triples (test-30.out):

<http://www.w3.org/2001/sw/DataAccess/df1/tests/test-00.ttl> <http://www.w3.org/2001/sw/DataAccess/df1/tests/test-01.ttl> <http://www.w3.org/2001/sw/DataAccess/df1/tests/test-02.ttl> .
<http://example.org/ns/a2> <http://example.org/ns/b2> <http://example.org/ns/c2> .
<http://example.org/ns/foo/a3> <http://example.org/ns/foo/b3> <http://example.org/ns/foo/c3> .
<http://example.org/ns/foo/bar#a4> <http://example.org/ns/foo/bar#b4> <http://example.org/ns/foo/bar#c4> .
<http://example.org/ns2#a5> <http://example.org/ns2#b5> <http://example.org/ns2#c5> .

The Turtle syntax for literals and blank nodes are defined by SPARQL Query section 4.1.2 and SPARQL Query section 4.1.4 respectively.

4. Turtle Grammar

A Turtle document is a Unicode[UNICODE] character string encoded in UTF-8. Unicode codepoints only in the range U+0 to U+10FFFF inclusive are allowed.

4.1 White Space

White space (production ws) is used to separate two tokens which would otherwise be (mis-)recognized as one token.

White space is significant in tokens IRI_REF and string.

4.2 Comments

Comments in Turtle take the form of '#', outside an IRI_REF or strings, and continue to the end of line (marked by characters U+000D or U+000A) or end of file if there is no end of line after the comment marker. Comments are treated as white space.

4.3. String Escapes

Turtle strings and URIs can use \-escape sequences to represent Unicode code points.

The following table describes all the escapes allowed inside a string or IRI_REF:

Escape Unicode code point
'\u' hex hex hex hex A Unicode codepoint in the range U+0 to U+FFFF inclusive corresponding to the encoded hexadecimal value.
'\U' hex hex hex hex hex hex hex hex A Unicode codepoint in the range U+10000 to U+10FFFF inclusive corresponding to the encoded hexadecimal value.
'\t' U+0009
'\n' U+000A
'\r' U+000D
'\"'
(inside string)
U+0022
'\>'
(inside IRI_REF only)
U+003E
'\\' U+005C

where HEX is a hexadecimal character

HEX ::= [0-9] | [A-F] | [a-f]

4.4 Grammar

The EBNF used here is defined in XML 1.0 (Third Edition) [NOTATION]. Production labels consisting of a number and a final 's', e.g. [60s], reference to the production with that number in the SPARQL Query Language for RDF grammar [SPARQLQ].

Turtle - Terse RDF Triple Language EBNF
[1]    turtleDoc    ::=    (statement)*
[2]    statement    ::=    directive "."
| triples "."
[3]    directive    ::=    prefixID
| base
[4]    prefixID    ::=    PREFIX PNAME_NS IRI_REF
[5]    base    ::=    BASE IRI_REF
[6]    triples    ::=    subject predicateObjectList
[7]    predicateObjectList    ::=    verb objectList ( ";" verb objectList )* (";")?
[8]    objectList    ::=    object ( "," object )*
[9]    verb    ::=    predicate
| "a"
[10]    subject    ::=    IRIref
| blank
[11]    predicate    ::=    IRIref
[12]    object    ::=    IRIref
| blank
| literal
[13]    literal    ::=    RDFLiteral
| NumericLiteral
| BooleanLiteral
[14]    blank    ::=    BlankNode
| blankNodePropertyList
| collection
[15]    blankNodePropertyList    ::=    "[" predicateObjectList "]"
[16]    collection    ::=    "(" object* ")"
[60s]    RDFLiteral    ::=    String ( LANGTAG | ( "^^" IRIref ) )?
[61s]    NumericLiteral    ::=    NumericLiteralUnsigned
| NumericLiteralPositive
| NumericLiteralNegative
[62s]    NumericLiteralUnsigned    ::=    INTEGER
| DECIMAL
| DOUBLE
[63s]    NumericLiteralPositive    ::=    INTEGER_POSITIVE
| DECIMAL_POSITIVE
| DOUBLE_POSITIVE
[64s]    NumericLiteralNegative    ::=    INTEGER_NEGATIVE
| DECIMAL_NEGATIVE
| DOUBLE_NEGATIVE
[65s]    BooleanLiteral    ::=    "true"
| "false"
[66s]    String    ::=    STRING_LITERAL1
| STRING_LITERAL2
| STRING_LITERAL_LONG1
| STRING_LITERAL_LONG2
[67s]    IRIref    ::=    IRI_REF
| PrefixedName
[68s]    PrefixedName    ::=    PNAME_LN
| PNAME_NS
[69s]    BlankNode    ::=    BLANK_NODE_LABEL
| ANON
[17]    <BASE>    ::=    "@base"
[18]    <PREFIX>    ::=    "@prefix"
[70s]    <IRI_REF>    ::=    "<" ( [^<>\"{}|^`\\] - [#0000- ] )* ">"
[71s]    <PNAME_NS>    ::=    (PN_PREFIX)? ":"
[72s]    <PNAME_LN>    ::=    PNAME_NS PN_LOCAL
[73s]    <BLANK_NODE_LABEL>    ::=    "_:" PN_LOCAL
[74s]    <VAR1>    ::=    "?" VARNAME
[75s]    <VAR2>    ::=    "$" VARNAME
[76s]    <LANGTAG>    ::=    BASE
| PREFIX
| "@" [a-zA-Z]+ ( "-" [a-zA-Z0-9]+ )*
[77s]    <INTEGER>    ::=    [0-9]+
[78s]    <DECIMAL>    ::=    [0-9]+ "." [0-9]*
| "." [0-9]+
[79s]    <DOUBLE>    ::=    [0-9]+ "." [0-9]* EXPONENT
| "." ( [0-9] )+ EXPONENT
| ( [0-9] )+ EXPONENT
[80s]    <INTEGER_POSITIVE>    ::=    "+" INTEGER
[81s]    <DECIMAL_POSITIVE>    ::=    "+" DECIMAL
[82s]    <DOUBLE_POSITIVE>    ::=    "+" DOUBLE
[83s]    <INTEGER_NEGATIVE>    ::=    "-" INTEGER
[84s]    <DECIMAL_NEGATIVE>    ::=    "-" DECIMAL
[85s]    <DOUBLE_NEGATIVE>    ::=    "-" DOUBLE
[86s]    <EXPONENT>    ::=    [eE] [+-]? [0-9]+
[87s]    <STRING_LITERAL1>    ::=    "'" ( ( [^'\\\n\r] ) | ECHAR )* "'"
[88s]    <STRING_LITERAL2>    ::=    '"' ( ( [^\"\\\n\r] ) | ECHAR )* '"'
[89s]    <STRING_LITERAL_LONG1>    ::=    "'''" ( ( "'" | "''" )? ( [^'\\] | ECHAR ) )* "'''"
[90s]    <STRING_LITERAL_LONG2>    ::=    '"""' ( ( '"' | '""' )? ( [^\"\\] | ECHAR ) )* '"""'
[91s]    <ECHAR>    ::=    "\\" [tbnrf\\\"']
[92s]    <NIL>    ::=    "(" (WS)* ")"
[93s]    <WS>    ::=    " "
| "\t"
| "\r"
| "\n"
[94s]    <ANON>    ::=    "[" (WS)* "]"
[95s]    <PN_CHARS_BASE>    ::=    [A-Z]
| [a-z]
| [#00C0-#00D6]
| [#00D8-#00F6]
| [#00F8-#02FF]
| [#0370-#037D]
| [#037F-#1FFF]
| [#200C-#200D]
| [#2070-#218F]
| [#2C00-#2FEF]
| [#3001-#D7FF]
| [#F900-#FDCF]
| [#FDF0-#FFFD]
| [#10000-#EFFFF]
[96s]    <PN_CHARS_U>    ::=    PN_CHARS_BASE
| "_"
[97s]    <VARNAME>    ::=    ( PN_CHARS_U | [0-9] ) ( PN_CHARS_U | [0-9] | #00B7 | [#0300-#036F] | [#203F-#2040] )*
[98s]    <PN_CHARS>    ::=    PN_CHARS_U
| "-"
| [0-9]
| #00B7
| [#0300-#036F]
| [#203F-#2040]
[99s]    <PN_PREFIX>    ::=    PN_CHARS_BASE ( ( PN_CHARS | "." )* PN_CHARS )?
[100s]    <PN_LOCAL>    ::=    ( PN_CHARS_U | [0-9] ) ( ( PN_CHARS | "." )* PN_CHARS )?
[-]    PASSED TOKENS    ::=    [ \t\r\n]+
| "#" [^\r\n]*

5 Parsing

The RDF Concepts and Abstract Syntax ([RDF-CONCEPTS]) specification defines three types of RDF Term: RDF URI References (here called IRIs), literals and blank nodes. Literals are composed of a lexical form and an optional language tag or datatype IRI. An extra type, prefix, is used during parsing to map string identifiers to namespace IRIs. This section maps a string conforming to the grammar in section 4.4 to a set of triples by mapping this strings matching productions and lexical tokens to these RDF terms or their components (e.g. language tags, lexical forms of literals). Some productions change the parser state (base or prefix declarations).

5.1 Parser State

Parsing Turtle requires a state of four items:

5.2 RDF Term Constructors

This table maps productions and lexical tokens to RDF terms or components of RDF terms listed in section 5:

production type procedure
IRI_REF IRI The characters between "<" and ">" are unescaped¹ to form the unicode string of the IRI. Relative IRI resolution is performed per SPARQL Query section 4.1.1.
PNAME_NS prefix The potentially empty unicode string matching the first argument of the rule is a key into the namespaces map.
PNAME_LN IRI A prefix is identified by the first argument, PNAME_NS. The namespaces map has a corresponding namespace. The unicode string of the IRI is formed by concatenating this namespace and the second argument, PN_LOCAL. Relative IRI resolution is performed per SPARQL Query section 4.1.1.
STRING_LITERAL1 lexical formThe characters between the outermost "'"s are unescaped¹ to form the unicode string of a lexical form.
STRING_LITERAL2 lexical formThe characters between the outermost '"'s are unescaped¹ to form the unicode string of a lexical form.
STRING_LITERAL_LONG1 lexical formThe characters between the outermost "'''"s are unescaped¹ to form the unicode string of a lexical form.
STRING_LITERAL_LONG2 lexical formThe characters between the outermost '"""'s are unescaped¹ to form the unicode string of a lexical form.
LANGTAG language tagThe characters following the "@" form the unicode string of the language tag.
RDFLiteral literal The literal has a lexical form of the first rule argument (String) and either a language tag of LANGTAG or a datatype URI of URIref, depending on which rule matched the input.
INTEGER literal The literal has a lexical form of the input string, and a datatype of xsd:integer.
DECIMAL literal The literal has a lexical form of the input string, and a datatype of xsd:decimal.
DOUBLE literal The literal has a lexical form of the input string, and a datatype of xsd:double.
BooleanLiteral literal The literal has a lexical form of the "true" or "false", depending on which matched the input, and a datatype of xsd:boolean.
BLANK_NODE_LABEL blank node The string matching the second argument, PN_LOCAL, is a key in bnodeLabels. If there is no corresponding blank node in the map, one is allocated.
ANON blank node A blank node is generated.
blankNodePropertyList blank node A blank node is generated. Note the rules for blankNodePropertyList in the next section.
collection blank node A blank node is generated. Note the rules for collection in the next section.

¹ Section 3.3 defines an mapping from escaped unicode strings to unicode strings. The following lexical tokens are unescaped to produce unicode strings: IRI_REF, STRING_LITERAL1, STRING_LITERAL2, STRING_LITERAL_LONG1 and STRING_LITERAL_LONG2.

5.3 RDF Triples Constructors

A Turtle document defines an RDF graph composed of set of RDF triples. Each object N in the document produces an RDF triple: curSubject curPredicate N .

Beginning the blankNodePropertyList production records the curSubject and curPredicate, and sets curSubject to a novel blank node B. Finishing the blankNodePropertyList production restores curSubject and curPredicate. The node produced by matching blankNodePropertyList is the blank node B.

Beginning the collection production records the curSubject and curPredicate, sets curSubject to a novel blank node Bhead and sets curSubject and curPredicate to Bhead and rdf:first respectively. Each object O in collection allocates a novel blank node Bn, creates an additional triple curSubject rdf:rest Bn . and sets curSubject to Bn. Finishing the collection production creates an additional triple curSubject rdf:rest rdf:nil . and restores curSubject and curPredicate The node produced by matching collection is the blank node Bhead.

5.4 Parsing Example (Informative)

The following informative example shows the semantic actions performed when parsing this Turtle document with an LALR1 parser:

@prefix ericFoaf: <http://www.w3.org/People/Eric/ericP-foaf.rdf#> .
@prefix : <http://xmlns.com/foaf/0.1/> .
ericFoaf:ericP :givenName "Eric" ;
               :knows <http://norman.walsh.name/knows/who/dan-brickley> ,
                      [ :mbox <mailto:timbl@w3.org> ] ,
                      <http://getopenid.com/amyvdh> .

6. Examples (Informative)

This example is a Turtle translation of example 7 in the RDF/XML Syntax specification (example1.ttl):

@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix dc: <http://purl.org/dc/elements/1.1/> .
@prefix ex: <http://example.org/stuff/1.0/> .

<http://www.w3.org/TR/rdf-syntax-grammar>
  dc:title "RDF/XML Syntax Specification (Revised)" ;
  ex:editor [
    ex:fullname "Dave Beckett";
    ex:homePage <http://purl.org/net/dajobe/>
  ] .

An example of an RDF collection of two literals.

@prefix : <http://example.org/stuff/1.0/> .
:a :b ( "apple" "banana" ) .

which is short for (example2.ttl):

@prefix : <http://example.org/stuff/1.0/> .
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
:a :b
  [ rdf:first "apple";
    rdf:rest [ rdf:first "banana";
               rdf:rest rdf:nil ]
  ] .

An example of two identical triples containing literal objects containing newlines, written in plain and long literal forms. Assumes that line feeds in this document are #xA. (example3.ttl):

@prefix : <http://example.org/stuff/1.0/> .

:a :b "The first line\nThe second line\n  more" .

:a :b """The first line
The second line
  more""" .

As indicated by the grammar, a collection can be either a subject or an object. This subject or object will be the novel blank node for the first object, if the collection has one or more objects, or rdf:nil if the collection is empty.

For example,

(1 2.0 3E1) :p "w" .

is syntactic sugar for (noting that the blank nodes b0, b1 and b2 do not occur anywhere else in the RDF graph):

    _:b0  rdf:first  1 ;
          rdf:rest   _:b1 .
    _:b1  rdf:first  2.0 ;
          rdf:rest   _:b2 .
    _:b2  rdf:first  3E1 ;
          rdf:rest   rdf:nil .
    _:b0  :p         "w" . 

RDF collections can be nested and can involve other syntactic forms:

(1 [:p :q] ( 2 ) ) .

is syntactic sugar for:

    _:b0  rdf:first  1 ;
          rdf:rest   _:b1 .
    _:b1  rdf:first  _:b2 .
    _:b2  :p         :q .
    _:b1  rdf:rest   _:b3 .
    _:b3  rdf:first  _:b4 .
    _:b4  rdf:first  2 ;
          rdf:rest   rdf:nil .
    _:b3  rdf:rest   rdf:nil .

7. Identifiers for the Turtle Language

The URI that identifies the Turtle language is:
http://www.w3.org/2008/turtle#turtle

The XML (Namespace name, Local name) pair that identifies the Turtle language is:
Namespace: http://www.w3.org/2008/turtle#
Local name: turtle
The suggested namespace prefix is ttl (informative) which would make this ttl:turtle as an XML QName.

8. Conformance

Systems conforming to Turtle MUST pass all the following test cases:

  1. The N-Triples tests in the RDF Test Cases W3C Recommendation.
  2. The Turtle Test Suite (tests.zip md5sum 361f0b8b5e3a36d0ecd60be7965009df )

    Passing these tests means:

    1. All the test-n.ttl tests MUST generate equivalent RDF triples to those given in the corresponding test-n.out N-Triples file.
    2. All the bad-n.ttl tests MUST NOT generate RDF triples.

9. Media Type and Content Encoding

The media type of Turtle is text/turtle. The content encoding of Turtle content is always UTF-8. Charset parameters on the mime type are required until such time as the text/ media type tree permits UTF-8 to be sent without a charset parameter. See B. Internet Media Type, File Extension and Macintosh File Type for the media type registration form.

10. Turtle compared to N-Triples (Informative)

Turtle adds the following syntax to N-Triples:

  1. Whitespace restrictions removed
  2. Text content-encoding changed from ASCII to UTF-8
  3. Three additional string syntaxes: STRING_LITERAL2, STRING_LITERAL_LONG1, STRING_LITERAL_LONG2
  4. @base directive for setting a base IRI
  5. @prefix directive for assigning namespace prefixes
  6. Prefixed names
  7. Object lists separated by ,
  8. Predicate object lists separated by ;
  9. Unlabled blank nodes indicated by []
  10. rdf:type shorthand a
  11. RDF collection constructor bound by ()s
  12. Decimal integer literals of type xsd:integer
  13. Decimal double literals of type xsd:double
  14. Decimal arbitrary length literals of type xsd:decimal
  15. Boolean literals of type xsd:boolean

11. Turtle compared to Notation 3 (Informative)

Notation 3 (N3) triples are a superset of RDF triples. In particular, N3 formulae (graphs) may be the subject or object of N3 triples. For example here, the formula with _:Bob a foaf:Person is the object of another arc:

_:Bob ex:said { _:Bob a foaf:Person } .

Following is a partial list of syntactic features in N3 which are not in Turtle:

  1. { ... }
  2. is of
  3. paths like :a.:b.:c and :a^:b^:c
  4. @keywords
  5. => implies
  6. = equivalence
  7. @forAll
  8. @forSome
  9. <=

12. Turtle compared to SPARQL (Informative)

The SPARQL Query Language for RDF (SPARQL) [SPARQLQ] uses a Turtle/N3 style syntax for its TriplesBlock production. This production differs from the Turtle langage in that:

  1. SPARQL permits RDF Literals as the subject of RDF triples (per editor's draft)
  2. SPARQL permits variables (?name or $name) in any part of the triple of the form
  3. Turtle allows prefix and base declarations anywhere outside of a triple. In SPARQL, they are only allowed in the Prologue (at the start of the SPARQL query).

For further information see the Syntax for IRIs and SPARQL Grammar sections of the SPARQL query document [SPARQLQ].

A. References

A.1 Normative

[NOTATION]
Notation section in Extensible Markup Language (XML) 1.0 (Third Edition), T. Bray, J. Paoli, C.m. Sperberg-McQueen, E. Maler, F. Yergeau editors, W3C Recommendation, 04 February 2004. This version of XML 1.0 is http://www.w3.org/TR/2004/REC-xml-20040204/. The latest version of the Extensible Markup Language (XML) 1.0 is at http://www.w3.org/TR/REC-xml/.
[N-TRIPLES]
N-Triples section in RDF Test Cases, J. Grant and D. Beckett, Editors, W3C Recommendation, 10 February 2004. This version of the RDF Test Cases is http://www.w3.org/TR/2004/REC-rdf-testcases-20040210/. The latest version of the RDF Test Cases is at http://www.w3.org/TR/rdf-testcases/.
[UNICODE]
The Unicode Standard Version 3.0, Addison Wesley, Reading MA, 2000, ISBN: 0-201-61633-5. This document is http://www.unicode.org/unicode/standard/standard.html.
[CHARMOD]
Character Model for the World Wide Web 1.0: Fundamentals, M. J. Dürst, F. Yergeau, R. Ishida, M. Wolf, T. Texin editors, W3C Recommendation, 15 February 2005. This version of Character Model for the WWW 1.0: Fundamentals is http://www.w3.org/TR/2005/REC-charmod-20050215/ The latest version of Character Model for the WWW: Fundamentals 1.0 is at http://www.w3.org/TR/charmod/.
[RDF-CONCEPTS]
Resource Description Framework (RDF): Concepts and Abstract Syntax, G. Klyne, J.J. Carroll editors, W3C Recommendation, 10 February 2004. This version of RDF Concepts and Abstract Syntax is http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/. The latest version of RDF Concepts and Abstract Syntax is http://www.w3.org/TR/rdf-concepts/.
[RDF-XML]
RDF/XML Syntax Specification (Revised), D. Beckett editor, W3C Recommendation, 10 February 2004. This version of RDF/XML is http://www.w3.org/TR/2004/REC-rdf-syntax-grammar-20040210/. The latest version of RDF/XML is http://www.w3.org/TR/rdf-syntax-grammar/.
[RFC3023]
RFC 3023 UTF-8, XML Media Types, M. Murata et al January 2001
[RFC3629]
RFC 3629 UTF-8, a transformation format of ISO 10646, F. Yergeau November 2003
[RFC3986]
RFC 3986 Uniform Resource Identifier (URI): Generic Syntax, T. Berners-Lee, R. Fielding, L. Masinter January 2005
[RFC3987]
RFC 3987, "Internationalized Resource Identifiers (IRIs)", M. Dürst , M. Suignard
[UNISEC]
Unicode Security Considerations, Mark Davis, Michel Suignard
[UNICODE]
The Unicode Standard, Version 4. ISBN 0-321-18578-1, as updated from time to time by the publication of new versions. The latest version of Unicode and additional information on versions of the standard and of the Unicode Character Database is available at http://www.unicode.org/unicode/standard/versions/.

A.2 Informative

Previous version
http://www.dajobe.org/2004/01/turtle/2007-09-11/
[NOTATION3]
Notation 3, Tim Berners-Lee, World Wide Web Consortium
[MSWM]
Modernising Semantic Web Markup, Dave Beckett and presentation given at XML Europe 2004, Amsterdam, 20 April 2004
[SPARQLQ]
SPARQL Query Language for RDF, E. Prud'hommeaux, A. Seaborne, Editors. World Wide Web Consortium. W3C Recommendation, 15 January 2008. This version is http://www.w3.org/TR/2008/REC-rdf-sparql-query-20080115/. The latest version of SPARQL Query Language for RDF is available at http://www.w3.org/TR/rdf-sparql-query/.

B. Internet Media Type, File Extension and Macintosh File Type (Normative)

Contact:
Eric Prud'hommeaux
See also:
How to Register a Media Type for a W3C Specification
Internet Media Type registration, consistency of use
TAG Finding 3 June 2002 (Revised 4 September 2002)

The Internet Media Type / MIME Type for Turtle is "text/turtle".

It is recommended that Turtle files have the extension ".ttl" (all lowercase) on all platforms.

It is recommended that Turtle files stored on Macintosh HFS file systems be given a file type of "TEXT".

This information that follows has been submitted to the IESG for review, approval, and registration with IANA.

Type name:
text
Subtype name:
turtle
Required parameters:
None
Optional parameters:
charset — this parameter is required when transferring non-ASCII data. If present, the value of charset is always UTF-8.
Encoding considerations:
The syntax of Turtle is expressed over code points in Unicode [UNICODE]. The encoding is always UTF-8 [RFC3629].
Unicode code points may also be expressed using an \uXXXX (U+0 to U+FFFF) or \UXXXXXXXX syntax (for U+10000 onwards) where X is a hexadecimal digit [0-9A-F]
Security considerations:
Turtle is a general-purpose assertion language; applications may evaluate given data to infer more assertions or to dereference URIs, invoking the security considerations of the scheme for that URI. Note in particular, the privacy issues in [RFC3023] section 10 for HTTP URIs. Data obtained from an inaccurate or malicious data source may lead to inaccurate or misleading conclusions, as well as the dereferencing of unintended URIs. Care must be taken to align the trust in consulted resources with the sensitivity of the intended use of the data; inferences of potential medical treatments would likely require different trust than inferences for trip planning.
Turtle is used to express arbitrary application data; security considerations will vary by domain of use. Security tools and protocols applicable to text (e.g. PGP encryption, MD5 sum validation, password-protected compression) may also be used on Turtle documents. Security/privacy protocols must be imposed which reflect the sensitivity of the embedded information.
Turtle can express data which is presented to the user, for example, RDF Schema labels. Application rendering strings retrieved from untrusted Turtle documents must ensure that malignant strings may not be used to mislead the reader. The security considerations in the media type registration for XML ([RFC3023] section 10) provide additional guidance around the expression of arbitrary data and markup.
Turtle uses IRIs as term identifiers. Applications interpreting data expressed in Turtle should address the security issues of Internationalized Resource Identifiers (IRIs) [RFC3987] Section 8, as well as Uniform Resource Identifier (URI): Generic Syntax [RFC3986] Section 7.
Multiple IRIs may have the same appearance. Characters in different scripts may look similar (a Cyrillic "о" may appear similar to a Latin "o"). A character followed by combining characters may have the same visual representation as another character (LATIN SMALL LETTER E followed by COMBINING ACUTE ACCENT has the same visual representation as LATIN SMALL LETTER E WITH ACUTE). Any person or application that is writing or interpreting data in Turtle must take care to use the IRI that matches the intended semantics, and avoid IRIs that make look similar. Further information about matching of similar characters can be found in Unicode Security Considerations [UNISEC] and Internationalized Resource Identifiers (IRIs) [RFC3987] Section 8.
Interoperability considerations:
There are no known interoperability issues.
Published specification:
This specification.
Applications which use this media type:
No widely deployed applications are known to use this media type. It may be used by some web services and clients consuming their data.
Additional information:
Magic number(s):
Turtle documents may have the strings '@prefix' or '@base' (case dependent) near the beginning of the document.
File extension(s):
".ttl"
Base URI:
The Turtle '@base <IRIref>' term can change the current base URI for relative IRIrefs in the query language that are used sequentially later in the document.
Macintosh file type code(s):
"TEXT"
Person & email address to contact for further information:
Eric Prud'hommeaux <eric@w3.org>
Intended usage:
COMMON
Restrictions on usage:
None
Author/Change controller:
The Turtle specification is the product of David Beckett and Tim Berners-Lee. A W3C Working Group may assume maintenance of this document; W3C reserves change control over this specifications.

C. Acknowledgements (Informative)

This work was described in the paper New Syntaxes for RDF which discusses other RDF syntaxes and the background to the Turtle (Submitted to WWW2004, referred to as N-Triples Plus there).

This work was started during the Semantic Web Advanced Development Europe (SWAD-Europe) project funded by the EU IST-7 programme IST-2001-34732 (2002-2004) and further development supported by the Institute for Learning and Research Technology at the University of Bristol, UK (2002-Sep 2005).

D. Changes (Informative)

Changes since the last publication of this document W3C Turtle Submission 2008-01-14 . See the Previous changelog for further information


$Id: Overview.html,v 1.13 2011/03/17 16:41:14 eric Exp $

Valid XHTML + RDFa