Copyright © 2010-2024 World Wide Web Consortium. W3C® liability, trademark and permissive document license rules apply.
This document defines a textual syntax for RDF called TriG that allows an RDF dataset to be completely written in a compact and natural text form, with abbreviations for common usage patterns and datatypes. TriG is an extension of the Turtle [RDF12-TURTLE] format.
RDF 1.2 TriG shares triple terms with [RDF12-TURTLE] as a fourth kind of RDF term which can be used as the object of another triple, making it possible to make statements about other statements. RDF 1.2 TriG also adds shares directional language-tagged strings with [RDF12-TURTLE].
In addition, RDF 1.2 TriG shares the reifying triples and annotation syntax extensions with [RDF12-TURTLE] which allows triple terms to also be asserted.
This section describes the status of this document at the time of its publication. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at https://www.w3.org/TR/.
This document is part of the RDF 1.2 document suite. TriG is intended the meet the charter requirement of the RDF Working Group to define an RDF syntax for multiple graphs. TriG is an extension of the Turtle syntax for RDF [RDF12-TURTLE]. The current document is based on the original proposal by Chris Bizer and Richard Cyganiak.
This document was published by the RDF-star Working Group as a Working Draft using the Recommendation track.
Publication as a Working Draft does not imply endorsement by W3C and its Members.
This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress. Future updates to this specification may incorporate new features.
This document was produced by a group operating under the W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
This document is governed by the 03 November 2023 W3C Process Document.
This document defines TriG, a concrete syntax for RDF as defined in the RDF Concepts and Abstract Syntax document [RDF12-CONCEPTS]. TriG is an extension of Turtle [RDF12-TURTLE], extended to support representing a complete RDF dataset.
This section is non-normative.
A TriG document allows writing down an RDF dataset in a compact textual form.
It consists of a sequence of directives, triple statements,
graph statements which contain triple-generating statements and optional blank lines.
Comments may be given after a #
that is not part of another
lexical token and continue to the end of the line.
Graph statements are a pair of an
IRI or
blank node
and a group of triple statements
surrounded by braces ({
}
).
The IRI or
blank node
of the graph statement may be used in another graph statement
which implies taking the union of the tripes generated
by each graph statement.
An IRI or
blank node
used as a graph label may also reoccur
as part of any triple statement.
Optionally a graph statement may not not be labeled with an IRI.
Such a graph statement corresponds to the Default Graph of an RDF dataset.
The construction of an RDF dataset from a TriG document is defined in 4. TriG Grammar and 5. Parsing.
As TriG is an extention of the Turtle language it allows for any constructs from the Turtle language. Simple Triples, Predicate Lists, and Object Lists can all be used either inside a graph statement, or on their own as in a TriG document. When outside a graph statement, the triples are considered to be part of the default graph of the RDF dataset.
A graph statement pairs an
IRIs or
blank node with an
RDF graph.
The triple statements that make up the graph are enclosed in
{
}
.
In a TriG document a graph IRI or blank node may be used as label for more than one graph statements. The graph label of a graph statement may be omitted. In this case the graph is considered the default graph of the RDF dataset.
A RDF dataset might contain only a single graph.
# This document encodes one graph.
PREFIX ex: <http://www.example.org/vocabulary#>
PREFIX : <http://www.example.org/exampleDocument#>
:G1 { :Monica a ex:Person ;
ex:name "Monica Murphy" ;
ex:homepage <http://www.monicamurphy.org> ;
ex:email <mailto:monica@monicamurphy.org> ;
ex:hasSkill ex:Management ,
ex:Programming .
}
An RDF dataset may contain a default graph and/or zero or more named graphs.
# This document contains a default graph and two named graphs.
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX dc: <http://purl.org/dc/terms/>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
# default graph
{
<http://example.org/bob> dc:publisher "Bob" .
<http://example.org/alice> dc:publisher "Alice" .
}
<http://example.org/bob> {
_:a foaf:name "Bob" .
_:a foaf:mbox <mailto:bob@oldcorp.example.org> .
_:a foaf:knows _:b .
}
<http://example.org/alice> {
_:b foaf:name "Alice" .
_:b foaf:mbox <mailto:alice@work.example.org> .
}
TriG provides various alternative ways to write graphs and triples, giving the data writer choices for clarity:
# This document contains a same data as the previous example.
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX dc: <http://purl.org/dc/terms/>
PREFIX foaf: <http://xmlns.com/foaf/0.1/>
# default graph - no {} used.
<http://example.org/bob> dc:publisher "Bob" .
<http://example.org/alice> dc:publisher "Alice" .
# GRAPH keyword to highlight a named graph
# Abbreviation of triples using ;
GRAPH <http://example.org/bob>
{
[] foaf:name "Bob" ;
foaf:mbox <mailto:bob@oldcorp.example.org> ;
foaf:knows _:b .
}
GRAPH <http://example.org/alice>
{
_:b foaf:name "Alice" ;
foaf:mbox <mailto:alice@work.example.org>
}
TriG shares the same syntax for representing triple terms as Turtle, including Reifying Triples and the Annotation Syntax.
PREFIX : <http://www.example.org/>
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
_:e38 :familyName "Smith" .
_:anno rdf:reifies <<( _:e38 :jobTitle "Designer" )>> .
_:anno :accordingTo _:e22 .
All other terms and directives come from Turtle.
Blank nodes sharing the same identifier in differently labeled graph statements are considered to be the same blank node.
As well as sections marked as non-normative, all authoring guidelines, diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative.
The key words MUST and SHOULD in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.
This specification defines conformance criteria for:
A conforming TriG document is a Unicode string that conforms to the grammar
and additional constraints defined in 4. TriG Grammar,
starting with the trigDoc
production.
A TriG document serializes an RDF dataset.
A conforming TriG parser is a system capable of reading TriG documents on behalf of an application. It makes the serialized RDF dataset, as defined in 5. Parsing, available to the application, usually through some form of API.
The IRI that identifies the TriG language is:
http://www.w3.org/ns/formats/TriG
This specification does not define how TriG parsers handle non-conforming input documents.
The media type of TriG is application/trig
.
The content encoding of TriG content is always UTF-8.
A TriG document is a Unicode [UNICODE] character string encoded in UTF-8. Unicode characters only in the range U+0000 to U+10FFFF inclusive are allowed.
White space (production WS
)
is used to separate two terminals which would otherwise be (mis-)recognized as one terminal.
Rule names below in capitals indicate where white space is significant;
these form a possible choice of terminals for constructing a TriG parser.
White space is significant in the production String
.
Comments in TriG start with a #
outside an
IRIREF
or String
,
and continue to the end of line (marked by
LF
, or
CR
),
or end of file if there is no end of line after the comment marker.
Comments are treated as white space.
Relative IRI references are resolved with base IRIs as per Uniform Resource Identifier (URI): Generic Syntax [RFC3986] using only the basic algorithm in section 5.2. Neither Syntax-Based Normalization nor Scheme-Based Normalization (described in sections 6.2.2 and 6.2.3 of RFC3986) are performed. Characters additionally allowed in IRI references are treated in the same way that unreserved characters are treated in URI references, per section 6.5 of Internationalized Resource Identifiers (IRIs) [RFC3987].
The @base
or BASE
directive
defines the Base IRI used to resolve relative IRI references
per [RFC3986] section 5.1.1, "Base URI Embedded in Content".
Section 5.1.2, "Base URI from the Encapsulating Entity"
defines how the In-Scope Base IRI may come from an encapsulating document,
such as a SOAP envelope with an xml:base
directive or a MIME multipart document with a
Content-Location
header.
The "Retrieval URI" identified in 5.1.3, Base "URI from the Retrieval URI",
is the URL from which a particular TriG document was retrieved.
If none of the above specifies the Base URI, the default
Base URI (section 5.1.4, "Default Base URI") is used.
Each @base
or BASE
directive sets a new In-Scope Base URI,
relative to the previous one.
There are three forms of escapes used in TriG documents:
numeric escape sequences represent Unicode code points:
Escape sequence | Unicode code point |
---|---|
\u hex
hex
hex
hex |
A Unicode code point
in the range U+0000 to U+FFFF inclusive
corresponding to the value encoded by the four hexadecimal digits interpreted
from most significant to least significant digit. |
\U hex
hex
hex
hex
hex
hex
hex
hex |
A Unicode code point
in the range U+0000 to U+0010FFFF inclusive
corresponding to the value encoded by the eight hexadecimal digits
interpreted from most significant to least significant digit. |
where hex
is a hexadecimal character
HEX ::= [0-9] | [A-F] | [a-f]
string escape sequences represent the characters traditionally escaped in string literals:
Escape sequence | Unicode code point |
---|---|
\t |
U+0009 |
\b |
U+0008 |
\n |
U+000A |
\r |
U+000D |
\f |
U+000C |
\" |
U+0022 |
\' |
U+0027 |
\\ |
U+005C |
reserved character escape sequences consist of
a \
followed by one of ~.-!$&'()*+,;=/?#@%_
and represent the character to the right of the \
.
numeric escapes |
string escapes |
reserved character escapes |
|
---|---|---|---|
IRIs,
used as RDF terms
or as in PREFIX
or BASE declarations |
yes | no | no |
local names | no | no | yes |
Strings | yes | yes | no |
%-encoded sequences are in the
character range for IRIs
and are explicitly allowed in local names.
These appear as a '%' followed by two hex characters and represent that same sequence
of three characters.
These sequences are not decoded during processing.
A term written as <http://a.example/%66oo-bar>
in TriG designates the IRI http://a.example/%66oo-bar
and not IRI http://a.example/foo-bar
.
A term written as ex:%66oo-bar
with a prefix
PREFIX ex: <http://a.example/>
also designates
the IRI http://a.example/%66oo-bar
.
The EBNF used here is defined in XML 1.0 [EBNF-NOTATION].
Notes:
@base
',
'@prefix
',
'a
',
'true
',
'false
')
are case-sensitive.
Keywords in quotation marks
("BASE
",
"PREFIX
",
"GRAPH
")
are case-insensitive.
UCHAR
and ECHAR
are case sensitive.
trigDoc
.
@prefix
'
and '@base
'
match the pattern for LANG_DIR
,
though neither prefix
nor base
are registered language subtags.
This specification does not define whether a quoted literal
followed by either of these tokens (e.g., "A"@base
)
is in the Turtle language.
[1] | trigDoc |
::= | ( directive | block) * |
[2] | block |
::= | triplesOrGraph | wrappedGraph | triples2 | ( "GRAPH " labelOrSubject wrappedGraph) |
[3] | triplesOrGraph |
::= | ( labelOrSubject ( wrappedGraph | ( predicateObjectList '. ') ) ) | ( reifiedTriple predicateObjectList? '. ') |
[4] | triples2 |
::= | ( blankNodePropertyList predicateObjectList? '. ') | ( collection predicateObjectList '. ') |
[5] | wrappedGraph |
::= | '{ ' triplesBlock? '} ' |
[6] | triplesBlock |
::= | triples ( '. ' triplesBlock? ) ? |
[7] | labelOrSubject |
::= | iri | BlankNode |
[8] | directive |
::= | prefixID | base | sparqlPrefix | sparqlBase |
[9] | prefixID |
::= | '@prefix ' PNAME_NS IRIREF '. ' |
[10] | base |
::= | '@base ' IRIREF '. ' |
[11] | sparqlPrefix |
::= | "PREFIX " PNAME_NS IRIREF |
[12] | sparqlBase |
::= | "BASE " IRIREF |
[13] | triples |
::= | ( subject predicateObjectList) | ( blankNodePropertyList predicateObjectList? ) | ( reifiedTriple predicateObjectList? ) |
[14] | predicateObjectList |
::= | verb objectList ( '; ' ( verb objectList) ? ) * |
[15] | objectList |
::= | object annotation ( ', ' object annotation) * |
[16] | verb |
::= | predicate | 'a ' |
[17] | subject |
::= | iri | BlankNode | collection |
[18] | predicate |
::= | iri |
[19] | object |
::= | iri | BlankNode | collection | blankNodePropertyList | literal | tripleTerm | reifiedTriple |
[20] | literal |
::= | RDFLiteral | NumericLiteral | BooleanLiteral |
[21] | blankNodePropertyList |
::= | '[ ' predicateObjectList '] ' |
[22] | collection |
::= | '( ' object* ') ' |
[23] | NumericLiteral |
::= | INTEGER | DECIMAL | DOUBLE |
[24] | RDFLiteral |
::= | String ( LANG_DIR | ( '^^ ' iri) ) ? |
[25] | BooleanLiteral |
::= | 'true ' | 'false ' |
[26] | String |
::= | STRING_LITERAL_QUOTE | STRING_LITERAL_SINGLE_QUOTE | STRING_LITERAL_LONG_SINGLE_QUOTE | STRING_LITERAL_LONG_QUOTE |
[27] | iri |
::= | IRIREF | PrefixedName |
[28] | PrefixedName |
::= | PNAME_LN | PNAME_NS |
[29] | BlankNode |
::= | BLANK_NODE_LABEL | ANON |
[30] | reifier |
::= | '~ ' ( iri | BlankNode) ? |
[31] | reifiedTriple |
::= | '<< ' ( subject | reifiedTriple) verb object reifier? '>> ' |
[32] | tripleTerm |
::= | '<<( ' ttSubject verb ttObject ')>> ' |
[33] | ttSubject |
::= | iri | BlankNode |
[34] | ttObject |
::= | iri | BlankNode | literal | tripleTerm |
[35] | annotation |
::= | ( reifier | annotationBlock) * |
[36] | annotationBlock |
::= | '{| ' predicateObjectList '|} ' |
[38] | IRIREF |
::= | '< ' ( [ ^ #x00 - #x20 <>"{}|^`\ ] | UCHAR) * '> '
/* #x00=NULL #x01-#x1F=control codes #x20=space */ |
[39] | PNAME_NS |
::= | PN_PREFIX? ': ' |
[40] | PNAME_LN |
::= | PNAME_NS PN_LOCAL |
[41] | BLANK_NODE_LABEL |
::= | '_: ' ( PN_CHARS_U | [ 0-9 ] ) ( ( PN_CHARS | '. ') * PN_CHARS) ? |
[42] | LANG_DIR |
::= | '@ ' [ a-zA-Z ] + ( '- ' [ a-zA-Z0-9 ] + ) * ( '-- ' [ a-zA-Z ] + ) ? |
[43] | INTEGER |
::= | [ +- ] ? [ 0-9 ] + |
[44] | DECIMAL |
::= | [ +- ] ? ( [ 0-9 ] * '. ' [ 0-9 ] + ) |
[45] | DOUBLE |
::= | [ +- ] ? ( ( [ 0-9 ] + '. ' [ 0-9 ] * EXPONENT) | ( '. ' [ 0-9 ] + EXPONENT) | ( [ 0-9 ] + EXPONENT) ) |
[46] | EXPONENT |
::= | [ eE ] [ +- ] ? [ 0-9 ] + |
[47] | STRING_LITERAL_QUOTE |
::= | '" ' ( [ ^ #x22 #x5C #x0A #x0D ] | ECHAR | UCHAR) * '" ' |
[48] | STRING_LITERAL_SINGLE_QUOTE |
::= | "' " ( [ ^ #x27 #x5C #x0A #x0D ] | ECHAR | UCHAR) * "' " |
[49] | STRING_LITERAL_LONG_SINGLE_QUOTE |
::= | "''' " ( ( "' " | "'' ") ? ( [ ^'\ ] | ECHAR | UCHAR) ) * "''' " |
[50] | STRING_LITERAL_LONG_QUOTE |
::= | '""" ' ( ( '" ' | '"" ') ? ( [ ^"\ ] | ECHAR | UCHAR) ) * '""" ' |
[51] | UCHAR |
::= | ( '\u ' HEX HEX HEX HEX) | ( '\U ' HEX HEX HEX HEX HEX HEX HEX HEX) |
[52] | ECHAR |
::= | '\ ' [ tbnrf\"' ] |
[53] | WS |
::= | #x20 | #x09 | #x0D | #x0A |
[54] | ANON |
::= | '[ ' WS* '] ' |
[55] | PN_CHARS_BASE |
::= | [ A-Z ] |
| |
[ a-z ] |
||
| |
[ #xC0 - #xD6 ] |
||
| |
[ #xD8 - #xF6 ] |
||
| |
[ #xF8 - #x02FF ] |
||
| |
[ #x0370 - #x037D ] |
||
| |
[ #x037F - #x1FFF ] |
||
| |
[ #x200C - #x200D ] |
||
| |
[ #x2070 - #x218F ] |
||
| |
[ #x2C00 - #x2FEF ] |
||
| |
[ #x3001 - #xD7FF ] |
||
| |
[ #xF900 - #xFDCF ] |
||
| |
[ #xFDF0 - #xFFFD ] |
||
| |
[ #x00010000 - #x000EFFFF ] |
||
[56] | PN_CHARS_U |
::= | PN_CHARS_BASE | '_ ' |
[57] | PN_CHARS |
::= | PN_CHARS_U | '- ' | [ 0-9 ] | #xB7 | [ #x0300 - #x036F ] | [ #x203F - #x2040 ] |
[58] | PN_PREFIX |
::= | PN_CHARS_BASE ( ( PN_CHARS | '. ') * PN_CHARS) ? |
[59] | PN_LOCAL |
::= | ( PN_CHARS_U | ': ' | [ 0-9 ] | PLX) ( ( PN_CHARS | '. ' | ': ' | PLX) * ( PN_CHARS | ': ' | PLX) ) ? |
[60] | PLX |
::= | PERCENT | PN_LOCAL_ESC |
[61] | PERCENT |
::= | '% ' HEX HEX |
[62] | HEX |
::= | [ 0-9 ] | [ A-F ] | [ a-f ] |
[63] | PN_LOCAL_ESC |
::= | '\ ' ( '_ ' | '~ ' | '. ' | '- ' | "! " | '$ ' | '& ' | "' " | '( ' | ') ' | '* ' | '+ ' | ', ' | '; ' | '= ' | '/ ' | '? ' | '# ' | '@ ' | '% ') |
This document uses some specific terminal literal strings [EBNF-NOTATION]. To clarify the Unicode code points used for these terminal literal strings, the following table describes specific characters and sequences used throughout this document.
Code | Glyph | Description |
---|---|---|
U+000A |
LF |
Line feed |
U+000D |
CR |
Carriage return |
U+0022 |
" |
Quotation mark |
U+0023 |
# |
Number sign |
U+0027 |
' |
Apostrophe |
U+002D |
- |
Hyphen |
U+003B |
: |
Colon |
U+0040 |
@ |
At sign |
U+005C |
\ |
Backslash |
U+005F |
_ |
Underscore |
U+0061 |
a |
Latin small letter E |
U+007B |
{ |
Left curly bracket |
U+007D |
} |
Right curly bracket |
Other short terminal literal strings are composed of specific sequences of Unicode characters:
space
U+0020
"""
U+0022
'''
U+0027
--
-
charactersThe RDF Concepts and Abstract Syntax [RDF12-CONCEPTS]
specification defines four types of RDF term:
IRIs,
literals,
blank nodes, and
triple terms.
Literals are composed of a lexical form
and an optional language tag [BCP47]
– possibly including a base direction –
or an optional datatype IRI.
An extra type, prefix
, is used during parsing to map string identifiers to namespace IRIs.
This section maps a string conforming to the grammar in 4.5 Grammar
to a set of triples by mapping strings matching productions and lexical tokens to RDF terms
or their components (e.g., language tags, lexical forms of literals).
Grammar productions change the parser state and emit triples.
Parsing TriG requires a state of nine items:
base
production is reached,
the second rule argument, IRIREF
, is the base URI used for relative IRI reference resolution.PNAME_NS
and IRIREF
)
in the prefixID production
assign a namespace name (IRIREF
) for the prefix (PNAME_NS
).
Outside of a prefixID
production, any PNAME_NS
is substituted with the namespace.
Note that the prefix may be an empty string,
per the PNAME_NS
production: PN_PREFIX? ":"
.subject
,
blankNodePropertyList
,
collection
,
tripleTerm
, and
annotationBlock
productions.verb
production.
If token matched was a
, curPredicate
is bound to the IRI http://www.w3.org/1999/02/22-rdf-syntax-ns#type
.object
and
ttObject
productions.reifier
and
annotationBlock
productions.Term Constructors can create a stack of these values indicated by using language such as "records the curSubject and curPredicate."
This table maps productions and lexical tokens to RDF terms or components of RDF terms listed in 5. Parsing:
production | type | procedure |
---|---|---|
IRIREF | IRI | The characters between "<" and ">" are taken, after the numeric escape sequences are processed, to form the Unicode string of the IRI. Relative IRI reference resolution is performed per 4.3 IRI References. |
PNAME_NS | prefix | When used in a prefixID or sparqlPrefix production, the prefix is the potentially empty Unicode string matching the first argument of the rule is a key into the namespaces map. |
IRI | When used in a PrefixedName production, the iri is the value in the namespaces map corresponding to the first argument of the rule. | |
PNAME_LN | IRI | A potentially empty prefix is identified by the first sequence, PNAME_NS . The namespaces map MUST have a corresponding namespace . The Unicode string of the IRI is formed by unescaping the reserved characters in the second argument, PN_LOCAL , and concatenating this onto the namespace . |
STRING_LITERAL_SINGLE_QUOTE | lexical form | The characters between the outermost ' s are taken, with numeric and string escape sequences unescaped, to form the Unicode string of a lexical form. |
STRING_LITERAL_QUOTE | lexical form | The characters between the outermost " s are taken, with numeric and string escape sequences unescaped, to form the Unicode string of a lexical form. |
STRING_LITERAL_LONG_SINGLE_QUOTE | lexical form | The characters between the outermost ''' s are taken, with numeric and string escape sequences unescaped, to form the unicode string of a lexical form. |
STRING_LITERAL_LONG_QUOTE | lexical form | The characters between the outermost """ s are taken, with numeric and string escape sequences unescaped, to form the Unicode string of a lexical form. |
LANG_DIR | language tag | The characters following the @ form the language tag and optionally the base direction, if the matched characters include -- . |
RDFLiteral | literal | The literal has a lexical form of the first rule argument, String . If the '^^' iri rule is matched, the datatype IRI is derived from the iri , and the literal has no language tag. If the LANG_DIR rule is matched, the language tag and base direction are taken from LANG_DIR . If there is no base direction, the datatype is rdf:langString . If there is a base direction, the datatype is rdf:dirLangString . If neither matched, the datatype is xsd:string , and the literal has no language tag. |
INTEGER | literal | The literal has a lexical form of the input string, and a datatype of xsd:integer . |
DECIMAL | literal | The literal has a lexical form of the input string, and a datatype of xsd:decimal . |
DOUBLE | literal | The literal has a lexical form of the input string, and a datatype of xsd:double . |
BooleanLiteral | literal | The literal has a lexical form of the true or false , depending on which matched the input, and a datatype of xsd:boolean . |
BLANK_NODE_LABEL | blank node | The string matching the second argument, PN_LOCAL , is a key in bnodeLabels. If there is no corresponding blank node in the map, one is allocated. |
ANON | blank node | A blank node is generated. |
blankNodePropertyList | blank node | A blank node is generated. Note the rules for blankNodePropertyList in the next section. |
collection | blank node | For non-empty lists, a blank node is generated. Note the rules for collection in the next section. |
IRI | For empty lists, the resulting IRI is rdf:nil . Note the rules for collection in the next section. | |
reifier | IRI | blank node |
The curReifier is taken from term, which is taken from the matched
iri production
or BlankNode production, if any.
If no such production is matched, term is taken
from a fresh RDF blank node.
|
tripleTerm | triple term |
The triple term
is composed of the terms constructed from
the ttSubject ,
predicate , and
ttObject productions.
|
reifiedTriple | IRI | blank node |
The term is taken from the matched reifier , if any,
or from a fresh RDF blank node.
|
annotationBlock | IRI | blank node | The term is taken from a previously matched reifier, if any, or from a fresh RDF blank node. |
A TriG document defines an RDF Dataset composed of one default graph and zero or more named graphs. Each graph is composed of a set of RDF triples.
The state curGraph is initially unset. It records the label of the graph for triples produced during parsing. If undefined, the default graph is used.
The rule
labelOrSubject
sets both curGraph
and curSubject
(only one of these will be used).
The following grammar production clauses set curGraph to be undefined, indicating the default graph:
The grammar production
labelOrSubject predicateObjectList '.'
unsets curGraph
before handling the predicateObjectList
production
in rule triplesOrGraph
.
Each RDF triple produced is added to curGraph, or the default graph if curGraph is not set at that point in the parsing process.
The subject
production sets the curSubject.
The verb
production sets the curPredicate.
Triples are produced at the following points in the parsing process and each RDF triple produced is added to the graph identified by curGraph.
Each object N in the document produces an RDF triple: curSubject curPredicate N.
Beginning the reifier
production,
the curReifier is taken from the reifier
term constructor.
Then yield the the RDF triple curReifier rdf:reifies
curTripleTerm.
Beginning the reifiedTriple
production
records the curTripleTerm.
A new tripleTerm
instance curTripleTerm
is created using the
subject
,
verb
, and
object
productions.
Finishing the reifiedTriple
production,
if the curReifier is not set, it is assigned a fresh RDF blank node;
it next yields the RDF triple curReifier rdf:reifies
curTripleTerm,
and then restores the recorded value of the curTripleTerm.
The node produced by matching reifiedTriple
is the the curReifier.
Beginning the annotation
production
records the curSubject and curPredicate.
A new tripleTerm
instance curTripleTerm
is created using the curSubject curPredicate curObject,
and the value of the curReifier is cleared.
Finishing the annotation
production
restores the recorded values of the curSubject and curPredicate.
Beginning the annotationBlock
production records the curTripleTerm.
If the curReifier is not set, then it is assigned a fresh RDF blank node
and the production yields the RDF triple curReifier rdf:reifies
curTripleTerm.
The curSubject is taken from the curReifier
Finishing the annotationBlock
production
clears the value of the curReifier
and restores the curTripleTerm.
If the curReifier was already set, the
reifying triple curReifier rdf:reifies
curTripleTerm
was emitted in 5.3.4 Reifiers.
Beginning the blankNodePropertyList
production records the curSubject and curPredicate, and sets curSubject to a novel blank node B.
Finishing the blankNodePropertyList
production restores curSubject and curPredicate.
The node produced by matching blankNodePropertyList
is the blank node B.
Beginning the collection
production records the curSubject and curPredicate.
Each object
in the collection
production has a curSubject set to a novel blank node B and a curPredicate set to rdf:first
.
For each object objectn after the first produces a triple:objectn-1 rdf:rest
objectn .
Finishing the collection
production creates an additional triple curSubject rdf:rest rdf:nil
. and restores curSubject and curPredicate
The node produced by matching collection
is the first blank node B for non-empty lists and rdf:nil
for empty lists.
This section is non-normative.
The TriG format is used to express arbitrary application data, which may include the expression of personally identifiable information (PII) or other information which could be considered sensitive. Authors publishing such information are advised to carefully consider the needs and use of publishing such information, as well as the applicable regulations for the regions where the data is expected to be consumed and potentially revealed (e.g., GDPR, CCPA, others), particularly whether authorization measures are needed for access to the data.
This section is non-normative.
The STRING_LITERAL_SINGLE_QUOTE
,
STRING_LITERAL_QUOTE
,
STRING_LITERAL_LONG_SINGLE_QUOTE
, and
STRING_LITERAL_LONG_QUOTE
,
productions allows the use of unescaped control characters.
Although this specification does not directly expose this content to an end user,
it might be presented through a user agent, which may cause the presented text to
be obfuscated due to presentation of such characters.
TriG is a general-purpose assertion language; applications may evaluate given data to infer more assertions or to dereference IRIs, invoking the security considerations of the scheme for that IRI. Note in particular, the privacy issues in [RFC3023] section 10 for HTTP IRIs. Data obtained from an inaccurate or malicious data source may lead to inaccurate or misleading conclusions, as well as the dereferencing of unintended IRIs. Care must be taken to align the trust in consulted resources with the sensitivity of the intended use of the data; inferences of potential medical treatments would likely require different trust than inferences for trip planning.
The TriG language is used to express arbitrary application data; security considerations will vary by domain of use. Security tools and protocols applicable to text (for example, PGP encryption, checksum validation, password-protected compression) may also be used on TriG documents. Security/privacy protocols must be imposed which reflect the sensitivity of the embedded information.
TriG can express data which is presented to the user, such as RDF Schema labels. Applications rendering strings retrieved from untrusted TriG documents, or using unescaped characters, SHOULD use warnings and other appropriate means to limit the possibility that malignant strings might be used to mislead the reader. The security considerations in the media type registration for XML ([RFC3023] section 10) provide additional guidance around the expression of arbitrary data and markup.
TriG uses IRIs as term identifiers. Applications interpreting data expressed in TriG SHOULD address the security issues of Internationalized Resource Identifiers (IRIs) [RFC3987] Section 8, as well as Uniform Resource Identifier (URI): Generic Syntax [RFC3986] Section 7.
Multiple IRIs may have the same appearance.
Characters in different scripts may look similar (for instance,
a Cyrillic "о" (code point U+043E
) may appear similar to a Latin "o" (code point U+006F
)).
A character followed by combining characters may have the same visual representation
as another character (for example, LATIN SMALL LETTER "E" (code point U+0065
)
followed by COMBINING ACUTE ACCENT (code point U+0301
)
has the same visual representation as LATIN SMALL LETTER "E" WITH ACUTE
(U+00E9
)).
Any person or application that is writing or interpreting data in TriG
must take care to use the IRI that matches the intended semantics,
and avoid IRIs that may look similar.
Further information about matching visually similar characters can be found
in Unicode Security Considerations [UNICODE-SECURITY] and
Internationalized Resource Identifiers (IRIs) [RFC3987] Section 8.
The Internet Media Type (formerly known as MIME Type) for TriG is "application/trig".
It is recommended that TriG files have the extension ".trig" (all lowercase) on all platforms.
It is recommended that TriG files stored on Macintosh HFS file systems be given a file type of "TEXT".
This information that follows will be submitted to the IESG for review, approval, and registration with IANA.
This section is non-normative.
This section is non-normative.
The editors gratefully acknowledge the work of Chris Bizer and Richard Cyganiak in creating the original TriG specification. Valuable contributions to this version were made by Gregg Kellogg, Eric Prud'hommeaux and Sandro Hawke.
The document was improved through the review process by the wider community.
This section is non-normative.
In addition to the editors, the following people have contributed to this specification: Peter F. Patel-Schneider, Pierre-Antoine Champin, and Ted Thibodeau Jr
Members of the RDF-star Working Group Group included Achille Zappa, Adrian Gschwend, Alan Snyder, Amin Anjomshoaa, Andy Seaborne, Antoine Zimmermann, Dan Brickley, Dave Raggett, Dominik Tomaszuk, Dörthe Arndt, Enrico Franconi, Erich Bremer, Fabien Gandon, Felix Sasaki, Gregg Kellogg, Gregory Williams, Jean-Yves Rossi, Jose Emilio Labra Gayo, Julián Arenas-Guerrero, Kurt Cagle, Niklas Lindström, Olaf Hartig, Ora Lassila, Pasquale Lisena, Peter Patel-Schneider, Pierre-Antoine Champin, Raphaël Troncy, Richard Lea, Ruben Taelman, Rémi Ceres, Souripriya Das, Ted Thibodeau Jr, Thomas Lörtsch, Thomas Pellissier Tanon, Timothée Haudebourg, and Vladimir Alexiev.
Recognize members of the Task Force? Not an easy to find list of contributors.
This section is non-normative.
This section describes the main differences from the RDF 1.1 Recommendation.
LANGTAG
terminal production to
LANG_DIR
to include
an optional base direction.NIL
terminal production from the grammar, which was unused.Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in:
Referenced in: