RDF 1.1 Concepts and Abstract Syntax

Abstract

The Resource Description Framework (RDF) is a framework for representing information in the Web.

RDF 1.1 Concepts and Abstract Syntax defines an abstract syntax (a data model) which serves to link all RDF-based languages and specifications. The abstract syntax has two key data structures: RDF graphs are sets of subject-predicate-object triples, where the elements may be IRIs, blank nodes, or datatyped literals. They are used to express descriptions of resources. RDF datasets are used to organize collections of RDF graphs, and comprise a default graph and zero or more named graphs. This document also introduces key concepts and terminology, and discusses datatyping and the handling of fragment identifiers in IRIs within RDF graphs.

1. Introduction

This section is non-normative.

The Resource Description Framework (RDF) is a framework for representing information in the Web.

This document defines an abstract syntax (a data model) which serves to link all RDF-based languages and specifications, including:

the formal model-theoretic semantics for RDF and RDFS [RDF11-MT].
serialization syntaxes for storing and exchanging RDF (e.g., Turtle [TURTLE-CR] and RDF/XML [RDF-SYNTAX-GRAMMAR]),
the SPARQL Query Language [RDF-SPARQL-QUERY],
the RDF Vocabulary Description Language (RDFS) [RDF-SCHEMA],

1.1 Graph-based Data Model

This section is non-normative.

The core structure of the abstract syntax is a set of triples, each consisting of a subject, a predicate and an object. A set of such triples is called an RDF graph. An RDF graph can be visualized as a node and directed-arc diagram, in which each triple is represented as a node-arc-node link.

Fig. 1 An RDF graph with two nodes (Subject and Object) and a triple connecting them (Predicate)

There can be three kinds of nodes in an RDF graph: IRIs, literals, and blank nodes.

1.2 Resources and Statements

Any IRI or literal denotes something in the world (the "universe of discourse"). These things are called resources. Anything can be a resource, including physical things, documents, abstract concepts, numbers and strings; the term is synonymous with “entity”. The resource denoted by an IRI is called its referent, and the resource denoted by a literal is called its literal value. Literals have datatypes that define the range of possible values, such as strings, numbers, and dates. A special kind of literals, language-tagged strings, denote plain-text strings in a natural language.

Asserting an RDF triple says that some relationship, indicated by the predicate, holds between the resources denoted by the subject and object. This statement corresponding to an RDF triple is known as an RDF statement. The predicate itself is an IRI and denotes a property, that is, a resource that can be thought of as a binary relation. (Relations that involve more than two entities can only be indirectly expressed in RDF [SWBP-N-ARYRELATIONS].)

Unlike IRIs and literals, blank nodes do not denote specific resources. Statements involving blank nodes say that something with the given relationships exists, without explicitly naming it.

1.3 The Referent of an IRI

The resource denoted by an IRI is also called its referent. For some IRIs with particular meanings, such as those identifying XSD datatypes, the referent is fixed by this specification. For all other IRIs, what exactly is denoted by any given IRI is not defined by this specification. Other specifications may fix IRI referents, or apply other constraints on what may be the referent of any IRI.

Guidelines for determining the referent of an IRI are provided in other documents, like Architecture of the World Wide Web, Volume One [WEBARCH] and Cool URIs for the Semantic Web [COOLURIS]. A very brief, informal and partial account follows:

IRIs have global scope: Two different appearances of an IRI denote the same resource.
By social convention, the IRI owner [WEBARCH] gets to say what what the intended (or usual) referent of an IRI is. Applications and users need not abide by this intended denotation, but there may be a loss of interoperability with other applications and users if they do not do so.
The IRI owner can establish the intended referent by means of a specification or other document that explains what is denoted. For example, the Organization Ontology document [vocab-org] specifies the intended referents of various IRIs that start with http://www.w3.org/ns/org#.
A good way of communicating the intended referent is to set up the IRI so that it dereferences [WEBARCH] to such a document.
Such a document can, in fact, be an RDF document that describes the denoted resource by means of RDF statements.

Perhaps the most important characteristic of IRIs in web architecture is that they can be dereferenced, and hence serve as starting points for interactions with a remote server. This specification is not concerned with such interactions. It does not define an interaction model. It only treats IRIs as globally unique identifiers in a graph data model that describes resources. However, those interactions are critical to the concept of Linked Data [LINKED-DATA], which makes use of the RDF data model and serialization formats.

1.4 RDF Vocabularies and Namespace IRIs

An RDF vocabulary is a collection of IRIs intended for use in RDF graphs. For example, the IRIs documented in [RDF-SCHEMA] are the RDF Schema vocabulary. RDF Schema can itself be used to define and document additional RDF vocabularies. Some such vocabularies are mentioned in the Primer [RDF-PRIMER].

The IRIs in an RDF vocabulary often begin with a common substring known as a namespace IRI. Some namespace IRIs are associated by convention with a short name known as a namespace prefix. Some examples:

Some example namespace prefixes and IRIs
Namespace prefix	Namespace IRI	RDF vocabulary
rdf	`http://www.w3.org/1999/02/22-rdf-syntax-ns#`	The RDF built-in vocabulary [RDF-SCHEMA]
rdfs	`http://www.w3.org/2000/01/rdf-schema#`	The RDF Schema vocabulary [RDF-SCHEMA]
xsd	`http://www.w3.org/2001/XMLSchema#`	The RDF-compatible XSD types

In some serialization formats it is common to abbreviate IRIs that start with namespace IRIs by using a namespace prefix in order to assist readability. For example, the IRI http://www.w3.org/1999/02/22-rdf-syntax-ns#XMLLiteral would be abbreviated as rdf:XMLLiteral. Note however that these abbreviations are not valid IRIs, and must not be used in contexts where IRIs are expected. Namespace IRIs and namespace prefixes are not a formal part of the RDF data model. They are merely a syntactic convenience for abbreviating IRIs.

The term “namespace” on its own does not have a well-defined meaning in the context of RDF, but is sometimes informally used to mean “namespace IRI” or “RDF vocabulary”.

1.5 RDF and Change Over Time

The RDF data model is atemporal: It does not deal with time, and does not have a built-in notion of temporal validity of information. RDF graphs are static snapshots of information.

However, RDF graphs can express information about events and about temporal aspects of other entities, given appropriate vocabulary terms.

Since RDF graphs are defined as mathematical sets, adding or removing triples from an RDF graph yields a different RDF graph.

We informally use the term RDF source to refer to a persistent yet mutable source or container of RDF graphs. An RDF source is a resource that may be said to have a state that can change over time. A snapshot of the state can be expressed as an RDF graph. For example, any web document that has an RDF-bearing representation may be considered an RDF source. Like all resources, RDF sources may be named with IRIs and therefore described in other RDF graphs.

Intuitively speaking, changes in the universe of discourse can be reflected in the following ways:

An IRI, once minted, should never change its intended referent. (See URI persistence [WEBARCH].)
Literals, by design, are constants and never change their value.
Some properties may change over time. A relationship that holds between two resources at one time may not hold at another time.
RDF sources may change their state over time. That is, they may provide different RDF graphs at different times.
Some RDF sources may, however, be immutable snapshots of another RDF source, archiving its state at some point in time.

1.6 Working with Multiple RDF Graphs

As RDF graphs are sets of triples, they can be combined easily, supporting the use of data from multiple sources. Nevertheless, it is sometimes desirable to work with multiple RDF graphs while keeping their contents separate. RDF datasets support this requirement.

An RDF dataset is a collection of RDF graphs. All but one of these graphs have an associated IRI or blank node. They are called named graphs, and the IRI or blank node is called the graph name. The remaining graph does not have an associated IRI, and is called the default graph of the RDF dataset.

There are many possible uses for RDF datasets. One such use is to hold snapshots of multiple RDF sources.

1.7 Equivalence, Entailment and Inconsistency

An RDF triple encodes a statement—a simple logical expression, or claim about the world. An RDF graph is the conjunction (logical AND) of its triples. The precise details of this meaning of RDF triples and graphs are the subject of the RDF Semantics specification [RDF-MT], which yields the following relationships between RDF graphs:

Entailment: An RDF graph A entails another RDF graph B if every possible arrangement of the world that makes A true also makes B true. When A entails B, if the truth of A is presumed or demonstrated then the truth of B is established.
Equivalence: Two RDF graphs A and B are equivalent if they make the same claim about the world. A is equivalent to B if and only if A entails B and B entails A.
Inconsistency: An RDF graph is inconsistent if it contains an internal contradiction. There is no possible arrangement of the world that would make the expression true.

An entailment regime [RDF-MT] is a specification that defines precise conditions that make these relationships hold. RDF itself recognizes only some basic cases of entailment, equivalence and inconsistency. Other specifications, such as RDF Schema [RDF-SCHEMA] and OWL 2 [OWL2-OVERVIEW], add more powerful entailment regimes, as do some domain-specific vocabularies.

This specification does not constrain how implementations use the logical relationships defined by entailment regimes. Implementations may or may not detect inconsistencies, and may make all, some or no entailed information available to users.

1.8 RDF Documents and Syntaxes

An RDF document is a document that encodes an RDF graph or RDF dataset in a concrete RDF syntax, such as Turtle [TURTLE-CR], RDFa [RDFA-PRIMER], JSON-LD [JSON-LD], RDF/XML [RDF-SYNTAX-GRAMMAR], or N-Triples [N-TRIPLES]. RDF documents enable the exchange of RDF graphs and RDF datasets between systems.

A concrete RDF syntax may offer many different ways to encode the same RDF graph or RDF dataset, for example through the use of namespace prefixes, relative IRIs, blank node identifiers, and different ordering of statements. While these aspects can have great effect on the convenience of working with the RDF document, they are not significant for its meaning.

3. RDF Graphs

An RDF graph is a set of RDF triples.

3.1 Triples

An RDF triple consists of three components:

the subject, which is an IRI or a blank node
the predicate, which is an IRI
the object, which is an IRI, a literal or a blank node

An RDF triple is conventionally written in the order subject, predicate, object.

The set of nodes of an RDF graph is the set of subjects and objects of triples in the graph. It is possible for a predicate IRI to also occur as a node in the same graph.

IRIs, literals and blank nodes are collectively known as RDF terms.

Note

IRIs, literals and blank nodes are distinct and distinguishable. For example, http://example.org/ as a string literal is not equal to http://example.org/ as an IRI, nor to a blank node with the blank node identifier http://example.org/.

3.2 IRIs

An IRI (Internationalized Resource Identifier) within an RDF graph is a Unicode string [UNICODE] that conforms to the syntax defined in RFC 3987 [RFC3987].

IRIs in the RDF abstract syntax MUST be absolute, and MAY contain a fragment identifier.

IRI equality: Two IRIs are equal if and only if they are equivalent under Simple String Comparison according to section 5.1 of [RFC3987]. Further normalization MUST NOT be performed when comparing IRIs for equality.

Note

URIs and IRIs: IRIs are a generalization of URIs [RFC3986] that permits a much wider range of Unicode characters. Every absolute URI and URL is an IRI, but not every IRI is an URI. When IRIs are used in operations that are only defined for URIs, they must first be converted according to the mapping defined in section 3.1 of [RFC3987]. A notable example is retrieval over the HTTP protocol. The mapping involves UTF-8 encoding of non-ASCII characters, %-encoding of octets not allowed in URIs, and Punycode-encoding of domain names.

Relative IRIs: Some concrete RDF syntaxes permit relative IRIs as a convenient shorthand that allows authoring of documents independently from their final publishing location. Relative IRIs must be resolved against a base IRI to make them absolute. Therefore, the RDF graph serialized in such syntaxes is well-defined only if a base IRI can be established [RFC3986].

IRI normalization: Interoperability problems can be avoided by minting only IRIs that are normalized according to Section 5 of [RFC3987]. Non-normalized forms that are best avoided include:

Uppercase characters in scheme names and domain names
Percent-encoding of characters where it is not required by IRI syntax
Explicitly stated HTTP default port (http://example.com:80/); http://example.com/ is preferrable
Completely empty path in HTTP IRIs (http://example.com); http://example.com/ is preferrable
“/./” or “/../” in the path component of an IRI
Lowercase hexadecimal letters within percent-encoding triplets (“%3F” is preferable over “%3f”)
Punycode-encoding of Internationalized Domain Names in IRIs
IRIs that are not in Unicode Normalization Form C [NFC]

3.3 Literals

Literals are used for values such as strings, numbers and dates.

A literal in an RDF graph consists of two or three elements:

a lexical form, being a Unicode [UNICODE] string, which SHOULD be in Normal Form C [NFC],
a datatype IRI, being an IRI that determines how the lexical form maps to a literal value.

A literal is a language-tagged string if and only if its datatype IRI is http://www.w3.org/1999/02/22-rdf-syntax-ns#langString, and only in this case the third element is present:

a non-empty language tag as defined by [BCP47]. The language tag MUST be well-formed according to section 2.2.9 of [BCP47]. Lexical representations of language tags MAY be converted to lower case. The value space of language tags is always in lower case.
A badly formed language tag MUST be treated as a syntax error.

Note

Implementors might wish to note that language tags conform to the regular expression ’@’ [a-zA-Z]{1,8} (’-’ [a-zA-Z0-9]{1,8})* before normalizing to lowercase.

Multiple literals may have the same lexical form.

Concrete syntaxes MAY support simple literals, consisting of only a lexical form without any datatype IRI or language tag. Simple literals only exist in concrete syntaxes, and are treated as syntactic sugar for abstract syntax literals with the datatype IRI http://www.w3.org/2001/XMLSchema#string.

Literal term equality: Two literals are term-equal (the same RDF literal) if and only if the two lexical forms, the two datatype IRIs, and the two language tags (if any) compare equal, character by character.

Two literals can have the same value without being the same RDF term. For example:

		"1"^^xs:integer
		"01"^^xs:integer

denote the same value, but are not the same literal RDF terms and are not term-equals.

The literal value associated with a literal is:

If the literal is a language-tagged string, then the literal value is a pair consisting of its lexical form and its language tag, in that order.
If the literal's datatype IRI is not recognized by an implementation, then the literal value is not defined by this specification.
Let d be the referent of the datatype IRI in the set of recognized datatype IRIs. If the literal's lexical form is in the lexical space of d, then the literal value is the result of applying the lexical-to-value mapping of d to the lexical form.
Otherwise, the literal is ill-typed, and no literal value can be associated with the literal. Such a case produces a semantic inconsistency but is not syntactically ill-formed and implementations MUST accept ill-typed literals and produce RDF graphs from them. Implementations MAY produce warnings when encountering ill-typed literals.

3.4 Blank Nodes

Blank nodes are disjoint from IRIs and literals. Otherwise, the set of possible blank nodes is arbitrary. RDF makes no reference to any internal structure of blank nodes.

Note

Blank node identifiers are local identifiers that are used in some concrete RDF syntaxes or RDF store implementations. They are always locally scoped to the file or RDF store, and are not persistent or portable identifiers for blank nodes. Blank node identifiers are not part of the RDF abstract syntax, but are entirely dependent on the concrete syntax or implementation. The syntactic restrictions on blank node identifiers, if any, therefore also depend on the concrete RDF syntax or implementation. Implementations that handle blank node identifiers in concrete syntaxes need to be careful not to create the same blank node from multiple occurences of the same blank node identifier except in situations where this is supported by the syntax.

3.5 Replacing Blank Nodes with IRIs

Blank nodes do not have identifiers in the RDF abstract syntax. The blank node identifiers introduced by some concrete syntaxes have only local scope and are purely an artifact of the serialization.

In situations where stronger identification is needed, systems MAY systematically replace some or all of the blank nodes in an RDF graph with IRIs. Systems wishing to do this SHOULD mint a new, globally unique IRI (a Skolem IRI) for each blank node so replaced.

This transformation does not appreciably change the meaning of an RDF graph, provided that the Skolem IRIs do not occur anywhere else. It does however permit the possibility of other graphs subsequently using the Skolem IRIs, which is not possible for blank nodes.

Systems may wish to mint Skolem IRIs in such a way that they can recognize the IRIs as having been introduced solely to replace blank nodes. This allows a system to map IRIs back to blank nodes if needed.

Systems that want Skolem IRIs to be recognizable outside of the system boundaries SHOULD use a well-known IRI [RFC5785] with the registered name genid. This is an IRI that uses the HTTP or HTTPS scheme, or another scheme that has been specified to use well-known IRIs; and whose path component starts with /.well-known/genid/.

For example, the authority responsible for the domain example.com could mint the following recognizable Skolem IRI:

http://example.com/.well-known/genid/d26a2d0e98334696f4ad70a677abc1f6

Note

RFC 5785 [RFC5785] only specifies well-known URIs, not IRIs. For the purpose of this document, a well-known IRI is any IRI that results in a well-known URI after IRI-to-URI mapping [RFC3987].

3.6 Graph Comparison

Two RDF graphs G and G' are isomorphic (that is, they have an identical form) if there is a bijection M between the sets of nodes of the two graphs, such that:

M maps blank nodes to blank nodes.
M(lit)=lit for all RDF literals lit which are nodes of G.
M(uri)=uri for all IRIs uri which are nodes of G.
The triple ( s, p, o ) is in G if and only if the triple ( M(s), p, M(o) ) is in G'

With this definition, M shows how each blank node in G can be replaced with a new blank node to give G'. Graph isomorphism is needed to support the RDF Test Cases [RDF-TESTCASES] specification.

5. Datatypes

Datatypes are used with RDF literals to represent values such as strings, numbers and dates. The datatype abstraction used in RDF is compatible with XML Schema [XMLSCHEMA11-2]. Any datatype definition that conforms to this abstraction MAY be used in RDF, even if not defined in terms of XML Schema. RDF re-uses many of the XML Schema built-in datatypes, and provides two additional built-in datatypes, rdf:HTML and rdf:XMLLiteral. The list of datatypes supported by an implementation is determined by its recognized datatype IRIs.

A datatype consists of a lexical space, a value space and a lexical-to-value mapping, and is denoted by one or more IRIs.

The lexical space of a datatype is a set of Unicode [UNICODE] strings.

The lexical-to-value mapping of a datatype is a set of pairs whose first element belongs to the lexical space, and the second element belongs to the value space of the datatype. Each member of the lexical space is paired with exactly one value, and is a lexical representation of that value. The mapping can be seen as a function from the lexical space to the value space.

Note

Language-tagged strings have the datatype IRI http://www.w3.org/1999/02/22-rdf-syntax-ns#langString. No datatype is formally defined for this IRI because the definition of datatypes does not accommodate language tags in the lexical space. The value space associated with this datatype IRI is the set of all pairs of strings and language tags.

For example, the XML Schema datatype xsd:boolean, where each member of the value space has two lexical representations, is defined as follows:

Lexical space:: {“true”, “false”, “1”, “0”}
Value space:: {true, false}
Lexical-to-value mapping: { <“true”, true>, <“false”, false>, <“1”, true>, <“0”, false>, }

The literals that can be defined using this datatype are:

This table lists the literals of type xsd:boolean.
Literal	Value
<“`true`”, `xsd:boolean`>	true
<“`false`”, `xsd:boolean`>	false
<“`1`”, `xsd:boolean`>	true
<“`0`”, `xsd:boolean`>	false

5.1 The XML Schema Built-in Datatypes

IRIs of the form http://www.w3.org/2001/XMLSchema#xxx, where xxx is the name of a datatype, denote the built-in datatypes defined in XML Schema 1.1 Part 2: Datatypes [XMLSCHEMA11-2]. The XML Schema built-in types listed in the following table are the RDF-compatible XSD types. Their use is RECOMMENDED.

Readers might note that the xsd:hexBinary and xsd:base64Binary datatypes are the only safe datatypes for transferring binary information.

A list of the RDF-compatible XSD types, with short descriptions"
	Datatype	Value space (informative)
Core types	`xsd:string`	Character strings (but not all Unicode character strings)
	`xsd:boolean`	true, false
	`xsd:decimal`	Arbitrary-precision decimal numbers
	`xsd:integer`	Arbitrary-size integer numbers
IEEE floating-point numbers	`xsd:double`	64-bit floating point numbers incl. ±Inf, ±0, NaN
IEEE floating-point numbers	`xsd:float`	32-bit floating point numbers incl. ±Inf, ±0, NaN
Time and date	`xsd:date`	Dates (yyyy-mm-dd) with or without timezone
	`xsd:time`	Times (hh:mm:ss.sss…) with or without timezone
	`xsd:dateTime`	Date and time with or without timezone
	`xsd:dateTimeStamp`	Date and time with required timezone
Recurring and partial dates	`xsd:gYear`	Gregorian calendar year
	`xsd:gMonth`	Gregorian calendar month
	`xsd:gDay`	Gregorian calendar day of the month
	`xsd:gYearMonth`	Gregorian calendar year and month
	`xsd:gMonthDay`	Gregorian calendar month and day
	`xsd:duration`	Duration of time
	`xsd:yearMonthDuration`	Duration of time (months and years only)
	`xsd:dayTimeDuration`	Duration of time (days, hours, minutes, seconds only)
Limited-range integer numbers	`xsd:byte`	-128…+127 (8 bit)
	`xsd:short`	-32768…+32767 (16 bit)
	`xsd:int`	-2147483648…+2147483647 (32 bit)
	`xsd:long`	-9223372036854775808…+9223372036854775807 (64 bit)
	`xsd:unsignedByte`	0…255 (8 bit)
	`xsd:unsignedShort`	0…65535 (16 bit)
	`xsd:unsignedInt`	0…4294967295 (32 bit)
	`xsd:unsignedLong`	0…18446744073709551615 (64 bit)
	`xsd:positiveInteger`	Integer numbers >0
	`xsd:nonNegativeInteger`	Integer numbers ≥0
	`xsd:negativeInteger`	Integer numbers <0
	`xsd:nonPositiveInteger`	Integer numbers ≤0
Encoded binary data	`xsd:hexBinary`	Hex-encoded binary data
Encoded binary data	`xsd:base64Binary`	Base64-encoded binary data
Miscellaneous XSD types	`xsd:anyURI`	Absolute or relative URIs and IRIs
	`xsd:language`	Language tags per [BCP47]
	`xsd:normalizedString`	Whitespace-normalized strings
	`xsd:token`	Tokenized strings
	`xsd:NMTOKEN`	XML NMTOKENs
	`xsd:Name`	XML Names
	`xsd:NCName`	XML NCNames

The other built-in XML Schema datatypes are unsuitable for various reasons, and SHOULD NOT be used.

Note

xsd:QName and xsd:ENTITY require an enclosing XML document context.
xsd:ID and xsd:IDREF are for cross references within an XML document.
xsd:NOTATION is not intended for direct use.
xsd:IDREFS, xsd:ENTITIES and xsd:NMTOKENS are sequence-valued datatypes which do not fit the RDF datatype model.

5.2 The `rdf:HTML` Datatype

RDF provides for HTML content as a possible literal value. This allows markup in literal values. Such content is indicated in an RDF graph using a literal whose datatype is a special built-in datatype rdf:HTML. This datatype is defined as follows:

An IRI denoting this datatype

is http://www.w3.org/1999/02/22-rdf-syntax-ns#HTML.

The lexical space

is the set of Unicode [UNICODE] strings.

The value space

is a set of DOM DocumentFragment nodes [DOM4]. Two DocumentFragment nodes A and B are considered equal if and only if the DOM method A.isEqualNode(B) [DOM4] returns true.

The lexical-to-value mapping

Each member of the lexical space is associated with the result of applying the following algorithm:

Let domnodes be the list of DOM nodes [DOM4] that result from applying the HTML fragment parsing algorithm [HTML5] to the input string, without a context element.
Let domfrag be a DOM DocumentFragment [DOM4] whose childNodes attribute is equal to domnodes
Return domfrag.normalize()

Note

Any language annotation (lang="…") or XML namespaces (xmlns) desired in the HTML content must be included explicitly in the HTML literal. Relative URLs in attributes such as href do not have a well-defined base URL and are best avoided. RDF applications may use additional equivalence relations, such as that which relates an xsd:string with an rdf:HTML literal corresponding to a single text node of the same string.

5.3 The `rdf:XMLLiteral` Datatype

RDF provides for XML content as a possible literal value. Such content is indicated in an RDF graph using a literal whose datatype is a special built-in datatype rdf:XMLLiteral, which is defined as follows:

An IRI denoting this datatype

is http://www.w3.org/1999/02/22-rdf-syntax-ns#XMLLiteral.

The lexical space

is the set of all strings which are well-balanced, self-contained XML content [XML10]; and for which embedding between an arbitrary XML start tag and an end tag yields a document conforming to XML Namespaces [XML-NAMES].

The value space

is a set of DOM DocumentFragment nodes [DOM4]. Two DocumentFragment nodes A and B are considered equal if and only if the DOM method A.isEqualNode(B) returns true.

The lexical-to-value mapping

Each member of the lexical space is associated with the result of applying the following algorithm:

Let domfrag be a DOM DocumentFragment node [DOM4] corresponding to the input string
Return domfrag.normalize()

The canonical mapping

defines a canonical lexical form [XMLSCHEMA11-2] for each member of the value space. The rdf:XMLLiteral canonical mapping is the exclusive XML canonicalization method (with comments, with empty InclusiveNamespaces PrefixList) [XML-EXC-C14N].

Note

Any XML namespace declarations (xmlns), language annotation (xml:lang) or base URI declarations (xml:base) desired in the XML content must be included explicitly in the XML literal. Note that some concrete RDF syntaxes may define mechanisms for inheriting them from the context (e.g., @parseType="literal" in RDF/XML [RDF-SYNTAX-GRAMMAR]).

5.4 Datatype IRIs

Datatypes are identified by IRIs. If D is a set of IRIs which are used to refer to datatypes, then the elements of D are called recognized datatype IRIs. Recognized IRIs have fixed referents, which MUST satisfy these conditions:

If the IRI http://www.w3.org/1999/02/22-rdf-syntax-ns#XMLLiteral is recognized then it refers to the datatype rdf:XMLLiteral;
If the IRI http://www.w3.org/1999/02/22-rdf-syntax-ns#HTML is recognized then it refers to the datatype rdf:HTML;
If any IRI of the form http://www.w3.org/2001/XMLSchema#xxx is recognized then it refers to the RDF-compatible XSD type named xsd:xxx, for every XSD type listed in section 5.1.

Semantic extensions of RDF MAY recognize other datatype IRIs and require them to refer to a fixed datatype.

RDF processors are not required to recognize datatype IRIs. Any literal typed with an unrecognized IRI is treated just like an unknown IRI, i.e. as referring to an unknown thing. Applications MAY give a warning message if they are unable to determine the referent of an IRI used in a typed literal, but they SHOULD NOT reject such RDF as either a syntactic or semantic error.

Other specifications MAY impose additional constraints on datatype IRIs, for example, require support for certain datatypes.

Note

The Web Ontology Language [OWL2-OVERVIEW] offers facilities for formally defining custom datatypes that can be used with RDF. Furthermore, a practice for identifying user-defined simple XML Schema datatypes is suggested in [SWBP-XSCH-DATATYPES]. RDF implementations are not required to support either of these facilities.

B. Change Log

This section is non-normative.

B.1 Changes from 15 January 2013 WD to this version

This section is non-normative.

This section lists changes from the 15 January 2013 Working Draft (WD) to this Editor's Draft of RDF 1.1 Concepts and Abstract Syntax.

2013-07-15: Editorial change to the first paragraph of Section 1.7 in an attempt to clarify the relationship to RDF-MT following the review by Markus Lanthaler and suggested changes by Peter Patel-Schneider
2013-07-03: Editorial changes in response to a review by Markus Lanthaler and a related request
2013-06-27: Added informative section on generalized RDF triples, graphs, and datasets.
2013-06-27: Added caution on the use of graph names as blank nodes.
2013-06-19: Noted that RDF Dataset graph names may be blank nodes (ACTION-274, resolution)
2013-06-19: Changes in response to a review by Peter Patel-Schneider
2013-06-05: Minor change to note to specify the value space and lexical space of language tags (ACTION-265, resolution)
2013-05-08: Minor change to note that a badly formed language tag is a syntax error (ACTION-262)
2013-05-08: Migrated language related to datatype maps to recognized datatype IRIs (ISSUE-118)
2013-05-08: Editorial changes in response to a discussion of literal equality
2013-05-08: Editorial changes in response to a review by Sandro Hawke
2013-05-07: Revised the definition of blank nodes (ISSUE-107)
2013-05-07: Defined the consequence of a literal being ill-typed (ISSUE-109)
2013-05-07: Clarified the existence of null control characters in xsd:strings (ISSUE-126)
2013-05-07: Added a definition of RDF Dataset isomorphism (ISSUE-111)
2013-05-07: Addressed content negotiation as it relates to graphs and datasets (ISSUE-105)

B.2 Changes from 05 June 2012 WD to this version

This section lists changes from the 05 June 2012 Working Draft (WD) to this Editor's Draft of RDF 1.1 Concepts and Abstract Syntax.

2013-01-14: Editorial changes in response to reviews from Antoine Zimmermann and Peter Patel-Schneider
2012-11-21: Replaced the placeholder term “g-box” with “RDF source” (ISSUE-110)
2012-11-21: Removed various Notes (as listed here), and refactored others (ISSUE-104)
2012-11-17: Many changes to Introduction, including mostly new subsections on Working with Multiple RDF Graphs and G-Boxes, Equivalence, Entailment and Inconsistencies, and RDF Documents and Syntaxes
2012-11-17: Reverted section on Blank Nodes to earlier state
2012-11-17: Changes, mostly but not exclusively editorial, to section on Fragment Identifiers
2012-11-13: Remove the notion of other specs conforming to this spec from the Conformance section. This spec simply provides definitions that other specs can use.
2012-11-09: Updated the section on RDF datasets to reflect various WG resolutions around named graphs
2012-11-09: Re-wrote the section on Blank Nodes, including a definition of “fresh blank nodes” and an extended Note on standardizing apart blank node IDs
2012-11-09: Moved all informative material about changes between RDF 2004 and RDF 1.1 to a new appendix
2012-11-07: Add new informative section on Change Over Time
2012-11-07: New abstract, based on comments from Dan Connolly
2012-11-06: Tweak definition of literals to avoid apparent contradiction (ISSUE-94)
2012-11-06: Add a note on the use of OWL2 custom datatypes and simple user-defined XML Schema datatypes (ISSUE-96)
2012-11-06: Add a note on empty named graphs (ISSUE-22)
2012-11-06: Modify the Note on relative IRIs to stress their usefulness and to clarify the role of RFC 3986 in the resolution process
2012-11-06: Informatively explain that IRIs in this spec are treated only as nodes in a graph data model, and no interaction model is implied
2012-08-09: Clarify that all datatypes are optional, but RDF-conformant specifications MAY require specific datatype maps

B.3 Changes from FPWD to 05 June 2012 WD

This section lists changes from the First Public Working Draft (FPWD) to the 05 June 2012 Working Draft (WD) of RDF 1.1 Concepts and Abstract Syntax.

2012-05-31: Update Acknowledgements for RDF 1.1; added RDFa 1.1 markup
2012-05-24: Moved the multigraph section to an earlier position and renamed it to “RDF Datasets”
2012-05-17: Changed normative reference for DOM in rdf:XMLLiteral from [DOM3CORE] to [DOM4] as we need DOM4 anyways for rdf:HTML
2012-05-17: Added rdf:HTML datatype (ISSUE-63)
2012-05-17: Added xsd:duration to list of RDF-compatible XSD types (ISSUE-88)
2012-05-14: Replaced the example graph diagram in Section 1.1 with a re-drawn SVG version, with support from Dominik Tomaszuk
2012-05-10: New Conformance section to explain that this specification is not implemented directly, but through other specifications that use our definitions
2012-05-10: Simplified rdf:XMLLiteral's new value space slightly after feedback from Ivan Herman and Arnaud Le Hors.
2012-05-10: Added an informative subsection on RDF vocabularies and namespace IRIs.
2012-05-09: Removed an example from the conformance section that didn't make sense any more with the modified rdf:XMLLiteral. Added some new issue boxes.
2012-05-09: rdf:XMLLiteral no longer requires lexical forms to be canonicalized, and the value space is now defined in terms of [DOM-LEVEL-3-CORE] (ISSUE-13)
2012-05-09: Removed Section 3 RDF Vocabulary IRI and Namespace; its contents will be folded into the RDF Schema document
2012-05-02: Renamed “graph equivalence” to “graph isomorphism” (ISSUE-86)
2012-05-02: Updated [XMLSCHEMA11-1] and [XMLSCHEMA11-2] references to the new REC versions
2012-05-02: Added the new XSD 1.1 datatypes xsd:dayTimeDuration, xsd:yearMonthDuration and xsd:dateTimeStamp to the list of RDF-compatible XSD types (ISSUE-66)
2012-04-26: Remove normative definition of “property” as it disagreed with RDF Semantics; small editorial changes.
2011-11-21: Updated XHTML 1.0 reference to XHTML 1.1
2011-11-20: Added table of RDF-compatible XSD types, and definition of datatype map, both adapted from previous content in [RDF-MT]
2011-11-18: Replaced informative Introduction and RDF Concepts sections with a new extended introduction. Folded some content from RDF Concepts into the later normative sections, mostly as examples and notes.
2011-11-10: Changed XSD references to version 1.1
2011-11-10: Replaced the section on fragment identifiers with an updated account that follows RFC 3986
2011-11-09: Updated the two sections on literals to reflect the ISSUE-71 resolution that literals with language tag now have the datatype IRI rdf:langString. Formally introduced the term “language-tagged string”.
2011-11-09: Add a note that explains that #x0-#x1F are no longer allowed in simple literals

B.4 Changes from RDF 2004 to FPWD

This section lists changes from the 2004 Recommendation of RDF Concepts and Abstract Syntax to the First Public Working Draft (FPWD) of RDF 1.1 Concepts and Abstract Syntax.

2011-08-13: Updated Turtle reference to Turtle FPWD
2011-07-21: Condensed the 2004 acknowledgements
2011-07-21: Updated the two sections on literals to reflect the ISSUE-12 resolution that simple literals are no longer part of the abstract syntax. Formally introduced the terms “language-tagged literal”, “simple literal”.
2011-07-21: Updated the introduction, and removed many mentions of RDF/XML. Changed the normative reference for the terms in the RDF namespace from the RDF/XML spec to the RDF Schema spec. Removed any mention of the 1999 version of RDF.
2011-07-21: Replaced RFC 2279 reference (UTF-8) with RFC 3629
2011-07-20: Removed informative sections “Motivations and Goals” (see RDF 2004 version) and “RDF Expression of Simple Facts” (see RDF 2004 version)
2011-06-01: Replaced the URI References section with new section on IRIs, and changed “RDF URI Reference” to “IRI” throughout the document.
2011-06-01: Changed language tag definition to require well-formedness according to BCP47; added a note that this invalidates some RDF
2011-05-25: Added boxes for known WG issues throught the document
2011-05-25: Deleted “Structure of this Document” section, it added no value beyond the TOC
2011-05-25: Implemented resolution of ISSUE-40: Skolemization advice in the RDF dcocument by adding a section on Replacing Blank Nodes with IRIs
2011-05-25: rdf:XMLLiteral is disjoint from any datatype not explicitly related to it, per erratum [concept-xmlliteral]
2011-05-25: Added Conformance section with RFC2119 reference
2011-05-25: Updated all W3C references to latest editions, and Unicode from v3 to v4
2011-05-24: Converted to ReSpec, changed metadata to reflect RDF 1.1

C. References

C.1 Normative references

[BCP47]: A. Phillips; M. Davis. Tags for Identifying Languages. September 2009. IETF Best Current Practice. URL: http://tools.ietf.org/html/bcp47
[DOM4]: Anne van Kesteren; Aryeh Gregor; Lachlan Hunt; Ms2ger. DOM4. 6 December 2012. W3C Working Draft. URL: http://www.w3.org/TR/dom/
[HTML5]: Robin Berjon et al. HTML5. 17 December 2012. W3C Candidate Recommendation. URL: http://www.w3.org/TR/html5/
[NFC]: M. Davis, Ken Whistler. TR15, Unicode Normalization Forms.. 17 September 2010, URL: http://www.unicode.org/reports/tr15/
[RDF11-MT]: Patrick J. Hayes; Peter F. Patel-Schneider. RDF 1.1 Semantics. 23 July 2013. W3C Last Call Working Draft. URL: http://www.w3.org/TR/2013/WD-rdf11-mt-20130723/. The latest edition is available at http://www.w3.org/TR/rdf11-mt/
[RFC2119]: S. Bradner. Key words for use in RFCs to Indicate Requirement Levels. March 1997. Internet RFC 2119. URL: http://www.ietf.org/rfc/rfc2119.txt
[RFC3987]: M. Dürst; M. Suignard. Internationalized Resource Identifiers (IRIs). January 2005. RFC. URL: http://www.ietf.org/rfc/rfc3987.txt
[UNICODE]: The Unicode Standard. URL: http://www.unicode.org/versions/latest/
[XML-EXC-C14N]: John Boyer; Donald Eastlake; Joseph Reagle. Exclusive XML Canonicalization Version 1.0. 18 July 2002. W3C Recommendation. URL: http://www.w3.org/TR/xml-exc-c14n
[XML-NAMES]: Tim Bray; Dave Hollander; Andrew Layman; Richard Tobin; Henry Thompson et al. Namespaces in XML 1.0 (Third Edition). 8 December 2009. W3C Recommendation. URL: http://www.w3.org/TR/xml-names
[XML10]: Tim Bray; Jean Paoli; Michael Sperberg-McQueen; Eve Maler; François Yergeau et al. Extensible Markup Language (XML) 1.0 (Fifth Edition). 26 November 2008. W3C Recommendation. URL: http://www.w3.org/TR/xml
[XMLSCHEMA11-2]: David Peterson; Sandy Gao; Ashok Malhotra; Michael Sperberg-McQueen; Henry Thompson; Paul V. Biron et al. W3C XML Schema Definition Language (XSD) 1.1 Part 2: Datatypes. 5 April 2012. W3C Recommendation. URL: http://www.w3.org/TR/xmlschema11-2/

C.2 Informative references

[COOLURIS]: Leo Sauermann; Richard Cyganiak. Cool URIs for the Semantic Web. 3 December 2008. W3C Note. URL: http://www.w3.org/TR/cooluris
[HTML-RDFA]: Manu Sporny et al. HTML+RDFa 1.1. 25 May 2011. W3C Working Draft. URL: http://www.w3.org/TR/rdfa-in-html/
[JSON-LD]: Manu Sporny; Gregg Kellogg; Markus Lanthaler. JSON-LD 1.0. 11 April 2013. W3C Working Draft. URL: http://www.w3.org/TR/json-ld/
[LINKED-DATA]: Tim Berners-Lee. Linked Data Design Issues. 27 July 2006. W3C-Internal Document. URL: http://www.w3.org/DesignIssues/LinkedData.html
[N-TRIPLES]: Gavin Carothers. N-Triples. 9 April 2013. W3C Working Group Note. URL: http://www.w3.org/TR/2013/NOTE-n-triples-20130409/. The latest edition is available at http://www.w3.org/TR/n-triples/
[OWL2-OVERVIEW]: W3C OWL Working Group. OWL 2 Web Ontology Language Document Overview (Second Edition). 11 December 2012. W3C Recommendation. URL: http://www.w3.org/TR/owl2-overview/
[RDF-MT]: Patrick Hayes. RDF Semantics. 10 February 2004. W3C Recommendation. URL: http://www.w3.org/TR/rdf-mt/
[RDF-PRIMER]: Frank Manola; Eric Miller. RDF Primer. 10 February 2004. W3C Recommendation. URL: http://www.w3.org/TR/rdf-primer/
[RDF-SCHEMA]: Dan Brickley; Ramanathan Guha. RDF Vocabulary Description Language 1.0: RDF Schema. 10 February 2004. W3C Recommendation. URL: http://www.w3.org/TR/rdf-schema
[RDF-SPARQL-QUERY]: Eric Prud'hommeaux; Andy Seaborne. SPARQL Query Language for RDF. 15 January 2008. W3C Recommendation. URL: http://www.w3.org/TR/rdf-sparql-query/
[RDF-SYNTAX-GRAMMAR]: Dave Beckett. RDF/XML Syntax Specification (Revised). 10 February 2004. W3C Recommendation. URL: http://www.w3.org/TR/rdf-syntax-grammar
[RDF-TESTCASES]: jan grant; Dave Beckett. RDF Test Cases. 10 February 2004. W3C Recommendation. URL: http://www.w3.org/TR/rdf-testcases
[RDFA-PRIMER]: Ben Adida; Ivan Herman; Manu Sporny; Mark Birbeck. RDFa 1.1 Primer. 7 June 2012. W3C Note. URL: http://www.w3.org/TR/rdfa-primer/
[RFC3986]: T. Berners-Lee; R. Fielding; L. Masinter. Uniform Resource Identifier (URI): Generic Syntax (RFC 3986). January 2005. RFC. URL: http://www.ietf.org/rfc/rfc3986.txt
[RFC5785]: Mark Nottingham; Eran Hammer-Lahav. Defining Well-Known Uniform Resource Identifiers (URIs) (RFC 5785). April 2010. RFC. URL: http://www.rfc-editor.org/rfc/rfc5785.txt
[SWBP-N-ARYRELATIONS]: Natasha Noy; Alan Rector. Defining N-ary Relations on the Semantic Web. 12 April 2006. W3C Note. URL: http://www.w3.org/TR/swbp-n-aryRelations
[SWBP-XSCH-DATATYPES]: Jeremy Carroll; Jeff Pan. XML Schema Datatypes in RDF and OWL. 14 March 2006. W3C Note. URL: http://www.w3.org/TR/swbp-xsch-datatypes
[TURTLE-CR]: Eric Prud'hommeaux, Gavin Carothers. Turtle; Terse RDF Triple Language 19 February 2013. W3C Candidate Recommendation. URL: http://www.w3.org/TR/2013/CR-turtle-20130219/. The latest edition is available at http://www.w3.org/TR/turtle/
[WEBARCH]: Ian Jacobs; Norman Walsh. Architecture of the World Wide Web, Volume One. 15 December 2004. W3C Recommendation. URL: http://www.w3.org/TR/webarch/
[XML-ID]: Jonathan Marsh; Daniel Veillard; Norman Walsh. xml:id Version 1.0. 9 September 2005. W3C Recommendation. URL: http://www.w3.org/TR/xml-id/
[XMLSCHEMA11-1]: Sandy Gao; Michael Sperberg-McQueen; Henry Thompson; Noah Mendelsohn; David Beech; Murray Maloney. W3C XML Schema Definition Language (XSD) 1.1 Part 1: Structures. 5 April 2012. W3C Recommendation. URL: http://www.w3.org/TR/xmlschema11-1/
[vocab-org]: Dave Reynolds. The Organization Ontology. 25 June 2013. W3C Candidate Recommendation. URL: http://www.w3.org/TR/vocab-org/

RDF 1.1 Concepts and Abstract Syntax

W3C Last Call Working Draft 23 July 2013

Abstract

Status of This Document

Table of Contents

1. Introduction

1.1 Graph-based Data Model

1.2 Resources and Statements

1.3 The Referent of an IRI

1.4 RDF Vocabularies and Namespace IRIs

1.5 RDF and Change Over Time

1.6 Working with Multiple RDF Graphs

1.7 Equivalence, Entailment and Inconsistency

1.8 RDF Documents and Syntaxes

2. Conformance

3. RDF Graphs

3.1 Triples

3.2 IRIs

3.3 Literals

3.4 Blank Nodes

3.5 Replacing Blank Nodes with IRIs

3.6 Graph Comparison

4. RDF Datasets

4.1 RDF Dataset Comparison

4.2 Content Negotiation of RDF Datasets

5. Datatypes

5.1 The XML Schema Built-in Datatypes

5.2 The rdf:HTML Datatype

5.3 The rdf:XMLLiteral Datatype

5.4 Datatype IRIs

6. Fragment Identifiers

7. Generalized RDF Triples, Graphs, and Datasets

8. Acknowledgments

A. Changes between RDF 2004 and RDF 1.1

B. Change Log

B.1 Changes from 15 January 2013 WD to this version

B.2 Changes from 05 June 2012 WD to this version

B.3 Changes from FPWD to 05 June 2012 WD

B.4 Changes from RDF 2004 to FPWD

C. References

C.1 Normative references

C.2 Informative references

5.2 The `rdf:HTML` Datatype

5.3 The `rdf:XMLLiteral` Datatype