W3C

R2RML: RDB to RDF Mapping Language

W3C Working Draft 20 September 2011 Editor's draft

This version: Revision:
http://www.w3.org/TR/2011/WD-r2rml-20110920/ $Id: Overview.html,v 1.232 2012/09/17 16:30:01 rcygania2 Exp $
Latest Editor's Draft: editor's draft:
http://www.w3.org/2001/sw/rdb2rdf/r2rml/
Latest published version:
http://www.w3.org/TR/r2rml/
Previous version: http://www.w3.org/TR/2011/WD-r2rml-20110324/ Editors:
Souripriya Das, Oracle
Seema Sundara, Oracle
Richard Cyganiak, DERI, National University of Ireland, Galway

Abstract

This document describes R2RML, a language for expressing customized mappings from relational databases to RDF datasets. Such mappings provide the ability to view existing relational data in the RDF data model, expressed in a structure and target vocabulary of the mapping author's choice. R2RML mappings are themselves RDF graphs and written down in Turtle syntax. R2RML enables different types of mapping implementations. Processors could, for example, offer a virtual SPARQL endpoint over the mapped relational data, or generate RDF dumps, or offer a Linked Data interface.

Status of this Document

This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/. This document is a Last Call Working Draft of an editor's draft prepared by the "R2RML: RDB to RDF Mapping Language". Publication as a Last Call Working Draft indicates that members of the RDB2RDF Working Group . It is not an official W3C document. See http://www.w3.org/TR/r2rml/ believes it has addressed all substantive issues and that for the document is stable. The Working Group expects to advance latest published version of this specification to Recommendation Status . document.

Comments on this document should be sent to public-rdb2rdf-comments@w3.org , a mailing list with a public archive . Comments on this working draft are due on or before 1 November 2011 . Publication as a Working Draft does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress. This document was produced by the W3C RDB2RDF Working Group . R2RML has changed significantly since the previous Working Draft. Much of the document has been rewritten, many terms were renamed and other design details have changed. New language features include a detailed account of the conversion of SQL datatypes to RDF . The triples in the output dataset are now more accurately specified. One major open question remains, and the working group seeks feedback on it: should R2RML processors be required to support the Turtle syntax ? Apart from this, the working group anticipates no major further changes. This document was produced by a group operating under the 5 February 2004 W3C Patent Policy . W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy .

Table of Contents


1 Introduction

This specification describes R2RML, a language for expressing customized mappings from relational databases to RDF datasets. Such mappings provide the ability to view existing relational data in the RDF data model, expressed in a structure and target vocabulary of the mapping author's choice.

This specification has a companion that defines a direct mapping from relational databases to RDF [ DM ]. In the direct mapping of a database, the structure of the resulting RDF graph directly reflects the structure of the database, the target RDF vocabulary directly reflects the names of database schema elements, and neither structure nor target vocabulary can be changed. With R2RML on the other hand, a mapping author can define highly customized views over the relational data.

Every R2RML mapping is tailored to a specific database schema and target vocabulary. The input to an R2RML mapping is a relational database that conforms to that schema. The output is an RDF dataset [ SPARQL ], as defined in SPARQL, that uses predicates and types from the target vocabulary. The mapping is conceptual; R2RML processors are free to materialize the output data, or to offer virtual access through an interface that queries the underlying database, or to offer any other means of providing access to the output RDF dataset.

R2RML mappings are themselves expressed as RDF graphs and written down in Turtle syntax [ TURTLE ].

The intended audience of this specification is implementors of software that generates or processes R2RML mapping documents, as well as mapping authors looking for a reference to the R2RML language constructs. The document uses concepts from RDF Concepts and Abstract Syntax [ RDF ] and from the SQL language specifications [ SQL1 ][ SQL2 ]. A reader's familiarity with the contents of these documents, as well as with the Turtle syntax, is assumed.

The R2RML language is designed to meet the use cases and requirements identified in Use Cases and Requirements for Mapping Relational Databases to RDF [ UCNR ].

1.1 Document Conventions

In this document, examples assume the following namespace prefix bindings unless otherwise stated:

Prefix IRI
rr: http://www.w3.org/ns/r2rml#
rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns#
rdfs: http://www.w3.org/2000/01/rdf-schema#
xsd: http://www.w3.org/2001/XMLSchema#
ex: http://example.com/ns#

Throughout the document, boxes containing Turtle markup and SQL data will appear. These boxes are color-coded. Gray boxes contain RDFS definitions of R2RML vocabulary terms:

# This box contains RDFS definitions of R2RML vocabulary terms

Yellow boxes contain example fragments of R2RML mappings in Turtle syntax:

# This box contains example R2RML mappings

Blue tables contain example input into an R2RML mapping:

EXAMPLE
ID INTEGER PRIMARY KEY DESC VARCHAR(100)
1 This is an example input table.
2 The table name is EXAMPLE.
3 It has six rows.
4 It has two columns, ID and DESC.
5 ID is the table's primary key and of type INTEGER.
6 DESC is of type VARCHAR(100)

Green boxes contain example output:

# This box contains example output RDF triples or fragments

2 R2RML Overview and Example (Informative)

This section gives a brief overview of the R2RML mapping language, followed by a simple example relational database with an R2RML mapping document and its output RDF. Further R2RML examples can be found in the R2RML and Direct Mapping Test Cases [ TC ].

An R2RML mapping refers to logical tables to retrieve data from the input database . A logical table can be one of the following:

  1. A base table,
  2. a view, or
  3. a valid SQL query (called an “ R2RML view ” because it emulates a SQL view without modifying the database).

Each logical table is mapped to RDF using a triples map . The triples map is a rule that maps each row in the logical table to a number of RDF triples . The rule has two main parts:

  1. A subject map that generates the subject of all RDF triples that will be generated from a logical table row. The subjects often are IRIs that are generated from the primary key column(s) of the table.
  2. Multiple predicate-object maps that in turn consist of predicate maps and object maps (or referencing object maps ).

Triples are produced by combining the subject map with a predicate map and object map, and applying these three to each logical table row . For example, the complete rule for generating a set of triples might be:

By default, all RDF triples are in the default graph of the output dataset . A triples map can contain graph maps that place some or all of the triples into named graphs instead.

UML overview diagram of R2RML

Figure 1: An overview of R2RML

2.1 Example Input Database

The following example database consists of two tables, EMP and DEPT , with one row each:

EMP
EMPNO INTEGER PRIMARY KEY ENAME VARCHAR(100) JOB VARCHAR(20) DEPTNO INTEGER REFERENCES DEPT (DEPTNO)
7369 SMITH CLERK 10
DEPT
DEPTNO INTEGER PRIMARY KEY DNAME VARCHAR(30) LOC VARCHAR(100)
10 APPSERVER NEW YORK

2.2 Desired RDF Output

The desired RDF triples to be produced from this database are as follows:

<http://data.example.com/employee/7369> rdf:type ex:Employee.
<http://data.example.com/employee/7369> ex:name "SMITH".
<http://data.example.com/employee/7369> ex:department <http://data.example.com/department/10>.
<http://data.example.com/department/10> rdf:type ex:Department.
<http://data.example.com/department/10> ex:name "APPSERVER".
<http://data.example.com/department/10> ex:location "NEW YORK".
<http://data.example.com/department/10>
ex:staff
1.

Note in particular:

2.3 Example: Mapping a Simple Table

The following partial R2RML mapping document will produce the desired triples from the EMP table (except the ex:department triple, which will be added later):

@prefix rr: <http://www.w3.org/ns/r2rml#>.
@prefix ex: <http://example.com/ns#>.

<#TriplesMap1>
    rr:logicalTable [ rr:tableName "EMP" ];
    rr:subjectMap [
        rr:template "http://data.example.com/employee/{EMPNO}";
        rr:class ex:Employee;
    ];
    rr:predicateObjectMap [
        rr:predicate ex:name;
        rr:objectMap [ rr:column "ENAME" ];
].
<http://data.example.com/employee/7369> rdf:type ex:Employee.
<http://data.example.com/employee/7369>
ex:name
"SMITH".

2.4 Example: Computing a Property with an R2RML View

Next, the DEPT table needs to be mapped. Instead of using the table directly as the basis for that mapping, an “ R2RML view ” will be defined based on a SQL query. This allows computation of the staff number. (Alternatively, one could define this view directly in the database.)

<#DeptTableView> rr:sqlQuery """
SELECT DEPTNO,
       DNAME,
       LOC,
       (SELECT COUNT(*) FROM EMP WHERE EMP.DEPTNO=DEPT.DEPTNO) AS STAFF
FROM DEPT;
""".

The definition of a triples map that generates the desired DEPT triples based on this R2RML view follows.

<#TriplesMap2>
    rr:logicalTable <#DeptTableView>;
    rr:subjectMap [
        rr:template "http://data.example.com/department/{DEPTNO}";
        rr:class ex:Department;
    ];
    rr:predicateObjectMap [
        rr:predicate ex:name;
        rr:objectMap [ rr:column "DNAME" ];
    ];
    rr:predicateObjectMap [
        rr:predicate ex:location;
        rr:objectMap [ rr:column "LOC" ];
    ];
    rr:predicateObjectMap [
        rr:predicate ex:staff;
        rr:objectMap [ rr:column "STAFF" ];
].
<http://data.example.com/department/10> rdf:type ex:Department.
<http://data.example.com/department/10> ex:name "APPSERVER".
<http://data.example.com/department/10> ex:location "NEW YORK".
<http://data.example.com/department/10>
ex:staff
1.

2.5 Example: Linking Two Tables

To complete the mapping document, the ex:department triples need to be generated. Their subjects come from the first triples map ( <#TriplesMap1> ), the objects come from the second triples map ( <#TriplesMap2> ).

This can be achieved by adding another rr:predicateObjectMap to <#TriplesMap1> . This one uses the other triples map, <#TriplesMap2> , as a parent triples map :

<#TriplesMap1>
    rr:predicateObjectMap [
        rr:predicate ex:department;
        rr:objectMap [
            rr:parentTriplesMap <#TriplesMap2>;
            rr:joinCondition [
                rr:child "DEPTNO";
                rr:parent "DEPTNO";
            ];
        ];
].

This performs a join between the EMP table and the R2RML view, on the DEPTNO columns. The objects will be generated from the subject map of the parent triples map, yielding the desired triple:

<http://data.example.com/employee/7369>
ex:department
<http://data.example.com/department/10>.

This completes the R2RML mapping document. An R2RML processor will generate the triples listed above from this mapping document.

2.6 Example: Many-to-Many Tables

A final The following example will assume that a many-to-many relationship exists between the extended versions of EMP table and the DEPT table shown below. This many-to-many relationship is captured by the content of the EMP2DEPT table . The database consisting of the EMP , DEPT , and EMP2DEPT tables are shown below:

DEPTNO INTEGER REFERENCES DEPT (DEPTNO) 10 20 10
EMP
EMPNO INTEGER PRIMARY KEY ENAME VARCHAR(100) JOB VARCHAR(20)
7369 SMITH CLERK
7369 SMITH NIGHTGUARD
7400 JONES ENGINEER
DEPT
DEPTNO INTEGER PRIMARY KEY DNAME VARCHAR(30) LOC VARCHAR(100)
10 APPSERVER NEW YORK
20 RESEARCH BOSTON
EMP2DEPT PRIMARY KEY (EMPNO, DEPTNO)
EMPNO INTEGER REFERENCES EMP (EMPNO) DEPTNO INTEGER REFERENCES DEPT (DEPTNO)
7369 10
7369 20
7400 10
<http://data.example.com/employee=7369/department=10> 
    ex:employee   <http://data.example.com/employee/7369> ;
    ex:department <http://data.example.com/department/10> .
<http://data.example.com/employee=7369/department=20> 
    ex:employee <http://data.example.com/employee/7369> ;
    ex:department <http://data.example.com/department/20> .
<http://data.example.com/employee=7400/department=10> 
    ex:employee <http://data.example.com/employee/7400> ;
    ex:department <http://data.example.com/department/10> .

The following R2RML mapping will produce the desired triples listed above:

<#TriplesMap3>
    rr:tableName "EMP2DEPT";

    rr:logicalTable [ rr:tableName "EMP2DEPT" ];

    rr:subjectMap [ rr:template "http://data.example.com/employee={EMPNO}/department={DEPTNO}" ];
    rr:predicateObjectMap [
        rr:predicate ex:employee;
        rr:objectMap [ rr:template "http://data.example.com/employee/{EMPNO}" ; rr:termType rr:IRI ]

        rr:objectMap [ rr:template "http://data.example.com/employee/{EMPNO}" ];

    ];
    rr:predicateObjectMap [
        rr:predicate ex:department;
        rr:objectMap [ rr:template "http://data.example.com/department/{DEPTNO}" ; rr:termType rr:IRI ]

        rr:objectMap [ rr:template "http://data.example.com/department/{DEPTNO}" ];

    ].

However, if one does not require that the subjects in the desired output uniquely identify the rows in the EMP2DEPT table , the desired output may look as follows:

<http://data.example.com/employee/7369> 
    ex:department <http://data.example.com/department/10> ;
    ex:department <http://data.example.com/department/20> .
<http://data.example.com/employee/7400> 
    ex:department <http://data.example.com/department/10>.

The following R2RML mapping will produce the desired triples:

<#TriplesMap3>
    rr:tableName "EMP2DEPT";

    rr:logicalTable [ rr:tableName "EMP2DEPT" ];

    rr:subjectMap [
        rr:template "http://data.example.com/employee/{EMPNO}";
    ];
    rr:predicateObjectMap [
      rr:predicate ex:department;
      rr:objectMap [ rr:template "http://data.example.com/department/{DEPTNO}"; rr:termType rr:IRI ]

      rr:objectMap [ rr:template "http://data.example.com/department/{DEPTNO}" ];

].

2.7 Example: Translating database type codes to IRIs

Sometimes, database columns contain codes that need to be translated into IRIs, but a direct syntactic translation using string templates is not possible. For example, consider a JOB column in the EMP table with the following possible values, and IRIs corresponding to those database values in the RDF output:

Value Corresponding RDF IRI
CLERK http://data.example.com/roles/general-office
NIGHTGUARD http://data.example.com/roles/security
ENGINEER http://data.example.com/roles/engineering

The IRIs are not found in the original database and therefore the mapping from database codes to IRIs has to be specified in the R2RML mapping. Such translations can be achieved using an “ R2RML view ”. The view is defined based on a SQL query that computes the IRI based on the database value. SQL's CASE statement is convenient for this purpose. (Alternatively, one could define this view directly in the database.)

<#TriplesMap1>
    rr:logicalTable [ rr:sqlQuery """
        SELECT EMP.*, (CASE JOB
            WHEN 'CLERK' THEN 'general-office'
            WHEN 'NIGHTGUARD' THEN 'security'
            WHEN 'ENGINEER' THEN 'engineering'
        END) ROLE FROM EMP
        """ ];
    rr:subjectMap [
        rr:template "http://data.example.com/employee/{EMPNO}";
    ];
    rr:predicateObjectMap [
        rr:predicate ex:role;
        rr:objectMap [ rr:template "http://data.example.com/roles/{ROLE}" ];
    ].

With the example input database , this mapping would yield the following triple:


<http://data.example.com/employee/7369>
ex:role
<http://data.example.com/roles/general-office>.

3 Conformance

As well as sections marked as non-normative in the section heading, all diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative.

The key words must , must not , required , should , should not , recommended , may , and optional in this specification are to be interpreted as described in RFC 2119 [ RFC2119 ].

This specification describes conformance criteria for:

A collection of test cases for R2RML processors and R2RML data validators is available in the R2RML and Direct Mapping Test Cases [ TC ].

This specification defines R2RML for databases that conform to Core SQL 2008 , as defined in ISO/IEC 9075-1:2008 [ SQL1 ] and ISO/IEC 9075-2:2008 [ SQL2 ]. Processors and mappings may have to deviate from the R2RML specification in order to support databases that do not conform to this version of SQL.

Where SQL queries are embedded into R2RML mappings, SQL version identifiers can be used to indicate the specific version of SQL that is being used.

4 R2RML Processors and Mapping Documents

An R2RML mapping defines a mapping from a relational database to RDF. It is a structure that consists of one or more triples maps .

The input to an R2RML mapping is called the input database .

An R2RML processor is a system that, given an R2RML mapping and an input database , provides access to the output dataset .

There are no constraints on the method of access to the output dataset provided by a conforming R2RML processor. An R2RML processor MAY materialize the output dataset into a file, or offer virtual access through an interface that queries the input database, or offer any other means of providing access to the output dataset.

An R2RML processor also has access to an execution environment consisting of:

The SQL connection is used by the R2RML processor to evaluate SQL queries against the input database. It MUST be established with sufficient privileges for read access to all base tables and views that are referenced in the R2RML mapping. It MUST be configured with a default catalog and default schema that will be used when tables and views are accessed without an explicit catalog or schema reference.

How the SQL connection is established, or how users are authenticated against the database, is outside of the scope of this document.

The base IRI MUST be a valid IRI . It SHOULD NOT contain question mark (“ ? ”) or hash (“ # ”) characters and SHOULD end in a slash (“ / ”) character.

Resolution of

To obtain an absolute IRI from a relative IRIs in IRI, the term generation rules of R2RML uses use simple string concatenation instead of concatenation, rather than the more complex algorithm for resolution of relative URIs defined in RFC 3986. Section 5.2 of [ RFC3986 ]. This ensures that the original database value can be reconstructed from the generated absolute IRI. Both algorithms are equivalent if all of the following are true:

  1. The base IRI does not contain question marks or hashes,
  2. the base IRI ends in a slash,
  3. the relative IRI does not start with a slash, and
  4. the relative IRI does not contain any “ . ” or “ .. ” path segments.

An R2RML data validator is a system that takes as its input an R2RML mapping , a base IRI , and a SQL connection to an input database , and checks for the presence of data errors . When checking the input database, a data validator MUST report any data errors that are raised in the process of generating the output dataset.

An R2RML processor MAY include an R2RML data validator , but this is not required.

4.1 Mapping Graphs and the R2RML Vocabulary

An R2RML mapping is represented as an RDF graph . In other words, RDF is used not just as the target data model of the mapping, but also as a formalism for representing the R2RML mapping itself.

An RDF graph that represents an R2RML mapping is called an R2RML mapping graph .

The R2RML vocabulary is the set of IRIs defined in this specification that start with the rr: namespace IRI:



http://www.w3.org/ns/r2rml#


An R2RML mapping graph :

The R2RML vocabulary also includes the following R2RML classes , which represent various R2RML mapping constructs. Using these classes is OPTIONAL in a mapping graph. The applicable class of a resource can always be inferred from its properties. :

The members of these classes are collectively called mapping components .

Many of these classes differ only in capitalization from properties in the R2RML vocabulary .

4.2 RDF-based Turtle Syntax; Media Type

ISSUE-57 : Should R2RML require Explicit typing of the resources in a specific syntax? The working group has proposed two alternate proposals for this issue: The R2RML mapping document specifies both the vocabulary and the syntax. The graph with R2RML document classes is MUST OPTIONAL be a Turtle document and R2RML processors MUST support Turtle to be able to read such documents. Conformance criteria requires support of R2RML vocabulary written in Turtle. The R2RML mapping document specifies only the vocabulary. There is has no syntax specified. There can be an accompanying "R2RML mapping document in Turtle" that specifies the Turtle syntax. Conformance criteria for the R2RML mapping document requires supporting the vocabulary in any language (Turtle, N-Triple, RDF/XML etc.) Additionally, if an implementation supports the Turtle syntax, it can claim conformance to the "R2RML mapping document in Turtle". The advantage of effect on the first approach is that it promotes interoperability between different producers and consumers behaviour of R2RML files by requiring all to support at least one shared syntax. Without such a shared syntax, an R2RML file created in one tool may be rejected processor . The mapping component represented by another tool because both assume different RDF syntaxes. R2RML examples found in educational material may not work any given resource in actual implementations due to different syntaxes. This is seen as an impediment to the uptake of R2RML. The second approach distinguishes between the R2RML vocabulary and the syntax and wants to keep them separate. The advantage of the second approach a mapping graph is that defined by the R2RML mapping document remains independent presence or absence of any exchange format. This gives flexibility certain properties, as different syntax flavours of R2RML could defined throughout this specification. A resource SHOULD NOT be easily defined. It is in the spirit of RDF typed as an abstract format. Users may have to convert between different RDF syntaxes in order to use R2RML files, but such conversion is not difficult and therefore not seen as an impediment. Thereby, class if it allows conformance with does not meet the R2RML mapping document using any definition of the standard exchange formats. There is consensus that Turtle should be used for the examples in this document, as well as for the test cases. The working group seeks comments and opinions on this question and encourages reports to public-rdb2rdf-comments mailing list . class.

4.2 RDF-based Turtle Syntax; Media Type

An R2RML mapping document is any document written in the Turtle [ TURTLE ] RDF syntax that encodes an R2RML mapping graph .

The media type for R2RML mapping documents is the same as for Turtle documents in general: text/turtle . The content encoding of Turtle content is always UTF-8 and the charset parameter on the media type SHOULD always be used: text/turtle;charset=utf-8 . The preferred file extension is .ttl . SHOULD be used.

A conforming R2RML processor MUST SHOULD accept R2RML mapping documents in Turtle syntax. It MAY accept R2RML mapping graphs encoded in other RDF syntaxes.

It is common to use document-local IRIs in mapping documents by defining the default prefix in the beginning of the document, and using it for creating IRIs for mapping components such as triples maps: @prefix : <#> … :EmpQuery rr:sqlQuery """SELECT * FROM EMP WHERE …""". … :EmpTriples rr:logicalTable :EmpQuery.

4.3 Data Errors

A data error is a condition of the data in the input database that would lead to the generation of an invalid RDF term , such as . The following conditions give rise to data errors:

  1. A term map with term type rr:IRI results in the generation of an invalid IRI .
  2. A term map or whose natural RDF datatype is overridden with a specified datatype produces an ill-typed literal . (see datatype-override RDF literal ).

When providing access to the output dataset , an R2RML processor MUST abort any operation that requires inspecting or returning an RDF term whose generation would give rise to a data error , and report an error to the agent invoking the operation. A conforming R2RML processor MAY , however, allow other operations that do not require inspecting or returning these RDF terms , and thus MAY provide partial access to an output dataset that contains data errors. Nevertheless, an R2RML processor SHOULD report data errors as early as possible.

The following conditions give rise to data errors: A term map with term type rr:IRI results in the generation of an invalid IRI . A term map with a datatype override produces an ill-typed literal of a supported RDF datatype . The presence of data errors does not make an R2RML mapping non-conforming.

Data errors cannot generally be detected by analyzing the table schema of the database, but only by scanning the data in the tables. For large and rapidly changing databases, this can be impractical. Therefore, R2RML processors are allowed to answer queries that do not “touch” a data error, and the behavior of such operations is well-defined. For the same reason, the conformance of R2RML mappings is defined without regard for the presence of data errors.

R2RML data validators can be used to explicitly scan a database for data errors.

4.4 Default Mappings

An R2RML processor MAY include an R2RML default mapping generator . This is a facility that introspects the schema of the input database and generates an R2RML mapping , possibly in the form of an R2RML mapping document , intended for further customization by a mapping author. Such a mapping is known as a default mapping .

The default mapping SHOULD be such that its output is the Direct Graph [ DM ] corresponding to the input database .

Duplicate row preservation : For tables without a primary key, the Direct Graph requires that a fresh blank node is created for each row. This ensures that duplicate rows in such tables are preserved. This requirement is relaxed for R2RML default mappings : They MAY reuse the same blank node for multiple duplicate rows. This behaviour does not preserve duplicate rows. R2RML default mapping generators that provide default mappings based on the Direct Graph MUST document whether the generated default mapping preserves duplicate rows or not.

5 Defining Logical Tables

Diagram: The properties of logical tables

Figure 2: The properties of logical tables

A logical table is a possibly virtual database table tabular SQL query result that is to be mapped to RDF triples . A logical table is either

Every logical table has an effective SQL query that, if executed over the SQL connection , produces as its result the contents of the logical table.

A logical table row is a row in a logical table .

A column name is the name of a column of a logical table . A column name MUST be a valid SQL identifier . Column names do not include any qualifying table, view or schema names.

A SQL identifier is the name of a SQL object, such as a column, table, view, schema, or catalog. A SQL identifier MUST match the <identifier> production in [ SQL2 ]. When comparing identifiers for equality, the comparison rules of [ SQL2 ] MUST be used.

An informative summary of SQL identifier syntax rules:
  1. SQL identifiers can be delimited identifiers (with double quotes), or regular identifiers.
  2. Regular identifiers must start with a Unicode character from any of the following character classes: upper-case letter, lower-case letter, title-case letter, modifier letter, other letter, or letter number. Subsequent characters may be any of these, or a nonspacing mark, spacing combining mark, decimal number, connector punctuation, and formatting code.
  3. Regular identifiers are case-insensitive.
  4. Delimited identifiers can contain any character.
  5. Double quotes A double-quote character inside a delimited identifiers must be immediately followed identifier is escaped by another double quote. appending a second double-quote character.
  6. Delimited identifiers are case-sensitive.
  7. deptno and "deptno" are not equivalent (delimited identifiers that are not in all-upper-case are not equivalent to any undelimited identifiers).
  8. DEPTNO and "DEPTNO" are equivalent (all-upper-case delimited and undelimited identifiers are equivalent).
  9. Five examples of valid column names: deptno , dept_no , "dept_no" , "Department Number" , "Identifier ""with quotes""" .
Note that in R2RML, column name specified as an RDF plain literal or within curly braces, is considered a delimited SQL identifier. Thus the SQL column name identifiers deptno , dept_no , "dept_no" , "Department Number" can be used as (part of) object value for the various relevant R2RML properties as follows: [] rr:column "DEPTNO". [] rr:parent "DEPT_NO". [] rr:child "dept_no". [] rr:template "http://data.example.com/department/{Department Number}". Note that Turtle string syntax requires escaping of double quotes with a backslash, so the identifier "Identifier ""with quotes""" can be used as (part of) value for identifiers from the various relevant list above might be written like this if occurring inside an R2RML properties as follows: [] rr:column "Identifier \"\"with quotes\"\"". [] rr:template "http://data.example.com/department/{Identifier \"\"with quotes\"\"}". mapping document:
[] rr:column "deptno".
[] rr:column "dept_no".
[] rr:column "\"dept_no\"".
[] rr:column "\"Department Number\"".
[]
rr:column
"\"Identifier
\"\"with
quotes\"\"\"".

These rules are for Core SQL 2008 . See Section 3, Conformance regarding databases that do not conform to this version of SQL.

5.1 Base Tables and SQL Views ( rr:tableName )

A SQL base table or view is a logical table containing SQL data from a base table or view in the input database . A SQL base tables table or views view is represented by a resource that has exactly one rr:tableName property.

The value of rr:tableName specifies the table or view name of the base table or view. Its value MUST be a valid schema-qualified name that names an existing base table or view in the input database .

A schema-qualified name is a sequence of one, two or three valid SQL identifiers , separated by the dot character (“ . ”). The three identifiers name, respectively, a catalog, a schema, and a table or view. If no catalog or schema are is specified, then the default catalog and default schema of the SQL connection are assumed.

The effective SQL query of a SQL base table or view is:

SELECT
*
FROM

{table}

with {table} replaced with the table or view name .

The following example shows a logical table specified using a schema-qualified table name.

[] rr:tableName "SCOTT.DEPT".

The following example shows a logical table specified using an unqualified table name. The SQL connection's default schema will be used.

[] rr:tableName "DEPT".

5.2 R2RML Views ( rr:sqlQuery , rr:sqlVersion )

An R2RML view is a logical table whose contents are the result of executing a SQL query against the input database . It is represented by a resource that has exactly one rr:sqlQuery property, whose value MUST be is a literal with a lexical form that is a valid SQL query .

R2RML mappings sometimes require data transformation, computation, or filtering before generating triples from the database. This can be achieved by defining a SQL view in the input database and referring to it with rr:tableName . However, this approach may sometimes not be practical for lack of database privileges or other reasons. R2RML views achieve the same effect without requiring changes to the input database.

Note that unlike “real” SQL views, an R2RML view can not be used as an input table in further SQL queries.

A SQL query is a SELECT query in the SQL language that can be executed over the input database . The value of rr:sqlQuery string MUST conform to the production <direct select statement: multiple rows> in [ SQL2 ] with an OPTIONAL trailing semicolon character and OPTIONAL surrounding white space (excluding comments) as defined in [ TURTLE ]. It MUST be a valid SQL query if executed to execute over the SQL connection . It The result of the query execution MUST NOT have duplicate column names or unnamed derived . Any columns in the SELECT list. list derived by projecting an expression SHOULD be named, because otherwise they cannot be reliably referenced in the rest of the mapping.

Database objects referenced in the SQL query MAY be qualified with a catalog or schema name. For any database objects referenced without an explicit catalog name or schema name, the default catalog and default schema of the SQL connection are used. assumed.

For example, the following SELECT query is not a valid R2RML SQL query because the result contains a duplicate column name DEPTNO :


SELECT
EMP.DEPTNO,
1
AS
DEPTNO
FROM
EMP;

As a further example, the following SELECT query SHOULD NOT be used, because it contains an unnamed column derived from a COUNT expression:

SELECT
DEPTNO,
COUNT(EMPNO)
FROM
EMP
GROUP
BY
DEPTNO;

An R2RML view MAY have one or more SQL version identifiers . They MUST be valid IRIs and are represented as values of the rr:sqlVersion property. The following SQL version identifier indicates that the SQL query conforms to Core SQL 2008:

http://www.w3.org/ns/r2rml#SQL2008

The absence of a SQL version identifier indicates that no claim to Core SQL 2008 conformance is made.

No further identifiers besides rr:SQL2008 are defined in this specification. The RDB2RDF Working Group intends to maintain a non-normative list of identifiers for other SQL versions [ SQLIRIS ].

The effective SQL query of an R2RML view is the value of its rr:sqlQuery property.

The following example shows a logical table specified as an R2RML view conforming to Core SQL 2008.

[] rr:sqlQuery """
        Select ('Department' || DEPTNO) AS DEPTID
             , DEPTNO
             , DNAME
             , LOC
          from SCOTT.DEPT
    """;
rr:sqlVersion
rr:SQL2008.

6 Mapping Logical Tables to RDF with Triples Maps

Diagram: The properties of triples maps

Figure 3: The properties of triples maps

A triples map specifies a rule for translating each row of a logical table to zero or more RDF triples .

The RDF triples generated from one row in the logical table all share the same subject.

A triples map is represented by a resource that references the following other resources:

The referenced columns of all term maps of a triples map (subject map, predicate maps, object maps, graph maps) MUST be column names that exist in the term map's logical table . Furthermore, the columns carrying these names in the logical table MUST be of a SQL datatype for which conversion to string is defined. Conversion to string is undefined in SQL 2008 for row types, array types, user-defined datatypes that do not have a user-defined string CAST , and a few other exotic types.

The following example shows a triples map including its logical table, subject map, and two predicate-object maps.

[]
    rr:logicalTable [ rr:tableName "DEPT" ];
    rr:subjectMap [ rr:template "http://data.example.com/department/{DEPTNO}" ];
    rr:predicateObjectMap [
        rr:predicate ex:name;
        rr:objectMap [ rr:column "DNAME" ];
    ].

    ];

    rr:predicateObjectMap [
        rr:predicate ex:location;
        rr:objectMap [ rr:column "LOC" ];
].
The logical table may also be specified directly on the same resource, without introducing an intermediate resource: [] rr:tableName "DEPT"; rr:subjectMap [ rr:template "http://data.example.com/department/{DEPTNO}" ]; # … .

6.1 Creating Resources with Subject Maps

A subject map is a term map . It specifies a rule for generating the subjects of the RDF triples generated by a triples map .

6.2 Typing Resources ( rr:class )

A subject map MAY have one or more class IRIs . They are represented by the rr:class property. The values of the rr:class property MUST be IRIs . For each RDF term generated by the subject map, RDF triples with predicate rdf:type and the class IRI as object will be generated.

This property is merely a shortcut for specifying an rr:predicateObjectMap with predicate rdf:type and the rr:class IRI as a constant object. Mappings where the class IRI is not constant, but needs to be computed based on the contents of the database, can be achieved by defining such a predicate-object map with predicate rr:predicateObjectMap rdf:type with and a non-constant object. object map .

In the following example, the generated subject will be asserted as an instance of the ex:Employee class.

[] rr:template "http://data.example.com/employee/{EMPNO}"; 
   rr:class ex:Employee.

Using the example EMP table , the following RDF triple will be generated:

<http://data.example.com/emp/7369> rdf:type ex:Employee.

6.3 Creating Properties and Values with Predicate-Object Maps

A predicate-object map is a function that creates one or more predicate-object pairs from for each logical table rows row of a logical table . It is used in conjunction with a subject map to generate RDF triples in a triples map .

A predicate-object map is represented by a resource that references the following other resources:

A predicate map is a term map .

An object map is a term map .

7 Creating RDF Terms with Term Maps

Diagram: The properties of term maps

Figure 4: The properties of term maps

An RDF term is either an IRI , or a blank node , or a literal .

A term map is a function that generates an RDF term from a logical table row . The result of that function is known as the term map's generated RDF term .

Term maps are used to generate the subjects, predicates and objects of the RDF triples that are generated by a triples map . Consequently, there are several kinds of term maps , depending on where in the mapping they occur: subject maps , predicate maps , object maps and graph maps .

A term map MUST be exactly one of the following:

The referenced columns of a term map are the set of column names referenced in the term map and depend on the type of term map.

7.1 Constant RDF Terms ( rr:constant )

A constant-valued term map is a term map that ignores the logical table row and always generates the same RDF term. A constant-valued term map is represented by a resource that has exactly one rr:constant property.

The constant value of a constant-valued term map is the RDF term that is the value of its rr:constant property.

If the constant-valued term map is a subject map , predicate map or graph map , then its constant value MUST be an IRI .

If the constant-valued term map is an object map , then its constant value MUST be an IRI or literal .

The referenced columns of a constant-valued term map is the empty set.

Constant-valued term maps can be expressed more concisely using the constant shortcut properties rr:subject , rr:predicate , rr:object and rr:graph . Occurrances Occurrences of these properties MUST be treated exactly as if the following triples were present in the mapping graph instead:

Triple involving constant shortcut property Replacement triples
aaa ?x rr:subject bbb ?y . aaa ?x rr:subjectMap [ rr:constant bbb ?y ].
aaa ?x rr:predicate bbb ?y . aaa ?x rr:predicateMap [ rr:constant bbb ?y ].
aaa ?x rr:object bbb ?y . aaa ?x rr:objectMap [ rr:constant bbb ?y ].
aaa ?x rr:graph bbb ?y . aaa ?x rr:graphMap [ rr:constant bbb ?y ].

The following example shows a predicate-object map that uses a constant-valued term map both for its predicate and for its object.

[] rr:predicateMap [ rr:constant rdf:type ];
rr:objectMap
[
rr:constant
ex:Employee
].

If added to a triples map , this predicate-object map would add the following triple to all resources ?x generated by the triples map:

?x
rdf:type
ex:Employee.

The following example uses constant shortcut properties and is equivalent to the example above:

[] rr:predicate rdf:type;
rr:object
ex:Employee.

7.2 From a Column ( rr:column )

A column-valued term map is a term map that is represented by a resource that has exactly one rr:column property.

The value of the rr:column property MUST be a valid column name . The column value of the term map is the data value of that column in a given logical table row .

The referenced columns of a column-valued term map is the singleton set containing the value of the term map's rr:column . property.

The following example defines an object map that generates literals from the DNAME column of some logical table.

[]
rr:objectMap
[
rr:column
"DNAME"
].

Using the sample row from the DEPT table as a logical table row, the column value of the object map would be “ APPSERVER ”.

7.3 From a Template ( rr:template )

A template-valued term map is a term map that is represented by a resource that has exactly one rr:template property. The value of the rr:template property MUST be a valid string template .

A string template is a format string that can be used to build strings from multiple components. It can reference column names by enclosing them in curly braces. braces (“ { ” and “ } ”). The following syntax rules apply to valid string templates:

The template value of the term map for a given logical table row is determined as follows:

  1. Let result be the template string
  2. For each pair of unescaped curly braces in result :
    1. Let value be the data value of the column whose name is enclosed in the curly braces
    2. If value is NULL , then return NULL
    3. Apply conversion to string Let value be the natural RDF lexical form corresponding to value
    4. If the term type is rr:IRI , then replace the pair of curly braces with an IRI-safe version of value ; otherwise, replace the pair of curly braces with value
  3. Return result

The IRI-safe version of a string is obtained by applying the following transformation to any character that is not in the iunreserved production in [ RFC3987 ]:

  1. Convert the character to a sequence of one or more octets using UTF-8 [ RFC3629 ]
  2. Percent-encode each octet [ RFC3986 ]

The following table shows examples of strings and their IRI-safe versions:

String IRI-safe version
42 42
Hello World! Hello%20World%21
2011-08-23T22:17:00Z 2011-08-23T22%3A17%3A00Z
~A_17.1-2 ~A_17.1-2
葉篤正 葉篤正

R2RML always performs percent-encoding when IRIs are generated from string templates. If IRIs need to be generated without percent-encoding, then rr:column should be used instead of rr:template , with an R2RML view that performs the string concatenation.

In the case of string templates that generate IRIs, any single character that is legal in an IRI, but percent-encoded in the IRI-safe version of a data value, is a safe separator . This includes in particular the eleven sub-delim characters defined in [ RFC3987 ]: !$&'()*+,;=

The referenced columns of a template-valued term map is the set of column names enclosed in unescaped curly braces in the template string .

The following example defines a subject map that generates IRIs from the DEPTNO column of a logical table.

[]
rr:subjectMap
[
rr:template
"http://data.example.com/department/{DEPTNO}"
].

Using the sample row from the DEPT table as a logical table row, the template value of the subject map would be:

http://data.example.com/department/10

The following example shows how an IRI-safe template value is created:

[]
rr:subjectMap
[
rr:template
"http://data.example.com/site/{LOC}"
].

Using the sample row from the DEPT table as a logical table row, the template value of the subject map would be:

http://data.example.com/site/NEW%20YORK

The space character is not in the iunreserved set, and therefore percent-encoding is applied to the character, yielding “ %20 ”.

The following example shows the use of backslash escapes in string templates. The template will generate a fancy title such as

{{{
\o/

Hello
World!
\o/

}}}

from a string “ Hello World! ” in the TITLE column. [] rr:objectMap [ By default, rr:template "\\{\\{\\{ {TITLE} \\}\\}\\}" generates IRIs. Since the intention here is to create a literal instead, the term type has to be set.

[] rr:objectMap [
    rr:template "\\{\\{\\{ \\\\o/ {TITLE} \\\\o/ \\}\\}\\}";
    rr:termType rr:Literal;

].

Note that because backslashes need to be escaped by a second backslash in the Turtle syntax [ TURTLE ], a double backslash is needed to escape each curly brace. brace, and to get one literal backslash in the output one needs to write four backslashes in the template.

7.4 IRIs, Literal, Blank Nodes ( rr:termType )

The term type of a column-valued term map or template-valued term map determines the kind of generated RDF term ( IRIs , blank nodes or literals ).

If the term map has an optional rr:termType property, then its term type is the value of that property. The value MUST be an IRI and MUST be one of the following options:

If the term map does not have a rr:termType property, then its term type is:

Term maps with term type rr:IRI cause data errors if the value is not a valid IRI (see generated RDF term for details). Data values from the input database may require percent-encoding before they can be used in IRIs. Template-valued term maps are a convenient way of percent-encoding data values.

Constant-valued term maps are not considered as having a term type , and specifying rr:termType on these term maps has no effect. The type of the generated RDF term is determined directly by the value of rr:constant : If it is an IRI, then an IRI will be generated; if it is a literal, a literal will be generated.

7.5 Language Tags ( rr:language )

A term map with a term type of rr:Literal MAY have a specified language tag . It is represented by the rr:language property on a term map. If present, its value MUST be a valid language tag .

A specified language tag causes generated literals to be language-tagged plain literals. In the following example, plain literals with language tag “ en-us ” (U.S. English) will be generated for the data values in the DNAME column.

[] rr:objectMap [ rr:column "DNAME"; rr:language "en-us" ].

7.6 Typed Literals ( rr:datatype )

A typeable datatypeable term map is a term map with a term type of rr:Literal that does not have a specified langauge language tag .

Typeable Datatypeable term maps may generate typed literals . The datatype of these literals can be explicitly specified using rr:datatype , or automatically determined based on the SQL datatype of the underlying logical table column. column (producing a natural RDF literal ), or it can be explicitly overridden using rr:datatype (producing a datatype-override RDF literal ).

A typeable datatypeable term map MAY have a rr:datatype property. Its value MUST be an IRI . This IRI is the specified datatype of the term map.

A term map MUST NOT have more than one rr:datatype value.

A term map that is not a typeable datatypeable term map MUST NOT have an rr:datatype property.

A typeable term map has an The implicit SQL datatype and an implicit transform . They are determined as follows: If of a datatypeable term map is CHARACTER VARYING if the term map is a column-valued template-valued term map , then the implicit datatype ; otherwise, it is the corresponding RDF SQL datatype of the respective column in the logical table row , and the implicit transform is the RDF transformation of the column. Otherwise, the term map must be a template-valued term map and its implicit datatype is empty , and its implicit transform is the identity transform. A datatype override is in effect on a typeable term map if it has a specified datatype , and the specified datatype is different from its implicit datatype .

See generated RDF term for further details. details on generating literals from term maps.

R2RML does not allow generating One cannot explicitly state that a plain literals literal without language tag should be generated. They are the default for string columns. To generate one from a non-string columns. One can use column, a derived column that uses template-valued term map with a SQL template such as CAST "{MY_COLUMN}" expression instead. and a term type of rr:Literal can be used.

The following example shows an object map that overrides the default datatype of the logical table with an explicitly specified xsd:positiveInteger type. Whatever A datatype-override RDF literal of that datatype will be generated from whatever is in the EMPNO column will be subjected to conversion to string , and turned into a literal of that type. column.

[] rr:objectMap [ rr:column "EMPNO"; rr:datatype xsd:positiveInteger ].

7.7 Inverse Expressions ( rr:inverseExpression )

An inverse expression is a string template associated with a column-valued term map or template-value term map . It is represented by the value of the rr:inverseExpression property. This property is OPTIONAL and there MUST NOT be more than one for a term map.

Inverse expressions are useful for optimizing term maps that reference derived columns in R2RML views . An inverse expression specifies an expression that allows “reversing” of a generated RDF term and the construction of a SQL query that efficiently retrieves the logical table row from which the term was generated. In particular, it allows the use of indexes on the underlying relational tables.

Every pair of unescaped curly braces in the inverse expression is a column reference in an inverse expression . The string between the braces MUST be a valid column name .

An inverse expression MUST satisfy the following condition:

For example, for the DEPTID column in the logical table used for mapping the DEPT table in this example mapping , an inverse expression could be defined as follows:

[] rr:column "DEPTID";
   rr:inverseExpression "{DEPTNO} = substr({DEPTID},length('Department')+1)";

   rr:inverseExpression "{DEPTNO} = SUBSTRING({DEPTID}, CHARACTER_LENGTH('Department')+1)";

This facilitates the use of an existing index on the DEPTNO column of the DEPT table .

A quoted and escaped data value is a any SQL literal string that matches the <literal> or <null specification> productions of [ SQL2 ]. This string can be used in a SQL query, such as: query to specify a SQL data value. Examples:

8 Foreign Key Relationships among Logical Tables ( rr:parentTriplesMap , rr:joinCondition , rr:child and rr:parent )

Diagram: The properties of referencing object maps

Figure 5: The properties of referencing object maps

A referencing object map allows using the subjects of another triples map as the objects generated by a predicate-object map . Since both triples maps may be based on different logical tables , this may require a join between the logical tables. This is not restricted to 1:1 joins.

A referencing object map is represented by a resource that:

A join condition is represented by a resource that has exactly one value for each of the following two properties:

The child query of a referencing object map is the effective SQL query of the logical table of the term map containing the referencing object map.

The parent query of a referencing object map is the effective SQL query of the logical table of its parent triples map .

If the child query and parent query of a referencing object map are not identical, then the referencing object map MUST have at least one join condition .

The joint SQL query of a referencing object map is:

The joint SQL query is used when generating RDF triples from referencing object maps .

The following example shows a referencing object map as part of a predicate-object map :

[] rr:predicateObjectMap [
[] rr:predicateObjectMap [
    rr:predicate ex:department;
    rr:refObjectMap [

    rr:objectMap [

        rr:parentTriplesMap <#TriplesMap2>;
        rr:joinCondition [
            rr:child "DEPTNO";
            rr:parent "DEPTNO";
        ];
    ];
].

If the logical table of the surrounding triples map is EMP , and the logical table of <#TriplesMap2> is DEPT , this would result in a join between these two tables with the condition

EMP.DEPTNO
=
DEPT.DEPTNO

and the objects of the triples would be generated using the subject map of <#TriplesMap2> .

Given the two example tables , and subject maps as defined in the example mapping , this would result in a triple:

<http://data.example.com/employee/7369>
ex:department
<http://data.example.com/department/10>.

The following example shows a referencing object map that does not have a join condition . It creates two kinds of resources from the DEPT table : departments and sites.

<#DeptTriplesMap>
    rr:logicalTable [ rr:tableName "DEPT" ];
    rr:subjectMap [
        rr:template "department/{DEPTNO}";
        rr:class ex:Department;
    ];
    rr:predicateObjectMap [
        rr:predicate ex:location;
        rr:objectMap [ rr:parentTriplesMap <#SiteTriplesMap> ];
    ].
<#SiteTriplesMap>
    rr:logicalTable [ rr:tableName "DEPT" ];
    rr:subjectMap [
        rr:template "site/{LOC}";
        rr:class ex:Site;
    ];
    rr:predicateObjectMap [
        rr:predicate ex:siteName;
        rr:objectMap [ ex:column "LOC" ];
].

An ex:Site resource is created for each distinct value in the LOC column, using the <#SiteTriplesMap> . Departments and sites are linked by ex:location triples, and the objects of these triples are specified using a referencing object map that references the sites triples map. No join condition is needed as both triples maps use the same logical table (the base table DEPT ). Given the example table, this mapping would result in four triples (assuming an appropriate base IRI ):

<http://data.example.com/department/10> rdf:type ex:Department.
<http://data.example.com/department/10> ex:location <http://data.example.com/site/NEW%20YORK>.
<http://data.example.com/site/NEW%20YORK> rdf:type ex:Site.
<http://data.example.com/site/NEW%20YORK>
ex:siteName
"NEW
YORK".

9 Assigning Triples to Named Graphs

Diagram: The properties of graph maps

Figure 6: The properties of graph maps

Each triple generated from an R2RML mapping is placed into one or more graphs of the output dataset . Possible target graphs are the unnamed default graph , and the IRI -named named graphs .

Any subject map or predicate-object map MAY have one or more associated graph maps . They are specified in one of two ways:

  1. using the rr:graphMap property, whose value MUST be a graph map ,
  2. using the constant shortcut property rr:graph .

Graph maps are themselves term maps . When RDF triples are generated , the set of target graphs is determined by taking into account any graph maps associated with the subject map or predicate-object map.

If a graph map generates the special IRI rr:defaultGraph , then the target graph is the default graph of the output dataset .

In the following subject map example, all generated RDF triples will be stored in the named graph ex:DepartmentGraph .

[] rr:subjectMap [
    rr:template "http://data.example.com/department/{DEPTNO}";
    rr:graphMap [ rr:graph ex:DepartmentGraph ];

    rr:graphMap [ rr:constant ex:DepartmentGraph ];

].

This is equivalent to the following example, which uses a constant shortcut property :

[] rr:subjectMap [
    rr:template "http://data.example.com/department/{DEPTNO}";
    rr:graph ex:DepartmentGraph;
].

In the following example, RDF triples are placed into named graphs according to the job title of employees:

[] rr:subjectMap [
    rr:template "http://data.example.com/employee/{EMPNO}";
    rr:graphMap [ rr:template "http://data.example.com/jobgraph/{JOB}" ];
].

The triples generated from the EMP table would be placed in the named graph with the following IRI:

<http://data.example.com/jobgraph/CLERK>

9.1 Scope of Blank Nodes

Blank nodes in the output dataset are scoped to a single RDF graph . If the same blank node identifier occurs in multiple RDF triples that are in the same graph, then the triples will share the same single blank node. If, however, the same blank node identifier occurs in multiple graphs, then a distinct blank node is be created for each graph. An R2RML-generated blank node can never be shared by two triples in two different graphs.

This implies that triples generated from a single logical table row will have different subjects if the subjects are blank nodes and the triples are placed into different graphs.

10 Datatype Conversions

This section defines various conversion rules applicable mappings from SQL data values to RDF literals .

10.1 Introduction (Informative)

This section defines the following mappings from SQL data values. values:

  1. The rules are invoked natural RDF literal is a mapping to literals . It is used in various places throughout this specification, R2RML and in particular around the rr:datatype Direct Mapping of Relational Data to RDF [ DM ] as the default mapping when literals are created. It maps SQL datatypes to corresponding XML Schema datatypes [ XMLSCHEMA2 ] and in hte term generation rules . A typed literal loosely follows ISO/IEC 9075-14:2008 [ SQL14 of a supported ].
  2. The natural RDF datatype lexical form is ill-typed if its similar, but produces only the lexical form of the typed literal and recommends that implementations perform XSD canonicalization. It is not used in the R2RML when non-string columns are used in a string context , for example when a TIMESTAMP is used in an IRI template .
  3. The canonical RDF lexical space form of is again similar, but requires XSD canonicalization. It is used in the Direct Mapping when IRIs are generated.
  4. The datatype-override RDF datatype identified literal is a mapping that constructs typed literals by its using the natural RDF lexical form and applying a specified datatype IRI . For example, "X"^^xsd:boolean The mapping author is ill-typed because “ X ” responsible for ensuring that the generated lexical form is not valid for the datatype. It is used in R2RML when the lexical space target datatype of a literal-generating term map is overridden using xsd:boolean rr:datatype .

The mappings cover all predefined Core SQL 2008 datatypes except INTERVAL . The natural mappings may be extended with custom handling for other types , such as vendor-specific SQL datatypes. In the absence of such extensions, the natural mappings fall back on a simple cast to string [ XMLSCHEMA2 for all unsupported SQL datatypes.

The mappings are referenced in the R2RML term generation rules .

An informative summary of XSD lexical forms ]. is provided to aid implementers.

10.1 Table

10.2 Natural Mapping of Corresponding Datatypes SQL Values

The corresponding natural RDF datatype literal of corresponding to a SQL datatype data value is given in the table below, or empty if result of applying the following steps:

  1. Let dt be the SQL datatype does not occur in the table. The RDF transformation of the SQL data value.
  2. If dt is a character string type (in Core SQL datatype 2008: CHARACTER , CHARACTER VARYING , CHARACTER LARGE OBJECT , NATIONAL CHARACTER , NATIONAL CHARACTER VARYING , NATIONAL CHARACTER LARGE OBJECT ), then the result is a transformation rule given plain literal without language tag whose lexical form is the SQL data value.
  3. Otherwise, if dt is listed in the table below, or conversion to string below : The result is a typed literal if whose datatype IRI is the SQL IRI indicated in the RDF datatype does not occur column in the table. same row as dt . The supported lexical form may be any lexical form that represents the same value as the SQL data value, according to the definition of the RDF datatypes datatype. If there are multiple lexical forms available that represent the datatypes for which an implementation can detect ill-typed literals. This set same value (e.g., 1 , +1 , 1.0 and 1.0E0 ), then the choice is implementation-dependent. However, the choice MUST include all datatypes mentioned in be made so that given a target RDF datatype and value, the table below in same lexical form is chosen consistently (e.g., INTEGER 5 and BIGINT 5 must be mapped to the same lexical form, as both are mapped to the column “Corresponding RDF datatype”, according datatype xsd:integer and are equal values; mapping one to their definitions in 5 and the other to +5 would be an error). The canonical lexical representation [ XMLSCHEMA2 ]. This set ] MAY include arbitrary further datatypes. be chosen. (See also: Summary of XSD Lexical Forms )
  4. Otherwise, the result is a plain literal without language tag whose lexical form is the SQL data value cast to string .
SQL datatype Corresponding RDF datatype Transformation Lexical transformation (informative)
BINARY , BINARY VARYING , BINARY LARGE OBJECT xsd:base64Binary xsd:hexBinary base64 encoding xsd:hexBinary lexical mapping
NUMERIC , DECIMAL xsd:decimal conversion to string none required
SMALLINT , INTEGER , BIGINT xsd:integer conversion to string none required
FLOAT , REAL , DOUBLE PRECISION xsd:double conversion to string none required
BOOLEAN xsd:boolean conversion to boolean ensure lowercase ( true , false )
DATE xsd:date conversion to datetime none required
TIME xsd:time conversion to datetime none required
TIMESTAMP xsd:dateTime conversion to datetime replace space character with “ T
INTERVAL undefined undefined
Any types not appearing in the table, including all character string types and vendor-specific types, will default to producing RDF plain literals by using conversion to string .

R2RML processor implementations extensions that handle vendor-specific or user-defined datatypes beyond those of Core SQL 2008 are expected to augment behave as if the table with above contained additional rows for mapping vendor-specific that associate the SQL datatypes to with appropriate XSD types. RDF-compatible datatypes (e.g., the XML Schema built-in types [ XMLSCHEMA2 ]), and appropriate lexical transformations where required. Note however that future versions of R2RML may also normatively add additional rows to this table.

The translation of INTERVAL is left undefined due to the complexity of the translation. [ SQL14 ] describes a translation of INTERVAL to xdt:yearMonthDuration and xdt:dayTimeDuration .

In [ SQL2 ], the precision of many SQL datatypes is not fixed, but left implementation-defined. Therefore, the mapping to XML Schema datatypes must rely on arbitrary-precision types such as xsd:decimal , xsd:integer and xsd:dateTime . Implementers of the mapping may wish to set upper limits for the supported precision of these XSD types. The following XML Schema specification allows such partial implementations of infinite datatypes [ XMLSCHEMA2 ], and defines specific minimum requirements.

The natural RDF datatype corresponding to a SQL datatype is the value of the RDF datatype column in the row corresponding to the SQL datatype in the table shows examples above .

The natural RDF lexical form corresponding to a SQL data value is the lexical form of various its corresponding natural RDF literal , with the additional constraint that the canonical lexical representation [ XMLSCHEMA2 ] SHOULD be chosen.

The canonical RDF lexical form corresponding to a SQL data value is the lexical form of its corresponding natural RDF literal , with the additional constraint that the canonical lexical representation [ XMLSCHEMA2 ] MUST be chosen.

Cast to string is an implementation-dependent function that maps SQL data values after conversion to equivalent Unicode strings. It is undefined for the following kinds of SQL datatypes: collection types, row types, user-defined types without a user-defined string CAST , reference types whose referenced type does not have a user-defined string CAST , binary types.

Cast to string , is a fallback that handles vendor-specific and user-defined datatypes not supported by the R2RML processor. It can be implemented in a number of ways, including explicit SQL casts (“ CAST( value AS VARCHAR( n )) ”, where n is an arbitrary large integer), implicit SQL casts (concatenation with the empty string), or by employing a database access API that presents return values as strings.

10.3 Datatype-override Mapping of SQL Values

The datatype-override RDF literal corresponding to a SQL data value v and a datatype IRI dt , is a typed literal of whose lexical form is the corresponding natural RDF lexical form corresponding to v , and whose datatype IRI is dt . If the typed literal is ill-typed , derived then a data error is raised.

A typed literal is ill-typed in R2RML if its datatype IRI denotes a validatable RDF datatype and its lexical form is not in the lexical space of the RDF datatype identified by applying its datatype IRI . (See also: Summary of XSD Lexical Forms )

The set of validatable RDF datatypes includes all datatypes in the RDF datatype column of the table of natural datatype mappings , as defined in [ XMLSCHEMA2 ]. This set MAY include implementation-defined additional RDF datatypes.

For example, "X"^^xsd:boolean is ill-typed because xsd:boolean is a validatable RDF datatype in R2RML, and “ X ” is not in the lexical space of xsd:boolean [ XMLSCHEMA2 ].

10.4 Non-String Columns in String Contexts

The same non-character-string SQL datatype's data value can typically be represented in multiple different string forms. For example, the DOUBLE value 1 can be represented as 1 , +1 , 1.0 and 1.0E0 . This can cause interoperability issues when such values are used in string contexts, for example when using them to generate IRIs . Two IRIs that are character-for-character equivalent, except one contains 1 where the other contains 1.0 , will not “link up” in an RDF transformation : graph – they are two different nodes.

To reduce portability issues arising from such conversions, this specification recommends that implementations convert non-string data values to a canonical form (see natural RDF lexical form ). However, this is not a strict requirement. Therefore, when portability between R2RML implementations is a concern, mapping authors SHOULD NOT use non-character-string columns in contexts where strings are produced:

In these contexts, if portability is to be maximized, then mapping authors SHOULD use an R2RML view instead and explicitly convert the non-string column to a string column using an SQL expression.

Note that this is not a problem when natural RDF literals are generated from such columns, because the resulting literal has a corresponding non-string XSD datatype, and equivalences between different lexical forms within these datatype are well-defined.

10.5 Summary of XSD Lexical Forms (Informative)

The natural mappings make reference to various XSD datatypes and require that SQL data values be converted to strings that are appropriate as lexical forms for these datatypes. This subsection gives examples of these lexical forms in order to aid implementers of the mappings. This subsection is non-normative; the normative definitions of the lexical spaces as well as the canonical lexical mappings are found in W3C XML Schema Definition Language (XSD) 1.1 Part 2: Datatypes [ XMLSCHEMA2 ].

A general approach that may be used for implementing the natural mappings is as follows:

  1. Identify the SQL datatype of the input SQL data value.
  2. Look up its corresponding natural RDF datatype .
  3. Apply cast to string to the SQL data value.
  4. Ensure that the resulting string is in the lexical space of the target RDF datatype; that is, it must be in a form such as those listed in either column of the table below. This may require some transformations of the string, in particular for xsd:hexBinary , xsd:dateTime and xsd:boolean .
  5. If the goal is to obtain a canonical lexical representation, then further string transformations may be required to obtain a form such as those listed in the Canonical lexical forms column of the table below.
10.2 Conversion to string Conversion to string is the process of transforming a SQL data value to a Unicode string. Its result MUST be the same as evaluating the following SQL expression, as defined in [ SQL2
SQL RDF datatype Conversion to string example Non-canonical lexical forms Typed literal example Canonical lexical forms Comments
DECIMAL xsd:hexBinary 2000000000005.9 5232524d4c "2000000000005.9"^^xsd:decimal 5232524D4C Convert from SQL by applying xsd:hexBinary lexical mapping .
DECIMAL xsd:decimal 2000000000000 .224 "2000000000000"^^xsd:decimal 0.224
INTEGER +001 -1 1
"-1"^^xsd:integer 42.0 REAL 42
5.0E-1 -5.9000 "5.0E-1"^^xsd:double -5.9
REAL xsd:integer 0E0 -05 "0E0"^^xsd:double -5
DATE +333 DATE 2011-08-23 333
"2011-08-23"^^xsd:date 00 TIME 0
TIME 22:17:00 xsd:double "22:17:00"^^xsd:time -5.90 TIME -5.9E0 Also supports INF , -INF , NaN and -0.0E0 ,
but these do not appear in standard SQL.
TIME 22:17:00.0000 +0.00014770215000 "22:17:00.0000"^^xsd:time 1.4770215E-4
TIME +01E+3 TIME 22:17:00+01:00 1.0E3
"22:17:00+01:00"^^xsd:time 100.0 TIMESTAMP 1.0E2
TIMESTAMP 2011-08-23 22:17:00 0 "2011-08-23T22:17:00"^^xsd:dateTime 0.0E0
]: CAST( value AS CHARACTER VARYING( max )) where value is the quoted and escaped form xsd:boolean of the SQL data value, and max is the implementation-dependent maximum length of a variable-length character string. An informative summary of the rules for casting standard SQL 2008 datatypes to string follows. The right column of the table contains a regular expression that is matched by the string-converted form of all SQL data values of the types in the left column: Type Pattern DECIMAL , NUMERIC 1 -?(\.\d+|\d+(\.\d+)?) true Must be lowercase.
SMALLINT , INTEGER , BIGINT 0 -?\d+ false
FLOAT , REAL , DOUBLE PRECISION xsd:date 0E0|-?[1-9]\.\d+ 2011-08-23 Dates in SQL don't have timezone offsets.
They are optional in XSD.
xsd:time DATE 22:17:34.885+00:00 \d\d\d\d-\d\d-\d\d 22:17:34.885Z May or may not have timezone offset.
TIME 22:17:34.000 \d\d:\d\d:\d\d(.\d+)?([+-]\d\d:\d\d)? 22:17:34
TIMESTAMP 22:17:34.1+01:00 \d\d\d\d-\d\d-\d\d \d\d:\d\d:\d\d(.\d+)?([+-]\d\d:\d\d)? 22:17:34.1+01:00
BOOLEAN xsd:dateTime TRUE|FALSE The result of conversion to string is always the shortest possible string that, if interpreted as a SQL literal of the original SQL datatype, has the same value as the original SQL data value. For example, converting the DECIMAL value 1 to string yields 1 , not the longer equal-valued strings 01 or 1.0 . 10.3 Conversion to xsd:boolean 2011-08-23T22:17:00.000+00:00 Conversion to boolean is the process of transforming a SQL data value of datatype BOOLEAN to a string that is compatible with the xsd:boolean datatype. It consists of the following steps: Apply conversion to string to the SQL data value, convert the resulting string to lowercase. Example: The result of converting a BOOLEAN SQL data value to string is either TRUE or FALSE . The resulting typed literal is either "true"^^xsd:boolean 2011-08-23T22:17:00Z or "false"^^xsd:boolean . 10.4 Conversion to Datetime Conversion to datetime is the process of transforming a SQL data value of datatype DATE , TIME or TIMESTAMP to a string that is compatible with the corresponding XSD datatype. It consists of the following steps: Apply conversion to string to the SQL data value. Remove any initial string “ DATE ”, “ TIME ” May or “ TIMESTAMP ” and any leading spaces may not have timezone offset.
Convert from the resulting string. If the SQL data value is of datatype TIMESTAMP , then replace the 11th character of the string (a space) with an upper-case by replacing space wiht T ”. Any fractional seconds and/or time zone interval present after conversion to string is included in the resulting string. Examples for conversion to datetime can be found in the table above. 10.5 Conversion to xsd:base64Binary Base64 encoding is the process of transforming a binary SQL data value to a string that is compatible with the xsd:base64Binary datatype, by applying base64 encoding as restricted for xsd:base64Binary [ XMLSCHEMA2 ] on the binary value.

11 The Output Dataset

The output dataset of an R2RML mapping is an RDF dataset that contains the generated RDF triples for each of the triples maps of the R2RML mapping. The output dataset MUST NOT contain any other RDF triples or named graphs besides these. However, R2RML processors MAY provide access to datasets that contain additional triples or graphs beyond those in the output dataset, such as inferred triples or provenance information.

If a table or column is not explicitly referenced in a triples map , then no RDF triples will be generated for that table or column.

Conforming R2RML processors MAY rename blank nodes when providing access to the output dataset . This means that client applications may see actual blank node identifiers that differ from those produced by the R2RML mapping . Client applications SHOULD NOT rely on the specific text of the blank node identifier for any purpose.

RDF syntaxes and RDF APIs generally represent blank nodes with blank node identifiers . But the characters allowed in blank node identifiers differ between syntaxes, and not all characters occurring in the values produced by a term map may be allowed, so a bijective mapping function from values to valid blank node identifiers may be required. The details of this mapping function are implementation-dependent, and R2RML processors may have to use different functions for different output syntaxes or access interfaces. Strings matching the regular expression [a-zA-Z_][a-zA-Z_0-9-]* are valid blank node identifiers in all W3C-recommended RDF syntaxes (as of this document's publication).

RDF datasets may contain empty named graphs . R2RML cannot generate such output datasets.

11.1 The Generated RDF Triples of a Triples Map

This subsection describes the process of generating RDF triples from a triples map . This process adds RDF triples to the output dataset . Each generated triple is placed into one or more particular graphs of the output dataset.

The generated RDF triples are determined by the following algorithm. R2RML processors MAY use other means than implementing this algorithm to compute the generated RDF triples, as long as the result is the same.

  1. Let sm be the subject map of the triples map
  2. Let rows be the result of evaluating the effective SQL query of the triples map 's logical table using the SQL connection
  3. Let classes be the class IRIs of sm
  4. Let sgm be the set of graph maps of sm
  5. For each logical table row row in rows , apply the following steps:
    1. Let subject be the generated RDF term that results from applying sm to row
    2. Let subject_graphs be the union set of the generated RDF terms that result from applying any each term maps map in sgm to row
    3. If For each classes class is not empty, then for each IRI in classes , add the following triples to the output dataset: dataset as follows:

      Subject: subject
      Predicate: rdf:type
      Object: classes class
      Target graphs: If sgm is empty: rr:defaultgraph ; otherwise: subject_graphs

    4. For each predicate-object map of the triples map , apply the following steps:
      1. If the predicate-object map has no object map (but a referencing object map ), then skip these substeps for this predicate-object map Let predicate predicates be the set of generated RDF term terms that results result from applying each of the predicate-object map's predicate map maps to row
      2. Let object objects be the set of generated RDF term terms that results result from applying each of the predicate-object map's object map maps (but not referencing object maps ) to row
      3. Let pogm be the set of graph maps of the predicate-object map
      4. Let predicate-object_graphs be the union set of the generated RDF terms that result from applying any each graph maps map in pogm to row
      5. For each possible combination < Add the following predicate , object > where predicate is a member of predicates and object is a member of objects , add triples to the output dataset: dataset as follows:

        Subject: subject
        Predicate: predicate
        Object: object
        Target graphs: If sgm and pogm are empty: rr:defaultGraph ; otherwise: union of subject_graphs and predicate-object_graphs

  6. For each referencing object map of a predicate-object map of the triples map , apply the following steps:
    1. If the predicate-object map has no referencing object map (but a normal object map ), then skip these substeps for this predicate-object map Let psm be the subject map of the parent triples map of the referencing object map
    2. Let pogm be the set of graph maps of the predicate-object map
    3. Let n be the number of columns in the logical table of the triples map
    4. Let rows be the result of evaluating the joint SQL query of the referencing object map
    5. For each row in rows , apply the following steps:
      1. Let child_row be the subset logical table row derived by taking the first n columns of row whose columns are present in the referencing object map's child query
      2. Let parent_row be the subset logical table row derived by taking all but the first n columns of row whose columns are present in the referencing object map's parent query
      3. Let subject be the generated RDF term that results from applying sm to child_row
      4. Let predicate predicates be the set of generated RDF term terms that results result from applying each of the predicate-object map's predicate map maps to child_row
      5. Let object be the generated RDF term that results from applying psm to parent_row
      6. Let subject_graphs be the union set of the generated RDF terms that result from applying any each graph maps map of sgm to child_row
      7. Let predicate-object_graphs be the union set of the generated RDF terms that result from applying any each graph maps map in pogm to child_row
      8. For each Add the following predicate in predicates , add triples to the output dataset: dataset as follows:

        Subject: subject
        Predicate: predicate
        Object: object
        Target graphs: If neither sgm nor pogm has any graph maps : rr:defaultGraph ; otherwise: union of subject_graphs and predicate-object_graphs

The process of adding Add triples to the output dataset ” is a process that takes as its input: the following inputs:

For each possible combination < s , Execute the following steps:

  1. If p Subject , o Predicate >, where or s Object is a member of Subjects empty , p a member of Predicates and o a member of Objects : then abort these steps.
  2. Generate Otherwise, generate an RDF triple < s Subject , p Predicate , o Object >
  3. If the set of target graphs includes rr:defaultGraph , add the triple to the default graph of the output dataset .
  4. For each IRI in the set of target graphs that is not equal to rr:defaultGraph , add the triple to a named graph of that name in the output dataset . If the output dataset does not contain a named graph with that IRI, create it first.

RDF graphs cannot contain duplicate RDF triples . Placing multiple equal triples into the same graph has the same effect as placing it into the graph only once. Also note the scope of blank nodes .

11.2 The Generated RDF Terms Term of a Term Map

A term map is a function that generates a set of an RDF terms term from a logical table row . The result of that function can be:

The generated RDF terms term of a term map for a given logical table row are is determined as follows:

The term generation rules , applied to a value , are as follows:

  1. If the value is NULL , then no RDF term is generated.
  2. Otherwise, if the term map 's term type is rr:IRI :
    1. Apply conversion to string Let value be the natural RDF lexical form corresponding to the value. value .
    2. If the value is a valid absolute IRI [ RFC3987 ], then generate return an IRI . generated from value .
    3. Otherwise, prepend the value with the base IRI . If the result is a valid absolute IRI [ RFC3987 ], then generate return an IRI generated from the result.
    4. Otherwise, raise a data error .
  3. Otherwise, if the term type is rr:BlankNode :
    1. Apply conversion to string to the value. Generate Return a blank node whose blank node identifier that is unique to the value. natural RDF lexical form corresponding to value . (Note: On Blank Node Identifiers , Scope of Blank Nodes )
  4. Otherwise, if the term type is rr:Literal :
    1. If the term map has a specified language tag , then apply conversion to string to the value, and generate return a plain literal with that language tag. tag and with the natural RDF lexical form corresponding to value .
    2. Otherwise, if a datatype override is in effect on the term map: Apply conversion to string to the value. Generate map has a typed literal whose datatype IRI is the non-empty specified datatype . If the specified datatype that is a supported different from the natural RDF datatype and the generated typed literal is ill-typed , then raise a data error . Otherwise, if corresponding to the term map's implicit SQL datatype is empty , , then apply conversion to string to return the value, and generate a plain datatype-override RDF literal without language tag. corresponding to value and the specified datatype.
    3. Otherwise, apply the term map's implicit transform to return the value, and generate a typed natural RDF literal whose datatype IRI is the implicit datatype . corresponding to value .
The algorithm uses simple string concatenation for obtaining an absolute IRI from a relative IRI, rather than the more complex algorithm defined in RFC 3986. This ensures that the original database value can be reconstructed from the generated IRI.

A. RDF Terminology (Informative)

This section appendix lists some terms normatively defined in other specifications.

The following terms are defined in RDF Concepts and Abstract Syntax [ RDF ] and used in R2RML:

The following terms are defined in SPARQL Query Language for RDF [ SPARQL ] and used in R2RML:

B. Index of R2RML Vocabulary Terms (Informative)

This appendix lists all the classes, properties and other terms defined by this specification within the R2RML vocabulary .

An RDFS representation of the vocabulary is available from the namespace IRI .

B.1 Classes

The following table lists all R2RML classes .

The third column contains minimum conditions that a resource has to fulfil in order to be considered member of the class. Where multiple conditions are listed, all must be fulfilled.

Class Represents Minimum conditions
rr:BaseTableOrView SQL base table or view Having an rr:tableName property
rr:GraphMap graph map Being an rr:TermMap
Being value of an rr:graphMap property
rr:Join join condition Having an rr:parent property
Having an rr:child property
rr:LogicalTable logical table Being one of its subclasses, rr:BaseTableOrView or rr:R2RMLView
rr:ObjectMap object map Being an rr:TermMap
Being value of an rr:objectMap property
rr:PredicateMap predicate map Being an rr:TermMap
Being value of an rr:predicateMap property
rr:PredicateObjectMap predicate-object map Having at least one of rr:predicate and rr:predicateMap
Having at least one of rr:object and rr:objectMap
rr:R2RMLView R2RML view Having an rr:sqlQuery property
rr:RefObjectMap referencing object map Having an rr:parentTriplesMap property
rr:SubjectMap subject map Being an rr:TermMap
Being value of an rr:subjectMap property
rr:TermMap term map Having exactly one of rr:constant , rr:column , rr:template
rr:TriplesMap triples map Having an rr:logicalTable property
Having exactly one of rr:subject and rr:subjectMap

As noted earlier , a single node in an R2RML mapping graph may represent multiple mapping components and thus be typed as several of these classes. However, the following classes are disjoint:

B.2 Properties

The following table lists all properties in the R2RML vocabulary .

The cardinality column indicates how often this property occurs within its context. Note that additional constraints not stated in this table might apply, and making a property forbidden apply. The actual cardinality of some properties may depend on the presence or required in certain situations. absence of other properties, and their values. Properties where this applies are indicated by an exclamation mark.

Property Represents Context Cardinality
rr:child child column join condition 1
rr:class class IRI subject map 0…∞
rr:column column name column-valued term map 1
rr:datatype specified datatype term map 0…1 !
rr:constant constant value constant-valued term map 1
rr:graph constant shortcut property subject map , predicate-object map 0…∞
rr:graphMap graph map subject map , predicate-object map 0…∞
rr:inverseExpression inverse-expression inverse expression term map 0…1 !
rr:joinCondition join condition referencing object map 0…∞
rr:language specified language tag term map 0…1 !
rr:logicalTable logical table triples map 1
rr:object constant shortcut property predicate-object map 1…∞ 1
rr:objectMap object map predicate-object , referencing object map 1
rr:parent parent column join condition 1
rr:parentTriplesMap parent triples map referencing object map 1
rr:predicate constant shortcut property predicate-object map 1…∞ 1
rr:predicateMap predicate map predicate-object map 1
rr:predicateObjectMap predicate-object map triples map 0…∞
rr:sqlQuery SQL query R2RML view 1
rr:sqlVersion SQL version identifier R2RML view 0…∞
rr:subject constant shortcut property triples map 0…1 1
rr:subjectMap subject map triples map 1
rr:tableName table or view name SQL base table or view 1
rr:template string template template-valued term map 1
rr:termType term type term map 0…1 !

B.3 Other Terms

Term Denotes Used with property
rr:defaultGraph default graph rr:graph
rr:SQL2008 Core SQL 2008 rr:sqlVersion
rr:IRI IRI rr:termType
rr:BlankNode blank node rr:termType
rr:Literal literal rr:termType

C. References

C.1 Normative References

[DM]
A Direct Mapping of Relational Data to RDF , Alexandre Bertails, Marcelo Arenas, Eric Prud'hommeaux, Juan Sequeda, Editors. World Wide Web Consortium, 27 September 2012. This version is http://www.w3.org/TR/2012/REC-rdb-direct-mapping-20120927/. The latest version is http://www.w3.org/TR/rdb-direct-mapping/.
[RDF]
Resource Description Framework (RDF): Concepts and Abstract Syntax , Graham Klyne, Jermey J. Carroll, Editors. World Wide Web Consortium, 10 February 2004. This version is http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/. The latest version is http://www.w3.org/TR/rdf-concepts/.
[RFC2119]
Key words for use in RFCs to Indicate Requirement Levels , S. Bradner, March 1997. Internet RFC 2119, http://tools.ietf.org/html/rfc2119.
[RFC3629]
UTF-8, a transformation format of ISO 10646 , F. Yergeau. November 2003. Internet RFC 3629, http://tools.ietf.org/html/rfc3629.
[RFC3986]
Uniform Resource Identifier (URI): Generic Syntax , T. Berners-Lee, R. Fielding, L. Masinter. January 2005. Internet RFC 3986, http://tools.ietf.org/html/rfc3986.
[RFC3987]
Internationalized Resource Identifiers (IRIs) , M. Duerst, M. Suignard. January 2005. Internet RFC 3987, http://tools.ietf.org/html/rfc3987.
[SPARQL]
SPARQL Query Language for RDF , Eric Prud'hommeaux, Andy Seaborne, Editors. World Wide Web Consortium, 15 January 2008. This version is http://www.w3.org/TR/2008/REC-rdf-sparql-query-20080115/. The latest version is http://www.w3.org/TR/rdf-sparql-query/.
[SQL1]
ISO/IEC 9075-1:2008 SQL - Part 1: Framework (SQL/Framework) . International Organization for Standardization, 27 January 2009.
[SQL2]
ISO/IEC 9075-2:2008 SQL - Part 2: Foundation (SQL/Foundation) . International Organization for Standardization, 27 January 2009.
[TURTLE]
Turtle - Terse RDF Triple Language , Dave Beckett, Tim Berners-Lee. Eric Prud'hommeaux, Gavin Carothers. World Wide Web Consortium, 14 January 2008. 10 July 2012. This version is http://www.w3.org/TeamSubmission/2008/SUBM-turtle-20080114/. http://www.w3.org/TR/2012/WD-turtle-20120710/. The latest version is http://www.w3.org/TeamSubmission/turtle/. http://www.w3.org/TR/turtle/. This document is work in progress.
[XMLSCHEMA2]
W3C XML Schema Definition Language (XSD) 1.1 Part 2: Datatypes Second Edition , Paul V. Biron, David Peterson, Shudi Gao, Ashok Malhotra. Malhotra, C. M. Sperberg-McQueen, Henry S. Thompson. World Wide Web Consortium, 28 October 2004. 5 April 2012. This version is http://www.w3.org/TR/2004/REC-xmlschema-2-20041028/. http://www.w3.org/TR/2012/REC-xmlschema11-2-20120405/. The latest version is http://www.w3.org/TR/xmlschema-2/. http://www.w3.org/TR/xmlschema11-2/.

C.2 Other References

[DM] A Direct Mapping of Relational Data to RDF , Alexandre Bertails, Marcelo Arenas, Eric Prud'hommeaux, Juan Sequeda, Editors. World Wide Web Consortium, 20 September 2011. This version is http://www.w3.org/TR/2011/WD-rdb-direct-mapping-20110920/. The latest version is http://www.w3.org/TR/rdb-direct-mapping/. This document is work in progress. [SQL14]
ISO/IEC 9075-14:2008 SQL - Part 14: XML-Related Specifications (SQL/XML) . International Organization for Standardization, 27 January 2009.
[SQLIRIS]
SQL Version IRIs , Members Editors of the W3C RDB2RDF Working Group. Semantic Web Standards wiki. The latest version is http://www.w3.org/2001/sw/rdb2rdf/wiki/SQL_Version_IRIs. http://www.w3.org/2001/sw/wiki/RDB2RDF/SQL_Version_IRIs. This is a public wiki page.
[TC]
R2RML and Direct Mapping Test Cases (Editor's Draft) , Boris Villazón-Terrazas, Michael Hausenblas, Alexander de Leon, Editors. World Wide Web Consortium, 31 14 August 2011. 2012. This version is http://www.w3.org/TR/2012/NOTE-rdb2rdf-test-cases-20120814/. The latest version is http://www.w3.org/2001/sw/rdb2rdf/test-cases/. This document is work in progress. http://www.w3.org/TR/rdb2rdf-test-cases/.
[UCNR]
Use Cases and Requirements for Mapping Relational Databases to RDF , Eric Prud'hommeaux, Michael Hausenblas, Editors. World Wide Web Consortium, 8 June 2010. This version is http://www.w3.org/TR/2010/WD-rdb2rdf-ucr-20100608/. The latest version is http://www.w3.org/TR/rdb2rdf-ucr/. This document is work in progress.

D. Acknowledgements (Informative)

The Editors would like to give special thanks to the following members: Nuno Lopes for help in designing the datatyping related text, contributors: David McNeil for raising many greatly improved the quality of the issues that needed addressing, specification with detailed reviews and comments. Nuno Lopes and Eric Prud'hommeaux for designing contributed to the design of the mapping from SQL compatibility text, and data values to RDF literals. Eric also worked on the mechanism for SQL compatibility. Boris Villazón-Terrazas for drawing all drew the diagrams. diagrams throughout the text, and kept them up-to-date throughout many iterations.

In addition, the Editors gratefully acknowledge contributions from: Marcelo Arenas, Sören Auer, Samir Batla, Alexander de Leon, Orri Erling, Lee Feigenbaum, Enrico Franconi, Howard Greenblatt, Wolfgang Halb, Harry Halpin, Michael Hausenblas, Patrick Hayes, Ivan Herman, Nophadol Jekjantuk, Li Ma, Nan Ma, Ashok Malhotra, Ivan Mikhailov, Percy Enrique Rivera Salas, Juan Sequeda, Ben Szekely, Ted Thibodeau, and Edward Thomas.