A Direct Mapping of Relational Data to RDF

1 Introduction

Relational databases proliferate both because of their efficiency and their precise definitions, allowing for tools like SQL [SQLFN] to manipulate and examine the contents predictably and efficiently. Resource Description Framework (RDF) [RDF-concepts] is a data format based on a web-scalable architecture for identification and interpretation of terms. This document defines a mapping from relational representation to an RDF representation.

Strategies for mapping relational data to RDF abound. The direct mapping defines a simple transformation, providing a basis for defining and comparing more intricate transformations. This document includes an informal and a formal description of the transformation.

The Direct Mapping is intended to provide a default behavior for R2RML: RDB to RDF Mapping Language . It can be also be used to materialize RDF graphs or define virtual graphs, which can be queried by SPARQL or traversed by an RDF graph API.

2 Direct Mapping Description (Informative)

The direct mapping defines an RDF Graph [RDF-concepts] representation of the data in ~~any~~ a relational database. The direct mapping takes as input a relational database (data and schema), and generates an RDF graph that is called the direct graph . ~~This~~ The algorithms in this document compose a graph ~~is composed~~ of relative IRIs ~~that may~~ which must be resolved against a base IRI ~~per~~ [RFC3987] . to form an RDF graph.

Foreign keys in relational databases establish a ~~named~~ reference from any row in a table to exactly one row in a (potentially different) table. The direct graph conveys these references, as well as each value in the ~~rows.~~ row.

2.1 Direct Mapping Example

The concepts in direct mapping can be introduced with an example RDF graph produced by a relational database. Following is SQL (DDL) to create a simple example with two tables with single-column primary keys and one foreign key reference between them: ~~CREATE TABLE Addresses ( ID INT, city CHAR(10), state CHAR(2), PRIMARY KEY(ID)~~

CREATE TABLE "Addresses" (
	"ID" INT, PRIMARY KEY("ID"), 
	"city" CHAR(10), 
	"state" CHAR(2)

)
CREATE TABLE People (
	ID INT, 
	fname CHAR(10), 
	addr INT, PRIMARY KEY(ID), 
	FOREIGN KEY(addr) REFERENCES Addresses(ID)

CREATE TABLE "People" (
	"ID" INT, PRIMARY KEY("ID"), 
	"fname" CHAR(10), 
	"addr" INT, 
	FOREIGN KEY("addr") REFERENCES "Addresses"("ID")

)
INSERT INTO Addresses (ID, city, state) VALUES (18, "Cambridge", "MA")
INSERT INTO People (ID, fname, addr) VALUES (7, "Bob", 18)
INSERT INTO People (ID, fname, addr) VALUES (8, "Sue", NULL)

INSERT INTO "Addresses" ("ID", "city", "state") VALUES (18, 'Cambridge', 'MA')
INSERT INTO "People" ("ID", "fname", "addr") VALUES (7, 'Bob', 18)
INSERT INTO "People" ("ID", "fname", "addr") VALUES (8, 'Sue', NULL)

HTML tables will be used in this document to convey SQL tables. The primary key of these tables will be marked with the PK class to convey an SQL primary key such as ID in CREATE TABLE Addresses (ID "Addresses" ("ID" INT, ... PRIMARY KEY(ID)) KEY("ID")). Foreign keys will be illustrated with a notation like " → Address(ID) " to convey an SQL foreign key such as CREATE TABLE People "People" (... addr "addr" INT, FOREIGN KEY(addr) KEY("addr") REFERENCES Addresses(ID)) "Addresses"("ID")).

People
PK		→ Address(ID)
ID	fname	addr
7	Bob	18
8	Sue	NULL

Addresses
PK
ID	city	state
18	Cambridge	MA

Given a base IRI http://foo.example/DB/, the direct mapping of this database produces a direct graph: ~~@base <http://foo.example/DB/>~~

@base <http://foo.example/DB/> .

@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
<People/ID-7> rdf:type <People> .
<People/ID-7> <People#ID> 7 .
<People/ID-7> <People#fname> "Bob" .
<People/ID-7> <People#addr> 18 .
<People/ID-7> <People#ref-addr> <Addresses/ID-18> .
<People/ID-8> rdf:type <People> .
<People/ID-8> <People#ID> 8 .
<People/ID-8> <People#fname> "Sue" .
<Addresses/ID-18> rdf:type <Addresses> .
<Addresses/ID-18> <Addresses#ID> 18 .
<Addresses/ID-18> <Addresses#city> "Cambridge" .
<Addresses/ID-18> <Addresses#state> "MA" .

In this expression, each row, e.g. (7, "Bob", 18), produces a set of triples with a common subject. The subject is an IRI formed from the concatenation of the base IRI, table name ( People ), primary key column name ( ID ) and primary key value ( 7 ). The predicate for each column is an IRI formed from the concatenation of the base IRI, table name and the column name. The values are ~~either~~ RDF literals formed from the lexical form of the column ~~value, or, in the case of~~ value. Each foreign keys, row identifiers ( <Addresses/ID=18> ). Note that these reference row identifiers must coincide with the subject used for the triples generated from the referenced row. 2.2 Preliminaries: Generating IRIs In the process of translating relational data into RDF, the direct mapping must create IRIs for identifying tables, the columns in a table, and each row in a table. In this section, we assume that http://foo.example/DB is the the base IRI. All the examples in this section will contain relative IRIs which are to be understood as relative to this base IRI. The following are the IRIs that need to be generated: Table IRI: The IRI that identifies keys produces a ~~table is created by concatenating the base IRI~~ triple with the table name. Specifically, if base_IRI is the base IRI and table_name is the table name, then base_IRI/table_name is the Table IRI for the table. Column IRI: Single-column IRI: The IRI that identifies a ~~column of a table is created by concatenating the base IRI with the table name and the column name. Specifically, if base_IRI is the base IRI, table_name is the table name and column_name is~~ predicate composed from the foreign key column name, then base_IRI/table_name#column_name is the Column IRI for the column. Multi-column IRI: The IRI that identifies a sequence of two or more columns of a table is created by concatenating the base IRI with names, the ~~table name~~ referenced table, and the referenced column names. Specifically, if base_IRI is the base IRI, table_name is the table name and column_name_1 , column_name_2 , ..., column_name_k is a sequence of k columns (k > 1), then base_IRI/table_name#column_name_1,column_name_2,...,column_name_k is the Column IRI for the columns. Row RDF Node: Row RDF Node for a row with a single-column primary key: The IRI that identifies a row is created by concatenating the base IRI with the table name, the column name of the primary key and the value of the row in that column. Specifically, if base_IRI is the base IRI, table_name is the table name, column_name is the column name object of ~~the primary key and value~~ these triples is the ~~value of the~~ row ~~in that column, then~~ identifiers ( base_IRI/table_name/column_name=value <Addresses/ID - 18> ~~is the Row RDF Node (or Row IRI)~~ ) for the row. Row RDF Node for a row with a multi-column primary key: The IRI that identifies a row is created by concatenating the base IRI with the table name, the names of the columns that constitute the primary key and the values of the row in those columns. Specifically, if base_IRI is the base IRI, table_name is the table name, column_name_1 , column_name_2 , ..., column_name_k is the sequence of k columns (k > 1) that constitute the primary key, and value_1 , value_2 , ..., value_k is the sequence of values of the columns referenced triple. Note that constitute the primary key of the row, then base_IRI/table_name/column_name_1=value_1,column_name_2=value_2,...,column_name_k=value_k is the Row RDF Node (or Row IRI) for the row. Row RDF Node for a row without a primary key: A fresh Blank Node is created, which is used as the Row RDF Node for the row. Issue (hash-vs-slash): The direct graph may be offered as Linked Open Data, raising the issue of distinguishing row identifiers from the information resources which describe them . This edition of this document presumes hash identifiers, allowing a GET on a row identifier to retrieve a small resource (i.e. not all rows from the same table) and distinguish between the retrieved resource People/ID=7 and the these reference row ~~People/ID=7 . The "slash" alternative would offer a direct graph with~~ identifiers ~~like People/ID=7 but would demand the server respond to GET /People/ID=7~~ must coincide with ~~a 303 redirect to some other resource. Resolution: None recorded. 2.2.1 IRIs generated for~~ the ~~initial example Given the base IRI http://foo.example/DB/ , the following are some of the IRIs that are~~ subject used when translating into RDF the relational data given in the initial example: For the table People , the following IRIs are considered in the translation process: Table IRI: <People> Column IRIs: <People#ID> <People#fname> <People#addr> Row IRIs: <People/ID=7> <People/ID=8> For the table Addresses , the following IRIs are considered in the translation process: Table IRI: <Addresses> Columns IRIs: <Addresses#ID> <Addresses#city> <Addresses#state> Row IRI: <Addresses/ID=18> 2.3 Mapping Rules Each row in for the ~~database produces a set of RDF~~ triples ~~with a subject , predicate , and object composed as follows: Shared Subject: A Row RDF Node , which may be an IRI or a Blank Node, is~~ generated ~~for each~~ from the referenced row. ~~Table Triples:~~ The row generates a triple with the following: Predicate: the rdf:type property Object: the Table IRI for the table Literal Triples: Each column with a non-null value, including the column(s) that constitute the primary key, and that either is direct mapping does not the only constituent of a foreign key or is the only constituent of a foreign key that references a candidate key, generates a triple with the following: Predicate: the Column IRI for the column Object: an RDF Literal with an XML Schema datatype corresponding to the SQL datatype of that value. String datatypes are expressed as an RDF plain literal Reference Triples: Columns that constitute a foreign key and with non-null values in the row generate triples with the following: Predicate: the Column IRI for the columns that constitute the foreign key Object: the Row RDF Node for the corresponding referenced row (according to the foreign key) Issue (primary-is-candidate-key): Should the following exception be included in the definition of the direct mapping? Primary-is-Candidate-Key Exception : If the primary key is also a candidate key K to table R: The shared subject is the subject of the referenced row in R. The foreign key K generates no reference triple. Even if K is a single-column foreign key, it generates a literal triple. Resolution: None recorded. 2.3.1 Triples generated for the example in Section Direct Mapping Example Next we show how the 11 triples in the example of Section Direct Mapping Example are classified into the above categories: Triples generated from table People : Table Triples: Literal Triples: Reference Triple: Triples generated from table Addresses : NULL values.

~~Table Triple:~~

~~Literal Triples:~~

2.4 Additional Examples and Corner Cases 2.4.1 2.2 Foreign keys referencing candidate keys

More complex schemas include ~~compound and~~ composite primary keys. In this example, the columns deptName and deptCity in the People table reference name and city in the Department ~~table. The following is the schema of the augmented database: CREATE TABLE Addresses ( ID INT, city CHAR(10), state CHAR(2), PRIMARY KEY(ID)~~ table:


CREATE TABLE "Addresses" (
	"ID" INT, 
	"city" CHAR(10), 
	"state" CHAR(2), 
	PRIMARY KEY("ID")

)
CREATE TABLE Deparment (
	ID INT, 
	name CHAR(10), 
	city CHAR(10), 
	manager INT, 
	PRIMARY KEY(ID), 
	UNIQUE (name, city), 
	FOREIGN KEY(manager) REFERENCES People(ID)

CREATE TABLE "Department" (
	"ID" INT, 
	"name" CHAR(10), 
	"city" CHAR(10), 
	"manager" INT, 
	PRIMARY KEY("ID"), 
	UNIQUE ("name", "city")

)
CREATE TABLE People (
	ID INT, 
	fname CHAR(10), 
	addr INT, 
	deptName CHAR(10), 
	deptCity CHAR(10), 
	PRIMARY KEY(ID), 
	FOREIGN KEY(addr) REFERENCES Addresses(ID), 
	FOREIGN KEY(deptName, deptCity) REFERENCES Department(name, city) 

CREATE TABLE "People" (
	"ID" INT, 
	"fname" CHAR(10), 
	"addr" INT, 
	"deptName" CHAR(10), 
	"deptCity" CHAR(10), 
	PRIMARY KEY("ID"), 
	FOREIGN KEY("addr") REFERENCES "Addresses"("ID"), 
	FOREIGN KEY("deptName", "deptCity") REFERENCES "Department"("name", "city") 

)
ALTER TABLE "Department" ADD FOREIGN KEY("manager") REFERENCES "People"("ID")

~~The following~~ Following is an instance of ~~the augmented relational~~ this schema:

People
PK		→ Addresses(ID)	→ Department(name, city)
ID	fname	addr	deptName	deptCity
7	Bob	18	accounting	Cambridge
8	Sue	NULL	NULL	NULL

Addresses
PK
ID	city	state
18	Cambridge	MA

Department
PK	Unique Key		→ People(ID)
ID	name	city	manager
23	accounting	Cambridge	8

Per the People tables's compound foreign key to Department:

The row in People with deptName="accounting" and deptCity="Cambridge" references a row in Department with a primary key of ID=23.
The predicate for this key is formed from " deptName,deptCity deptName " and " deptCity ", reflecting the order of the column names in the foreign key.
The ~~referent identifier (object~~ object of the above ~~predicate)~~ predicate is formed from the base ~~IRI~~ IRI, the table name " Department " and the primary key value " ID=23 ID-23 ".

In this example, the direct mapping generates the following triples: ~~@base <http://foo.example/DB/>~~

@base <http://foo.example/DB/> .

@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
<People/ID-7> rdf:type <People> .
<People/ID-7> <People#ID> 7 .
<People/ID-7> <People#fname> "Bob" .
<People/ID-7> <People#addr> 18 .
<People/ID-7> <People#ref-addr> <Addresses/ID-18> .
<People/ID-7> <People#deptName> "accounting" .
<People/ID-7> <People#deptCity> "Cambridge" .
<People/ID-7> <People#ref-deptName.deptCity> <Department/ID-23> .
<People/ID-8> rdf:type <People> .
<People/ID-8> <People#ID> 8 .
<People/ID-8> <People#fname> "Sue" .
<Addresses/ID-18> rdf:type <Addresses> .
<Addresses/ID-18> <Addresses#ID> 18 .
<Addresses/ID-18> <Addresses#city> "Cambridge" .
<Addresses/ID-18> <Addresses#state> "MA" .
<Department/ID-23> rdf:type <Department> .
<Department/ID-23> <Department#ID> 23 .
<Department/ID-23> <Department#name> "accounting" .
<Department/ID-23> <Department#city> "Cambridge" .
<Department/ID-23> <Department#manager> 8 .
<Department/ID-23> <Department#ref-manager> <People#ID-8> .

The green triples above are generated by considering the new elements in the augmented database. ~~It should be noticed that:~~ Note:

Although deptName is an attribute of table People that is part of a foreign key, the Literal Triple <People/ID=7> <People#deptName> "accounting" is generated by the direct mapping because deptName is not the sole column of a foreign key of table People .
The Reference Triple <People/ID=7> <People#deptName,deptCity> <Department/ID=23> <People/ID - 7> <People#ref-deptName,deptCity> <Department/ID - 23> is generated by considering a foreign key referencing a candidate key ~~(instead of~~ (different from the primary ~~key): (deptName, deptCity) is a multi-column foreign key in the table People which references the multi-column candidate key (name, city) in the table Department .~~ key).

2.4.2 2.3 Multi-column primary keys

~~We note that primary~~ Primary keys may also be composite. ~~For example, if~~ If, in the above example , the primary key for Department were ( name , city ) instead of ID ~~in the example in Section Foreign keys referencing candidate keys , then~~ , the identifier for the only row in this table would be <Department/name=accounting,city=Cambridge> , and the following <Department/name - accounting . city - Cambridge>. The triples involving <Department/ID - 23> would ~~have been generated by~~ be substituted with the ~~direct mapping:~~ following triples:

<People/ID-7> <People#ref-deptName.deptCity> <Department/name-accounting.city-Cambridge> . <Department/name-accounting.city-Cambridge> rdf:type <Department> . <Department/name-accounting.city-Cambridge> <Department#ID> 23 . <Department/name-accounting.city-Cambridge> <Department#name> "accounting" .<Department/name-accounting.city-Cambridge> <Department#city> "Cambridge" .

2.4.3 2.4 Empty (non-existent) primary keys

~~Even if~~ If there is no primary key, rows ~~generate~~ implies a set of triples with a shared subject, but that subject is a blank node. ~~For instance, assume that the following~~ A Tweets table is can be added to the ~~schema of the~~ above example ~~in Section Foreign keys referencing candidate keys~~ ~~(for keeping~~ to keep track of employees' tweets in ~~Twitter): CREATE TABLE Tweets ( tweeter INT, when TIMESTAMP, text CHAR(140), FOREIGN KEY(tweeter) REFERENCES People(ID)~~ Twitter:


CREATE TABLE "Tweets" (
	"tweeter" INT,
	"when" TIMESTAMP,
	"text" CHAR(140),
	FOREIGN KEY("tweeter") REFERENCES "People"("ID")

)

The following is an instance of table Tweets :

Tweets
→ People(ID)
tweeter	when	text
7	2010-08-30T01:33	I really like lolcats.
7	2010-08-30T09:01	I take it back.

Given that table Tweets does not have a primary key, each row in this table is identified by a Blank Node. In fact, when translating the above table the direct mapping generates the following triples:

@base <http://foo.example/DB/>
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
_:a rdf:type <Tweets> .
_:a <Tweets#ref-tweeter> <People/ID-7> .
_:a <Tweets#when> "2010-08-30T01:33"^^xsd:dateTime .
_:a <Tweets#text> "I really like lolcats." .
_:b rdf:type <Tweets> .
_:b <Tweets#tweeter> <People/ID-7> .
_:b <Tweets#when> "2010-08-30T09:01"^^xsd:dateTime .
_:b <Tweets#text> "I take it back." .

~~It is not possible to dereference blank nodes ("_:a" and "_:b" above). Queries or updates may be made to these nodes via SPARQL queries.~~

2.4.4 2.5 Referencing tables with empty primary keys

Rows in tables with no primary key may still be referenced by foreign keys. (Relational database theory tells us that these rows must be unique as foreign keys reference candidate keys and candidate keys are unique across all the rows in a table.) References to rows in tables with no primary key are expressed as RDF triples with blank nodes for objects, where that blank node is the same node used for the subject in the referenced row.

This example includes several foreign keys with mutual column names. For clarity; here is the DDL to clarify these keys: CREATE TABLE Projects ( lead INT, FOREIGN KEY (lead) REFERENCES People(ID), name VARCHAR(50), UNIQUE (lead, name), deptName VARCHAR(50), deptCity VARCHAR(50), UNIQUE (name, deptName, deptCity), FOREIGN KEY (deptName, deptCity) REFERENCES Department(name, city)

CREATE TABLE "Projects" (
	"lead" INT,
		FOREIGN KEY ("lead") REFERENCES "People"("ID"),
		"name" VARCHAR(50), 
		UNIQUE ("lead", "name"), 
		"deptName" VARCHAR(50), 
		"deptCity" VARCHAR(50),
		UNIQUE ("name", "deptName", "deptCity"),
		FOREIGN KEY ("deptName", "deptCity") REFERENCES "Department"("name", "city")

)
CREATE TABLE TaskAssignments (
	worker INT,
        FOREIGN KEY (worker) REFERENCES People(ID),
        project VARCHAR(50), 
        PRIMARY KEY (worker, project), 
        deptName VARCHAR(50), 
        deptCity VARCHAR(50),
        FOREIGN KEY (worker) REFERENCES People(ID),
        FOREIGN KEY (project, deptName, deptCity) REFERENCES Projects(name, deptName, deptCity),
        FOREIGN KEY (deptName, deptCity) REFERENCES Department(name, city)

CREATE TABLE "TaskAssignments" (
	"worker" INT,
		FOREIGN KEY ("worker") REFERENCES "People"("ID"),
		"project" VARCHAR(50), 
		PRIMARY KEY ("worker", "project"), 
		"deptName" VARCHAR(50), 
		"deptCity" VARCHAR(50),
		FOREIGN KEY ("worker") REFERENCES "People"("ID"),
		FOREIGN KEY ("project", "deptName", "deptCity") REFERENCES "Projects"("name", "deptName", "deptCity"),
		FOREIGN KEY ("deptName", "deptCity") REFERENCES "Department"("name", "city")

)

The following is an instance of the preceding schema:

Projects
Unique key
	Unique key
→ People(ID)		→ Department(name, city)
lead	name	deptName	deptCity
8	pencil survey	accounting	Cambridge
8	eraser survey	accounting	Cambridge

TaskAssignments
PK
	→ Projects(name, deptName, deptCity)
→ People(ID)		→ Departments(name, city)
worker	project	deptName	deptCity
7	pencil survey	accounting	Cambridge

In this case, the direct mapping generates the following triples from the preceding tables:

@base <http://foo.example/DB/>
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
@prefix pencil: <http://foo.example/DB/TaskAssignment/worker=7,project=pencil+survey> .

_:c rdf:type <Projects> .
_:c <Projects#lead> <People/ID-8> .
_:c <Projects#name> "pencil survey" .
_:c <Projects#deptName> "accounting" .
_:c <Projects#deptCity> "Cambridge" .
_:c <Projects#ref-deptName.deptCity> <Department/ID-23> .
_:d rdf:type <Projects> .
_:d <Projects#lead> <People/ID-8> .
_:d <Projects#name> "eraser survey" .
_:d <Projects#deptName> "accounting" .
_:d <Projects#deptCity> "Cambridge" .
_:d <Projects#ref-deptName.deptCity> <Department/ID-23> .
<TaskAssignment/worker-7.project-pencil+survey> rdf:type <TaskAssignments> .
<TaskAssignment/worker-7.project-pencil+survey> <TaskAssignments#worker> 7 .
<TaskAssignment/worker-7.project-pencil+survey> <TaskAssignments#ref-worker> <People/ID-7> .
<TaskAssignment/worker-7.project-pencil+survey> <TaskAssignments#project> "pencil survey" .
<TaskAssignment/worker-7.project-pencil+survey> <TaskAssignments#deptName> "accounting" .
<TaskAssignment/worker-7.project-pencil+survey> <TaskAssignments#deptCity> "Cambridge" .
<TaskAssignment/worker-7.project-pencil+survey> <TaskAssignments#ref-deptName.deptCity> <Department/ID-23> .
<TaskAssignment/worker-7.project-pencil+survey> <TaskAssignments#ref-project.deptName.deptCity> _:c .

The absence of a primary key forces the generation of blank nodes, but does not change the structure of the direct graph or names of the predicates in that graph.

2.5 Hierarchical Tables 3 Direct Graph Definition

It The Direct Graph is ~~common to express specializations of some concept as multiple tables sharing~~ a ~~common primary key. In such cases,~~ formula for creating an RDF graph from the ~~primary keys~~ rows of each table and view in a database schema. A base IRI defines a web space for the ~~inherited tables are~~ IRIs in ~~turn foreign keys~~ this graph; for the purposes of this specification, all IRIs are generated by appending to a base. Terms enclosed in <> are defined in the ~~table from which they derive. Addresses PK ID city state 18 Cambridge MA Offices PK → Addresses(ID) ID building ofcNumber 18 32 G528 ExecutiveOffices PK → Offices(ID) ID desk 18 oak~~ SQL specification [SQLFN] .

~~In this example, Offices are~~ An SQL table has a ~~specialization~~ set of ~~Addresses~~ uniquely-named columns and ~~ExecutiveOffices are~~ a ~~specialization~~ set of ~~Offices . The subjects for~~ foreign keys, each mapping a <column name list> to a <unique column list> (a list of columns in some table).

SQL table and column identifiers compose RDF IRIs in the ~~triples implied~~ direct graph. These identifiers are separated by ~~rows~~ the punctuation characters '#', ' . ', '/' and ' - '. All SQL identifiers are escaped following URL-encoding HTML form data except that only the above punctuation and the characters not permitted in RDF IRIs are escaped.

Definition ~~Offices~~ percent-encode : (a subset of HTML5 form dataset encoding ):

Replace each PERCENT SIGN character ('%', U+0025) with the string "%25".
For table names, replace each NUMBER SIGN character ('#', U+0023) with the string "%23".
For table names, replace each SOLIDUS character ('/', U+002f) with the string "%2f".
For attribute names, replace each HYPHEN-MINUS character (' - ', U+003d) with the string "%3D".
For attribute values, replace each FULL STOP character (' . ', U+002e) with the string "%2E".
Replace each SPACE character (U+0020) with the PLUS SIGN character (+, U+002B).

There is either a blank node or IRI assigned to each each row in a table:

Definition ~~ExecutiveOffices are~~ row node :

If the ~~same as those for~~ table has a primary key, the ~~corresponding~~ row node is a relative IRI obtained by concatenating:
- the percent-encoded form of the table name,
- the SOLIDUS character '/',
- for each column in ~~Addresses . @base <http://foo.example/DB/> @prefix xsd: <http://www.w3.org/2001/XMLSchema#> . The Primary-is-foreign Key Exception allows~~ the ~~generation~~ primary key, in order:
  - the percent-encoded form of the column name,
  - a ~~triple with an~~ HYPHEN-MINUS character ' - ',
  - the percent-encoded lexical form of the canonical RDF literal ~~for~~ representation of the ID column value as defined in R2RML section 10.2 Natural Mapping of SQL Values ,
  - if it is not the last column in the foreign key, a FULL STOP character ' . '
If the ~~Offices and ExecutiveOffices~~ table ~~( Offices.ID=18 and ExecutiveOffices.ID=18 ). Issue (hier-table-at-risk): This feature attempts to intricately model some existing modeling practice but adds significant complexity. This feature~~ has no primary key, the row node is ~~at risk. Resolution:~~ a fresh blank node that is unique to this row.

A table forms a table IRI:

~~None recorded.~~

Definition table IRI : the relative IRI consisting of the percent-encoded form of the table name.

~~Issue (fk-pk-order):~~

A column in a table forms a literal property IRI:

~~What if fk is~~

Definition literal property IRI : the concatenation of:

the percent-encoded form of the table name,
the hash character '#',
the percent-encoded form of the column name.

A foreign key in a ~~rearrangement~~ table forms a reference property IRI:

Definition reference property IRI : the concatenation of:

the percent-encoded form of the ~~pk? E.g what~~ table name,
the string '#ref - ',
for each column in the foreign key, in order:
- the percent-encoded form of the column name,
- if ~~TaskAssignments,~~ it is not the last column in the foreign key, a FULL STOP character ' . '

Any input database with a ~~primary key (project, worker), had~~ given schema has a ~~foreign key (worker, project)? Resolution:~~ direct graph defined as:

~~None recorded.~~

Definition direct graph : the union of the table graph s for each table in a database schema.

Definition table graph : the union of the row graph ~~Issue (many-to-many-as-repeated-properties):~~ s for each row in a table.

~~The direct~~

Definition row graph ~~is arguably more faithful to~~ : an RDF graph consisting of the ~~conceptual model if it reflects e.g.~~ following triples:

the row type triple .
a ~~person with multiple addresses (some many-to-many Person2Address table) as repeated properties. It is difficult to detect which tables with exactly two~~ reference triple for each <column name list> in a table's foreign keys ~~and no other attributes are many-to-many. As~~ where none of the column values is NULL.
a ~~counter example ,~~ literal triple for each column in a ~~Wedding~~ table ~~may have exactly two spouses but it's still not a many-to-many relation in most places. Resolution: None recorded.~~ where the column value is non-NULL.

Definition row type triple : an RDF triple with:

subject: the row node ~~Issue (formalism-model):~~ for the row.
predicate: the RDF IRI rdf:type.
object: the table IRI for the table name.

Definition literal triple : an RDF triple with:

~~The RDB2RDF working group has not decided on a formalism~~

subject: the row node for ~~representing~~ the ~~direct mapping. We would appreciate feedback from~~ row.
predicate: the ~~community~~ literal property IRI for the column.
object: the R2RML natural RDF literal representation of the column value as defined in ~~helping us choose between Section 5. Direct~~ R2RML section 10.2 Natural Mapping of SQL Values .

Definition reference triple : an RDF triple with:

subject: the row node for the row.
predicate: the reference property IRI for the columns.
object: the row node for the referenced row.

4 References

SPARQL: SPARQL Query Language for RDF, Eric Prud'hommeaux and ~~Section 6. Direct Mapping as Rules .~~ Andy Seaborne 2008. (See http://www.w3.org/TR/rdf-sparql-query/.)
~~Resolution:~~ SQLFW: SQL. ISO/IEC 9075-1:2008 SQL – Part 1: Framework (SQL/Framework) International Organization for Standardization, 27 January 2009.
SQLFN: ISO/IEC 9075-2:2008 SQL – Part 2: Foundation (SQL/Foundation) International Organization for Standardization, 27 January 2009.
~~None recorded.~~ RDF-concepts: Resource Description Framework (RDF): Concepts and Abstract Syntax, G. Klyne, J. J. Carroll, Editors, W3C Recommendation, 10 February 2004 (See http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/.)
ReuseableIDs: Reusable Identifiers in the RDB2RDF mapping language, Michael Hausenblas and Themis Palpanas, 2009. (See http://esw.w3.org/topic/Rdb2RdfXG/ReusableIdentifier.)
URI: RFC3986 - Uniform Resource Identifier (URI): Generic Syntax (See http://tools.ietf.org/html/rfc3986.)
RFC3987: RFC3987 - Internationalized Resource Identifier (IRIs) (See http://tools.ietf.org/html/rfc3987.)
SQL2SW: Translating SQL Applications to the Semantic Web. Syed Hamid Tirmizi, Juan Sequeda and Daniel Miranker. 2008 (See http://www.springerlink.com/content/mv58805364k31734/.)
DMSurvey: Survey of directly mapping SQL databases to the Semantic Web. Juan Sequeda, Syed Hamid Tirmizi, Oscar Corcho, Daniel P. Miranker. 2011 (See http://journals.cambridge.org/abstract_S0269888911000208.)

3 A Direct Mapping Definition Algebra (Informative)

3.1 A.1 Notations

The RDB and RDF data models make use of the commonly defined Abstract Data Types Set , List and MultiSet , used here as type constructors. For example, Set(A) denotes the type for the sets of elements of type A. We assume that they come with their common operations, such as the function size : Set → Int size : Set → Int.

The definitions follow a type-as-specification approach, thus the models are based on dependent types . For example, { s:Set(A) | size(s) ≤ 1 } { s:Set(A) | size(s) ≤ 1 } is a type denoting the sets for elements of type A, such that those sets have at most one element.

The denotational RDF semantics makes use of the set-builder notation for building the RDF sets.

The buttons below can be used to show or hide the available syntaxes.

3.2 A.2 Relational Data Model

3.2.1 A.2.1 RDB Abstract Data Type (Normative)

[1]	`Database`	::=	`Set(Table) Set( Table )`
[1]	`Database`	::=	`{ Table }`
A relational database is a set of tables.
[2]	`Table`	::=	`(TableName, Header, List(CandidateKey), Set(ForeignKey), Body) ( TableName , Set(( ColumnName , Datatype )), Set( CandidateKey ), Set( PrimaryKey ) \| size() ≤ 1, Set( ForeignKey ), Body )`
[2]	`Table`	::=	`( TableName , { ColumnName → Datatype }, { CandidateKey }, PrimaryKey ?, { ForeignKey }, Body )`
A relation has a name uniquely defining this table in the database; an associative array mapping each column to a ~~header describing the domain of the data;~~ SQL datatype; a potentially empty list of candidate keys, possibly including a primary key; a potentially empty set of foreign keys; a body containing the rows of data.
[3]	`Header Body`	::=	`Set((ColumnName, Datatype)) A header is an associative array MultiSet( Row mapping each column to a SQL datatype. )`
~~[4]~~ [3]	`Body`	::=	`MultiSet(Row) [ Row ]`
A body is a set of ~~(potentially duplicate)~~ potentially duplicate rows.
~~[5]~~ [4]	`Row`	::=	`Set((ColumnName, CellValue)) Set(( ColumnName , CellValue ))`
[4]	`Row`	::=	`{ ColumnName → CellValue }`
A row is a associative array mapping each column in a row to a value.
~~[6]~~ [5]	`CellValue`	::=	`LexicalValue Value \| NULL`
[5]	`CellValue`	::=	`Value \| Null`
A cell value is either a lexical value or NULL, denoting the absence of value.
~~[7]~~ [6]	`ForeignKey`	::=	`(List(ColumnName), Table, CandidateKey) (List( ColumnName ), Table , CandidateKey )`
[6]	`ForeignKey`	::=	`{ [ ColumnName ] → ( Table , [ ColumnName ] ) }`
A foreign key ~~relies on a list~~ constrains the values of ~~columns (their order matters) and points~~ a <column name list> to be equivalent (by the SQL `=` operator) to the values of a ~~candidate key into another~~ <unique column list> in some row of the referenced table.
[7]	`PrimaryKey`	::=	`CandidateKey`
[7]	`PrimaryKey`	::=	`CandidateKey`
A primary key is a candidate key with the additional constraint that none of the columns can have a NULL value.
[8]	`CandidateKey`	::=	`List(ColumnName) List( ColumnName )`
[8]	`CandidateKey`	::=	`[ ColumnName ]`
A candidate key is ~~made of a list of columns (their order matters).~~ an SQL <unique column list> in some table. This constrains that no two rows in the table have values for the <unique column list> which are all equivalent (by the SQL `=` operator).
[9]	`Datatype`	::=	`Int \| Float \| Date \| …`
[9]	`Datatype`	::=	`{ INT \| FLOAT \| DATE \| TIME \| TIMESTAMP \| CHAR \| VARCHAR \| STRING }`
A datatype is a common SQL datatype.
[10]	`TableName`	::=	`String`
[10]	`TableName`	::=	`String`
A table name is a string.
[11]	`ColumnName`	::=	`String`
[11]	`ColumnName`	::=	`String`
A column name is a string.

3.2.2 A.2.2 RDB accessor functions (Normative)

[12]	`tablename`	:	`Table → TableName`
Given a table, tablename returns its name.
[13]	`header`	:	`Table → Header Set(( ColumnName , Datatype ))`
Given a table, header returns its header.
[14]	`candidateKeys`	:	`Table → List(CandidateKey) List( CandidateKey )`
Given a table, candidateKeys returns the list of candidate keys.
[15]	`primaryKey`	:	`Table → { s:Set(CandidateKey) s:Set( CandidateKey ) \| size(s) ≤ 1 }`
Given a table, primaryKey returns a set containing the primary key if it exists, otherwise it returns an empty set.
[16]	`foreignKeys`	:	`Table → Set(ForeignKey) Set( ForeignKey )`
Given a table, foreignKeys returns the set of foreign keys.
[17]	`unary`	:	`ForeignKey → Boolean`
Given a foreign key, unary tells if this is a unary foreign key, meaning it has exactly one column.
[18]	`lexicals`	:	`Table → Set({ c:ColumnName c: ColumnName \| ! unary(c) unary (c) })`
Given a table, lexicals returns the set of columns that do not constitute a unary foreign key.
[19]	`body`	:	`Table → Body`
Given a table, body returns its body.
[20]	`datatype`	:	`{ h:Header h:Set(( ColumnName , Datatype )) } → { c:ColumnName c: ColumnName \| ∃ d, (c,d) ∈ h } → { d:Datatype d: Datatype \| (c,d) ∈ h }`
Given a header and a column in this header, datatype returns the datatype associated with this column.
[21]	`table`	:	`{ r:Row r: Row } → { t:Table t: Table \| r ∈ t }`
Given a row, table returns the table to which this row belongs.
[22]	`value`	:	`{ r:Row r: Row } → { a:ColumnName a: ColumnName \| a ∈ r } → CellValue`
Given a row and a column in this row, value returns the cell value (can be NULL) for this column.
[23]	`dereference`	:	{ r:Row r: Row } → { fk:ForeignKey fk: ForeignKey \| fk ∈ foreignKeys(table(r)) foreignKeys ( table (r)) } → { targetRow:Row targetRow: Row \| let (columnNames, targetTable, ck) = fk in targetRow targetRow ∈ body(targetTable) body (targetTable) and and ∀ c _i ^fk ∈ columnNames, ∀ c _j ^ck ∈ ck, ∀ ∀ (c _k ^r, v _k ^r ) ∈ r, ∀ (c _l ^target, v _l ^target ) ∈ targetRow, i = j → c _i ^fk = c _k ^r → c _j ^ck = c _l ^target → v _k ^r = v _l ^target }
Given a row and a foreign key from the table containing this row, dereference returns the row which is referenced by this foreign ~~key. The values~~ key, i.e. the row for which the values of the foreign ~~key columns in r equal~~ key's <unique column list> are all equivalent (by the SQL `=` operator) to the values of for the ~~referenced columns~~ foreign key's <column name list> in the ~~returned row.~~ referring table.

3.3 A.3 RDF Data Model (Non-normative)

Per RDF Concepts and Abstract Syntax , an RDF graph is a set of triples of a subject, predicate and object. The subject may be an IRI or a blank node, the predicate must be an IRI and the object may be an IRI, blank node, or an RDF literal.

This section recapitulates for convience the formal definition of RDF.

[24]	`Graph`	::=	`Set(Triple) Set( Triple )`
[24]	`Graph`	::=	`{ Triple }`
An RDF graph is a set of RDF triples.
[25]	`Triple`	::=	`(Subject, Predicate, Object) ( Subject , Predicate , Object )`
[25]	`Triple`	::=	`( Subject , Predicate , Object )`
An RDF triple is composed of a subject, predicate and object.
[26]	`Subject`	::=	`IRI \| BlankNode`
[26]	`Subject`	::=	`IRI \| BlankNode`
A subject is either an IRI or a blank node.
[27]	`Predicate`	::=	`IRI`
[27]	`Predicate`	::=	`IRI`
A predicate is always an IRI.
[28]	`Object`	::=	`IRI \| BlankNode \| Literal`
[28]	`Object`	::=	`IRI \| BlankNode \| Literal`
An object is either an IRI, a blank node, or a literal.
[29]	`BlankNode`	::=	`RDF blank node`
[29]	`BlankNode`	::=	`RDF blank node`
A blank node is an arbitrary term used only to establish graph connectivity.
[30]	`Literal`	::=	`PlainLiteral \| TypedLiteral`
[30]	`Literal`	::=	`PlainLiteral \| TypedLiteral`
A literal is either a plain literal or a typed literal.
[31]	`PlainLiteral`	::=	`lexicalForm \| (lexicalForm, langageTag) ( lexicalForm , langageTag )`
[31]	`PlainLiteral`	::=	`( lexicalForm ) \| ( lexicalForm , langageTag )`
A plain literal has a lexical form and an optional language tag .
[32]	`TypedLiteral`	::=	`(lexicalForm, IRI) ( lexicalForm , IRI )`
[32]	`TypedLiteral`	::=	`( lexicalForm , IRI )`
An typed literal is composed of lexical form and a datatype IRI .
[33]	`IRI`	::=	`RDF URI-reference as subsequently restricted by SPARQL`
[33]	`IRI`	::=	`RDF URI-reference as subsequently restricted by SPARQL`
An IRI is an RDF URI reference as subsequently restricted by SPARQL .
[34]	`lexicalForm`	::=	`a Unicode String`
[34]	`lexicalForm`	::=	`a Unicode String`
SQL string representing a value.

3.4 A.4 Denotational semantics (Normative)

In this model, Databases are inhabitants of RDB and they are denoted by mathematical objects living in the RDF domain . This denotational semantics is what we call the Direct Mapping .

The url-encoding function renders strings in a form suitable to insert into IRIs. Data values are expressed in the XML Schema canonical form before url-encoding.

[35]	`ue`	:	`String → String`
[35]	`UE(s)`	=	s percent-encoded .
Replace each PERCENT SIGN character ('%', U+0025) with the string "%25". For table names, replace each NUMBER SIGN character ('#', U+0023) with the string "%23". For table names, replace each SOLIDUS character ('/', U+002f) with the string "%2f". For attribute names, replace each HYPHEN-MINUS character (' - ', U+003d) with the string "%3D". For attribute values, replace each FULL STOP character (' . ', U+002e) with the string "%2E". Replace each SPACE character (U+0020) with the PLUS SIGN character (+, U+002B).
	`⟦ , ⟧ _canon`	:	`( Row , Column ) → String`
[36]	`⟦r, c⟧ _canon`	=	`let v = value (r, c) in let d = header ( table (r)) in canonical RDF literal (v, d)`
[36]	`canon(A)`	=	`canonical RDF literal (A)`
lexical form of the canonical RDF literal representation of the column value as defined in R2RML section 10.2 Natural Mapping of SQL Values

Most of the functions defining the Direct Mapping are higher-order functions parameterized by a function ~~φ : Row → Node . This function~~ φ (r) row_node (r) which maps any row to a unique ~~node~~ IRI or Blank Node . ~~φ is formally defined by the following axioms:~~

~~[35]~~ [37]	`row-iri φ`	:	`∀ db:Database, db: Database , ∀ r:Row, r: Row , r ∈ db → primaryKey(table(r)) if primaryKey ( table (r)) ≠ ∅ → φ(r) is an IRI then ue ( tablename ( table (r))) + '/' + ue (c ₀ ) + '-' + ue ( canon (r, c ₀ )) + '.' + ⋯ + '.' + ue (c _n-1 ) + '-' + ue ( canon (r, c _n-1 )) else a BlankNode unique to r`
[37]	`row_node`	=	`if (pk(R) ≠ ∅) then IRI (UE(R.name) + "/" + (join('.', UE(A.name) + "-" + UE( canon (A))) ∣ A ∈ As )) else a BlankNode unique to r`
If the table has a primary key, the row ~~belongs to~~ node is a relative IRI obtained by concatenating: the percent-encoded form of the table ~~with~~ name, the SOLIDUS character '/', for each column in the primary key, in order: the percent-encoded form of the column name, a HYPHEN-MINUS character ' - ', the percent-encoded lexical form of the canonical RDF literal representation of the column value as defined in R2RML section 10.2 Natural Mapping of SQL Values , if it is not the last column in the foreign key, a FULL STOP character ' . ' If the table has no primary ~~key then φ maps this~~ key, the row to node is a fresh blank node that is unique ~~RDF IRI.~~ to this row.
	`⟦ ⟧ _tableIRI`	:	`TableName → IRI`
~~[36]~~ [38]	`⟦t⟧ _tableIRI`	=	ue ( tablename (t))
[38]	`row-blanknode table_IRI(R)`	=	`IRI (R.name)`
the relative IRI consisting of the percent-encoded form of the table name.
	`⟦ , ⟧ _litcol`	:	`( Row , Column ) → IRI`
[39]	`∀ db:Database, ∀ r:Row, r ∈ db → primaryKey(table(r)) ⟦r, c⟧ _litcol`	= ~~∅ → φ(r) is a BlankNode~~	`ue ( tablename ( table (r))) + '#' + ue (c))`
[39]	`literal_property_IRI(R, A)`	=	`IRI (UE(R.name) + "#" + UE(A.name))`
the concatenation of: the percent-encoded form of the table name, the hash character '#', the percent-encoded form of the column name.
	`⟦ , ⟧ _refcol`	:	`( Row , ForeignKey ) → IRI`
~~If a row belongs to a table with no primary key then~~ [40]	`φ ⟦r, fk⟧ _refcol` ~~maps this row to a unique BlankNode.~~	=	`let (from, reftable, to) = fk in ue ( tablename ( table (r))) + '/ref-' + ue (from ₀ ) + '.' + ⋯ + '.' + ue (from _n-1 )`
[40]	`reference_property_IRI(R, As)`	=	`IRI (UE(R.name) + "#ref-" + join('.', UE(A.name)) ∣ A ∈ As )`
the concatenation of: the percent-encoded form of the table name, the string '#ref - ', for each column in the foreign key, in order: the percent-encoded form of the column name, if it is not the last column in the foreign key, a FULL STOP character ' . '

The Direct Mapping is defined by induction on the structure of RDB. Thus it is defined for any relational database. The entry point for the Direct Mapping is the function ⟦ ⟧ ⟦ ⟧ ^φ _database . direct_graph (r) .

~~The~~ ~~A mapping from a table to a set of RDF triples.~~ ~~The triples expressing~~ ~~The union~~ ~~A mapping from a column in a row to an optional pair of RDF predicate and object.~~ ~~A mapping from a list~~ ~~The XML Schema datatype for d as defined by IWD 9075 §9.5~~ ~~An URL encoding per WSDL urlEncoded .~~

~~[37]~~	`⟦ ⟧ ⟦ ⟧ ^φ _database`	:	`Database → Graph A mapping from a relational database to an RDF graph.`
~~[38]~~ [41]	`⟦db⟧ ^φ _database`	=	`{ triple \| triple ∈ ⟦t⟧ ^φ _table \| t ∈ db }`
[41]	`direct_graph()`	=	`{ table_graph (R) ∣ R ∈ DB }`
the union of the ~~triples expressing~~ table graph s for each table in ~~the database.~~ a database schema.
~~[39]~~	`⟦ ⟧ ⟦ ⟧ ^φ _table`	:	`Table → Set(Triple) Set( Triple )`
~~[40]~~ [42]	`⟦t⟧ ^φ _table`	=	`{ triple \| triple ∈ ⟦r⟧ ^φ _row \| r ∈ body(t) }`
[42]	`table_graph(R)`	=	`{ row_graph (T, R) ∣ T ∈ R.Body }`
the union of the row graph s for each row in ~~the table t~~ a table.
	`noNULLs`	:	`Row → ForeignKey → Boolean`
[43]	`noNULLs(r, fk)`	=	`let (columnNames, _, _) = fk in ∀ c ∈ columnNames, value [41] (r, c) ≠ NULL`
[43]	`noNULLs(T, As)`	=	`∄(T(A) = Null ∣ A ∈ As)`
	`⟦ ⟧ ⟦ ⟧ ^φ _row`	:	`Row → Set(Triple) Set( Triple )` ~~A mapping from a row to a set of RDF triples.~~
~~[42]~~ [44]	`⟦r⟧ ^φ _row`	=	`let s = φ(r) φ (r) in { (s, p, o) \| (p, o) ∈ ⟦r, fk⟧ { ⟦r⟧ ^φ _ref type \| fk ∈ foreignKeys(table(r)) } ⋃ { (s, p, o) \| (p, o) ∈ ⟦r, c⟧ ^φ _lex \| value (r, c) ≠ NULL \| c ∈ lexicals(r) lexicals (r) } ⋃ { (s, rdf:type, ue(tablename(table(r)))) ⟦r, fk⟧ ^φ_ref \| noNULLs (r, fk) \| fk ∈ foreignKeys ( table (r)) }`
[44]	`row_graph(T, R)`	=	`{ type_triple (R) } ∪ { literal_triple (R, A) ∣ A ≠ Null ∧ [A] ∉ R. ForeignKeys (T) } ∪ { reference_triple (As, T) ∣ noNULLs (T, As) ∧ As ≠ R. PrimaryKey ∣ As ∈ R. ForeignKeys (T)`
an RDF graph consisting of the ~~triples coming from~~ following triples: the ~~foreign keys~~ row type triple . ~~the lexical values (not contributing to~~ a ~~unary~~ reference triple for each <column name list> in a table's foreign ~~key)~~ keys where none of the column values is NULL. ~~the~~ a literal triple for each column in a table ~~name, which denotes an RDF type information~~ where the column value is non-NULL.
~~[43]~~	`⟦ , ⟧ φ ⟦ ⟧ _ref type`	:	`(Row, ForeignKey) ( Row ) → (Predicate, Object) Triple A mapping from a foreign key in a row to an RDF predicate and an RDF object.`
~~[44]~~ [45]	`⟦r, fk⟧ φ ⟦r⟧ _ref type`	=	`let p s = ⟦table(r), fk⟧ col φ (r) in let targetRow t = dereference(r, fk) table (r) in let o = φ(targetRow) ⟦t⟧ _tableIRI in (p, { (s, rdf:type , o) }`
[45]	`type_triple(R)`	=	`The predicate based on triple ( row node (R), rdf:type , table-IRI (R))`
an RDF triple with: subject: the ~~column name and~~ row node for the ~~object refered by~~ row. predicate: the ~~foreign key fk .~~ RDF IRI `rdf:type`. object: the table IRI for the table name.
~~[45]~~	`⟦ , ⟧ ⟦ , ⟧ _lex`	:	`(Row, Column) ( Row , Column ) → { s:Set((Predicate, Object)) \| size(s) ≤ 1 } Triple`
[46]	`⟦r, c⟧ _lex`	=	let s = φ (r) in let p = ⟦table(r), fk⟧ ⟦ table (r), c⟧ _col litcol in let v = value(r, value (r, c) in let d = datatype(header(table(r))(c)) header ( table (r)) in if v is NULL then ∅ else if d is String then {(p, v)} else let datatype_iri o = ⟦d⟧ datatype in {(p, natural RDF literal (v, datatype_iri))} d) in If the cell value for this column is NULL: nothing; Otherwise: a predicate based on the column name and a typed literal made of the value in the cell plus the corresponding RDF datatype. { (s, p, o) }
~~[47]~~ [46]	`⟦ , ⟧ col literal_triple(R, A)`	: =	`(Row, List[Column]) → IRI triple ( row node (R), literal_property_IRI (R, [A]), natural RDF literal (A))`
an RDF triple with: subject: the row node for the row. predicate: the literal property IRI for the column. object: the R2RML natural RDF literal representation of ~~columns~~ the column value as defined in ~~a row to an IRI.~~ R2RML section 10.2 Natural Mapping of SQL Values .
~~[48]~~	`⟦r, c*⟧ ⟦ , ⟧ _col ref`	= :	`ue(tablename(table(r))) + '#' + ue(c 0 ) + ',' + ⋯ + ',' + ue(c n-1 ( Row , ForeignKey ) → Triple A concatenation, with punctuation as separators, of the url-encoded table name and the url-encoded column names.`
~~[49]~~ [47]	`⟦ ⟧ ⟦r, fk⟧ _{datatype ref}`	: =	`Datatype → IRI A mapping from a SQL datatype to an XML Schema datatype let s = φ IRI. (r) in let targetSpec = dereference [50] (r, fk) in ⟦d⟧ let p = ⟦ table (r), fk⟧ _{datatype refcol} in let o = φ (row(targetSpec)) in if d is Int then XSD:integer else if d is Float then XSD:float else if d is Date then XSD:date ⋯ (s, p, o)`
~~[51]~~ [47]	`ue reference_triple(R, As)`	: =	`String → String triple ( row node (R), reference_property_IRI (R, As), row_node (row referenced by (R, As)))`
an RDF triple with: subject: the row node for the row. predicate: the reference property IRI for the columns. object: the row node for the referenced row.

4 B Direct Mapping as Rules (Normative) (Informative)

In this section, we formally present the Direct Mapping as rules in Datalog ~~syntax.~~ syntax, inspired by previous approach [SQL2SW] [DMSurvey] . The left hand side of each rule is the RDF Triple output. The right hand side of each rule consists of a sequence of predicates from the relational database and built-in predicates. The built-in predicates are divided into ~~three~~ four groups. The first group contains some built-in predicates for dealing with repeated rows in a table without a primary key.

card(r, l, k): Given a table name r without a primary key and the list l of values [v ₁, ..., v _n ] for a row of table r, it returns in k the multiplicity of l in r (that is, k is the number of times row l appears in r)
n ≤ m: This is the usual order on positive integer values (given positive integers n and m, it holds if n is smaller than or equal to m)

The second group contains a predicate to deal with null values.

nonNull(v): Given a value v, it holds if v is not null

~~Finally, the~~ The third group of built-in predicates is used to generate IRIs for identifying tables and the columns in a table, and to generate IRIs or blank nodes for identifying each row in a table.

generateTableIRI(r, i): Given a table name r, it generates the ~~Table~~ table IRI i of r

~~generateColumnIRI(r,~~

generateLiteralPropertyIRI(r, a, i): Given a table name r and an attribute name a, it generates the literal property IRI i for a
generateReferencePropertyIRI(r, l, i): Given a table name r and a non-empty list of columns l, it generates the ~~Column~~ reference property IRI i for l
generateRowIRI(r, l ₁, l ₂, i): Given a table name r, a non-empty list l ₁ of columns and a non-empty list l ₂ of values (for the columns in l ₁ ), it generates the ~~Row RDF Node~~ row node (or Row IRI) i for the given row
generateRowBlankNode(r, l, n, i): Given a table name r without a primary key, a list l of values for a row of table r and a positive integer n, it generates the ~~Row RDF Node~~ row node i for the n-th occurrence of row l in r (which is a Blank Node in this case). It is assumed that n is smaller than or equal to the multiplicity of l in r (that is, if card(r, l, k) holds, then 1 ≤ n ≤ k)

Finally, the fourth group of built-in predicates is used to generate typed literals.

generateTypedLiteral(u, a, r, v): Given a value u, an attribute name a and a table name r, it generates an R2RML natural RDF literal v representation of the column value u, given the type of a in r and as defined in R2RML section 10.2 Natural Mapping of SQL Values .

Throughout the section, boxes containing Direct Mapping rules and examples will appear. These boxes are color-coded. Yellow boxes contain Direct Mapping rules:


This box contains a Direct Mapping rule

Green boxes contain examples of applying the previous Direct Mapping rule:


This box contains examples of applying a Direct Mapping rule

Consider again the example from Section Direct Mapping Example . It should be noticed that in the rules presented in this section, a formula of the form Addresses(X, Y, Z) indicates that the variables X, Y and Z are used to store the values of a row in the three columns of the table Addresses (according to the order specified in the schema of the table, that is, X, Y and Z store the values of ID, city and state, respectively). In particular, uppercase letters like X, Y, Z, S, P and O are used to denote variables. Moreover, double quotes are used in the rules to refer to the string with the name of a table or a column. For example, a formula of the form generateRowIRI("Addresses", ["ID"], [X], S) is used to generate the ~~Row RDF Node~~ row node (or Row IRI) for the row of table "Addresses" whose value in the primary key "ID" is the value stored in the variable X. The value of this Row IRI is stored in the variable S.

4.1 B.1 Generating Table Row Type Triples

4.1.1 B.1.1 Table has a primary key

Assume that r is a table with columns a ₁, ..., a _m and such that [a _p
₁, ..., a _{p
_n} ] is the primary key of r, where 1 ≤ n ≤ m and 1 ≤ p ₁ < ... < p _n ≤ m. Then the following is the direct mapping rule to generate ~~Table Triples~~ row type triples from r: ~~Triple(S, "rdf:type", O) ← r(X], S), generateTableIRI("r", O)~~

Triple(S, "rdf:type", O) ← r(X₁, ..., X_m), generateRowIRI("r", ["a_p₁", ..., "a_{p_n}"], [X_p₁, ..., X_{p_n}], S), generateTableIRI("r", O)

For example, table Addresses in the Direct Mapping Example has columns ID , city and state , and it has column ID as its primary key. Then the following is the direct mapping rule to generate ~~Table Triples~~ row type triples from Addresses :

Triple(S, "rdf:type", O) ← Addresses(X₁, X₂, X₃), generateRowIRI("Addresses", ["ID"], [X₁], S), generateTableIRI("Addresses", O)

As a second example, consider table Department from the example in Section Foreign keys referencing candidate keys , which has columns ID , name , city and manager , and assume that ( name , city ) is the multi-column primary key of this table (instead of ID ). Then the following is the direct mapping rule to generate ~~Table Triples~~ row type triples from Department : ~~Triple(S, "rdf:type", O) ← Department(X], S), generateTableIRI("Department", O)~~

Triple(S, "rdf:type", O) ← Department(X₁, X₂, X₃, X₄), generateRowIRI("Department", ["name","city"], [X₂, X₃], S), 
                            generateTableIRI("Department", O)

4.1.2 B.1.2 Table does not have a primary key

Assume that r is a table with columns a ₁, ..., a _m and such that r does not have a primary key. Then the following is the direct mapping rule to generate ~~Table Triples~~ row type triples from r: ~~Triple(S, "rdf:type", O) ← r(X], V, S),~~

Triple(S, "rdf:type", O) ← r(X₁, ..., X_m), card("r", [X₁, ..., X_m], U), V ≤ U, generateRowBlankNode("r", [X₁, ..., X_m], V, S),

                            generateTableIRI("r", O)

For example, table Tweets from Section Empty (non-existent) primary keys has columns tweeter , when and text , and it does not have a primary key. Then the following is the direct mapping rule to generate ~~Table Triples~~ row type triples from Tweets : ~~Triple(S, "rdf:type", O) ← Tweets(X], V, S),~~

Triple(S, "rdf:type", O) ← Tweets(X₁, X₂, X₃), card("Tweets", [X₁, X₂, X₃], U), V ≤ U, generateRowBlankNode("Tweets", [X₁, X₂, X₃], V, S),

                            generateTableIRI("Tweets", O)

4.2 B.2 Generating Literal Triples

4.2.1 B.2.1 Table has a primary key

Assume that r is a table with columns a ₁, ..., a _m and such that [a _p
₁, ..., a _{p
_n} ] is the primary key of r, where 1 ≤ n ≤ m and 1 ≤ p ₁ < ... < p _n ≤ m. Then for every a _j (1 ≤ j ≤ ~~m) that is not the only constituent of a foreign key of r or is the only constituent of a foreign key of r that references a candidate key,~~ m), the direct mapping includes the following rule for r and a _j to generate ~~Literal Triples: Triple(S, P, X"], P)~~ literal triples :


Triple(S, P, V) ← r(X₁, ..., X_m), nonNull(X_j), generateRowIRI("r", ["a_p₁", ..., "a_{p_n}"], [X_p₁, ..., X_{p_n}], S),
                   generateLiteralPropertyIRI("r", "a_j", P), generateTypedLiteral(X_j, "a_j", "r", V)

For example, table Addresses in the Direct Mapping Example has columns ID , city and state , and it has column ID as its primary key. Then the following are the direct mapping rules to generate ~~Literal Triples~~ literal triples from Addresses : ~~Triple(S, P, X], S), generateColumnIRI("Addresses", ["ID"], P) Triple(S, P, X], S), generateColumnIRI("Addresses", ["city"], P) Triple(S, P, X], S), generateColumnIRI("Addresses", ["state"], P)~~

Triple(S, P, V) ← Addresses(X₁, X₂, X₃), nonNull(X₁), generateRowIRI("Addresses", ["ID"], [X₁], S),
                   generateLiteralPropertyIRI("Addresses", "ID", P), generateTypedLiteral(X₁, "ID", "Addresses", V)
Triple(S, P, V) ← Addresses(X₁, X₂, X₃), nonNull(X₂), generateRowIRI("Addresses", ["ID"], [X₁], S), 
                   generateLiteralPropertyIRI("Addresses", "city", P), generateTypedLiteral(X₂, "city", "Addresses", V)
Triple(S, P, V) ← Addresses(X₁, X₂, X₃), nonNull(X₃), generateRowIRI("Addresses", ["ID"], [X₁], S),
                   generateLiteralPropertyIRI("Addresses", "state", P), generateTypedLiteral(X₃, "state", "Addresses", V)

As a second example, consider again table Department from the example in Section Foreign keys referencing candidate keys , which has columns ID , name , city and manager , and assume that ( name , city ) is the multi-column primary key of this table (instead of ID ). Then the following are the direct mapping rules to generate ~~Literal Triples~~ literal triples from Department : Triple(S, P, X], S), generateColumnIRI("Department", ["name"], P) Triple(S, P, X], S), generateColumnIRI("Department", ["city"], P) Triple(S, P, X], S), generateColumnIRI("Department", ["ID"], P)

Triple(S, P, V) ← Department(X₁, X₂, X₃, X₄), nonNull(X₁), generateRowIRI("Department", ["name", "city"], [X₂, X₃], S), 
                   generateLiteralPropertyIRI("Department", "ID", P), generateTypedLiteral(X₁, "ID", "Department", V)
Triple(S, P, V) ← Department(X₁, X₂, X₃, X₄), nonNull(X₂), generateRowIRI("Department", ["name", "city"], [X₂, X₃], S), 
                   generateLiteralPropertyIRI("Department", "name", P), generateTypedLiteral(X₂, "name", "Department", V)
Triple(S, P, V) ← Department(X₁, X₂, X₃, X₄), nonNull(X₃), generateRowIRI("Department", ["name", "city"], [X₂, X₃], S), 
                   generateLiteralPropertyIRI("Department", "city", P), generateTypedLiteral(X₃, "city", "Department", V)
Triple(S, P, V) ← Department(X₁, X₂, X₃, X₄), nonNull(X₄), generateRowIRI("Department", ["name", "city"], [X₂, X₃], S), 
                   generateLiteralPropertyIRI("Department", "manager", P), generateTypedLiteral(X₄, "manager", "Department", V)

It is important to notice that no rule is generated from column manager , as this column is the only constituent of a foreign key that references a primary key: FOREIGN KEY(manager) REFERENCES People(ID) .

4.2.2 B.2.2 Table does not have a primary key

Assume that r is a table with columns a ₁, ..., a _m and such that r does not have a primary key. Then for every a _j (1 ≤ j ≤ ~~m) that is not the only constituent of a foreign key of r or is the only constituent of a foreign key of r that references a candidate key,~~ m), the direct mapping includes the following rule for r and a _j to generate ~~Literal Triples: Triple(S, P, X], V, S), generateColumnIRI("r", ["a"], P)~~ literal triples :


Triple(S, P, V) ← r(X₁, ..., X_m), nonNull(X_j), card("r", [X₁, ..., X_m], U), V ≤ U, generateRowBlankNode("r", [X₁, ..., X_m], V, S), 
                   generateLiteralPropertyIRI("r", "a_j", P), generateTypedLiteral(X_j, "a_j", "r", V)

For example, table Tweets from Section Empty (non-existent) primary keys has columns tweeter , when and text , and it does not have a primary key. Then the following are the direct mapping rules to generate ~~Literal Triples~~ literal triples from Tweets : ~~Triple(S, P, X], V, S), generateColumnIRI("Tweets", ["when"], P) Triple(S, P, X], V, S), generateColumnIRI("Tweets", ["text"], P)~~

Triple(S, P, V) ← Tweets(X₁, X₂, X₃), nonNull(X₁), card("Tweets", [X₁, X₂, X₃], U), V ≤ U, generateRowBlankNode("Tweets", [X₁, X₂, X₃], V, S), 
                   generateLiteralPropertyIRI("Tweets", "tweeter", P), generateTypedLiteral(X₁, "tweeter", "Tweets", V)
Triple(S, P, V) ← Tweets(X₁, X₂, X₃), nonNull(X₂), card("Tweets", [X₁, X₂, X₃], U), V ≤ U, generateRowBlankNode("Tweets", [X₁, X₂, X₃], V, S), 
                   generateLiteralPropertyIRI("Tweets", "when", P), generateTypedLiteral(X₂, "when", "Tweets", V)
Triple(S, P, V) ← Tweets(X₁, X₂, X₃), nonNull(X₃), card("Tweets", [X₁, X₂, X₃], U), V ≤ U, generateRowBlankNode("Tweets", [X₁, X₂, X₃], V, S), 
                   generateLiteralPropertyIRI("Tweets", "text", P), generateTypedLiteral(X₃, "text", "Tweets", V)

It is important to notice that no rule is generated from column tweeter , as this column is the only constituent of a foreign key that references a primary key: FOREIGN KEY(tweeter) REFERENCES People(ID) .

4.3 B.3 Generating Reference Triples

For each foreign key from a table r ₁ to a table r ₂, one of the following four cases is applied.

4.3.1 B.3.1 Table r ₁ has a primary key and table r ₂ has a primary key

Assume that:

r ₁ is a table with columns a ₁, ..., a _i and such that [a _p
₁, ..., a _{p
_j} ] is the primary key of r ₁, where 1 ≤ j ≤ i and 1 ≤ p ₁ < ... < p _j ≤ i
r ₂ is a table with columns c ₁, ..., c _k and such that [c _q
₁, ..., c _{q
_m} ] is the primary key of r ₂, where 1 ≤ m ≤ k and 1 ≤ q ₁ < ... < q _m ≤ k
the foreign key indicates that the columns a _p

s

₁, ..., a _{p

s

_n} of r ₁ reference the columns c _q

t

₁, ..., c _{q

t

_n} of r ₂, where (1) 1 ≤ p s ₁, ..., p s _n ≤ i, (2) 1 ≤ q t ₁, ..., q t _n ≤ k, and (3) n ≥ 1

Then the direct mapping includes the following rule for r ₁ and r ₂ to generate Reference Triples: ~~Triple(S, P, O) ← r], S), r], O), nonNull(X"], P)~~

Triple(S, P, O) ← r₁(X₁, ..., X_i), generateRowIRI("r₁", ["a_p₁", ..., "a_{p_j}"], [X_p₁, ..., X_{p_j}], S), 
                   r₂(Y₁, ..., Y_k), generateRowIRI("r₂", ["c_q₁", ..., "c_{q_m}"], [Y_q₁, ..., Y_{q_m}], O), 
                   nonNull(X_s₁), ..., nonNull(X_{s_n}), X_s₁ = Y_t₁, ...,  X_{s_n} = Y_{t_n},  generateReferencePropertyIRI("r₁", ["a_s₁", ..., "a_{s_n}"], P)

For example, ~~... to-do ...~~ table Addresses in the Direct Mapping Example has columns ID , city and state , where column ID is the primary key. Table People in this example has columns ID , fname and addr , where column ID is the primary key, and it has a foreign key in the column addr that references the column ID in the table Addresses . In this case, the following is the direct mapping rule to generate Reference Triples :

~~... to-do ...~~

Triple(S, P, O) ← People(X₁, X₂, X₃), generateRowIRI("People", ["ID"], [X₁], S),  
                   Addresses(Y₁, Y₂, Y₃), generateRowIRI("Addresses", ["ID"], [Y₁], O),  
                   nonNull(X₃), X₃ = Y₁,  generateReferencePropertyIRI("People", ["addr"], P)

4.3.2 B.3.2 Table r ₁ has a primary key and table r ₂ does not have a primary key

Assume that:

r ₁ is a table with columns a ₁, ..., a _i and such that [a _p
₁, ..., a _{p
_j} ] is the primary key of r ₁, where 1 ≤ j ≤ i and and 1 ≤ p ₁ < ... < p _j ≤ i
r ₂ is a table with columns c ₁, ..., c _k, and it does not have a primary key
the foreign key indicates that the columns a _p

s

₁, ..., a _{p

s

_n} of r ₁ reference the columns c _q

t

₁, ..., c _{q

t

_n} of r ₂, where (1) 1 ≤ p s ₁, ..., p s _n ≤ i, (2) 1 ≤ q t ₁, ..., q t _n ≤ k, and (3) n ≥ 1

Then the direct mapping includes the following rule for r ₁ and r ₂ to generate Reference Triples: ~~Triple(S, P, O) ← r], S), r, O), nonNull(X"], P)~~

Triple(S, P, O) ← r₁(X₁, ..., X_i), generateRowIRI("r₁", ["a_p₁", ..., "a_{p_j}"], [X_p₁, ..., X_{p_j}], S),
                   r₂(Y₁, ..., Y_k), card("r₂", [Y₁, ..., Y_k], U), V ≤ U, generateRowBlankNode("r₂", [Y₁, ..., Y_k], V, O), 
                nonNull(X_s₁), ..., nonNull(X_{s_n}), X_s₁ = Y_t₁, ...,  X_{s_n} = Y_{t_n},  generateReferencePropertyIRI("r₁", ["a_s₁", ..., "a_{s_n}"], P)

For example, ~~... to-do ...~~ assume that table Addresses in the Direct Mapping Example has columns ID , city and state , and that column ID is a candidate key (instead of a primary key), so that table Addresses does not have a primary key. Moreover, assume that table People in this example has columns ID , fname and addr , it has column ID as its primary key, and it has a foreign key in the column addr to the candidate key ID in the table Addresses . In this case, the following is the direct mapping rule to generate Reference Triples :

~~... to-do ...~~

Triple(S, P, O) ← People(X₁, X₂, X₃), generateRowIRI("People", ["ID"], [X₁], S), 
                   Addresses(Y₁, Y₂, Y₃), card("Addresses", [Y₁, Y₂, Y₃], U), V ≤ U, generateRowBlankNode("Addresses", [Y₁, Y₂, Y₃], V, O), 
                   nonNull(X₃), X₃ = Y₁,  generateReferencePropertyIRI("People", ["addr"], P)

4.3.3 B.3.3 Table r ₁ does not have primary key and table r ₂ has a primary key

Assume that:

r ₁ is a table with columns a ₁, ..., a _i, and it does not have a primary key
r ₂ is a table with columns c ₁, ..., c _k and such that [c _q
₁, ..., c _{q
_m} ] is the primary key of r ₂, where 1 ≤ m ≤ k and 1 ≤ q ₁ < ... < q _m ≤ k
the foreign key indicates that the columns a _p

s

₁, ..., a _{p

s

_n} of r ₁ reference the columns c _q

t

₁, ..., c _{q

t

_n} of r ₂, where (1) 1 ≤ p s ₁, ..., p s _n ≤ i, (2) 1 ≤ q t ₁, ..., q t _n ≤ k, and (3) n ≥ 1

Then the direct mapping includes the following rule for r ₁ and r ₂ to generate Reference Triples: ~~Triple(S, P, O) ← r, S), r], O), nonNull(X"], P)~~

Triple(S, P, O) ← r₁(X₁, ..., X_i), card("r₁", [X₁, ..., X_i], U), V ≤ U, generateRowBlankNode("r₁", [X₁, ..., X_i], V, S), 
                   r₂(Y₁, ..., Y_k), generateRowIRI("r₂", ["c_q₁", ..., "c_{q_m}"], [Y_q₁, ..., Y_{q_m}], O), 
                   nonNull(X_s₁), ..., nonNull(X_{s_n}), X_s₁ = Y_t₁, ...,  X_{s_n} = Y_{t_n},  generateReferencePropertyIRI("r₁", ["a_s₁", ..., "a_{s_n}"], P)

For example, ~~... to-do ...~~ table People in the Direct Mapping Example has columns ID , fname and addr , and it has column ID as its primary key, while table Tweets from Section Empty (non-existent) primary keys has columns tweeter , when and text , it does not have a primary key, and it has a foreign key in column tweeter that references column ID in table People . In this case, the following is the direct mapping rule to generate Reference Triples :

~~... to-do ...~~

Triple(S, P, O) ← Tweets(X₁, X₂, X₃), card("Tweets", [X₁, X₂, X₃], U), V ≤ U, generateRowBlankNode("Tweets", [X₁, X₂, X₃], V, S), 
                   People(Y₁, Y₂, Y₃), generateRowIRI("People", ["ID"], [Y₁], O), 
                   nonNull(X₁), X₁ = Y₁, generateReferencePropertyIRI("Tweets", ["tweeter"], P)

4.3.4 B.3.4 Table r ₁ does not have primary key and table r ₂ does not have a primary key

Assume that:

r ₁ is a table with columns a ₁, ..., a _i, and it does not have a primary key
r ₂ is a table with columns c ₁, ..., c _k, and it does not have a primary key
the foreign key indicates that the columns a _p

s

₁, ..., a _{p

s

_n} of r ₁ reference the columns c _q

t

₁, ..., c _{q

t

_n} of r ₂, where (1) 1 ≤ p s ₁, ..., p s _n ≤ i, (2) 1 ≤ q t ₁, ..., q t _n ≤ k, and (3) n ≥ 1

Then the direct mapping includes the following rule for r ₁ and r ₂ to generate Reference Triples: ~~Triple(S, P, O) ← r, S), r, O), nonNull(X"], P)~~

Triple(S, P, O) ← r₁(X₁, ..., X_i), card("r₁", [X₁, ..., X_i], U₁), V₁ ≤ U₁, generateRowBlankNode("r₁", [X₁, ..., X_i], V₁, S), 
                   r₂(Y₁, ..., Y_k), card("r₂", [Y₁, ..., Y_k], U₂), V₂ ≤ U₂, generateRowBlankNode("r₂", [Y₁, ..., Y_k], V₂, O), 
                   nonNull(X_s₁), ..., nonNull(X_{s_n}), X_s₁ = Y_t₁, ...,  X_{s_n} = Y_{t_n},  generateReferencePropertyIRI("r₁", ["a_s₁", ..., "a_{s_n}"], P)

For example, ~~... to-do ... ... to-do ... 5 References~~ assume that table People in the Direct Mapping Example ~~SPARQL SPARQL Query Language for RDF, Eric Prud'hommeaux~~ has columns ID , fname and Andy Seaborne 2008. (See http://www.w3.org/TR/rdf-sparql-query/.) SQLFW SQL. ISO/IEC 9075-1:2008 SQL – Part 1: Framework (SQL/Framework) International Organization for Standardization, 27 January 2009. SQLFN ISO/IEC 9075-2:2008 SQL – Part 2: Foundation (SQL/Foundation) International Organization for Standardization, 27 January 2009. RDF-concepts Resource Description Framework (RDF): Concepts addr , and ~~Abstract Syntax, G. Klyne, J. J. Carroll, Editors, W3C Recommendation, 10 February 2004 (See http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/.)~~ that column ID is a candidate key (instead of a primary key), so that People does not have a primary key. Moreover, assume that table Tweets from Section Empty (non-existent) primary keys ~~ReuseableIDs Reusable Identifiers~~ has columns tweeter , when and text , it does not have a primary key, and it has a foreign in column tweeter that references candidate key ID in table People . In this case, the ~~RDB2RDF~~ following is the direct mapping language, Michael Hausenblas and Themis Palpanas, 2009. (See http://esw.w3.org/topic/Rdb2RdfXG/ReusableIdentifier.) URI RFC3986 - Uniform Resource Identifier (URI): Generic Syntax (See http://tools.ietf.org/html/rfc3986.) IRI RFC3987 - Internationalized Resource Identifier (IRIs) (See http://tools.ietf.org/html/rfc3987.) $Log: LC-to-CR.html,v $ Revision 1.1 2012/02/22 15:52:46 ivan *** empty log message *** Revision 1.1 2012/02/21 19:04:11 eric cp ~/WWW/2001/sw/rdb2rdf/directMapping/LC/ Revision 1.1 2012/01/26 22:59:54 eric CREATED: http://www.w3.org/2007/10/htmldiff?doc1=http%3A%2F%2Fwww.w3.org%2FTR%2F2011%2FWD-rdb-direct-mapping-20110324%2F&doc2=http%3A%2F%2F128.30.6.156%2F2001%2Fsw%2Frdb2rdf%2FdirectMapping%2FLC%2F Revision 1.1 2011/03/23 22:17:52 bertails + snapshot of rdb-direct-mapping Revision 1.21 2011/03/23 20:53:12 bertails ~ cleaning before moving to TR space Revision 1.20 2011/03/17 23:16:34 eric - fragments on node IRIs Revision 1.19 2011/03/08 04:14:06 bertails ~ fix some typos Revision 1.18 2011/03/07 00:48:06 bertails + phi function mapping rows to RDF nodes Revision 1.17 2011/03/07 00:13:06 bertails + RDB accessor functions Revision 1.16 2011/03/06 21:56:29 bertails ~ migrating to cleaner denotational semantics Revision 1.15 2011/03/02 17:26:34 marenas Datalog rules in Section 4 were simplified Revision 1.14 2011/03/01 02:35:49 marenas Section 4 now includes all the Datalog rules that define the direct mapping Revision 1.13 2011/02/01 16:15:17 marenas Section 4 now includes an example for each type of Datalog rule used to define the direct mapping Revision 1.12 2011/01/27 01:16:42 marenas New version of Datalog rules to deal with repeated tuples in a table without a primary key Revision 1.11 2010/11/17 21:36:44 eric ~ validated HTML, CSS, links for publication Revision 1.10 2010/11/16 17:45:35 eric ~ xml well-formed Revision 1.9 2010/11/16 17:43:47 eric ~ 2010-11-16T17:34:38Z <ericP> mhausenblas: s/very simple direct mapping/direct mapping/ ~ re-title notation ~ addressed nunolopes's issue with rule 23 ~ text from #rdb2rdf 2010-11-16T17:40:18Z <juansequeda>... Revision 1.8 2010/11/16 17:30:18 eric ~ fixed Notation title Revision 1.7 2010/11/16 17:25:52 eric ~ re-oranized algebra section Revision 1.6 2010/11/16 17:22:39 eric ~ re-ordered authors Revision 1.5 2010/11/11 18:39:27 marenas rephrasing the definition of table tuples Revision 1.4 2010/11/11 17:58:25 marenas New Section 2.2: "Preliminaries: Generating IRIs" Examples are now grouped in Section 2.4: "Additional Examples and Corner Cases" Revision 1.3 2010/11/10 14:54:48 marenas Removing "(Editor)" from the list of authors Revision 1.2 2010/11/10 12:56:22 eric + Revision 1.1 2010/11/10 02:51:03 eric moved from ../directMapping Revision 1.56 2010/11/10 02:47:08 eric ~ well-formedness error Revision 1.55 2010/11/10 02:45:37 eric ~ finished adopting the all-relative-IRI model in order to sync with the merged text from alt/ ~ adopted "direct" mapping per the resolution of the 2010-11-09 telecon ~ made Juan and Marcello editors instead of authors ~ fixed a couple typos : I believe this specification follows the intent of: RESOLUTION: http://www.w3.org/2001/sw/rdb2rdf/directMapping/ with Juan, Marcelo and Eric as editors based on Richard's proposal as of http://lists.w3.org/Archives/Public/public-rdb2rdf-wg/2010Nov/0052.html and try to work in J&M's IRI and Triple generations part; move hierarchical table and the M-M mappings into Ed note; datalog as a separate section; Eric perform merge with review/approval/consensus of Juan, Marcelo, & Eric Revision 1.54 2010/11/09 22:46:56 eric + Revision 1.53 2010/11/09 22:41:02 eric ~ date Revision 1.52 2010/11/09 22:39:12 eric ~ s/stem/base/g + inclusion of collapsible sections from alt/ Revision 1.51 2010/11/09 15:39:46 eric ~ removed collapsible sections per request mid:AANLkTikvnrgXuu5fDAw+c2nUv5ENkmngPAJJ05c2gASk@mail.gmail.com Revision 1.50 2010/11/09 15:06:34 eric + exp sections Revision 1.49 2010/11/09 14:12:08 eric ~ addressed Revision 1.48 2010/11/09 04:11:35 eric ~ addressed + inclusion of some explanatory details from Revision 1.47 2010/11/04 12:42:21 eric ~ working on style for editorial choices Revision 1.46 2010/11/04 06:10:08 eric ~ hilit triples in query in Use of Direct Mapping Revision 1.45 2010/11/04 05:42:55 eric ~ incorporated Revision 1.44 2010/11/02 08:18:07 eric ~ updates per DanC's feedback Revision 1.43 2010/10/29 03:10:12 eric ~ s/relational terminology/SQL terminology/ Revision 1.42 2010/10/17 13:46:48 eric + SQL constraints Revision 1.41 2010/10/12 14:21:36 eric ~ renumbered Revision 1.40 2010/10/12 12:14:52 eric + SQL for example 1 Revision 1.39 2010/10/11 03:12:21 eric ~ prettied up mutual-hilights Revision 1.38 2010/10/10 22:09:55 eric + pfkexception Revision 1.37 2010/10/10 14:25:41 eric ~ re-worked front-loaded informative rules Revision 1.36 2010/10/10 11:59:01 eric ~ prettied-up pre@class=turtle ~ experimenting with new presentation of transformation rules ~ validated XSLT output Revision 1.35 2010/10/09 15:12:40 eric + crosslinks for hier-tabl Revision 1.34 2010/10/09 14:52:31 eric + crosslinks for ref-no-pk Revision 1.33 2010/10/09 13:45:17 eric ~ symmetric xrefs between tables and triples for emp-addr and multi-key Revision 1.32 2010/10/08 21:59:54 eric + hilights Revision 1.31 2010/09/29 19:53:37 eric ~ align with https://dvcs.w3.org/hg/stemGraph/ Revision 1.30 2010/09/29 15:13:18 eric ~ align with https://dvcs.w3.org/hg/stemGraph/rev/75cf39ef7d74 Revision 1.29 2010/09/29 03:34:55 eric + 2nd gen hierarchical example Revision 1.28 2010/09/28 03:10:53 eric validation Revision 1.27 2010/09/28 03:08:52 eric + hierarchical (untested) Revision 1.26 2010/09/27 21:49:18 eric ~ XML validation (per xsltproc) Revision 1.25 2010/09/27 21:46:42 eric ~ fixed reference table name Revision 1.24 2010/09/27 18:48:46 eric + noticed another key in ref-no-pk Revision 1.23 2010/09/27 18:13:03 eric + ref-no-pk Revision 1.22 2010/09/27 14:50:44 eric + nodemap + a rough pass on <scala>scala code</scala> Revision 1.21 2010/09/26 04:50:07 eric ~ fix load state for syntax display Revision 1.20 2010/09/25 18:40:39 eric + some tips Revision 1.19 2010/09/24 16:34:02 eric + some tips Revision 1.18 2010/09/24 16:00:53 eric + some tips Revision 1.17 2010/09/24 15:50:41 eric + buttons for different languages Revision 1.16 2010/09/07 12:14:44 eric ~ fixed pk invocation errors per mid:04C1B62C-42A5-424C-974B-6E894ED7B11A@cyganiak.de Revision 1.15 2010/08/30 18:37:19 eric + section Revision 1.14 2010/08/30 14:05:45 eric + fks Revis rule to generate Reference Triples :


Triple(S, P, O) ← Tweets(X₁, X₂, X₃), card("Tweets", [X₁, X₂, X₃], U₁), V₁ ≤ U₁, generateRowBlankNode("Tweets", [X₁, X₂, X₃], V₁, S),

A Direct Mapping of Relational Data to RDF

W3C Working Editor's Draft 24 March 2011 25 January 2012

Abstract

Status of this Document

Table of Contents

Appendices

1 Introduction

2 Direct Mapping Description (Informative)

2.1 Direct Mapping Example

2.4 Additional Examples and Corner Cases 2.4.1 2.2 Foreign keys referencing candidate keys

2.4.2 2.3 Multi-column primary keys

2.4.3 2.4 Empty (non-existent) primary keys

2.4.4 2.5 Referencing tables with empty primary keys

2.5 Hierarchical Tables 3 Direct Graph Definition

4 References

3 A Direct Mapping Definition Algebra (Informative)

3.1 A.1 Notations

3.2 A.2 Relational Data Model

3.2.1 A.2.1 RDB Abstract Data Type (Normative)

3.2.2 A.2.2 RDB accessor functions (Normative)

3.3 A.3 RDF Data Model (Non-normative)

3.4 A.4 Denotational semantics (Normative)

4 B Direct Mapping as Rules (Normative) (Informative)

4.1 B.1 Generating Table Row Type Triples

4.1.1 B.1.1 Table has a primary key

4.1.2 B.1.2 Table does not have a primary key

4.2 B.2 Generating Literal Triples

4.2.1 B.2.1 Table has a primary key

4.2.2 B.2.2 Table does not have a primary key

4.3 B.3 Generating Reference Triples

4.3.1 B.3.1 Table r ₁ has a primary key and table r ₂ has a primary key

4.3.2 B.3.2 Table r ₁ has a primary key and table r ₂ does not have a primary key

4.3.3 B.3.3 Table r ₁ does not have primary key and table r ₂ has a primary key

4.3.4 B.3.4 Table r ₁ does not have primary key and table r ₂ does not have a primary key

A Direct Mapping of Relational Data to RDF

W3C Working Editor's Draft 24 March 2011 25 January 2012

Abstract

Status of this Document

Table of Contents

Appendices

1 Introduction

2 Direct Mapping Description (Informative)

2.1 Direct Mapping Example

2.4 Additional Examples and Corner Cases 2.4.1 2.2 Foreign keys referencing candidate keys

2.4.2 2.3 Multi-column primary keys

2.4.3 2.4 Empty (non-existent) primary keys

2.4.4 2.5 Referencing tables with empty primary keys

2.5 Hierarchical Tables 3 Direct Graph Definition

4 References

3 A Direct Mapping Definition Algebra (Informative)

3.1 A.1 Notations

3.2 A.2 Relational Data Model

3.2.1 A.2.1 RDB Abstract Data Type (Normative)

3.2.2 A.2.2 RDB accessor functions (Normative)

3.3 A.3 RDF Data Model (Non-normative)

3.4 A.4 Denotational semantics (Normative)

4 B Direct Mapping as Rules (Normative) (Informative)

4.1 B.1 Generating Table Row Type Triples

4.1.1 B.1.1 Table has a primary key

4.1.2 B.1.2 Table does not have a primary key

4.2 B.2 Generating Literal Triples

4.2.1 B.2.1 Table has a primary key

4.2.2 B.2.2 Table does not have a primary key

4.3 B.3 Generating Reference Triples

4.3.1 B.3.1 Table r 1 has a primary key and table r 2 has a primary key

4.3.2 B.3.2 Table r 1 has a primary key and table r 2 does not have a primary key

4.3.3 B.3.3 Table r 1 does not have primary key and table r 2 has a primary key

4.3.4 B.3.4 Table r 1 does not have primary key and table r 2 does not have a primary key

4.3.1 B.3.1 Table r ₁ has a primary key and table r ₂ has a primary key

4.3.2 B.3.2 Table r ₁ has a primary key and table r ₂ does not have a primary key

4.3.3 B.3.3 Table r ₁ does not have primary key and table r ₂ has a primary key

4.3.4 B.3.4 Table r ₁ does not have primary key and table r ₂ does not have a primary key