SPARQL Access to SQL Databases

Eric Prud'hommeaux
Sanitation Engineer, W3C
SPARQL, HCLS, RDB2RDF Working Groups

Lee Feigenbaum
VP Technology & Standards, Cambridge Semantics
Co-Chair, W3C SPARQL Working Group

Last modified: 2012-05-05
This work is licensed under a Creative Commons Attribution-Share Alike 3.0 Unported License, with attribution to W3C and Cambridge Semantics.
This talk: <http://w3.org/brief/MjQx>

Introductions/agenda bashing

Value proposition for SQL as RDF
RDBs as RDF
Brief intro to SPARQL
Web of Data
1st evaluation of value proposition
Mapping to common terms
R2RML
SemWeb rules
SPARQL for real
Mapping SPARQL queries to SQL
2nd evaluation of value proposition
More philosophizing

What's in it for us?

most of the world's data
is in documents

machine-useful data
is in RDBs

today's questions
link RDBs

Value proposition for SQL as RDF

Break plumbing barriers between data.
Unambiguously communicate intended interpretation.
Minimize effort for integraion.
Scale knowledge repositories to web size.

Mashups become queries.

SERVICE <http://uu3.org:8888/7tm_receptors>
  {
    ?iuphar  iface:family      ?family .
    ?iuphar  iface:code	       ?code .
    ?iuphar  iface:iupharName  ?iupharNm .
    ?human   iface:iuphar      ?iuphar .
    ?human   iface:geneName    "GABBR1" .
    ?human   iface:entrezGene  ?humanEntrez .
  }
SERVICE <http://dbpedia.org/sparql?default-graph-uri=http%3A%2F%2Fdbpedia.org&should-sponge=&>

  {
     _:gene  dbp:entrezgene    ?humanEntrez ;
             rdfs:label        ?label ;
    FILTER (lang(?label) = "en")
  }
SERVICE <http://hcls.deri.org/atag/data/gabab_example.html>
  {
     ?topic  rdfs:label        ?label .
     ?post   sioc:topic        ?topic
  }

Data Browsers/Integrators

RDBs as RDF

Goal: put a relational database on the semantic web.

Encourage data sharing.
Simplify plumbing.
Enable structured access to conventional portals:

RDF Data Model

RDF is a data model of graphs of subject, predicate, object triples.
Resources are represented with URIs, which can be abbreviated as prefixed names
Objects can be literals: strings, integers, booleans, etc.
Turtle: a bit of syntax
- URIs: <http://example.com/resource> or prefix:name
- Literals: "plain string" "13.4"^^xsd:float or "string with language"@en
- Triple: pref:subject other:predicate "object" .
- More shortcuts & abbreviations as we go.

Database to RDF

The Direct Mapping represents a relational database as an RDF graph.

Database to RDF

Node names from tuple IDs.
http://someheathprovider.example/Consultation/C1
Arc names from attributes.
Referential Integrity.
Typed values.

Database to RDF

Node names from tuple IDs.
Arc names from attributes.
provider:C1 role:patient provider:P1 .
Referential Integrity.
Typed values.

Database to RDF

Node names from tuple IDs.
Arc names from attributes.
Referential Integrity.
Conveys the relational graph.
Typed values.

Database to RDF

Node names from tuple IDs.
Arc names from attributes.
Referential Integrity.
Typed values.
e.g. ISO 8601 datetimes
provider:P1 role:birthdate "1946-11-03"^^xsd:date .

Scripting

java
python
perl...

Direct Mapping

Employee
ID	fname	addr
7	Bob	18
8	Sue	NULL

Address
ID	city	state
18	Cambridge	MA

Input:

n-ary primary keys
n-ary foreign keys
NULL-y

Output:

decomposable row identifers
decomposable column identifers
minimally escaped
<http://伝言.example/?employee=أكرم,task=R&D>

<Employee/ID-7> rdf:type <Employee> .
<Employee/ID-7> <Employee#ID> 7 .
<Employee/ID-7> <Employee#fname> "Bob" .
<Employee/ID-7> <Employee#addr> 18 .
<Employee/ID-7> <Employee#ref-addr> <Address/ID-18> .
<Employee/ID-8> rdf:type <Employee> .
<Employee/ID-8> <Employee#ID> 8 .
<Employee/ID-8> <Employee#fname> "Sue" .

<Address/ID-18> rdf:type <Address> .
<Address/ID-18> <Address#ID> 18 .
<Address/ID-18> <Address#city> "Cambridge" .
<Address/ID-18> <Address#state> "MA" .

Direct Graph

+ Exposes relational graph.
± Reflects SQL schema.
− Doesn't use popular RDF schema;
e.g. <Employee#fname> vs. foaf:name.
− No control of graph structure.

<Employee/ID-7> rdf:type <Employee> .
<Employee/ID-7> <Employee#ID> 7 .
<Employee/ID-7> <Employee#fname> "Bob" .
<Employee/ID-7> <Employee#addr> 18 .
<Employee/ID-7> <Employee#ref-addr> <Address/ID-18> .
<Employee/ID-8> rdf:type <Employee> .
<Employee/ID-8> <Employee#ID> 8 .
<Employee/ID-8> <Employee#fname> "Sue" .

<Address/ID-18> rdf:type <Address> .
<Address/ID-18> <Address#ID> 18 .
<Address/ID-18> <Address#city> "Cambridge" .
<Address/ID-18> <Address#state> "MA" .

Brief intro to SPARQL

<Employee/ID-7> rdf:type <Employee> .
<Employee/ID-7> <Employee#ID> 7 .
<Employee/ID-7> <Employee#fname> "Bob" .
<Employee/ID-7> <Employee#addr> 18 .
<Employee/ID-7> <Employee#ref-addr> <Address/ID-18> .
<Employee/ID-8> rdf:type <Employee> .
<Employee/ID-8> <Employee#ID> 8 .
<Employee/ID-8> <Employee#fname> "Sue" .

<Address/ID-18> rdf:type <Address> .
<Address/ID-18> <Address#ID> 18 .
<Address/ID-18> <Address#city> "Cambridge" .
<Address/ID-18> <Address#state> "MA" .