W3C logo
slanted W3C logo

RDF and non-RDF Databases on the Semantic Web

W3C Semantic Web Projects.


Eric Prud'hommeaux, Sanitation Engineer.
Last modified: $Date: 2009/01/21 14:33:31 $
Creative Commons License This work is licensed under a Creative Commons Attribution 3.0 License, with attribution to W3C.

Valid XHTML + RDFa

Use Case - Clinical Studies

SemWeb Windfalls

Standardized Reasoning

Drug Mapping

X rdfs:subClassOf set1.
set1 a owl:restriction.
set1 owl:onProperty may_treat.
set1 owl:someValuesFrom   :C0028754
=> X a weightLossDrug.

Merged Data Ideal

Many DBs contributing
to a network of data.

Databases as RDF

Query Interface

Triples Transformation

Warning: vapid example:

Triples Transformation

(Virtual) Transformation Pipe

Query the World

Configured query services
federating and distributing
to many DB Interfaces.

Translation layer

CONSTRUCT Expressivity

   PREFIX :mydb <http://cityhospital.example/dbs>
CONSTRUCT { ?o a          :PatientObservation .
            ?o :patient   ?p .
            ?o :doctor    ?d }
            ?d :foaf:name ?dName }

    WHERE { ?o mydb:patient ?p .
            ?o mydb:doctor  ?d .
            ?d mydb:name    ?dName }

Diabetic Patient Data

SDTM Query

PREFIX sdtm: <http://www.sdtm.org/vocabulary#>
PREFIX spl: <http://www.hl7.org/v3ballot/xml/infrastructure/vocabulary/vocabulary#>

SELECT ?patient ?dob ?sex ?takes ?indicDate # ?indicEnd ?contra
 WHERE {
  ?patient a sdtm:Patient ;
          sdtm:middleName ?middleName ;
          sdtm:dateTimeOfBirth ?dob ;
          sdtm:sex ?sex .

  ?ip	  sdtm:subject ?patient ;
	  sdtm:standardizedMedicationName ?takes ;

	  # nothing by ingredient in sdtm/caBIG/...
#          sdtm:startDateTimeOfMedication ?indicDate ;
	  # ; sdtm:endDateTimeOfMedication ?indicEnd
	  spl:activeIngredient ?ic . ?ic spl:classCode 6809
   .
  OPTIONAL {
  ?cp	  sdtm:subject ?patient ;
	  sdtm:standardizedMedicationName ?contra ;
# 	  sdtm:startDateTimeOfMedication ?contraDate ;
	  # ; sdtm:endDateTimeOfMedication   ?contraEnd
	  spl:activeIngredient ?cc . ?cc spl:classCode 11289
   .
  }
  FILTER (!BOUND(?cp))
} LIMIT 30

Pipeline

      

Pipeline

  cat sdtm.rq | \
  SWtransformer -q - hl7-sdtm.rq | \
  SWtransformer -q - db-hl7.rq -s http://hospital.example/DB/ | \
  mysql -u root DiabeticPatientsDataSet --table

HL7 Query


    

Conspicuously Absent

Performance

Performance winfalls are hard to achieve:

Kudos

thanks to Lilly and Lincoln Labs for funding this work.