Cover page images (keys)

State of the Semantic Web

Tampere, 4 April, 2007

Ivan Herman, W3C

What will I talk about?

 

Let us look at some results first!

The basics: RDF(S)

The basics: RDF(S) (cont.)

Ontologies: OWL

Ontologies

Vocabularies

A mix of vocabularies/ontologies (from life sciences)…

diagram showing a large number of HC related ontologies bound via a RFD-like graph

Ontologies, Vocabularies

Querying RDF: SPARQL

SPARQL as the only interface to RDF data?

SELECT ?translator ?translationTitle ?originalTitle ?originalDate
FROM <http://…/TR_and_Translations.rdf>
WHERE {
   ?trans rdf:type trans:Translation;
		  trans:translationFrom ?orig;
		  trans:translator      [ contact:fullName ?translator ];
		  dc:language           "fr";
		  dc:title              ?translationTitle.
   ?orig  rdf:type rec:REC;
		  dc:date               ?originalDate;
		  dc:title              ?originalTitle.
}
ORDER BY ?translator ?originalDate

A word of warning on SPARQL…

Of course, not everything is so rosy…

How to get RDF data?

Data may be around already…

Data may be extracted (a.k.a. “scraped”)

Getting structured data to RDF: GRDDL

Getting structured data to RDF: RDFa

GRDDL & RDFa example: Ivan’ home page…

screen dump of Ivan's home page at W3C

…marked up with GRDDL headers…

source highlighting the GRDDL headers

…and hCard microformat tags…

source highlighting the GRDDL headers

…yielding; …

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
         xmlns:dataview="http://www.w3.org/2003/g/data-view#"
         xml:base="http://www.w3.org/People/Ivan/">
   <c:Vcalendar xmlns:r="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
                xmlns:c="http://www.w3.org/2002/12/cal/icaltzd#"
                xmlns:h="http://www.w3.org/1999/xhtml">
      <c:prodid>-//connolly.w3.org//palmagent 0.6 (BETA)//EN</c:prodid>
      <c:version>2.0</c:version>
      <c:component>
         <c:Vevent r:about="#ac06">
            <summary xmlns="http://www.w3.org/2002/12/cal/icaltzd#" xml:lang="en">W3C@10, 
				W3C AC Meeting and W3C Team day</summary>
            <dtstart xmlns="http://www.w3.org/2002/12/cal/icaltzd#"
                     r:datatype="http://www.w3.org/2001/XMLSchema#date">2006-11-28</dtstart>
            <dtend xmlns="http://www.w3.org/2002/12/cal/icaltzd#"
                   r:datatype="http://www.w3.org/2001/XMLSchema#date">2006-12-03</dtend>
            <url xmlns="http://www.w3.org/2002/12/cal/icaltzd#"
                 r:resource="http://www.w3.org/Member/Meeting/2006ac/November/"/>
            <location xmlns="http://www.w3.org/2002/12/cal/icaltzd#" xml:lang="en">Tokyo, Japan</location>
            <geo xmlns="http://www.w3.org/2002/12/cal/icaltzd#" r:parseType="Resource">
               <r:first r:datatype="http://www.w3.org/2001/XMLSchema#double">35.670685</r:first>
               <r:rest r:parseType="Resource">
                  <r:first r:datatype="http://www.w3.org/2001/XMLSchema#double">139.770813</r:first>
                  <r:rest r:resource="http://www.w3.org/1999/02/22-rdf-syntax-ns#nil"/>
               </r:rest>
            </geo>
        </c:Vevent>
      </c:component>
      …

(see the full file if interested…)

…marked up with RDFa tags…

source highlighting the RDFa tags

…yielding; …

<rdf:RDF xmlns:foaf="http://xmlns.com/foaf/0.1/" 
		                      xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"  
		                      xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" >
  <foaf:Person rdf:about="http://www.w3.org/People/Ivan/#me">
    <foaf:mbox rdf:resource="mailto:ivan@w3.org"/>
    <foaf:workInfoHomepage rdf:resource="http://www.w3.org/Consortium/Offices"/>
    <foaf:workInfoHomepage rdf:resource="http://www.iw3c2.org"/>
    <foaf:workInfoHomepage rdf:resource="http://www.w3.org/2001/sw"/>
    <foaf:name>Ivan Herman</foaf:name>
    <foaf:workplaceHomepage rdf:resource="http://www.w3.org"/>
    <foaf:schoolHomepage rdf:resource="http://www.elte.hu/"/>
      …

(see the full file if interested…)

SPARQL-ing such data

SELECT DISTINCT ?name ?home
          ?orgRole ?orgName ?orgHome
# Get RDFa from my home page:
FROM <http://www.w3.org/People/Ivan/>
# GRDDL-ing http://www.w3.org/Member/Mail:
FROM <http://www.w3.org/Member/Mail/>
WHERE {
?foafPerson  foaf:mbox ?mail;
             foaf:homepage ?home.
             ?individual  contact:mailbox ?mail;  
             contact:fullName ?name.
?orgUnit ?orgRole ?individual;  
          org:name ?orgName; 
          contact:homePage ?orgHome.
}

Linking to SQL

SPARQL as a unifying point?

diagram showing a sparql that can be connected to an rdf datafile, a document via grddl, and to a database via an sparql/sql bridge

Missing features, functionalities…

Rules

Some typical use cases

In an ideal World…

diagram showing star-like format of ellipses representing rule systems, all with dual arrows connected to a box stating 'full RIF format'

In the real World…

RIF “core”: only partial interchange

diagram showing star-like format of ellipses each with a yellow box core, and a separate core box in the middle to which all connect

RIF “variants”

like the core diagram, but each core is surrounded by different sized rectangles

Possible variants: F-logic, production rules, fuzzy logic systems, …; none of these have been finalized yet

Role of variants

core figure
core figures plus a cloud of rule systems core figures plus a cloud of rule systems clustered around variants with exchange arrows core figures plus a cloud of rule systems clustered around variants with exchange arrows

“Light” ontologies

Revisions of RDF and OWL?

Revision of the RDF model?

These are just ideas floating around…

Revision of OWL? (OWL 1.1)

Other items…

(Need a new PhD topic? wink smiley)

A major problem: messaging

RDF RDF/XML!

RDF RDF/XML! (cont.)

RDF is not that complex…

Semantic Web Ontologies on the Web!

Web 2.0 and Semantic Web are not antagonistic…

SW Ontologies some central, big ontology!

Remember?

diagram showing a large number of HC related ontologies bound via a RFD-like graph

Semantic Web an academic research only!

May start with small communities

Some RDF deployment areas

Library metadata Defense Life sciences
Problem to solve? single-domain integration yes, serious data integration needs yes, connections among genetics, proteomics, clinical trials, regulatory,…
Willingness to adopt? yes: OCLC push and Dublin Core Initiative(*) yes: funded early DAML (OWL) work yes: intellectual level high, much modeling done already.
Motivation light strong very strong
Links to other library data phone calls records, etc chemistry, regulatory, medical, etc

(*) note that the Dublin Core Initiative’s work go way beyond digital libraries these days

Some RDF deployment areas (cont)

The “corporate” landscape is moving

Data integration

Life Sciences (cont.)

Left side: data silos, each its own representation on a screen, with scientist interpreting; right side: same silos, converted to rdf and co, scientist doing data right away.

General approach

  1. Map the various data onto RDF
    • assign URI-s to your data
    • “mapping” may mean on-the-fly SPARQL to SQL conversion, “scraping”, etc
  2. Merge the resulting RDF graphs (with a possible help of ontologies, rules, etc, to combine the terms)
  3. Start making queries on the whole!

Example: antibodies demo

Antibodies' demo screen dump

There has been lots of R&D

MuseoSuomi Application dump Traditional Chinese medicine example dump

Example: ontology controlled annotation

Pfizer's application

Portals

Vodafone screen dump

Improved Search via Ontology: GoPubMed

GoPubMed Application dump

Baby CareLink

Baby care link application dump

Other Application Areas Come to the Fore

 

Thank you for your attention!

These slides are publicly available on:

http://www.w3.org/2007/Talks/0403-Tampere-IH/

in XHTML and PDF formats; the XHTML version has active links that you can follow