Summary
  - World Wide Web Consortium
  - The Semantic Web 
    
      - Motivation
- Standards
- Applications
- Possibilities
 
 
W3C's Mission: Leading the Web to its Full Potential
Founded by Web inventor Tim Berners-Lee in
1994:
  - Providing the Vision to Lead.
- Engineering the Standards that Make the Web Work.
  
    
      |  | 
          From a Web of Documents ...... toward a Web of data and programs that is: 
            
              Meaning-fulon Everythingfor Everyone... and Interoperable, Safe, Evolvable | 
  
(Flyer: US, A4
/ History)
 
Web of Meaning
People and communities have data stores and applications to
share
  - Vision 
    
      - Expand the Web to include more machine-understandable resources
- Enable global interoperability between resources you
        know should interoperaperable as well as those you
        don't yet know should be interoperable
 
Key Web technologies:
  - Web Services: Web of Programs 
    
      - Standards for interactions between programs, linked on the Web
- Easier to Expose and Use services (and data they
        provide)
 
  - Semantic Web: Web of Data 
    
      - Standards for things, relationships and descriptions, linked on the
        Web
- Easier to Understand, Search for, Share,
        Re-Use, Aggregate, Extend information
 
 
What Dogs Understand
Analogy:
What We Say to Dogs
  - cartoon ...
 
- "Stay out of the garbage! Understand, Ginger? Stay out of the
    garbage!"
What Dogs Understand
  - "... blah blah blah blah GINGER blah blah blah
    blah ..."
 
What Computers Understand on Current Web
  
    
      |  | " ... blah blah
        blah <a
        href=http://www.xwz.com/foo.html> link</a>
        blah blah blah . . . ."* | 
  
  - Computers must "understand" more: 
    
      - Not human concept of "understanding"
- Just useful machine processing 
        
      
 
 
Toward Processable Search Semantics
Google:

  - Web links are machine processable
- Minimal semantics assumed: "This" refers to "that" = popularity
- Amazing results from minimal semantics
What if Web pages had more machine processable semantics?
(See Article by Bijan Parsia)
 
Ways to Enable Machine Processing
Smarter Machines
  - Teach computers to infer the meaning of Web content 
    
      - Natural language, image recognition, etc.
 
  - ... this is the Artificial Intelligence approach 
    
      - WARNING: Hard problem!
- This is not what the Semantic Web
        is about
 
Smarter Data
  - Make content easier for machines to find, access and process 
    
      - Express data and meaning in standard machine-readable format
- Support decentralized definition and management
 
  - ... this is what the Semantic Web
    is about
 
Semantic Web Standards
  
    
      |  | Machine-processable, global Web standards: 
          Assigning unambiguous names (URI)Expressing data, including metadata (RDF)Capturing ontologies (OWL) 
          Query, rules, logic, proofs, trust (in progress) Accessible wherever you want it to be ... ... on the Web. | 
  
 
Uniform Resource Identifiers (URI)
  - URIs have two
    different uses: 
    
      - Unambiguous name for something
- Location of a document
 
  - URIs can be used to identify definitions for concepts 
    
      - Especially useful for ontologies & metadata
 
 
Resource Description Framework (RDF)
  - W3C standard (Primer) 
    
      - XML syntax
- Statements about things (concepts, objects, etc.)
- Metadata, plus more
 
  - Descriptive statements expressed as triples: 
    (Subject, Predicate, Object) or (Subject, Property,
    Value) 

  - Most useful, Web-wise, when these are URIs, e.g. for this presentation: 
    
    
  
 
RDF: Flexible Representation
Linking table and tree data ...

 
Web Ontology Language (OWL) +
  - W3C standard (Guide, Overview) 
    
      - Defining concepts & relationships within area of knowledge
 
  - Like DBMS data dictionary or schema, but .. 
    
      - Global, standard syntax based on RDF
- Can define more complex relationships
- Using URIs as keys
- On the Web
 
  - Includes standard vocabulary for describing properties and classes.: 
    
      - Datatypes (e.g., integer, date, string ..)
- Relations (e.g. subclass of, property of ..)
- Cardinality (e.g. exactly one ..)
- Characteristics of properties (e.g. symmetry)
- ... and more
 
 
Standards En Route
  - Logic, proof, trust, etc.
 
Current Web
  - Minimal machine-processable information -- dumb links

 
Semantic Web - Enhancement of Current Web
  - More machine-processable information - applications connected by
    concepts
 
 
___
[ellipses = resources; ellipse/line
color = one data source; x: = one ontology]
 
Semantic Web Applications
  - Pioneer domains 
    
      - Enterprise integration
- Data management
- Life sciences
- Personal and consumer information
 
  - Growing Industry Update 
    
      - Vodafone, IBM, Oracle, Adobe, Nokia, HP, Fujitsu, NEC, many
      others
 
 
Enterprise Integration Today

 
Enterprise Integration on the "RDF Bus"

 
Clients on the "RDF Bus"
New data applications can be built on top of RDF bus, for example:

 
Applications Connected by Concepts

 
Real World: Vodafone
  
    
      |  | 
          Vodafone Live Mobile PortalRDF-based integration of products 
            
              simple descriptive metadata schemeeasy integration of multiple, diverse vendorsflexible maintenance and extensionSearch results (e.g. ringtones, games, pictures, etc.) 
            
              improvement: page views per download decreased 50%increased revenue: for ringtones, up 20% in 2 months* 
                
                  Ringtones market will surpass $500
                    million in retail sales in 2005! | 
  
 
Real World: IBM
  
  
    
      |  | 
          Multiple effort in
            SemanticsDeveloping user-centered tools 
            
              Exploit power of Semantic Web technologies, but hide the
                underlying complexityExtraction, storage, query, editing, annotation,
                visualizationFocus on Life
            Sciences - Potential for transforming the scientific research
            process 
            
              Common representation (RDF), unique identifiers (LSID),
                provenance, collaboration | 
  
 
Real World: Oracle
  
  
    
      |  | 
          Oracle
            10.2 will include an RDF data model to store RDF
          statementsJava Ntriple2NDM converter for loading existing RDF dataAn RDF_MATCH function which can be used in SQL to find graph
            patterns in RDF (similar to SPARQL)Interest in supporting life
            sciences, and other, requirements | 
  
... from S. Stephens, Oracle, June
2005
 
Challenges Ahead
  - Providing information in Semantic-Web-friendly ways 
    
      - Exposing existing data stores as RDF
- Automated and human-assisted tools to create RDF stores,
        ontologies, mappings
- "Chicken and egg problem" en route to "network effect"
 
- Addressing operational and social needs 
    
      - "Worldwide" Web
- Quality, provenance, versioning, explanation, safety
- Privacy, access control, authorization
- Trust
 
  - Making data accessible to people and programs 
    
      - Usable interfaces to masses of semantic data 
        
          - Search, filtering, aggregation, processing, graphics
 
- Access via multiple modes, multiple devices
 
 
Bottom-Up Evolution for SemWeb Global Data Interoperability
  - Explore existing tools (see extra slides)
  - Pick a few community applications and databases
  - Develop key community ontologies
  - Develop shared ontologies (broadly used, linking)
  - Exponential grass-roots growth? 
    
      - More machine accessible data
- Directed and serendipitous discovery
- Better tools
- Killer apps
- Network effects
 
 
Thank You
We are happy to help ...

_
Acknowledgements: Tim Berners-Lee, David
Booth, Eric Miller
Resource slides follow (W3C, Web
Services and Semantic Web tools and applications)
 
International Web Standards Organization
W3C's expanding international basis of
operations (map)
 
W3C's Members Power Our Work
~ 370 Members (png, map history)
  - World's leading technology organizations 
    
      - "Third-class companies make products; second-class companies
        develop technology; first-class companies set standards." -
        saying in China
 
  - Commercial, academic, research institutions, government agencies,
    non-profits
  - Developing Web products, leveraging the Web, for your work, conducting
    Web research, developing specifications built upon W3C's work
 
Values of W3C Membership
  - Leadership 
    
      - Introduce ideas through submissions & workshops
- Influence standards through Working Group participation, review,
        implementation
- Provide strategic direction through seat on Advisory Committee
 
  - Early insight into market trends 
    
      - Access Member & Team technical experts
- Track emerging technologies & markets through
        Member-confidential access
- Implement standards & plan for procurement of new technologies
        ahead of market
 
  - Promoting image as innovator 
    
      - Participate in international media activities, press releases,
        testimonials
- Display organization and logo on W3C site (250K visits/day)
- Display W3C Member logo on organization's page
- Ensure & enjoy benefits of royalty-free implementation of Web
        standards
 
 
W3C Brings the World Together to Engineer the Web
  - Accountable to the Global Public 
    
      - Public lists, some public Interest Groups
- Groups must address all submitted issues
 
 
Developing Standards at W3C
  - Clear and effective Process, which encourages: 
    
      - Member-neutrality
- Consistent architecture 
        
      
- Coordination, consensus and
        interoperability 
        
          - No "rubber stamping"
- Must engineer dependencies (inside and outside of W3C)
 
 
 
Views of the Process: Life of
Working Group, Recommendation Track
 
Some Web Services and Semantic Web References
Standards, specifications, profiles:
Integration of Web Services and Semantic Web
Semantic Web Research and Development Groups
 
Some Web Services Tools
* Most products not yet using new W3C standard versions of SOAP (1.2),
WSDL (2.0), etc., but they will in the future.
 
Some Semantic Web Tools
RDF Environments (interpreters, etc)
  - Jena (from HP;
    for Java, includes OWL reasoning)
- RDFLib (for Python)
- Redland
    (in C, with interfaces to Tcl, Java, PHP, Perl, …)
- Sesame (RDF Schema based repository and querying
    facility)
- cwm (general-purpose data processor for the semantic
  web)
- Protege
    (ontology editor)
- Access to Relational Databases (white paper)
- Annotea
    (annotation of Web documents)
- Longwell (browse and search RDF datasets)
- Semantics tools (from IBM; for storage, manipulation,
    query, and inference of ontologies)
SemWeb Graphical Editors
 
Some Web Service & Semantic Web Applications
Rapid growth of both ...
  - Compilations of Web Service apps: 
    
  
  - Compilations of Semantic Web apps: 
    
  
 
Challenge of Global Data Interoperability
  
    
      |  | 
          Increasingly specialized communities... 
            
              ... yet, important problems require multi-disciplinary and
                multi-organizational approach 
          Massive quantities of data ... 
            
              ... yet, not readily understandable nor accessible outside
                of community 
          Already using the Web ... 
            
              ... yet, mainly as a Web of documents, not as a Web of
                machine-processable data | 
  
 
Today's Web: Searching
Current Situation:
  - Search Web for information on "Love waves"
Issue:
  - Some useful, some not-so-useful hits: 
    
      - "Love Wave Research at Brown"
- "Courtney Love Waves the Red Flag"
- " 'Buy Faith, Hope Love: Waves and Reflections', at Amazon and
      SAVE"
- etc. ...
 
 
Today's Web : Sharing and Aggregating
Current Situation: Earthquake Monitoring:
  - Web site A, B, C ...: fault, earthquake and other geo information
- Programs G, H, I ... : mapping, propagation, site response ...
- Email communications with X, Y, Z ... : gov't emergency authorities
- Want to aggregate data to (for R&D or realtime alerts): 
    
      - compare network locations and compute new ones
 
      - examine correlations of geophysical observations
- interface with authorities re: events and risks
 
Issues:
  - Different databases, attributes, locations, services
- Different jargon, meanings
- Not easy to find, access, process resources
- Must build specialized systems to important, convert, update .. 
    
  
 
What Is the Semantic Web?
"The Semantic Web is an
... extension of the current web in
which
... information is
given
well-definedmeaning,
... better enabling computers and people to work in
cooperation."
___
The Semantic Web 
Tim Berners-Lee, James Hendler and Ora Lassila
Scientific American, May 2001