ABCDEFGHIJKLMNOPQRSTUVWXYZ abcdefghijklmnopqrstuvwxyz Semantic data extractor - QA @ W3C

Quick Introduction

This tool, geared by an XSLT stylesheet, tries to extract some information from a HTML semantic rich document. It only uses information available through a good usage of the semantics defined in HTML.

The aim is to show that providing a semantically rich HTML gives much more value to your code: using a semantically rich HTML code allows a better use of CSS, makes your HTML intelligible to a wider range of user agents (especially search engines bots).

As an aside, it can give clues to user agents developers on some hooks that could be interesting to add in their product.

On-line service

See a demonstration of the service, relying both on the W3C XSLT Servlet and tidy on-line:

More Semantics?

If you have suggestion to improve this XSLT, please send patches to public-qa-dev@w3.org.

Valid XHTML 1.0! Created Date: 2006-11-15 by Dominique Hazaël-Massieux
Last modified $Date: 2011/06/30 10:35:53 $ by $Author: dom $
Semantic data extraction for "<xsl:value-of select="/html:html/html:head/html:title"/>"

Quick Introduction

This tool, geared by an XSLT stylesheet, tries to extract some information from a HTML semantic rich document. It only uses information available through a good usage of the semantics defined in HTML.

The aim is to show that providing a semantically rich HTML gives much more value to your code: using a semantically rich HTML code allows a better use of CSS, makes your HTML intelligible to a wider range of user agents (especially search engines bots).

As an aside, it can give clues to user agents developers on some hooks that could be interesting to add in their product.

Extracted data

Sorry, this tool is supposed to be called on (X)HTML documents only.

Generic metadata

Title
Author
Description
Contact information
Language code
Explicit language annotations within the document
HTML Profile
If that profile is GRDDL-enabled, you can see the RDF/XML extracted from it
Embedded RDFa data
The document uses RDFa to embed additional data; see the RDF/XML extracted from it

Related resources

Translations
Alternate formats
Starting page
Next page
Previous page
Table of contents
Index
Glossary
Copyright
"Chapters"
Sections
Subsections
Appendix
Help
Bookmarkable points

Defined terms

The following terms are defined in the given HTML page:

Abbreviations and Acronyms

The following abbreviations and/or acronyms are used in the given HTML page:

standing for

Citations and quote

There are some quotes and citations in this page:

References were found to the following sources:

Outline of the document

No top-level heading (h1) found, no outline extracted.

Valid XHTML 1.0! Created Date: 2006-11-15 by Dominique Hazaël-Massieux
Last modified $Date: 2011/06/30 10:35:53 $ by $Author: dom $
  • [] [Unknown title]
    (format: ) (lang: )