RDFa Distiller

What is it?

RDFa is a specification for attributes to be used with XHTML to express structured data. The rendered, hypertext data of XHTML is reused by the RDFa markup, so that publishers don't need to repeat significant data in the document content. The underlying abstract representation is RDF, which lets publishers build their own vocabulary, extend others, and evolve their vocabulary with maximal interoperability over time. This service generates RDF from XHTML files using the RDFa attributes, based on a Python library called “pyRdfa”. To learn more about RDFa, please consult the RDFa Syntax Document

Using the form below, one can generate the RDF content in various serialization formats. The core Python package is also available for download. The package relies on the RDFLib package, otherwise it needs only the standard Python distribution (has been tested on version 2.4). The package includes a possible CGI interface script to start a service like this one.

Warning: the underlying code is subject to change. Indeed, it tries to follow the various discussions of the RDFa task force to serve as one of the several ‘test beds’ for the specification work.

The Distiller Form

Distil RDFa Information

Help: To distil the RDFa content of an XHTML file, give the address (URI) of an XHTML file and choose the output format of the result.

If you use Firefox or Opera, then you can also drag these bookmarklets to your browser bar and use them to distill the current page: “RDFa it (RDF/XML)!”, “RDFa it (Turtle)!”, “RDFa it (N triples)!”. Alternatively, you can also use the fixed URI:

http://www.w3.org/2007/08/pyRdfa/extract?uri=referer

to generate the RDF content using, say, a button, without specifying the URI of the page.


Ivan Herman, (ivan@w3.org)
Last revised: $Date: 2008/05/06 11:55:08 $ (see in RDF)

Valid XHTML + RDFa