This service implements the W3C IG Note on converting microdata to RDF. The software is still in beta (or even alpha…), though.
If you intend to use this service regularly on large scale, consider downloading the package and use it locally. Storing a (conceptually) “cached” version of the generated RDF, instead of referring to the live service, might also be an alternative to consider in trying to avoid overloading this server…
Microdata is a specification for attributes to be used with HTML5 to express structured data. A separate Semantic Web Interest Group Note defines a mapping from HTML5+Microdata to RDF. pyMicrodata is a distiller that generates the RDF triples from HTML5 file annotated by Microdata in various RDF serialization formats. It can either be used directly from a command line or via a CGI service. To learn more about Microdata, please consult the HTML5 Microdata Document, as well as the RDF Conversion algorithm. See also below for the possibilities to download the package.
As installed in this service is a server-side implementation of the conversion. This also means that pages that generate their (X)HTML content dynamically (e.g., using AJAX) will not be properly processed by this distiller.
format; values:
turtle, xml, json, nt;
default: turtle) vocab_expansion;
values: true, false; default: false )vocab
attribute, i.e., to retrieve the corresponding RDF file and follow the possible
subclass and subproperty relationships. See the
RDFa 1.1. Core document for further details. This options makes this possibility available for
the microdata as well, where the vocabulary is defined by the itemtype. This behavior is
non standard and therefore optional.vocab_cache;
values: true, false; default: true)If you use Firefox, Safari, Chrome, or Opera, you can also drag the following bookmarklets to your browser bar and use them to distill the current page: “Microdata it (Turtle)!”, “Microdata it (RDF/XML)!”, “Microdata it (N triples)!”.
When using the distiller URI directly, the option names for the default options can be ommited. Some examples:
http://www.example.com/md.html, serialized in Turtle:http://www.w3.org/2012/pyMicrodata/extract?uri=http://www.example.com/md.htmlhttp://www.example.com/md.html, serialized in RDF/XML:http://www.w3.org/2012/pyMicrodata/extract?format=xml&uri=http://www.example.com/md.htmlhttp://www.w3.org/2012/pyMicrodata/extract?uri=refererThe underlying package, called pyMicrodata, implemented as a Python package, is available for download from GitHub. The package is based on the standard Python 2.x distribution where 'x' is higher or equal than 5. (It has been tested on version 2.7.2, which is the highest, and probably the last stable release in Python 2.x). The module does not run on the Python 3.x family.
To install the package, download the distribution file (it is a compressed tar
file) and either move the pyMicrodata directory to your PYTHONPATH or modify your
PYTHONPATH to to include that directory. Alternatively, the included setup.py can also be used to install the
library into the system-dependent areas.
This software is available for use under the W3C® SOFTWARE NOTICE AND LICENSE