Vocabulary Search on the Semantic Web for RDFa Default Profiles—Yahoo! version

$Date: 2011/06/08 21:48:38 $

The tables in this file show the result of the Yahoo! crawl for RDFa Default Profile data. See a separate blog on more details on this dataset. Please, see the generic overview for a description of the methodology on how the crawl results were used. The rules used to shape the final result is also available for download.

The dump is based on a generic crawl, with a size of around 12B pages, with 431M documents using RDF (excluding trivial RDFa markup, i.e., pages containing triples in the xhtml namespace only). See a separate blog on more details on this dataset. (I.e., in contrast to Sindice, this result is not based on a general Semantic Web crawl). The top 100 entries of the crawl is available for download.

Final results of the Yahoo! crawl

The table below shows the result of applying the rules on the results of the Yahoo! crawl’s top 100 entries.

Vocabulary URIEffective Second Level Domains
1.http://purl.org/dc/terms/344545
2.http://ogp.me/ns#177761
3.http://creativecommons.org/ns#37890
4.http://rdf.data-vocabulary.org/#6083
5.http://xmlns.com/foaf/0.1/2545
6.http://rdfs.org/sioc/ns#1633
7.http://www.w3.org/2006/vcard/ns#1349
8.http://purl.org/goodrelations/v1#488
9.http://purl.org/stuff/rev#369
10.http://commontag.org/ns#272
11.http://gree.jp/69
12.http://www.w3.org/2002/12/cal/icaltzd#62
13.http://www.abmeta.org/ns#59
14.http://data-vocabulary.org/Product/49
15.http://vocab.org/frbr/core#36
16.http://usefulinc.com/ns/doap#35
17.http://www.w3.org/2002/12/cal#25
18.http://purl.org/dc/22
19.http://www.w3.org/2002/12/cal/ical#21
20.http://openelectiondata.org/0.1/20
21.http://purl.org/amicroformat/17
22.http://purl.org/vocab/bio/0.1/17
23.http://purl.org/ontology/bibo/14
24.http://open.vocab.org/terms/13
25.http://www.w3.org/2000/10/swap/pim/contact#13
26.http://purl.org/ontology/mo/13
27.http://www.w3.org/2002/12/cal/12
28.http://https://opengraphprotocol.org/schema/11
29.http://data.semanticweb.org/ns/swc/ontology#10
30.http://mostplays.com/8
31.http://www.w3.org/2006/vcard/8
32.http://www.w3.org/ns/auth/cert#8
33.http://xmlns.xfy.com/blogtool/samples/8
34.http://www.w3.org/ns/auth/rsa#8
35.http://developers.facebook.com/7
36.http://purl.org/dc/elements/1.1/7
37.http://dbpedia.org/ontology/7
38.http://dbpedia.org/property/7
39.http://www.facebook.com/6
40.http://purl.org/dc/dcam5
41.http://moat-project.org/ns#5
42.http://www.openarchives.org/ore/terms/5
43.http://www.facebook.com/2009/5
44.http://www.geonames.org/ontology#5
45.http://www.loc.gov/loc.terms/relators/4
46.http://www.w3.org/2001/04/roadmap/org#4
47.http://www.w3.org/2003/01/geo/wgs84_pos#4
48.http://purl.org/NET/c4dm/event.owl4
49.http://rdfs.org/sioc/services#4
50.http://umbel.org/umbel/sc/4
51.http://ns.aksw.org/update/4