Vocabulary Search on the Semantic Web for RDFa Default Profiles—Sindice version

$Date: 2011/06/08 21:50:13 $

The tables in this file show the result of the Sindice crawl for RDFa Default Profile data. Please, see the generic overview for a description of the methodology. The rules used to shape the final result is also available for download.

The dump is based on a crawl of 10B triples on the Semantic Web, concentrating on properties and class URI-s. The processing rules are performed as part of the dump, and the top 100 entries, ordered by the effective second level domain numbers, are produced by Sindice.

For completeness, this file includes two tables: the top 100 entries of the Sindice crawl with all the data provided, and a shortened version that only contains the effective second level domains and with the important URI-s fully spelled out. (Warning: The Sindice team acknowledges that the dump from which this data has been extracted is indeed polluted by a small fraction (1-3% of XML files erroneously read as RDF). While this leads to some spurious entries, there is no indication this is altering the final results. A full full web refetching/reindexing of the data is being worked on to eliminate this issue and new data will be provided as soon as this is completed.)

Final results of the Sindice crawl

The table below shows the top 100 results of the Sindice crawl (after the application of the processing rules) with some cleanups on the URI-s and showing the Effective Second Level Domains only.

Vocabulary URIEffective Second Level Domains
1.http://purl.org/dc/terms/32848
2.http://ogp.me/ns#18954
3.http://purl.org/dc/elements/1.1/4485
4.http://xmlns.com/foaf/0.1/3630
5.http://rdfs.org/sioc/ns#1305
6.http://rdf.data-vocabulary.org/#845
7.http://www.w3.org/2003/01/geo/wgs84_pos#829
8.http://creativecommons.org/ns#743
9.http://purl.org/vocab/bio/0.1/571
10.http://www.w3.org/2006/vcard/ns#559
11.http://purl.org/goodrelations/v1#390
12.http://www.w3.org/2000/10/swap/pim/contact#352
13.http://wellformedweb.org/CommentAPI/238
14.http://rdfs.org/sioc/types#MicroblogPost184
15.http://commontag.org/ns#168
16.http://usefulinc.com/ns/doap#128
17.http://xmlns.com/wot/0.1/119
18.developers.facebook.com/schema/admins102
19.developers.facebook.com/schema/app_id101
20.http://purl.org/vocab/relationship/91
21.http://purl.org/stuff/rev#73
22.xmlns.com/wordnet/1.6/Airport62
23.http://purl.org/ontology/bibo/54
24.http://www.w3.org/2002/12/cal/icaltzd#50
25.rdfs.org/sioc/types#Comment48
26.http://purl.org/dc/dcmitype/47
27.http://rdfs.org/sioc/types#BlogPost46
28.http://www.geonames.org/ontology#43
29.http://www.w3.org/ns/auth/rsa#43
30.http://www.w3.org/ns/auth/cert#43
31.http://data.semanticweb.org/ns/swc/ontology#41
32.http://dbpedia.org/ontology/34
33.open.vocab.org/terms/30
34.http://rdfs.org/sioc/types#Microblog29
35.http://www.w3.org/2002/12/cal/ical#29
36.smob.me/ns#Hub28
37.http://online-presence.net/opo/ns#28
38.http://dbpedia.org/property/28
39.developers.facebook.com/schema/page_id27
40.http://purl.org/ontology/mo/26
41.http://purl.org/ontology/wo/25
42.http://moat-project.org/ns#23
43.http://purl.org/vocab/vann/22
44.http://purl.org/net/vocab/2004/07/visit#22
45.http://ramonantonio.net/doac/0.1/#21
46.http://holygoat.co.uk/owl/redwood/0.1/tags/21
47.http://purl.org/net/provenance/ns#17
48.http://abmeta.org/17
49.http://www.w3.org/2002/12/cal#16
50.skype.com/15
51.http://purl.org/NET/scovo#15
52.ebusiness-unibw.org/ontologies/eclass/5.1.4/#C_AKJ315005-tax15
53.http://trust.mindswap.org/ont/trust.owl#14
54.http://purl.org/NET/c4dm/event.owl14
55.rdfs.org/sioc/types#BoardPost13
56.http://www.w3.org/2006/time#13
57.http://www.openarchives.org/ore/terms/13
58.http://purl.org/vocab/frbr/core#13
59.w3.org/2004/03/trix/rdfg-1/Graph12
60.purl.org/dc/dcam12
61.xs:string11
62.rdfs.org/sioc/types#Weblog10
63.http://sw.deri.org/2007/07/sitemapextension/10
64.http://purl.org/net/pingback/10
65.w3.org/2001/04/roadmap/org#9
66.redfoot.net/2005/session#hexdigest9
67.purl.org/net/provenance/types#QueryResult9
68.purl.org/net/provenance/types#DataCreatingService9
69.ebusiness-unibw.org/ontologies/eclass/5.1.4/#C_AKJ317003-tax9
70.xri://$xrd*($v*2.0)XRD8
71.xri://$xrd*($v*2.0)Service8
72.xri://$xrdsXRDS8
73.purl.org/net/schemas/quaffing/drankBeerWith8
74.xsd:string7
75.xs:boolean7
76.xmlns.com/wordnet/1.6/Project7
77.xmlns.com/wordnet/1.6/Person7
78.xmlns.com/wordnet/1.6/Beer7
79.purl.org/vocab/psychometric-profile/7
80.purl.org/stuff/pets/7
81.mozilla.org/xblimplementation7
82.mozilla.org/xblconstructor7
83.mozilla.org/xblbindings7
84.mozilla.org/xblbinding7
85.http://sw.deri.org/2005/08/conf/cfp#7
86.daml.ri.cmu.edu/ont/USRegionState.daml#USState7
87.xmlns.com/wordnet/1.6/Document6
88.umbel.org/umbel#6
89.s.opencalais.com/1/type/sys/InstanceInfo6
90.s.opencalais.com/1/type/sys/DocInfoMeta6
91.s.opencalais.com/1/type/sys/DocInfo6
92.s.opencalais.com/1/type/lid/DefaultLangId6
93.skype.com6
94.simplemachines.org/xml-feed6
95.rdfs.org/sioc/types#WikiArticle6
96.rdfs.org/sioc/types#Wiki6
97.rdfs.org/sioc/types#MessageBoard6
98.purl.org/net/schemas/quaffing/owesBeerTo6
99.purl.org/atom-blog/ns#draft6
100.http://schemas.talis.com/2005/dir/schema#6

Original results of the Sindice crawl

The table below shows the the Sindice crawl the top 100 results of the Sindice crawl (after the application of the processing rules) but with all the data provided by the original crawl, namely, for each vocabulary URI:

  1. (Partial) URI of the vocabulary
  2. Number of Triples that use the vocabulary
  3. Number of Graphs that use the vocabulary
  4. Number of Domains that use the vocabulary
  5. Number of effective Second level domains that use the vocabulary (i.e., http://a.b.c and http://q.b.c are considered as identical)

By default, the table is sorted using its last column, in decreasing order. Clicking on the column header reorders the table using that column, first in increasing and, clicking again, by decreasing order.

Vocabulary URITriplesGraphsDomainEffective Second Level Domains
purl.org/dc/terms/1.1422424E864194097267637232848
ogp.me/ns#1.22796352E8260616002679818954
purl.org/dc/elements/1.1/1.62947024E81362355227613754485
http://xmlns.com/foaf/0.1/2.90304947E92240583728258933630
http://rdfs.org/sioc/ns#1.9601212E770921126941305
http://rdf.data-vocabulary.org/#1.2954988E8114959051347845
http://www.w3.org/2003/01/geo/wgs84_pos#1.9475524E78633246114139829
creativecommons.org/ns#3624811.010220351111743
http://purl.org/vocab/bio/0.1/451523.027927042065571
www.w3.org/2006/vcard/ns#1.0840275E7270186346334559
http://purl.org/goodrelations/v1#1.72295168E85955196399390
http://www.w3.org/2000/10/swap/pim/contact#1568602.04045727533352
http://wellformedweb.org/CommentAPI/55675.01948243238
rdfs.org/sioc/types#MicroblogPost1228729.080805415184
http://commontag.org/ns#7119623.090480271168
http://usefulinc.com/ns/doap#64103.09240174128
http://xmlns.com/wot/0.1/1423.0239129119
developers.facebook.com/schema/admins364365.0364194139102
developers.facebook.com/schema/app_id1114637.01114020131101
http://purl.org/vocab/relationship/80511.035659491
http://purl.org/stuff/rev#4517259.07912077773
xmlns.com/wordnet/1.6/Airport127.01066562
http://purl.org/ontology/bibo/388256.02797806354
http://www.w3.org/2002/12/cal/icaltzd#35742.036665050
rdfs.org/sioc/types#Comment195948.0477705848
purl.org/dc/dcmitype/292262.0621775447
rdfs.org/sioc/types#BlogPost5524.031646346
www.geonames.org/ontology#6.9954656E7139620685443
http://www.w3.org/ns/auth/rsa#540.01384743
http://www.w3.org/ns/auth/cert#430.01384743
http://data.semanticweb.org/ns/swc/ontology#84636.048384441
http://dbpedia.org/ontology/1.3783624E714167903534
open.vocab.org/terms/66838.0484203230
rdfs.org/sioc/types#Microblog383.03832929
http://www.w3.org/2002/12/cal/ical#29274.021643029
smob.me/ns#Hub382.03822828
http://online-presence.net/opo/ns#1040.03782828
http://dbpedia.org/property/8.561048E7167421612928
developers.facebook.com/schema/page_id11651.0113273727
http://purl.org/ontology/mo/994543.02758282626
http://purl.org/ontology/wo/303964.0145613025
moat-project.org/ns#4270.07452623
http://purl.org/vocab/vann/3745.03702422
http://purl.org/net/vocab/2004/07/visit#844.0412222
http://ramonantonio.net/doac/0.1/#2457.0772121
http://holygoat.co.uk/owl/redwood/0.1/tags/207650.0118752121
http://purl.org/net/provenance/ns#1.678109E710484781717
http://abmeta.org/6484.023771717
http://www.w3.org/2002/12/cal#1168.0941616
skype.com/35.0311515
http://purl.org/NET/scovo#7355.03866115
ebusiness-unibw.org/ontologies/eclass/5.1.4/#C_AKJ315005-tax23.0181515
http://trust.mindswap.org/ont/trust.owl#632.0191414
http://purl.org/NET/c4dm/event.owl9404.010071414
rdfs.org/sioc/types#BoardPost1485.011181313
http://www.w3.org/2006/time#649.0661513
http://www.openarchives.org/ore/terms/7211708.07530721513
http://purl.org/vocab/frbr/core#856117.0595031413
w3.org/2004/03/trix/rdfg-1/Graph2097031.010485881212
purl.org/dc/dcam18801.082891312
xs:string1345.0971111
rdfs.org/sioc/types#Weblog1439.013571210
http://sw.deri.org/2007/07/sitemapextension/1306.0101010
http://purl.org/net/pingback/22.0141010
w3.org/2001/04/roadmap/org#3249.051109
redfoot.net/2005/session#hexdigest17.01799
purl.org/net/provenance/types#QueryResult1048484.0104845499
purl.org/net/provenance/types#DataCreatingService2096828.0104842399
ebusiness-unibw.org/ontologies/eclass/5.1.4/#C_AKJ317003-tax17.01199
xri://$xrd*($v*2.0)XRD27.02188
xri://$xrd*($v*2.0)Service25.02188
xri://$xrdsXRDS21.02188
purl.org/net/schemas/quaffing/drankBeerWith67.0888
xsd:string64.01177
xs:boolean52.02277
xmlns.com/wordnet/1.6/Project18.01077
xmlns.com/wordnet/1.6/Person6317.040187
xmlns.com/wordnet/1.6/Beer13.01277
purl.org/vocab/psychometric-profile/48.01277
purl.org/stuff/pets/84.0977
mozilla.org/xblimplementation12.0977
mozilla.org/xblconstructor12.0977
mozilla.org/xblbindings9.0977
mozilla.org/xblbinding16.0977
http://sw.deri.org/2005/08/conf/cfp#310.040107
daml.ri.cmu.edu/ont/USRegionState.daml#USState136.01077
xmlns.com/wordnet/1.6/Document84.07366
umbel.org/umbel#22266.015676106
s.opencalais.com/1/type/sys/InstanceInfo871.03066
s.opencalais.com/1/type/sys/DocInfoMeta30.03066
s.opencalais.com/1/type/sys/DocInfo30.03066
s.opencalais.com/1/type/lid/DefaultLangId30.03066
skype.com12.01166
simplemachines.org/xml-feed16.01666
rdfs.org/sioc/types#WikiArticle35836.025466
rdfs.org/sioc/types#Wiki1182.0523106
rdfs.org/sioc/types#MessageBoard214.021466
purl.org/net/schemas/quaffing/owesBeerTo18.0666
purl.org/atom-blog/ns#draft79.01066
http://schemas.talis.com/2005/dir/schema#373.035476