WARNING: this is still a draft, and some details are still in discussion.
The table below contains the top 100 of the complete search and processing for Default Profile Vocabularies as performed by Sindice. See the the separate section below that explains the methodology leading to this table.
The content of each columns are:
http://a.b.c
and http://q.b.c
are considered as identical)The domains’ data is important for the purpose of a default profile: if a vocabulary is used by a large number of triples but all originating at one or few domains only, that indicates that the vocabulary is used only in a few, albeit possibly very large datasets. Although these datasets may be very important resources, nevertheless, this would not warrant adding the vocabulary to be part of a generic default profile. To be precise about the content of the table, it contains the top 100 entries through a dicreasing order of the 2nd level domain value. The full search results is also available in CSV format (beware, the data set is over 140M, containing more than 1,800,000 entries).
By default, the table is sorted using its last column, in decreasing order. Clicking on the column header reorders the table using that column, first in increasing and, clicking again, by decreasing order.
Vocabulary URI | Triples | Graphs | Domain | 2nd Level Domains |
---|---|---|---|---|
purl.org/dc/terms/ | 3.81844672E8 | 113308089 | 3395120 | 232691 |
w3.org/2006/vcard/ns# | 2.15687398E9 | 52579593 | 585781 | 166510 |
xmlns.com/foaf/0.1/ | 2.73884851E9 | 41645800 | 2939451 | 83707 |
ogp.me/ns# | 1.62337136E8 | 24162238 | 27579 | 19424 |
purl.org/dc/elements/1.1/ | 1.72281808E8 | 13324030 | 2370883 | 18223 |
w3.org/2003/01/geo/wgs84_pos# | 4.6242568E7 | 10215846 | 105668 | 7989 |
creativecommons.org/ns# | 7827782.0 | 1651178 | 8204 | 5450 |
w3.org/2002/12/cal/icaltzd# | 3.6544712E7 | 2157748 | 6085 | 3895 |
purl.org/stuff/rev# | 1.8225048E7 | 2372861 | 4664 | 2451 |
w3.org/2000/10/swap/pim/contact# | 2723938.0 | 68249 | 28878 | 1350 |
rdfs.org/sioc/ns# | 1.7927096E7 | 649744 | 2676 | 1313 |
rdf.data-vocabulary.org/# | 1.30246464E8 | 10054031 | 1728 | 1030 |
purl.org/vocab/bio/0.1/ | 788445.0 | 238116 | 35921 | 510 |
purl.org/goodrelations/v1# | 1.57271296E8 | 4953695 | 403 | 374 |
wellformedweb.org/CommentAPI/ | 44282.0 | 1494 | 226 | 223 |
ramonantonio.net/doac/0.1/# | 7257748.0 | 911699 | 308 | 190 |
commontag.org/ns# | 5257518.0 | 65337 | 269 | 180 |
rdfs.org/sioc/types#MicroblogPost | 1144228.0 | 68690 | 373 | 171 |
usefulinc.com/ns/doap# | 67050.0 | 7861 | 163 | 116 |
xmlns.com/wot/0.1/ | 1480.0 | 228 | 124 | 110 |
developers.facebook.com/schema/admins | 822554.0 | 317936 | 150 | 109 |
developers.facebook.com/schema/app_id | 2800303.0 | 918280 | 135 | 106 |
purl.org/vocab/relationship/ | 74791.0 | 3080 | 87 | 80 |
purl.org/stuff/rev#Review | 54401.0 | 8332 | 78 | 71 |
purl.org/stuff/rev#hasReview | 27019.0 | 2276 | 61 | 59 |
purl.org/ontology/bibo/ | 700982.0 | 237603 | 71 | 56 |
xmlns.com/wordnet/1.6/Airport | 159.0 | 84 | 60 | 54 |
purl.org/stuff/rev#text | 38572.0 | 6952 | 57 | 54 |
rdfs.org/sioc/types#Comment | 253000.0 | 38216 | 66 | 52 |
purl.org/dc/dcmitype/ | 290338.0 | 43730 | 61 | 52 |
purl.org/stuff/rev#rating | 54185.0 | 9126 | 56 | 51 |
rdfs.org/sioc/types#BlogPost | 24667.0 | 5174 | 64 | 48 |
w3.org/ns/auth/rsa# | 1386.0 | 223 | 50 | 43 |
data.semanticweb.org/ns/swc/ontology# | 77595.0 | 4124 | 49 | 42 |
geonames.org/ontology# | 7.7840128E7 | 11134802 | 53 | 41 |
w3.org/ns/auth/cert# | 972.0 | 210 | 46 | 40 |
developers.facebook.com/schema/page_id | 33067.0 | 12072 | 45 | 36 |
w3.org/2002/12/cal/ical# | 29016.0 | 1879 | 35 | 32 |
open.vocab.org/terms/ | 121108.0 | 37316 | 34 | 32 |
purl.org/stuff/rev#reviewer | 19146.0 | 1043 | 32 | 30 |
http://dbpedia.org/ontology/ | 1.3169637E7 | 1119693 | 35 | 30 |
rdfs.org/sioc/types#Microblog | 575.0 | 323 | 30 | 29 |
purl.org/ontology/wo/ | 438756.0 | 18907 | 42 | 29 |
smob.me/ns#Hub | 498.0 | 335 | 28 | 28 |
http://dbpedia.org/property/ | 9.3231264E7 | 11782346 | 33 | 28 |
online-presence.net/opo/ns# | 1665.0 | 306 | 27 | 27 |
purl.org/ontology/mo/ | 1214051.0 | 234770 | 28 | 26 |
purl.org/net/vocab/2004/07/visit# | 757.0 | 38 | 20 | 20 |
moat-project.org/ns# | 6395.0 | 595 | 22 | 20 |
purl.org/vocab/vann/ | 3859.0 | 293 | 18 | 18 |
purl.org/stuff/rev#title | 1435.0 | 170 | 20 | 18 |
w3.org/2002/12/cal# | 1205.0 | 78 | 16 | 16 |
skype.com/ | 49.0 | 28 | 15 | 13 |
purl.org/vocab/frbr/core# | 806407.0 | 50886 | 14 | 13 |
purl.org/net/provenance/ns# | 1.3858524E7 | 760903 | 13 | 13 |
openarchives.org/ore/terms/ | 7141080.0 | 643265 | 15 | 13 |
_:node0 | 126.0 | 108 | 13 | 13 |
ebusiness-unibw.org/ontologies/eclass/5.1.4/#C_AKJ315005-tax | 27.0 | 18 | 14 | 13 |
abmeta.org/ns#tags | 2494.0 | 432 | 13 | 13 |
trust.mindswap.org/ont/trust.owl# | 605.0 | 18 | 13 | 12 |
rdfs.org/sioc/types#BoardPost | 2536.0 | 975 | 13 | 12 |
purl.org/stuff/rev#type | 173.0 | 20 | 12 | 12 |
purl.org/NET/scovo# | 3426.0 | 333 | 56 | 12 |
purl.org/NET/c4dm/event.owl | 10470.0 | 867 | 12 | 12 |
purl.org/dc/dcam | 27946.0 | 7073 | 13 | 12 |
holygoat.co.uk/owl/redwood/0.1/tags/taggedWithTag | 9408.0 | 705 | 12 | 12 |
abmeta.org/ns#Book | 335.0 | 174 | 12 | 12 |
w3.org/2004/03/trix/rdfg-1/Graph | 3217495.0 | 761231 | 11 | 11 |
abmeta.org/ns#link | 78.0 | 50 | 11 | 11 |
abmeta.org/ns#isbn | 127.0 | 80 | 11 | 11 |
abmeta.org/ns#description | 1100.0 | 573 | 11 | 11 |
xs:string | 1514.0 | 71 | 10 | 10 |
w3.org/2006/time# | 603.0 | 46 | 12 | 10 |
holygoat.co.uk/owl/redwood/0.1/tags/taggedBy | 4794.0 | 2027 | 10 | 10 |
holygoat.co.uk/owl/redwood/0.1/tags/Tag | 27806.0 | 6072 | 10 | 10 |
purl.org/net/pingback/ | 20.0 | 9 | 9 | 9 |
holygoat.co.uk/owl/redwood/0.1/tags/taggedResource | 1336.0 | 191 | 10 | 9 |
abmeta.org/ns#year | 111.0 | 64 | 9 | 9 |
xri://$xrd*($v*2.0)Service | 27.0 | 18 | 8 | 8 |
umbel.org/umbel# | 44058.0 | 12909 | 11 | 8 |
redfoot.net/2005/session#hexdigest | 16.0 | 14 | 8 | 8 |
rdfs.org/sioc/types#Weblog | 3303.0 | 1200 | 10 | 8 |
mozilla.org/xblbindings | 9.0 | 9 | 8 | 8 |
xri://$xrdsXRDS | 34.0 | 17 | 7 | 7 |
xmlns.com/wordnet/1.6/Project | 16.0 | 8 | 7 | 7 |
xmlns.com/wordnet/1.6/Person | 4908.0 | 349 | 8 | 7 |
sw.deri.org/2005/08/conf/cfp# | 258.0 | 32 | 10 | 7 |
purl.org/net/provenance/types#QueryResult | 2261588.0 | 892893 | 7 | 7 |
holygoat.co.uk/owl/redwood/0.1/tags/name | 2474.0 | 457 | 7 | 7 |
ebusiness-unibw.org/ontologies/eclass/5.1.4/#C_AKJ317003-tax | 12.0 | 7 | 7 | 7 |
xs:boolean | 48.0 | 19 | 6 | 6 |
xmlns.com/wordnet/1.6/Document | 97.0 | 66 | 6 | 6 |
s.opencalais.com/1/type/lid/DefaultLangId | 127.0 | 25 | 6 | 6 |
skype.com | 14.0 | 11 | 7 | 6 |
rdfs.org/sioc/types#WikiArticle | 30965.0 | 212 | 6 | 6 |
rdfs.org/sioc/types#Wiki | 1642.0 | 445 | 10 | 6 |
rdfs.org/sioc/types#MessageBoard | 226.0 | 152 | 8 | 6 |
purl.org/vocab/psychometric-profile/ | 46.0 | 11 | 7 | 6 |
purl.org/net/schemas/quaffing/drankBeerWith | 55.0 | 7 | 7 | 6 |
purl.org/net/provenance/types#DataCreatingService | 3180869.0 | 821348 | 6 | 6 |
The fundamental approach is to search the Semantic Web for vocabulary usage and process the results in order to derive possible vocabularies that are suitable for an RDFa Default Profile. The detailed steps are as follows.
The most complex and possibly controversial step is 2.2 above. Here are the categories of vocabularies that were removed from the result set: