HCLSIG/LLD/DatasetDescription
Published Vocabularies:
- VOID - Vocabulary of Interlinked Datasets - http://www.w3.org/TR/void/
- OMV - Ontology Metadata Vocabulary - http://omv2.sourceforge.net/
- SCOVO - Statistical Core Vocabulary - http://sw.joanneum.at/scovo/schema.html
- DMOP - Data Mining Ontology - http://www.dmo-foundry.org/
- DCAT - Dataset Catalog Vocabulary - http://data.fundacionctic.org/vocab/catalog/dcat.html
- Data Cube Vocab - http://www.w3.org/TR/vocab-data-cube/
- Quantifying RDF Datasets - http://dl.dropbox.com/u/21690634/Quantifying%20RDF%20data%20sets.pdf
- schema.org - http://www.w3.org/wiki/WebSchemas/Datasets
Descriptors
- label coverage [
- language tags used [list]
- vocabularies used [list]
- links to [list]
Measures
- Number of triples (Nt)
- Number of literals (Nl)
- Number of object URIs (No)
- Number of distinct literals (type removed) (Ndl)
- Number of distinct objects (Ndo)
- Number of distinct subjects (Nds)
- Number of distinct URIs (Nu)
- Number of typed instances (Ni)
- Number of instances of type t (Nit)
- Number of distinct classes (Nc)
- Number of distinct predicates (Ndp)
- “Literalness” = Nl / Nt
- “Literal uniqueness” = Ndl / Nl
- “Object uniqueness” = Ndo / No
- “Structure” = 1 - (Ni + Nl) / Nt
- “Subject coverage” = Nds / Nu
- “Object coverage” = Ndo / Nu
- “Type frequency of class t” = {Nit / Ni , . . .}