Copyright ©2009 W3C® (MIT, ERCIM, Keio), All Rights Reserved. W3C liability, trademark and document use rules apply.
SKOS—Simple Knowledge Organization System—provides a model for expressing the basic structure and content of concept schemes such as thesauri, classification schemes, subject heading lists, taxonomies, folksonomies, and other similar types of controlled vocabulary. As an application of the Resource Description Framework (RDF), SKOS allows concepts to be composed and published on the World Wide Web, linked with data on the Web and integrated into other concept schemes.
This document is a user guide for those who would like to represent their concept scheme using SKOS.
In basic SKOS, conceptual resources (concepts) are identified with URIs, labeled with strings in one or more natural languages, documented with various types of note, semantically related to each other in informal hierarchies and association networks, and aggregated into concept schemes.
In advanced SKOS, conceptual resources can be mapped across concept schemes and grouped into labeled or ordered collections. Relationships can be specified between concept labels. Finally, the SKOS vocabulary itself can be extended to suit the needs of particular communities of practice or combined with other modeling vocabularies.
This document is a companion to the SKOS Reference, which provides the normative reference on SKOS.
This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/.
This document is a Working Draft published by the Semantic Web Deployment Working Group, part of the W3C Semantic Web Activity. This draft is an update to the previous version of 17 March 2009. A differences document details the changes. This is a companion document to the SKOS Simple Knowledge Organization System Reference W3C Proposed Recommendation dated 15 June 2009.
The Working Group solicits review and feedback on this document and the associated SKOS Reference Proposed Recommendation. All comments are welcome and may be sent to public-swd-wg@w3.org; please include the text "SKOS comment" in the subject line. All messages received at this address are viewable in a public archive.
The group does not expect this document to become a W3C Recommendation. The Working Group intends to advance the associated SKOS Reference to W3C Recommendation after further review and comment. The Working Group expects to revise this Primer while the SKOS Reference is undergoing review and eventually publish the Primer as a Working Group Note.
This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
Publication as a Working Draft does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.
The Simple Knowledge Organization System (SKOS) is an RDF vocabulary for representing semi-formal knowledge organization systems (KOSs), such as thesauri, taxonomies, classification schemes and subject heading lists. Because SKOS is based on the Resource Description Framework (RDF) [RDF-PRIMER] these representations are machine-readable and can be exchanged between software applications and published on the World Wide Web.
SKOS has been designed to provide a low-cost migration path for porting existing organization systems to the Semantic Web. SKOS also provides a lightweight, intuitive conceptual modeling language for developing and sharing new KOSs. It can be used on its own, or in combination with more formal languages such as the Web Ontology Language (OWL) [OWL]. SKOS can also be seen as a bridging technology, providing the missing link between the rigorous logical formalism of ontology languages such as OWL and the chaotic, informal and weakly-structured world of Web-based collaboration tools, as exemplified by social tagging applications.
The aim of SKOS is not to replace original conceptual vocabularies in their initial context of use, but to allow them to be ported to a shared space, based on a simplified model, enabling wider re-use and better interoperability.
This document is intended to help users who have a basic understanding of RDF to represent and publish their concept schemes as SKOS data. The Primer aims to provide introductory examples and guidance in the use of the SKOS vocabulary.
For a systematic account of all SKOS vocabulary elements, including their reference semantics, the reader should consult the normative SKOS Reference [SKOS-REFERENCE]. This can be done, at the level of classes and properties, by clicking on their occurrences in the text (e.g. skos:Concept). For an overview of the use cases for SKOS and the elicited requirements that guided its design, the reader should consult the SKOS Use Cases and Requirements [SKOS-UCR].
This Primer, together with the SKOS Reference [SKOS-REFERENCE], replaces the earlier SKOS Core Guide [SWBP-SKOS-CORE-GUIDE] and the SKOS Core Vocabulary Specification [SWBP-SKOS-CORE-SPEC], which are now deprecated.
The essential features of the SKOS model are explained in Section 2. Here, the reader is presented with the set of vocabulary elements that are most commonly used for representing KOSs. In Section 3, the reader is shown how to add value to these representations, either by linking them together or by relating them to other kinds of Semantic Web resources. It is expected that many SKOS applications will employ some of the features presented in Section 3. Section 4 is focused on more advanced representation needs, which are likely to be required for a limited number of SKOS applications. Section 5 discusses the use of SKOS in conjunction with other modeling approaches, specifically OWL.
Most of the examples in this guide are given as a serialization of RDF graphs using the Turtle syntax for RDF [TURTLE]. Examples serialized as Turtle appear in code lines such as:
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>. @prefix ex: <http://www.example.com/>. ex:aResource ex:aProperty ex:anotherResource; ex:anotherProperty "An RDF Literal"@en.
The above is equivalent to the following expression, in the RDF/XML reference syntax [RDF/XML-SYNTAX]:
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:ex="http://www.example.com/"> <rdf:Description rdf:about="http://www.example.com/aResource"> <ex:aProperty rdf:resource="http://www.example.com/anotherResource"/> <ex:anotherProperty xml:lang="en">An RDF Literal</ex:anotherProperty> </rdf:Description> </rdf:RDF>
For the sake of brevity a number of namespace declarations are omitted from the examples. This applies to standard namespaces (SKOS, RDF/RDFS [RDF-PRIMER], OWL [OWL] and Dublin Core [DC]) but also to the ones that are coined for the examples. Generally, these namespaces could be declared as in the following code:
@prefix skos: <http://www.w3.org/2004/02/skos/core#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix owl: <http://www.w3.org/2002/07/owl#> . @prefix dct: <http://purl.org/dc/terms/> . @prefix foaf: <http://xmlns.com/foaf/0.1/> . @prefix ex: <http://www.example.com/> . @prefix ex1: <http://www.example.com/1/> . @prefix ex2: <http://www.example.com/2/> .
This section introduces the core of the SKOS model, namely the features that are needed to represent most KOSs, as observed in the majority of use cases [SKOS-UCR].
In basic SKOS, conceptual resources (concepts) can be identified with URIs, labeled with lexical strings in one or more natural languages, documented with various types of note, semantically related to each other in informal hierarchies and association networks and aggregated into concept schemes.
The fundamental element of the SKOS vocabulary is the concept. Concepts are the units of thought [WillpowerGlossary]—ideas, meanings, or (categories of) objects and events—which underlie many knowledge organization systems [SKOS-UCR]. As such, concepts exist in the mind as abstract entities which are independent of the terms used to label them.
SKOS introduces the class skos:Concept
,
which allows implementors to assert that a given resource is a concept. This
is done in two steps:
rdf:type
, that
the resource identified by this URI is of type skos:Concept
.For example:
<http://www.example.com/animals> rdf:type skos:Concept.
This can also be represented in Turtle more compactly using the namespace
prefix ex
defined above:
ex:animals rdf:type skos:Concept.
Using SKOS to publish concept schemes makes it easy to reference the concepts in resource descriptions on the Semantic Web. Implementors are encouraged to use HTTP URIs when minting concept URIs since they are resolvable to representations that can be accessed using standard Web technologies. For more information about URIs on the Semantic Web, see Cool URIs for the Semantic Web [COOLURIS] and Best Practice Recipes for Publishing RDF Vocabularies [RECIPES].
The first characterization of concepts are the expressions that are used
to refer to them in natural language: their labels. SKOS provides
three properties to attach labels to conceptual resources: skos:prefLabel
,
skos:altLabel
and skos:hiddenLabel
.
Each property implies a specific status for the label it introduces, ranging
from a strong, univocal denotation relationship, to a string to aid in
lookup. These properties are formally defined as being pairwise disjoint.
This means, for example, that it is an error if a concept has a same literal
both as its preferred label and as an alternative label.
As specified in Section 5 of the
SKOS Reference, skos:prefLabel
,
skos:altLabel
and skos:hiddenLabel
provide simple
labels. They are all sub-properties of rdfs:label
, and
are used to link a skos:Concept
to an RDF plain
literal, which is a character string (e.g. "love"
) combined
with an optional language tag (e.g. "en-US"
) [RDF-CONCEPTS].
The skos:prefLabel
property makes it possible to assign a preferred lexical label to a resource.
Terms used as descriptors in indexing systems [WillpowerGlossary] will for instance be
represented using this property, as in the following example:
ex:animals rdf:type skos:Concept; skos:prefLabel "animals".
RDF plain literals are formally defined as character strings with optional language tags. SKOS thereby enables a simple form of multilingual labeling. This is done by using the language tag of a lexical label to restrict its scope to a particular language. The following example illustrates how a concept is given one preferred label in English and another in French:
ex:animals rdf:type skos:Concept; skos:prefLabel "animals"@en; skos:prefLabel "animaux"@fr.
Note that the notion of preferred label implies that a resource can only have one such label per language tag, as explained in Section 5 of the SKOS Reference [SKOS-REFERENCE].
Following common practice in KOS design, the preferred label of a concept may also be used to unambiguously represent this concept within a KOS and its applications. So even though the SKOS data model does not formally enforce it, it is recommended that no two concepts in the same KOS be given the same preferred lexical label for any given language tag.
The skos:altLabel
property makes it possible to assign an alternative lexical label to a
concept. This is especially helpful when assigning labels beyond the one that
is preferred for the concept, for instance when synonyms need to be
represented:
ex:animals rdf:type skos:Concept; skos:prefLabel "animals"@en; skos:altLabel "creatures"@en; skos:prefLabel "animaux"@fr; skos:altLabel "créatures"@fr.
Note that representation of synonyms for preferred labels is not the only
use for skos:altLabel
. Near-synonyms, abbreviations and
acronyms can be represented the same way:
ex:fao rdf:type skos:Concept;
skos:prefLabel "Food and Agriculture Organization"@en;
skos:altLabel "FAO"@en.
Note on upward posting: It is also possible
to use skos:altLabel
to represent cases of upward
posting [ISO-2788]. That is, when a
concept aggregates more specialized notions that are not explicitly
introduced as concepts in the considered KOS:
ex:rocks rdf:type skos:Concept; skos:prefLabel "rocks"@en; skos:altLabel "basalt"@en; skos:altLabel "granite"@en; skos:altLabel "slate"@en.
However, and even though SKOS is not intended as a guide for
KOS design replacing existing standards [ISO-2788, BS8723-2], the reader should be aware that this
does not correspond to a recommended practice. A more appropriate KOS for
this domain would introduce a skos:Concept
for each kind of rock
considered (basalt, granite and slate) and assert it as a narrower concept of
ex:rock
.
A hidden lexical label, represented by means of the skos:hiddenLabel
property, is a lexical label for a resource, where a KOS designer would like
that character string to be accessible to applications performing text-based
indexing and search operations, but would not like that
label to be visible otherwise. Hidden labels may for instance be used to
include misspelled variants of other lexical labels. For example:
ex:animals rdf:type skos:Concept; skos:prefLabel "animaux"@fr; skos:altLabel "bêtes"@fr; skos:hiddenLabel "betes"@fr.
In KOSs semantic relations play a crucial role for defining concepts. The meaning of a concept is defined not just by the natural-language words in its labels but also by its links to other concepts in the vocabulary. Mirroring the fundamental categories of relations that are used in vocabularies such as thesauri [ISO2788], SKOS supplies three standard properties:
skos:broader
and skos:narrower
enable the representation of hierarchical links, such as the relationship
between one genre and its more specific species, or,
depending on interpretations, the relationship between one whole
and its parts;skos:related
enables the representation of associative (non-hierarchical) links, such
as the relationship between one type of event and a category of
entities which typically participate in it. Another use for
skos:related
is between two categories where neither is more
general or more specific. Note that skos:related
enables the
representation of associative (non-hierarchical) links, which can also be
used to represent part-whole links that are not meant as hierarchical
relationships.To assert that one concept is broader in meaning (i.e. more general) than
another, the skos:broader
property is used. The skos:narrower
property is used to assert the inverse, namely when one concept is narrower
in meaning (i.e. more specific) than another. For example:
ex:animals rdf:type skos:Concept; skos:prefLabel "animals"@en; skos:narrower ex:mammals. ex:mammals rdf:type skos:Concept; skos:prefLabel "mammals"@en; skos:broader ex:animals.
As is often the case in KOS, a SKOS concept can be attached to several
broader concepts at the same time. For example, a concept ex:dog
could have both ex:mammals
and
ex:domesticatedAnimals
as broader concepts.
Note on skos:broader
direction:
for historic reasons, the name of the skos:broader
property (the
word "broader") does not provide an explicit indication of its direction. The
word "broader" should read here as "has broader concept"; the subject of a
skos:broader
statement is the more specific concept involved in
the assertion and its object is the more generic one.
Note on implicit
skos:broader
/skos:narrower
statements: the
properties skos:broader
and skos:narrower
are each
other's inverse. Whenever a concept X is broader than another concept Y, then
Y is a narrower concept of X according to the SKOS data model [SKOS-REFERENCE]. This can be useful for
making SKOS representations more efficient by limiting the information they
contain. In the above example, for instance, the statement ex:mammals
skos:broader ex:animals
can be left out if, before using the concept
scheme data, an OWL reasoner is used to infer it from the statement
ex:animals skos:narrower ex:mammals
.
In many cases, hierarchical relations in a concept scheme can be
considered as transitive
[OWL]. If ex:animals
is broader than ex:mammals
, which is itself broader than
ex:cats
, it makes sense to assert that ex:animals
is broader than ex:cats
. However, there are "dirtier"
hierarchies, especially in KOSs different from standard well-designed
thesauri, where such a feature would not be judged appropriate. Consider for
instance a case where ex2:vehicles
is said to be broader than
ex2:cars
, which is itself asserted to be broader than
ex2:wheels
. It may be problematic if "wheels" is automatically
inferred to be a narrower concept with respect to "vehicles". SKOS
anticipates such problems by not defining
skos:broader
and skos:narrower
as generally
transitive properties. The reader interested in representing "transitive
hierarchies" is encouraged to read Section
4.5, which presents a way to do this while retaining compatibility with
the semantics of skos:broader
defined in this section.
Note on not transitive vs. intransitive: the
SKOS model does not state that skos:broader
and
skos:narrower
are transitive. Yet this does not imply that these
properties are intransitive. Consider a concept cats
which is narrower than a concept mammals
, itself narrower than
animals
: one can assert that cats
is narrower than
animals
as well, while staying compatible with the SKOS model.
Not specifying skos:broader
as transitive implies that no new
skos:broader
statement can be inferred between
cats
and animals
by applying SKOS axioms. This does
not prevent the publishers of a SKOS concept scheme from
asserting hierarchical statements that reflect a locally transitive
behaviour.
Similarly, SKOS does not assume that hierarchical relations are by default
irreflexive. In many thesaurus guidelines, it is prohibited to have a concept
broader than itself. However, in specific cases beyond classical thesauri,
some reflexive skos:broader
statements may occur. Consider the
conversion of an existing RDFS/OWL ontology into a SKOS concept scheme. In
such a case, it is legitimate that every rdfs:subClassOf
statement will be re-interpreted as a skos:broader
statement.
However, rdfs:subClassOf
is a reflexive property, which means
that for every class C
, the statement C rdfs:subClassOf
C
is true
[OWL]. In this case every concept
would therefore have itself among its broader concepts.
Not covered in basic SKOS is the distinction between types of hierarchical relation: for example, instance-class and part-whole relationships. The interested reader is referred to Section 4.7, which describes how to create specializations of semantic relations to deal with this issue.
To assert an associative relationship between two concepts, skos:related
can be used:
ex:birds rdf:type skos:Concept; skos:prefLabel "birds"@en; skos:related ex:ornithology. ex:ornithology rdf:type skos:Concept; skos:prefLabel "ornithology"@en.
As described in the SKOS Reference [SKOS-REFERENCE], the
skos:related
property is symmetric
[OWL]. From the above RDF graph,
for instance, it follows that ex:ornithology
is the subject of a
skos:related
statement that has ex:birds
as an
object.
Note on (non-)transitivity of
skos:related
: The reader should be aware that in the
SKOS data model skos:related
is not defined as a transitive
property. A transitive skos:related
could have unwanted
consequences, as in the following example:
ex:renaissance skos:related ex:humanism. ex:humanism skos:related ex:philosophicalAnthropology. ex:philosophicalAnthropology skos:related ex:philosophyOfMind. ex:philosophyOfMind skos:related ex:cognitiveScience.
Should skos:related
be transitive,
ex:renaissance
would be then directly related to
ex:cognitiveScence
. While every individual statement makes
sense, the inferred statement may not fit what the designer of the KOS
originally intended.
Note on mixing hierarchy with association:
The transitive closure of skos:broader
is disjoint from
skos:related
. If resources A and B are related via
skos:related
, there must not be a chain of
skos:broader
relationships from A to B. The same holds of
skos:narrower
.
Semantic relationships are crucial to the definition of concepts, as many KOS guidelines emphasize it. However, next to these structured characterizations, concepts sometimes have to be further defined using human-readable ("informal") documentation, such as scope notes or definitions.
SKOS provides a skos:note
property
for general documentation purposes. Inspired by existing KOS guidelines, such
as [ISO2788] or [BS8723-2], this property is further specialized
into skos:scopeNote
, skos:definition
,
skos:example
, and skos:historyNote
to fit more
specific types of documentation.
skos:scopeNote
supplies some, possibly partial, information about the intended meaning of a
concept, especially as an indication of how the use of a concept is limited
in indexing practice. The following example is adapted from [ISO2788]:
ex:microwaveFrequencies skos:scopeNote "Used for frequencies between 1GHz to 300Ghz"@en.
skos:definition
supplies a complete explanation of the intended meaning of a concept. The
following example is adapted from [ISO2788]:
ex:documentation skos:definition "the process of storing and retrieving information in all fields of knowledge"@en.
skos:example
supplies an example of the use of a concept:
ex:organizationsOfScienceAndCulture skos:example "academies of science, general museums, world fairs"@en.
skos:historyNote
describes significant changes to the meaning or the form of a concept:
ex:childAbuse skos:historyNote "estab. 1975; heading was: Cruelty to children [1952-1975]"@en.
In addition to these notes that are intended for users of a concept
scheme, SKOS includes two specializations of skos:note
that are
useful for KOS managers or editors: skos:editorialNote
and
skos:changeNote
.
skos:editorialNote
supplies information that is an aid to administrative housekeeping, such as
reminders of editorial work still to be done, or warnings in the event that
future editorial changes might be made:
ex:doubleclick skos:editorialNote "Review this term after company merger
complete"@en.
ex:folksonomy skos:editorialNote "Check spelling with Thomas Vander Wal"@en.
skos:changeNote
documents fine-grained changes to a concept, for the purposes of
administration and maintenance:
ex:tomato skos:changeNote
"Moved from under 'fruits' to under 'vegetables' by Horace Gray"@en.
It is important to notice that the hierarchical link between
skos:note
and its different specializations allows all the
documentation associated to a concept to be retrieved in a straightforward
way. Every skos:definition
is a skos:note
, every
skos:scopeNote
is a skos:note
, and so on.
As illustrated above, SKOS documentation properties can be simply used with RDF plain literals. Section 4.2 will show that there are other possible patterns, as the range of these properties is not be restricted to literals. One important feature of simple literals, however, is the ability to use language tags, as done for labeling properties. Documentation may thus be provided in multiple languages:
ex:pineapples rdf:type skos:Concept; skos:prefLabel "pineapples"@en; skos:prefLabel "ananas"@fr; skos:definition "The fruit of plants of the family Bromeliaceae"@en; skos:definition "Le fruit d'une plante herbacée de la famille des broméliacées"@fr.
Before concluding this section, it is important to note that other,
non-SKOS properties could be used to document concepts. The dct:creator
property
from Dublin Core [DC] can for instance be used
to point to a person that created the concept:
ex:madagascarFishEagle dct:creator [ foaf:name "John Smith" ].
Concepts can be created and used as stand-alone entities. However,
especially in indexing practice, concepts usually come in carefully compiled
vocabularies, such as thesauri or classification schemes. SKOS offers the
means of representing such KOSs using the skos:ConceptScheme
class.
The following example shows how to define a concept scheme resource
(representing a thesaurus) and to describe that resource using the dct:title
and dct:creator
properties
from Dublin Core [DC]:
ex:animalThesaurus rdf:type skos:ConceptScheme; dct:title "Simple animal thesaurus"; dct:creator ex:antoineIsaac.
Once the concept scheme resource has been created, it can be linked with
the concepts it contains using the skos:inScheme
property:
ex:mammals rdf:type skos:Concept; skos:inScheme ex:animalThesaurus. ex:cows rdf:type skos:Concept; skos:broader ex:mammals; skos:inScheme ex:animalThesaurus. ex:fish rdf:type skos:Concept; skos:inScheme ex:animalThesaurus.
In order to provide an efficient access to the entry points of
broader/narrower concept hierarchies, SKOS defines a skos:hasTopConcept
property. This property allows one to link a concept scheme to the (possibly
many) most general concepts it contains, as in the (continued) animal
thesaurus example:
ex:animalThesaurus rdf:type skos:ConceptScheme; skos:hasTopConcept ex:mammals; skos:hasTopConcept ex:fish.
Concept schemes are designed to represent traditional vocabularies, and designers are encouraged to follow existing KOS guidelines (e.g., [ISO2788] or [BS8723-2]) when compiling a SKOS concept scheme. For example, as described in Section 2.2, it is recommended that no two concepts have the same preferred lexical label in a given language when they belong to the same concept scheme.
The reader should however be aware that there are some subtle differences
between SKOS concept schemes and "traditional" KOSs, mainly due to the
Semantic Web context for SKOS. Section 4.6 of the
SKOS Reference [SKOS-REFERENCE] gives an account of these
differences. One important feature of SKOS is that it is possible for the
same concept to be linked to several concept schemes, using the
skos:inScheme
property. This will be discussed in the next
section.
Finally, it is important to notice that the SKOS vocabulary only offers
limited support for containment of KOS information in a concept scheme.
skos:inScheme
and skos:hasTopConcept
link concept
schemes and concepts. Yet, there is no mechanism in SKOS to record that a
specific statement concerning these concepts, e.g. a
skos:broader
assertion, pertains to a specific concept scheme,
whereas a KOS is usually seen as consisting of both its concepts and the
links that define them. The interested reader is referred to Section 5.3 for a discussion on this topic.
Representing a KOS with SKOS not only serves as a publication mechanism, but also allows it to participate in a network of concept schemes. On the Semantic Web the true potential of data is unleashed when it is interlinked. As concepts from different concept schemes are connected together they begin to form a distributed, heterogeneous global concept scheme. A web of concept schemes can serve as the foundation for new applications that allow meaningful navigation between KOSs. This section introduces the SKOS features that enable the interlinking of concept schemes and explains how to relate conceptual resources to other resources on the Semantic Web.
Every SKOS concept is assigned a URI [COOLURIS], which makes it possible to unambiguously reference a concept in any SKOS application. This can be especially useful for establishing semantic relations between pre-existing concepts. Such mappings are crucial for applications such as information retrieval tools that use several KOSs at the same time, where these KOSs have overlapping scopes and need to be semantically reconciled; examples can be found in the SKOS Use cases and Requirements document [SKOS-UCR].
A crucial feature of mapping is the possibility to state that two concepts from different schemes have comparable meanings, and to specify how these meanings compare, even though they come from different contexts and possibly follow different modeling principles [BS8723-4]. Conceptual mappings are expected to be a key advantage of making KOSs available on the Semantic Web using SKOS.
SKOS provides several properties that map concepts between different
concept schemes. This can be done by asserting that two concepts have a
similar meaning, using the skos:exactMatch
and skos:closeMatch
properties. Two concepts from different concept schemes can also be mapped
using properties that parallel the semantic relations introduced in Section 2.3: skos:broadMatch
,
skos:narrowMatch
and skos:relatedMatch
.
Consider the following example, where two concept schemes represent different views on animals:
ex1:referenceAnimalScheme rdf:type skos:ConceptScheme; dct:title "Extensive list of animals"@en. ex1:animal rdf:type skos:Concept; skos:prefLabel "animal"@en; skos:inScheme ex1:referenceAnimalScheme. ex1:platypus rdf:type skos:Concept; skos:prefLabel "platypus"@en; skos:inScheme ex1:referenceAnimalScheme. ex2:eggSellerScheme rdf:type skos:ConceptScheme; dct:title "Obsessed egg-seller's vocabulary"@en. ex2:eggLayingAnimals rdf:type skos:Concept; skos:prefLabel "animals that lay eggs"@en; skos:inScheme ex2:eggSellerScheme. ex2:animals rdf:type skos:Concept; skos:prefLabel "animals"@en; skos:inScheme ex2:eggSellerScheme. ex2:eggs rdf:type skos:Concept; skos:prefLabel "eggs"@en; skos:inScheme ex2:eggSellerScheme.
It is possible to map the concepts in
ex1:referenceAnimalScheme
to the concepts in
ex2:eggSellerScheme
by using the mapping assertions below:
ex1:platypus skos:broadMatch ex2:eggLayingAnimals. ex1:platypus skos:relatedMatch ex2:eggs. ex1:animal skos:exactMatch ex2:animals.
A skos:closeMatch
assertion indicates that two concepts are
sufficiently similar that they can be used interchangeably in applications
that consider the two concept schemes they belong to. However,
skos:closeMatch
is not defined as transitive, which prevents
such similarity assessments to propagate beyond these two schemes. If a
concept ex1:A
is a close match for another concept
ex2:B
which is itself a close match for ex3:C
, it
does not follow from the SKOS data model that
ex1:A
is a close match for ex3:C
.
skos:exactMatch
also indicates semantic similarity—it is a
sub-property of skos:closeMatch
. However, it denotes an even
higher degree of closeness: the two concepts have equivalent meaning, and the
link can be exploited across a wider range of applications and schemes.
skos:exactMatch
is indeed transitive: if a concept
ex1:A
is an exact match for another concept ex2:B
which is itself an exact match for ex3:C
, it
does follow from the SKOS data model that ex1:A
is an exact match for ex3:C
.
Note on skos:exactMatch
vs.
owl:sameAs
: SKOS provides skos:exactMatch
to map concepts with equivalent meaning, and intentionally does not use
owl:sameAs
from
the OWL ontology language [OWL].
When two resources are linked with owl:sameAs
they are
considered to be the same resource, and triples involving these resources are
merged. This does not fit what is needed in most SKOS applications. In the
above example, ex1:animal
is said to be equivalent to
ex2:animals
. If this equivalence relation were represented using
owl:sameAs
, the following statements would hold for
ex:animal
:
ex1:animal rdf:type skos:Concept; skos:prefLabel "animal"@en; skos:inScheme ex1:referenceAnimalScheme. skos:prefLabel "animals"@en; skos:inScheme ex2:eggSellerScheme.
This would make ex:animal
inconsistent, as a
concept cannot possess two different preferred labels in a same language. Had
the concepts been assigned other information, such as semantic relationships
to other concepts, or notes, these would be merged as well, causing these
concepts to acquire new meanings.
By convention, mapping properties are used to represent links that have the same intended meaning as the "standard" semantic properties, but with a different application scope. One might say that mapping relationships are less inherent to the meaning of the concepts they involve. From the point of view of the original designer of a mapped KOS, they might even sometimes be wrong.
Mapping properties are expected to be useful in specific applications that use multiple, conceptually overlapping KOSs. By convention, mapping relationships are expected to be asserted between concepts that belong to different concept schemes.
The reader should be aware that according to the SKOS data model, the
mapping properties that "mirror" a given semantic relation property are also
sub-properties of it in the RDFS sense. For instance,
skos:broadMatch
is a sub-property of skos:broader
.
Consequently, every assertion of skos:broadMatch
between two
concepts leads by inference to asserting a skos:broader
between
these concepts.
Linking concepts by means of mappings is not the only way to interlink concept schemes. The use of URIs on the Semantic Web allows resources to be shared and reused in a distributed fashion. As a result it is possible for a SKOS concept to participate in several concept schemes at the same time. For example, a SKOS publisher can choose to locally extend an existing concept scheme by declaring any new concepts that may be needed and simply linking to concepts that have already been defined in the existing scheme.
Extension of a KOS can be especially useful when its designers (or third-party KOS publishers) want to achieve a better coverage of a domain or sub-domain, while following the principles that guided the design of the existing KOS—e.g., by re-using some of its concepts. Explicit KOS extension and re-use can also be used as a modularization mechanism, when a family of articulated KOSs (for instance microthesauri that belong to an overarching vocabulary) is designed to cover several domains and its designers want to allow specific applications to operate on given subsets of concepts.
A new concept scheme can re-use existing concepts using the skos:inScheme
property. Consider the example below, where a first concept scheme for
animals defines a concept for "cats":
ex1:referenceAnimalScheme rdf:type skos:ConceptScheme; dct:title "Reference list of animals"@en. ex1:cats rdf:type skos:Concept; skos:prefLabel "cats"@en; skos:inScheme ex1:referenceAnimalScheme.
The creator of another concept scheme devoted to cat descriptions can
freely include the reference ex1:cats
concept in her scheme, and
then reference it as follows:
ex2:catScheme rdf:type skos:ConceptScheme; dct:title "The Complete Cat Thesaurus"@en. ex1:cats skos:inScheme ex2:catScheme. ex2:abyssinian rdf:type skos:Concept; skos:prefLabel "Abyssinian Cats"@en; skos:broader ex1:cats; skos:inScheme ex2:catScheme. ex2:siamese rdf:type skos:Concept; skos:prefLabel "Siamese Cats"@en; skos:broader ex1:cats; skos:inScheme ex2:catScheme.
Note that the information source defining the new concept scheme does not
replicate information about the ex1:cats
concept, such as its
preferred label. Assuming ex1:cats
has been published, a
Semantic Web application is able to retrieve the information for this concept
by simply resolving the concept's URI
(http://www.example.com/1/cats
).
Note on owl:imports
and re-using
KOSs: The owl:imports
property provides a mechanism for importing the assertions of one OWL
ontology into another. owl:imports
may be used with SKOS
vocabularies to provide a special case of re-use/extension where a concept
scheme "imports" another concept scheme as a whole. To continue the example
above, this is achieved by including the following statement in the source
defining ex2:catScheme
:
ex2:catScheme owl:imports ex1:referenceAnimalScheme.
Using owl:imports
in this way has some
ramifications. First, the domain and range of owl:imports
is
owl:Ontology
, while skos:ConceptScheme
is defined
as an owl:Class
. Thus asserting that a concept scheme imports
another via owl:imports
leads to the consequence that the
instances of skos:conceptScheme
involved in the import are also
inferred to be instances of owl:Ontology
. This in turn results
in an OWL Full ontology (due to the dual use of a URI as a class and
ontology, see Section 4.2 of
the OWL Semantics document [OWL-SEMANTICS]).
Second, under the OWL Full semantics (see Section 5.3 of
the OWL Semantics [OWL-SEMANTICS]), the intended interpretation
of owl:imports
is that the RDF graph retrieved from the imported
URI is added to the importing graph. Users should be aware of this, and any
alternative interpretations should be avoided. In particular, there is no
logical dependency between skos:inScheme
and
owl:imports
: the use of owl:imports
will not result
in the presence of any skos:inScheme
statements other than the
ones already asserted in the imported graph. If we consider the example
above, owl:imports
has been used to state that one concept
scheme logically imports another. But even though
ex1:referenceAnimalScheme
contains the triple
ex1:Elephant skos:inScheme ex1:referenceAnimalsScheme.
the triple
ex1:Elephant skos:inScheme ex2:catScheme.
should not be inferred to be present in the
graph defining ex2:catScheme
.
If an application is concerned with practical provenance or ownership information, additional steps may be required in order to maintain the provenance or assert the authority of imported triples, as mentioned in Section 5.3.
Though formally not belonging to the features defining a KOS, the link between a concept and the resources which are about this concept is fundamental in many KOS applications, such as document indexing and document retrieval. This becomes even more important in a Semantic Web context, where there is a crucial need to annotate documents with conceptual units which define their subject.
While the SKOS vocabulary itself does not include a mechanism for
associating an arbitrary resource with a skos:Concept
,
implementors can turn to other vocabularies. Dublin Core, for instance,
provides a dct:subject
property
[DC]:
ex1:platypus rdf:type skos:Concept; skos:prefLabel "platypus"@en. <http://en.wikipedia.org/wiki/Platypus> rdf:type foaf:Document; dct:subject ex1:platypus.
Note that a same resource can have several subjects, and hence be involved
in several dct:subject
statements. These subjects can clearly
come from different concept schemes, resulting for instance from a
distributed annotation process.
Beyond the above mentioned features, SKOS proposes a number of vocabulary elements or guidelines that deal with more advanced representation needs, making SKOS compatible with a broad range of KOS modeling approaches. These are especially designed to meet requirements which were raised in the SKOS Use Cases and Requirements [SKOS-UCR], but which were only present in a smaller number of use cases:
This section concludes with a general note on the extensibility of the SKOS model, paving the way for even more specialized refinements of the vocabulary presented in this Primer.
SKOS makes it possible to define meaningful groupings or "collections" of concepts. Such groupings are normally rendered in thesauri as in the following example:
milk <milk by source animal> cow milk goat milk buffalo milk
These collections can be used to represent "arrays" in thesaurus terminology, in which the term "milk by source animal" is a "node label" [WillpowerGlossary]. There is consensus that a node label does not represent a label for a concept in its own right. Therefore, specific entities have to be introduced to represent them.
To correctly model such concept collection structures, SKOS introduces a
skos:Collection
class. Instances of this class group specific concepts by means of the
skos:member
property, as in the following example:
ex:milk rdf:type skos:Concept; skos:prefLabel "milk"@en. ex:cowMilk rdf:type skos:Concept; skos:prefLabel "cow milk"@en; skos:broader ex:milk. ex:goatMilk rdf:type skos:Concept; skos:prefLabel "goat milk"@en; skos:broader ex:milk. ex:buffaloMilk rdf:type skos:Concept; skos:prefLabel "buffalo milk"@en; skos:broader ex:milk. _:b0 rdf:type skos:Collection; skos:prefLabel "milk by source animal"@en; skos:member ex:cowMilk; skos:member ex:goatMilk; skos:member ex:buffaloMilk.
Note that in the example above the collection was defined as a blank node,
i.e. no defined URI was allocated. URIs may be allocated to collections, but
usually this is not necessary. Also, skos:prefLabel
has been
used to assign a lexical label to the Collection, as this property (as other
SKOS labeling properties) can be used with non-conceptual resources.
Sometimes it is important to capture the order of concepts in a
collection, such as when concepts are listed in alphabetical or chronological
order. To define an ordered collection of concepts the skos:OrderedCollection
class is used, together with the skos:memberList
property. This property links an instance of
skos:OrderedCollection
to a (possibly blank) node of type
rdf:List
, following the pattern that enables the definition of
RDF
collections [RDF-PRIMER]. For
example:
ex:infants rdf:type skos:Concept; skos:prefLabel "infants"@en. ex:children rdf:type skos:Concept; skos:prefLabel "children"@en. ex:adults rdf:type skos:Concept; skos:prefLabel "adults"@en. _:b0 rdf:type skos:OrderedCollection; skos:prefLabel "people by age"@en; skos:memberList _:b1. _:b1 rdf:first ex:infants; rdf:rest _:b2. _:b2 rdf:first ex:children; rdf:rest _:b3. _:b3 rdf:first ex:adults; rdf:rest rdf:nil.
Note that, according to the SKOS data model, collections are disjoint from
concepts. It is therefore impossible to use SKOS semantic relations (see Section 2.3) to have a collection directly fit into a SKOS
semantic network. In other words, grouping concepts into collections does not
replace assertions about the concepts' place in a concept scheme. For
instance, in the above "milk" example, all source-defined milks must be
explicitly attached to a more generic ex:milk
using the
skos:broader
property:
ex:cowMilk skos:broader ex:milk. ex:goatMilk skos:broader ex:milk. ex:buffaloMilk skos:broader ex:milk.
A systematic (hierarchical) display can then be generated including the
concept grouping "milk by source animal", as presented in the example
introducing this sub-section. The skos:broader
hierarchy and the
collection membership information can be used for this, but the process still
requires a dedicated algorithm, the implementation of which is left to
specific applications.
One may wonder whether using collections is desirable, as they add
complexity to the representations applications have to manipulate. In fact,
for some cases, e.g. when KOSs are mainly intended as navigation hierarchies,
it seems more intuitive to represent "node labels" or "guide terms" as
instances of skos:Concept
, and to use normal semantic
relationships for linking them to other concepts. Take the following variant
of the "milk" example:
ex3:milkBySourceAnimal rdf:type skos:Concept; skos:prefLabel "milk by source animal"@en; skos:broader ex3:milk; skos:narrower ex3:cowMilk; skos:narrower ex3:goatMilk; skos:narrower ex3:buffaloMilk.
The choice between the two representation options remains open, depending on the application at hand. Readers should however be aware that not using collections, even if more intuitive, may result in a harmful loss of semantic accuracy. For many description applications, for instance, "node labels" are entities of really specific nature, and must not be used as object indices alongside "normal" concepts. Representing them as mere concepts is therefore clearly not a best practice.
As shown in Section 2.4, SKOS allows concepts to be annotated by attaching various notes to them. It is worth noticing that the SKOS Reference does not restrict the range of resources that assertions can use in the object position. This leads to different usage patterns, three of which are explained—and recommended—in this document.
Documentation as an RDF literalHere, documentation statements have simple RDF literals as objects, as illustrated by all examples of Section 2.4. This is the simplest way to document concepts, and it is expected to fit most common applications.
Documentation as a Related Resource DescriptionIn this second pattern, the object of a documentation statement consists of a general non-literal RDF node—that is, a resource node (possibly blank) that can be the subject of further RDF statements [RDF-PRIMER]. This is especially useful to represent with RDF more information about the documentation itself, such as its creator or creation date. This is typically done using the RDF rdf:value utility property, as in the following example, which uses a blank node:
ex:tomato skos:changeNote [ rdf:value "Moved from under 'fruits' to under 'vegetables'"@en; dct:creator ex:HoraceGray; dct:date "1999-01-23" ]. ex:HoraceGray rdf:type foaf:Person; foaf:name "Horace Gray".
Documentation as a Document Reference
A third option consists of introducing, as the object of a documentation statement, the URI of a document, for instance a Web page. Note that this pattern, closely related to the previous one, also allows the definition of further metadata for this document using RDF:
ex:zoology skos:definition ex:zoology.txt. ex:zoology.txt dct:creator ex:JohnSmith.
Some applications require the creation of explicit links between the
labels associated to concepts. For example, consider the relationship between
a preferred label for a concept "Corporation" and its abbreviation "Corp."
coined as an alternative label, or a translation link between two labels in
different languages: "Cow"@en
and "Vache"@fr
. The
use of SKOS lexical labeling properties, e.g. skos:prefLabel
, is
restricted to RDF literals. Therefore these labels cannot be the subject of
an RDF statement, and a direct relationship cannot be asserted between
them.
To solve this representation issue, the SKOS vocabulary has been appended
with an optional extension for labels, SKOS-XL [SKOS-REFERENCE]. This extension introduces
a skosxl:Label
class that allows labels to be treated as first-order RDF resources. Each
instance of this class shall first be attached to a single RDF literal via
the skosxl:literalForm
property. Consider the example where the
concept "Food and Agriculture Organization" is labeled by both the official
name and the acronym of the institution. The two labels can be represented in
the following way:
ex:FAOlabel1 rdf:type skosxl:Label; skosxl:literalForm "Food and Agriculture Organization"@en. ex:FAOlabel2 rdf:type skosxl:Label; skosxl:literalForm "FAO"@en.
skosxl:Label
instances can then be related to concepts using
properties (skosxl:prefLabel
,
skosxl:altLabel
,
skosxl:hiddenLabel
)
that mirror the standard literal-based labeling
constructs. Finally, these instances can be linked together by skosxl:labelRelation
statements:
ex:FAO rdf:type skos:Concept; skosxl:prefLabel ex:FAOlabel1; skosxl:altLabel ex:FAOlabel2. ex:FAOlabel2 skosxl:labelRelation ex:FAOlabel1.
Such a solution is however not complete: an "acronym-sensitive"
application would miss the actual information that the two labels are indeed
in an acronymy relationship. Such an application would also miss the
direction of the link. SKOS-XL users are therefore encouraged to
specialize skosxl:labelRelation
so as to fit their
application-specific requirements, as in the following:
ex:isAcronymOf rdfs:subPropertyOf skosxl:labelRelation. ex:FAOlabel2 ex:isAcronymOf ex:FAOlabel1.
Note that the SKOS-XL data model ensures that using such a pattern remains
compatible with the standard SKOS labeling practice. If an instance of
skosxl:Label
is attached to a concept by, say, a
skosxl:altLabel
statement, it follows from the SKOS-XL data
model that the literal form of the skosxl:Label
instance is
related to this concept by a standard skos:altLabel
statement.
In the above example, ex:FAO
therefore has
"FAO"@en"
as alternative (literal) label.
Indexing practices involving thesauri and other KOSs often include the notion of coordination. Coordination is an activity in which concepts from a KOS are combined. In general there are two kinds of coordination: pre-coordination and post-coordination [WillpowerGlossary]. The key distinction between the two hinges on when the actual coordination occurs in relation to an information retrieval event.
Pre-coordination is done prior to information retrieval, by a KOS maintainer, or by an indexer who is using a KOS—for example, if an indexer takes two existing concepts from a concept scheme, such as "Bicycles" and "Repairing", and explicitly combines them with a given syntax such as "Bicycles--Repairing" to index a particular document.
Post-coordination on the other hand is performed as part of an information retrieval task—for example, if a given document is indexed with two distinct concepts "Bicycles" and "Repairing" and a user decides to perform a search for all documents that are indexed with "Bicycles" and "Repairing".
Post-coordination as an information retrieval activity lends itself to indirect representation as a SPARQL query to access RDF data [SPARQL]. For example, given two distinct concepts:
ex:bicycles skos:prefLabel "Bicycles"@en. ex:repairing skos:prefLabel "Repairing"@en.
one could construct a SPARQL query to return only the documents that are indexed with both concepts
SELECT ?document WHERE { ?document dct:subject ex:bicycles. ?document dct:subject ex:repairing. }
However the SKOS vocabulary itself does not provide any mechanism for
expressing that a given concept consists of a pre-coordination of other
concepts. Of course it is perfectly feasible to extend SKOS to establish a pattern for
representing coordinated concepts. For example it has been suggested
that a new property such as ex:coordinationOf
could be
established:
ex:coordinationOf a rdf:Property; rdfs:domain skos:Concept; rdfs:range rdf:List.
which could then be used in assertions such as:
ex:bicyclesRepairing a skos:Concept; ex:coordinationOf (ex:bicycles ex:repairing); skos:prefLabel "Bicycles--Repairing"@en.
It has also been suggested that OWL itself could be used to coordinate concepts:
ex:bicyclesRepairing a skos:Concept; owl:intersectionOf (ex:bicycles ex:repairing); skos:prefLabel "Bicycles--Repairing"@en.
However, established patterns for pre-coordinations of this kind have not
yet emerged in the SKOS community. ex:coordinationOf
(or some
equivalent extension), and the ramifications of using
SKOS with OWL have not been explored fully enough yet to warrant
inclusion in the SKOS vocabulary. Rather than commit to a design pattern that
has not been proven useful, the Semantic Web Deployment Group decided to
postpone the issue of coordination, to
allow extension patterns to organically emerge as SKOS is deployed. The hope
is that as successful patterns are established, they can be published on the
Web as an extension vocabulary to SKOS
and documented as a W3C Note or some equivalent.
As described in Section 2.3.1, the properties
used to represent KOS hierarchies, skos:broader
and
skos:narrower
, are not defined as transitive. As shown in Fig.
4.5.1 (i) & (ii), this means that their semantics do not
support inferences of the type: if "animals" is broader than "mammals"
and "mammals" is broader than "cats", then "animals" is broader than
"cats".
For the applications that require such semantics—for instance to perform
query expansion—SKOS features two specific properties, skos:broaderTransitive
and skos:narrowerTransitive
.
These are defined as transitive super-properties of skos:broader
and skos:narrower
[SKOS-REFERENCE]. This pattern enables,
using a Semantic Web inferencing tool, access to the "transitive closure" of
a hierarchy expressed with skos:broader
and
skos:narrower
.
Consider the example of Fig. 4.5.1 (i):
ex:animals skos:prefLabel "animals"@en. ex:mammals skos:prefLabel "mammals"@en; skos:broader ex:animals. ex:cats skos:prefLabel "cats"@en; skos:broader ex:mammals.
When reading the above triples, a reasoner makes use the definition of
skos:broaderTransitive
as a super-property of
skos:broader
to infer the following statements:
ex:cats skos:broaderTransitive ex:mammals. ex:mammals skos:broaderTransitive ex:animals.
The transitivity of skos:broaderTransitive
then causes the
desired statement to be inferred:
ex:cats skos:broaderTransitive ex:animals.
These two steps are showed in the following figure:
The use of the skos:broaderTransitive
super-property allows
communities of practice to exploit transitive interpretations of hierarchical
networks as they see fit, while not interfering with the semantics of
skos:broader
, which cannot enforce such transitivity.
Intuitively, one can interpret skos:broader
statements as
explicitly asserted direct parent links, while
skos:broaderTransitive
is used to reflect more general (and
possibly indirect) ancestor relationships.
Note on supposed "transitiveness
inheritance": the super-property link between
skos:broader
and skos:broaderTransitive
may look
counter-intuitive at first glance. Here, a non-transitive property is defined
as a child of a transitive one, while not inheriting its transitiveness. This
is however fully compliant with RDFS/OWL semantics for
rdfs:subPropertyOf
[OWL]: a property P is a sub-property of Q if
and only if every time P holds between two resources, then Q also holds
between them. This does not enforce any transitiveness inheritance: on the
contrary, the sets of all couples of resources related by P (its
graph), as a subset of Q's, is likely to miss some of the couples
that make Q transitive.
Some KOSs, for example classification systems such as the Universal Decimal Classification [UDC], use notations as the primary means of access to the concepts they contain. Notations are symbols which are not normally recognizable as words or sequences of words in any natural language and are thus usable independently of natural-language contexts. They are typically composed of digits, complemented with punctuation signs and other characters, as in the following UDC example:
512 Algebra 512.6 Special branches of algebra
SKOS allows notations to be represented in two ways, depending on the
priorities of the concept scheme publisher. The first, preferred technique is
to use the skos:notation
property. This property allows a concept to be attached to an RDF typed
literal—a literal with an explicit datatype [RDF-PRIMER]. The datatype of the literal
specifies a syntax encoding scheme, which fits the usage of notations in the
concerned KOS. The value of the literal is the notation itself (in this case
the classification code itself):
ex:udc512 skos:prefLabel "Algebra" ; skos:notation "512"^^ex:myUDCNotationDatatype .
Section 6.5.1 of the SKOS Reference gives more detail on how to handle datatypes [SKOS-REFERENCE]. This approach can be especially useful if a KOS publisher wants to provide users with processing rules that are specific to the KOS's notation scheme. For instance, many classification systems have specific syntax rules which allow complex notations to be decomposed, leading to the linking of the corresponding concept to other, simpler concepts. Also, this pattern can help creators of SKOS tools and KOS publishers who want to have notations displayed in a dedicated way.
However, the management of such datatypes can be cumbersome. Further, the
previous pattern is not really needed when publishers consider the notations
themselves to be simple language-independent labels. In such cases, it is
possible to use one SKOS labeling property, for instance
skos:prefLabel
, in combination with private use language tags
(or subtags) as defined by RFC 4646 [RFC4646]. This pattern was first proposed for a
list of coded countries [COUNTRYCODES-SKOS] from which the following
example is adapted:
iso3166:FR skos:prefLabel "France"@en ; skos:prefLabel "FRA"@en-x-notation-threeletter ; skos:prefLabel "250"@x-notation-numerical.
Note that it is unlikely that notations represented in such a manner will benefit from notation-specific mechanisms (such as display procedures) in SKOS tools. This would indeed require different actors to agree on the use of shared subtags, which conflicts with these being defined as "private". By default, users should expect these notations to be treated, in accordance with the SKOS model, as mere labels.
SKOS is intended to serve as a common denominator between different modeling approaches. As such it is hoped that the current vocabulary specification will allow many existing KOSs to be ported to the Semantic Web. However, the great variety of KOS models makes it impossible to capture every detail of these models while still retaining the first "S" ("simple") in "SKOS".
Applications that require finer granularity will greatly benefit from SKOS being a Semantic Web vocabulary. SKOS can indeed be seamlessly extended to suit the specific needs of a particular KOS community while retaining compatibility with applications that are based on the core SKOS features.
This can mostly be done by specializing existing SKOS constructs
into more specific ones. Users can create their own properties and classes
and attach them to the standard SKOS vocabulary elements by using the
rdfs:subPropertyOf
and rdfs:subClassOf
properties
from the RDF Schema vocabulary [RDF-PRIMER].
The example in Section 4.3
illustrates how skosxl:labelRelation
can be specialized into a
semantically richer property devoted to acronym link representation. Other
uses are possible, such as creating different "flavors" of the properties
skos:broader
and skos:narrower
. Thesaurus standards
indeed identify a small number of kinds of hierarchical relation, such as
generic, part-whole, or instance-class [ISO2788]. The SKOS approach allows an application
designer to create new properties to capture this distinction, and to declare
them as sub-properties of skos:broader
:
ex:broaderGeneric rdfs:subPropertyOf skos:broader. ex:broaderPartitive rdfs:subPropertyOf skos:broader. ex:broaderInstantive rdfs:subPropertyOf skos:broader.
Every ex:broaderPartitive
statement between two concepts, for
instance, can be formally interpreted by a proper Semantic Web reasoning
engine. This interpretation will yield the inference of a
skos:broader
statement between these concepts—a piece of
information which may then be exploited by basic SKOS tools.
Note on tampering with the SKOS vocabulary itself: In general, it is best to avoid stating triples where a URI from the SKOS vocabulary is in the subject position. By doing so, one may alter the SKOS data model and introduce unwanted side effects. This may then compromise the interoperability of vocabularies. If one wants to adapt the behavior of the "built-in" vocabulary to specific cases, one should first consider introducing one's own constructs as sub-classes or sub-properties.
Of course the creators of extensions to SKOS are encouraged to publish them, e.g., using the SKOS public mailing list (public-esw-thes@w3.org). Such extensions might correspond to shared concerns and thus be re-usable across different applications. In turn, re-use is likely to bring community feedback, helping to enhance the quality of published extensions.
As seen above, SKOS is an RDF/OWL vocabulary which can be seamlessly extended to fit specific requirements. Likewise, SKOS features can also be used on the Semantic Web as a complement to other modeling vocabularies. This section gives examples of re-using SKOS labeling properties to describe resources that are not necessarily SKOS concepts. It then deals with the specific problem of articulating SKOS concepts with classes as defined by the ontology language OWL.
Note: this section deals with the issues arising when an application requires SKOS features to be used in coordination with other modeling approaches. Users not having such a requirement may skip it.
It is possible to use SKOS labeling properties to label resources that are
not of type skos:Concept
. Consider these triples that describe
Tim Berners-Lee:
<http://www.w3.org/People/Berners-Lee/card#i> rdf:type foaf:Person;
foaf:name "Timothy Berners-Lee";
rdfs:label "TBL";
skos:prefLabel "Tim Berners-Lee"@en.
An application that wishes to display a label for this resource is able to
identify "Tim Berners-Lee" as the preferred label instead of having to choose
between the equally compatible labels rdfs:label
"TBL"
or the foaf:name
"Timothy
Berners-Lee"—these labels are compatible because foaf:name
is
a sub-property of rdfs:label
.
Another example are human-readable labels on classes, properties and
individuals in OWL ontologies, which are normally expressed using
rdfs:label
alone. Consider the following triples that describe
humans:
ex:Human rdf:type owl:Class; rdfs:label "human"@en; rdfs:label "man"@en.
An application would have difficulty determining the correct label to
display to the user since both labels have the same weight. The semantics of
skos:prefLabel
allow implementors to explicitly define the
preferred label for a given resource. In general the ability to reuse
vocabulary elements from SKOS and other RDF vocabularies as needed is what
gives RDF much of its expressive power.
The SKOS
Reference defines skos:Concept
as an OWL class [SKOS-REFERENCE]:
skos:Concept rdf:type owl:Class.
Thus, instances of skos:Concept
(e.g.
ex:Painting
in an art vocabulary) are in OWL terms
individuals.
ex:Painting rdf:type skos:Concept.
This raises the question whether a SKOS concept instance such as
ex:Painting
can be treated as a class in its own right. For
example, can users define properties of ex:painting
such as
ex:support
:
ex:support rdf:type owl:DatatypeProperty. ex:support rdfs:domain ex:Painting.
One might ask the question: why would someone want to do this? Well,
conceptually a class such as skos:Concept
can be seen as a
metaclass: its instances are the concepts occurring in a vocabulary. So, it
is conceivable that SKOS users want to specify class-level characteristics of
SKOS concepts, for example that paintings have supports or that cheese has a
country of origin.
It should be pointed out that SKOS does not take a stance with respect to the flavor of OWL—OWL Full or OWL-DL [OWL-REFERENCE]—to be used together with SKOS. OWL Full users will be able to handle the situation above by treating instances of SKOS concepts explicitly as classes, e.g. by adding statements of the form:
ex:Painting rdf:type owl:Class.
This is possible because OWL Full does not require the sets of classes and individuals to be disjoint. People who wish to use the DL flavor of OWL cannot use this metamodeling mechanism, as the disjointness condition between classes and individuals must hold for any OWL-DL ontology. The OWL-DL users interested in linking OWL classes to SKOS concepts have to keep these formally distinct. They can nevertheless use dedicated OWL annotation properties to bridge them, provided they can create and use their own extension for SKOS, as in:
ex:PaintingClass rdf:type owl:Class. ex:PaintingConcept rdf:type skos:Concept. ex:PaintingClass ex:correspondingConcept ex:PaintingConcept.
Note that at the time of writing, the recently started OWL Working Group [OWL-WG] had been chartered to handle (some forms of) metamodeling within a description-logic framework. This might allow OWL-DL users to opt for patterns that are easier to exploit.
Summarizing, the relationship between SKOS concepts and OWL classes/individuals is as follows:
In a context of networked KOSs, some applications may require the provenance or ownership of SKOS statements to be tracked, for instance for trust purposes. A specific issue is how to establish explicit links between a concept scheme and every piece of information that is stated in the original KOS it represents, including for instance semantic relationships between concepts.
Such functionality, albeit identified as a candidate requirement [SKOS-UCR], is currently outside the scope of SKOS. In RDF, statements comes as context-free triples, which makes it difficult to represent containment and provenance.
However, solutions for such problems have been proposed, such as named
graphs [NAMED-GRAPHS], and the use
of RDF
Datasets in SPARQL [SPARQL]. A SKOS
concept scheme can be related to an RDF Dataset, or even asserted to be such
a Dataset, which enables the creation of SPARQL queries dealing with some
form of provenance or containment. Continuing the example of Section 3.2, and assuming that
ex1:referenceAnimalScheme
and ex2:catScheme
have
been managed as appropriate RDF Datasets (here, named graphs), the query
SELECT ?x ?y WHERE { GRAPH ex2:catScheme { ?x skos:broader ?y } }
may return (ex2:abyssinian, ex1:cat)
as a result, while this
tuple would not appear among the results of
SELECT ?x ? WHERE { GRAPH ex1:referenceAnimalScheme { ?x skos:broader ?y } }
Readers should nevertheless be aware that these mechanisms have not been widely used at the time of writing, and that different standard practices could emerge in the future.
The authors would like to thank Alistair Miles and Dan Brickley for the SKOS Core Guide (which this Primer is largely based on); as well as Tom Baker, Guus Schreiber and Sean Bechhofer who contributed significant portions of this text. Semantic Web Deployment Group members Tom Baker, Margherita Sini, Quentin Reul also provided extensive reviews during the publication process.
This document is the result of extended discussions within the Semantic Web Deployment Group as a whole. Working Group members not already mentioned include (in alphabetical order): Ben Adida, Diego Berrueta, Jeremy Carroll, Michael Hausenblas, Elisa Kendall, Vit Novacek, Jon Phipps, Clay Redding, Daniel Rubin, Manu Sporny, and Ralph Swick.
Public comments (especially via the public-esw-thes@w3.org mailing list) from the following individuals provided invaluable guidance, suggestions and corrections: Mark van Assem, Stephen Bounds, Dan Brickley, Johan De Smedt, Stella Dextre-Clarke, Alasdair Gray, Andrew Houghton, Simon Jupp, Carl Mattocks, Emma McCulloch, Mikael Nilsson, Alan Ruttenberg, Aida Slavic, Simon Spero, Doug Tudhope, Bernard Vatant, Jakob Voss, Leonard Will, Sue Ellen Wright.
SKOS owes much to decades of efforts in the KOS community, in the form of applications, guidelines and standard formats. The compatibility between the SKOS model and two such efforts, ISO 2788 specifications for monolingual thesauri [ISO-2788] and ISO 5964 specifications for multilingual thesauri [ISO-5964] was specifically raised as a candidate requirement in the SKOS Use Case and Requirements [SKOS-UCR].
SKOS does not itself specify rules on how to create concept schemes, however its data model reflects some KOS construction principles. The design of its vocabulary has also been especially influenced by standard thesaurus guidelines, as these are among the most mature proposals in the KOS field. In particular, there are many common points between SKOS and ISO 2788/5964. The following table summarizes the parallels and highlights ways in which the design of SKOS varies from ISO recommendations. It is hoped that this will help future efforts to port thesauri that follow the ISO guidelines into SKOS.
The reader should be aware that this comparison must not by any means be interpreted as a limitation of the scope of SKOS to standard thesauri. As already said in this document, SKOS can be used—possibly with appropriate extensions—for other types of KOS, or thesauri that do not follow the ISO guidelines.
KOS design aspect | ISO 2788/5964 | SKOS |
concepts vs. terms | In ISO standards, thesauri are indexing languages which consist of
terms.
ISO 2788 discusses extensively the crafting of terms, focusing for
instance on their form. For example, explicit qualifiers are used to
distinguish homographs, e.g. |
Concepts are the central modeling primitive of SKOS. Terms
in ISO standards correspond to labels of SKOS concepts.
SKOS, as a simple publishing vehicle, does not propose rules on label design. Further, since SKOS uses simple literals to represent labels, it is not possible to express term-forming mechanisms such as qualification formally and explicitly. For this, and for other cases of attaching information to labels and not to the concept they express, the SKOS-XL extension must be used (see Section 4.3). |
intra-KOS semantic relationships — equivalence | Terms can be semantically equivalent. They are then distinguished
between preferred and non-preferred, using the
USE and UF (used for) relations.
It is assumed that a non-preferred term can only point to one equivalent preferred term, the latter being the main entry point for the concept they both express. |
Equivalent terms are represented as labels attached to a same
concept. By default, there is no direct relationship between these
labels. As in ISO 2788, preferred labels are distinct from
non-preferred (alternative) ones. However, SKOS further
allows to distinguish hidden labels.
A concept can have only one preferred label (per language). Inside a same concept scheme, different concepts can however share a preferred label, though this is not recommended. |
intra-KOS semantic relationships — other links | Beyond the equivalence relations USE and
UF , three types of link are used to semantically relate
terms. BT (broader term) and NT (narrower
term) express that a term's meaning is more general than another's.
RT (related term) is used when a (non-hierarchical)
associative link holds between meanings, which can be useful for
applications which exploit the thesaurus.
ISO 2788 separates three kinds of The validity of logical tests in well-formed thesauri leads to transitive interpretations of the hierarchy, for which a term can reasonably admit all its ancestors as superordinates. |
skos:broader , skos:narrower and
skos:related mirror BT , NT and
RT at the level of concepts.
However as SKOS has a wider scope in terms of KOS types, it does
not make any recommendation as precise as in ISO 2788 on what is a
valid hierarchy. It is mostly up to the KOS publishers to ensure that
the links in their schemes will not conflict with what is observed in
general KOS practice—of which thesauri are only part. SKOS instead
focuses on separating explicitly asserted "parent-child" links
( SKOS also allows for specializing semantic relationships (see Section 4.7). It does not, however, propose a standard set of such specializations. Rather, it is expected that these will come from other standards and guidelines, such as ISO 2788 itself. |
syntactical composition of terms | ISO 2788 features equivalence relations that link terms to
combinations of other terms (USE + , UF + ),
as in coal mining USE coal + mining . |
By default, SKOS does not feature one-to-many concept-to-concept or
concept-to-label links. Extensions might be however devised to
address this shortcoming, e.g. by specializing
skos:Concept or skosxl:Label . |
node labels | Thesaurus arrays play an important role regarding the rendering of term hierarchy in a systematic display. They are for example the main vehicle for faceted organization of thesauri. | SKOS allows the representation of groupings of concepts. But it focuses on the conceptual level, and no construct is given that biases towards a specific display strategy. As a result, collections in SKOS are not explicitly related to one "parent" concept. This link must be (re-)created via a specific display algorithm, or by using an ad-hoc extension. |
documentation notes | ISO 2788 proposes to attach scope notes and definitions to terms
using the SN abbreviation. |
SKOS has more types of note for concepts: scope notes, definition, history note, etc. These properties can be further extended to match specific requirements. |
notations | ISO guidelines target standard thesauri. As a result, they do not address the issue of notations as used in other types of KOS. | There are two ways to attach represent notations: either via the
skos:notation property, or by using simple labeling
properties combined with specific language tags (see Section 4.6). |
concept schemes | In ISO 2788, there is no explicit rendering of thesauri themselves, as terms are only considered in the context of one indexing vocabulary. | SKOS is influenced by the possibility of having several KOSs
co-exist. A ConceptScheme class is proposed to represent
them explicitly and to attach descriptive metadata to them, even
though SKOS itself does not feature specific constructs for this. The
link between a KOS and its concepts is explicit, and a same concept
can belong to several KOSs. |
top concepts | In a thesaurus display, the TT abbreviation can be
used to refer to the topmost term of the hierarchy to which displayed
terms belong. |
skos:hasTopConcept is used to relate a concept scheme
to the concepts that constitute entry points in its hierarchy. |
language management | In ISO 2788 terms should come from a same language.
ISO 5964 proposes to have several languages co-exist in a same thesaurus. The terms from each language form however quite independent parts of the thesaurus, only related to each other by translation links. |
From a model perspective, concepts are language-independent : a concept can have labels in different languages. Labels can indeed be declared as language-specific, using RDF literal language tags. Several languages may therefore be tightly integrated in a same concept scheme. |
inter-KOS mapping relationships | Semantic mapping relations are only considered by ISO 5964 in the
context of multilingual thesauri, as a further characterization for
the translation. The types discussed are:
Note that ISO 5964 addresses many issues that are outside the scope of SKOS, such as transferring hierarchical and associative relations from one language to the other, or coining new terms in a language when a semantic equivalent cannot be found for terms in other languages. |
SKOS mapping relations mirror relatively well ISO 5964 types. For
example, skos:closeMatch and
skos:exactMatch separate cases where semantic
equivalence is not exact but can be accepted for a given application,
from cases where equivalence is perfectly valid from a semantic
perspecitve
For an individual multilingual KOS, however, equivalence links in
ISO 5964 may be represented in SKOS by attaching equivalent terms as
labels of a same concept. This fits the approach of ISO 5964, which
only makes it necessary to link preferred terms: such links can be
transfered at the level of the concepts these terms express. Yet ISO
5964 also allows to relate non-preferred terms (e.g.,
Single-to-multiple translations cannot be represented in SKOS. As for syntactic combination of terms within one thesaurus, extensions to the standard model are required. Note finally that ISO 5964 discusses extensively the display of multilingual thesauri. SKOS does not address this. But as for simple thesauri, ISO 5964 displays can be implemented on top of SKOS data—except in the case of the single-to-multiple mappings mentioned above. |