Shapes Constraint Language (SHACL)

Abstract

This document defines the SHACL Shapes Constraint Language, a language for validating RDF graphs against a set of conditions. These conditions are provided as shapes and other constructs expressed in the form of an RDF graph. RDF graphs that are used in this manner are called "shapes graphs" in SHACL and the RDF graphs that are validated against a shapes graph are called "data graphs". As SHACL shape graphs are used to validate that data graphs satisfy a set of conditions they can also be viewed as a description of the data graphs that do satisfy these conditions. Such descriptions may be used for a variety of purposed beside validation, including user interface building, code generation and data integration.

1. Introduction

This document specifies SHACL (Shapes Constraint Language), a language for describing and validating RDF graphs. This section introduces SHACL with an overview of the key terminology and an example to illustrate basic concepts.

1.1 Terminology

Throughout this document, the following terminology is used.

Basic RDF Terminology

This document uses the terms RDF graph, RDF triple, IRI, literal, blank node, node of an RDF graph, RDF term, and subject, predicate, and object of RDF triples as defined in RDF 1.1 Concepts and Abstract Syntax [rdf11-concepts]. SHACL can be used with RDF graphs that are obtained by any means, e.g. from the file system, HTTP requests, or RDF datasets. SHACL makes no assumptions about whether a graph contains triples that are entailed from the graph under any RDF entailment regime.

Property Values and Paths

The values of (or for) a property p for a node n in an RDF graph are the objects of the triples in the graph that have n as subject and p as predicate. A property path is a possible route in a graph between two graph nodes. SHACL supports a subset of the property path syntax from SPARQL 1.1, including inverse paths and sequences.

SHACL Subclass, SHACL superclass

A node Sub in an RDF graph is a SHACL subclass of another node Super in the graph if there is a sequence of triples in the graph each with predicate rdfs:subClassOf such that the subject of the first triple is Sub, the object of the last triple is Super, and the object of each triple except the last is the subject of the next. If Sub is a SHACL subclass of Super in an RDF graph then Super is a SHACL superclass of Sub in the graph.

SHACL Type

The SHACL types of a node in an RDF graph are its values for rdf:type in the graph as well as the SHACL superclasses of these values in the graph.

SHACL Class

Nodes in an RDF graph that might be subclasses, superclasses, or types of nodes in the graph are often referred to as SHACL class. SHACL makes no assumption whether a SHACL class has any particular value for rdf:type in the graph.

SHACL Instance

A node in an RDF graph is a SHACL instance of a SHACL class in the graph if one of its SHACL types is the given class.

Data Graph, Shapes Graph, Validation, Report, Result, Violation, Failure

SHACL defines what it means for an RDF graph, referred to as the data graph, to validate against an RDF graph containing shapes, referred to as the shapes graph. The result of validation is a validation report including validation results such as informational results, warnings and violations. Validation may also result in a failure, which is reported by a SHACL validation engine to indicate that a request could not be handled. Failures are not represented as part of the validation report, but through implementation-specific channels. Validation of a shapes graph against a data graph involves validating each shape in the shapes graph against the data graph. A node in a data graph is said to validate against a shape if validation of that node against the shape neither produces any validation results that are violations nor results in a failure.

Constraint

A constraint is a node in the shapes graph that determines how to validate focus nodes based on the values of properties and other characteristics of the node. Constraints can, for example, require that a focus node be an IRI or that a focus node has a particular value for a property and also a minimum number of values for the property. Constraints that are about a particular property or path and its values for the focus node are called property constraints. Constraints that are about the focus node itself are called focus node constraints. Constraints can also have non-validating properties (such as names and default values) that do not lead to validation results.

Constraint Component, Parameter

A constraint component represents a part of a constraint that is determined by the values of one or more properties. These properties are called parameters. For example, sh:minCount is a parameter for the component that checks whether the focus node has at least a minimum number of values for a particular property. Validating a node against a constraint involves validating the node against each of its components.

Shape

A shape is represented by a node in a shapes graph that is typically a SHACL instance of sh:Shape. A shape provides a collection of targets, filters, constraints and parameters of constraint components that specify how a data graph is validated against the shape. Shapes can also provide non-validating information, such as labels and comments.

Target

A target is a triple or a node in the shapes graph that specifies which nodes in a data graph are validated against a shape. Validating a shape in a shapes graph involves validating the target nodes for all targets of the shape. SHACL provides several different kinds of targets, most notably the SHACL instances of a given class, specifically enumerated nodes, or all objects or subjects of a given predicate.

Focus Node

A node in the data graph that is validated against a shape is called a focus node.

Filter

A filter is a shape in the shapes graph that limits the nodes that are validated against the constraints of another shape. Only those nodes that validate against all the filters of a shape are validated against its constraints.

1.2 Document Conventions

Within this document, the following namespace prefix bindings are used:

Prefix	Namespace
`rdf:`	`http://www.w3.org/1999/02/22-rdf-syntax-ns#`
`rdfs:`	`http://www.w3.org/2000/01/rdf-schema#`
`sh:`	`http://www.w3.org/ns/shacl#`
`xsd:`	`http://www.w3.org/2001/XMLSchema#`
`ex:`	`http://example.com/ns#`

Note that the URI of the graph defining the SHACL vocabulary itself is equivalent to the namespace above, i.e. it includes the #. References to the SHACL vocabulary, e.g. via owl:imports SHOULD include the #.

Throughout the document, color-coded boxes containing RDF graphs in Turtle will appear. These fragments of Turtle documents use the prefix bindings given above.

# This box represents an input shapes graph

# Triples that can be omitted are marked as grey e.g.
<s> <p> <o> .

# This box represents an input data graph.
# When highlighting is used in the examples:

# Elements highlighted in blue are focus nodes that are
# selected by some target of a shape under discussion
# and validate against the shape's filters, if any.
ex:Bob a ex:Person .

# Elements highlighted in red are focus nodes that fail validation
ex:Alice a ex:Person .

# This box represents an output results graph

SHACL Definitions appear in blue boxes:

SPARQL or TEXTUAL DEFINITIONS

# This box contains SPARQL or textual definitions.

1.3 SHACL Example

The following example data graph contains eight total nodes, out of which three nodes are SHACL instances of the class ex:Person.

ex:Alice
	a ex:Person ;
	ex:child ex:Calvin ;
	ex:ssn "987-65-432A" .
  
ex:Bob
	a ex:Person ;
	ex:child ex:Calvin ;
	ex:ssn "123-45-6789" ;
	ex:ssn "124-35-6789" .
  
ex:Calvin
	a ex:Person ;
	ex:school ex:TrinityAnglicanSchool .

SHACL can be used to define the following example constraints:

A SHACL instance of ex:Person may have at most one value for the property ex:ssn, and this value must be a literal with the datatype xsd:string that matches a specified regular expression.
A SHACL instance of ex:Person may have unlimited values for the property ex:child, and these values must be IRIs and they must be SHACL instances of ex:Person.
A person's parents are represented using the property ex:child in the inverse direction. A SHACL instance of ex:Person may have at most 2 parents, i.e. may be the object of at most two triples where the predicate is ex:child.
A SHACL instance of ex:Person may not have values for any other property apart from ex:ssn, ex:child and rdf:type.

The constraints above can be represented using the following shapes graph:

ex:PersonShape
	a sh:Shape ;
	sh:targetClass ex:Person ;    # Applies to all persons
	sh:property [
		sh:predicate ex:ssn ;     # Constrains the values of the ex:ssn property
		sh:maxCount 1 ;
		sh:datatype xsd:string ;
		sh:pattern "^\\d{3}-\\d{2}-\\d{4}$" ;
	] ;
	sh:property [
		sh:predicate ex:child ;
		sh:class ex:Person ;
		sh:nodeKind sh:IRI ;
	] ;
	sh:property [
		rdfs:comment "A person's parents are represented via ex:child used in the inverse direction." ;
		sh:path [ sh:inversePath ex:child ] ;
		sh:name "parent" ;
		sh:maxCount 2 ;
	] ;
	sh:closed true ;
	sh:ignoredProperties ( rdf:type ) .

We can use the shape definition above to skim through some of the key terminology used by SHACL. The focus nodes for the shape ex:PersonShape are all SHACL instances of the class ex:Person. These focus nodes are the targets of the shape and are defined using the property sh:targetClass. The shape has three property constraints, linked to the shape using the property sh:property, one of which uses a path expression. The shape furthermore defines a constraint on the focus nodes themselves using the parameters sh:closed and sh:ignoredProperties.

Some of the property constraints specify multiple constraint components in order to restrict multiple aspects of the property values. For example, in the property constraint for ex:ssn, three constraint components are used. These constraint components are identified by their parameters sh:datatype, sh:pattern and sh:maxCount. For each focus node the property values of ex:ssn will be validated against all three components. The constraint on the inverse property values of sh:child has only one constraint component identified by the sh:maxCount parameter. Note that this constraint uses the non-validating property sh:name to suggest a human-readable name for the property when used in the inverse direction.

SHACL validation based on the provided data graph and shapes graph would produce the following validation results:

[	a sh:ValidationResult ;
	sh:sourceConstraintComponent sh:RegexConstraintComponent ;
	sh:sourceShape ex:PersonShape ;
	sh:focusNode ex:Alice ;
	sh:path ex:ssn ;
	sh:value "987-65-432A" ;
	sh:severity sh:Violation ;
] ;
[	a sh:ValidationResult ;
	sh:sourceConstraintComponent sh:MaxCountConstraintComponent ;
	sh:sourceShape ex:PersonShape ;
	sh:focusNode ex:Bob ;
	sh:path ex:ssn ;
	sh:severity sh:Violation ;
] ;
[	a sh:ValidationResult ;
	sh:sourceConstraintComponent sh:ClosedConstraintComponent ;
	sh:sourceShape ex:PersonShape ;
	sh:focusNode ex:Calvin ;
	sh:path ex:school ;
	sh:value ex:TrinityAnglicanSchool ;
	sh:severity sh:Violation ;
] .

The first validation result is produced because ex:Alice has a value for ex:ssn that does not match the regular expression specified by the property sh:regex. The second validation result is produced because ex:Bob has more than the permitted number of values for the property ex:ssn as specified by the sh:maxCount of 1. The third validation result is produced because the shape ex:PersonShape has the the property sh:closed set to true but ex:Calvin uses the property ex:school which is neither one of the predicates from any of the property constraints at the shape, nor one of the properties listed using sh:ignoredProperties.

1.4 Relationship between SHACL and RDFS inferencing

SHACL uses the RDF and RDFS vocabularies, but full RDFS inferencing is not required. However, SHACL processors MUST identify SHACL instances of a class both in the data graph and the shapes graph without mutating either graph during the validation process. Furthermore, SHACL processors may operate on RDF graphs that include entailments - either pre-computed before being submitted to a SHACL processor or performed on the fly as part of SHACL processing. To support processing of entailments, SHACL includes the property sh:entailment to indicate what inferencing is required by a given shapes graph. SHACL implementations may, but are not required to, support entailment regimes.

1.5 Relationship between SHACL and SPARQL

This specification uses parts of SPARQL 1.1 in the normative definition of the semantics of the SHACL Core constraints and targets. However, SPARQL is not required for the implementation of the SHACL Core language.

SPARQL variables using the $ marker represent external values that must be pre-bound or substituted in the SPARQL query before execution.

In some places, the specification assumes that the provided SPARQL engines are preserving the identity of blank nodes, so that repeated invocations of queries consistently identify and communicate the same blank nodes.

The definition of some constraints requires or is simplified through access to the shapes graph during query execution. SHACL validation engines MAY pre-bind the variable $shapesGraph to provide access to the shapes graph. Access to the shapes graph is not a requirement for supporting the SHACL Core language. The variable $shapesGraph can also be used in user-defined SPARQL constraints and SPARQL-based constraint components. However, such constraints may not be interoperable across different SHACL validation engines or not applicable to remote RDF datasets.

Some SHACL constraints are defined with the use of the sh:hasShape function. SHACL additionally introduces mechanisms to define constraints, targets, derived values and new functions in SPARQL. Implementations that cover only the the SHACL Core features are not required to implement these mechanisms or the sh:hasShape function.

The button below can be used to show or hide the SPARQL definitions.

1.6 Conformance

As well as sections marked as non-normative, all authoring guidelines, diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative.

The key words MAY, MUST, MUST NOT, and SHOULD are to be interpreted as described in [RFC2119].

TODO: We still need to mark non-normative sections.

2. Shapes

Shapes are SHACL instances of sh:Shape and define constraints that a set of focus nodes can be validated against. The set of focus nodes for a shape may be defined explicitly in a shape using targets and filters. The focus nodes may also be determined as part of the validation of constraints that include references to shapes using properties such as sh:shape and sh:or. Shapes can also provide non-validating information, such as labels and names. The following figure depicts a workflow of the targeting and filtering process. SHACL validation engines MAY alter the order of the depicted steps as long as the returned validation results are correct.

Fig. 1 Illustration of the targeting and filtering process

2.1 Targets

Targets specify which nodes in the data graph are validated against a shape. While targets define the starting points of the validation process, some shapes may validate a) less target nodes when a shapes defines filters or b) additional nodes against different shapes, for example referenced via sh:shape and sh:or.

The Core of SHACL includes four Core target types: node targets, class-based targets, subjects-of targets, and objects-of targets.

The SHACL language additionally defines an advanced general target mechanism based on SPARQL.

When multiple targets are provided in a shape, the target of a shape is the union of all nodes produced by these individual targets. Nodes specified by targets are not required to exist in the data graph.

2.1.1 Node targets (sh:targetNode)

A node target is defined with the sh:targetNode predicate. The values of sh:targetNode can be IRIs or literals. Each value of a node target defines a node to validate in the data graph.

With the example data below, only ex:Alice is the target of the provided shape:

ex:PersonShape
	a sh:Shape ;
	sh:targetNode ex:Alice .

ex:Alice a ex:Person .
ex:Bob a ex:Person .

The following SPARQL query specifies the semantics of node targets. The variable $targetNode is assumed to be pre-bound to the given value of sh:targetNode.

SPARQL DEFINITION

SELECT DISTINCT ?this
WHERE {
	BIND ($targetNode AS ?this)
}

2.1.2 Class-based Targets (sh:targetClass)

A class target is defined with the sh:targetClass predicate. Each value of sh:targetClass must be an IRI that is expected to be a SHACL class. For every value c of a class target, all SHACL instances of c in the data graph are validated against the subject of the sh:targetClass triple.

ex:PersonShape
	a sh:Shape ;
	sh:targetClass ex:Person .

ex:Alice a ex:Person .
ex:Bob a ex:Person .
ex:NewYork a ex:Place .

In this example, only ex:Alice and ex:Bob are validated. Note that, according to the SHACL instance definition, all the rdfs:subClassOf declarations needed to walk the class hierarchy must exist in the data graph. However, the ex:Person a rdfs:Class triple is not required to exist in either graphs.

In the following example, the selected target node is only ex:Who.

ex:Doctor rdfs:subClassOf ex:Person .
ex:Who a ex:Doctor .
ex:House a ex:Nephrologist .

The following SPARQL query specifies the semantics of class targets. The variable $targetClass is assumed to be pre-bound to the given value of sh:targetClass.

SPARQL DEFINITION

SELECT DISTINCT ?this
WHERE {
	?this rdf:type/rdfs:subClassOf* $targetClass
}

2.1.2.1 Implicit Class Targets

When, in the shapes graph, a shape is a SHACL instance of both sh:Shape and rdfs:Class then the shape is a class target of itself.

ex:Person
	a rdfs:Class, sh:Shape .

ex:Alice a ex:Person .

In this example, only ex:Alice is validated, because it is a SHACL instance of ex:Person which is both a class and a shape in the shapes graph.

2.1.3 Subjects-of targets (sh:targetSubjectsOf)

A subjects-of target is defined with the predicate sh:targetSubjectsOf, the values of which must be IRIs. For every value p of such a target, the validated nodes are defined as the set of subjects in the data graph that appear in a triple with p as a predicate.

In other words, this type of target can be used to state that a shape applies to all nodes in the data graph that have at least one value of the given property.

ex:TargetSubjectsOfExampleShape
	a sh:Shape ;
	sh:targetSubjectsOf ex:knows .

ex:Alice ex:knows ex:Bob .
ex:Bob ex:livesIn ex:NewYork .

In the example above, only ex:Alice is validated against the given shape, because it is the subject of a triple that has ex:knows as its predicate.

The following SPARQL query specifies the semantics of subjects-of targets. The variable $targetSubjectsOf is assumed to be pre-bound to the given value of sh:targetSubjectsOf.

SPARQL DEFINITION

SELECT DISTINCT ?this
WHERE {
	?this $targetSubjectsOf ?any .
}

2.1.4 Objects-of targets (sh:targetObjectsOf)

An objects-of target is defined with the predicate sh:targetObjectsOf, the values of which must be IRIs. For every value p of such a target, the validated nodes are defined as the set of objects in the data graph that appear in a triple with p as a predicate.

ex:TargetObjectsOfExampleShape
	a sh:Shape ;
	sh:targetObjectsOf ex:knows .

ex:Alice ex:knows ex:Bob .
ex:Bob ex:livesIn ex:NewYork .

In the example above, only ex:Bob is validated against the given shape, because it is the object of a triple that has ex:knows as its predicate.

The following SPARQL query specifies the semantics of objects-of targets. The variable $targetObjectsOf is assumed to be pre-bound to the given value of sh:targetObjectsOf.

SPARQL DEFINITION

SELECT DISTINCT ?this
WHERE {
	?any $targetObjectsOf ?this .
}

2.2 Filter Shapes

A filter is a shape in a shapes graph that can be used to limit the nodes that are validated against a given constraint or shape. Only those nodes that validate against all the filters are validated. A filter is specified with the sh:filterShape predicate.

The following example states that the sh:minCount constraint on ex:email is filtered to include only SHACL instances of ex:Person that are ex:members of ex:W3c.

ex:ExampleFilteredShape
	a sh:Shape ;
	sh:targetClass ex:Person ;
	sh:filterShape [
		a sh:Shape ; # Optional triple
		sh:property [
			sh:predicate ex:member ;
			sh:hasValue ex:W3c ;
		]
	] ;
	sh:property [
		sh:predicate ex:email ;
		sh:minCount 1 ;
	] .

ex:Alice a ex:Person ;
	ex:member ex:W3c ;
	ex:email <mailto:alice@example.org> .
ex:John a ex:Person ;
	ex:member ex:W3c .
ex:Bob a ex:Person ;
	ex:member ex:Acme .

[  a sh:ValidationResult ;
	sh:severity sh:Violation ;
	sh:focusNode ex:John ;
	sh:path ex:email ;
	sh:message "sh:minCount for ex:email is '1'." ;
	sh:sourceConstraintComponent sh:MinCountConstraintComponent ;
	sh:sourceShape ex:ExampleFilteredShape ; 
] .

The following example shows a sh:filterShape that is defined on a specific property constraint, instead of the whole shape. In this scenario, the sh:minCount constraint is only applied to persons that are also member of ex:W3c.

ex:FilteredExampleShape
	a sh:Shape ;
	sh:targetClass ex:Person ;
	sh:property [
		sh:predicate ex:email ;
		sh:minCount 1 ;
		sh:filterShape [
			sh:property [
				sh:predicate ex:member ;
				sh:hasValue ex:W3c ;
			]
		] ;
	] .

Filter shapes MUST be evaluated before validating the associated shapes or constraints. This includes scenarios such as sh:shape where a shape is explicitly referenced by another constraint. However, during the validation of a shape referenced via sh:shape, the declared targets of these shapes are not used to limit the set of focus nodes.

2.3 Constraints

Constraints are defined within a shape and sh:Constraint is the SHACL superclass of all constraint types. The SHACL Core language defines two types of constraints: a) constraints about a particular property or path and its values for the focus node (property constraints) and b) constraints about the focus node itself (focus node constraints).

Additional types of constraints can be added using the extension mechanism, as SPARQL-based constraints or SPARQL-based constraint components.

Constraints may contain non-validating properties (such as sh:description) or parameters of constraint components (e.g. sh:minCount). Constraint components define one or more parameter properties and validation instructions (such as those implemented as SPARQL queries) that can be used to perform the validation for the given focus node and parameter values. The catalog of constraint components in the Core of SHACL is defined in section 4.

2.3.1 Property Constraints (sh:predicate or sh:path)

Property constraints specify conditions that must be met with respect to nodes that can be reached from the focus node either by directly following a given property (specified using sh:predicate) or a given property path (specified using sh:path). Property constraints are linked from a shape with the property sh:property. The values of sh:property must be IRIs or blank nodes that are the subject of precisely one triple with either predicate sh:predicate or sh:path. The values of sh:predicate must be IRIs. The values of sh:path must be well-formed property paths following the SHACL property path syntax rules.

The following example illustrates the two syntax variations of property constraints.

ex:ExampleShapeWithPropertyConstraints
	a sh:Shape ;
	sh:property [
		sh:predicate ex:email ;
		sh:name "e-mail" ;
		sh:description "We need at least one email value" ;
		sh:minCount 1 ;
	] ;
    sh:property [
		sh:path (ex:knows ex:email) ;
		sh:name "Friend's e-mail" ;
		sh:description "We need at least one email for everyone you know" ;
		sh:minCount 1 ;
	] .

sh:PropertyConstraint is the class of property constraints. A SHACL validation engine treats all values of sh:property as property constraints. Thus, the values of sh:property do not require the rdf:type sh:PropertyConstraint triple.

2.3.2 Focus Node Constraints

Focus node constraints define constraints about the focus node itself. Since focus node constraints operate directly on the input focus nodes they impose some limitations in comparison to property constraints. In particular, constraints that operate on value sets, such as sh:hasValue and sh:equals, are not applicable to focus node constraints. Focus node constraints are attached directly to Shapes and sh:shape can be used to group focus node constraints.

ex:ExampleFocusNodeConstraint
	a sh:Shape ;
	sh:targetClass ex:Person ;
	sh:stem "https://www.w3.org/People/" .

sh:Shape is defined as rdfs:subClassOf sh:Constraint.

2.3.3 Multiple Parameters

Parameters of constraint components that only declare one parameter, such as sh:class, may be used multiple times within the same constraint node. The interpretation of such declarations is conjunction, i.e. all constraints must validate. Constraints that declare more than one parameters, such as sh:pattern, are not allowed to be declared more than once in the same constraint. In the following example this technique is used to restrict the values of a property to be SHACL instances of both ex:Customer and ex:MalePerson.

ex:ShapeWithTwoClasses
	a sh:Shape ;
	sh:property [
		sh:predicate ex:someProperty ;
		sh:class ex:Customer ;
		sh:class ex:MalePerson ;
	] .

2.3.4 Property Paths

SHACL includes an RDF vocabulary to represent property paths that can be mapped into a subset of SPARQL 1.1 property paths. In particular, SHACL supports the following SPARQL 1.1 property path constructs: PredicatePath, InversePath, SequencePath, AlternativePath, ZeroOrMorePath, OneOrMorePath and ZeroOrOnePath. A valid SHACL property path p is represented by an IRI or a blank node that can be correctly traversed recursively using the following rules.

If the graph contains a triple of the form p sh:inversePath elt then the path becomes an InversePath element of elt and elt must be a valid SHACL property path. Corresponding rules apply for sh:zeroOrMorePath (ZeroOrMorePath), sh:zeroOrOnePath (ZeroOrOnePath) and sh:oneOrMorePath (OneOrMorePath).
If the graph contains a triple of the form p rdf:first elt, the path must be a well-formed RDF list with a at least two members. Each member must be valid SHACL property path that is converted into a sequence of either SequencePath or AlternativePath elements.
If the graph contains a triple of the form p sh:alternativePath elt, the value of path must be a well-formed RDF list with a at least two members. All members must be valid SHACL property paths that become a series of AlternativePath elements.
If p is an IRI then it is turned into a PredicatePath with value p.

A SHACL property path is invalid if:

p is a blank node that is not handled by any of the above rules.
During the path traversal the same node is reached more than once.
There is a value for p with a predicate different from: sh:inversePath, sh:alternativePath, sh:zeroOrMorePath, sh:zeroOrOnePath, sh:oneOrMorePath or rdf:first.
There are two or more triples with p as a subject.

The following example illustrates some valid SHACL property paths, together with their SPARQL 1.1 equivalents.

# ^ex:parent
[ sh:inversePath ex:parent ] .

# ex:parent/ex:firstName
( ex:parent ex:firstName ) .
					
# rdf:type/rdfs:subClassOf*
( rdf:type [ sh:zeroOrMorePath rdfs:subClassOf ] ) .

# ex:father|ex:mother
[ sh:alternativePath ( ex:father ex:mother  ) ] .

Severity	Description
`sh:Info`	An informative message, not a violation.
`sh:Warning`	A non-critical constraint violation indicating a warning.
`sh:Violation`	A constraint violation that should be fixed.

4. Core Constraint Components

This sections defines the built-in SHACL constraint components that MUST be supported by all SHACL validation engines. Each constraint component is identified by an IRI that is referenced in the validation results via sh:sourceConstraintComponent.

The choice of constraint components that are defined by the SHACL Core was made based on the requirements collected by the [shacl-ucr] document. Special attention was paid to the balance between trying to cover as many common use cases as possible and keeping the size of the Core language manageable. Not all use cases (such as describing constraints on members of an rdf:List) can be expressed by the Core language alone. Instead, SHACL provides an extension mechanism, described in the second part of this specification. It is expected that additional reusable libraries of constraint components will be maintained by third parties.

All constraint components can be used both in property constraints and focus node constraints. However, some components might always fail in a particular constraint type. For example, sh:closed does not make sense in property constraints or sh:hasValue in focus node constraints.

The textual description of each component refers to the concept of value nodes which is defined as follows, including rules for the creation of validation results:

For property constraints that have a sh:predicate the value nodes are the objects of the triples that have the focus node as subject and the given property as predicate. Each produced validation result must have the focus node as its sh:focusNode, the sh:predicate as its sh:path and, if applicable, the respective violating value node as its sh:value.
For property constraints that have a sh:path the value nodes are the objects in the data graph that can be reached by following the given property path starting with the focus node as subject based on the evaluation rules defined by SPARQL 1.1. Each produced validation result must have the focus node as its sh:focusNode, a deep copy of sh:path as its sh:path and, if applicable, the respective violating value node as its sh:value.
For focus node constraints the value nodes are the individual focus nodes, forming a set of exactly one node.

4.1 SPARQL definitions for built-in constraint components

The SPARQL definitions in this section represent potential validators. Many constraint components are represented by a SPARQL ASK query. These queries are interpreted against each value node, bound to the variable $value. If a value node fails the ASK query, a validation result is produced based on the rules outlined in the section on ASK-based validators. Constraint components that are described using a SELECT query are interpreted based on the rules outlined in the section on SELECT-based validators. In particular, for sh:PropertyConstraints, the variable $PATH is substituted with a path expression based on the values of either sh:predicate or sh:path in the constraint. All SPARQL queries also assume the variable bindings and result variable mapping rules detailed in the section on SPARQL-based Constraints. In a nutshell, the variable $this represents the currently validated focus node.

Note that the parameter tables in each of the following sections have a column called Value Type which indicates the expected type of the parameter values for documentation purposes, without enforcing any formal restrictions.

Some SPARQL definitions in this section also assume the existence of a built-in SPARQL function sh:hasShape.

4.2 Value Type Constraint Components

The constraint components in this section have in common that they define restrictions on the type of the nodes. Note that it is possible to represent multiple value type options using sh:or.

4.2.1 sh:class

The property sh:class can be used to verify that each value node is a SHACL instance of a given type.

Constraint Component: sh:ClassConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:class`	`rdfs:Resource`	Type of all values

TEXTUAL DEFINITION

A validation result must be produced for each value node that is either a literal, or a non-literal that is not a SHACL instance of the given class in the data graph.

SPARQL DEFINITION (Must evaluate to true for each value node $value)

ASK {
	$value rdf:type/rdfs:subClassOf* $class .
}

ex:ClassExampleShape
	a sh:Shape ;
	sh:targetNode ex:Bob, ex:Alice, ex:Carol ;
	sh:property [
		sh:predicate ex:knows ;
		sh:class ex:Person ;
	] .

ex:Alice a ex:Person .
ex:Bob ex:knows ex:Alice .
ex:Carol ex:knows ex:Bob .

4.2.2 sh:datatype

The property sh:datatype can be used to restrict the datatype of all value nodes. The values of sh:datatype must be resources representing datatypes, such as xsd:string.

Constraint Component: sh:DatatypeConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:datatype`	`rdfs:Resource`	Datatype of all value nodes (e.g., `xsd:integer`)

TEXTUAL DEFINITION

A validation result must be produced for each value node that is not a literal, or is a literal with a mismatching datatype. A literal matches a datatype if the literal's datatype has the same IRI.

SPARQL DEFINITION (Must evaluate to true for each value node $value)

ASK {
	FILTER (datatype($value) = $datatype) .
}

ex:DatatypeExampleShape
	a sh:Shape ;
	sh:targetNode ex:Alice, ex:Bob ;
	sh:property [
		sh:predicate ex:age ;
		sh:datatype xsd:integer ;
	] .

ex:Alice ex:age "23"^^xsd:integer .
ex:Bob ex:age "twenty two" .

4.2.3 sh:nodeKind

The property sh:nodeKind is used to restrict the RDF node kind of each value node.

Constraint Component: sh:NodeKindConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:nodeKind`	`sh:NodeKind`	Node kind (IRI, blank node, literal or combinations of these) of all value nodes

The values of sh:nodeKind must be one of the following six instances of the class sh:NodeKind: sh:BlankNode, sh:IRI and sh:Literal as well as the following values that represent combinations of the former three, i.e. either-or: sh:BlankNodeOrIRI, sh:BlankNodeOrLiteral and sh:IRIOrLiteral.

TEXTUAL DEFINITION

A validation result must be produced for each value node that does not match the given node kind.

SPARQL DEFINITION (Must evaluate to true for each value node $value)

ASK {
	FILTER ((isIRI($value) && $nodeKind IN ( sh:IRI, sh:BlankNodeOrIRI, sh:IRIOrLiteral ) ) ||
		(isLiteral($value) && $nodeKind IN ( sh:Literal, sh:BlankNodeOrLiteral, sh:IRIOrLiteral ) ) ||
		(isBlank($value)   && $nodeKind IN ( sh:BlankNode, sh:BlankNodeOrIRI, sh:BlankNodeOrLiteral ) )) .
}

ex:NodeKindExampleShape
	a sh:Shape ;
	sh:targetNode ex:Bob, ex:Alice ;
	sh:property [
		sh:predicate ex:knows ;
		sh:nodeKind ex:IRI ;
	] .

ex:Bob ex:knows ex:Alice .
ex:Alice ex:knows "Bob" .

4.3 Cardinality Constraint Components

The constraint components in this section can be applied to a property constraint, to represent restrictions on the number of values that the focus node may have for the given property or property path.

4.3.1 sh:minCount

The property sh:minCount restricts the number of value nodes.

Constraint Component: sh:MinCountConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:minCount`	`xsd:integer`	The minimum cardinality. If the value is 0 then this constraint is always satisfied and so may be omitted.

TEXTUAL DEFINITION

A validation result must be produced if the number of value nodes is less than the value of sh:minCount.

SPARQL DEFINITION (Must return no results for the given $PATH)

SELECT $this
WHERE {
	OPTIONAL {
		$this $PATH ?value .
	}
} 
GROUP BY $this
HAVING (COUNT(?value) < $minCount)

ex:MinCountExampleShape
	a sh:Shape ;
	sh:targetNode ex:Alice, ex:Bob ;
	sh:property [
		sh:predicate ex:name ;
		sh:minCount 1 ;
	] .

ex:Alice ex:name "Alice" .
ex:Bob ex:givenName "Bob"@en .

4.3.2 sh:maxCount

The property sh:maxCount restricts the number of value nodes.

Constraint Component: sh:MaxCountConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:maxCount`	`xsd:integer`	The maximum cardinality. If this parameter is omitted then there is no limit on the number of triples.

TEXTUAL DEFINITION

A validation result must be produced if the number of value nodes is greater than the value of sh:maxCount.

SPARQL DEFINITION (Must return no results for the given $PATH)

SELECT $this
WHERE {
	$this $PATH ?value .
}
GROUP BY $this
HAVING (COUNT(?value) > $maxCount)

ex:MaxCountExampleShape
	a sh:Shape ;
	sh:targetNode ex:Bob ;
	sh:property [
		sh:predicate ex:birthDate ;
		sh:maxCount 1 ;
	] .

ex:Bob ex:birthDate "May 5th 1990" .

4.4 Value Range Constraint Components

The following constraint components represent range restrictions on nodes that are comparable via operators such as < and >.

4.4.1 sh:minExclusive, sh:minInclusive, sh:maxExclusive, sh:maxInclusive

The properties from the following table restrict the range of value nodes. The supported datatypes of these nodes are xsd:string, xsd:boolean, xsd:dateTime and all numeric datatypes such as xsd:integer.

Constraint Components: sh:MinExclusiveConstraintComponent, sh:MinInclusiveConstraintComponent, sh:MaxExclusiveConstraintComponent, sh:MaxInclusiveConstraintComponent

Parameters:

Property	Value Type	Summary	Definition
`sh:minExclusive`	(supported datatypes)	The minimum exclusive value	`<`
`sh:minInclusive`	(supported datatypes)	The minimum inclusive value	`<=`
`sh:maxExclusive`	(supported datatypes)	The maximum exclusive value	`>`
`sh:maxInclusive`	(supported datatypes)	The maximum inclusive value	`>=`

TEXTUAL DEFINITION

A validation result must be produced for each value node that does not match the literal range specified by the table above, using the semantics of the SPARQL operators <, <=, > and >=. A validation result must also be produced if the node cannot be compared to the specified range.

Note that if the comparison cannot be performed, for example when someone compares a string with an integer, then the validation engine will produce a validation result. This is different from, say, a plain SPARQL query, in which such failures would silently not lead to any results.

The following SPARQL definition covers sh:minExclusive - the other variations can be derived by replacing the < operator and the $minExclusive variable.

SPARQL DEFINITION (Must evaluate to true for each value node $value)

ASK {
	FILTER ($minExclusive < $value)
}

ex:NumericRangeExampleShape
	a sh:Shape ;
	sh:targetNode ex:Bob, ex:Alice, ex:Ted ;
	sh:property [
		sh:predicate ex:age ;
		sh:minInclusive 0 ;
		sh:maxInclusive 150 ;
	] .

ex:Bob ex:age 23 .
ex:Alice ex:age 220 .
ex:Ted ex:age "twenty one" .

4.5 String-based Constraint Components

The constraint components in this section have in common that they are representing restrictions on the string representation of certain nodes.

4.5.1 sh:minLength

The property sh:minLength restricts the string length of value nodes. This can be applied to any literals and IRIs, but not to blank nodes.

Constraint Component: sh:MinLengthConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:minLength`	`xsd:integer`	The minimum length. If the value is 0 then there is no restriction on the string length but this constraint is still violated if the node is a blank node.

TEXTUAL DEFINITION

A validation result must be produced for each value node where the length of its string representation (as defined by the SPARQL str function) is less than the specified minimum length, or if the node is a blank node.

SPARQL DEFINITION (Must evaluate to true for each value node $value)

ASK {
	FILTER (STRLEN(str($value)) >= $minLength) .
}

4.5.2 sh:maxLength

The property sh:maxLength restricts the string length of value nodes This can be applied to any literals and IRIs, but not to blank nodes.

Constraint Component: sh:MaxLengthConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:maxLength`	`xsd:integer`	The maximum length. If this constraint is omitted then there is no restriction on the string length and no requirement that the node is a literal or IRI.

TEXTUAL DEFINITION

A validation result must be produced for each value node where the length of its string representation (as defined by the SPARQL str function) is greater than the specified maximum length, or if the node is a blank node.

SPARQL DEFINITION (Must evaluate to true for each value node $value)

ASK {
	FILTER (STRLEN(str($value)) <= $maxLength) .
}

ex:PasswordExampleShape
	a sh:Shape ;
	sh:targetNode ex:Bob, ex:Alice ;
	sh:property [
		sh:predicate ex:password ;
		sh:minLength 8 ;
		sh:maxLength 10 ;
	] .

ex:Bob ex:password "123456789" .
ex:Alice ex:password "1234567890ABC" .

4.5.3 sh:pattern

The property sh:pattern can be used to validate whether all value nodes match a given regular expression. The values of sh:pattern must be valid pattern arguments for the SPARQL REGEX function.

Constraint Component: sh:PatternConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:pattern`	`xsd:string`	Regular expression that all value nodes must match
`sh:flags`	`xsd:string` (optional)	An optional string of flags, interpreted as in SPARQL 1.1 REGEX

TEXTUAL DEFINITION

A validation result must be produced for each value node that is a blank node or where the string representation (as defined by the SPARQL str function) does not match the given regular expression (as defined by the SPARQL REGEX function). If sh:flags is present then this must be interpreted according to the third argument of the SPARQL REGEX function.

SPARQL DEFINITION (Must evaluate to true for each value node $value)

ASK {
	FILTER (!isBlank($value) && IF(bound($flags), regex(str($value), $pattern, $flags), regex(str($value), $pattern)))
}

ex:PatternExampleShape
	a sh:Shape ;
	sh:targetNode ex:Bob, ex:Alice, ex:Carol ;
	sh:property [
		sh:predicate ex:bCode ;
		sh:pattern "^B" ;    # starts with 'B'
		sh:flags "i" ;       # Ignore case
	] .

ex:Bob ex:bCode "b101" .
ex:Alice ex:bCode "B102" .
ex:Carol ex:bCode "C103" .

4.5.4 sh:stem

The property sh:stem validates whether all value nodes are IRIs and the IRI starts with a given string value.

Constraint Component: sh:StemConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:stem`	`xsd:string`	String value that an IRI must start with

TEXTUAL DEFINITION

A validation result must be produced for each value node that is not an IRI or the string representation of the IRI does not start with the given string.

SPARQL DEFINITION (Must evaluate to true for each value node $value)

ASK {
	FILTER (isIRI($value) && STRSTARTS(str($value), $stem))
}

ex:StemExampleShape
	a sh:Shape ;
	sh:targetNode ex:Bob, ex:Alice, ex:Carol ;
	sh:property [
		sh:predicate ex:w3cHomepage ;
		sh:stem "https://www.w3.org/People/" ;
	] .

ex:Alice ex:w3cHomepage <https://www.w3.org/People/Alice> .
ex:Bob ex:w3cHomepage <https://example.com/People/Bob> .
ex:Carol ex:w3cHomepage "https://www.w3.org/People/Carol" .

4.5.5 sh:uniqueLang

The property sh:uniqueLang can be set to true to specify that no pair of value nodes may use the same language tag. The values of sh:uniqueLang must be xsd:booleans.

Constraint Component: sh:UniqueLangConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:uniqueLang`	`xsd:boolean`	`true` to activate this constraint

TEXTUAL DEFINITION

If sh:uniqueLang is set to true then a validation result must be produced for each non-empty language tag that is used by at least two value nodes.

SPARQL DEFINITION (Must return no results for the given $PATH)

SELECT DISTINCT $this ?lang
WHERE {
	{
		FILTER ($uniqueLang) .
	}
	$this $PATH ?value .
	BIND (lang(?value) AS ?lang) .
	FILTER (bound(?lang) && ?lang != "") . 
	FILTER EXISTS {
		$this $PATH ?otherValue .
		FILTER (?otherValue != ?value && ?lang = lang(?otherValue)) .
	}
}

ex:UniqueLangExampleShape
	a sh:Shape ;
	sh:targetNode ex:Alice, ex:Bob ;
	sh:property [
		sh:predicate ex:label ;
		sh:uniqueLang true ;
	] .

ex:Alice
	ex:label "Alice" ;
	ex:label "Alice"@en ;
	ex:label "Alice"@fr .

ex:Bob
	ex:label "Bob"@en ;
	ex:label "Bobby"@en .

4.6 Property Pair Constraint Components

The constraint components in this section restrict the sets of values represented by the sh:predicate used in the property constraint, and another property that is specified as the value of the respective parameter such as sh:equals.

4.6.1 sh:equals

sh:equals constrains a pair of properties so that the sets of values of both properties at a given focus node must be equal.

Constraint Component: sh:EqualsConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:equals`	`rdfs:Resource`	Property to compare with

TEXTUAL DEFINITION

A validation result must be produced for each value node that does not exist as value at the focus node of the property specified using sh:equals and for each value of the property specified using sh:equals that does not exist as value node.

SPARQL DEFINITION (Must return no results for the given $PATH)

SELECT $this ?value
WHERE {
	{
		$this $PATH ?value .
		MINUS {
			$this $equals ?value .
		}
	}
	UNION
	{
		$this $equals ?value .
		MINUS {
			$this $PATH ?value .
		}
	}
}

The following example illustrates the use of sh:equals in a shape to verify that certain nodes must have the same set of values for ex:firstName and ex:givenName.

ex:EqualExampleShape
	a sh:Shape ;
	sh:targetNode ex:Bob ;
	sh:property [
		sh:predicate ex:firstName ;
		sh:equals ex:givenName ;
	] .

ex:Bob
	ex:firstName "Bob" ;
	ex:givenName "Bob" .

4.6.2 sh:disjoint

sh:disjoint constrains a pair of properties so that the sets of values of both properties at a given focus node must not share any nodes.

Constraint Component: sh:DisjointConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:disjoint`	`rdfs:Resource`	The property to compare the values with

TEXTUAL DEFINITION

A validation result must be produced for each value node that also exists as value of the property specified using sh:disjoint at the given focus node.

SPARQL DEFINITION (Must return no results for the given $PATH)

SELECT $this ?value
WHERE {
	$this $PATH ?value .
	$this $disjoint ?value .
}

The following example illustrates the use of sh:disjoint in a shape to verify that certain nodes must not share any values for ex:prefLabel and ex:altLabel.

ex:DisjointExampleShape
	a sh:Shape ;
	sh:targetNode ex:USA, ex:Germany ;
	sh:property [
		sh:predicate ex:prefLabel ;
		sh:disjoint ex:altLabel ;
	] .

ex:USA
	ex:prefLabel "USA" ;
	ex:altLabel "United States" .

ex:Germany
	ex:prefLabel "Germany" ;
	ex:altLabel "Germany" .

4.6.3 sh:lessThan

sh:lessThan constrains a pair of properties so that the values of the first property must be smaller than the values of the second property at a given focus node.

Constraint Component: sh:LessThanConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:lessThan`	`rdfs:Resource`	The property to compare the values with

TEXTUAL DEFINITION

A validation result must be produced for each pair of value nodes and the values of the property specified using sh:lessThan at the given focus node, where the first value is not less than the second value, based on SPARQL's < operator. A validation result must also be produced if the two values cannot be compared.

SPARQL DEFINITION (Must return no results for the given $PATH)

SELECT $this ?value
WHERE {
	$this $PATH ?value .
	$this $lessThan ?otherValue .
	FILTER (!(?value < ?otherValue)) .
}

TODO: Decide what should happen if values are not comparable, i.e. < fails, similar to minExclusive etc.

The following example illustrates the use of sh:lessThan in a shape to verify that all values of ex:startDate must be "before" the values of ex:endDate.

ex:LessThanExampleShape
	a sh:Shape ;
	sh:property [
		sh:predicate ex:startDate ;
		sh:lessThan ex:endDate ;
	] .

4.6.4 sh:lessThanOrEquals

sh:lessThanOrEquals constrains a pair of properties so that the values of the first property must be smaller than or equal to the values of the second property at a given focus node.

Constraint Component: sh:LessThanOrEqualsConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:lessThanOrEquals`	`rdfs:Resource`	The property to compare the values with

TEXTUAL DEFINITION

A validation result must be produced for each pair of value nodes and the values of the property specified using sh:lessThanOrEquals at the given focus node, where the first value is not less than or equal to the second value, based on SPARQL's <= operator. A validation result must also be produced if the two values cannot be compared.

SPARQL DEFINITION (Must return no results for the given $PATH)

SELECT $this ?value
WHERE {
	$this $PATH ?value .
	$this $lessThan ?otherValue .
	FILTER (!(?value <= ?otherValue)) .
}

4.7 Logical Constraint Components

The constraint components in this section implement the common logical operators and, or and not.

4.7.1 sh:not

SHACL supports a negation constraint component that can be used to verify that a value node does not have a given shape. This is comparable to a logical "not" operator.

Constraint Component: sh:NotConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:not`	`sh:Shape`	The shape to negate

TEXTUAL DEFINITION

A validation result must be produced for each value node that produces no validation results with severity sh:Violation for the shape given via sh:not. A failure must be reported if the validation of the shape produces a failure.

SPARQL DEFINITION (for shapes, must return no results)

SELECT $this ?failure
WHERE {
	BIND (sh:hasShape($this, $not, $shapesGraph) AS ?hasShape) .
	BIND (!bound(?hasShape) AS ?failure) .
	FILTER (?failure || ?hasShape) .
}

SPARQL DEFINITION (for property constraints, must return no results for the given $PATH)

SELECT $this ?value ?failure
WHERE {
	$this $PATH ?value .
	BIND (sh:hasShape($this, $not, $shapesGraph) AS ?hasShape) .
	BIND (!bound(?hasShape) AS ?failure) .
	FILTER (?failure || ?hasShape) .
}

The following example illustrates the use of sh:not in a shape to verify that certain nodes cannot have any value of ex:property.

ex:NotExampleShape
	a sh:Shape ;
	sh:targetNode ex:InvalidInstance1 ;
	sh:not [
		a sh:Shape ;
		sh:property [
			sh:predicate ex:property ;
			sh:minCount 1 ;
		] ;
	] .

ex:InvalidInstance1 ex:property "Some value" .

4.7.2 sh:and

SHACL supports a conjunctive constraint component that can be used to test whether a value node has all out of several shapes. This is comparable to a logical "and" operator.

Constraint Component: sh:AndConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:and`	`rdf:List` (members: `sh:Shape`)	List of shapes to validate the value nodes against

TEXTUAL DEFINITION

A validation result must be produced for each value node if the following condition is false: The validation of the value node against all of the shapes in the sh:and list produces a validation result with severity sh:Violation for at least one shape. A failure must be produced if the validation of one of the shapes produces a failure.

Note that although sh:and has an rdf:List of shapes as its value, the order of those shapes does not impact the validation results.

The following example illustrates the use of sh:and in a shape to verify that certain nodes have exactly one value of ex:property. This is achieved via the conjunction of a separate named shape (ex:SuperShape) which defines the minimum count, and a blank node shape that further constrains the maximum count. As shown here, sh:and can be used to implement a specialization mechanism between shapes.

ex:SuperShape
	a sh:Shape ;
	sh:property [
		sh:predicate ex:property ;
		sh:minCount 1 ;
	] .

ex:ExampleAndShape
	a sh:Shape ;
	sh:targetNode ex:ValidInstance, ex:InvalidInstance ;
	sh:and (
		ex:SuperShape
		[
			a sh:Shape ;
			sh:property [
				sh:predicate ex:property ;
				sh:maxCount 1 ;
			]
		]
	) .

ex:ValidInstance
	ex:property "One" .

# Invalid: more than one property
ex:InvalidInstance
	ex:property "One" ;
	ex:property "Two" .

4.7.3 sh:or

SHACL supports a high-level syntax for disjunctive constraints that can be used to test whether a value node has at least one out of several shapes. This is comparable to a logical "or" operator.

Constraint Component: sh:OrConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:or`	`rdf:List` (members: `sh:Shape`)	List of shapes to validate the value nodes against

TEXTUAL DEFINITION

A validation result must be produced for each value node if the following condition is false: The validation of the value node against all of the shapes in the sh:or list produces no validation results with severity sh:Violation for at least one shape. A failure must be produced if the validation of one of the shapes produces a failure.

Note that although sh:or has an rdf:List of shapes as its value, the order of those shapes does not impact the validation results.

The following example illustrates the use of sh:or in a shape to verify that certain nodes have at least one value of ex:firstName or at least one value of ex:givenName.

ex:OrConstraintExampleShape
	a sh:Shape ;
	sh:targetNode ex:Bob ;
	sh:or (
		[
			sh:property [
				sh:predicate ex:firstName ;
				sh:minCount 1 ;
			]
		]
		[
			sh:property [
				sh:predicate ex:givenName ;
				sh:minCount 1 ;
			]
		]
	) .

ex:Bob ex:firstName "Robert" .

The next example shows how sh:or can be used in a property constraint to state that the values of the given property ex:address may be either literals with datatype xsd:string or SHACL instances of the class ex:Address.

ex:PersonAddressShape
	a sh:Shape ;
	sh:targetClass ex:Person ;
	sh:property [
		sh:predicate ex:address ;
		sh:or (
			[
				sh:datatype xsd:string ;
			]
			[
				sh:class ex:Address ;
			]
		)
	] .

ex:Bob ex:address "123 Prinzengasse, Vaduz, Liechtenstein" .

4.8 Shape-based Constraint Components

The constraint components in this section can be used to represent complex restrictions based on applying shape definitions on value nodes.

4.8.1 sh:shape

The property sh:shape can be used verify that all value nodes must have a given shape. The value type of sh:shape is sh:Shape, but the rdf:type triple of those shapes can be omitted.

Constraint Component: sh:ShapeConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:shape`	`sh:Shape`	The required shape of all value nodes

TEXTUAL DEFINITION

A validation result must be produced for each value node where validating the value node against the shape specified by sh:shape produces any validation results with severity sh:Violation. A failure must be produced if the validation of any value node has produced a failure.

SPARQL DEFINITION (for shapes, must return no results)

SELECT $this ?failure
WHERE {
	BIND (sh:hasShape($this, $shape, $shapesGraph) AS ?hasShape) .
	BIND (!bound(?hasShape) AS ?failure) .
	FILTER (?failure || !?hasShape) .
}

SPARQL DEFINITION (for property constraints, must return no results for the given $PATH)

SELECT $this ?value ?failure
WHERE {
	$this $PATH ?value .
	BIND (sh:hasShape(?value, $shape, $shapesGraph) AS ?hasShape) .
	BIND (!bound(?hasShape) AS ?failure) .
	FILTER (?failure || !?hasShape) .
}

A shape may refer to itself directly or indirectly via sh:shape, sh:filterShape, etc. Such a shape is said to be recursive. The meaning of non-recursive shapes is always well-founded. In contrast, the meaning of a recursive shape may not be well-founded. The handling of recursive shapes in SHACL is left to implementations. Some implementations MAY reject shapes graphs containing recursive shape definitions. Some implementations MAY report a failure if a recursion has been detected at validation time.

In the following example, all values of the property ex:someProperty will validate with no results for the shape specified by a blank node that ensures that the property ex:nestedProperty has at least one value.

ex:ShapeExampleShape
	a sh:Shape ;
	sh:property [
		sh:predicate ex:someProperty ;
		sh:shape [
			a sh:Shape ;   # Optional
			sh:predicate [
				sh:predicate ex:nestedProperty ;
				sh:minCount 1 ;
			]
		]
	] .

ex:ShapeExampleValidResource
	ex:someProperty [
		ex:nestedProperty 42 ;
	] .

4.8.2 sh:qualifiedValueShape, sh:qualifiedMinCount, sh:qualifiedMaxCount

Issue 92: Relationship of QCRs with Partitions

Feature at risk: There is an ongoing proposal (see sh:partition) that strongly overlaps with the QCRs proposed here. The WG may decide to support only one of these options, but not both.

The property sh:qualifiedValueShape can be used verify that a certain number of value nodes must have a given shape. The value type of sh:qualifiedValueShape is sh:Shape, and it needs to be accompanied by a sh:qualifiedMinCount or a sh:qualifiedMaxCount (both typed xsd:integer), or both. The rdf:type of the value shapes can be omitted.

Constraint Component: sh:QualifiedValueShapeConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:qualifiedValueShape`	`sh:Shape`	The required shape of the specified values
`sh:qualifiedMinCount`	`xsd:integer`	The minimum number of values that must have the shape. If this constraint is omitted then there is no minimum number of values required.
`sh:qualifiedMaxCount`	`xsd:integer`	The maximum number of values that can have the shape. If this constraint is omitted then there is no maximum number of values required.

TEXTUAL DEFINITION of sh:qualifiedMinCount

Let C be the number of value nodes where validating the node against the shape specified by sh:qualifiedValueShape produces no validation results with severity sh:Violation. A failure must be produced if the validation of any of the value nodes produces a failure. A validation result must be produced if C is less than the specified sh:qualifiedMinCount.

SPARQL DEFINITION of sh:qualifiedMinCount (Must return no results for the given $PATH)

SELECT $this (SUM(?failed)>0 AS ?failure)
WHERE {
	{
		$this $PATH ?value .
		BIND (sh:hasShape(?value, $qualifiedValueShape, $shapesGraph) AS ?hasShape) .
		FILTER (!bound(?hasShape) || ?hasShape) .
	}
	BIND (IF(!bound(?hasShape), 1, 0) AS ?failed) .
} 
GROUP BY $this
HAVING ((COUNT(?value) < $qualifiedMinCount) || (SUM(?failed) > 0))

Note that in the SPARQL query above, we assume that the SUM operation fails if one of the values of ?s is not a number. This mechanism is used by the error handling, which sets ?s to the string 'error' whenever one of the individual sh:hasShape calls fails.

TEXTUAL DEFINITION of sh:qualifiedMaxCount

SPARQL DEFINITION of sh:qualifiedMaxCount (Must return no results for the given $PATH)

SELECT $this (SUM(?failed)>0 AS ?failure)
WHERE {
	{
		$this $PATH ?value .
		BIND (sh:hasShape(?value, $qualifiedValueShape, $shapesGraph) AS ?hasShape) .
		FILTER (!bound(?hasShape) || ?hasShape) .
	}
	BIND (IF(!bound(?hasShape), 1, 0) AS ?failed) .
} 
GROUP BY $this
HAVING ((COUNT(?value) > $qualifiedMaxCount) || (SUM(?failed) > 0))

In the following example, the property ex:parent must have exactly two values, and at least one of them needs to be female.

ex:QualifiedValueShapeExampleShape
	a sh:Shape ;
	sh:targetNode ex:QualifiedValueShapeExampleValidResource ;
	sh:property [
		sh:predicate ex:parent ;
		sh:minCount 2 ;
		sh:maxCount 2 ;
		sh:qualifiedValueShape [
			a sh:Shape ;   # Optional
			sh:property [
				sh:predicate ex:gender ;
				sh:hasValue ex:female ;
			]
		] ;
		sh:qualifiedMinCount 1 ;
	] .

ex:QualifiedValueShapeExampleValidResource
	ex:parent ex:John ;
	ex:parent ex:Jane .

ex:John
	ex:gender ex:male .

ex:Jane
	ex:gender ex:female .

4.8.3 sh:partition

In some cases a given property may be multi-valued and it may be required that the set of values be partitioned into two or more subsets, each of which satisfies certain constraints.

For example, suppose that in the Library of Congress BIBFRAME (bf:) Cultural Heritage vocabulary each person (bf:Person) must be identified by (bf:identifiedBy) exactly one identifier from id.loc.gov and may have another identifier from viaf.org. No other identifiers are allowed. Thus the set of all identifiers is partitioned into two subsets, the first of which contains exactly one member and the second of which contains zero or one members. The following example shows a snippet of some valid BIBFRAME data.

<bf_Person1>
  	bf:identifiedBy <http://id.loc.gov/authorities/names/n80103961#RWO> ;
 	bf:identifiedBy <https://viaf.org/viaf/268367832/#Knape,_Joachim> .

The following example shows a snippet of some invalid BIBFRAME data.

<bf_Person1>
  	bf:identifiedBy <http://id.loc.gov/authorities/names/n80103961#RWO> ;
 	bf:identifiedBy <https://viaf.org/viaf/268367832/#Knape,_Joachim> ;
	bf:identifiedBy "this is a mistake" . # should be an error

Qualified cardinality constraints provide a basis for expressing this type of partitioning requirement, but using them imposes a burden on the shapes author. In the BIBFRAME example the author would need to express the requirement that the set of all identifiers that are from neither id.loc.gov nor viaf.org is empty, i.e. it has a maximum cardinality of 0. Clearly, as more subsets of values are involved, the burden on the author increases. The sh:partition constraint makes it easier to express this type of requirement than it would be to use multiple qualified cardinality constraints. In effect, sh:partition chains together a sequence of qualified cardinality constraints and removes the set of value nodes matched by each from further consideration. If every value node gets matched in this process, then the sh:partition constraint reports no violations. Otherwise, any value nodes remaining are reported as violations of the constraint. The BIBFRAME example constraint is expressed as follows.

ex:BibframeShape a sh:Shape ;
	sh:property [
		sh:predicate bf:identifiedBy ;
		sh:partition (
			[sh:minCount 1; sh:maxCount 1; sh:pattern "^http://id.loc.gov/"]
			[sh:maxCount 1; sh:pattern "^https://viaf.org/"]
		)
] .

The value of the sh:partition constraint parameter MUST be an rdf:List that contains zero or more resources. Each resource in the list defines conditions on a subset of the value nodes and MAY contain the following parameters:

zero or one sh:minCount. This defines the minimum cardinality of the corresponding subset.
zero or one sh:maxCount. This defines the maximum cardinality of the corresponding subset.
any combination of parameters associated with node validation constraints. A node validation constraint is any constraint defined by a boolean function on nodes. These include the built-in constraints defined by sh:nodeKind, sh:partition, sh:minExclusive, etc. The corresponding subset consists of those remaining nodes for which the boolean function is true.

Note that a resource that contains no parameters matches all nodes. Such a resource is useful as the last member of the list where it acts as a default matching rule in the case where nodes that do not match any of the preceeding constraints are allowed. Note also that a qualified cardinality constraint defined using sh:qualifiedValueShape, sh:qualifiedMinCount, and sh:qualifiedMaxCount is equivalent to a sh:partition constraint that contains two resources with the first one containing the corresponding parameters and the last one being the default matching rule that matches any set of nodes.

Each member of the list is used by the SHACL processor to match a subset of the value nodes. The SHACL processor matches as many nodes as possible and then compares the result with the specified minimum and maximum cardinalities if specified. This is referred to as a greedy matching algorithm. Greedy pattern matching is commonly used with textual regular expressions. Nodes that match are removed from further matching. Thus the set of all value nodes becomes partitioned by the matching algorithm. The following paragraphs define this algorithm more precisely.

Let D be a data graph and let F be a focus node in D. Let S be a shapes graph, let T be a shape in S, and let C be a sh:partition constraint in T. Let N be the set of value nodes for C in D at F. Recall that N depends on how C is related to T.

If (T, sh:constraint, C) is in S then N consists of just the node F.
If (T, sh:property, C) and (C, sh:predicate, P) are in S then N consists of all the nodes X such that (F, P, X) is in D.

Let the value of the sh:partition parameter be the list (Q₁, ..., Q_n) of resources. The SHACL validator MUST perform the following steps to validate the constraint C at F.

Let R denote the set of remaining value nodes. Initialize R to N.
Repeat the following for Q = Q₁, ..., Q_n
1. Let P be the conjunction of all the node validation constraints in Q.
2. Compute R' to be the set of all nodes in R that satisfy P, i.e. R' = {X in R | P(X) = true}
3. If Q contains a minimum cardinality m_min and the number of nodes in R' is less than m_min, i.e. m_min > #R', then report a constraint violation and exit the loop.
4. If Q contains a maximum cardinality m_max and the number of nodes in R' is greater then m_max, i.e. m_max < #R', then report a constraint violation and exit the loop.
5. Remove R' from R, i.e. set R = R \ R'.
If R is non-empty and no violations have been reported yet then report a violation.

Note that the order of resources within the list is significant. In general, if the members of the list are reordered then different value node sets will be matched and different violation results will be reported.

4.9 Other Constraint Components

This section enumerates Core constraint components that did not fit into the other categories.

4.9.1 sh:closed, sh:ignoredProperties

The RDF data model offers a huge amount of flexibility. Any resource can in principle have values for any property. However, in some cases it makes sense to restrict which properties can be applied to resources. The SHACL Core language includes a property called sh:closed that can be assigned to a shape via the property sh:constraint to indicate that valid resources must only have values for those properties that have been explicitly declared via sh:property.

Constraint Component: sh:ClosedConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:closed`	`xsd:boolean`	Set to `true` to close the shape
`sh:ignoredProperties`	`rdf:List` (members: `rdf:Property`)	Optional list of properties that are also permitted in addition to those explicitly enumerated via `sh:property`

TEXTUAL DEFINITION

If sh:closed is true then a validation result must be produced for each triple that has the focus node as its subject and a predicate that is not explicitly enumerated as a sh:predicate in any of the sh:property constraints at the surrounding shape. If the parameter sh:ignoredProperties is present then the properties enumerated in this list are also permitted. The produced validation result must have the predicate of the triple as its sh:path, and the object of the triple as its sh:value.

SPARQL DEFINITION (Must return no results)

SELECT $this (?predicate AS ?path) ?value
WHERE {
	{
		FILTER ($closed) .
	}
	$this ?predicate ?value .
	FILTER (NOT EXISTS {
		GRAPH $shapesGraph {
			$currentShape sh:property/sh:predicate ?predicate .
		}
	} && (!bound($ignoredProperties) || NOT EXISTS {
		GRAPH $shapesGraph {
			$ignoredProperties rdf:rest*/rdf:first ?predicate .
		}
	}))
}

The following example illustrates the use of sh:closed in a shape to verify that certain nodes only have values for ex:exampleProperty1 and ex:exampleProperty2. The "ignored" property rdf:type would also be allowed.

ex:ClosedShapeExampleShape
	a sh:Shape ;
	sh:targetNode ex:Alice, ex:Bob ;
	sh:closed true ;
	sh:ignoredProperties (rdf:type) ;
	sh:property [
		sh:predicate ex:firstName ;
	] ;
	sh:property [
		sh:predicate ex:lastName ;
	] .

ex:Alice
	ex:firstName "Alice" .

ex:Bob
	ex:firstName "Bob" ;
	ex:middleInitial "J" .

4.9.2 sh:hasValue

The property sh:hasValue can be used to verify that one of the value nodes is a given RDF node.

Constraint Component: sh:HasValueConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:hasValue`	any	A specific required value

TEXTUAL DEFINITION

A validation result must be produced if the node specified by sh:hasValue is not among the value nodes.

SPARQL DEFINITION (Must return no results for the given $PATH)

SELECT $this
WHERE {
	FILTER NOT EXISTS { $this $PATH $hasValue }
}

ex:StanfordGraduate
	a sh:Shape ;
	sh:targetNode ex:Alice ;
	sh:property [
		sh:predicate ex:alumniOf ;
		sh:hasValue ex:Stanford ;
	] .

ex:Alice
	ex:alumniOf ex:Harvard ;
	ex:alumniOf ex:Stanford .

4.9.3 sh:in

The property sh:in exclusively enumerates the permitted value nodes. For example when specified as part of a property constraint, then each value of the given property must be a member of the specified list.

Constraint Component: sh:InConstraintComponent

Parameters:

Property	Value Type	Summary
`sh:in`	`rdf:List`	Enumeration of allowed values

TEXTUAL DEFINITION

The values of sh:in must be well-formed rdf:Lists. The members of that rdf:List must not be blank nodes. A validation result must be produced for every value node that is not a member of the given list. Matching of literals needs to be exact, e.g. "04"^^xsd:byte does not match "4"^^xsd:integer.

SPARQL DEFINITION (Must evaluate to true for each value node $value)

ASK {
	GRAPH $shapesGraph {
		$in (rdf:rest*)/rdf:first $value .
	}
}

ex:InExampleShape
	a sh:Shape ;
	sh:targetNode ex:RainbowPony ;
	sh:property [
		sh:predicate ex:color ;
		sh:in ( ex:Pink ex:Purple ) ;
	] .

ex:RainbowPony ex:color ex:Pink .

4.10 Non-Validating Constraint Characteristics

While the previous sections introduced properties that represent validation conditions, this section covers properties that are ignored by SHACL validation engines. The use of these properties is entirely optional and not subject to formal interpretation contracts. They may be used for purposes such as form building or predictable printing of RDF files.

Property constraints may have one or more values for sh:name to provide human-readable labels for the property in the target where it appears. If present, tools SHOULD prefer those locally defined labels over globally defined labels at the rdf:Property itself. For example, if a form displays a resource that is in the target of a given shape, and the shape defines a sh:property constraint with an sh:name, then the tool SHOULD use the provided name. Similarly, property constraints may have an sh:description to provide a description of the property in the given context. Both sh:name and sh:description may have multiple values, but SHOULD only have one value per language tag.

Property constraints may have one value for the property sh:order to indicate the relative order of the property constraint for purposes such as form building. The values of sh:order must be decimals. sh:order is not used for validation purposes. If present, the recommended use of sh:order is to sort the property constraints in an ascending order, for example so that properties with smaller order are placed above (or to the left) of properties with larger order.

Property constraints may link to an SHACL instance of the class sh:PropertyGroup using the property sh:group to indicate that the constraint belongs to a group of related property constraints. Each group may have additional triples that serve application purposes, such as an rdfs:label for form building. Groups may also have an sh:order property to indicate the relative ordering of groups within the same form.

Property constraints may have a single value for sh:defaultValue. The default value does not have fixed semantics in SHACL, but MAY be used by user interface tools to pre-populate input widgets. The value type of the sh:defaultValue SHOULD align with the specified sh:datatype or sh:class of the same constraint.

The following example illustrates the use of these various features together.

ex:PersonFormShape
	a sh:Shape ;
	sh:property [
		sh:predicate ex:firstName ;
		sh:name "first name" ;
		sh:description "The person's given name(s)" ;
		sh:order 0 ;
		sh:group ex:NameGroup ;
	] ;
	sh:property [
		sh:predicate ex:lastName ;
		sh:name "last name" ;
		sh:description "The person's last name" ;
		sh:order 1 ;
		sh:group ex:NameGroup ;
	] ;
	sh:property [
		sh:predicate ex:streetAddress ;
		sh:name "street address" ;
		sh:description "The street address including number" ;
		sh:order 11 ;
		sh:group ex:AddressGroup ;
	] ;
	sh:property [
		sh:predicate ex:locality ;
		sh:name "locality" ;
		sh:description "The suburb, city or town of the address" ;
		sh:order 12 ;
		sh:group ex:AddressGroup ;
	] ;
	sh:property [
		sh:predicate ex:postalCode ;
		sh:name "postal code" ;
		sh:name "zip code"@en-US ;
		sh:description "The postal code of the locality" ;
		sh:order 13 ;
		sh:group ex:AddressGroup ;
	] .

ex:NameGroup
	a sh:PropertyGroup ;
	sh:order 0 ;
	rdfs:label "Name" .

ex:AddressGroup
	a sh:PropertyGroup ;
	sh:order 1 ;
	rdfs:label "Address" .

A form building application may use the information above to display information as follows:

Name

first name:	John
last name:	Doe

Address

street address:	123 Silverado Ave
locality:	Cupertino
zip code:	54321

5. SPARQL-based Constraints

SHACL supports two mechanisms to define constraints using SPARQL:

Constraints based on a high-level vocabulary (constraint components), relying on a reusable SPARQL query in the background
Constraints directly based on a SPARQL query

The following sub-sections are about the latter.

5.1 Syntax of SPARQL-based Constraints

sh:SPARQLConstraint is an rdfs:subClassOf sh:Constraint and is the class of all SPARQL-based constraints. SPARQL-based constraints must have exactly one value for the property sh:select. The SPARQL queries linked to a constraint via sh:select must be string literals that can be parsed into legal SPARQL 1.1 queries of the query form SELECT.

Issue 105: Defined prefixes

There is an ongoing discussion about whether and how to inject prefixes into SPARQL queries, so the following paragraph may change.

Before parsing the values of sh:select, a SHACL processor must prepend PREFIX declarations for all namespace prefixes declared via the property sh:prefix in the current shapes graph. The subjects of sh:prefix triples must be IRIs, which become the IRIREF in the PREFIX declaration. The objects of sh:prefix triples must be string literals, which become the PNAME_NS in the PREFIX declaration. For the example shapes graph below, a SHACL processor would produce the line PREFIX ex: <http://example.com/ns#>. No such PREFIX must be generated if the SPARQL string already contains a PREFIX statement for the same prefix at the top-level query (ignoring prefixes from nested SELECT queries). The SHACL processor must produce a failure if the shapes graph contains multiple sh:prefix triples with the same object. Since the use of sh:prefix triples may lead to conflicts, it is recommended to only use them in closed and controlled environments or for well-established prefixes. In the rest of this document, the sh:prefix statements may have been omitted for brevity.

The following example illustrates the definition of a SPARQL-based constraint.

ex:ValidCountry a ex:Country ;
	ex:germanLabel "Spanien"@de .
  
ex:InvalidCountry a ex:Country ;
	ex:germanLabel "Spain"@en .

<http://example.com/ns#> sh:prefix "ex" .

ex:LanguageExampleShape
	a sh:Shape ;
	sh:targetClass ex:Country ;
	sh:sparql [
		a sh:SPARQLConstraint ;   # This triple is optional
		sh:message "Values must be literals with German language tag." ;
		sh:select """
			SELECT $this (ex:germanLabel AS ?path) ?value
			WHERE {
				$this ex:germanLabel ?value .
				FILTER (!isLiteral(?value) || !langMatches(lang(?value), "de"))
			}
			""" ;
	] .

The target of the shape above includes all SHACL instances of ex:Country. For those RDF nodes (represented by the variable $this), the SPARQL query walks through the values of ex:germanLabel and verifies that they are literals with a German language code. The validation results for the aforementioned data graph is shown below:

[
	a sh:ValidationResult ;
	sh:severity sh:Violation ;
	sh:focusNode ex:InvalidCountry ;
	sh:path ex:germanLabel ;
	sh:value "Spain"@en ;
	sh:sourceShape ex:LanguageExampleShape ;
	...
]

The SPARQL query returns result set rows for all bindings of ?value that violate the constraint. A validation result is produced for each row in that result set, following the mapping rules explained later: Each validation result will have $this as the sh:focusNode, ex:germanLabel as sh:path and the violating value as sh:value.

5.2 Pre-bound Variables in SPARQL Constraints ($this, $shapesGraph, $currentShape)

The following table enumerates variables that have special meaning in SPARQL constraints. When SPARQL constraints are executed, the validation engine should pre-bind values for these variables.

Variable	Interpretation
`$this`	The focus node.
`$shapesGraph`	Can be used to query the shapes graph as in `GRAPH $shapesGraph { ... }`. If the shapes graph is a named graph in the same dataset as the data graph then it is the IRI of the shapes graph in the dataset. Not all SHACL validation engines need to support this variable. Processors that do not support `$shapesGraph` MUST report a failure if they encounter a query that references this variable. Use of `GRAPH $shapesGraph { ... }` should be handled with extreme caution. It may result in constraints that are not interoperable across different SHACL validation engines and that may not run on remote RDF datasets.
`$currentShape`	The currently validated shape. Typically used in conjunction with `$shapesGraph`. The same support policies as for `$shapesGraph` apply for this variable.

5.3 Mapping of Result Variables to Validation Results

If one of the rows of the result set produced by a SELECT query contains the binding true for the variable ?failure, then the validation engine must signal a failure.

Otherwise, each row of the result set produced by a SELECT query must be converted into one validation result resource. The properties of those resources are derived by the following rules, through a combination of result variables and the properties linked to the constraint itself. The production rules are meant to be executed from top to bottom, so that the first bound value will be used.

Property	Production Rules
`sh:severity`	The value of `sh:severity` of the constraint node For constraint components, the value of `sh:severity` of the constraint component resource `sh:Violation` as default
`sh:focusNode`	The value of the variable `?focusNode` The value of the variable `$this`
`sh:path`	The value of the variable `?path` (only supports property IRIs, no complex paths)
`sh:value`	The value of the variable `?value`
`sh:message`	The value of the variable `?message` The values of `sh:message` of the subject of the `sh:select` or `sh:ask` triple. These string literals may reference any variable from the SELECT result variables via `{?varName}`. If the constraint is based on a constraint component, then the component's parameter variables can also be used. The `{?varName}` blocks SHOULD be substituted with suitable string representations of the values of said variables.
`sh:sourceConstraint`	The constraint that was validated against
`sh:sourceShape`	The shape that was validated against

5.4 Injecting Annotation Properties into Validation Results

It is possible to inject additional annotation properties into the validation result resources created for each row of the SELECT result sets. Any such property needs to be declared via a value of sh:resultAnnotation at the subject holding the sh:select or sh:ask triple. The values of sh:resultAnnotation must be IRIs or blank nodes with the following properties:

Property	Value type	Count	Description
`sh:annotationProperty`	`rdf:Property`	`1 (mandatory)`	The annotation property that shall be set
`sh:annotationVarName`	`xsd:string`	`0..1`	The name of the SPARQL variable to take the values from
`sh:annotationValue`		`0..unlimited`	Constant nodes that shall be used as default values

For each row of a SELECT result set, a SHACL processor must walk through the declared result annotations. The mapping from result annotations to SPARQL variables uses the following rules:

If a sh:resultAnnotation defines a sh:annotationVarName then the validation engine must look for the variable named after the sh:annotationVarName
Otherwise, the validation engine must derive a variable name from the value of sh:annotationProperty using the same local name mechanism as described earlier

If a variable name could be determined, then the validation engine must copy the bindings for the given variable into the constructed validation results for the current row. If the variable has no binding in the result set row, then the value of sh:annotationValue must be used, if present.

The values of sh:annotationProperty must not be from the SHACL namespace, to avoid clashes with variables that are already produced by other means.

Here is a slightly complex example, illustrating the use of result annotations.

ex:ShapeWithPathViolationExample
	a sh:Shape ;
	sh:targetNode ex:ExampleRootResource ;
	sh:sparql [
		sh:resultAnnotation [
			sh:annotationProperty ex:time ;
			sh:annotationVarName "time"
		] ;
		sh:select """
			SELECT $this (ex:property1 AS ?path) (?first AS ?value) ?message ?time
			WHERE {
				$this ex:property1 ?first .
				?subject ex:property2 ?first .
				FILTER isBlank(?value) .
				BIND (CONCAT("The ", "message.") AS ?message) .
				BIND (NOW() AS ?time) .
			}
			""" ;
	] .

ex:ExampleRootResource
	ex:property1 ex:ExampleIntermediateResource .

ex:ExampleValueResource
	ex:property2 ex:ExampleIntermediateResource .

Which produces the following validation result resource:

[
	a sh:ValidationResult ;
	sh:severity sh:Violation ;
	sh:focusNode ex:ExampleRootResource ;
	sh:path ex:property1 ;
	sh:value ex:ExampleIntermediateResource ;
	sh:message "The message." ;
	sh:sourceConstraint [ the blank node of the sh:sparql above ] ;
	sh:sourceShape ex:ShapeWithPathViolationExample ;
	ex:time "2015-03-27T10:58:00"^^xsd:dateTime ;  # Example
] .

6. SPARQL-based Constraint Components

SPARQL-based constraints as introduced in the previous section provide a lot of flexibility. However, SPARQL-based constraints may be hard to understand for some people or lead to repetition. Constraint components are a way to abstract the complexity of SPARQL and define high level reusable components similar to the Core constraint components. The definition of such constraint components can be represented in the SHACL RDF vocabulary and thus shared and reused.

sh:ConstraintComponent is the class of all constraint components. Each constraint component must define:

one or more parameters (e.g. sh:class, sh:stem)
at least one validator

6.1 An Example Constraint Component

This section is non-normative.

The following example demonstrates how SPARQL-based constraint components can be applied to define Core elements of the SHACL language itself. The example implements sh:pattern and sh:flags using a SPARQL ASK query to validate that each value node matches a given regular expression. Note that this is only an example implementation and should not be considered normative.

sh:PatternConstraintComponent
	a sh:ConstraintComponent ;
	sh:parameter [
		sh:predicate sh:pattern ;
		sh:order 0 ;
	] ;
	sh:parameter [
		sh:predicate sh:flags ;
		sh:optional true ;
		sh:order 1 ;
	] ;
	sh:validator shimpl:hasPattern .

shimpl:hasPattern
	a sh:SPARQLAskValidator ;
	sh:message "Value does not match pattern {$pattern}" ;
	sh:ask "ASK { FILTER (!isBlank($value) && IF(bound($flags), regex(str($value), $pattern, $flags), regex(str($value), $pattern))) }" .

The following sections introduce the properties that constraint components may have. Some of these properties are independent of SPARQL-based execution and apply to constraint components based on other potential extension languages such as JavaScript too.

6.2 Parameters Declaration (sh:parameter)

The parameters of a constraint component are declared via the property sh:parameter. Each parameter must be a SHACL instance of sh:Parameter, but the rdf:type triples can be omitted.

Issue 107: Specifying SPARQL variable name

There is an open issue about the relationship between SPARQL variable name and sh:predicate. Possible revisions may require an additional property similar to sh:annotationVarName.

Each sh:Parameter must have exactly one value p for the property sh:predicate and the value must be an IRI. The local name of an IRI is defined as the longest NCNAME at the end of the IRI, not immediately preceded by the first colon in the IRI. The local names of the values of sh:predicate must fulfill the following conditions (to ensure that a correct mapping from parameters into SPARQL variables is possible):

The local name must be a valid SPARQL VARNAME
There must not be any other declared parameter for the same constraint component that has a sh:predicate with the same local name
The local name must not be this, shapesGraph or currentShape.
The local name must not be focusNode, path or value.

An sh:Parameter may have its property sh:optional set to true to indicate that the parameter is not mandatory. Every sh:ConstraintComponent must have at least one non-optional parameter.

The class sh:Parameter is defined as a SHACL subclass of sh:PropertyConstraint, and all properties that are applicable to property constraints may also be used for parameters. This includes descriptive properties such as sh:name and sh:description but also constraint parameters such as sh:class. Some implementations MAY use these constraint parameters to prevent the execution of constraint components with invalid parameter values.

6.3 Label Templates (sh:labelTemplate)

The property sh:labelTemplate can be used at any constraint component to suggest how they could be rendered to humans. The values of sh:labelTemplate must be strings (possibly with language tag) that can reference the values of the declared parameters using the syntax {?varName} or {$varName}, where varName is the name of the SPARQL variable that corresponds to the parameter. At display time, these {...} blocks SHOULD be substituted with the actual parameter values. There may be multiple label templates for the same subject, assuming they do not have the same language tags.

6.4 Validators

For every supported context (i.e., property constraint or shape) the constraint component must declare a suitable validator. For a given constraint, a validator is selected from the constraint component using the following rules:

For shapes, use one of the values of sh:shapeValidator, if present.
For property constraints, use one of the values of sh:propertyValidator, if present.
Otherwise, use one of the values of sh:validator.

If no suitable validator can be found, a SHACL processor ignores the constraint. The SHACL WG is seeking practical feedback on what the default behavior should be, and whether we should report violations in those cases.

SHACL includes two types of validators, based on SPARQL SELECT (for sh:shapeValidator and sh:propertyValidator) or SPARQL ASK queries (for sh:validator).

6.4.1 Validators based on SPARQL SELECT Queries

Validators that have the rdf:type sh:SPARQLSelectValidator must point at exactly one string representation of a SPARQL SELECT query via the property sh:select. The value of sh:select must be a valid SPARQL query using the aforementioned prefix handling rules. This type of validator can be used as values of sh:shapeValidator or sh:propertyValidator.

The following example illustrates the definition of a constraint component based on a SPARQL SELECT query. It is a generalized variation of the SPARQL-based example constraint from the section on SPARQL-based constraints. That SPARQL query included two constants: the specific property ex:germanLabel and the language tag de. Constraint components make it possible to generalize such scenarios, so that constants get pre-bound with parameters. This allows the query logic to be reused in multiple places, without having to write any new SPARQL.

ex:LanguageConstraintComponentUsingSELECT
	a sh:ConstraintComponent ;
	rdfs:label "Language constraint component" ;
	sh:parameter [
		sh:predicate ex:lang ;
		sh:datatype xsd:string ;
		sh:minLength 2 ;
		sh:name "language" ;
		sh:description "The language tag, e.g. \"de\"." ;
	] ;
	sh:labelTemplate "Values must be literals with language \"{$lang}\"" ;
	sh:propertyValidator [
		a sh:SPARQLSelectValidator ;
		sh:message "Values must be literals with language \"{?lang}\"" ;
		sh:select """
			SELECT $this ?value
			WHERE {
				$this $PATH ?value .
				FILTER (!isLiteral(?value) || !langMatches(lang(?value), $lang))
			}
			"""
	] .

Once a constraint component has been defined, its parameters can be used as illustrated in the following example.

ex:LanguageExampleShape
	a sh:Shape ;
	sh:targetClass ex:Country ;
	sh:property [
		sh:predicate ex:germanLabel ;
		ex:lang "de" ;
	] ;
	sh:property [
		sh:predicate ex:englishLabel ;
		ex:lang "en" ;
	] .

The example shape above specifies that all values of ex:germanLabel must carry the language tag de while all values of ex:englishLabel must have en as their language. These details are specified via two property constraints that provide values for the ex:lang parameter required by the constraint component.

SELECT queries used in the context of property constraints must use a special variable named $PATH as a placeholder for the predicate or path used by the constraint. The only legal use of this variable is in the predicate position of a triple pattern. A query that uses the variable $PATH in any other position is invalid. Furthermore, any query that uses the variable $this in an aggregation is invalid.

A SPARQL-based SHACL validation engine executes the provided SPARQL query to produce validation results. In the context of property constraints, the engine will first substitute all occurrences of the variable $PATH with the provided property path derived from the value of either sh:predicate or sh:path in the constraint. The resulting SPARQL query is then evaluated with the same pre-bound variables as outlined in the section for SPARQL-based Constraints ($this etc). Additionally, the value of each declared parameter of the constraint component needs to be pre-bound for the variable derived by the local name of the parameter's sh:predicate. For example, if a non-optional parameter declares sh:predicate ex:lang then the variable $lang needs to be pre-bound. The result set of the SELECT query is turned into validation results using the same rules as outlined in the section for SPARQL-based Constraints. In addition to the result properties listed in that section, the property sh:sourceConstraintComponent MUST point at the IRI of the constraint component that has been evaluated. Furthermore, a sh:SPARQLSelectValidator may declare additional annotation properties via sh:resultAnnotation.

6.4.2 Validators based on SPARQL ASK Queries

Many constraint components are of the form in which all value nodes are tested individually against some boolean condition. Writing SELECT queries for these becomes burdensome, especially if a constraint component can be used for both property constraints and shapes. SHACL provides an alternative, more compact syntax for validators based on ASK queries. This type of validators can be used as values of the property sh:validator.

Validators that have the rdf:type sh:SPARQLAskValidator must point at exactly one string representation of a SPARQL ASK query via the property sh:ask. The value of sh:ask must be a valid SPARQL query using the aforementioned prefix handling rules. The ASK queries are expected to return true if a given value node (represented by the pre-bound variable $value) is valid.

Prior to evaluation, a SHACL validation engine transforms the provided ASK query into a SELECT query using the following templates. The resulting SELECT query can then be evaluated using the same algorithm as for SELECT-based validators. The engine drops the ASK keyword, leaving only the fraction between the outermost {...} pair. This block then substitutes ... in the template.

Template for sh:Shape context:

	SELECT $this ?value
	WHERE {
		BIND ($this AS ?value) .
		FILTER NOT EXISTS ...
	}

Template for sh:PropertyConstraint context:

	SELECT $this ?value
	WHERE {
		$this $PATH ?value .
		FILTER NOT EXISTS ...
	}

Once the corresponding template has been applied, the resulting SELECT query will be evaluated using the same approach as outlined above. Actual SHACL implementations may of course use a different approach internally, as long as the results are equivalent to the described approach.

The following example defines a constraint component using an ASK query.

ex:LanguageConstraintComponentUsingASK
	a sh:ConstraintComponent ;
	rdfs:label "Language constraint component" ;
	sh:parameter [
		sh:predicate ex:lang ;
		sh:datatype xsd:string ;
		sh:minLength 2 ;
		sh:name "language" ;
		sh:description "The language tag, e.g. \"de\"." ;
	] ;
	sh:labelTemplate "Values must be literals with language \"{$lang}\"" ;
	sh:validator ex:hasLang .
	
ex:hasLang
	a sh:SPARQLAskValidator ;
	sh:message "Values must be literals with language \"{$lang}\"" ;
	sh:ask """
		ASK {
			FILTER (isLiteral($value) && langMatches(lang($value), $lang))
		}
		""" .

Note that the validation condition implemented by an ASK query is "in the inverse direction" from its SELECT counterpart: ASK queries return true for valid value nodes, while SELECT queries return the invalid value nodes.

TODO: The TopBraid SHACL API uses such ASK constraint declarations to install new SPARQL functions. Time permitting we could standardize that too, so that people can reuse the same business logic in the queries.

6.5 Evaluation

A constraint component is triggered for every SHACL instance of a context that defines all non-optional parameters. TODO: This is unclear.

Parameter	Value Type	Summary
`focusNode`	`rdfs:Resource`	The focus node to validate.
`shape`	`rdfs:Resource`	The shape to validate the focus node against.
`shapesGraph`	`rdfs:Resource`	The IRI of the current shapes graph.

Abstract

Status of This Document

Revision History

Document Outline

1. Introduction

1.1 Terminology

1.2 Document Conventions

1.3 SHACL Example

1.4 Relationship between SHACL and RDFS inferencing

1.5 Relationship between SHACL and SPARQL

1.6 Conformance

Part 1: Core Features

2. Shapes

2.1 Targets

2.1.1 Node targets (sh:targetNode)

2.1.2 Class-based Targets (sh:targetClass)

2.1.2.1 Implicit Class Targets

2.1.3 Subjects-of targets (sh:targetSubjectsOf)

2.1.4 Objects-of targets (sh:targetObjectsOf)

2.2 Filter Shapes

2.3 Constraints

2.3.1 Property Constraints (sh:predicate or sh:path)

2.3.2 Focus Node Constraints

2.3.3 Multiple Parameters

2.3.4 Property Paths

3. Validation and Graphs

3.1 Shapes Graph

3.2 Data Graph

3.3 Linking to shapes graphs (sh:shapesGraph)

3.4 Validation report

3.4.1 Validation Results

3.4.1.1 Focus node (sh:focusNode)

3.4.1.2 Path (sh:path)

3.4.1.3 Value (sh:value)

3.4.1.4 Source (sh:sourceConstraint, sh:sourceShape and sh:sourceConstraintComponent)

3.4.1.5 Detail (sh:detail)

3.4.1.6 Message (sh:message)

3.4.1.7 Severity (sh:severity)

3.4.1.8 Declaring the Severity of a Constraint

4. Core Constraint Components

4.1 SPARQL definitions for built-in constraint components

4.2 Value Type Constraint Components

4.2.1 sh:class

4.2.2 sh:datatype

4.2.3 sh:nodeKind

4.3 Cardinality Constraint Components

4.3.1 sh:minCount

4.3.2 sh:maxCount

4.4 Value Range Constraint Components

4.4.1 sh:minExclusive, sh:minInclusive, sh:maxExclusive, sh:maxInclusive

4.5 String-based Constraint Components

4.5.1 sh:minLength

4.5.2 sh:maxLength

4.5.3 sh:pattern

4.5.4 sh:stem

4.5.5 sh:uniqueLang

4.6 Property Pair Constraint Components

4.6.1 sh:equals

4.6.2 sh:disjoint

4.6.3 sh:lessThan

4.6.4 sh:lessThanOrEquals

4.7 Logical Constraint Components

4.7.1 sh:not

4.7.2 sh:and

4.7.3 sh:or

4.8 Shape-based Constraint Components

4.8.1 sh:shape

4.8.2 sh:qualifiedValueShape, sh:qualifiedMinCount, sh:qualifiedMaxCount

4.8.3 sh:partition

4.9 Other Constraint Components

4.9.1 sh:closed, sh:ignoredProperties

4.9.2 sh:hasValue

4.9.3 sh:in

4.10 Non-Validating Constraint Characteristics

Part 2: Advanced Features

5. SPARQL-based Constraints

5.1 Syntax of SPARQL-based Constraints

5.2 Pre-bound Variables in SPARQL Constraints ($this, $shapesGraph, $currentShape)

5.3 Mapping of Result Variables to Validation Results

5.4 Injecting Annotation Properties into Validation Results