W3C

XSL Transformations (XSLT)
Version 2.0

W3C Working Draft 16 August 2002

This version:
http://www.w3.org/TR/2002/WD-xslt20-20020816/
Latest version:
http://www.w3.org/TR/xslt20/
Previous versions:
http://www.w3.org/TR/2002/WD-xslt20-20020430/
http://www.w3.org/TR/2001/WD-xslt20-20011220/
Editor:
Michael Kay (Software AG) <Michael.Kay@softwareag.com>

Abstract

This specification defines the syntax and semantics of XSLT 2.0, which is a language for transforming XML documents into other XML documents.

XSLT 2.0 is designed to be used in conjunction with XPath 2.0, which is defined in [XPath 2.0].

Status of this document

This document is a working draft of XSLT 2.0. It is published in order to provide the XSLT user community with a progress report on the evolving language specification and the outstanding issues that remain to be decided. Changes since the second published Working Draft (published on 30 April 2002) are indicated by background highlighting (green for modified text, yellow for new text), or in the case of deleted text, by strikethrough. A summary of the changes is included at [K.2.4 Changes since the April 2002 draft]. The most significant change is the addition of an instruction, xsl:analyze-string, to match text against a regular expression. This is designed to facilitate applications of XSLT that add XML markup to documents whose structure is implicit in the text, using conventions such as comma-separated values.

This remains a draft document and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use W3C Working Drafts as reference material or to cite them as other than "work in progress". While prototype implementations are encouraged, users and vendors are advised that this working draft cannot be regarded as a stable specification.

XSLT 2.0 is a revised version of the XSLT 1.0 Recommendation [XSLT 1.0] published on 16 November 1999. The changes made in this document are intended to meet the requirements for XSLT 1.1 and XSLT 2.0 described in [XSLT 1.1] and [XSLT 2.0 Requirements] and to incorporate fixes for errors that have been detected in XSLT 1.0. A summary of the changes since XSLT 1.0 is included in [K Changes from XSLT 1.0].

XSLT 2.0 is designed to be used together with XPath 2.0, which has been developed by the W3C XSL Working Group in collaboration with the XML Query Working Group. The current specification of XPath 2.0 can be found in [XPath 2.0].

NOTE: This specification supersedes XSLT 1.1, which was never developed beyond the Working Draft stage.

Comments on this specification may be sent to public-qt-comments@w3.org; archives of the comments are available. Archives of comments on earlier versions of the specification can be found at http://lists.w3.org/Archives/Public/xsl-editors/. Public discussion of XSL, including XSL Transformations, takes place on the XSL-List mailing list.

The English version of this specification is the only normative version. However, for translations of this document, see http://www.w3.org/Style/XSL/translations.html.

A list of current W3C Recommendations and other technical documents can be found at http://www.w3.org/TR/.

This specification has been produced as part of the W3C Style activity.

Patent disclosures relevant to this specification may be found on the XSL Working Group's patent disclosure page at http://www.w3.org/Style/XSL/Disclosures.html.

Table of contents

1 Introduction
    1.1 What is XSLT?
    1.2 What's new in XSLT 2.0?
2 Concepts
    2.1 Terminology
    2.2 Notation
    2.3 Initiating a Transformation
    2.4 Executing a Transformation
        2.4.1 Push Processing
        2.4.2 Pull Processing
        2.4.3 Content Constructors
    2.5 Maintaining Position: the Focus
    2.6 Parsing and Serialization
    2.7 Extensibility
    2.8 Stylesheets and Schemas
    2.9 Error Handling
3 Stylesheet Structure
    3.1 XSLT Namespace
    3.2 XSLT Media Type
    3.3 Standard Attributes
    3.4 Stylesheet Element
        3.4.1 User-defined Data Elements
    3.5 Simplified Stylesheet Modules
    3.6 Backwards-Compatible Processing
    3.7 Forwards-Compatible Processing
    3.8 Combining Stylesheet Modules
        3.8.1 Stylesheet Inclusion
        3.8.2 Stylesheet Import
    3.9 Embedded Stylesheet Modules
    3.10 Importing Schema Components
4 Data Model
    4.1 Parentless Nodes
    4.2 Document Node Children
    4.3 Unparsed Entities
    4.4 Whitespace Stripping
    4.5 Namespace Fixup
    4.6 Disable Output Escaping
    4.7 External Objects
5 Syntactic Constructs
    5.1 Qualified Names
    5.2 Expressions
    5.3 Patterns
    5.4 Unprefixed Names in Expressions and Patterns
    5.5 Attribute Value Templates
    5.6 Content Constructors
6 Template Rules
    6.1 Defining Template Rules
    6.2 Applying Template Rules
    6.3 Conflict Resolution for Template Rules
    6.4 Modes
    6.5 Built-in Template Rules
    6.6 Overriding Template Rules
7 Repetition
8 Conditional Processing
    8.1 Conditional Processing with xsl:if
    8.2 Conditional Processing with xsl:choose
9 Variables and Parameters
    9.1 Values of Variables and Parameters
    9.2 Temporary Trees
    9.3 Global Variables and Parameters
    9.4 Circular Definitions
    9.5 Local Variables and Parameters
    9.6 Range Variables
10 Callable Components
    10.1 Named Templates
        10.1.1 Passing Parameters to Templates
    10.2 Named Attribute Sets
    10.3 Stylesheet Functions
        10.3.1 Defining a Stylesheet Function
        10.3.2 Returning the Result
11 Creating New Nodes
    11.1 Literal Result Elements
        11.1.1 Setting the Type Annotation for Literal Result Elements
        11.1.2 Attribute Nodes for Literal Result Elements
        11.1.3 Namespace Nodes for Literal Result Elements
        11.1.4 Namespace Aliasing
    11.2 Creating Element Nodes using xsl:element
        11.2.1 Setting the Type Annotation for a Constructed Element Node
    11.3 Creating Attribute Nodes using xsl:attribute
        11.3.1 Setting the Type Annotation for a Constructed Attribute Node
    11.4 Creating Text Nodes
        11.4.1 Literal Text Nodes
        11.4.2 Creating Text Nodes using xsl:text
    11.5 Creating Processing Instructions
    11.6 Creating Namespace Nodes
    11.7 Creating Comments
    11.8 Copying Nodes from the Source Tree to the Result Tree
        11.8.1 Shallow Copy
        11.8.2 Deep Copy
    11.9 Generating Text with xsl:value-of
12 Numbering
    12.1 Formatting a Supplied Number
    12.2 Numbering based on Position in a Document
    12.3 Number to String Conversion Attributes
13 Sorting
    13.1 Collating Sequences
    13.2 The xsl:sort Element
    13.3 Using Unnamed Sort Specifications
    13.4 Using Named Sort Specifications
        13.4.1 Declaring a Named Sort Specification
        13.4.2 Sorting Using a Named Sort Specification
14 Grouping
    14.1 The Current Group
    14.2 The xsl:for-each-group Element
    14.3 Examples of Grouping
15 Regular Expressions
    15.1 Examples of Regular Expression Matching
16 Additional Functions
    16.1 Multiple Source Documents
    16.2 Reading Text Files
    16.3 Keys
        16.3.1 The xsl:key Declaration
        16.3.2 The key Function
    16.4 Number Formatting
        16.4.1 Defining a Decimal Format
        16.4.2 Processing the Picture String
        16.4.3 Analysing the Picture String
        16.4.4 Formatting the Number
    16.5 Miscellaneous Additional Functions
        16.5.1 current()
        16.5.2 unparsed-entity-uri()
        16.5.3 unparsed-entity-public-id()
        16.5.4 generate-id()
        16.5.5 system-property()
17 Messages
18 Extensibility and Fallback
    18.1 Extension Functions
        18.1.1 Testing Availability of Functions
        18.1.2 Calling Extension Functions
        18.1.3 External Objects
    18.2 Extension Instructions
        18.2.1 Designating an Extension Namespace
        18.2.2 Testing Availability of Instructions
        18.2.3 Fallback
19 Result Trees
    19.1 The Principal Result Tree
    19.2 Secondary Result Trees
20 Serialization
    20.1 XML Output Method
    20.2 XHTML Output Method
    20.3 HTML Output Method
    20.4 Text Output Method
    20.5 Disabling Output Escaping
21 Conformance

Appendices

A References
    A.1 Normative References
    A.2 Other References
B Glossary (Non-Normative)
C Element Syntax Summary (Non-Normative)
D Summary of Error Conditions (Non-Normative)
E Checklist of Implementation-Defined Features (Non-Normative)
F Schema for XSLT Stylesheets (Non-Normative)
G Representation of Lexical XML Constructs (Non-Normative)
H Acknowledgements (Non-Normative)
I Checklist of Requirements (Non-Normative)
J Summary of Issues (Non-Normative)
    J.1 Open Issues
    J.2 Decided Issues
    J.3 Closed Issues
K Changes from XSLT 1.0 (Non-Normative)
    K.1 Incompatible Changes
        K.1.1 XSLT 2.0 Backwards Compatibility
        K.1.2 XPath 2.0 Backwards Compatibility
        K.1.3 Compatibility in the Presence of a Schema
    K.2 Changes since XSLT 1.0
        K.2.1 Pervasive changes
        K.2.2 Major Features
        K.2.3 Minor Changes
        K.2.4 Changes since the April 2002 draft

1 Introduction

1.1 What is XSLT?

This specification defines the syntax and semantics of the XSLT 2.0 language. A transformation in the XSLT language is expressed in the form of a stylesheet, whose syntax is well-formed XML [XML] conforming to the Namespaces in XML Recommendation [XML Names]. A stylesheet generally includes elements that are defined by XSLT as well as elements that are not defined by XSLT. XSLT-defined elements are distinguished by use of the XML namespace http://www.w3.org/1999/XSL/Transform (see [3.1 XSLT Namespace]), which is referred to in this specification as the XSLT namespace. Thus this specification is a definition of the syntax and semantics of the XSLT namespace.

The term stylesheet reflects the fact that one of the important roles of XSLT is to add styling information to an XML source document, by transforming it into a document consisting of XSL formatting objects (see [XSL Formatting Objects]), or into another presentation-oriented format such as HTML, XHTML, or SVG. However, XSLT is used for a wide range of XML-to-XML transformation tasks, not exclusively for formatting and presentation applications.

A transformation expressed in XSLT describes rules for transforming one or more source trees into one or more result trees. The transformation is achieved by a set of template rules. A template rule associates a pattern, which matches nodes in the source document, with a content constructor, which can be evaluated to produce part of a result tree. The structure of the result trees can be completely different from the structure of the source trees. In constructing a result tree, nodes from the source trees can be filtered and reordered, and arbitrary structure can be added. This mechanism allows a stylesheet to be applicable to a wide class of documents that have similar source tree structures.

Ed. Note: The term "source tree" is used rather casually throughout this specification, and is not properly defined. Usually it is referring to the tree that contains the context node.

A stylesheet may consist of several stylesheet modules, contained in different XML documents. For a given transformation, one of these functions as the principal stylesheet module. The complete stylesheet is assembled by finding the stylesheet modules referenced directly or indirectly from the principal stylesheet module using xsl:include and xsl:import elements: see [3.8.1 Stylesheet Inclusion] and [3.8.2 Stylesheet Import].

1.2 What's new in XSLT 2.0?

XSLT 1.0 was published in November 1999, and version 2.0 represents a significant increase in the capability of the language. A detailed list of changes is included in [K Changes from XSLT 1.0]. XSLT 2.0 has been developed in parallel with XPath 2.0 (see [XPath 2.0]), so the changes to XPath must be considered alongside the changes to XSLT.

2 Concepts

2.1 Terminology

For a full glossary of terms, see [B Glossary].

The software responsible for transforming a source document into a result document is referred to as the processor. This is sometimes expanded to XSLT processor to avoid any confusion with other processors, for example an XML processor. A specific product that performs the functions of an XSLT processor is referred to as an implementation.

In this specification the words must, must not, should, should not, may, required, and recommended are to be interpreted as described in [RFC2119].

In this specification, the term implementation-defined refers to a feature where the implementation is allowed some flexibility, and where the choices made by the implementation should be described in the vendor's documentation.

The term implementation-dependent refers to a feature where the behavior may vary from one implementation to another, and where the vendor is not expected to provide a full specification of the behavior. (This might apply, for example, to limits on the size of source documents that can be transformed.)

In all cases where this specification leaves the behavior implementation-defined or implementation-dependent, the implementation has the option of providing mechanisms that allow the user to influence the behavior.

Paragraphs labelled as NOTEs or described as examples are non-normative.

2.2 Notation

In this document the specification of each XSLT-defined element type is preceded by a summary of its syntax in the form of a model for elements of that element type. A full list of all these specifications can be found in [C Element Syntax Summary]. The meaning of syntax summary notation is as follows:

 

The following example illustrates the notation.

<!-- Category: instruction -->
<xsl:example-element
  select = sequence-expression
  debug = { "yes" | "no" }>
  <!-- Content: ((xsl:variable | xsl:param)*, xsl:result) -->
</xsl:example-element>

This example defines a (non-existent) element xsl:example-element. The element is classified as an instruction. It takes a mandatory select attribute, whose value is an XPath expression, and an optional debug attribute, whose value must be either yes or no; the curly braces indicate that the value can be defined as an attribute value template, allowing a value such as debug="{$debug}", where the variable debug is evaluated to yield "yes" or "no" at run-time.

The content of an xsl:example-element instruction is defined to be a sequence of zero or more xsl:variable and xsl:param elements, followed by an xsl:result element.

 

[ERR001] A static error is signaled if an XSLT-defined element is used in a context where it is not permitted, if a required attribute is omitted, or if the content of the element does not correspond to the content that is allowed for the element. [ERR002] It is a static error if an attribute (other than an attribute written using curly braces in a position where an attribute value template is permitted) contains a value that is not one of the permitted values for that attribute. [ERR003] It is a dynamic error if the effective value of an attribute written using curly braces, in a position where an attribute value template is permitted, is a value that is not one of the permitted values for that attribute.

Special rules apply if the construct appears in part of the stylesheet that is processed with forwards-compatible behavior: see [3.7 Forwards-Compatible Processing].

Ed. Note: This working draft includes an XML Schema for XSLT stylesheet modules: see [F Schema for XSLT Stylesheets]. This makes the notation explained in this section somewhat redundant. Future drafts may replace the notation described here with extracts of the relevant definitions from the schema.

2.3 Initiating a Transformation

This document does not specify any application programming interfaces or other interfaces for initiating a transformation. This section, however, describes the information that must be supplied when a transformation is initiated.

Implementations may allow an transformation to run as two or more phases, for example parsing, compilation and execution. Such a distinction is outside the scope of this specification, which treats transformation as a single process controlled using a set of stylesheet modules, supplied in the form of XML documents.

The following information is supplied to execute a transformation:

  • An identification of the stylesheet module that is to act as the principal stylesheet module for the transformation. The complete stylesheet is assembled by expanding the xsl:import and xsl:include declarations in the principal stylesheet module, as described in [3.8.1 Stylesheet Inclusion] and [3.8.2 Stylesheet Import]

  • A set (possibly empty) of values for stylesheet parameters (see [9.3 Global Variables and Parameters]). These values are available for use within expressions in the stylesheet.

  • A set of nodes (possibly empty) that forms the initial input sequence. These nodes (which will often be document nodes, but may in principle be any kind of node, from the same or different documents) are available at any time during the transformation as the result of the input function described in [Functions and Operators].

  • A node that acts as the initial context node for the transformation. This node is accessible within the stylesheet as the initial value of the XPath expressions . and self::node(), as described in [2.5 Maintaining Position: the Focus]. If no initial context node is supplied explicitly, then the stylesheet is invoked with an initial context node in the form of a document node with no children.

    NOTE: It is not necessary for the initial context node to be a member of the initial input sequence, but in a simple transformation of a single document, it is convenient to set both the initial input sequence and the initial context node to the document node of the source document to be transformed.
  • Optionally, the name of a named template which is to be executed as the entry point to the transformation. This template must exist within the stylesheet. If no named template is supplied, then the transformation starts with the template rule that best matches the initial context node, according to the rules defined in [6.3 Conflict Resolution for Template Rules].

[ERR004] It is a dynamic error if the invocation of the stylesheet specifies an template name that does not match the expanded-QName of a named template defined in the stylesheet.

Issue 113 (global-variables-in-call-mode): It's not obvious what the context should be for evaluating global variables when a transformation is initiated by calling a named template. The solution adopted for the moment is to say that there must be an initial context node; this will be supplied as an empty document node if no other initial context node is forthcoming.

Parameters passed to the transformation by the client application are matched against global stylesheet parameters (see [9.3 Global Variables and Parameters]), not against the local parameters declared within the first template to be executed. All local parameters within the first template to be executed will take their default values.

A stylesheet can process further source documents in addition to those supplied when the transformation is invoked. These additional documents can be loaded using the functions document or collection described in [Functions and Operators], or they can be supplied as stylesheet parameters (see [9.3 Global Variables and Parameters]), or as the result of an extension function (see [18.1 Extension Functions]

NOTE: Sometimes it is useful to write an XSLT stylesheet that does not require input from any source document. However, the semantics of the language require that an initial context node is always present. Implementors may provide a mechanism that supplies a default initial context node, perhaps just a document node with no children, as the initial context node to be used in the absence of any other source document.

2.4 Executing a Transformation

The XSLT instructions used to control a transformation can be divided into two groups: push processing instructions, which use template rules to define the processing associated with different nodes in the source tree, and pull processing instructions, which allow the selection and processing of source tree nodes to be described together. These two groups of instructions are described in the following sections.

2.4.1 Push Processing

A stylesheet generally contains a set of template rules. A template rule has two parts: a pattern which is matched against nodes in the source tree and a content constructor which is evaluated to create new nodes that are written to a result tree.

A template rule that is selected to process a particular node in the source tree will generally write one or more nodes to the result tree, and will frequently use the xsl:apply-templates instruction to select other nodes in the source tree (often, but not invariably, the original node's children) for processing. Each of these selected nodes is processed by searching the stylesheet for a matching template rule. Sometimes all the selected nodes will be processed by the same template rule, but in general different nodes will be processed by different template rules. Either way, the chosen template rules are evaluated and the process repeats itself, typically resulting in a recursive traversal of the source tree.

Because the template rule that is written to process one node in the tree has no knowledge of which rules will be chosen to process the nodes that it selects, push processing allows a stylesheet to be applicable to a wide class of documents that have similar source tree structures.

When stylesheet execution starts, a document node is implicitly created for the result tree. Frequently, the first action of the stylesheet is to find the template rule that matches the document node of a source tree. The content constructor of this template rule then creates the children of the new document node. By the time evaluation of this content constructor is complete, these children will typically each act as the parent of further result nodes, so a complete tree is constructed.

In the process of finding a template rule to process a particular node, more than one template rule may have a pattern that matches the node. However, only one template rule will be applied. The method for deciding which template rule to apply is described in [6.3 Conflict Resolution for Template Rules]. If there is no template rule that matches the node, a built-in template rule is used (see [6.5 Built-in Template Rules]).

 

For example, suppose the source tree contains the structure below:

<doc>
  <item>1</item>
  <item>2</item>
  <item>3</item>
</doc>

If the context node is the doc element, and the xsl:apply-templates instruction is evaluated with the attribute select="*", then the input sequence will contain the three item elements. Suppose the template rule that matches an item element has the following form:

<xsl:template match="item">
<li><xsl:value-of select="."/></li>
</xsl:template>

This template rule will be evaluated once for each item element in the input sequence, and in each case it will return a new li element, which in turn will contain a single text node. The result of the xsl:apply-templates instruction will therefore be a sequence of three li elements. The following XML fragment will be generated:

  <li>1</li>
  <li>2</li>
  <li>3</li>

This fragment will be added to the result tree at the point where the doc element was being processed.

 

Very often, the template rule that matches one element in the source tree will contain an xsl:apply-templates instruction selecting the children of that element as the input sequence for processing. In this situation, each child node will be processed to create a subtree, and these subtrees will be added as children of the element produced by the parent template rule. The structure of the result tree will then correspond directly to the structure of the source tree.

 

For example, suppose that the doc element in the previous example is processed using the template rule below:

<xsl:template match="doc">
<ol><xsl:apply-templates select="*"/></ol>
</xsl:template>

Then the result of processing the doc element will be the following result tree:

<ol>
  <li>1</li>
  <li>2</li>
  <li>3</li>
</ol>
 

2.4.2 Pull Processing

A single template rule by itself has considerable power. It can create structures of arbitrary complexity; it can pull string values out of arbitrary locations in the source tree; and it can generate structures that are repeated according to the occurrence of nodes in the source tree.

A stylesheet that uses the pull processing technique is written to navigate explicitly around the source tree, accessing data from wherever it appears in order to construct nodes in the result tree. A number of XSLT instructions are provided to assist pull processing: notably xsl:for-each, xsl:choose, xsl:if, and xsl:call-template.

For simple transformations where the structure of the result tree is independent of the structure of the source tree, a stylesheet can often consist of only a single literal result element, containing a content constructor which functions as a template for building the complete result tree. Transformations on XML documents that represent data with a regular and predictable structure (for example, data extracted from a relational database) are often of this kind. XSLT allows a simplified syntax for such stylesheets (see [3.5 Simplified Stylesheet Modules]).

2.4.3 Content Constructors

Whether the stylesheet is written to use push processing or pull processing, nodes in the result tree are always generated using content constructors.

A content constructor is evaluated for a particular node in the source tree, to create part of the result tree. A content constructor can contain elements (called literal result elements) and text nodes that specify part of the result structure directly. A content constructor can also contain elements from the XSLT namespace that are instructions for creating parts of the result tree.

When a content constructor is evaluated, each instruction is evaluated to produce a sequence of zero or more nodes; the result of the content constructor as a whole is a sequence of nodes formed by concatenating the results of each of the instructions and literal results nodes that it contains, in the order that they appear in the content constructor. The resulting nodes are typically attached as children to an element or document node constructed by the instruction that contains the content constructor, thus forming a tree. During this process, adjacent text nodes will be merged into a single text node. When a content constructor is evaluated to create new nodes, the tree to which these nodes are added is referred to as the current result tree. When the transformation is initiated, a result tree is created, and becomes the current result tree. This tree is referred to as the principal result tree. Various XSLT instructions (including xsl:variable and xsl:result-document) establish a new current result tree for the nodes created by the content constructor that they contain.

The elements occurring within a content constructor are classified as being either literal result elements or instructions. If the element is in the XSLT namespace, or in a namespace designated as an extension namespace, then it is an instruction. Otherwise, it is a literal result element.

Instructions that select nodes from the source document, or that derive information from these nodes for inclusion in the result document, always access the source tree by means of an Expression in the XPath language, described in [XPath 2.0]. A stylesheet written to use XSLT 2.0 will contain expressions whose syntax and semantics are defined by XPath 2.0 (but see also [3.6 Backwards-Compatible Processing] and [3.7 Forwards-Compatible Processing]).

2.5 Maintaining Position: the Focus

When a content constructor is evaluated, the processor keeps track of which nodes are being processed by means of a set of implicit variables referred to collectively as the focus. More specifically, the focus consists of the following three values:

If the context item is a node (as distinct from an atomic value such as an integer), then it is also referred to as the context node. The context node is not an independent variable, it changes whenever the context item changes. When the context item is an atomic value, there is no context node: its value is an empty sequence. The context node is returned by the XPath expression self::node(), and it is used as the starting node for all relative path expressions.

The current function can be used within any XPath expression to select the item that was supplied as the context item to the XPath expression by the XSLT processor; unlike . this is unaffected by changes to the context item that occur within the XPath expression. The current function is described in [16.5.1 current()].

On completion of an instruction which changes the focus (such as xsl:apply-templates or xsl:for-each), the focus reverts to its previous value.

When a stylesheet function is called, the focus within the body of the function is initially undefined. [ERR005] When the focus is undefined, evaluation of any expression that references the context item, context position, or context size results in a dynamic error. The processor must signal the error.

The description above gives an outline of the way the focus works. Detailed rules for the effect of each instruction are given separately with the description of that instruction. In the absence of specific rules, an instruction uses the same focus as its parent instruction.

Sometimes the focus is based on a single node rather than a sequence. A singleton focus based on a node N has the context item (and therefore the context node) set to N, the context document set to the document containing N, and the context position and context size both set to 1 (one).

In addition to the values that make up the focus, an XSLT processor maintains a number of other internal variables that reflect aspects of the evaluation context. These variables are fully described in the sections of the specification that maintain and use these variables. They are:

As with the focus, these variables are not accessible within a stylesheet function unless they have been initialized within that function.

2.6 Parsing and Serialization

An XSLT stylesheet describes a process that constructs a set of result trees from a set of source trees.

The stylesheet does not describe how a source tree is constructed. Frequently an implementation will operate in conjunction with an XML parser (or more strictly, in the terminology of [XML], an XML processor), to build the source tree from an input XML document. An implementation may also provide an application programming interface allowing the tree to be constructed directly, or allowing it to be supplied in the form of a DOM Document object (see [DOM2]). This is outside the scope of this specification. Users should be aware, however, that since the input to the transformation is a tree conforming to the XPath data model as described in [Data Model], constructs that might exist in the original XML document, or in the DOM, but which are not within the scope of the data model, cannot be processed by the stylesheet and cannot be guaranteed to remain unchanged in the transformation output. Such constructs include CDATA section boundaries, the use of entity references, and the DOCTYPE declaration and internal DTD subset.

A frequent requirement is to output the result tree as an XML document (or in other formats such as HTML). This process is referred to as serialization. Like parsing, serialization is not part of the transformation process, and it is not required that an XSLT processor should be able to perform serialization. However, for pragmatic reasons, this specification describes a declaration (the xsl:output element, see [20 Serialization]) which allows a stylesheet to specify the desired properties of a serialized output file. Implementations that do not serialize the result tree are allowed to ignore this declaration.

Because it is a common requirement to perform a transformation on a document while retaining lexical characteristics such as CDATA section boundaries, entity references, and the like, a non-normative appendix to this specification (see [G Representation of Lexical XML Constructs]) describes a way in which these constructs can be represented within the data model by means of elements in a special namespace. If such a representation is chosen, the tree is transformed in the same way as any other tree. The process of constructing such a tree is something that happens before XSLT transformation starts, and the process of interpreting such a tree and reconstituting the lexical representation is part of the serialization process. Neither of these processes is properly within the scope of XSLT transformation, and therefore, this specification places no requirement on an XSLT processor to support this representation of lexical properties.

2.7 Extensibility

XSLT provides two hooks for extending the language, one hook for extending the set of instruction elements used in content constructors and one hook for extending the set of functions used in XPath expressions. These hooks are both based on XML namespaces: see [18 Extensibility and Fallback] for further details. Extension instructions and extension functions defined according to these rules may be provided by the implementor of the XSLT processor, and the implementor may also provide facilities to allow users to create further extension instructions and extension functions. This specification defines how extension instructions and extension functions are invoked, but the facilities for creating new extension instructions and extension functions are implementation-defined.

2.8 Stylesheets and Schemas

An XSLT stylesheet can make use of information from XML Schemas, as defined in [XML Schema]. An XSLT transformation can take place in the absence of a schema (and, indeed, in the absence of a DTD), but where the source document has undergone schema validity assessment, the XSLT processor has access to the type information associated with individual nodes, not merely to the raw text.

Ed. Note: The XSL Working Group intends to define a conformance level for XSLT processors in which schema processing, and handling of user-defined types, is not a requirement. The precise rules for such a processor (for example, the question of whether it should ignore type-related constructs in the stylesheet or should signal them as an error) have yet to be defined.

There are places within a stylesheet, and within XPath expressions and patterns with a stylesheet, where the names of schema-defined elements, attributes, and named types may appear. For example, it is possible to declare the types expected for the parameters of a function. The element, attribute, and type names that appear in such contexts must either be built-in schema types (for example xs:string or xs:integer), or they must be user-defined types that are made accessible to the XSLT processor by means of an xsl:import-schema declaration: see [3.10 Importing Schema Components].

It is only necessary to import a schema explicitly if its type definitions are mentioned statically in the stylesheet; it is not necessary to import a schema merely because the stylesheet is used to process a source document that has been assessed against that schema. It is possible to make use of the information resulting from schema assessment (for example, the fact that a particular attribute holds a date) even if no schema has been imported by the stylesheet.

Further, importing a schema does not of itself say anything about the type of the source document that the stylesheet is expected to process. The imported type definitions can be used for temporary nodes or for nodes on the result tree just as much as for nodes on the principal source document.

Issue 106 (assert-source-document-type): How should a stylesheet assert the schema type of the source documents that it is designed to process?

A stylesheet can also control the types of nodes that it constructs in a final result tree, or in temporary trees. In the case of temporary trees, this enables the tree to be processed in the same way as a source document that has undergone schema assessment. In the case of a final result tree, it enables the result tree to be passed to further processes (including further XSLT transformations) with its type information intact.

Where a stylesheet author chooses to make assertions about the types of nodes or of variables and parameters, it is possible for an XSLT processor to perform static analysis of the stylesheet (that is, analysis in the absence of any source document). Such analysis may reveal errors that would otherwise not be discovered until the transformation is actually executed. An XSLT processor is not required to perform such static type-checking, and if it does perform such checking, any errors reported are advisory only: the user must be given the option to use the stylesheet even if static analysis has revealed type errors.

Type annotations can be created in nodes of the result tree, or of a temporary tree, in a number of ways. Firstly, it is possible to request validation of the complete tree. Validation may be either strict or lax, and is performed according to the rules defined in [XML Schema]: if validation fails, the transformation fails, but if it succeeds, the element and attribute nodes of the tree will be annotated with the names of the schema-defined types to which these nodes conform, and the typed value of the nodes will be accessible using the data function. Secondly, it is possible to add annotations to individual element and attribute nodes as they are constructed. This is done using the type-annotation attribute of the xsl:element and xsl:attribute instructions, or the xsl:type-annotation attribute of a literal result element. This is permitted only for elements that have simple content. In addition, type annotations may be copied from a source tree to a result tree when using the xsl:copy-of instruction. Type annotations attached to individual nodes in this way will only be retained in the completed tree if the tree construction element (for example, xsl:variable or xsl:result-document) specifies type-information="preserve".

2.9 Error Handling

An error that is detected by examining a stylesheet before execution starts (that is, before the source document and values of stylesheet parameters are available) is referred to as a static error. Errors classified in this specification as static errors must be signaled by all implementations: that is, the processor must indicate that the error is present, and it must not use the stylesheet to produce a result tree. A static error must be signaled even if it occurs in a part of the stylesheet that is never evaluated.

There is an exception to this rule when the stylesheet specifies forwards-compatible behavior (see [3.7 Forwards-Compatible Processing]).

Generally, errors in the structure of the stylesheet, or in the syntax of XPath expressions contained in the stylesheet, are classified as static errors. Where this specification states that an element in the stylesheet must or must not appear in a certain position, or that it must or must not have a particular attribute, or that an attribute must or must not have a value satisfying specified conditions, then any contravention of this rule is a static error unless otherwise specified.

An error that is not detected until a source document is being transformed is referred to as a dynamic error. In many cases, this specification allows dynamic errors to be signaled (by reporting the error condition and terminating execution), but also defines a recovery action that can be taken. It is implementation-defined whether the error is signaled or the recovery action is taken. If the implementation does choose to take recovery action, it must take the recovery action defined in this specification.

When the implementation makes the choice between signaling a dynamic error or recovering, it is not restricted in how it makes the choice; for example, it may provide options that can be set by the user. When an implementation chooses to recover from a dynamic error, it is also allowed to take other action, such as logging a warning message.

Because different implementations may optimize execution of the stylesheet in different ways, the detection of dynamic errors is to some degree implementation-dependent. In cases where an implementation is able to produce the result tree without evaluating a particular construct, the implementation is never required to evaluate that construct solely in order to determine whether doing so causes a dynamic error. For example, if a variable is declared but never referenced, an implementation can choose whether or not to evaluate the variable declaration, which means that if evaluating the variable declaration causes a dynamic error, some implementations will signal this error and others will not.

There are some cases where this specification requires that a construct must not be evaluated: for example, the content of an xsl:if instruction must not be evaluated if the test condition is false. This means that an implementation must not report any dynamic errors that would arise if the construct were evaluated.

An implementation may signal a dynamic error before any source document is available, but only if it can determine that the error would be signaled for every possible source document and every possible set of parameter values. For example, some circularity errors fall into this category: see [9.4 Circular Definitions].

Certain errors are classified as type errors. A type error occurs when the value supplied as input to an operation is of the wrong type for that operation, for example when an integer is supplied to an operation that expects a node. If a type error occurs in an instruction that is actually evaluated, then it must be reported as a dynamic error. An implementation may also, optionally, report a type error as a static error, even if it occurs in part of the stylesheet that is never evaluated, provided it can establish that execution of a particular construct would never succeed.

It is implementation-defined whether type errors are reported statically.

For example, the following construct contains a type error, because 42 is not allowed as an operand of the xsl:apply-templates instruction. An implementation may optionally report this as a static error, even though the offending instruction will never be evaluated, and the type error would therefore never be reported as a dynamic error.

<xsl:if test="false()">
  <xsl:apply-templates select="42"/>
</xsl:if>

If more than one error arises, an implementation is not required to signal any errors other than the first one that it detects. It is implementation-dependent which of the several errors is signaled. This applies both to static errors and to dynamic errors. An implementation is allowed to signal more than one error, but if any errors have been signaled, it must not finish as if the transformation were successful.

When a transformation signals one or more dynamic errors, the final state of any persistent resources updated by the transformation is implementation-dependent. Implementations are not required to restore such resources to their initial state. In particular, where a transformation produces multiple result documents, it is possible that one or more result documents may be written successfully before the transformation terminates, but the application cannot rely on this behavior.

Everything said above about error handling applies equally to errors in evaluating XSLT instructions, and errors in evaluating XPath expressions. Static errors and dynamic errors may occur in both cases.

If a transformation has successfully produced a result tree, it is still possible that errors may occur in serializing the result tree. For example, it may be impossible to serialize the result tree using the encoding selected by the user. Such an error is referred to as a serialization error. As with other aspects of serialization, the handling of serialization errors is implementation-defined: see [20 Serialization].

3 Stylesheet Structure

A stylesheet consists of one or more stylesheet modules, each one forming all or part of a well-formed XML document. There are three kinds of stylesheet module:

A stylesheet module is either a standard stylesheet module or a simplified stylesheet module:

Both forms of stylesheet module (standard and simplified) can exist either as an entire XML document, or embedded as part of another XML document, typically a source document that is to be processed using the stylesheet. An embedded stylesheet module is a stylesheet module that is embedded within another XML document, typically the source document that is being transformed. (see [3.9 Embedded Stylesheet Modules]).

3.1 XSLT Namespace

The XSLT namespace has the URI http://www.w3.org/1999/XSL/Transform. It is used to identify elements, attributes, and other names that have a special meaning defined in this specification.

NOTE: The 1999 in the URI indicates the year in which the URI was allocated by the W3C. It does not indicate the version of XSLT being used, which is specified by attributes (see [3.4 Stylesheet Element] and [3.5 Simplified Stylesheet Modules]).

XSLT processors must use the XML namespaces mechanism [XML Names] to recognize elements and attributes from this namespace. Elements from the XSLT namespace are recognized only in the stylesheet and not in the source document. The complete list of XSLT-defined elements is specified in [C Element Syntax Summary]. Implementations must not extend the XSLT namespace with additional elements or attributes. Instead, any extension must be in a separate namespace. Any namespace that is used for additional instruction elements must be identified by means of the extension instruction mechanism specified in [18.2 Extension Instructions].

This specification uses a prefix of xsl: for referring to elements in the XSLT namespace. However, XSLT stylesheets are free to use any prefix, provided that there is a namespace declaration that binds the prefix to the URI of the XSLT namespace.

An element from the XSLT namespace may have any attribute not from the XSLT namespace, provided that the expanded-QName (see [XPath 2.0]) of the attribute has a non-null namespace URI. The presence of such attributes must not change the behavior of XSLT elements and functions defined in this document or in the XPath specification, though they may be used to modify the behavior of extension functions and extension instructions. Thus, an implementation is always free to ignore such attributes, and must ignore such attributes without giving an error if it does not recognize the namespace URI. Such attributes can provide, for example, unique identifiers, optimization hints, or documentation. The set of namespaces that are recognized for such attributes is implementation-defined.

Ed. Note: We refer to a node (or a name) that is not in any namespace as "having a null namespace URI". This is not the terminology of the Namespaces in XML Recommendation, nor of the Data Model.

 

For example, the following code might be used to indicate to a particular implementation that the xsl:message instruction is to ask the user for confirmation before continuing with the transformation:

<xsl:message
    abc:pause="yes"
    xmlns:abc="http://vendor.example.com/xslt/extensions">Phase 1 complete</xsl:message>

Implementations that do not recognize the namespace http://vendor.example.com/xslt/extensions will simply ignore the extra attribute, and evaluate the xsl:message instruction in the normal way.

 

[ERR006] It is a static error for an element from the XSLT namespace to have an attribute whose namespace is either null (i.e. an attribute with an unprefixed name) or the XSLT namespace, other than attributes defined for the element in this document.

NOTE: The conventions used for the names of XSLT elements, attributes and functions are that names are all lower-case, use hyphens to separate words, and use abbreviations only if they already appear in the syntax of a related language such as XML or HTML. Names of types defined in XML Schema however, are regarded as single words and are capitalized exactly as in XML Schema. This sometimes leads to composite function names such as current-dateTime.

The XSLT namespace, together with certain other namespaces recognized by an XSLT processor, are classified as reserved namespaces and may be used only as specified in this and related specifications. The reserved namespaces are those listed below.

Reserved namespaces may be used without restriction to refer to the names of elements and attributes in source documents and result documents. As far as the XSLT processor is concerned, reserved namespaces other than the XSLT namespace may be used without restriction in the names of literal result elements and user-defined data elements, and in the names of attributes of literal result elements or of XSLT instructions: but other processors may impose restrictions or attach special meaning to them. Reserved namespaces must not be used, however, in the names of stylesheet-defined objects such as variables and functions.

[ERR007] It is a static error to use a reserved namespace in the name of a named template, a mode, an attribute set, a key (see [16.3.2 The key Function]), a named sort specification, a decimal-format, a variable or parameter (see [9 Variables and Parameters]), a stylesheet function, or a named output definition.

Implementations may reserve additional namespaces for use by the implementation, provided they follow accepted practice to avoid naming collisions. The set of such namespaces is implementation-defined.

Future versions of this specification may reserve additional namespaces starting with http://www.w3.org/.

3.2 XSLT Media Type

The MIME media types text/xml and application/xml [RFC2376] should be used for XSLT stylesheets. It is possible that a media type will be registered specifically for XSLT stylesheets; if and when it is, that media type may also be used.

Issue 138 (xslt-media-type): We may need to define an XSLT media type after all: see the TAG decision at http://www.w3.org/2001/tag/2002/0129-mime

3.3 Standard Attributes

There are a number of standard attributes that may appear on any XSLT element: specifically version, exclude-prefixes, exclude-result-prefixes, extension-element-prefixes, and default-xpath-namespace.

These attributes may also appear on a literal result element, but in this case, to distinguish them from user-defined attributes, the names of the attributes are in the XSLT namespace. They are thus typically written as xsl:version, xsl:exclude-result-prefixes, exclude-prefixes, xsl:extension-element-prefixes, or xsl:default-xpath-namespace.

It is recommended that these attributes should also be permitted on extension instructions, but this is at the discretion of the implementor of each extension instruction. They may also be permitted on user-defined data elements, though they will only have any useful effect in the case of data elements that are designed to behave like XSLT declarations or instructions.

In the following descriptions, these attributes are referred to generically as [xsl:]version, and so on.

These attributes all affect the element they appear on, and any descendant elements of the element they appear on, together with attributes of those descendant elements. The two forms with and without the XSLT namespace have the same effect; the XSLT namespace is used for the attribute if and only if its parent element is not in the XSLT namespace.

In the case of [xsl:]version and [xsl:]default-xpath-namespace the value can be overridden by a different value for the same attribute appearing on a descendant element. The effective value of the attribute for a particular stylesheet element is determined by the innermost containing element on which the attribute appears.

In the case of [xsl:]exclude-result-prefixes, exclude-prefixes, and [xsl:]extension-element-prefixes the values are cumulative. For these attributes, the value may be given as a whitespace-separated list of namespace prefixes, and the effective value for an element is the combined set of namespace URIs designated by the prefixes that appear in this attribute for that element and any of its ancestor elements. Again, the two forms with and without the XSLT namespace are equivalent.

Because these attributes may appear on any XSLT element, they are not listed in the syntax summary of each individual element. Instead they are listed and described in the description of the xsl:stylesheet and xsl:transform elements only. This reflects the fact that these attributes are often used on the xsl:stylesheet element, in which case they apply to the entire stylesheet module.

Note that the effect of these attributes does not extend to stylesheet modules referenced by xsl:include or xsl:import declarations.

For the detailed effect of each attribute, see the following sections:

Issue 128 (version-on-xsl-output): Allowing version as a standard attribute on any XSL element, to indicate backwards or forwards compatibility resquirements, conflicts with its existing use on the xsl:output element. See also issue 122.

Ed. Note: The XSL Working Group has decided in principle to impose restrictions in the ability to mix different version attributes on different elements in a stylesheet. The form of these restrictions has yet to be decided.

3.4 Stylesheet Element

<xsl:stylesheet
  id = id
  extension-element-prefixes = tokens
  exclude-result-prefixes = tokens
  exclude-prefixes = tokens
  version = number
  default-xpath-namespace = uri>
  <!-- Content: (xsl:import*, top-level-elements) -->
</xsl:stylesheet>

<xsl:transform
  id = id
  extension-element-prefixes = tokens
  exclude-result-prefixes = tokens
  exclude-prefixes = tokens
  version = number
  default-xpath-namespace = uri>
  <!-- Content: (xsl:import*, top-level-elements) -->
</xsl:transform>

A stylesheet module is represented by an xsl:stylesheet element in an XML document. xsl:transform is allowed as a synonym for xsl:stylesheet; everything this specification says about the xsl:stylesheet element applies equally to xsl:transform.

[ERR008] An xsl:stylesheet element must have a version attribute, indicating the version of XSLT that the stylesheet requires. [ERR009] The value of the version attribute must be a number (specifically, it must be a DecimalLiteral as defined in [XPath 2.0].) For this version of XSLT, the value should normally be 2.0. When the value is less than 2.0, backwards-compatible processing behavior is enabled (see [3.6 Backwards-Compatible Processing]). When the value is greater than 2.0, forwards-compatible behavior is enabled (see [3.7 Forwards-Compatible Processing]).

[ERR010] An xsl:stylesheet element must have no text node children, other than text nodes consisting entirely of whitespace.

An element occurring as a child of an xsl:stylesheet element is called a top-level element.

Top-level elements fall into two categories: declarations, and user-defined data elements. Top-level elements whose names are in the XSLT namespace are declarations. Top-level elements in any other namespace are user-defined data elements (see [3.4.1 User-defined Data Elements])

The xsl:stylesheet element may contain the following types of declaration:

If there are xsl:import elements, these must come before any other elements. Apart from this, the child elements of the xsl:stylesheet element may appear in any order. The ordering of these elements does not affect the results of the transformation unless there are conflicting declarations (for example, two template rules with the same priority that match the same node). In general, it is an error for a stylesheet to contain such conflicting declarations, but in some cases the processor is allowed to recover from the error by choosing the declaration that appears last in the stylesheet.

3.4.1 User-defined Data Elements

In addition to declarations, the xsl:stylesheet element may contain any element not from the XSLT namespace, provided that the expanded-QName of the element has a non-null namespace URI. Such elements are referred to as user-defined data elements. [ERR011] It is a static error if the xsl:stylesheet element has a child element having a null namespace URI.

An implementation may attach meaning to user-defined data elements that appear in a namespace controlled by the vendor. The presence of a user-defined data element must not change the behavior of XSLT elements and functions defined in this document; for example, it is not permitted for a user-defined data element to specify that xsl:apply-templates should use different rules to resolve conflicts. Thus, an implementation is always free to ignore user-defined data elements, and must ignore such data elements without giving an error if it does not recognize the namespace URI. The set of namespaces that are recognized for such data elements is implementation-defined.

User-defined data elements can provide, for example,

[ERR012] A user-defined data element must not precede an xsl:import element within a stylesheet module.

3.5 Simplified Stylesheet Modules

A simplified syntax is allowed for a stylesheet module that defines only a single template rule for the document node. The stylesheet module may consist of just a literal result element (see [11.1 Literal Result Elements]) together with its contents. Such a stylesheet is equivalent to a standard stylesheet module whose xsl:stylesheet element contains a template rule containing the literal result element; the template rule has a match pattern of /.

For example:

<html xsl:version="2.0"
      xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
      xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <title>Expense Report Summary</title>
  </head>
  <body>
    <p>Total Amount: <xsl:value-of select="expense-report/total"/></p>
  </body>
</html>

has the same meaning as

<xsl:stylesheet version="2.0"
                xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
                xmlns="http://www.w3.org/1999/xhtml">
<xsl:template match="/">
<html>
  <head>
    <title>Expense Report Summary</title>
  </head>
  <body>
    <p>Total Amount: <xsl:value-of select="expense-report/total"/></p>
  </body>
</html>
</xsl:template>
</xsl:stylesheet>

More formally, a simplified stylesheet module is equivalent to the standard stylesheet module that would be generated by applying the following transformation to the simplified stylesheet module, invoking the transformation by calling the named template expand, with the containing literal result element as the context node:

<xsl:stylesheet version="2.0"
                xmlns:xsl="http://www.w3.org/1999/XSL/Transform">

<xsl:template name="expand">
  <xsl:element name="xsl:stylesheet">
    <xsl:attribute name="version">
      <xsl:value-of select="@xsl:version"/>
    </xsl:attribute>
    <xsl:element name="xsl:template">
      <xsl:attribute name="match">/</xsl:attribute>
      <xsl:copy-of select="."/>
    </xsl:element>
  </xsl:element>
</xsl:template>  

</xsl:stylesheet>

If the outermost element of the simplified stylesheet is not the document element, then in the above transformation the attribute match="/*" must be replaced by a pattern than matches this outermost element. Note that if the element has an ID attribute to enable it to be selected (for example in the fragment identifier of a URI reference), then this ID attribute will be copied to the result document in the same way as other attributes of a literal result element.

[ERR013] A literal result element that is used as the outermost element of a simplified stylesheet module must have an xsl:version attribute. This indicates the version of XSLT that the stylesheet requires. For this version of XSLT, the value should normally be 2.0; the value must be a DecimalLiteral as defined in [XPath 2.0].

Other literal result elements may also have an xsl:version attribute. When the xsl:version attribute is numerically less than 2.0, backwards-compatible processing behavior is enabled (see [3.6 Backwards-Compatible Processing]). When the xsl:version attribute is numerically greater than 2.0, forwards-compatible behavior is enabled (see [3.7 Forwards-Compatible Processing]).

The allowed content of a literal result element when used as a simplified stylesheet is the same as when it occurs within a content constructor. Thus, a literal result element used as the document element of a simplified stylesheet cannot contain declarations.

3.6 Backwards-Compatible Processing

An element enables backwards-compatible behavior for itself, its attributes, its descendants and their attributes if either it has an [xsl:]version attribute (see [3.3 Standard Attributes]) whose value is less than 2.0. An element that has an [xsl:]version attribute whose value is greater than or equal to 2.0 disables backwards-compatible behavior for itself, its attributes, its descendants and their attributes. The compatibility behavior established by an element overrides any compatibility behavior established by an ancestor element.

If an attribute containing an XPath expression is processed with backwards-compatible behavior, then:

Issue 122 (mixing-versions): Mixing version 1.0 and version 2.0 code in the same stylesheet has considerable complications. We probably need to be more restrictive than we are here.

It is implementation-defined whether a particular XSLT 2.0 implementation supports backwards-compatible behavior. [ERR015] If an implementation does not support backwards-compatible behavior, then it is a dynamic error if any element is evaluated that enables backwards-compatible behavior. The processor must signal the error.

3.7 Forwards-Compatible Processing

An element enables forwards-compatible behavior for itself, its attributes, its descendants and their attributes if it has an [xsl:]version attribute (see [3.3 Standard Attributes]) whose value is greater than 2.0. An element that has an [xsl:]version attribute whose value is less than or equal to 2.0 disables forwards-compatible behavior for itself, its attributes, its descendants and their attributes. The compatibility behavior established by an element overrides any compatibility behavior established by an ancestor element.

Within a section of a stylesheet where forwards-compatible behavior is enabled, errors that would normally be static errors are treated instead as dynamic errors. This means that no error is reported unless the construct containing the error is actually evaluated.

This means, for example, that when an element is processed with forwards-compatible behavior:

Thus, any XSLT 2.0 processor must be able to process the following stylesheet without error, although the stylesheet includes elements from the XSLT namespace that are not defined in this specification:

<xsl:stylesheet version="17.0"
                xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
  <xsl:template match="/">
    <xsl:choose>
      <xsl:when test="system-property('xsl:version') >= 17.0">
        <xsl:exciting-new-17.0-feature/>
      </xsl:when>
      <xsl:otherwise>
        <html>
        <head>
          <title>XSLT 17.0 required</title>
        </head>
        <body>
          <p>Sorry, this stylesheet requires XSLT 17.0.</p>
        </body>
        </html>
      </xsl:otherwise>
    </xsl:choose>
  </xsl:template>
</xsl:stylesheet>
NOTE: If a stylesheet depends crucially on a declaration introduced by a version of XSLT after 2.0, then the stylesheet can use an xsl:message element with terminate="yes" (see [17 Messages]) to ensure that implementations that conform to an earlier version of XSLT will not silently ignore the declaration.

For example,

<xsl:stylesheet version="18.0"
                xmlns:xsl="http://www.w3.org/1999/XSL/Transform">

  <xsl:important-new-17.0-declaration/>

  <xsl:template match="/">
    <xsl:choose>
      <xsl:when test="system-property('xsl:version') < 17.0">
        <xsl:message terminate="yes">
          <xsl:text>Sorry, this stylesheet requires XSLT 17.0.</xsl:text>
        </xsl:message>
      </xsl:when>
      <xsl:otherwise>
        ...
      </xsl:otherwise>
    </xsl:choose>
  </xsl:template>
  ...
</xsl:stylesheet>

3.8 Combining Stylesheet Modules

XSLT provides two mechanisms to construct a stylesheet from multiple stylesheet modules:

3.8.1 Stylesheet Inclusion

<!-- Category: declaration -->
<xsl:include
  href = uri-reference />

A stylesheet module may include another stylesheet module using an xsl:include declaration. The xsl:include declaration has an href attribute whose value is a URI reference identifying the stylesheet module to be included. A relative URI is resolved relative to the base URI of the xsl:include declaration (see [Data Model]).

It is implementation-defined whether the URI reference may include a fragment identifier, and if so, what form of fragment identifier is supported. A future version of XSLT may define rules for the use of fragment identifiers in the URI reference, for example by reference to the XPointer specification (see [XPointer]). Note that if the implementation does not support the use of fragment identifiers in the URI reference, then it will not be possible to include an embedded stylesheet module.

The included stylesheet module may be either a standard stylesheet module or a a simplified stylesheet module. It may be a complete XML document, or (if referenced using a suitable fragment identifier) it may be an embedded stylesheet module.

[ERR016] The xsl:include element is allowed only as a top-level element.

A stylesheet level is a collection of stylesheet modules connected using xsl:include declarations: specifically, two stylesheet modules A and B are part of the same stylesheet level if one of them includes the other by means of an xsl:include declaration, or if there is a third stylesheet module C that is in the same stylesheet level as both A and B.

The declarations within a stylesheet level have a total ordering known as declaration order. The order of declarations within a stylesheet level is the same as the document order that would result if each stylesheet module were inserted textually in place of the xsl:include element that references it. In other respects, however, the effect of xsl:include is not equivalent to the effect that would be obtained by textual inclusion.

[ERR017] It is an static error if a stylesheet module directly or indirectly includes itself.

NOTE: It is not intrinsically an error for a stylesheet to include the same module more than once. However, doing so can cause errors because of duplicate definitions. Such multiple inclusions are less obvious when they are indirect. For example, if stylesheet B includes stylesheet A, stylesheet C includes stylesheet A, and stylesheet D includes both stylesheet B and stylesheet C, then A will be included indirectly by D twice. If all of B, C and D are used as independent stylesheets, then the error can be avoided by separating everything in B other than the inclusion of A into a separate stylesheet B' and changing B to contain just inclusions of B' and A, similarly for C, and then changing D to include A, B', C'.

3.8.2 Stylesheet Import

<!-- Category: declaration -->
<xsl:import
  href = uri-reference />

A stylesheet module may import another stylesheet module using an xsl:import declaration. Importing a stylesheet is the same as including it (see [3.8.1 Stylesheet Inclusion]) except that template rules and other declarations in the importing stylesheet take precedence over template rules and declarations in the imported stylesheet; this is described in more detail below. The xsl:import declaration has an href attribute whose value is a URI reference identifying the stylesheet to be imported. A relative URI is resolved relative to the base URI of the xsl:import element (see [Data Model]).

It is implementation-defined whether the URI reference may include a fragment identifier, and if so, what form of fragment identifier is supported. A future version of XSLT may define rules for the use of fragment identifiers in the URI reference, for example by reference to the XPointer specification (see [XPointer]). Note that if the implementation does not support the use of fragment identifiers in the URI reference, then it will not be possible to import an embedded stylesheet module.

The imported stylesheet module may be either a standard stylesheet module or a a simplified stylesheet module. It may be a complete XML document, or (if referenced using a suitable fragment identifier) it may be an embedded stylesheet module.

[ERR018] The xsl:import declaration is allowed only as a top-level element. [ERR019] The xsl:import element children must precede all other element children of an xsl:stylesheet element, including any xsl:include element children and any user-defined data elements.

For example,

<xsl:stylesheet version="2.0"
                xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
  <xsl:import href="article.xsl"/>
  <xsl:import href="bigfont.xsl"/>
  <xsl:attribute-set name="note-style">
    <xsl:attribute name="font-style">italic</xsl:attribute>
  </xsl:attribute-set>
</xsl:stylesheet>

The stylesheet levels making up a stylesheet are treated as forming an import tree. In the import tree, each stylesheet level has one child for each xsl:import declaration that it contains. The ordering of the children is the declaration order of the xsl:import declarations within their stylesheet level. A declaration D in the stylesheet is defined to have lower import precedence than another declaration E if the stylesheet level containing D would be visited before the stylesheet level containing E in a post-order traversal of the import tree (that is, a traversal of the import tree in which a stylesheet level is visited after its children). Two declarations within the same stylesheet level have the same import precedence.

For example, suppose

  • stylesheet module A imports stylesheet modules B and C in that order;

  • stylesheet module B imports stylesheet module D;

  • stylesheet module C imports stylesheet module E.

Then the import tree has the following structure:

         A
         |
     +---+---+
     |       |
     B       C
     |       |
     D       E

The order of import precedence (lowest first) is D, B, E, C, A.

In general, a declaration with higher import precedence takes precedence over a declaration with lower import precedence. This is defined in detail for each kind of declaration.

[ERR020] It is a static error if a stylesheet module directly or indirectly imports itself.

NOTE: The case where a stylesheet with a particular URI is imported in multiple places is not treated specially. The resulting stylesheet will contain multiple declarations that are identical in content but that differ in their import precedence.

3.9 Embedded Stylesheet Modules

A standard stylesheet module is a complete XML document with the xsl:stylesheet element as its document element. However, a stylesheet module may also be embedded in another resource. Two forms of embedding are possible:

To facilitate the second form of embedding, the xsl:stylesheet element is allowed to have an ID attribute that specifies a unique identifier.

NOTE: In order for such an attribute to be used with the XPath id function, it must actually be declared in the DTD or schema as being of type ID. The same requirement typically applies if the identifier is to be used as a fragment identifier in a URI reference.

The following example shows how the xml-stylesheet processing instruction (see [XML Stylesheet]) can be used to allow a source document to contain its own stylesheet. The URI reference uses a relative URI with a fragment identifier to locate the xsl:stylesheet element:

<?xml-stylesheet type="text/xml" href="#style1"?>
<!DOCTYPE doc SYSTEM "doc.dtd">
<doc>
<head>
<xsl:stylesheet id="style1"
                version="2.0"
                xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
                xmlns:fo="http://www.w3.org/1999/XSL/Format">
<xsl:import href="doc.xsl"/>
<xsl:template match="id('foo')">
  <fo:block font-weight="bold"><xsl:apply-templates/></fo:block>
</xsl:template>
<xsl:template match="xsl:stylesheet">
  <!-- ignore -->
</xsl:template>
</xsl:stylesheet>
</head>
<body>
<para id="foo">
...
</para>
</body>
</doc>
NOTE: A stylesheet module that is embedded in the document to which it is to be applied or that may be included or imported into a stylesheet that is so embedded typically needs to contain a template rule that specifies that xsl:stylesheet elements are to be ignored.
NOTE: The above example uses the pseudo-attribute type="text/xml" in the xml-stylesheet processing instruction to denote an XSLT stylesheet. This usage was defined provisionally in XSLT 1.0, and is subject to change. In the absence of a registered media type for XSLT stylesheets, some vendors' products have adopted different conventions, notably type="text/xsl".
NOTE: Support for the xml-stylesheet processing instruction is not a requirement for conformance with this Recommendation.

3.10 Importing Schema Components

<!-- Category: declaration -->
<xsl:import-schema
  namespace = uri-reference
  schema-location = uri-reference />

The xsl:import-schema declaration is used to identify schema components (that is, definitions of types) which need to be available statically, that is, before any source document is available. Every type name used statically within the stylesheet other than the built-in type names defined in XML Schema must be defined in an imported schema. The declaration imports the element and attribute declarations and type definitions from the schema, and maps them to types in the XPath data model according to rules defined in [Formal Semantics].

Ed. Note: The relationship of this document to the Formal Semantics needs to be clarified.

The namespace and schema-location attributes are both optional. At least one of them must be present, and it is permissible to supply both.

The namespace attribute indicates that a schema for the given namespace is required by the stylesheet. This information may be enough on its own to enable an implementation to locate the required schema components.

The schema-location attribute gives the URI of a location where a schema document or other resource containing the required definitions may be found. An XSLT processor must have the capability to process an XML Schema that exists at this location in the form of a source XML document; implementations may also be able to access equivalent information held in other forms, for example a compiled XML Schema, or type information expressed using some other schema language.

[ERR021] It is a static error if the processor is not able to locate a schema using the namespace and/or schema-location attributes , or if the document that it locates is neither a valid XML Schema nor any other resource that the implementation can process.

[ERR022] It is a static error if two xsl:import-schema declarations yield multiple definitions for the same named type, even if the definitions are consistent with each other.

The use of a namespace in an xsl:import-schema declaration does not by itself make the namespace available for use in the stylesheet. If names from the namespace are used within the stylesheet, a prefix must be associated with the namespace by means of a namespace declaration, in the normal way.

The precise way in which an implementation uses the namespace and/or schema-location attributes to locate schema definitions is implementation-defined.

Issue 125 (schema-conformance): We need to describe a conformance level that does not require schema support.

4 Data Model

The data model used by XSLT is the XPath 2.0 and XQuery 1.0 data model, as defined in [Data Model], with the additions described in this section. XSLT operates on source, result and stylesheet documents using the same data model.

Some of the sections below describe restrictions to the data model: that is, features of the data model that are never used by XSLT. Some sections describe additions to the data model: that is, information that is required to support XSLT processing, but that is not described in the data model. Other sections describe refinements to the data model, that is, additional rules about the way in which trees in the data model are constructed. Each section is therefore marked as a restriction, and addition, or a refinement.

NOTE: Features of a source XML document that are not represented in the tree defined by the data model will have no effect on the operation of an XSLT stylesheet. Examples of such features are entity references, CDATA sections, character references, whitespace within element tags, and the choice of single or double quotes around attribute values.

Ed. Note: We need to say something here about schemas and DTDs. See http://lists.w3.org/Archives/Member/w3c-xsl-wg/2002Jan/0113.html (members only) and subsequent discussion.

4.1 Parentless Nodes

Restriction

The data model defined in [Data Model] allows a node to be part of a tree whose root is a node other than a document node.

XSLT does not allow nodes with no parent (other than document nodes) to be created: a node created during the course of XSLT processing is not available for further processing until it has been attached to a tree that is rooted at a document node. However, implementations may allow parentless nodes to be supplied as input to the transformation, for example as parameters to the stylesheet.

4.2 Document Node Children

Ed. Note: This section can be removed when it is confirmed that the data model permits "well-balanced" trees. At the time of writing, this is still an open issue in the data model (Issue 0041).

The normal restrictions on the children of the document node are relaxed for the result tree and for temporary trees constructed during the evaluation of the stylesheet. The document node of such a tree may have any sequence of nodes as children that would be possible for an element node. In particular, it may have text node children, and any number of element node children. When written out using the XML output method (see [20 Serialization]), it is possible that a result tree will not be a well-formed XML document; however, it will always be a well-formed external general parsed entity.

For example, a stylesheet might produce the following output. This is a well-formed external general parsed entity, but it is not a well-formed XML document:

<?xml version="1.0" encoding="iso-8859-1"?>A <i>fine</i> mess!

When a source tree is created by parsing a well-formed XML document, the document node of the source tree will automatically satisfy the normal restrictions of having no text node children and exactly one element child. When a source tree is created in some other way, for example by using the DOM, the usual restrictions are relaxed for the source tree as for the result tree.

4.3 Unparsed Entities

Addition

Ed. Note: Unparsed entities don't currently appear in the data model, though we have asked for them to be added. This section can be deleted when the Data Model is updated to support unparsed entities.

Ed. Note: Add a normative reference to the InfoSet.

The document node has a mapping that gives the system identifier and public identifier for each unparsed entity declared in the document's DTD. The system identifier is derived from the [system identifier] of the corresponding unparsed entity information item in the InfoSet, resolved relative to its [declaration base URI] property, while the public identifier is derived directly from the [public identifier] property .

The URI is generated from the system identifier and public identifier specified in the entity declaration. The processor may use the public identifier to generate a URI for the entity instead of the URI specified in the system identifier. If the processor does not use the public identifier to generate the URI, it must use the system identifier; if the system identifier is a relative URI, it must be resolved into an absolute URI using the URI of the resource containing the entity declaration as the base URI [RFC2396].

4.4 Whitespace Stripping

Refinement

The source document supplied as input to the transformation process may contain whitespace nodes (that is, text nodes consisting solely of whitespace characters) that are of no interest, and that do not need to be retained by the transformation. Conceptually, such whitespace nodes may be removed from the tree before the transformation commences. This process is referred to as whitespace stripping. The source tree itself must not be modified: the processor may implement whitespace stripping either by creating a copy of the tree from which the whitespace nodes have been removed, or by working on a virtual tree in which the whitespace nodes are treated as if they were absent.

The stripping process takes as input a set of element names whose child whitespace nodes must be preserved. The stripping process is applied to both stylesheets and source documents, but the set of whitespace-preserving element names is determined differently for stylesheets and for source documents.

NOTE: Where multiple transformations are to be applied to the same source document, a useful optimization is to do the whitespace stripping only once. Implementations may therefore allow whitespace stripping to be controlled as a separate operation from the rest of the transformation process.

A text node is preserved if any of the following apply:

Otherwise, the text node is stripped.

The xml:space attributes are not removed from the tree.

NOTE: This implies that if an xml:space attribute is specified on a literal result element, it will be included in the result.

For stylesheets, the set of whitespace-preserving element names consists of just xsl:text.

Processing instructions and comments in a stylesheet module are ignored: the stylesheet module is treated as if the processing instructions and comments were not there. This also means that sibling text nodes that are separated by a processing instruction or comment in a stylesheet module are concatenated into a single text node; and a text node is classified as a whitespace text node for the purpose of whitespace stripping only after this concatenation has taken place.

The content model for some XSLT elements (for example xsl:stylesheet and xsl:choose) does not permit text nodes as children of these elements. If the xml:space="preserve" attribute is used to suppress the stripping of whitespace text nodes within such elements, then any whitespace used for the layout of such elements will be retained in the stylesheet tree in the form of whitespace text nodes. Such text nodes must not be reported as an error. [ERR023] Within an XSLT element that is required to be empty, any content other than comments or processing instructions, including any whitespace-only text node preserved using the xml:space="preserve" attribute, is a static error.

<!-- Category: declaration -->
<xsl:strip-space
  elements = tokens />

<!-- Category: declaration -->
<xsl:preserve-space
  elements = tokens />

For source documents, the set of whitespace-preserving element names is specified by xsl:strip-space and xsl:preserve-space declarations. Whether an element name is included in the set of whitespace-preserving names is determined by the best match amongst xsl:strip-space or xsl:preserve-space declarations: it is included if and only if there is no match or the best match is an xsl:preserve-space element. The xsl:strip-space and xsl:preserve-space elements each have an elements attribute whose value is a whitespace-separated list of NameTests; an element name matches an xsl:strip-space or xsl:preserve-space element if it matches one of the NameTests. An element matches a NameTest if and only if the NameTest would be true for the element as an XPath node test. When more than one xsl:strip-space and xsl:preserve-space element matches, the best matching element is determined by the best matching NameTest. This is determined in the same way as with template rules:

[ERR024] It is an dynamic error if this leaves more than one match. The processor must either signal the error, of must recover by choosing, from amongst the matches that are left, the one that occurs last in declaration order.

NOTE: A source document is supplied as input to the XSLT processor in the form of a tree. Nothing in this specification states that this tree must be built by parsing an XML document; nor does it state that the application that constructs the tree is required to treat whitespace in any particular way. The provisions in this section relate only to whitespace text nodes that are present in the tree supplied as input to the processor. In particular, the processor cannot preserve whitespace text nodes unless they were actually present in the supplied tree.

4.5 Namespace Fixup

Refinement

Ed. Note: The process of namespace fixup would ideally be described along with the node construction functions defined in the XPath 2.0 data model.

Issue 13 (shared-namespace-node-fixup): This section needs to be revised if namespace nodes are to be held at document level.

In a tree constructed by parsing an XML document, the following constraints relating to namespace nodes will be satisfied:

However, when a tree is being constructed as the result of an XSLT transformation, these constraints might not be satisfied unless special action is taken. In particular, since xsl:element and xsl:attribute instructions do not create namespace nodes, they will often cause these constraints not to be satisfied. The process of namespace fixup modifies a tree by adding namespace nodes so that it satisfies all constraints affecting namespace nodes. What namespace nodes are added and where they are added by namespace fixup is implementation-dependent, provided that the resulting tree satisfies the constraints and provided that all namespaces nodes in the resulting tree are allowable, where a namespace node is allowable for an element E if any of the following conditions applies:

Namespace fixup must not result in an element having multiple namespace nodes with the same expanded-QName.

Namespace fixup is performed in two situations:

There is no requirement to perform namespace fixup for the principal source document, nor for any document loaded using the document function, nor for any document supplied as the value of a global parameter, nor for any document returned by an extension function. [ERR025] It is a dynamic error if such a document does not already satisfy the constraints listed above . The processor may signal the error, or may recover by performing namespace fixup, or may produce implementation-dependent results.

4.6 Disable Output Escaping

Addition

If an implementation supports the disable-output-escaping attribute of xsl:text, xsl:value-of, and xsl:attribute (see [20.5 Disabling Output Escaping]), then the data model for trees constructed by the processor is augmented with a boolean value representing the value of this property.

Conceptually, each character in a text node on a result tree, and each attribute node, has a boolean property indicating whether the serializer should disable the normal rules for escaping of special characters (for example, outputting of & as &amp;) in respect of this character or attribute node.

This property is preserved when a text or attribute node is copied using xsl:copy or xsl:copy-of.

NOTE: There are many ways an implementation can avoid the overhead of actually storing a boolean flag with every character.

4.7 External Objects

The type system described in [Data Model] is extended to allow an extension function to return an object whose type is unknown to the XPath or XSLT processor. Such an object is referred to as an external object. For details, see [18.1.3 External Objects].

5 Syntactic Constructs

5.1 Qualified Names

The name of a stylesheet-defined object, specifically a named template, a mode, an attribute set, a key (see [16.3.2 The key Function]), a named sort specification, a decimal-format, a variable or parameter (see [9 Variables and Parameters]), a stylesheet function, or a named output definition, is specified as a QName.

A QName is always written in the form (NCName ":")? NCName, that is, a local name optionally preceded by a namespace prefix. When two QNames are compared, however, they are considered equal if the corresponding expanded-QNames are the same.

An expanded-QName is a pair of values containing a namespace URI and a local name. A QName is expanded by replacing the namespace prefix with the corresponding namespace URI, from the namespace declarations that are in scope at the point where the QName is written. Two expanded-QNames are equal if the namespace URIs are the same and the local names are the same.

[ERR026] It is a static error to use a reserved namespace URI in the name of any stylesheet-defined object. The reserved namespaces are listed in [3.1 XSLT Namespace].

QNames always occur either as the value of an attribute node in a stylesheet module, or within an XPath expression contained in such an attribute node, or as the result of evaluating an XPath expression contained in such an attribute node. The element containing this attribute node is referred to as the defining element of the QName.

If the QName has a prefix, then the prefix is expanded into a URI reference using the namespace declarations in effect on its defining element. The expanded-QName consisting of the local part of the name and the possibly null URI reference is used as the name of the object. The default namespace (as defined by a namespace declaration of the form xmlns="some.uri") is not used for unprefixed names.

In the case of an unprefixed QName used as a NameTest within an XPath expression (see [5.2 Expressions]) or within a pattern (see [5.3 Patterns]), or in the elements attribute of the xsl:strip-space and xsl:preserve-space instructions, the namespace to be used in expanding the QName may be specified by means of the [xsl:]default-xpath-namespace attribute, as specified in [5.4 Unprefixed Names in Expressions and Patterns].

[ERR027] In the case of a QName used as the value of an attribute in the stylesheet, or appearing within the text of an XPath expression in the the stylesheet, it is a static error if the defining element has no namespace node whose name matches the prefix of the QName.

[ERR028] In the case of a QName produced by evaluating an XPath expression, it is a dynamic error if the defining element has no namespace node whose name matches the prefix of the QName. The error is a dynamic error even if the value of the expression is known statically, for example if the QName is written as a string literal. The required action depends on the defining element.

5.2 Expressions

XSLT uses the expression language defined by XPath 2.0 [XPath 2.0]. Expressions are used in XSLT for a variety of purposes including:

An expression must match the XPath production Expr.

An XPath expression may occur as the value of certain attributes on XSLT-defined elements, and also within curly braces in attribute value templates.

[ERR029] It is a static error if the value of such an attribute, or the text between curly braces in an attribute value template, does not match the XPath production Expr, or if it fails to satisfy other static constraints defined in the XPath specification, for example that all variable references must refer to variables that are in scope.

The context within a stylesheet where an XPath expression may appear determines the required type of the expression. The required type indicates the data type of value that the expression is expected to return.

[ERR030] It is a type error if an XPath expression contains a type error, or if the type of the XPath expression is incompatible with the required type. The processor must either signal a type error as a static error, or must attempt to recover by converting the result of the expression to the required type using the standard type conversion rules; if conversion is not possible under these rules, the processor must signal a dynamic error

The context for evaluation of an XPath expression is determined according to the following rules. The context has two parts: the static context, and the dynamic expression evaluation context.

The static context depends on the element in the stylesheet that contains the attribute holding the XPath expression ("the containing element") as follows:

The evaluation context, which includes the focus, is determined as follows:

Where the containing element is an instruction or a literal result element, the focus is established as follows. In other cases, the rules are given for the specific containing element.

5.3 Patterns

A template rule identifies the nodes to which it applies by means of a pattern. As well as being used in template rules, patterns are used for numbering (see [12 Numbering]), for grouping (see [14 Grouping]), and for declaring keys (see [16.3 Keys]).

A pattern specifies a set of conditions on a node. A node that satisfies the conditions matches the pattern; a node that does not satisfy the conditions does not match the pattern. The syntax for patterns is a subset of the syntax for expressions. As explained in detail below, a node matches a pattern if the node can be selected by evaluating this expression with respect to some possible context.

Here are some examples of patterns:

  • para matches any para element

  • * matches any element

  • chapter|appendix matches any chapter element and any appendix element

  • olist/item matches any item element with an olist parent

  • appendix//para matches any para element with an appendix ancestor element

  • / matches the document node of any source document

  • text() matches any text node

  • node() matches any node other than an attribute node, namespace node, or document node

  • id("W11") matches the element with unique ID W11

  • para[1] matches any para element that is the first para child element of its parent

  • item[position() mod 2 = 1] matches any item element that is an odd-numbered item child of its parent.

  • div[@class="appendix"]//p matches any p element with a div ancestor element that has a class attribute with value appendix

  • @class matches any class attribute (not any element that has a class attribute)

  • @* matches any attribute node

Issue 110 (type-matching-in-patterns): The current syntax for matching elements of a particular type is cumbersome, for example match="*[. instance of element us-address of type address]".

[ERR032] Where an attribute is defined to contain a pattern, it is a static error if the pattern does not match the production Pattern. Every pattern is a legal XPath expression, but the converse is not true: 2+2 is an example of a legal XPath expression that is not a pattern. The XPath expressions that can be used as patterns are those that match the grammar for Pattern, given below.

Informally, a Pattern is a set of path expressions separated by |, where each step in the path expression is constrained to be an AxisStepExpr that uses only the child or attribute axes. Patterns may also use the // operator, and they may start with an id or key function call, provided that the value to be matched is supplied as either a literal or a reference to a variable or parameter, and the key name (in the case of the key function) is supplied as a string literal. Predicates in a pattern (the construct enclosed between square brackets) can contain arbitrary XPath expressions in the same way as predicates in a path expression.

If a pattern occurs in part of the stylesheet where backwards compatible behavior is enabled (see [3.6 Backwards-Compatible Processing]), then the pattern is restricted to use the syntax for patterns defined in XSLT 1.0, and will match a node if and only if it would have matched that node under the rules defined in XSLT 1.0.

Patterns
[1]   Pattern   ::=   PathPattern
| Pattern ('|' | 'union') PathPattern
[2]   PathPattern   ::=   RelativePathPattern
| '/' RelativePathPattern?
| '//' RelativePathPattern
| IdKeyPattern (('/' | '//') RelativePathPattern)?
[3]   RelativePathPattern   ::=   PatternStep (('/' | '//') RelativePathPattern)?
[4]   PatternStep   ::=   PatternAxis? NodeTest ( '[' Expr ']' )*
[5]   PatternAxis   ::=   ('child' '::' | 'attribute' '::' | '@')
[6]   IdKeyPattern   ::=   'id' '(' IdKeyValue ')'
| 'key' '(' StringLiteral ',' IdKeyValue ')'
[7]   IdKeyValue   ::=   Literal | Variable

The constructs NodeTest, StringLiteral, and Expr are part of the XPath expression language, and are defined in [XPath 2.0].

The meaning of a pattern is defined formally as follows. To determine whether a node N matches a pattern PAT, evaluate the expression //(PAT) with a singleton focus based on N. If the result is a sequence of nodes that includes N, then node N matches the pattern; otherwise node N does not match the pattern. This expression is constructed by textually inserting the pattern PAT exactly as written in the stylesheet.

For example, p matches any p element, because a p element will always be present in the result of evaluating the expression //(p). Similarly, / matches a document node, and only a document node, because the result of the expression //(/) returns the document node of the document containing the context node.

NOTE: Although the semantics of patterns are specified formally in terms of expression evaluation, it is possible to understand pattern matching using a different model. In a pattern, | indicates alternatives; a pattern with one or more | separated alternatives matches if any one of the alternatives matches. A pattern such as book/chapter/section can be examined from right to left. A node will only match this pattern if it is a section element; and then, only if its parent is a chapter; and then, only if the parent of that chapter is a book. When the pattern uses the // operator, one can still read it from right to left, but this time testing the ancestors of a node rather than its parent. For example appendix//section matches every section element that has an ancestor appendix element. The formal definition, however, is useful for understanding the meaning of a pattern such as item[1]. This matches any node selected by the expression //(item[1]): that is, any item element that is the first item child of its parent.

The pattern node() matches all nodes selected by the expression //(node()), that is, all element, text, comment, and processing instruction nodes. It does not match attribute or namespace nodes because the expression does not select nodes using the attribute or namespace axes.

NOTE: An implementation, of course, may use any algorithm it wishes for evaluating patterns, so long as the result corresponds with the formal definition above. An implementation that followed the formal semantics by evaluating the equivalent expression and then testing the membership of a specific node in the result would probably be very inefficient.

5.4 Unprefixed Names in Expressions and Patterns

The attribute [xsl:]default-xpath-namespace (see [3.3 Standard Attributes]) may be used on an element in the stylesheet to define the namespace that will be used for an unprefixed name used as a NameTest within a step of an XPath PathExpression or an XSLT Pattern or in the elements attribute of the xsl:strip-space or xsl:preserve-space instructions, where the NameTest occurs in an attribute of that stylesheet element or an attribute of a descendant of that stylesheet element. The value of the attribute is the namespace URI to be used.

This default namespace URI applies only to a NameTest applied to an axis whose principal node type is elements: it does not apply when the step is using the attribute or namespace axis. The default namespace URI for such a name is the value of the [xsl:]default-xpath-namespace attribute on the innermost ancestor element that has such an attribute, considering all ancestor elements of the attribute containing the XPath expression or XSLT pattern. The [xsl:]default-xpath-namespace attribute must be in the XSLT namespace if and only if its parent element is not in the XSLT namespace.

In the absence of this attribute, an unqualified NameTest (that is, a NameTest that is an NCName) matches an expanded-QName whose namespace URI is null: the default namespace (as defined by an xmlns="some-uri" declaration) is not used.

The default-xpath-namespace only affects unqualified names (names containing no colon) used in a NameTest. It does not affect other names, for example function names, variable names, or names used as arguments to the key or system-property functions.

Ed. Note: Do we need to add this attribute to all element proformas and to the DTD?

5.5 Attribute Value Templates

In an attribute that is designated as an attribute value template, such as an attribute of a literal result element, an expression can be used by surrounding the expression with curly braces ({}).

An attribute value template consists of an alternating sequence of fixed parts and variable parts. A variable part consists of an XPath expression enclosed in curly braces ({}). A fixed part may contain any characters, except that a left curly brace must be written as {{ and a right curly brace must be written as }}.

NOTE: An expression within a variable part may contain an unescaped curly brace within a StringLiteral. It may also contain a construct, such as a comment, that is delimited by paired opening and closing braces.

[ERR033] It is a static error if a left curly brace appears in an attribute value template without a matching right curly brace.

[ERR034] It is a static error if the string contained between matching curly braces in an attribute value template does not match the XPath production Expr.

[ERR035] It is a static error if a right curly brace occurs in an attribute value template outside an expression without being followed by a second right curly brace. A right curly brace inside a StringLiteral in an expression is not recognized as terminating the expression.

The required type of each expression within an attribute value template is xs:string.

The result of evaluating an attribute value template is referred to as the effective value of the attribute. The effective value is the string obtained by concatenating the expansions of the fixed and variable parts. The expansion of a fixed part is obtained by replacing any double curly braces ({{ or }}) by the corresponding single curly brace. The expansion of a variable part is obtained by evaluating the enclosed XPath expression and converting the resulting value to a string as if by a call to the string function. This conversion is done by taking the sequence of values returned by the expression, replacing any node in this sequence by its typed value (which might itself be a sequence), and then converting each of the atomic values in the resulting sequence to a string, adding a single space after each value other than the last. If the sequence is empty, the result is a zero-length string .

NOTE: This may give a different result from XSLT 1.0. In XSLT 1.0, if the expression returned a node-set, all nodes other than the first were ignored.

Ed. Note: The Working Group is interested in receiving feedback on whether this proposed backwards incompatibility is acceptable, and on whether the same change should also be made for the xsl:value-of instruction.

Curly braces are not treated specially in an attribute value in an XSLT stylesheet unless the attribute is specifically designated as one that permits an attribute value template; in an element syntax summary, the value of such attributes is surrounded by curly braces.

NOTE: Not all attributes are interpreted as attribute value templates. Attributes whose value is an expression or pattern, attributes of top-level elements and attributes that refer to named XSLT objects are not interpreted as attribute value templates. One exception is the xsl:principal-result-document declaration: although this is a top-level element, some of its attributes are interpreted as attribute value templates. In addition, xmlns attributes are not interpreted as attribute value templates: this is because they must be interpreted in the same way by the XML parser and the XSLT processor.

The following example creates an img result element from a photograph element in the source; the value of the src attribute of the img element is computed from the value of the image-dir variable and the string-value of the href child of the photograph element; the value of the width attribute of the img element is computed from the value of the width attribute of the size child of the photograph element:

<xsl:variable name="image-dir">/images</xsl:variable>

<xsl:template match="photograph">
<img src="{$image-dir}/{href}" width="{size/@width}"/>
</xsl:template>

With this source

<photograph>
  <href>headquarters.jpg</href>
  <size width="300"/>
</photograph>

the result would be

<img src="/images/headquarters.jpg" width="300"/>
 

The following source:

<temperature readings="{(10.32, 5.50, 8.31)}"/>

produces the result:

<temperature readings="10.32 5.5 8.31"/>
 

Curly braces are not recognized recursively inside expressions.

For example:

<a href="#{id({@ref})/title}">

is not allowed. Instead, use simply:

<a href="#{id(@ref)/title}">

5.6 Content Constructors

NOTE: The term content constructor replaces template as used in XSLT 1.0. The change is made partly for clarity (to avoid confusion with template rules and named templates), but also to reflect a more formal definition of the semantics. Whereas XSLT 1.0 described a template as a sequence of instructions that write to the result tree, XSLT 2.0 describes a content constructor as something that can be evaluated to return a sequence of nodes; what happens to these nodes depends on the containing instruction.

Many XSLT elements (including literal result elements) are defined to take as their content a content constructor.

A content constructor is a sequence of nodes in the stylesheet that is used to generate nodes in the result tree.

Informally, the nodes in a content constructor act as templates for nodes to be added to the result tree. Some nodes (text nodes and literal result elements) are copied directly from the stylesheet to the result tree. Other nodes (XSLT instructions and extension instructions) are evaluated to produce new nodes, which are added to the result tree: for example, the xsl:element instruction produces an element node, and the xsl:comment instruction produces a comment node.

More formally, the semantics of a content constructor are defined in terms of the constructor functions defined in the data model (see [Data Model]). Each node in a content constructor is evaluated to produce a sequence of new nodes. These sequences are concatenated in the order of the stylesheet nodes that produced them. The resulting concatenated sequence is not available directly to the stylesheet code; rather, it is processed by the stylesheet instruction that contains the content constructor. Some instructions, such as xsl:if and xsl:for-each include the new nodes in their own result sequence, to be processed by their containing instruction. Other instructions, such as xsl:element and literal result elements, construct a new element node, and use the nodes created by the content constructor as the attributes, namespace nodes and child nodes of the new element. In general, the content constructor produces a sequence of nodes, and the way in which these nodes contribute to the result tree is defined by the containing instruction.

 

For example, consider the following stylesheet fragment:

<td>
<xsl:attribute name="valign">top</xsl:attribute>
<xsl:value-of select="@description"/>
</td>

This fragment consists of a literal result element td, acting as the parent of a content constructor that consists of two instructions: xsl:attribute and xsl:value-of. The content constructor is evaluated to produce a sequence of two nodes, an attribute node (with name valign and value top), and a text node (whose value is obtained from the description attribute of the context node in the source document). These two nodes are processed by the td literal result element, which itself constructs a td element having a valign attribute and a single text node. The sequence containing the attribute node and the text node is never available for processing directly in the stylesheet, rather it forms the input to the element constructor in the data model that creates the td element.

 

This sequence is referred to below as the result sequence.

Four kinds of nodes may be encountered in a content constructor:

The node sequences produced by each node in the content constructor are concatenated to form a single result sequence. This concatenation retains the order of the nodes within the content constructor: if while evaluating a content constructor, node M is constructed by instruction I, and node N is constructed by a different instruction J, then N will appear after M in the result sequence if and only if J follows I in document order. This does not mean that the nodes in a content constructor must be evaluated sequentially: on the contrary, they may be evaluated in any order, or in parallel, provided that their results are assembled in the correct sequence on completion.

If the result sequence contains two or more adjacent text nodes, these adjacent text nodes are concatenated to form a single text node.

The result sequence will never contain a document node. [ERR036] It is an dynamic error if an extension instruction attempts to return a sequence containing a document node. The processor must signal the error.

[ERR037] It is a dynamic error if the result sequence (after concatenating the results of individual instructions) contains a namespace node that is preceded in the sequence by a node that is not a namespace node. The processor must either signal the error, or must recover by ignoring the offending namespace node.

Issue 30 (must-namespaces-precede-attributes): It appears that several implementations currently allow a namespace node to be added after adding attributes (using xsl:copy ). This seems convenient for the user, and the Working Group is inclined to allow it. To achieve this, we will need to define some conflict resolution if the namespace clashes with an existing attribute.

[ERR038] It is a dynamic error if the result sequence (after concatenating the results of individual instructions) contains an attribute node that is preceded in the sequence by a node that is neither a namespace node nor an attribute node. The processor must either signal the error, or must recover by ignoring the offending attribute node.

What actually happens to the nodes in the result sequence depends on the element containing the content constructor. Some elements, such as xsl:if, simply return the sequence of nodes, which are added to the result of the content constructor containing the xsl:if instruction. Other elements, such as xsl:element, xsl:comment, and xsl:variable, construct a new node, and use the value returned by the content constructor to create the children or the text content of the new node.

Specifically:

Ed. Note: The above text should be rewritten to provide a formal mapping to the constructor functions defined in the data model.

Issue 99 (sequences-in-XSLT): Should we consider extending XSLT to allow construction of arbitrary sequences?

6 Template Rules

Template rules implement the push processing model described in [2.4.1 Push Processing].

6.1 Defining Template Rules

<!-- Category: declaration -->
<xsl:template
  match = pattern
  name = qname
  priority = number
  mode = tokens>
  <!-- Content: (xsl:param*, content-constructor) -->
</xsl:template>

A template rule is specified with the xsl:template element. The match attribute is a Pattern that identifies the source node or nodes to which the rule applies. The match attribute is required unless the xsl:template element has a name attribute (see [10.1 Named Templates]). The result of applying the template rule is the result of evaluating the content constructor contained in the xsl:template element, with the matching node used as the context node

For example, an XML document might contain:

This is an <emph>important</emph> point.

The following template rule matches emph elements and produces a fo:wrapper formatting object with a font-weight property of bold.

<xsl:template match="emph">
  <fo:wrapper font-weight="bold">
    <xsl:apply-templates/>
  </fo:wrapper>
</xsl:template>
NOTE: Examples in this document use the fo: prefix for the namespace http://www.w3.org/1999/XSL/Format, which is the namespace of the formatting objects defined in [XSL Formatting Objects].

As described in the next section, the xsl:apply-templates element can be used to process the children of a source element, and their children, recursively.

6.2 Applying Template Rules

<!-- Category: instruction -->
<xsl:apply-templates
  select = node-sequence-expression
  mode = token>
  <!-- Content: (xsl:sort | xsl:with-param)* -->
</xsl:apply-templates>

The xsl:apply-templates instruction takes as input a sequence of nodes in the source tree, and produces as output a sequence of nodes which are typically added to the result tree. Each node in the input sequence is processed by finding a template rule whose pattern matches that node. If there is more than one, the best among them is chosen, using rules described below; if there is none, a built-in template rule is used. The chosen template rule is evaluated, and produces a sequence of new nodes. The resulting sequences of nodes (one for each node in the input sequence) are then concatenated, to form a single sequence. They are concatenated retaining the order of the nodes in the original input sequence, unless a different order is requested using xsl:sort. The final concatenated sequence of nodes forms the result of the xsl:apply-templates instruction, and is passed to its parent instruction, which will normally add the nodes to the result tree.

This example creates a block for a chapter element and then processes its immediate children.

<xsl:template match="chapter">
  <fo:block>
    <xsl:apply-templates/>
  </fo:block>
</xsl:template>

In the absence of a select attribute, the xsl:apply-templates instruction processes all of the children of the context node, including text nodes. [ERR042] It is a dynamic error if the context item is not a node. The processor must either signal the error, or must recover by returning an empty sequence.

Whitespace text nodes that have been stripped as specified in [4.4 Whitespace Stripping] will not be processed. If stripping of whitespace nodes has not been enabled for an element, then all whitespace in the content of the element will be processed as text, and thus whitespace between child elements will count in determining the position of a child element as returned by the position function. This effect can be prevented by stripping whitespace text nodes as specified in [4.4 Whitespace Stripping].

A select attribute can be used to process nodes selected by an expression instead of processing all children. The value of the select attribute is an expression. The expression must evaluate to a sequence of nodes (it can contain zero, one, or more nodes). The order of the result nodes will be the same as the order of this sequence, unless a sorting specification is present (see [13 Sorting]).

[ERR043] It is a dynamic error if the sequence returned by the select expression contains an item that is not a node. The processor must either signal the error, or must recover by ignoring the offending items.

NOTE: In XSLT 1.0, the select attribute selected a set of nodes, which by default were processed in document order. In XSLT 2.0, it selects a sequence of nodes. In cases that would have been valid in XSLT 1.0, the expression will return a sequence of nodes in document order, so the effect is the same.

Ed. Note: Do we still need all these examples?

The following example processes all of the author children of the author-group:

<xsl:template match="author-group">
  <fo:wrapper>
    <xsl:apply-templates select="author"/>
  </fo:wrapper>
</xsl:template>

The following example processes all of the given-names of the authors that are children of author-group:

<xsl:template match="author-group">
  <fo:wrapper>
    <xsl:apply-templates select="author/given-name"/>
  </fo:wrapper>
</xsl:template>

This example processes all of the heading descendant elements of the book element.

<xsl:template match="book">
  <fo:block>
    <xsl:apply-templates select=".//heading"/>
  </fo:block>
</xsl:template>

It is also possible to process elements that are not descendants of the context node. This example assumes that a department element has group children and employee descendants. It finds an employee's department and then processes the group children of the department.

<xsl:template match="employee">
  <fo:block>
    Employee <xsl:apply-templates select="name"/> belongs to group
    <xsl:apply-templates select="ancestor::department/group"/>
  </fo:block>
</xsl:template>

Multiple xsl:apply-templates elements can be used within a single template to do simple reordering. The following example creates two HTML tables. The first table is filled with domestic sales while the second table is filled with foreign sales.

<xsl:template match="product">
  <table>
    <xsl:apply-templates select="sales/domestic"/>
  </table>
  <table>
    <xsl:apply-templates select="sales/foreign"/>
  </table>
</xsl:template>
NOTE: It is possible for there to be two matching descendants where one is a descendant of the other. This case is not treated specially: both descendants will be processed as usual.

For example, given a source document

<doc><div><div></div></div></doc>

the rule

<xsl:template match="doc">
  <xsl:apply-templates select=".//div"/>
</xsl:template>

will process both the outer div and inner div elements.

NOTE: Typically, xsl:apply-templates is used to process only nodes that are descendants of the context node. Such use of xsl:apply-templates cannot result in non-terminating processing loops. However, when xsl:apply-templates is used to process elements that are not descendants of the context node, the possibility arises of non-terminating loops. For example,
<xsl:template match="foo">
  <xsl:apply-templates select="."/>
</xsl:template>
Implementations may be able to detect such loops in some cases, but the possibility exists that a stylesheet may enter a non-terminating loop that an implementation is unable to detect. This may present a denial of service security risk.

6.3 Conflict Resolution for Template Rules

It is possible for a node in a source document to match more than one template rule. The template rule to be used is determined as follows:

  1. First, all matching template rules that have lower import precedence than the matching template rule or rules with the highest import precedence are eliminated from consideration.

  2. Next, all matching template rules that have lower priority than the matching template rule or rules with the highest priority are eliminated from consideration. The priority of a template rule is specified by the priority attribute on the template rule. [ERR044] The value of this must be a real number (positive or negative), matching the production NumericLiteral with an optional leading minus sign (-). [ERR045] If an xsl:template element does not have a match attribute, then it must not have a priority attribute.

    If no priority attribute is specified on the xsl:template element, the default priority is computed as follows:

    • If the pattern contains multiple alternatives separated by |, then it is treated equivalently to a set of template rules, one for each alternative. Note, however, that it is not an error if a node matches more than one of the alternatives.

    • If the pattern has the form of a QName optionally preceded by a PatternAxis or has the form processing-instruction(StringLiteral) optionally preceded by a PatternAxis, then the priority is 0.

    • If the pattern has the form NCName:* or *:NCName, optionally preceded by a PatternAxis, then the priority is -0.25.

    • Otherwise, if the pattern consists of just a NodeTest optionally preceded by a PatternAxis, then the priority is -0.5.

    • Otherwise, the priority is 0.5.

    NOTE: In many cases this means that highly selective patterns have higher priority than less selective patterns. The most common kind of pattern (a pattern that tests for a node with a particular type and a particular expanded-QName) has priority 0. The next less specific kind of pattern (a pattern that tests for a node with a particular type and an expanded-QName with a particular namespace URI) has priority -0.25. Patterns less specific than this (patterns that just tests for nodes with particular types) have priority -0.5. Patterns more specific than the most common kind of pattern have priority 0.5. However, it is not invariably true that a more selective pattern has higher priority than a less selective pattern. For example, the priority of the pattern node()[self::*] is higher than that of the pattern item. Therefore, to achieve clarity in a stylesheet it is good practice to allocate explicit priorities.

[ERR046] It is a dynamic error if this leaves more than one matching template rule. The processor must either signal the error, or must recover by choosing, from amongst the matching template rules that are left, the one that occurs last in declaration order.

6.4 Modes

Modes allow a node in the source tree to be processed multiple times, each time producing a different result. They also allow different sets of template rules to be active when processing different trees, for example when processing documents loaded using the document function (see [16.1 Multiple Source Documents]) or when processing temporary trees (see [9 Variables and Parameters])

Modes are identified by a QName, except for the default mode, which is unnamed.

A template rule is applicable to one or more modes. The modes to which it is applicable are defined by the mode attribute of the xsl:template element. If the attribute is omitted, then the template rule is applicable to the default mode. If the attribute is present, then its value must be a space-separated list of tokens, each of which defines a mode to which the template rule is applicable. Each token must either be a QName, which is expanded as described in [5.1 Qualified Names] to define the name of the mode, or it must be the token #default, to indicate that the template is applicable to the default mode.

The xsl:apply-templates element also has an optional mode attribute. The value of this attribute must either be a QName, which is expanded as described in [5.1 Qualified Names] to define the name of a mode, or the token #default, to indicate that the default mode is to be used, or the token #current, to indicate that the current mode is to be used. If the attribute is omitted, the default mode is used.

When searching for a template rule to process each node selected by the xsl:apply-templates instruction, only those template rules that are applicable to the selected mode are considered.

At any point in the processing of a stylesheet, there is a current mode. When the transformation is initiated, the current mode is the default mode (unless the processor provides some mechanism for starting processing in a different mode). Whenever an xsl:apply-templates instruction is evaluated, the current mode becomes the mode selected by this instruction. When a stylesheet function is called, the current mode becomes the default mode. No other instruction changes the current mode. On completion of the xsl:apply-templates instruction, the current mode reverts to its previous value. The current mode is used when an xsl:apply-templates instruction uses the syntax mode="#current"; it is also used by the xsl:apply-imports instruction (see [6.6 Overriding Template Rules]).

[ERR047] If an xsl:template element does not have a match attribute, then it is a static error if it has a mode attribute.

6.5 Built-in Template Rules

When a node is selected by xsl:apply-templates and there is no template rule in the stylesheet that can be used to process that node, a built-in template rule is evaluated instead. The built-in template rule for document nodes and elements nodes causes the children of the node to be processed; the built in rule for text nodes and attribute nodes causes the text to be copied to the result tree.

The built-in template rules apply to all modes. It is not possible for a user-written template rule to apply to all modes, but for the sake of illustration, the syntax mode="#all" is used in the examples below as shorthand for a list of all modes (including the default mode) that are used in the stylesheet.

The built-in rule for document nodes and element nodes is equivalent to calling xsl:apply-templates with no select attribute, and with the mode attribute set to #current. If the built-in rule was invoked with parameters, those parameters are passed on in the implicit xsl:apply-templates instruction.

For example, suppose the stylesheet contains the following instruction:

<xsl:apply-templates select="title" mode="mm">
  <xsl:with-param name="init" select="10"/>
</xsl:apply-template>

If there is no explicit template rule that matches the title element, then the following implicit rule is used:

<xsl:template match="title" mode="#all">
  <xsl:with-param name="init"/>
  <xsl:apply-templates mode="#current">
    <xsl:with-param name="init" select="$init"/>
  </xsl:apply-templates>
</xsl:template>

The built-in template rule for text and attribute nodes returns a text node containing the string value of the context node. It is effectively:

<xsl:template match="text()|@*" mode="#all">
  <xsl:value-of select="."/>
</xsl:template>

The built-in template rule for processing instructions and comments does nothing (it returns the empty sequence).

<xsl:template match="processing-instruction()|comment()" mode="#all"/>

The built-in template rule for namespace nodes is also to do nothing. There is no pattern that can match a namespace node; so, the built-in template rule is the only template rule that is applied for namespace nodes.

The built-in template rules have lower import precedence than all other template rules. Thus, the stylesheet author can override a built-in template rule by including an explicit template rule.

6.6 Overriding Template Rules

<!-- Category: instruction -->
<xsl:apply-imports>
  <!-- Content: xsl:with-param* -->
</xsl:apply-imports>

A template rule that is being used to override a template rule in an imported stylesheet (see [6.3 Conflict Resolution for Template Rules]) can use the xsl:apply-imports element to invoke the overridden template rule.

At any point in the processing of a stylesheet, there may be a current template rule. Whenever a template rule is chosen by matching a pattern, the template rule becomes the current template rule for the evaluation of the rule's content constructor. When an xsl:for-each or xsl:for-each-group instruction is evaluated, or when a stylesheet function is called (see [10.3 Stylesheet Functions]), the current template rule becomes null for the evaluation of that instruction or function.

The current template rule is not affected by invoking named templates (see [10.1 Named Templates]) or named attribute sets (see [10.2 Named Attribute Sets]), nor stylesheet functions (see [10.3 Stylesheet Functions]). While evaluating a global variable or parameter (see [9.3 Global Variables and Parameters]) the current template rule is null.

xsl:apply-imports searches for a template rule that matches the context node, and that is applicable to the current mode (see [6.4 Modes]). In choosing a template rule, it uses the usual criteria such as the priority and import precedence of the template rules, but it considers as candidates only those template rules contained in stylesheet levels that are descendants in the import tree of the stylesheet level that contains the current template rule.

NOTE: This is not the same as saying that the search considers all template rules whose import precedence is lower than that of the current template rule.

If no matching template rule is found that satisfies these criteria, the built-in template rule for the node type is used (see [6.5 Built-in Template Rules]).

[ERR048] It is an error if the xsl:apply-imports instruction is evaluated when the context item is not a node. The processor must either signal the error, or must recover by returning an empty sequence.

An xsl:apply-imports element may use xsl:with-param child elements to pass parameters to the chosen template rule (see [10.1.1 Passing Parameters to Templates]).

[ERR049] It is a dynamic error if xsl:apply-imports is evaluated when the current template rule is null. The processor must signal the error.

For example, suppose the stylesheet doc.xsl contains a template rule for example elements:

<xsl:template match="example">
  <pre><xsl:apply-templates/></pre>
</xsl:template>

Another stylesheet could import doc.xsl and modify the treatment of example elements as follows:

<xsl:import href="doc.xsl"/>

<xsl:template match="example">
  <div style="border: solid red">
     <xsl:apply-imports/>
  </div>
</xsl:template>

The combined effect would be to transform an example into an element of the form:

<div style="border: solid red"><pre>...</pre></div>

7 Repetition

<!-- Category: instruction -->
<xsl:for-each
  select = sequence-expression>
  <!-- Content: (xsl:sort*, content-constructor) -->
</xsl:for-each>

The xsl:for-each instruction processes each item in a sequence of items, evaluating the content constructor within the xsl:for-each instruction once for each item in that sequence.

The select attribute is required, and the expression must evaluate to a sequence, called the input sequence. If there is an xsl:sort element present (see [13 Sorting]) the input sequence is sorted to produce a sorted sequence. Otherwise, the sorted sequence is the same as the input sequence.

The xsl:for-each instruction contains a content constructor, which is evaluated once for each item in the sorted sequence. The content constructor is evaluated with the focus set as follows:

For each item in the input sequence, evaluating the content constructor produces a sequence of output nodes; these output sequences are concatenated in the same order as the sorted sequence. The result of the xsl:for-each instruction is the concatenated sequence of output nodes.

NOTE: With XSLT 1.0, the selected nodes were processed in document order. With XSLT 2.0, XPath expressions that would have been valid under XPath 1.0 (such as path expressions and union expressions) will return a sequence of nodes that is already in document order, so backwards compatibility is maintained.

For example, given an XML document with this structure

<customers>
  <customer>
    <name>...</name>
    <order>...</order>
    <order>...</order>
  </customer>
  <customer>
    <name>...</name>
    <order>...</order>
    <order>...</order>
  </customer>
</customers>

the following would create an HTML document containing a table with a row for each customer element

<xsl:template match="/">
  <html>
    <head>
      <title>Customers</title>
    </head>
    <body>
      <table>
	<tbody>
	  <xsl:for-each select="customers/customer">
	    <tr>
	      <th>
		<xsl:apply-templates select="name"/>
	      </th>
	      <xsl:for-each select="order">
		<td>
		  <xsl:apply-templates/>
		</td>
	      </xsl:for-each>
	    </tr>
	  </xsl:for-each>
	</tbody>
      </table>
    </body>
  </html>
</xsl:template>

8 Conditional Processing

There are two instructions in XSLT that support conditional processing in a template: xsl:if and xsl:choose. The xsl:if instruction provides simple if-then conditionality; the xsl:choose instruction supports selection of one choice when there are several possibilities.

8.1 Conditional Processing with xsl:if

<!-- Category: instruction -->
<xsl:if
  test = expression>
  <!-- Content: content-constructor -->
</xsl:if>

The xsl:if element has a test attribute, which specifies an expression. The content is a content constructor.

The result of the xsl:if instruction depends on the effective boolean value of the expression in the test attribute. The rules for determining the effective boolean value of an expression are given in [XPath 2.0]: they are the same as the rules used for XPath conditional expressions.

If the effective boolean value of the expression is true, then the content constructor is evaluated, and the resulting node sequence is returned as the result of the xsl:if instruction; otherwise, an empty sequence is returned.

In the following example, the names in a group of names are formatted as a comma separated list:

<xsl:template match="namelist/name">
  <xsl:apply-templates/>
  <xsl:if test="not(position()=last())">, </xsl:if>
</xsl:template>

The following colors every other table row yellow:

<xsl:template match="item">
  <tr>
    <xsl:if test="position() mod 2 = 0">
       <xsl:attribute name="bgcolor">yellow</xsl:attribute>
    </xsl:if>
    <xsl:apply-templates/>
  </tr>
</xsl:template>

8.2 Conditional Processing with xsl:choose

<!-- Category: instruction -->
<xsl:choose>
  <!-- Content: (xsl:when+, xsl:otherwise?) -->
</xsl:choose>

<xsl:when
  test = expression>
  <!-- Content: content-constructor -->
</xsl:when>

<xsl:otherwise>
  <!-- Content: content-constructor -->
</xsl:otherwise>

The xsl:choose element selects one among a number of possible alternatives. It consists of a sequence of xsl:when elements followed by an optional xsl:otherwise element. Each xsl:when element has a single attribute, test, which specifies an expression. The content of the xsl:when and xsl:otherwise elements is a content constructor.

When an xsl:choose element is processed, each of the xsl:when elements is tested in turn (that is, in document order as the elements appear in the stylesheet), until one of the xsl:when elements is satisfied. An xsl:when element is satisfied if the effective boolean value of the expression in its test attribute is true. The rules for determining the effective boolean value of an expression are given in [XPath 2.0]: they are the same as the rules used for XPath conditional expressions.

The content of the first, and only the first, xsl:when element that is satisfied is evaluated, and the resulting node sequence is returned as the result of the xsl:choose instruction. If no xsl:when element is satisfied, the content of the xsl:otherwise element is evaluated, and the resulting node sequence is returned as the result of the xsl:choose instruction. If no xsl:when element is satisfied, and no xsl:otherwise element is present, the result of the xsl:choose instruction is an empty sequence.

Only the content constructor of the selected xsl:when or xsl:otherwise instruction is evaluated. The test expressions for xsl:when instructions after the selected one are not evaluated.

The following example enumerates items in an ordered list using arabic numerals, letters, or roman numerals depending on the depth to which the ordered lists are nested.

<xsl:template match="orderedlist/listitem">
  <fo:list-item indent-start='2pi'>
    <fo:list-item-label>
      <xsl:variable name="level"
                    select="count(ancestor::orderedlist) mod 3"/>
      <xsl:choose>
        <xsl:when test='$level=1'>
          <xsl:number format="i"/>
        </xsl:when>
        <xsl:when test='$level=2'>
          <xsl:number format="a"/>
        </xsl:when>
        <xsl:otherwise>
          <xsl:number format="1"/>
        </xsl:otherwise>
      </xsl:choose>
      <xsl:text>. </xsl:text>
    </fo:list-item-label>
    <fo:list-item-body>
      <xsl:apply-templates/>
    </fo:list-item-body>
  </fo:list-item>
</xsl:template>

9 Variables and Parameters

<!-- Category: declaration -->
<!-- Category: instruction -->
<xsl:variable
  name = qname
  select = expression
  type = sequence-type
  type-information = "strict" | "lax" | "preserve" | "none">
  <!-- Content: content-constructor -->
</xsl:variable>

<!-- Category: declaration -->
<xsl:param
  name = qname
  select = expression
  type = sequence-type
  type-information = "strict" | "lax" | "preserve" | "none">
  <!-- Content: content-constructor -->
</xsl:param>

The two elements xsl:variable and xsl:param are referred to as variable-binding elements.

Issue 144 (type-attribute-names): It is inelegant to have two attributes named type and type-information on the same element.

A variable is a name that may be bound to a value. The value to which a variable is bound (the value of the variable) can be an object of any of the types that can be returned by expressions. There are two elements that can be used to bind variables: xsl:variable and xsl:param. The difference is that the value specified on the xsl:param variable is only a default value for the binding; when the template or stylesheet within which the xsl:param element occurs is invoked, parameters may be passed that are used in place of the default values.

Both xsl:variable and xsl:param have a required name attribute, which specifies the name of the variable. The value of the name attribute is a QName, which is expanded as described in [5.1 Qualified Names].

Both xsl:variable and xsl:param have an optional type attribute, which specifies the required type of the variable. The value of the type attribute is a SequenceType, as defined in [XPath 2.0]. The result of evaluating the expression contained in the select attribute, or in the case of xsl:param, the value supplied by the caller, is referred to as the supplied value of the variable. [ERR050] If the type attribute is specified, then the supplied value of the variable is converted to the required type, using the same rules that apply when converting a supplied value to the required type for an argument in a function call. These rules are specified in [XPath 2.0]. It is a type error if this conversion fails.

If the type attribute is omitted, the supplied value of the variable is used directly, and no conversion takes place. The effect is the same as if the type were specified as item*.

The type-information attribute is used to determine what type annotations should be present in a newly constructed tree. [ERR051] It is an error to specify the type-information attribute on a variable binding element that has empty content. For details of the meaning of this attribute, see [9.2 Temporary Trees].

NOTE: In the case of xsl:param, the type-information attribute is relevant only to the default value of the parameter; it does not affect the treatment of a value supplied as an actual parameter by the caller.

For any use of these variable-binding elements, there is a region of the stylesheet tree within which the binding is visible. The set of variable bindings in scope for an expression consists of those bindings that are visible at the point in the stylesheet where the expression occurs.

9.1 Values of Variables and Parameters

A variable-binding element can specify the supplied value of the variable in three alternative ways.

Issue 136 (default-param-is-string): The default value of a parameter defined as <xsl:param name="x" type="xs:integer"/> is an empty string; this is not a valid xs:integer, so an error will be reported.

The actual value of the variable depends on the supplied value, as described above, and the required type, which is determined by the value of the type attribute.

NOTE: When a variable is used to select nodes by position, be careful not to do:
<xsl:variable name="n">2</xsl:variable>
...
<xsl:value-of select="item[$n]"/>
This will output the value of the first item element, because the variable n will be bound to a node, not a number. Instead, do one of the following:
<xsl:variable name="n" select="2"/>
...
<xsl:value-of select="item[$n]"/>
or
<xsl:variable name="n">2</xsl:variable>
...
<xsl:value-of select="item[position()=$n]"/>
or
<xsl:variable name="n" type="xs:integer">2</xsl:variable>
...
<xsl:value-of select="item[$n]"/>

9.2 Temporary Trees

A temporary tree is constructed by evaluating an xsl:variable, xsl:param, xsl:with-param, or xsl:result element that has non-empty content. This element is referred to as the variable-binding element. The supplied value of the variable is a single node, the document node of the temporary tree.

The actual value of the variable depends on the supplied value, as described above, and the required type, which is determined by the value of the type attribute.

Issue 135 (tree-to-string-conversion): The current XPath 2.0 rules for function calling, as well as addition and equality comparison, do not allow a document node to be converted to a string.

Namespace fixup is performed on the temporary tree (see [4.5 Namespace Fixup]).

The base URI of a node in the temporary tree is determined as if all the nodes in the temporary tree came from a single entity whose URI was the base URI of the variable-binding element (see [Data Model]). Thus, the base URI of the document node will be equal to the base URI of the variable-binding element; an xml:base attribute within the temporary tree will change the base URI for its parent element and that element's descendants, just as it would within a document constructed by parsing.

A temporary tree is available for processing in exactly the same way as any source document. For example, its nodes are accessible using path expressions, and they can be processed using instructions such as xsl:apply-templates and xsl:for-each. Also, the key function can be used to find nodes within a temporary tree, provided that at the time the function is called, the context item is a node within the temporary tree.

The type-information attribute determines what type annotations will be present on the element and attribute nodes of the constructed tree. The default value is none. The values are as follows:

When requested, schema validation is performed according to the process described in [XML Schema]. As well as adding type annotations, this process may also insert nodes to represent defaulted attributes.

[ERR053] If schema validation is requested and the document is not well-formed (that is, if it contains text nodes as children of the document node, or if the number of element children of the document root is not exactly one), a dynamic error occurs. The processor must signal the error.

Issue 143 (validation-implies-well-formedness): Should we require that when validation of a temporary tree is performed, the tree must contain a single top-level element node? It would be possible to define validation without requiring this: it could mean that each top-level element is independently validated.

[ERR054] If schema validation is requested and the schema validity assessment concludes that the document is invalid, a dynamic error occurs. The processor must signal the error.

For example, the following stylesheet uses a temporary tree as the intermediate result of a two-phase transformation, using different modes for the two phases (see [6.4 Modes]):

Ed. Note: We need to add examples to clarify how the type attribute works, especially with temporary trees.

<xsl:stylesheet
  version="2.0"
  xmlns:xsl="http://www.w3.org/1999/XSL/Transform">

<xsl:import href="phase1.xsl"/>
<xsl:import href="phase2.xsl"/>

<xsl:variable name="intermediate">
  <xsl:apply-templates select="/" mode="phase1"/>
</xsl:variable>

<xsl:template match="/">
  <xsl:apply-templates select="$intermediate" mode="phase2"/>
</xsl:template>

</xsl:stylesheet>
NOTE: The algorithm for matching nodes against template rules is exactly the same regardless which tree the nodes come from; if nodes from different trees cannot be distinguished by their expanded-QName, it is therefore a good idea to use modes to ensure that each tree is processed using the appropriate set of template rules.

9.3 Global Variables and Parameters

Both xsl:variable and xsl:param are allowed as top-level elements. A top-level variable-binding element declares a global variable that is visible everywhere (except where it is shadowed by another binding). A top-level xsl:param element declares a global parameter to the stylesheet; XSLT does not define the mechanism by which parameters are passed to the stylesheet.

If a stylesheet contains more than one binding for a global variable or parameter of a particular name, then the binding with the highest import precedence is used. [ERR055] It is a static error if a stylesheet contains more than one binding of a global variable with the same name and same import precedence.

For a global variable or parameter, the expression or content constructor specifying the variable value is evaluated with a singleton focus based on the document node of the document containing the initial context node.

The following example declares a global parameter para-font-size, which it references in an attribute value template.

<xsl:param name="para-font-size" type="xs:string">12pt</xsl:variable>

<xsl:template match="para">
 <fo:block font-size="{$para-font-size}">
   <xsl:apply-templates/>
 </fo:block>
</xsl:template>

The implementation is expected to provide a mechanism allowing the user to supply a value for the parameter para-font-size when invoking the stylesheet; the value 12pt acts as a default.

9.4 Circular Definitions

If the expression or content constructor specifying the value of a global variable X references a global variable Y, then the value for Y must be computed before the value of X. If it is impossible to do this for all global variable definitions, then a circularity is said to exist.

For example the following two declarations create a circularity:

<xsl:variable name="x" select="$y+1"/>

<xsl:variable name="y" select="$x+1"/>

The definition of a global variable can be circular even if no other variable is involved. For example the following two declarations (see [10.3 Stylesheet Functions] for an explanation of the xsl:function element) also create a circularity:

<xsl:variable name="x" select="my:f()"/>

<xsl:function name="my:f">
  <xsl:result select="$x"/>
</xsl:function>

The definition of a variable is also circular if the evaluation of the variable invokes an xsl:apply-templates instruction and the variable is referenced in the pattern used in the match attribute of any template rule in the stylesheet. For example the following definition is circular:

<xsl:variable name="x">
  <xsl:apply-templates select="//param[1]"/>
</xsl:variable>

<xsl:template match="param[$x]">1</xsl:template>

Similarly, a variable definition is circular if it causes a call on the key function, and the definition of that key refers to that variable in its match or use attributes. So the following definition is circular:

<xsl:variable name="x" select="my:f(10)"/>

<xsl:function name="my:f">
  <xsl:param name="arg1"/>
  <xsl:result select="key('k', $arg1)"/>
</xsl:function>

<xsl:key name="k" match="item[@code=$x]" use="@desc"/>

[ERR056] In general, a circularity in a stylesheet is a dynamic error. The processor must signal the error. However, as with all other dynamic errors, an implementation will signal the error only if it actually executes the instructions and expressions that participate in the circularity. Because different implementations may optimize the execution of a stylesheet in different ways, it is implementation-dependent whether a particular circularity will actually be signaled.

For example, in the following declarations, the function declares a default value for a parameter, but it returns a result that does not require the default value to be evaluated. It is implementation-dependent whether the default value is actually evaluated, and it is therefore implementation-dependent whether the circularity is signaled as an error:

<xsl:variable name="x" select="my:f(1)/>

<xsl:function name="my:f">
  <xsl:param name="a" select="$x"/>
  <xsl:result select="$a"/>
</xsl:function>

Circularities usually involve global variables or parameters, but they can also exist between key definitions (see [16.3 Keys]), between named attribute sets (see [10.2 Named Attribute Sets]), or between any combination of these constructs. For example, a circularity exists if a key definition invokes a function that references an attribute set that calls the key function, supplying the name of the original key definition as an argument.

Circularity is not the same as recursion. Stylesheet functions (see [10.3 Stylesheet Functions]) and named templates (see [10.1 Named Templates]) may call other functions and named templates without restriction. With careless coding, recursion may be non-terminating. Implementations are required to signal circularity as a dynamic error, but they are not required to detect non-terminating recursion.

9.5 Local Variables and Parameters

As well as being allowed as top-level elements, both xsl:variable and xsl:param are also allowed in content constructors and within the xsl:function element (see [10.3 Stylesheet Functions]. Such a variable or parameter is known as a local variable or parameter.

The result of evaluating a local xsl:variable or xsl:param element (that is, the contribution it makes to the result of the content constructor it is part of) is an empty sequence.

A binding shadows another binding if the binding occurs at a point where the other binding is visible, and the bindings have the same name. It is a static error if a binding established by a local xsl:variable or xsl:param element shadows another binding established by another local xsl:variable or xsl:param. It is not an error if a binding established by a local xsl:variable or xsl:param shadows a global binding. In this case, the global binding will not be visible in the region of the stylesheet where it is shadowed by the other binding.

Thus, the following is an error:

<xsl:template name="foo">
  <xsl:param name="x" select="1"/>
  <xsl:variable name="x" select="2"/>
</xsl:template>

However, the following is allowed:

<xsl:param name="x" select="1"/>
<xsl:template name="foo">
  <xsl:variable name="x" select="2"/>
</xsl:template>
NOTE: Once a variable has been given a value, the value cannot subsequently be changed. XSLT does not provide an equivalent to the assignment operator available in many procedural programming languages. This is because an assignment operator would make it harder to create an implementation that processes a document other than in a batch-like way, starting at the beginning and continuing through to the end.
NOTE: As well as global variables and local variables, an XPath expression may also declare range variables for use locally within an expression. For details, see [XPath 2.0]. Where a reference to a variable occurs in an XPath expression, it is resolved first by reference to range variables that are in scope, then by reference to local variables and parameters, and finally by reference to global variables and parameters. A range variable may shadow a local variable or a global variable. XPath also allows a range variable to shadow another range variable.

9.6 Range Variables

As well as global variables and local variables, an XPath expression may also declare range variables for use locally within an expression. For details, see [XPath 2.0].

Examples of range variables occur in expressions such as:

<xsl:value-of select="sum(for $x in //item return $x/price * $x/qty)"/>

or:

<xsl:if test="some $i in authors/author satisfies substring($i, 1, 1) = 'A'"/>

Where a reference to a variable occurs in an XPath expression, it is resolved first by reference to range variables that are in scope, then by reference to local variables and parameters, and finally by reference to global variables and parameters. A range variable may shadow a local variable or a global variable. XPath also allows a range variable to shadow another range variable.

Issue 38 (xpath-variable-shadowing): Can variables declared within an XPath 2.0 expression shadow variables declared at XSLT level? And should variables declared within an XPath expression be allowed to shadow each other? If so, should we change the XSLT rules to be the same?

10 Callable Components

This section describes three constructs that can be used to provide subroutine-like functionality that can be invoked from anywhere in the stylesheet: named templates (see [10.1 Named Templates]), named attribute sets (see [10.2 Named Attribute Sets]) and stylesheet functions (see [10.3 Stylesheet Functions]).

10.1 Named Templates

<!-- Category: instruction -->
<xsl:call-template
  name = qname>
  <!-- Content: xsl:with-param* -->
</xsl:call-template>

Templates can be invoked by name. An xsl:template element with a name attribute specifies a named template. The value of the name attribute is a QName, which is expanded as described in [5.1 Qualified Names]. If an xsl:template element has a name attribute, it may, but need not, also have a match attribute. An xsl:call-template element invokes a template by name; it has a required name attribute that identifies the template to be invoked. Unlike xsl:apply-templates, the xsl:call-template instruction does not change the focus.

The match, mode and priority attributes on an xsl:template element do not affect whether the template is invoked by an xsl:call-template element. Similarly, the name attribute on an xsl:template element does not affect whether the template is invoked by an xsl:apply-templates element.

[ERR057] It is a static error if a stylesheet contains more than one template with the same name and same import precedence.

The result of evaluating an xsl:call-template instruction is the node sequence produced by evaluating the content constructor contained in the associated xsl:template element.

10.1.1 Passing Parameters to Templates

<xsl:with-param
  name = qname
  select = expression
  type-information = "strict" | "lax" | "preserve" | "none">
  <!-- Content: content-constructor -->
</xsl:with-param>

Parameters are passed to templates using the xsl:with-param element. The required name attribute specifies the name of the parameter (the variable the value of whose binding is to be replaced). The value of the name attribute is a QName, which is expanded as described in [5.1 Qualified Names].

xsl:with-param is allowed within xsl:call-template, xsl:apply-templates and xsl:apply-imports. It is a static error if a single xsl:call-template, xsl:apply-templates or xsl:apply-imports element contains more than one xsl:with-param element with the same name.

The value of the parameter is specified in the same way as for xsl:variable and xsl:param. The focus used for computing the value specified by xsl:with-param element is the same as that used for the xsl:apply-templates, xsl:apply-imports, or xsl:call-template element within which it occurs. It is not an error to pass a parameter x to a template that does not have an xsl:param element for x; the parameter is simply ignored.

The type attribute is used in the same way as for the xsl:variable element. The supplied value is converted as necessary to the required type. This then defines the actual value of the parameter that is supplied to the called template. If necessary, this value may be converted once again to the type specified in the corresponding xsl:param element within the named template definition.

The type-information attribute determines what type annotations will be present on the element and attribute nodes of the constructed tree. This attribute must be omitted if the content of the xsl:with-param element is empty. The default value is none. For details, see [9.2 Temporary Trees].

This example defines a named template for a numbered-block with an argument to control the format of the number.

<xsl:template name="numbered-block">
  <xsl:param name="format">1. </xsl:param>
  <fo:block>
    <xsl:number format="{$format}"/>
    <xsl:apply-templates/>
  </fo:block>
</xsl:template>

<xsl:template match="ol//ol/li">
  <xsl:call-template name="numbered-block">
    <xsl:with-param name="format">a. </xsl:with-param>
  </xsl:call-template>
</xsl:template>
NOTE: Arguments to stylesheet functions are supplied as part of an XPath function call: see [10.3 Stylesheet Functions]

10.2 Named Attribute Sets

<!-- Category: declaration -->
<xsl:attribute-set
  name = qname
  use-attribute-sets = qnames>
  <!-- Content: xsl:attribute* -->
</xsl:attribute-set>

The xsl:attribute-set element defines a named attribute set: that is, a collection of attribute values that can be used repeatedly on different elements in the result tree. The name attribute specifies the name of the attribute set. The value of the name attribute is a QName, which is expanded as described in [5.1 Qualified Names]. The content of the xsl:attribute-set element consists of zero or more xsl:attribute elements that specify the attributes in the set.

Attribute sets are used by specifying a use-attribute-sets attribute on the xsl:element, xsl:copy (see [11.8 Copying Nodes from the Source Tree to the Result Tree]) or xsl:attribute-set elements. The value of the use-attribute-sets attribute is a whitespace-separated list of names of attribute sets. Each name is specified as a QName, which is expanded as described in [5.1 Qualified Names]. Specifying a use-attribute-sets attribute is equivalent to adding xsl:attribute elements for each of the attributes in each of the named attribute sets to the beginning of the content of the element with the use-attribute-sets attribute, in the same order in which the names of the attribute sets are specified in the use-attribute-sets attribute. [ERR058] It is a dynamic error if use of use-attribute-sets attributes on xsl:attribute-set elements causes an attribute set to use itself, directly or indirectly. The processor must signal the error

Attribute sets can also be used by specifying an xsl:use-attribute-sets attribute on a literal result element. The value of the xsl:use-attribute-sets attribute is a whitespace-separated list of names of attribute sets. The xsl:use-attribute-sets attribute has the same effect as the use-attribute-sets attribute on xsl:element with the additional rule that attributes specified on the literal result element itself are treated as if they were specified by xsl:attribute elements before any actual xsl:attribute elements but after any xsl:attribute elements implied by the xsl:use-attribute-sets attribute. Thus, in the node sequence produced by evaluating the content constructor for a literal result element, attributes from attribute sets named in an xsl:use-attribute-sets attribute will appear first, in the order listed in the attribute; these will be followed by attributes specified on the literal result element will be added; finally, any attributes specified by xsl:attribute elements will appear. Since in a sequence of attribute nodes produced by a content constructor, only the last attribute with a given expanded-QName has any effect (see [5.6 Content Constructors]), this means that attributes specified in attribute sets can be overridden by attributes specified on the literal result element itself.

The content constructor within each xsl:attribute element in an xsl:attribute-set element is evaluated each time the attribute set is used; it is evaluated using the same focus as is used for evaluating the element bearing the use-attribute-sets or xsl:use-attribute-sets attribute. However, it is the position in the stylesheet of the xsl:attribute element rather than of the element bearing the use-attribute-sets or xsl:use-attribute-sets attribute that determines which variable bindings are visible (see [9 Variables and Parameters]); thus, only global variables and parameters and local variables declared within an xsl:attribute instruction are visible.

The following example creates a named attribute set title-style and uses it in a template rule.

<xsl:template match="chapter/heading">
  <fo:block font-stretch="condensed" xsl:use-attribute-sets="title-style">
    <xsl:apply-templates/>
  </fo:block>
</xsl:template>

<xsl:attribute-set name="title-style">
  <xsl:attribute name="font-size">12pt</xsl:attribute>
  <xsl:attribute name="font-weight">bold</xsl:attribute>
</xsl:attribute-set>

Multiple definitions of an attribute set with the same expanded-QName are merged. An attribute from a definition that has higher import precedence takes precedence over an attribute from a definition that has lower import precedence. [ERR059] It is a dynamic error if there are two attribute sets that have the same expanded-QName and equal import precedence and that both contain the same attribute, unless there is a definition of the attribute set with higher import precedence that also contains the attribute. The processor must either signal the error, or must recover by choosing from amongst the definitions that specify the attribute that have the highest import precedence the one that was specified last in declaration order.

Where the attributes in an attribute set were specified is relevant only in merging the attributes into the attribute set; it makes no difference when the attribute set is used. For each attribute set name occurring in a use-attribute-sets attribute on an xsl:attribute-set element, all definitions of an attribute set with that name must be merged before the use-attribute-sets attribute is replaced by the equivalent sequence of xsl:attribute child elements. Any use-attribute-sets attribute on an xsl:attribute-set element must be replaced by the equivalent sequence of xsl:attribute child elements before that xsl:attribute-set element is merged with other xsl:attribute-set elements with the same expanded-QName. When xsl:attribute-set elements with the same expanded-QName are merged, any xsl:attribute child elements added to replace a use-attribute-sets attribute are treated exactly as if they had originally been specified in the stylesheet as child elements.

10.3 Stylesheet Functions

An xsl:function declaration declares the name, parameters, and implementation of a stylesheet function that can be called from any XPath expression within the stylesheet. [ERR060] A stylesheet function must have a prefixed name, to remove any risk of a clash with a system-defined function. It is a static error if the name has no prefix.

NOTE: To prevent the namespace declaration used for the function name appearing in the result document, use the exclude-result-prefixes or exclude-prefixes attribute on the xsl:stylesheet element: see [11.1.3 Namespace Nodes for Literal Result Elements].

10.3.1 Defining a Stylesheet Function

<!-- Category: declaration -->
<xsl:function
  name = qname
  override = "yes" | "no">
  <!-- Content: (xsl:param*, (xsl:variable | xsl:message)*, xsl:result) -->
</xsl:function>

The xsl:function declaration defines a stylesheet function that can be called from any XPath expression used in the stylesheet (including an XPath expression used within a predicate in a pattern). The name attribute specifies the name of the function. The value of the name attribute is a QName, which is expanded as described in [5.1 Qualified Names][ERR061] The name attribute must be in a non-null namespace: that is, it must be written with a prefix..

An xsl:function declaration can only appear as a top-level element in the stylesheet.

The content of the xsl:function element consists of zero or more xsl:param elements that specify the formal arguments of the function, followed by zero or more xsl:variable elements that can be used to compute intermediate results, followed by a mandatory xsl:result element that defines the value to be returned by the function. It is also possible to include xsl:message elements after the xsl:param elements and before the xsl:result, to provide diagnostic output.

When a stylesheet function is called, the number of arguments in the function call must match the number of xsl:param elements in the function definition. Optional arguments are not allowed. [ERR062] Because arguments to a stylesheet function call must all be specified, the xsl:param elements within an xsl:function element must not specify a default value: this means they must be empty, and must have no select attribute.

NOTE: There are no special restrictions on what can appear within an xsl:variable element. For example, the xsl:variable element can contain xsl:apply-templates or xsl:call-template instructions.

If a stylesheet contains declarations of two or more stylesheet functions with the same expanded-QName, the one with highest import precedence is used. [ERR063] It is a static error for a stylesheet to contain two or more functions with the same expanded-QName and the same import precedence, unless there is another function with the same expanded-QName and a higher import precedence.

If a stylesheet function has been defined with a particular expanded-QName, then a call on function-available will return true when called with an argument that is a QName that expands to this same expanded-QName.

NOTE: Functions are not polymorphic. The only factor taken into account when binding a function call to a function implementation is the function name. The binding does not depend on the number of arguments, nor on their type.

The optional override attribute defines what happens if this function has the same name as a function provided by the implementor or made available in the static context using an implementation-defined mechanism. If the override attribute has the value yes, then this function is used in preference; if it has the value no, then the other function is used in preference. The default value is yes.

NOTE: Specifying override="yes" ensures interoperable behavior: the same code will execute with all processors. Specifying override="no" is useful when writing a fallback implementation of a function that is available with some processors but not others: it allows the vendor's implementation of the function to be used in preference to the stylesheet implementation, which is useful since the vendor's implementation will often be more efficient.

The xsl:param elements define the formal arguments to the function. These are interpreted positionally. When the function is called using a function-call in an XPath expression, the first argument supplied is assigned to the first xsl:param element, the second argument supplied is assigned to the second xsl:param element, and so on. [ERR064] It is an static error if the number of arguments supplied in the function call is different from the number of xsl:param elements in the function definition. It is not an error if there are fewer arguments supplied: any excess xsl:param elements will take their default values (see [10.1.1 Passing Parameters to Templates]).

The type attribute of the xsl:param element defines the required type of the parameter. The rules for converting the values of the actual arguments supplied in the function call to the types required by each xsl:param element are defined in [XPath 2.0]. If the value cannot be converted to the required type, a type exception is reported. If the type attribute is omitted, the effective default is item*, which means that no conversion takes place and any value is accepted.

Within the body of the function, the focus is the same as the focus at the point in the XPath expression where the function is called. This focus is used to evaluate expressions contained in xsl:param, xsl:variable, and xsl:result elements within the function. The static context for these expressions is determined by the position of the element within the stylesheet. Specifically, this means that it is not possible within the body of the stylesheet function to access the values of local variables that were in scope in the place where the function call was written. Global variables, however, remain available.

[ERR065] Within the body of a stylesheet function, the focus is initially undefined; this means that any attempt to reference the context item, context position, or context size is a dynamic error. The processor must signal the error. It is not possible within the body of the stylesheet function to access the values of local variables that were in scope in the place where the function call was written. Global variables, however, remain available.

Any xsl:variable elements within the function are evaluated in the normal way: (see [9 Variables and Parameters]).

10.3.2 Returning the Result

<xsl:result
  select = expression
  type = sequence-type
  type-information = "strict" | "lax" | "preserve" | "none">
  <!-- Content: content-constructor -->
</xsl:result>

The xsl:result element always appears (and only appears) as the last child of an xsl:function element. It defines the result that is returned when the function is evaluated.

The value is established in the same way as for an xsl:variable element: if there is a select attribute, the result is obtained by evaluating the expression contained in the select attribute; if the xsl:result element is not empty, the result is the document node of a temporary tree constructed as described in [9.1 Values of Variables and Parameters] obtained by evaluating the content constructor and adding the resulting sequence of nodes to a newly constructed document node. If there is no select attribute and no content constructor, the result is a zero-length string ; it is an static error if there is both a select attribute and a non-empty content constructor.

The optional type attribute indicates the required type of the result. The value of the type attribute is a SequenceType, as defined in [XPath 2.0]. [ERR066] If the type attribute is specified, then the calculated result is converted to the required type, using the rules defined in [XPath 2.0] for the conversion of a supplied value to a required type when calling a function. It is a type error if this conversion fails. If the type attribute is omitted, the calculated result is used as supplied, and no conversion takes place.

The type-information attribute determines what type annotations will be present on the element and attribute nodes of the constructed tree. This attribute must be omitted if the content of the xsl:result element is empty. The default value is none. For details, see [9.2 Temporary Trees].

The following example creates a stylesheet function named str:reverse that reverses the words in a supplied sentence, and then invokes this function from within a template rule.

<xsl:transform 
  xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
  xmlns:str="http://example.com/namespace"
  version="2.0"
  exclude-prefixes="str">

<xsl:function name="str:reverse">
  <xsl:param name="sentence"/>
  <xsl:result 
     select="if (contains($sentence, ' '))
             then concat(str:reverse(substring-after($sentence, ' ')),
                         ' ',
                         substring-before($sentence, ' '))
             else $sentence"/>             
</xsl:function>

<xsl:template match="/">
<output>
  <xsl:value-of select="str:reverse('DOG BITES MAN')"/>
</output>
</xsl:template>

</xsl:transform>

The following example illustrates the use of variables in a function definition. It returns a string containing the representation of its integer argument, expressed as a roman numeral. For example, the function call num:roman(7) will return the string "vii". This example uses the xsl:number instruction, described in [12 Numbering].

<xsl:function name="num:roman">
  <xsl:param name="value" type="xs:integer"/>
  <xsl:variable name="roman-number">
    <xsl:number value="$value" format="i"/>
  </xsl:variable>
  <xsl:result select="string($roman-number)" type="xs:string"/>             
</xsl:function>

11 Creating New Nodes

This section describes instructions that directly create new nodes. These nodes will typically be assembled to form the result tree.

11.1 Literal Result Elements

In a content constructor, an element in the stylesheet that does not belong to the XSLT namespace and that is not an extension instruction (see [18.2 Extension Instructions]) is classified as a literal result element. A literal result element is evaluated to construct a new element node with the same expanded-QName. The result of evaluating a literal result element is a node sequence containing one element, the newly constructed element node.

The content of the element is a content constructor, which is evaluated to return a sequence of nodes that define the attributes and children of the created element node. The children of the newly constructed element node will be the sequence of elements, text nodes, comment nodes, and processing instructions that results from evaluating this content constructor.

11.1.1 Setting the Type Annotation for Literal Result Elements

The attribute xsl:type-annotation may be used on a literal result element to indicate the type of the new element node. The value of the attribute is a QName identifying either a global type definition that is present in a schema imported using an xsl:import-schema declaration, or a built-in type definition. The type annotation of the new node will be set to this QName, and the typed value of the new node will be the result of casting the string-value of the node to this type. If the attribute is omitted, the new element node will have a type annotation of xs:anyType, and a typed value that is the same as the string value.

[ERR067] It is a static error if the value of the xsl:type-annotation attribute is not a QName, or is a QName whose prefix cannot be resolved using the in-scope namespace declarations, or is a QName that is not either the name of a built-in schema type, or the name of a global type definition in a schema imported using an xsl:import-schema declaration.

[ERR068] When the xsl:type-annotation attribute is used on a literal result element, it is a dynamic error to construct an element node as a child of the new element. Type annotations can be added only to element nodes that have simple content. The processor must signal the error.

[ERR069] When the xsl:type-annotation attribute is used on a literal result element, it is a dynamic error if the string-value of the new element cannot be converted to the named type under the rules of the XPath cast expression. These rules are defined in [Functions and Operators]. The processor must signal the error.

11.1.2 Attribute Nodes for Literal Result Elements

The created element node will have an attribute corresponding to each attribute node that is present on the element node in the stylesheet tree, other than attributes with names in the XSLT namespace.

The value of an attribute of a literal result element is interpreted as an attribute value template: it can therefore contain expressions contained in curly braces ({}). The attribute node created in the result tree will have the same name as the attribute in the source tree, and its string-value will be the same as the effective value of the attribute in the source tree. The type annotation on the attribute node will be xs:anySimpleType, and the typed value of the node will be the same as the string value..

Additional attributes may be generated by including xsl:attribute instructions in the content constructor, or by specifying the xsl:use-attribute-sets attribute on the literal result element itself. The way in which conflicts among these attributes are resolved is described in [10.2 Named Attribute Sets].

NOTE: The xml:base, xml:lang and xml:space attributes have two effects in XSLT. They behave as standard XSLT attributes, which means for example that if they appear on a literal result element, they will be copied to the result tree in the same way as any other attribute. In addition, they have their standard meaning as defined in the core XML specifications. Thus, an xml:base attribute in the stylesheet affects the base URI of the element on which it appears, and an xml:space attribute affects the interpretation of whitespace nodes within that element. One consequence of this is that these attributes should not be written as attribute value templates: although an XSLT processor will understand this notation, the XML parser will not.
NOTE: The same is true of the schema-defined attributes xsi:type, xsi:nil, and xsi:schemaLocation. If the stylesheet is processed by a schema processor, these attributes will be recognized and interpreted by the schema processor, but they have no special meaning to the XSLT processor. The attributes are copied to the result tree in the same way as any other attribute. If the result tree is validated, the copied attributes will again be recognized and interpreted by the schema processor.

Issue 134 (xsi-schema-location): If an xsi:schemaLocation attribute is present in a constructed tree that is subjected to validation, does the attribute have any effect?

None of these attributes will be generated in the result tree unless the stylesheet writes them to the result tree explicitly.

11.1.3 Namespace Nodes for Literal Result Elements

The created element node will also have a copy of the namespace nodes that were present on the element node in the stylesheet tree with the exception of any namespace node whose string-value (before the application of any namespace aliases: see [11.1.4 Namespace Aliasing]) is designated as an excluded namespace.

The following namespaces are designated as excluded namespaces:

  • The XSLT namespace URI (http://www.w3.org/1999/XSL/Transform)

  • A namespace URI declared as an extension namespace (see [18.2 Extension Instructions])

  • A namespace URI designated by using an [xsl:]exclude-result-prefixes or [xsl:]exclude-prefixes attribute either on the literal result element itself or on an ancestor element. The attribute must be in the XSLT namespace only if its parent element is not in the XSLT namespace.

    The value of the attribute is a whitespace-separated list of tokens, each of which is either a namespace prefix, or #default, or #all. The namespace bound to each of the prefixes is designated as an excluded namespace. It is a static error if there is no namespace bound to the prefix on the element bearing the [xsl:]exclude-result-prefixes or [xsl:]exclude-prefixes attribute.

    The default namespace (as declared by xmlns) may be designated as an excluded namespace by including #default in the list of namespace prefixes.

    The value #all indicates that all namespaces are designated as excluded namespaces. In this case, any other prefixes are ignored.

    The designation of a namespace as an excluded namespace is effective within the subtree of the stylesheet rooted at the element bearing the [xsl:]exclude-result-prefixes or [xsl:]exclude-prefixes attribute; a subtree rooted at an xsl:stylesheet element does not include any stylesheet modules imported or included by children of that xsl:stylesheet element.

The excluded namespaces, as described above, only affect namespace nodes copied from the stylesheet when processing a literal result element. There is no guarantee that an excluded namespace will not appear on the result tree for some other reason. Namespaces are also written to the result tree as part of the process of namespace fixup (see [4.5 Namespace Fixup]), or as the result of instructions such as xsl:copy and xsl:element.

NOTE: The [xsl:]exclude-prefixes attribute, which is new in XSLT 2.0, affects namespaces copied from the source document using xsl:copy-of, as well as namespaces copied from the stylesheet using literal result elements. The [xsl:]exclude-result-prefixes attribute, which is retained from XSLT 1.0, affects literal result elements only.
NOTE: When a stylesheet uses a namespace declaration only for the purposes of addressing the source tree, specifying the prefix in the [xsl:]exclude-result-prefixes or [xsl:]exclude-prefixes attribute will avoid superfluous namespace declarations in the result tree. The attribute is also useful to prevent namespaces used solely for the naming of extension functions from appearing in the result tree.

11.1.4 Namespace Aliasing

When a stylesheet is used to define a transformation whose output is itself a stylesheet module, or in certain other cases where the result document uses namespaces that it would be inconvenient to use in the stylesheet, namespace aliasing can be used to declare a mapping between a namespace URI used in the stylesheet and the corresponding namespace URI to be used in the result document.

A namespace URI in the stylesheet tree that is being used to specify a namespace URI in the result tree is called a literal namespace URI. This applies to:

  • the namespace URI in the expanded-QName of a literal result element in the stylesheet

  • the namespace URI in the expanded-QName of an attribute specified on a literal result element in the stylesheet

  • the string-value of a namespace node on a literal result element in the stylesheet.

<!-- Category: declaration -->
<xsl:namespace-alias
  stylesheet-prefix = prefix | "#default"
  result-prefix = prefix | "#default" />

A stylesheet can use the xsl:namespace-alias element to declare that one namespace URI is an alias for another namespace URI. When a literal namespace URI has been declared to be an alias for another namespace URI, then the namespace URI in the result tree will be the namespace URI that the literal namespace URI is an alias for, instead of the literal namespace URI itself.

The xsl:namespace-alias element declares that the namespace URI bound to the prefix specified by the stylesheet-prefix attribute is an alias for the namespace URI bound to the prefix specified by the result-prefix attribute. Thus, the stylesheet-prefix attribute specifies the namespace URI that will appear in the stylesheet, and the result-prefix attribute specifies the corresponding namespace URI that will appear in the result tree. The namespace prefix that will be used in the serialized output document is implementation-dependent.

The default namespace (as declared by xmlns) may be specified by using #default instead of a prefix.

If a namespace URI is declared to be an alias for multiple different namespace URIs, then the declaration with the highest import precedence is used. [ERR070] It is a dynamic error if there is more than one such declaration and different values for namespace-uri. The processor must either signal the error, or must recover by choosing, from amongst the declarations with the highest import precedence, the one that occurs last in declaration order.

Issue 126 (duplicate-ns-alias-error-type): Why is it a dynamic error, rather than a static error, to have duplicate inconsistent namespace-alias declarations? (This is unchanged from XSLT 1.0, but seems wrong.)

When literal result elements are being used to create element, attribute, or namespace nodes that use the XSLT namespace URI, the stylesheet should use an alias.

For example, the stylesheet

<xsl:stylesheet
  version="2.0"
  xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
  xmlns:fo="http://www.w3.org/1999/XSL/Format"
  xmlns:axsl="file://namespace.alias">

<xsl:namespace-alias stylesheet-prefix="axsl" result-prefix="xsl"/>

<xsl:template match="/">
  <axsl:stylesheet version="1.0">
    <xsl:apply-templates/>
  </axsl:stylesheet>
</xsl:template>

<xsl:template match="block">
  <axsl:template match="{.}">
     <fo:block><axsl:apply-templates/></fo:block>
  </axsl:template>
</xsl:template>

</xsl:stylesheet>

will generate an XSLT stylesheet from a document of the form:

<elements>
<block>p</block>
<block>h1</block>
<block>h2</block>
<block>h3</block>
<block>h4</block>
</elements>

The output of the transformation will be a stylesheet such as the following (whitespace added for clarity):

<xsl:stylesheet
  version="1.0"
  xmlns:xsl="http://www.w3.org/1999/XSL/Transform" 
  xmlns:fo="http://www.w3.org/1999/XSL/Format">
  
<xsl:template match="p">
  <fo:block><xsl:apply-templates/></fo:block>
</xsl:template>

<xsl:template match="h1">
  <fo:block><xsl:apply-templates/></fo:block>
</xsl:template>

<xsl:template match="h2">
  <fo:block><xsl:apply-templates/></fo:block>
</xsl:template>

<xsl:template match="h3">
  <fo:block><xsl:apply-templates/></fo:block>
</xsl:template>

<xsl:template match="h4">
  <fo:block><xsl:apply-templates/></fo:block>
</xsl:template>

</xsl:stylesheet>
NOTE: It may be necessary also to use aliases for namespaces other than the XSLT namespace URI. For example, literal result elements belonging to a namespace dealing with digital signatures might cause XSLT stylesheets to be mishandled by general-purpose security software; using an alias for the namespace would avoid the possibility of such mishandling.

11.2 Creating Element Nodes using xsl:element

<!-- Category: instruction -->
<xsl:element
  name = { qname }
  namespace = { uri-reference }
  use-attribute-sets = qnames
  type-annotation = qname>
  <!-- Content: content-constructor -->
</xsl:element>

The xsl:element instruction allows an element to be created with a computed name. The expanded-QName of the element to be created is specified by a required name attribute and an optional namespace attribute. The content of the xsl:element instruction is a content constructor for the attributes and children of the created element.

The result of evaluating the xsl:element instruction, except in error cases, is the newly constructed element node.

The name attribute is interpreted as an attribute value template. [ERR071] It is an dynamic error if the effective value is not a QName. The processor must either signal the error, or must recover by making the result of evaluating the xsl:element element be the sequence of nodes created by evaluating the content of the xsl:element element, excluding any initial attribute nodes.

If the namespace attribute is not present then the QName is expanded into an expanded-QName using the namespace declarations in effect for the xsl:element element, including any default namespace declaration.

If the namespace attribute is present, then it too is interpreted as an attribute value template. The effective value should be a URI reference. It is not an error if the string is not a syntactically legal URI reference. If the string is zero-length, then the expanded-QName of the element has a null namespace URI. Otherwise, the string is used as the namespace URI of the expanded-QName of the element to be created. The local part of the QName specified by the name attribute is used as the local part of the expanded-QName of the element to be created.

Implementations may make use of the prefix of the QName specified in the name attribute when selecting the prefix used for outputting the created element as XML; however, they are not required to do so.

For the effect of the use-attribute-sets attribute, see [10.2 Named Attribute Sets]

11.2.1 Setting the Type Annotation for a Constructed Element Node

The attribute type-annotation may be used to indicate the type of the new element node. The value of the attribute is a QName identifying either a global type definition that is present in a schema imported using an xsl:import-schema declaration, or a built-in type definition. The type annotation of the new node will be set to this QName, and the typed value of the new node will be the result of casting the string-value of the node to this type. If the attribute is omitted, the new element node will have a type annotation of xs:anyType, and a typed value that is the same as the string value.

Issue 141 (annotate-complex-types): Should the type-annotation attribute on xsl:element (and xsl:type-annotation on literal result elements) be extended to support complex types?

[ERR072] It is a static error if the value of the type-annotation attribute is not a QName, or is a QName whose prefix cannot be resolved using the in-scope namespace declarations, or is a QName that is not either the name of a built-in schema type, or the name of a global type definition in a schema imported using an xsl:import-schema declaration.

[ERR073] When the type-annotation attribute is used it is a dynamic error to construct an element node as a child of the new element. Type annotations can be added only to element nodes that have simple content. The processor must signal the error.

[ERR074] When the type-annotation attribute is used it is a dynamic error if the string-value of the new element cannot be converted to the named type under the rules of the XPath cast expression. These rules are defined in [Functions and Operators]. The processor must signal the error.

11.3 Creating Attribute Nodes using xsl:attribute

<!-- Category: instruction -->
<xsl:attribute
  name = { qname }
  namespace = { uri-reference }
  type-annotation = qname
  disable-output-escaping = "yes" | "no">
  <!-- Content: content-constructor -->
</xsl:attribute>

Issue 131 (creating-id-and-idref): What are the implications of creating attributes of type ID and IDREF[S] in a result tree? What guarantees of uniqueness or referential integrity are provided? Is there a need to create temporary trees containing IDREF values that are not resolved within the temporary tree, but will be resolved when the nodes are copied to a final result tree? What does the id function do?

The xsl:attribute element can be used to add attributes to result elements whether created by literal result elements in the stylesheet or by instructions such as xsl:element. The expanded-QName of the attribute to be created is specified by a required name attribute and an optional namespace attribute. The result of evaluating an xsl:attribute instruction is the newly constructed attribute node.

The content of the xsl:attribute element is a content constructor for the value of the created attribute.

The name attribute is interpreted as an attribute value template. [ERR075] It is a dynamic error if the effective value is not a QName or is the string xmlns. The processor must either signal the error, or must recover by not adding the attribute to the result tree.

If the namespace attribute is not present, then the QName is expanded into an expanded-QName using the namespace declarations in effect for the xsl:attribute element, not including any default namespace declaration.

If the namespace attribute is present, then it too is interpreted as an attribute value template. The effective value should be a URI reference. It is not an error if the string is not a syntactically legal URI reference. If the string is zero-length, then the expanded-QName of the attribute has a null namespace URI. Otherwise, the string is used as the namespace URI of the expanded-QName of the attribute to be created. The local part of the QName specified by the name attribute is used as the local part of the expanded-QName of the attribute to be created.

Implementations may make use of the prefix of the QName specified in the name attribute when selecting the prefix used for outputting the created attribute as XML; however, they are not required to do so and, if the prefix is xmlns, they must not do so.

Thus, although it is not an error to write:

<xsl:attribute name="xmlns:xsl" 
   namespace="file://some.namespace">http://www.w3.org/1999/XSL/Transform</xsl:attribute>

it will not result in the namespace declaration xmlns:xsl="http://www.w3.org/1999/XSL/Transform" being output. Instead, it will produce an attribute node with local name xsl, and with a system-allocated namespace prefix mapped to the namespace URI file://some.namespace.

As described in [5.6 Content Constructors], any attribute nodes must appear in the result of evaluating a content constructor before any element, text, comment, or processing instruction nodes, but after any namespace nodes. Where the result of the content constructor contains two or more attribute nodes with the same expanded-QName, the one that comes last is the only one that takes effect. When evaluating the content of a node other than an element, returning a node sequence that includes attribute nodes is a dynamic error; the recovery action (if any) depends on the instruction that makes use of the constructed node sequence.

NOTE: When an xsl:attribute element contains a text node with a newline, then the XML output must contain a character reference.

For the effect of the disable-output-escaping attribute, see [20.5 Disabling Output Escaping]

11.3.1 Setting the Type Annotation for a Constructed Attribute Node

The attribute type-annotation may be used to indicate the type of the new attribute node. The value of the attribute is a QName identifying either a global type definition that is present in a schema imported using an xsl:import-schema declaration, or a built-in type definition. The type annotation of the new node will be set to this QName, and the typed value of the new node will be the result of casting the string-value of the node to this type. If the attribute is omitted, the new attribute node will have a type annotation of xs:anySimpleType, and a typed value that is the same as the string value.

[ERR076] It is a static error if the value of the type-annotation attribute is not a QName, or is a QName whose prefix cannot be resolved using the in-scope namespace declarations, or is a QName that is not either the name of a built-in schema type, or the name of a global type definition in a schema imported using an xsl:import-schema declaration.

[ERR077] When the type-annotation attribute is used it is a dynamic error if the string-value of the new attribute cannot be converted to the named type under the rules of the XPath cast expression. These rules are defined in [Functions and Operators]. The processor must signal the error.

11.4 Creating Text Nodes

11.4.1 Literal Text Nodes

A content constructor can also contain text nodes. Each text node in a content constructor remaining after whitespace has been stripped as specified in [4.4 Whitespace Stripping] will construct a text node with the same string-value. The resulting text node is added to the result of the containing content constructor. When the resulting content is added to a result tree, adjacent text nodes in the result tree are automatically merged.

Note that text is processed at the tree level. Thus, markup of &lt; in a template will be represented in the stylesheet tree by a text node that includes the character <. This will create a text node in the result tree that contains a < character, which will be represented by the markup &lt; (or an equivalent character reference) when the result tree is externalized as an XML document (unless output escaping is disabled as described in [20.5 Disabling Output Escaping]).

11.4.2 Creating Text Nodes using xsl:text

<!-- Category: instruction -->
<xsl:text
  disable-output-escaping = "yes" | "no">
  <!-- Content: content-constructor -->
</xsl:text>

The xsl:text element is evaluated to contruct a new text node. The content of the xsl:text element is a content constructor for the string-value of the text node.

Issue 139 (nested-xsl-text): At XSLT 1.0 the xsl:text element only allowed PCDATA content. It has been generalized largely for consistency with xsl:comment and xsl:attribute, etc. Is this generalization actually useful?

The result of evaluating the xsl:text instruction is a single node, the newly constructed text node.

Text nodes that are immediate children of the xsl:text instruction will not be stripped from the stylesheet tree, even if they consist entirely of whitespace (see [4.4 Whitespace Stripping]).

For the effect of the disable-output-escaping attribute, see [20.5 Disabling Output Escaping]

Issue 132 (nested-d-o-e): The xsl:text element may now contain (or cause the evaluation of) nested xsl:text and xsl:value-of instructions. This makes it necessary to define the meaning of nested disable-output-escaping attributes. Should the default be that escaping is enabled, or should it be "no change from the previous setting"?

[ERR078] It is a static error to specify disable-output-escaping="yes" on an xsl:text instruction that has element node children.

NOTE: It is not always necessary to use the xsl:text instruction to write text nodes to the result tree. Literal text can be written to the result tree by including it anywhere in a content constructor, while computed text can be output using the xsl:value-of instruction. The principal reason for using xsl:text is that it offers improved control over whitespace handling.

11.5 Creating Processing Instructions

<!-- Category: instruction -->
<xsl:processing-instruction
  name = { ncname }>
  <!-- Content: content-constructor -->
</xsl:processing-instruction>

The xsl:processing-instruction element is evaluated to create a processing instruction node. The content of the xsl:processing-instruction element is a content constructor for the string-value of the processing instruction node. The xsl:processing-instruction element has a required name attribute that specifies the name of the processing instruction node. The value of the name attribute is interpreted as an attribute value template.

Except in error situations, the result of evaluating the xsl:processing-instruction instruction is a single node, the newly constructed processing instruction.

For example, this

<xsl:processing-instruction name="xml-stylesheet">
  <xsl:text>href="book.css" type="text/css"</xsl:text>
</xsl:processing-instruction>

would create the processing instruction

<?xml-stylesheet href="book.css" type="text/css"?>

[ERR079] It is a dynamic error if the effective value of the name attribute is not both an NCName and a PITarget. The processor must either signal the error, or must recover by returning an empty sequence.

NOTE: This means that xsl:processing-instruction cannot be used to output an XML declaration. The xsl:output declaration should be used to control this instead (see [20 Serialization]).

[ERR080] It is a dynamic error if the result of evaluating the content of the xsl:processing-instruction contains the string ?>. The processor must either signal the error, or must recover by inserting a space after any occurrence of ? that is followed by a >

11.6 Creating Namespace Nodes

<!-- Category: instruction -->
<xsl:namespace
  name = { ncname }>
  <!-- Content: content-constructor -->
</xsl:namespace>

The xsl:namespace element is evaluated to create a namespace node. The content of the xsl:namespace element is a content constructor for the string-value of the namespace node (that is, the namespace URI). The xsl:namespace element has a required name attribute that specifies the name of the namespace node (that is, the namespace prefix). The value of the name attribute is interpreted as an attribute value template.

Except in error situations, the result of evaluating the xsl:namespace instruction is a single node, the newly constructed namespace node. Note the restrictions described in [5.6 Content Constructors] for the position of a namespace node relative to other nodes in the node sequence returned by a content constructor.

For example, this

<xsl:namespace name="xsd">http://www.w3.org/2001/XMLSchema</xsl:namespace>

would typically cause the output document to contain the namespace declaration:

xmlns:xsd="http://www.w3.org/2001/XMLSchema"

[ERR081] It is a dynamic error if the effective value of the name attribute is neither a zero-length string nor an NCName. The processor must either signal the error, or must recover by returning an empty sequence.

[ERR082] It is an dynamic error if evaluating the content of xsl:namespace results in a zero-length string . The processor must either signal the error, or must recover by returning an empty sequence.

NOTE: It is rarely necessary to use xsl:namespace to create a namespace node in the result tree; in most circumstances, the required namespace nodes will be created automatically, as a side-effect of writing elements or attributes that use the namespace. An example where xsl:namespace is needed is a situation where the required namespace is used only within attribute values in the result document, not in element or attribute names; especially where the required namespace prefix or namespace URI is computed at run-time and is not present in either the source document or the stylesheet.

Ed. Note: We need to add text explaining how the new namespace node is effectively merged with other matching namespace nodes in the same document, once the data model is finalized.

11.7 Creating Comments

<!-- Category: instruction -->
<xsl:comment>
  <!-- Content: content-constructor -->
</xsl:comment>

The xsl:comment element is evaluated to contruct a new comment node. The content of the xsl:comment element is a content constructor for the string-value of the comment node.

The result of evaluating the xsl:comment instruction is a single node, the newly constructed comment node.

For example, this

<xsl:comment>This file is automatically generated. Do not edit!</xsl:comment>

would create the comment

<!--This file is automatically generated. Do not edit!-->

[ERR083] It is a dynamic error if the result of evaluating the content of the xsl:comment contains the string -- or ends with -. The processor must either signal the error, or must recover by inserting a space after any occurrence of - that is followed by another - or that ends the comment.

11.8 Copying Nodes from the Source Tree to the Result Tree

11.8.1 Shallow Copy

<!-- Category: instruction -->
<xsl:copy
  use-attribute-sets = qnames>
  <!-- Content: content-constructor -->
</xsl:copy>

The xsl:copy instruction provides an easy way of copying the context item. If the context item is a node, and is not a document node, evaluating the xsl:copy instruction constructs a copy of the context node, and the result of the xsl:copy instruction is this newly constructed node. The namespace nodes of the context node are automatically copied as well, but the attributes and children of the node are not automatically copied. The content of the xsl:copy element is a content constructor for the attributes and children of the created node; the content constructor is evaluated only for nodes of types that can have children (that is, document nodes and element nodes).

The xsl:copy element may have a use-attribute-sets attribute (see [10.2 Named Attribute Sets]). This is used only when copying element nodes.

The document node is treated specially because the document node of a result tree is created implicitly. When the context item is a document node, xsl:copy will not create a document node, but will just evaluate the content constructor, and return the resulting node sequence as the result of the xsl:copy instruction.

For example, the identity transformation can be written using xsl:copy as follows:

<xsl:template match="@*|node()">
  <xsl:copy>
    <xsl:apply-templates select="@*|node()"/>
  </xsl:copy>
</xsl:template>

[ERR084] When the context item is an attribute node, then if it would be a dynamic error to use xsl:attribute to create an attribute with the same name as the context item, then it is also a dynamic error to use xsl:copy (see [11.3 Creating Attribute Nodes using xsl:attribute]). The processor must either signal the error, or must recover by returning an empty sequence.

When the context item is a namespace node, then xsl:copy constructs a new namespace node as a copy of the context node (that is, a namespace node with the same expanded-QName and string-value. As described in detail in [5.6 Content Constructors], it will generally be an error if this namespace node is preceded in the constructed node sequence by any node other than another namespace node, or if the namespace node cannot be added to a containing element without a conflict arising.

When the context item is not a node, the effect of the xsl:copy instruction is the same as evaluating the instruction <xsl:value-of select="."/>.

The following example shows how xml:lang attributes can be easily copied through from source to result. If a stylesheet defines the following named template:

<xsl:template name="apply-templates-copy-lang">
 <xsl:for-each select="@xml:lang">
   <xsl:copy/>
 </xsl:for-each>
 <xsl:apply-templates/>
</xsl:template>

then it can simply do

<xsl:call-template name="apply-templates-copy-lang"/>

instead of

<xsl:apply-templates/>

when it wants to copy the xml:lang attribute.

Issue 123 (copying-type-annotation): We need to specify how xsl:copy and xsl:copy-of handle type annotations on the nodes being copied. (For example, does an ID attribute in the source tree become an ID attribute in the result tree?)

Ed. Note: This is a poor example because it would be easier to write <xsl:copy-of select="@xml:lang"/>

11.8.2 Deep Copy

<!-- Category: instruction -->
<xsl:copy-of
  select = expression
  separator = { string }
  copy-type-annotations = "yes" | "no" />

The xsl:copy-of instruction can be used to construct a copy of a sequence of nodes, with each new node containing copies of all the children, attributes, and (by default) namespaces of the original node, recursively. The result of evaluating the instruction is a sequence of new nodes corresponding one-to-one with the supplied node sequence, and retaining its order. (This correspondence does not apply when copying a document node: see below).

The xsl:copy-of instruction can also be used to copy atomic values.

The mandatory select attribute contains an expression. The required type of this expression is sequence, which means that any sequence can be supplied. The items in this sequence are processed as follows:

  • If the item is an element node, a new element is constructed and appended to the result sequence. The new element will have the same expanded-QName as the original, and it will have copies of the attribute nodes and children of the element node.

    The new element will also have namespace nodes copied from the original element node, unless they are excluded by means of the [xsl:]exclude-prefixes attribute appearing on the xsl:copy-of element or an ancestor element. For details of this attribute, see [11.1.3 Namespace Nodes for Literal Result Elements]. Excluding a namespace in this way will not prevent the namespace being written to the result tree if this is required by the namespace fixup process: see [4.5 Namespace Fixup].

  • If the item is a document node, the instruction copies the children of the document node (each according to the rules for its own node type) and adds the copies, in order, to the result sequence.

  • If the item is an attribute or namespace node, or a text node, a comment, or a processing instruction, the same rules apply as with xsl:copy (see [11.8 Copying Nodes from the Source Tree to the Result Tree]).

  • If the item is an atomic value, the result is converted to a string, a new text node is constructed with this string as its string-value, and the new text node is appended to the result sequence, as with xsl:value-of.

If the copy-type-annotations attribute is present and has the value yes, then all element and attribute nodes created using this instruction (including nodes copied as a result of the recursive action of the instruction) will have the same type annotations and the same typed value as the nodes from which they were copied. If the attribute is omitted, or has the value no, then the new nodes will be annotated as xs:anyType (for elements) or xs:anySimpleType (for attributes), and will have a typed value that is the same as the string value.

Issue 140 (modified-copy): Should we try to provide options for xsl:copy-of to produce a modified copy, for example, a copy omitting certain attributes? One possibility would be to define copy in terms of xsl:apply-templates with a special mode, so that template rules for that mode could override the standard action.

Issue 142 (copy-types-default): What should the default be for the copy-type-annotation attribute on the xsl:copy-of element?

If the separator attribute is present, then its effective value (a string) is inserted as a text node into the result sequence after the result of processing each item in the input sequence, other than the last. If the separator attribute is absent, the effect is the same as supplying a zero-length string .

For example, the instruction:

<x><xsl:copy-of select="(1,2,3,4)" separator="|"/></x>

produces the output:

<x>1|2|3|4</x>

11.9 Generating Text with xsl:value-of

Within a content constructor, the xsl:value-of instruction can be used to compute generated text, for example by extracting text from the source tree or by inserting the value of a variable. The xsl:value-of instruction computes this text using an expression that is specified as the value of the select attribute.

NOTE: An alternative way of computing a string value, applicable only when the resulting text is to be used within a generated attribute node, is to use an attribute value template, enclosing the expression in curly braces ({}) within an attribute of a literal result element.

<!-- Category: instruction -->
<xsl:value-of
  select = expression
  separator = { string }
  disable-output-escaping = "yes" | "no" />

The xsl:value-of instruction is evaluated to construct a new text node; the result of the instruction is the newly constructed text node. But if the rules below produce a text node whose string value is the zero-length string , the result of the instruction is an empty sequence. The required select attribute is an expression. The required type of this expression is sequence, which means that the result of the expression may be any sequence. If the sequence is empty, no text node will be created. If the sequence contains a single item, the resulting text node will have a string-value that is the same as the string-value of this item. If the sequence contains more than one item, the effect depends on whether the separator attribute is present.

If the separator attribute is present, then the string-value of the newly constructed text node will be the concatenation of the string-values of the items in the sequence that results from evaluating the select attribute, with each of these string-values except the last being followed by the string that is the effective value of the separator attribute. If the separator attribute is absent, then (for backwards compatibility with XSLT 1.0) all items in the input sequence other than the first are ignored. If the effective value of the separator attribute is a zero-length string , then all items in the input sequence are processed and the results are concatenated with no separator.

For example, the instruction:

<x><xsl:value-of select="(1,2,3,4)" separator="|"/></x>

produces the output:

<x>1|2|3|4</x>
NOTE: The separator attribute is used only when the select expression produces a sequence of items. It is not used when the select expression returns a single node whose typed-value is a sequence, for example a node of type IDREFS. The string-value of such a node is obtained in the normal way, as if by calling the string function.

Issue 121 (xsl-value-of-atomization): The rule that separators are not inserted when a single item in the sequence selected by xsl:value-of has a sequence as its value is probably wrong. It would be more logical to atomize the sequence first, and then insert separators.

NOTE: The xsl:copy-of element can be used to copy a sequence of nodes to the result tree without converting to a string. See [11.8.2 Deep Copy].

For example, the following creates an HTML paragraph from a person element with given-name and family-name attributes. The paragraph will contain the value of the given-name attribute of the context node followed by a space and the value of the family-name attribute of the context node.

<xsl:template match="person">
  <p>
   <xsl:value-of select="@given-name"/>
   <xsl:text> </xsl:text>
   <xsl:value-of select="@family-name"/>
  </p>
</xsl:template>

For another example, the following creates an HTML paragraph from a person element with given-name and family-name children elements. The paragraph will contain the string-value of the first given-name child element of the context node followed by a space and the string-value of the first family-name child element of the context node.

<xsl:template match="person">
  <p>
   <xsl:value-of select="given-name"/>
   <xsl:text> </xsl:text>
   <xsl:value-of select="family-name"/>
  </p>
</xsl:template>

The following precedes each procedure element with a paragraph containing the security level of the procedure. It assumes that the security level that applies to a procedure is determined by a security attribute on the procedure element or on an ancestor element of the procedure. It also assumes that if more than one such element has a security attribute then the security level is determined by the element that is closest to the procedure.

<xsl:template match="procedure">
  <fo:block>
    <xsl:value-of select="ancestor-or-self::*[@security][1]/@security"/>
  </fo:block>
  <xsl:apply-templates/>
</xsl:template>

For the effect of the disable-output-escaping attribute, see [20.5 Disabling Output Escaping]

12 Numbering

<!-- Category: instruction -->
<xsl:number
  level = "single" | "multiple" | "any"
  count = pattern
  from = pattern
  value = number-expression
  format = { string }
  lang = { nmtoken }
  letter-value = { "alphabetic" | "traditional" }
  grouping-separator = { char }
  grouping-size = { number } />

The xsl:number instruction is used to create a formatted number. The result of the instruction is a newly constructed text node containing the formatted number as its string-value.

The xsl:number instruction performs two tasks: firstly, determining a place marker to be formatted (this is a sequence of integers, to allow for hierarchic numbering schemes such as 1.12.2 or 3(c)ii), and secondly, formatting the place marker for output as a text node in the result tree. The place marker to be formatted can either be supplied directly, in the value attribute, or it can be computed based on the position of the context node within the tree that contains it.

NOTE: The facilities described in this section are specifically designed to enable the calculation and formatting of section numbers, paragraph numbers, and the like. For formatting of other numeric quantities, the format-number function may be more suitable: see [16.4 Number Formatting].

12.1 Formatting a Supplied Number

The place marker to be formatted may be specified by an expression. The value attribute contains the expression. The required type of the expression is xs:integer*, that is, a sequence of integers. The expression is evaluated, to produce a sequence, and each item in this sequence is converted to an integer using the rules of the XPath cast construct.

[ERR085] It is a dynamic error if any item in the sequence cannot be converted to an integer, or if the resulting integer is less than 1 (one). The processor must either signal the error, or must recover by converting that member to a string as if by a call to the string function and inserting the resulting string into the formatted result string in its proper position.

Otherwise, the sequence is formatted as a string using the effective values of the attributes specified in [12.3 Number to String Conversion Attributes]; each of these attributes is interpreted as an attribute value template. After conversion, the xsl:number element constructs a new text node containing the resulting string, and returns this node.

The following example numbers a sorted list:

<xsl:template match="items">
  <xsl:for-each select="item">
    <xsl:sort select="."/>
    <p>
      <xsl:number value="position()" format="1. "/>
      <xsl:value-of select="."/>
    </p>
  </xsl:for-each>
</xsl:template>

12.2 Numbering based on Position in a Document

If no value attribute is specified, then the xsl:number instruction returns a new text node containing a formatted place marker that is based on the position of the context node within its containing document.

[ERR086] It is a dynamic error if the xsl:number instruction is evaluated, with no value attribute, when the context item is not a node. The processor must either signal the error, or must recover by returning an empty sequence.

The following attributes control how the context node is to be numbered:

In addition, the attributes specified in [12.3 Number to String Conversion Attributes] are used for number to string conversion, as in the case when the value attribute is specified.

The xsl:number element first constructs a sequence of positive integers using the level, count and from attributes. Where level is single or any, this sequence will either be empty or contain a single number; where level is multiple, the sequence may be of any length. The sequence is constructed as follows:

Let matches-count($node) be a function that returns true if the given node matches the pattern given in the count attribute, or the implied pattern (according to the rules given above) if the count attribute is omitted.

Let matches-from($node) be a function that returns true if the given node matches the pattern given in the from attribute (this function is not used if the from attribute is omitted).

When level="single":

When level="multiple":

When level="any":

The sequence of numbers (the place marker) is then converted into a string using the effective values of the attributes specified in [12.3 Number to String Conversion Attributes]; each of these attributes is interpreted as an attribute value template. After conversion, the resulting string is inserted in the result tree.

For example, the following will number the items in an ordered list:

<xsl:template match="ol/item">
  <fo:block>
    <xsl:number/><xsl:text>. </xsl:text><xsl:apply-templates/>
  </fo:block>
<xsl:template>

The following two rules will number title elements. This is intended for a document that contains a sequence of chapters followed by a sequence of appendices, where both chapters and appendices contain sections, which in turn contain subsections. Chapters are numbered 1, 2, 3; appendices are numbered A, B, C; sections in chapters are numbered 1.1, 1.2, 1.3; sections in appendices are numbered A.1, A.2, A.3.

<xsl:template match="title">
  <fo:block>
     <xsl:number level="multiple"
                 count="chapter|section|subsection"
                 format="1.1 "/>
     <xsl:apply-templates/>
  </fo:block>
</xsl:template>

<xsl:template match="appendix//title" priority="1">
  <fo:block>
     <xsl:number level="multiple"
                 count="appendix|section|subsection"
                 format="A.1 "/>
     <xsl:apply-templates/>
  </fo:block>
</xsl:template>

The following example numbers notes sequentially within a chapter:

<xsl:template match="note">
  <fo:block>
     <xsl:number level="any" from="chapter" format="(1) "/>
     <xsl:apply-templates/>
  </fo:block>
</xsl:template>

The following example will number H4 elements in HTML with a three-part label:

<xsl:template match="H4">
 <fo:block>
   <xsl:number level="any" from="H1" count="H2"/>
   <xsl:text>.</xsl:text>
   <xsl:number level="any" from="H2" count="H3"/>
   <xsl:text>.</xsl:text>
   <xsl:number level="any" from="H3" count="H4"/>
   <xsl:text> </xsl:text>
   <xsl:apply-templates/>
 </fo:block>
</xsl:template>

12.3 Number to String Conversion Attributes

The following attributes are used to control conversion of a sequence of numbers into a string. The numbers are integers greater than 0. The attributes are all optional.

The main attribute is format. The default value for the format attribute is 1. The format attribute is split into a sequence of tokens where each token is a maximal sequence of alphanumeric characters or a maximal sequence of non-alphanumeric characters. Alphanumeric means any character that has a Unicode category of Nd, Nl, No, Lu, Ll, Lt, Lm or Lo. The alphanumeric tokens (format tokens) specify the format to be used for each number in the sequence. If the first token is a non-alphanumeric token, then the constructed string will start with that token; if the last token is non-alphanumeric token, then the constructed string will end with that token. Non-alphanumeric tokens that occur between two format tokens are separator tokens that are used to join numbers in the sequence. The nth format token will be used to format the nth number in the sequence. If there are more numbers than format tokens, then the last format token will be used to format remaining numbers. If there are no format tokens, then a format token of 1 is used to format all numbers. The format token specifies the string to be used to represent the number 1. Each number after the first will be separated from the preceding number by the separator token preceding the format token used to format that number, or, if there are no separator tokens, then by . (a period character).

Format tokens are a superset of the allowed values for the type attribute for the OL element in HTML 4.0 and are interpreted as follows:

For all format tokens other than the first kind above (one that consists of decimal digits), there may be an implementation-defined upper bound on the range of numbers that can be formatted using this format token; indeed, for some numbering sequences there may be an intrinsic limit. For the numbering sequences described above, the upper bound must not be less than 1000 (one thousand). Numbers that exceed the upper bound must be formatted using the format token 1.

When numbering with an alphabetic sequence, the lang attribute specifies which language's alphabet is to be used; it has the same range of values as xml:lang [XML]; if no lang value is specified, the language should be determined from the system environment. The set of languages for which numbering is supported is implementation-defined.

The letter-value attribute disambiguates between numbering sequences that use letters. In many languages there are two commonly used numbering sequences that use letters. One numbering sequence assigns numeric values to letters in alphabetic sequence, and the other assigns numeric values to each letter in some other manner traditional in that language. In English, these would correspond to the numbering sequences specified by the format tokens a and i. In some languages, the first member of each sequence is the same, and so the format token alone would be ambiguous. A value of alphabetic specifies the alphabetic sequence; a value of traditional specifies the other sequence. If the letter-value attribute is not specified, then it is implementation-dependent how any ambiguity is resolved.

NOTE: It is possible for two conforming implementations not to convert a number to exactly the same string. Some implementations might not support some languages. Furthermore, there may be variations possible in the way conversions are performed for any particular language that are not specifiable by the attributes on xsl:number. Future versions of XSLT may provide additional attributes to provide control over these variations. Implementations may also use implementation-defined namespaced attributes on xsl:number for this.

The grouping-separator attribute gives the separator used as a grouping (e.g. thousands) separator in decimal numbering sequences, and the optional grouping-size specifies the size (normally 3) of the grouping. For example, grouping-separator="," and grouping-size="3" would produce numbers of the form 1,000,000. If only one of the grouping-separator and grouping-size attributes is specified, then it is ignored.

Here are some examples of conversion specifications:

13 Sorting

A sort specification is a sequence of one or more adjacent xsl:sort elements which together define rules for sorting the items in an input sequence to form a sorted sequence. Within a sort specification, each xsl:sort element provides one sort key definition. The first xsl:sort element specifies the primary part of the sort specification, the second xsl:sort element specifies the secondary part of the sort specification and so on.

A sort specification may occur as the content of an xsl:sort-key declaration at the top level of a stylesheet module, or it may occur immediately within an xsl:apply-templates, xsl:for-each, or xsl:for-each-group element.

[ERR087] When used within xsl:for-each or xsl:for-each-group, xsl:sort elements must occur before any other children.

13.1 Collating Sequences

When sorting or comparing a set of strings, the question arises, what collating sequence should be used? Facilities in XSLT 2.0 and XPath 2.0 that require strings to be ordered rely on the concept of a named collation. A collation is a set of rules that determine whether two strings are equal, and if not, which of them should be sorted before the other. A collation is identified by a URI, but the manner in which this URI is associated with an actual rule or algorithm is implementation-defined.

NOTE: The reason XSLT does not provide detailed mechanisms for defining collating sequences is that many implementations will re-use collating mechanisms available from the underlying implementation platform (for example, from the operating system or from the run-time library of a chosen programming language). These will inevitably differ from one XSLT implementation to another.

In order to allow portable stylesheets to be written, implementations should provide facilities that allow an arbitrary URI to be bound to a collation offered by that implementation. The URIs that designate collations should not be hard-wired into the implementation. One way of doing this is by means of a top-level element in the stylesheet of the form <vendor:collation name="uri"> where vendor is an implementor-defined namespace and uri is the URI used to identify the collation; the attributes and/or content of this declaration can be used to bind the collation URI to a specific collation available from the implementation. As described in [3.4.1 User-defined Data Elements], such top-level elements will have no effect when the stylesheet is processed by an XSLT processor that does not recognize the namespace URI. A stylesheet that requires portability can therefore use multiple <vendor:collation declarations to bind the collation URI to different collation implementations provided by different vendors.

Ed. Note: Add something here about the default collation.

13.2 The xsl:sort Element

<xsl:sort
  select = expression
  lang = { nmtoken }
  type = { qname }
  order = { "ascending" | "descending" }
  collation = { uri }
  case-order = { "upper-first" | "lower-first" }
  data-type = { "text" | "number" | qname-but-not-ncname } />

Those attributes of the xsl:sort elements whose values are attribute value templates are evaluated using the outer focus. If the element that contains the xsl:sort elements is an xsl:sort-key declaration, then the outer focus is a singleton focus based on the document node of the document containing the initial context node. Otherwise, the outer focus is the focus used to evaluate the select attribute of the containing instruction (for example, xsl:for-each or xsl:apply-templates).

The sequence to be sorted is referred to as the initial sequence. The sequence after sorting as defined by the xsl:sort elements is referred to as the sorted sequence.

For each item in the initial sequence, a value is computed for each sort key definition within the sort specification. The value computed for an item by using the Nth sort key definition is referred to as the Nth sort key of that item. Specifically, the Nth sort key is computed by evaluating the expression contained in the select attribute of the Nth xsl:sort element, if there is such an attribute. If there is no select attribute, the sort key is computed by taking the actual item in the initial sequence if it is an atomic value, or the typed-value of this item if it is a node.

The expression in the select attribute of the xsl:sort element is evaluated with the focus set as follows:

If the xsl:sort element has a type attribute, then the sort key is converted to the target data type before comparing it with other items.

[ERR088] The target data type for each xsl:sort element is determined by the effective value of its type attribute. This must be the name of a primitive data type in XML Schema (see [XML Schema]). It is a dynamic error if any other value is supplied. The processor must either signal the error, or must recover by continuing as if the type attribute were not specified.

For backwards compatibility with XSLT 1.0, the attribute data-type is available as an alternative to type. If this has the value text, the target data type is xs:string. If it has the value number, the target data type is xs:double. If it has any other value, the effect is implementation-defined.

If the type and data-type attributes are both present, the data-type attribute is ignored. The implementation may check that its value is valid, but is not required to do so.

Each sort key is converted to the target data type using the rules for the XPath cast expression. [ERR089] It is a dynamic error if any value obtained by evaluating the select attribute of an xsl:sort element cannot be converted to the target data type. The processor must either signal the error, or must recover by treating the value as being less than any value for which conversion succeeds, but equal to any other value for which conversion fails. This means (if this is the first or only sort key) that values that cannot be converted to the target data type will appear together at the start of the sorted sequence if order is ascending, or at the end if order is descending.

Ed. Note: Why must the type be a primitive type?

If there is no type or data-type attribute, then the computed sort keys are not converted before comparison, except in the case where the data type of a computed sort key is a complex type, in which case it is converted to a string as if by the XPath string function.

The items in the initial sequence are ordered into a sorted sequence by comparing their sort keys. The relative position of two items A and B in the sorted sequence is determined as follows. The first sort key of A is compared with the first sort key of B, according to the rules of the first sort key definition. If, under these rules, A is less than B, then A will precede B in the sorted sequence, unless the order attribute of this sort key definition specifies descending, in which case B will precede A in the sorted sequence. If, however, the relevant sort keys compare equal, then the second sort key of A is compared with the second sort key of B, according to the rules of the second sort key definition. This continues until two sort keys are found that compare unequal. If all the sort keys compare equal, then A will precede B in the sorted sequence if A preceded B in the initial sequence, and vice versa.

In general, comparison of two values is performed according to the rules of the XPath lt operator. However, special rules apply to certain data types, as described below. [ERR090] It is a dynamic error if, for any sort key definition, the set of sort keys evaluated for all the items in the initial sequence, after any type conversion requested, contains a pair of values for which the result of the XPath lt operator is an error or an empty sequence. The processor must either signal the error, or must recover by assigning an arbitrary ordering to any such pair of values.

Issue 105 (sequence-valued-sort-key): What happens if the value of a sort key is a sequence?

[ERR091] It is a dynamic error if the effective value of the type attribute of the xsl:sort element is a data type for which no ordering relation is defined, other than the value xs:string or text, which are synonymous. The processor must signal the error, or must recover by continuing as if the type attribute were omitted.

For comparison of string values, special rules apply. If the xsl:sort element has a collation attribute, then the strings are compared according to the rules for the named collation: that is, they are compared using the XPath function call xf:compare($a, $b, $collation).

NOTE: XSLT provides no facilities to declare or define collations: such mechanisms are expected to be provided by implementors. Equally, this specification does not define what happens if the collation name is not recognized: the implementation may signal an error, or it may use the default collation.

The lang and case-order attributes are ignored if a collation attribute is present. But in the absence of a collation attribute, these attributes provide input to an implementation-defined algorithm to identify a suitable collation:

In the absence of any of these attributes, the default collation is used.

NOTE: The exact results of sorting are implementation-dependent. Some implementations might not support some languages. Furthermore, there may be variations possible in the sorting of any particular language that are not specified by the attributes on xsl:sort, for example, whether Hiragana or Katakana is sorted first in Japanese. Implementations may also use implementation-defined namespaced attributes on xsl:sort for this.
NOTE: It is recommended that implementors consult [UNICODE TR10] for information on internationalized sorting.

There are also special considerations where the target data type is float or double. For sorting purposes, NaN values are considered equal to each other, and less than any other value, including negative infinity. A value that cannot be converted to the target data type is treated in the same way as a NaN value. Negative zero and positive zero are considered equal, as normal.

13.3 Using Unnamed Sort Specifications

When used within xsl:for-each or xsl:apply-templates, a sort specification indicates that the sequence of items selected by that instruction should be processed in sorted order, not in the order of the supplied sequence.

For example, suppose an employee database has the form

<employees>
  <employee>
    <name>
      <given>James</given>
      <family>Clark</family>
    </name>
    ...
  </employee>
</employees>

Then a list of employees sorted by name could be generated using:

<xsl:template match="employees">
  <ul>
    <xsl:apply-templates select="employee">
      <xsl:sort select="name/family"/>
      <xsl:sort select="name/given"/>
    </xsl:apply-templates>
  </ul>
</xsl:template>

<xsl:template match="employee">
  <li>
    <xsl:value-of select="name/given"/>
    <xsl:text> </xsl:text>
    <xsl:value-of select="name/family"/>
  </li>
</xsl:template>

When used within xsl:for-each-group, a sort specification indicates the order in which the groups should be processed. For the effect of xsl:for-each-group, see [14 Grouping]

13.4 Using Named Sort Specifications

13.4.1 Declaring a Named Sort Specification

<!-- Category: declaration -->
<xsl:sort-key
  name = qname>
  <!-- Content: (xsl:sort+) -->
</xsl:sort-key>

A named sort specification is defined by an xsl:sort-key declaration. This is a top-level element in the stylesheet. The name attribute is mandatory. The value of the name attribute is a QName, which is expanded as described in [5.1 Qualified Names]: it need not have a prefix.

The content of the xsl:sort-key element consists of one or more xsl:sort elements that define the components of the sort specification, in major to minor order.

If a stylesheet contains declarations of two or more named sort specifications with the same expanded-QName, the one with highest import precedence is used. [ERR092] It is a static error for a stylesheet to contain two or more named sort specifications with the same expanded-QName and the same import precedence, unless there is another named sort specification with the same expanded-QName and a higher import precedence.

13.4.2 Sorting Using a Named Sort Specification

Ed. Note: We should use named arguments in function signatures, as in the F&O specification

Function: item* sort(string, sequence)

In an XPath expression used within an XSLT stylesheet, an additional function sort is available, which sorts a sequence using a named sort specification.

The first argument is evaluated as a string; its value must be a QName whose expanded-QName is the same as the name of a named sort specification defined in the stylesheet. [ERR093] It is a dynamic error if the first argument of the sort function does not match the name of any named sort specification in the stylesheet. The processor must signal the error.

The second argument is evaluated as a sequence. This sequence forms the initial sequence for the sort. The sequence is sorted according to the rules for the named sort specification, and the result of the function call is the resulting sorted sequence.

14 Grouping

The facilities described in this section are designed to allow users to group nodes in a document based on common string values, common names, or commmon values for any other expression. Since grouping identifies items with duplicate values, the same facilities also allow selection of the distinct values in a sequence of items, that is, the elimination of duplicates.

In addition these facilities allow grouping based on sequential position, e.g. selecting groups of adjacent PARA elements. The facilities also provide an easy way to do fixed-size grouping, for example identifying groups of three adjacent nodes, which is useful when arranging data in multiple columns.

For each group of items identified, it is possible to evaluate a content constructor for the group. Grouping is nestable to multiple levels so that groups of distinct items can be identified, then from among the distinct groups selected, further sub-grouping of distinct items in the current group can be done.

14.1 The Current Group

Function: sequence current-group()

The evaluation context for XPath expressions includes an additional value called the current group, which is a sequence. The current group is the collection of related items that are processed collectively in one iteration of the xsl:for-each-group element.

While an xsl:for-each-group instruction is being evaluated, the current group will be non-empty. At other times, it will be an empty sequence.

The function current-group returns the current group.

The function takes no arguments.

[ERR094] The current-group function must not be used within a pattern.

14.2 The xsl:for-each-group Element

<!-- Category: instruction -->
<xsl:for-each-group
  select = expression
  group-by = expression
  group-adjacent = expression
  group-starting-with = pattern
  group-ending-with = pattern
  collation = { uri }
  type = qname>
  <!-- Content: (xsl:sort*, content-constructor) -->
</xsl:for-each-group>

This element is an instruction that may be used anywhere within a content constructor.

The xsl:for-each-group instruction partitions a sequence into groups of items (that is, it establishes a set of sequences) based either on common values of a grouping key, or on a pattern that the initial node in a group must match. The content constructor that forms the content of the xsl:for-each-group instruction is evaluated once for each of these groups. A group is never empty.

The sequence of items to be grouped, which is referred to as the population, is determined by evaluating the XPath expression contained in the select attribute. The required type of this expression is sequence. The population is treated as a sequence; the order of items in this sequence is referred to as population order. If the population is empty, the number of groups will be zero. Each item in the population is assigned to exactly one group: the assignment of items to groups depends on the group-by, group-adjacent, group-starting-with, and group-ending-with attributes. [ERR095] These four attributes are mutually exclusive: exactly one of the four attributes must be present.

[ERR096] The type attribute, if specified, must be the name of an atomic type (either a built-in type, or a type defined in an imported schema).

[ERR097] It is an error to specify either the type attribute or the collation attribute if neither the group-by attribute nor group-adjacent attribute is specified.

For each group, the item within the group that is first in population order is known as the initial item of the group.

There is an ordering among groups referred to as the order of first appearance. A group G is defined to precede a group H in order of first appearance if the initial item of G precedes the initial item of H in population order.

There is another ordering among groups referred to as processing order.

If there are no xsl:sort elements immediately within the xsl:for-each-group element, the processing order of the groups is the order of first appearance.

Otherwise, the xsl:sort elements immediately within the xsl:for-each-group element define the processing order of the groups (see [13 Sorting]). They do not affect the order of items within each group. Multiple sort keys are allowed, and are evaluated in major-to-minor order. If two groups have the same values for all their sort keys, they are processed in order of first appearance.

The select expression of an xsl:sort element is evaluated once for each group. During this evaluation, the context item is the initial item of the group, the context position is the position of this item within the set of initial items (that is, one item for each group in the population) in population order, the context size is the number of groups, and the current group is the group whose sort key is being determined.

This means that if the grouping key is "@category", you can sort the groups in order of their grouping key by writing <xsl:sort select="@category"/>; or you can sort the groups in order of size by writing <xsl:sort select="count(current-group())" data-type="number">

If there are attribute value templates present in the xsl:sort element, the focus for evaluation of the contained expressions is the same as the focus for evaluation of the select attribute of the containing xsl:for-each-group instruction.

The content constructor contained by the xsl:for-each-group element is evaluated once for each of the groups, in processing order. The node sequences that result are concatenated, in processing order, to form the result of the xsl:for-each-group element. Within the content constructor, the context item is the initial item of the relevant group, the context position is the position of this item among the set of initial items (one item for each group) arranged in processing order of the groups, the context size is the number of groups, and the current group is the group being processed. This has the effect that within the the content constructor, a call on position() takes successive values 1, 2, ... last().

On completion of the evaluation of the xsl:for-each-group, the current group reverts to its previous value.

14.3 Examples of Grouping

The following example groups a list of nodes based on common values. The resulting groups are numbered but unsorted, and a total is calculated for each group.

Source XML document:

<cities>
  <city name="Milano"  country="Italia"      pop="5"/>
  <city name="Paris"   country="France"      pop="7"/>
  <city name="München" country="Deutschland" pop="4"/>
  <city name="Lyon"    country="France"      pop="2"/>
  <city name="Venezia" country="Italia"      pop="1"/>
</cities>

More specifically, the aim is to produce a four-column table, containing one row for each distinct country. The four columns are to contain first, a sequence number giving the number of the row; second, the name of the country, third, a comma-separated alphabetical list of the city names within that country, and fourth, the sum of the pop attribute for the cities in that country.

Desired output:

<table>
  <tr>
    <th>Position</th>
    <th>Country</th>
    <th>List of Cities</th>
    <th>Population</th>
  </tr>
  <tr>
    <td>1</td>
    <td>Italia</td>
    <td>Milano, Venezia</td>
    <td>6</td>
  </tr>
  <tr>
    <td>2</td>
    <td>France</td>
    <td>Lyon, Paris</td>
    <td>9</td>
  </tr>  
  <tr>
    <td>3</td>
    <td>Deutschland</td>
    <td>München</td>
    <td>4</td>
  </tr>  
</table>

Solution:

<table xsl:version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
  <tr>
    <th>Position</th>
    <th>Country</th>
    <th>City List</th>
    <th>Population</th>
  </tr>
  <xsl:for-each-group select="cities/city" group-by="@country">
    <tr>
      <td><xsl:value-of select="position()"/></td>
      <td><xsl:value-of select="@country"/></td>
      <td>
        <xsl:value-of select="current-group()/@name" separator=","/>
      </td>
      <td><xsl:value-of select="sum(current-group()/@pop)"/></td>
    </tr>
  </xsl:for-each-group>
</table>

The following example uses the same source document, this time grouping the cities according to the initial letter of the city name. The groups are sorted, and the result includes a count of the nodes within the group. The heading contains a count of the number of groups:

Desired output:

<html>
  <body>
    <h2>L (1)</h2><p>Lyon</p>
    <h2>M (2)</h2><p>Milano</p><p>München</p>
    <h2>P (1)</h2><p>Paris</p>
    <h2>V (1)</h2><p>Venezia</p>
  </body>
    </html>

Solution:

<html xsl:version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
  <body>
    <xsl:for-each-group select="cities/city" group-by="substring(@name,1,1)">
      <xsl:sort select="substring(@name,1,1)"/>
      <h2>
        <xsl:value-of select="upper-case(substring(@name,1,1))"/>
        <xsl:text> (</xsl:text>
        <xsl:value-of select="count(current-group())"/>
        <xsl:text>)</xsl:text>
      </h2>
      <xsl:for-each select="current-group()">
        <p><xsl:value-of select="@name"/></p>
      </xsl:for-each>
    </xsl:for-each-group>
  </body>
</html>

The next example identifies a group not by the presence of a common value, but rather by adjacency in document order. A group consists of an h2 element, followed by all the p elements up to the next h2 element.

Source XML document:

<body>
  <h2>Introduction</h2>
  <p>XSLT is used to write stylesheets.</p>
  <p>XQuery is used to query XML databases.</p>
  <h2>What is a stylesheet?</h2>
  <p>A stylesheet is an XML document used to define a transformation.</p>
  <p>Stylesheets may be written in XSLT.</p>
  <p>XSLT 2.0 introduces new grouping constructs.</p>
</body>

Desired output:

<chapter>
  <section title="Introduction">
    <para>XSLT is used to write stylesheets.</para>
    <para>XQuery is used to query XML databases.</para>
  </section> 
  <section title="What is a stylesheet?">
    <para>A stylesheet is an XML document used to define a transformation.</para>
    <para>Stylesheets may be written in XSLT.</para>
    <para>XSLT 2.0 introduces new grouping constructs.</para>
  </section>
</chapter>

Solution:

<xsl:template match="body">
  <chapter>
	<xsl:for-each-group select="*" group-starting-with="h2"	>
	  <section title="{self::h2}">
	    <xsl:for-each select="current-group()[self::p]">
	      <para><xsl:value-of select="."/></para>
	    </xsl:for-each> 
	  </section>
	</xsl:for-each-group>
  </chapter>
</xsl:template>

The use of title="{self::h2}" rather than title="{.}" is to handle the case where the first element is not an h2 element.

 

The next example illustrates how a group of related elements can be identified by the last element in the group, rather than the first. Here the absence of the attribute continued="yes" indicates the end of the group.

Source XML document:

<doc>
  <page continued="yes">Some text</page>
  <page continued="yes">More text</page>    
  <page>Yet more text</page>
  <page continued="yes">Some words</page>
  <page continued="yes">More words</page>    
  <page>Yet more words</page>        
</doc>

Desired output:

<doc>
  <pageset>
    <page>Some text</page>
    <page>More text</page>    
    <page>Yet more text</page>
  </pageset>
  <pageset>
    <page>Some words</page>
    <page>More words</page>    
    <page>Yet more words</page>
  </pageset>
</doc>

Solution:

<xsl:template match="doc">
<doc>
  <xsl:for-each-group select="*" 
                      group-ending-with="page[not(@continued='yes')]">
    <pageset>
      <xsl:for-each select="current-group()">
        <page><xsl:value-of select="."/></page>
      </xsl:for-each> 
    </pageset>
  </xsl:for-each-group>
</doc>
</xsl:template>
 

In the final example, the membership of a node within a group is based both on adjacency of the nodes in document order, and on common values. In this case, the grouping key is a boolean condition, true or false, so the effect is that a grouping establishes a maximal sequence of nodes for which the condition is true, followed by a maximal sequence for which it is false, and so on.

Source XML document:

<p>Do <em>not</em>:
    <ul>
    <li>talk,</li>
    <li>eat, or</li>
    <li>use your mobile telephone</li>
    </ul>
    while you are in the cinema.</p>

Desired output:

<p>Do <em>not</em>:</p>
    <ul>
    <li>talk,</li>
    <li>eat, or</li>
    <li>use your mobile telephone</li>
    </ul>
    <p>while you are in the cinema.</p>

Solution:

This requires creating a p element around the maximal sequence of sibling nodes that does not include a ul or ol element.

This can be done by using group-adjacent, with a grouping key that is true if the element is a ul or ol element, and false otherwise:

<xsl:template match="p">
    <xsl:for-each-group select="node()" 
            group-adjacent="self::ul or self::ol">
        <xsl:choose>
            <xsl:when test="self::ul or self::ol">
                <xsl:copy-of select="current-group()"/>  
            </xsl:when>
            <xsl:otherwise>
                <p>
                    <xsl:copy-of select="current-group()">
                </p>
            </xsl:otherwise>  
        </xsl:choose>
    </xsl:for-each-group>
</xsl:template>

15 Regular Expressions

The core function library for XPath 2.0 defines three functions that make use of regular expressions:

These functions are described in [Functions and Operators].

For more complex string processing than is possible using these functions, XSLT provides an instruction xsl:analyze-string, which is defined in this section.

The regular expressions used by this instruction, and the flags that control the interpretation of these regular expressions, must conform to the syntax defined in [Functions and Operators], which is itself based on the syntax defined in [XML Schema].

<!-- Category: instruction -->
<xsl:analyze-string
  select = expression
  regex = { string }
  flags = { string }>
  <!-- Content: (xsl:matching-substring?, xsl:non-matching-substring?) -->
</xsl:analyze-string>

<xsl:matching-substring>
  <!-- Content: content-constructor -->
</xsl:matching-substring>

<xsl:non-matching-substring>
  <!-- Content: content-constructor -->
</xsl:non-matching-substring>

The xsl:analyze-string instruction takes as input a string (the value of the select attribute) and a regular expression (the effective value of the regex attribute).

The flags attribute may be used to control the interpretation of the regular expression. If the attribute is omitted, the effect is the same as supplying a zero-length string . The mode is set using the $flags argument. If this is set to m, the match operates in multiline mode. Otherwise, it operates in string mode.

The xsl:analyze-string instruction may have two child elements: xsl:matching-substring and xsl:non-matching-substring. Both elements are optional, and neither may appear more than once.

This instruction is designed to process all the non-overlapping substrings of the input string that match the regular expression supplied.

[ERR100] It is a dynamic error if the effective value of the regex attribute does not conform to the required syntax for regular expressions, as specified in [Functions and Operators], or if the effective value of the flags attribute has a value other than the values defined in [Functions and Operators]. The processor must signal the error. If the regular expression and/or flags are known statically (for example, if the attributes do not contain any expressions enclosed in curly braces) then the processor may signal the error as a static error.

[ERR101] It is a dynamic error if the effective value of the regex attribute is a regular expression that matches a zero-length string . The processor must signal the error. If the regular expression is known statically (for example, if the attribute does not contain any expressions enclosed in curly braces) then the processor may signal the error as a static error.

The xsl:analyze-string instruction starts at the beginning of the input string and attempts to find the first substring that matches the regular expression. If there are several matches, the first match is defined to be the one whose starting position comes first in the string. Having found the first match, the method proceeds to find the second and subsequent matches by repeating the search, starting at the first character that was not included in the previous match.

The input string is thus partitioned into a sequence of non-empty substrings, some of which match the regular expression, others which do not match it. This sequence of substrings is processed using the xsl:matching-substring and xsl:non-matching-substring child instructions. A matching substring is processed using the xsl:matching-substring element, a non-matching substring using the xsl:non-matching-substring element. Each of these elements takes a content constructor as its content. If the element is absent, the effect is the same as if it were present with empty content. In processing each substring, the contents of the substring will be the context item (as a value of type xs:string); the position of the substring within the sequence of matching and non-matching substrings will be the context position; and the number of matching and non-matching substrings will be the context size.

If the input is a zero-length string , neither the xsl:matching-substring nor xsl:non-matching-substring elements will be evaluated.

Function: string regex-group(integer)

While the xsl:matching-substring instruction is active, a set of captured substrings is available, corresponding to the parenthized sub-expressions of the regular expression. These captured substrings are accessible using the function regex-group. This function takes an integer argument to identify the group, and returns a string representing the captured substring. In the absence of a captured substring with the relevant number, it returns the zero-length string .

NOTE: The function also returns a zero-length string in the case of a group that matched a zero-length string , and in the case of a group that exists in the regular expression but did not match any part of the input string.

While no xsl:matching-substring instruction is active the regex-group returns an empty sequence. The function also returns an empty sequence if an xsl:non-matching-substring instruction has been activated more recently than an xsl:matching-substring instruction.

15.1 Examples of Regular Expression Matching

Problem: replace all newline characters in the <abstract> by <br/> elements:

Solution:

<xsl:analyze-string select="abstract" regex="\n">
  <xsl:non-matching-substring>
    <xsl:value-of select="."/>
  </xsl:non-matching-substring>
  <xsl:matching-substring>
    <br/>
  </xsl:matching-substring>
</xsl:for-each>

Problem: replace all occurrences of "[...]" in the <body> by <cite>...</cite> elements:

Solution:

<xsl:analyze-string select="body" regex="\[(.*?)\]">
  <xsl:matching-substring>
    <cite><xsl:value-of select="regex-group(1)"/></cite>
  </xsl:matching-substring>
  <xsl:non-matching-substring>
    <xsl:value-of select="."/>
  </xsl:non-matching-substring>
</xsl:analyze-string>

Problem: the input string contains a date such as "23 March 2002". Convert it to the form 2002-03-23.

Solution (with no error handling if the input format is incorrect):

<xsl:variable name="months" select="('January', 'February', 'March', ...)"/>

<xsl:analyze-string select="$input" regex="\s*([0-9]+)\s+([A-Z](a-z)+)\s+([0-9]+)\s*">
    <xsl:matching-substring>
        <xsl:number value="regex-group(3)" format="0001"/>          
        <xsl:text>-</xsl:text>
        <xsl:number value="index-of($months, regex-group(2))" format="01"/>
        <xsl:text>-</xsl:text>
        <xsl:number value="regex-group(1)" format="01"/>
    </xsl:matching-substring>
</xsl:analyze-string>

16 Additional Functions

This section describes XSLT-specific additions to the core XPath function library. Some of these additional functions also make use of information specified by declarations in the stylesheet; this section also describes these declarations.

16.1 Multiple Source Documents

The document function, which in XSLT 1.0 was an additional function defined in the XSLT specification, is now a core function defined in XPath 2.0: see [Functions and Operators]. Details of the function have therefore been removed from this specification.

16.2 Reading Text Files

Function: sequence unparsed-text(sequence, string?)

The unparsed-text function reads one or more external files and returns the contents of each one as a string.

The first argument is treated as a sequence. Each item in the sequence, when converted to a string, must be a URI. The URI must contain no fragment identifier, and must identify a resource that can be read as text. If the URI is a relative URI, then:

The second argument, if present, is the name of an encoding. This encoding is used to translate the contents of the file into a string. The values for this attribute follow the same rules as for the encoding attribute in an XML declaration. The only values which every implementation is obliged to recognize are utf-8 and utf-16. If the second argument is omitted, the implementation may attempt to infer the encoding of the file by using external information (such as an HTTP header) or by auto-recognition using techniques such as those described in [XML].

Ed. Note: We decided to incorporate (as far as possible) the XInclude rules for detection of encoding.

[ERR102] It is a dynamic error if a URI cannot be used to retrieve a resource containing text. The processor must either signal the error, or must recover by treating the URI as if it referenced a resource containing a zero-length string.

[ERR103] It is a dynamic error if a resource contains characters that are not permitted XML characters. The processor must either signal the error, or must recover in a implementation-defined way; one possible outcome is that the processor will produce an output file that is not well-formed XML.

[ERR104] It is a dynamic error if a resource contains bytes that cannot be decoded into permitted XML characters using the specified encoding. This includes the case where the processor does not support the requested encoding. The processor must signal the error.

[ERR105] It is a dynamic error if the second argument of the unparsed-text function is omitted and the processor cannot infer the encoding using external information.The processor must signal the error.

The result is a sequence of strings, containing one string for each URI in the sequence supplied as the first argument; each string holds the text of the resource retrieved using the URI in the corresponding position of the sequence.

NOTE: If the text file contains characters such as < and &, these will typically be output as &lt; and &amp; when the string is written to the result tree and serialized as XML or HTML. If these characters actually represent markup (for example, if the text file contains HTML), then the stylesheet can attempt to write them as markup to the output file using the disable-output-escaping attribute of the xsl:value-of instruction (see [20.5 Disabling Output Escaping]. Note, however, that implementations are not required to support this feature.

The following example attempts to read an HTML file and copy it, as HTML, to the serialized output file:

<xsl:output method="html"/>

<xsl:template match="/">
  <xsl:value-of select="unparsed-text('header.html', 'iso-8859-1')"
                disable-output-escaping="yes"/>
  <xsl:apply-templates/>
  <xsl:value-of select="unparsed-text('footer.html', 'iso-8859-1')"
                disable-output-escaping="yes"/>
</xsl:template>

16.3 Keys

Keys provide a way to work with documents that contain an implicit cross-reference structure. They make it easier to locate the nodes within a document that have a given value for a given attribute or child element, and they provide a hint to the implementation that certain access paths in the document need to be efficient.

16.3.1 The xsl:key Declaration

<!-- Category: declaration -->
<xsl:key
  name = qname
  match = pattern
  use = expression
  type = qname
  collation = uri />

The xsl:key declaration is used to declare keys. The name attribute specifies the name of the key. The value of the name attribute is a QName, which is expanded as described in [5.1 Qualified Names]. The match attribute is a Pattern; an xsl:key element applies to all nodes that match the pattern specified in the match attribute. The use attribute is an expression specifying the values of the key; the expression will be evaluated with the node that matches the pattern as the context node.

The presence of an xsl:key declaration makes it easy to find a node that matches the match pattern if any of the values of the use expression (when applied to that node) are known. It also provides a hint to the implementation that access to the nodes by means of these values needs to be efficient (many implementations are likely to construct an index or hash table to achieve this.) Note that the use expression in general returns a sequence of values, and any one of these may be used to locate the node.

The type attribute defines the required type of the result of the use expression, and it also defines the rules that determine whether two key values are considered equal. The default value is xs:string, which means that the result of evaluating the expression is treated as a sequence of strings, by taking the string value of each item in the sequence. In general, the result of evaluating the use expression is converted to a sequence of items of the specified type, by invoking the rules for the cast as operator defined in [XPath 2.0]. The type must be either a built-in type defined in [XML Schema] (Part 2), or an atomic type defined in a schema imported using an xsl:import-schema declaration.

The collation attribute is used only when the type is xs:string, and it identifies a collation that is used to decide whether two strings are equal for the purposes of key matching. Specifically, two values $a and $b are considered equal if the result of the function call xf:compare($a, $b, $collation) is zero.

It is possible to have:

  • multiple key declarations with the same name;

  • a node that matches the match patterns of several different key declarations;

  • a node that returns more than one value from its use expression;

  • a key value that identifies more than one node (the key values for different nodes do not need to be unique).

An xsl:key declaration with higher import precedence does not override another of lower import precedence; all the xsl:key declarations in the stylesheet are effective regardless of their import precedence.

[ERR106] It is a static error if there are several xsl:key declarations in the stylesheet with the same key name and different data types, or if the data type is specified in one of these declarations and omitted in another. The key names and data types are the same if their expanded names match.

[ERR107] It is a static error if there are several xsl:key declarations in the stylesheet with the same key name and different collation sequences, or if the collation is specified in one of these declarations and omitted in another. The collation names are the same if their URIs consist of the same sequence of Unicode code-points.

[ERR108] It is a static error if the value of the type attribute is not an atomic type (optionally followed by an occurrence count).

[ERR109] It is a static error if a value is specified for the collation attribute unless the type is defaulted or set to xs:string.

[ERR110] It is a dynamic error if the result of evaluating the use expression , for any node that matches the pattern specified in the match attribute, cannot be converted to the data type specified by the type attribute. The processor may signal the error, or may recover by ignoring the existence of the value that cannot be converted.

16.3.2 The key Function

Function: node * key(string, sequence)

The key function does for keys what the id function does for IDs.

The first argument specifies the name of the key. The value of the argument must be a QName, which is expanded as described in [5.1 Qualified Names]. [ERR111] It is a dynamic error if the value is not a valid QName, or if there is no namespace declaration in scope for the prefix of the QName, or if the name obtained by expanding the QName is not the same as the expanded name of any key declaration in the stylesheet. The processor must signal these errors.

Issue 133 (qname-for-key-etc): Some XSLT functions such as key take an argument whose value is a string containing a lexical QName. Should we allow this argument to have type xs:QName?

The second argument to the key function is regarded as a sequence. The set of requested key values is formed by casting each item in the supplied value of the argument to the type defined in the key declaration. In general the result is a sequence of atomic values, each of which is considered as a requested key value. The result of the function is a sequence of nodes, in document order and with duplicates removed, comprising those nodes in the same document as the context node that are matched by an xsl:key declaration whose name is the same as the supplied key name, and whose value is equal to one of these requested key values, under the rules of the type and collation specified in the key declaration.

More formally, if the result of evaluating the second argument of the key function is denoted by $V, the result returned by the key function is the union of the node sequences selected by the expression

//(MATCH)/self::node()
   [some $k in (for $kk in USE return cast as TYPE ($kk)),
         $v in (for $vv in $V return cast as TYPE ($vv))
         satisfies $k eq $v]

applied to all xsl:key declarations whose name matches the name given as the first argument to the key function, where MATCH is the pattern given in the match attribute of the xsl:key declaration, USE is the expression given in its use attribute, and TYPE is the expression given in its type attribute, or xs:string if the attribute is omitted. The values of the match and use and type attributes are textually substituted into the above expression. Note that the predicate (USE) = $V typically has a sequence on both sides of the equals sign: it returns true if any of the values returned by the USE expression is equal to any of the values in $V.

If the collation attribute is specified in the key declaration, the expression above is modified to read:

//(MATCH)/self::node()
   [some $k in (for $kk in USE return cast as xs:string($kk)),
         $v in (for $vv in $V return cast as xs:string($vv))
         satisfies xf:compare($k, $v, 'COLLATION') = 0]
NOTE: The reason that /self::node() appears in the above expressions is to ensure that any calls of position() or last() within the USE expression evaluate to 1 (one). This is necessary to retain compatibility with the XSLT 1.0 specification.

For example, given a declaration

<xsl:key name="idkey" match="div" use="@id"/>

an expression key("idkey",@ref) will return the same nodes as id(@ref), assuming that the only ID attribute declared in the XML source document is:

<!ATTLIST div id ID #IMPLIED>

and that the ref attribute of the context node contains no whitespace.

Suppose a document describing a function library uses a prototype element to define functions

<prototype name="key" return-type="node-set">
<arg type="string"/>
<arg type="object"/>
</prototype>

and a function element to refer to function names

<function>key</function>

Then the stylesheet could generate hyperlinks between the references and definitions as follows:

<xsl:key name="func" match="prototype" use="@name"/>

<xsl:template match="function">
<b>
  <a href="#{generate-id(key('func',.))}">
    <xsl:apply-templates/>
  </a>
</b>
</xsl:template>

<xsl:template match="prototype">
<p><a name="{generate-id()}">
<b>Function: </b>
...
</a></p>
</xsl:template>

The key always returns nodes that are in the same document as the context node; to retrieve a key from any other document, it is necessary first to change the context node.

For example, suppose a document contains bibliographic references in the form <bibref>XSLT</bibref>, and there is a separate XML document bib.xml containing a bibliographic database with entries in the form:

<entry name="XSLT">...</entry>

Then the stylesheet could use the following to transform the bibref elements:

<xsl:key name="bib" match="entry" use="@name"/>

<xsl:template match="bibref">
  <xsl:variable name="name" select="."/>
  <xsl:apply-templates select="document('bib.xml')/key('bib',$name)"/>
</xsl:template>
NOTE: This relies on the ability in XPath 2.0 to have a function call on the right-hand side of the / operator in a path expression.

16.4 Number Formatting

Function: string format-number(double, string, string?)

The format-number function formats a number as a string using the picture string specified by the second argument and the decimal-format named by the third argument, or the default decimal-format, if there is no third argument. The number is supplied as the value of the first argument. The required type of this argument is xs:double; if the value is not of the right type, it is converted using the standard conversion rules. The decimal-format name must be a QName, which is expanded as described in [5.1 Qualified Names]. The result of the function is the formatted string representation of the supplied number.

[ERR112] It is a dynamic error if the stylesheet does not contain a declaration of the decimal-format with the expanded-QName specified as the third argument. The processor must either signal the error, or must recover by ignoring the third argument.

NOTE: Stylesheets can use other facilities in XPath to control rounding.

16.4.1 Defining a Decimal Format

<!-- Category: declaration -->
<xsl:decimal-format
  name = qname
  decimal-separator = char
  grouping-separator = char
  infinity = string
  minus-sign = char
  NaN = string
  percent = char
  per-mille = char
  zero-digit = char
  digit = char
  pattern-separator = char />

The xsl:decimal-format element declares a decimal-format, which controls the interpretation of a picture string used by the format-number function. If there is a name attribute, then the element declares a named decimal-format; otherwise, it declares the default decimal-format. The value of the name attribute is a QName, which is expanded as described in [5.1 Qualified Names]. [ERR113] It is a static error to declare either the default decimal-format or a decimal-format with a given name more than once (even with different import precedence), unless it is declared every time with the same value for all attributes (taking into account any default values). If a stylesheet does not contain a declaration of the default decimal format, a declaration equivalent to an xsl:decimal-format element with no attributes is implied.

The attributes of the xsl:decimal-format declaration establish values for a number of variables used as input to the algorithm followed by the format-number function. An outline of the purpose of each attribute is given below; however, the definitive explanations are given later, as part of the description of this algorithm.

The following attributes both control the interpretation of characters in the picture string supplied to the format-number function, and also specify characters that may appear in the result of formatting the number. In each case the value must be a single character.

  • decimal-separator specifies the character used for the decimal-separator-sign; the default value is the period character (.)

  • grouping-separator specifies the character used for the grouping-sign, which is typically used as a thousands separator; the default value is the comma character (,)

  • percent specifies the character used for the percent-sign; the default value is the percent character (%)

  • per-mille specifies the character used for the per-mille-sign; the default value is the Unicode per-mille character (#x2030)

  • zero-digit specifies the character used for the digit-zero-sign; the default value is the digit zero (0). This character must be a digit (category Nd in the Unicode property database), and it must have the numeric value zero. This attribute implicitly allocates the corresponding set of Unicode digit characters to represent the values 0 to 9.

The following attributes control the interpretation of characters in the picture string supplied to the format-number function. In each case the value must be a single character.

  • digit specifies the character used for the digit-sign in the picture string; the default value is the number sign character (#)

  • pattern-separator specifies the character used for the pattern-separator-sign, which separates positive and negative sub-pictures in a picture string; the default value is the semi-colon character (;)

The following attributes specify characters or strings that may appear in the result of formatting the number:

  • infinity specifies the string used for the infinity-symbol; the default value is the string Infinity

  • NaN specifies the string used for the NaN-symbol, which is used to represent the value NaN (not-a-number); the default value is the string NaN

  • minus-sign specifies the character used for the minus-symbol; the default value is the hyphen-minus character (-, #x2D). The value must be a single character.

[ERR114] It is a static error if, for any named or unnamed decimal format, the variables representing characters used in a picture string do not each have distinct values. These variables are decimal-separator-sign, grouping-sign, percent-sign, per-mille-sign, digit-zero-sign, digit-sign, and pattern-separator-sign.

16.4.2 Processing the Picture String

The behavior of the format-number function is described here by means of an algorithm; an implementation may use any algorithm that delivers the same result.

The formatting of a number is controlled by a picture string. The picture string is a sequence of characters, in which the characters assigned to the variables decimal-separator-sign, grouping-sign, percent-sign, per-mille-sign, zero-digit-sign, digit-sign and pattern-separator-sign are classified as active characters, and all other characters (including the percent-sign and per-mille-sign) are classified as passive characters.

Formatting is centered around the decimal-separator-sign, which can have a whole-part before and fractional-part after. The passive characters in the pattern string define a prefix and suffix portion.

The evaluation of the format-number function is described below in two phases, an anaysis phase and a formatting phase. The analysis phase takes as its inputs the picture string and the variables derived from the relevant xsl:decimal-format declaration, and produces as its output a number of variables with defined values. The formatting phase takes as its inputs the number to be formatted and the variables produced by the analysis phase, and produces as its output a string containing a formatted representation of the number.

NOTE: Numbers will always be formatted with the most significant digit on the left.

16.4.3 Analysing the Picture String

This phase of the algorithm analyses the picture string and the attribute settings of the xsl:decimal-format declaration, and has the effect of setting the values of various variables, which are used in the subsequent formatting phase. These variables are listed below. Each is shown with its initial setting and its data type. In the third column of this table, the word "single" means the one picture string that formats both positive and negative numbers, as contrasted to the two "positive" and "negative" sub-pictures that occur when the whole pattern has a pattern-separator.

[ERR115] The picture string must conform to the following rules. It is a dynamic error if the picture string does not satisfy these rules. The processor must either signal the error, or must recover by ignoring those characters in the supplied picture string that make the picture string invalid. If a valid picture string cannot be constructed, the processor may recover by returning the string obtained by applying the string function to the supplied number.

Note that in these rules "preceded" and "followed" refer to characters anywhere in the string, they are not to be read as "immediately preceded" and "immediately followed".

  • A picture-string consists either of a sub-picture, or of two sub-pictures separated by a pattern-separator-sign. A sub-picture must not contain a pattern-separator-sign. If the picture-string contains two sub-pictures, the first is used for positive values and the second for negative values; if there is only one, then it is taken as the positive sub-picture, and the negative sub-picture will be the same except for a leading minus-symbol.

  • A sub-picture must not contain more than one decimal-separator-sign.

  • A sub-picture must not contain more than one percent-sign or per-mille-sign, and it must not contain one of each.

  • A sub-picture must contain at least one digit-sign or zero-digit-sign.

  • A sub-picture must not contain a passive character that is preceded by a zero-digit-sign or digit-sign and that is followed by another zero-digit-sign or digit-sign.

  • A picture-string must not contain the character ¤ (Unicode #x00a4).

  • A sub-picture must not contain a grouping-separator-sign adjacent to a decimal-separator-sign.

  • A sub-picture must not contain a percent-sign or per-mille-sign in an illegal position.

  • A sub-picture must not contain a digit-sign that is preceded by a decimal-separator-sign and followed by a zero-digit-sign, or one that is preceded by a zero-digit-sign and followed by a decimal-separator-sign.

Inconsistent spacing of grouping separators is not an error. The only significant grouping separators are those closest to the decimal separator, on either side of it. Any others are ignored.

The following variables apply to the picture string as a whole:

  • digit-range-mapping (boolean, initially false): Indicates whether the standard ASCII digits (0-9) have been replaced by another set of digits. This property applies to the picture string as a whole, and is derived from the xsl:decimal-format declaration rather than the pattern string. If zero-digit is not "0", set digit-range-mapping to true and populate a mapping table for digits 0-9 (output characters for digits 1-9 must bear the same relationship to Basic Latin digits 1-9 as the requested zero-digit bears to Basic Latin digit 0).

  • two-sub-pictures (boolean initially false): Indicates whether separate positive and negative sub-pictures exist. It is true if the picture string includes a pattern-separator-sign.

The following variables each have two instances, one that applies to positive numbers, and one that applies to negative numbers:

  • fractional-part-grouping (boolean, initially false): Indicates whether the fractional-part will be grouped.

  • fractional-part-grouping-size (numeric, initially zero): Indicates the grouping-size for the fractional-part.

  • fractional-part-grouping-positions (sequence of integers, initially empty): Indicates the positions at which grouping separators will be output in the fractional-part.

  • integer-mode (boolean, initially true): Indicates whether the pattern lacks a decimal-separator and hence forces output as an integer.

  • is-percent (boolean, initially false). Indicates whether the number is to be processed as a percentage.

  • is-per-mille (boolean, initially false). Indicates whether the number is to be processed as a per-mille.

  • maximum-fractional-part-size (integer, initially zero): Indicates the largest number of digits possible for the fractional-part.

  • maximum-whole-part-size (integer, initially one). Indicates the largest number of digits possible for the whole-part.

  • minimum-fractional-part-size (integer, initially zero). Indicates the smallest number of digits possible for the fractional-part.

  • minimum-whole-part-size (integer, initially one). Indicates the smallest number of digits possible for the whole part.

  • overflow-threshold (integer, initially 10). The value is always a power of 10; it indicates the smallest number that is too large to fit the whole-part.

  • prefix (string, initially empty). Indicates the string of passive characters to appear before the number.

  • suffix (string, initially empty). Indicates the string of passive characters to appear after the number.

  • whole-part-grouping (boolean, initially false). Indicates whether the whole-part will be grouped.

  • whole-part-grouping-size (numeric, initially zero). Indicates the grouping-size for the whole-part.

  • whole-part-grouping-positions (sequence of integers, initially empty): Indicates the positions at which grouping separators will be output in the whole-part.

These variables are set using the following algorithm. If there are two sub-pictures, then the algorithm is applied to one sub-picture to obtain the values that apply to positive numbers, and to the other to obtain the values that apply to negative numbers. If there is only one sub-picture, then the values for both cases are derived from this sub-picture.

  1. Look for a decimal-separator-sign within the sub-picture. If one is found, set integer-mode to false. If none is found, leave integer-mode set to true; add an implicit decimal-separator-sign just after the last (least significant) digit-sign or zero-digit-sign in the sub-picture; and leave the minimum-fractional-part-size and maximum-fractional-part-size set to zero.

  2. From the decimal-separator-sign, step through characters in order of increasing significance (that is, to the left) looking for occurrences of the grouping-separator-sign. If none is found before the first occurrence of a passive character, leave the whole-part-grouping property set to false. When one is found, set the whole-part-grouping property to true and the whole-part-grouping-size to the number of characters between the decimal-separator-sign and the first grouping-separator-sign. (All other occurrences farther left are ignored.)When one is found, append an integer to the whole-part-grouping-positions list; the value of this integer is the number of digit-sign and zero-digit-sign characters between the grouping-separator-sign and the decimal-separator-sign.

  3. If there is a digit-sign immediately to the left of the (actual or implied) decimal-separator-sign, set the minimum-whole-part-size to zero (unless integer-mode is true, in which case leave it at one) and step through the characters in order of increasing significance (that is, to the left) looking for a passive character. If there is a zero-digit-sign immediately to the left of the decimal-separator, step through the characters in order of increasing significance (that is, to the left) looking for either a digit-sign or passive character. As soon as a character other than a zero-digit-sign or grouping-separator-sign is found, set minimum-whole-part-size to the number of zero-digit-signs found on this side of the decimal-separator. When a passive character is encountered, set the maximum-whole-part-size to the total number of digit-sign and zero-digit-sign characters found on this side of the decimal-separator. Set overflow-threshold to ten raised to the power of maximum-whole-part-size.

  4. Take all characters from the beginning of the sub-picture to the just-encountered passive character as the prefix. If two-sub-pictures is false, set the prefix for negative numbers by copying the prefix for positive numbers (if any) and then appending the minus-sign character (that is, making it the character that will be adjacent to the following number).

  5. From the (actual) decimal-separator-sign, step through characters in order of decreasing significance (that is, to the right) looking for an occurrence of the grouping-separator-sign. If none is found before the first occurrence of a passive character, leave fractional-part-grouping set to false. When one is found, set fractional-part-grouping to true and fractional-part-grouping-size to the number of characters between the decimal-separator-sign and the first grouping-separator-sign. (All other occurrences further to the right are ignored.) When one is found, append an integer to the fractional-part-grouping-positions list; the value of this integer is the number of digit-sign and zero-digit-sign characters between the grouping-separator-sign and the decimal-separator-sign.

  6. If there is a digit-sign adjacent to the decimal-separator-sign on the fractional side, leave the minimum-fractional-part-size set to zero and step through the characters in order of decreasing significance (that is, to the right) looking for a passive character , percent-sign, or per-mille-sign. If there is a zero-digit-sign adjacent to the decimal-separator-sign, step through the characters in order of decreasing significance (that is, to the right) looking for an either a digit-sign or a passive character, percent-sign, or per-mille-sign. As soon as a character other than zero-digit-sign or grouping-separator-sign is found, set minimum-fractional-part-size to the number of zero-digit-signs found on this side of the decimal-separator-sign. When a passive character, percent-sign, or per-mille-sign is encountered, set maximum-fractional-part-size to the total number of digit-sign and zero-digit-sign characters found on this side of the decimal-separator.

  7. If a percent-sign is present anywhere in the sub-picture, set is-percent to true. If a per-mille-sign is present anywhere in the sub-picture, set is-per-mille to true.

  8. Take all characters from the first-encountered passive character on the fractional side , or the percent-sign or per-mille-sign, to the end of the format string as the suffix. Include the percent-sign or per-mille-sign, if any, as the first character of the suffix.

NOTE: If there is only one sub-picture, then all variables for positive numbers and negative numbers will be the same, except for prefix: the prefix for negative numbers will have the minus-sign character appended.

16.4.4 Formatting the Number

This algorithm describes the second phase of processing of the format-number function. It takes as input a number to be formatted (referred to as the input number), and the variables set up by analysing the xsl:decimal-format declaration and the picture string, as described above. The result of this algorithm is a string, which forms the return value of the format-number function.

The algorithm for this second stage of processing is as follows:

  1. If the input number is NaN (not a number), return the concatenation of the prefix, the specified NaN-symbol, and the suffix, where the prefix and suffix are taken from the positive sub-picture if two-sub-pictures is true.

  2. In the rules below, select the set of variables that apply to positive numbers if the input number is positive, and the set of variables for negative numbers otherwise. Negative zero is taken as negative, positive zero as positive.

  3. If the input number is positive or negative infinity, return the concatenation of the appropriate prefix, the infinity-symbol, and the appropriate suffix.

  4. If is-percent is true, multiply the number by 100. If is-per-mille is true, multiply the number by 1000. The resulting number is referred to below as the adjusted number.

  5. [ERR116] It is a dynamic error if the absolute value of the adjusted number is numerically greater than or equal to the overflow-threshold. The processor may signal the error, or may recover by formatting the number as if the sub-picture were extended to the left (after any prefix) with a sufficient number of digit-sign characters to accommodate the adjusted number, with the addition of grouping-separator characters at each multiple of N characters, if and only if the existing whole-part-grouping-positions sequence satisfies the rule that the Mth integer in the sequence is in all cases equal to M * N.

  6. (Integer case) If integer-mode is true, round the adjusted number to the nearest integer using the rules for the XPath round function. If the number of digits is less than minimum-whole-part-size, prepend zero-digit-sign characters to pad out to that size. Map all digits if the digit-range-mapping property is true. If the whole-part-grouping-positions list is non-empty, insert a grouping-separator-sign at each position listed in whole-part-grouping-positions, provided there is at least one digit to the left of that grouping-separator-sign. Return the concatenation of the prefix, the string conversion of the number, and the suffix.

  7. Render the whole-part as a string. If the number of digits is less than minimum-whole-part-size, prepend zero-digit characters to pad out to that size. Map all digits if the digit-range-mapping property is true. If the whole-part-grouping-positions list is non-empty, insert a grouping-separator-sign at each position listed in whole-part-grouping-positions, provided there is at least one digit to the left of that grouping-separator-sign.. (Note: the result of this step could be a zero-length string.)

  8. Render the fractional-part as a string. If the number of digits is less than minimum-fractional-part-size, append zero-digit-sign characters to pad out to that size. If the number of digits is greater than maximum-fractional-part-size, set the least significant displayed digit by rounding as if it were the whole-part and following digits were the fractional-part in the XPath rounding procedure, but ignoring any sign that may result. Map all digits if the digit-range-mapping property is true. If the fractional-part-grouping-positions list is non-empty, insert a grouping-separator-sign at each position listed in fractional-part-grouping-positions, provided there is at least one digit to the right of that grouping-separator-sign.. (Note: the result of this step could be a zero-length string. It could also be a string of zeroes.)

  9. Return the concatenation of the prefix, the whole-part as rendered above, the decimal-separator-sign, the fractional-part as rendered above, and the suffix.

16.5 Miscellaneous Additional Functions

Issue 75 (format-date-time): There is a need for an additional function to format dates and times.

16.5.1 current()

Function: item current()

The current function, used within an XPath expression, returns the item that was the context item at the point where the expression was invoked from the XSLT stylesheet. This is referred to as the current item. For an outermost expression (an expression not occurring within another expression), the current item is always the same as the context item. Thus,

<xsl:value-of select="current()"/>

means the same as

<xsl:value-of select="."/>

However, within square brackets, or on the right-hand side of the / operator, the current item is generally different from the context item.

For example,

<xsl:apply-templates select="//glossary/item[@name=current()/@ref]"/>

will process all item elements that have a glossary parent element and that have a name attribute with value equal to the value of the current item's ref attribute. This is different from

<xsl:apply-templates select="//glossary/item[@name=./@ref]"/>

which means the same as

<xsl:apply-templates select="//glossary/item[@name=@ref]"/>

and so would process all item elements that have a glossary parent element and that have a name attribute and a ref attribute with the same value.

If the current function is used within a pattern, its value is the node that is being matched against the pattern.

16.5.2 unparsed-entity-uri()

Function: string unparsed-entity-uri(string)

The unparsed-entity-uri returns the URI of the unparsed entity with the specified name in the document containing the context node (see [4.3 Unparsed Entities]). It returns the zero-length string if there is no such entity.

16.5.3 unparsed-entity-public-id()

Function: string unparsed-entity-public-id(string)

The unparsed-entity-public-id returns the public identifier of the unparsed entity with the specified name in the document containing the context node (see [4.3 Unparsed Entities]). It returns the zero-length string if there is no such entity.

16.5.4 generate-id()

Function: string generate-id(node?)

The generate-id function returns a string that uniquely identifies a given node. The unique identifier must consist of ASCII alphanumeric characters and must start with an alphabetic character. Thus, the string is syntactically an XML name. An implementation is free to generate an identifier in any convenient way provided that it always generates the same identifier for the same node and that different identifiers are always generated from different nodes. An implementation is under no obligation to generate the same identifiers each time a document is transformed. There is no guarantee that a generated unique identifier will be distinct from any unique IDs specified in the source document. If the argument is an empty sequence, the empty sequence is returned. If the argument is omitted, it defaults to the context node.

[ERR117] It is a dynamic error if value of the argument is a sequence other than an empty sequence or a sequence containing a single node. If the first item in the sequence is not a node, the processor must signal the error. If the first item in the sequence is a node, the processor may signal the error, or may recover by returning the string that identifies the first node in the sequence.

Ed. Note: This should be covered by the standard fallback type conversion rules, but in the current XPath draft, no fallback rules are given for functions that expect a single node.

16.5.5 system-property()

Function: string system-property(string)

The argument must evaluate to a string that is a QName. The QName is expanded into a name using the namespace declarations in scope for the expression. [ERR118] It is a dynamic error if the value is not a valid QName, or if there is no namespace declaration in scope for the prefix of the QName. The processor must signal these errors.

The system-property function returns a string representing the value of the system property identified by the name. If there is no such system property, the zero-length string should be returned.

Implementations must provide the following system properties, which are all in the XSLT namespace:

  • xsl:version, a number giving the version of XSLT implemented by the processor; for implementations conforming to the version of XSLT specified by this document, this is the string "2.0". The value will always be a string in the lexical space of the decimal data type defined in XML Schema (see [XML Schema]) This allows the value to be converted to a number for the purpose of magnitude comparisons.
  • xsl:vendor, a string identifying the implementor of the processor
  • xsl:vendor-url, a string containing a URL identifying the implementor of the processor; typically this is the host page (home page) of the implementor's Web site.
  • xsl:product-name, a string containing the name of the implementation, as defined by the implementor. This should normally remain constant from one release of the product to the next. It should also be constant across platforms in cases where the same source code is used to produce compatible products for multiple execution platforms.
  • xsl:product-version, a string identifying the version of the implementation, as defined by the implementor. This should normally vary from one release of the product to the next, and at the discretion of the implementor it may also vary across different execution platforms.

The actual values returned for the above properties are implementation-defined.

The set of system properties that are supported, in addition to those listed above, is also implementation-defined.

NOTE: An implementation must not return the value 2.0 as the value of the xsl:version system property unless it is conformant to XSLT 2.0. It is recognized that vendors who are enhancing XSLT 1.0 processors may wish to release interim implementations before all the mandatory features of this specification are implemented. Since such products are not conformant to XSLT 2.0, this specification cannot define their behavior. However, implementors of such products are encouraged to return a value for the xsl:version system property that is intermediate between 1.0 and 2.0, and to provide the element-available and function-available functions to allow users to test which features have been fully implemented.

Implementations must not define additional system properties in the XSLT namespace.

17 Messages

<!-- Category: instruction -->
<xsl:message
  terminate = { "yes" | "no" }>
  <!-- Content: content-constructor -->
</xsl:message>

The xsl:message instruction sends a message in an implementation-defined way. The content of the xsl:message instruction is a content constructor. The content constructor is evaluated, and the resulting sequence of nodes is added to a newly created document node, to create an XML document. This XML document forms the content of the message. The document is typically serialized and output to an implementation-defined destination. The result of the xsl:message instruction is an empty sequence.

NOTE: Note: in many cases, the XML document produced using xsl:message will consist of a document node owning a single text node. However, it may contain a more complex structure.
NOTE: An implementation might implement xsl:message by popping up an alert box or by writing to a log file.

The terminate attribute is interpreted as an attribute value template.

If the effective value of the terminate attribute is yes, then the processor must terminate processing after sending the message. The default value is no.

One convenient way to do localization is to put the localized information (message text, etc.) in an XML document, which becomes an additional input file to the stylesheet. For example, suppose messages for a language L are stored in an XML file resources/L.xml in the form:

<messages>
  <message name="problem">A problem was detected.</message>
  <message name="error">An error was detected.</message>
</messages>

Then a stylesheet could use the following approach to localize messages:

<xsl:param name="lang" select="'en'"/>
<xsl:variable name="messages"
  select="document(concat('resources/', $lang, '.xml'))/messages"/>

<xsl:template name="localized-message">
  <xsl:param name="name"/>
  <xsl:message>
    <xsl:value-of select="$messages/message[@name=$name]"/>
  </xsl:message>
</xsl:template>

<xsl:template name="problem">
  <xsl:call-template name="localized-message">
    <xsl:with-param name="name">problem</xsl:with-param>
  </xsl:call-template>
</xsl:template>

18 Extensibility and Fallback

XSLT allows two kinds of extension, extension instructions and extension functions. An extension instruction is an element within a content constructor that is in a namespace (not the XSLT namespace) designated as an extension namespace. An extension function is a function that is available for use within an XPath expression, other than a core function defined in the XPath specification, an additional function defined in this XSLT specification, or a stylesheet function defined using an xsl:function declaration..

This specification does not define any mechanism for creating or binding implementations of extension instructions or extension functions, and does not require that implementations support any such mechanism. Such mechanisms, if they exist, are implementation-defined. Therefore, an XSLT stylesheet that must be portable between XSLT implementations cannot rely on particular extensions being available. XSLT provides mechanisms that allow an XSLT stylesheet to determine whether the implementation makes particular extensions available, and to specify what should happen if those extensions are not available. If an XSLT stylesheet is careful to make use of these mechanisms, it is possible for it to take advantage of extensions and still retain portability.

18.1 Extension Functions

If the FunctionName used in a FunctionCall within an XPath expression is not an NCName (that is, if it contains a colon), and if the stylesheet contains no stylesheet function with a matching expanded-QName, then it is treated as a call to an extension function. The QName used as the FunctionName is expanded using the namespace declarations in scope at the point in the stylesheet where the expression appears.

18.1.1 Testing Availability of Functions

The function-available function can be used with the xsl:choose and xsl:if instructions to explicitly control how a stylesheet should behave if a particular extension function is not available.

Function: boolean function-available(string)

A function name is said to be available if it matches the name of a core function defined in XPath, or the name of an additional function defined in this XSLT specification, or the name of a stylesheet function, or if the processor is able to locate an implementation of an extension function with a matching name.

The function-available function returns true if the function name supplied as its argument is available; otherwise it returns false.

The value of the first argument must be a string containing a QName. The QName is expanded into an expanded-QName using the namespace declarations in scope for the expression. The function-available function returns true if and only if the expanded-QName is the name of a function in the function library. If the expanded-QName has a non-null namespace URI, then it refers to a stylesheet function or extension function; otherwise, it refers to a function defined by XPath or XSLT.

[ERR119] It is a dynamic error if the argument does not evaluate to a string that is a valid QName, or if there is no namespace declaration in scope for the prefix of the QName. The processor must either signal the error, or must recover by returning the value false.

NOTE: The fact that a function with a given name is available gives no guarantee that any particular call on the function will be successful. For example, it is not possible to determine the number of arguments expected, nor their types.

[ERR120] It is a dynamic error if a FunctionCall within an XPath expression is evaluated, when the function in question is not available. The processor must signal the error. An implementation must not signal a static error merely because an expression contains a call to an extension function for which no implementation is available.

18.1.2 Calling Extension Functions

If the FunctionName used in a FunctionCall within an XPath expression identifies an extension function, then to evaluate the FunctionCall, the processor will first evaluate each of the arguments in the FunctionCall. If the processor has information about the data types expected by the extension function, then it may perform any necessary type conversions between the XPath data types and those defined by the implementation language. If multiple extension functions are available with the same name, the processor may decide which one to invoke based on the number of arguments, the types of the arguments, or any other criteria. The result returned by the implementation is returned as the result of the function call, again after any necessary conversions between the data types of the implementation language and those of XPath. The details of such type conversions are outside the scope of this specification.

[ERR121] It is a dynamic error if the arguments supplied to a call on an extension function do not satisfy the rules defined for that particular extension function, or if the extension function reports an error, or if the result of the extension function cannot be converted to an XPath value.The processor must signal the error.

NOTE: There is no prohibition on calling extension functions that have side-effects (for example, an extension function that writes data to a file). However, the order of execution of XSLT instructions is not defined in this specification, so the effects of such functions are unpredictable.
NOTE: Implementations are not required to perform full validation of values returned by extension functions. For example, the effect of returning a string containing characters that are not legal XML characters is implementation-defined.
NOTE: The ability to execute extension functions represents a potential security weakness, since untrusted stylesheets may invoke code that has privileged access to resources on the machine where the processor executes. Implementations may therefore provide mechanisms that restrict the use of extension functions by untrusted stylesheets.

18.1.3 External Objects

An implementation may allow an extension function to return an object that does not have any natural representation in the XPath data model, either as an atomic value or as a node. For example, an extension function sql:connect might return an object that represents a connection to a relational database; the resulting connection object might be passed as an argument to calls on other extension functions such as sql:insert and sql:select.

The way in which such objects are represented in the type system is implementation-defined. They might be represented by a completely new data type, or they might be mapped to existing data types such as integer, string, or anyURI.

18.2 Extension Instructions

The extension instruction mechanism allows namespaces to be designated as extension namespaces. When a namespace is designated as an extension namespace and an element with a name from that namespace occurs in a content constructor, then the element is treated as an instruction rather than as a literal result element. The namespace determines the semantics of the instruction.

NOTE: Since an element that is a child of an xsl:stylesheet element is not occurring in a content constructor, non-XSLT top-level elements are not extension elements as defined here, and nothing in this section applies to them.

18.2.1 Designating an Extension Namespace

A namespace is designated as an extension namespace by using an [xsl:]extension-element-prefixes attribute on an element in the stylesheet (see [3.3 Standard Attributes]). The attribute must be in the XSLT namespace only if its parent element is not in the XSLT namespace. The value of the attributes is a whitespace-separated list of namespace prefixes. The namespace bound to each of the prefixes is designated as an extension namespace. [ERR122] It is a static error if there is no namespace bound to the prefix on the element bearing the [xsl:]extension-element-prefixes attribute. The default namespace (as declared by xmlns) may be designated as an extension namespace by including #default in the list of namespace prefixes. The designation of a namespace as an extension namespace is effective for the element bearing the [xsl:]extension-element-prefixes attribute and for all descendants of that element within the same stylesheet module.

18.2.2 Testing Availability of Instructions

The element-available function can be used with the xsl:choose and xsl:if instructions to explicitly control how a stylesheet should behave if a particular extension instruction is not available.

Function: boolean element-available(string)

The value of the first argument must be a string containing a QName. The QName is expanded into an expanded-QName using the namespace declarations in scope for the expression. If there is a default namespace in scope, then it is used to expand an unprefixed QName. The element-available function returns true if and only if the expanded-QName is the name of an instruction. If the expanded-QName has a namespace URI equal to the XSLT namespace URI, then it refers to an element defined by XSLT. Otherwise, it refers to an extension instruction. If the expanded-QName has a null namespace URI, the element-available function will return false.

[ERR123] It is a dynamic error if the argument does not evaluate to a string that is a valid QName, or if there is no namespace declaration in scope for the prefix of the QName. The processor must either signal the error, or must recover by returning the value false.

If the processor does not have an implementation of a particular extension instruction available, then the element-available function must return false for the name of the element. When such an extension instruction is evaluated, then the processor must perform fallback for the element as specified in [18.2.3 Fallback]. An implementation must not signal an error merely because a template contains an extension instruction for which no implementation is available.

If the processor has an implementation of a particular extension instruction available, then the element-available function must return true for the name of the element.

18.2.3 Fallback

<!-- Category: instruction -->
<xsl:fallback>
  <!-- Content: content-constructor -->
</xsl:fallback>

[ERR124] When a processor performs fallback for an instruction element, if the instruction element has one or more xsl:fallback children, then the content of each of the xsl:fallback children must be evaluated; it is a dynamic error if it has no xsl:fallback children. This error must be signaled.

The content of an xsl:fallback element is a content constructor, and when performing fallback, the value returned by the xsl:fallback element is the result of evaluating this content constructor.

When not performing fallback, evaluating an xsl:fallback element returns an empty sequence: the content of the xsl:fallback element is ignored.

There are two situations where a processor performs fallback: when an extension instruction that is not available is evaluated, and when an instruction in the XSLT namespace, that is not defined in XSLT 2.0, is evaluated within a region of the stylesheet for which forwards compatible behavior is enabled.

Issue 129 (fallback-on-new-elements): To allow an XSLT 2.0 stylesheet to be used with an XSLT 1.0 processor, all new instructions should allow an xsl:fallback child element.

19 Result Trees

The output of a transformation is a set of final result trees. One of these is the principal result tree, the others are referred to as secondary result trees. The principal result tree is the current result tree when the transformation is initiated; a secondary result tree is created using an xsl:result-document instruction, and becomes the current result tree while the xsl:result-document instruction is being evaluated.

Ed. Note: The term "final result tree" seems oxymoronic. Perhaps we should just call them "result trees". In this case the term "current result tree" needs to change, perhaps to "current destination", to reflect the fact that the destination might be a temporary tree rather than a [final] result tree.

The way in which a final result tree is delivered to an application is implementation-defined.

A final result tree has a URI. If the implementation provides an API to access final result trees, then it must allow a final result tree to be identified by means of this URI.

NOTE: The URI of the result tree is not the same thing as the URI of its serialized representation on disk, if any. For example, a server (or browser client) might store the result trees only in memory, or in an internal disk cache. As long as it satisfies requests for those URIs, it is irrelevant where they are actually written on disk, if at all.

The URI of the principal result tree is specified using the href attribute of an xsl:principal-result-document declaration; the URI of a secondary result tree is specified using the href attribute of an xsl:result-document instruction. If the URI for a secondary result tree is relative, it is resolved relative to the base URI of the principal result tree. If the URI for the principal result tree is relative, it is resolved relative to an implementation-defined base URI.

NOTE: It will often be the case that one result tree contains links to another result tree produced during the same transformation, in the form of a relative URI. The mechanism of associating a URI with a final result tree has been chosen to allow the integrity of such links to be preserved when the trees are serialized.
NOTE: The URI of a result tree is unrelated to the base URI of its document node.

Serialization of final result trees is described further in [20 Serialization]

19.1 The Principal Result Tree

<!-- Category: declaration -->
<xsl:principal-result-document
  format = qname
  href = { uri-reference }
  type-information = "strict" | "lax" | "preserve" | "none" />

The xsl:principal-result-document element is used to define the URI of the principal result tree, and optionally to specify the output format to be used for serializing this tree.

The value of the format attribute, if specified, must be a QName. The QName is expanded using the namespace declarations in scope for the xsl:principal-result-document element. The expanded-QName must match the expanded QName of a named output definition in the stylesheet. This identifies the xsl:output declaration that will control the serialization of the result tree (see [20 Serialization]), if the result tree is serialized. If the format attribute is omitted, the unnamed output definition is used to control serialization of the principal result tree.

[ERR125] It is a static error if the value of the format attribute is not a valid QName, or if it does not match the expanded-QName of an output definition in the stylesheet.

If there is more than one xsl:principal-result-document element at the top level of the stylesheet, the one with highest import precedence is used. [ERR126] It is a static error if there is more than one xsl:result-document element at the top level of the stylesheet with the same import precedence, unless there is also another xsl:result-document element at the top level of the stylesheet with a higher import precedence.

If there is no xsl:principal-result-document element at the top level of the stylesheet, the effect is the same as specifying a single top-level xsl:result-document element with no attributes.

The href attribute defines the URI of the principal result tree. The attribute value template is expanded using a singleton focus based on the document node of the principal source document. The effective value of the attribute must be a URI. There may be implementation-defined restrictions on the form of absolute URI that may be used, but the implementation is not required to enforce any restrictions. Any legal relative URI must be accepted.

If the effective value is a relative URI, then it is resolved relative to an implementation-defined base URI.

The href attribute may be omitted, in which case the URI of the principal result tree is implementation-defined.

NOTE: The main reason to specify a URI for the principal result tree is to allow it to be referenced by links from a secondary result tree.

The type-information attribute controls the type annotations that will be present on element and attribute nodes in the result tree. The values and their meanings are the same as those for the type-information attribute on variable binding elements, described in [9.2 Temporary Trees].

19.2 Secondary Result Trees

<!-- Category: instruction -->
<xsl:result-document
  format = qname
  href = { uri-reference }
  type-information = "strict" | "lax" | "preserve" | "none">
  <!-- Content: content-constructor -->
</xsl:result-document>

The xsl:result-document instruction is used to create a secondary result tree. The content of the xsl:result-document element is a content constructor; this is evaluated to create a sequence of nodes. A document node is created with this sequence of nodes as its children. The tree rooted at this document node forms the secondary result tree.

The xsl:result-document instruction defines the URI of a secondary final result tree, and may optionally specify the output format to be used for serializing this tree.

The value of the format attribute, if specified, must be a QName. The QName is expanded using the namespace declarations in scope for the xsl:result-document element. The expanded-QName must match the expanded QName of a named output definition in the stylesheet. This identifies the xsl:output declaration that will control the serialization of the result tree (see [20 Serialization]), if the result tree is serialized. If the format attribute is omitted, the unnamed output definition is used to control serialization of the result tree.

[ERR127] It is a static error if the value of the format attribute is not a valid QName, or if it does not match the expanded-QName of an output definition in the stylesheet.

The href attribute is mandatory. The effective value of the attribute must be a URI. There may be implementation-defined restrictions on the form of absolute URI that may be used, but the implementation is not required to enforce any restrictions. Any legal relative URI must be accepted.

If the effective value is a relative URI, then it is resolved relative to the URI of the principal result tree.

The type-information attribute controls the type annotations that will be present on element and attribute nodes in the result tree. The values and their meanings are the same as those for the type-information attribute on variable binding elements, described in [9.2 Temporary Trees].

A processor may allow a secondary result tree to be serialized, just as it may allow the principal result tree to be serialized. Serialization is described in [20 Serialization]. However, an implementation (for example, a processor running in an environment with no access to writable filestore) is not required to support the serialization of secondary result trees. An implementation that does not support the serialization of secondary result trees must ignore the format attribute. Such an implementation must provide the application with some means of access to the (un-serialized) result tree, using its URI to identify it.

Ed. Note: We don't want to say that it must ignore the format attribute, it might want to use it in some other way.

For example, the following would create a principal result document specifying an HTML frameset with two frames, together with two secondary result documents, one for the contents of each frame:

Ed. Note: Choose an example that is less client-side focused.

<xsl:template match="/">
  <html>
    <head><title>Frame example</title></head>
    <frameset cols="20%, 80%">
      <frame src="toc.html"/>
      <xsl:result-document href="toc.html">
        <html>
          <head><title>Table of Contents</title></head>
          <body>
             <xsl:apply-templates mode="toc" select="*"/>
          </body>
        </html>
      </xsl:result-document>
      <frame src="body.html"/>
      <xsl:result-document href="body.html">
        <html>
          <head><title>Body</title></head>
          <body>
             <xsl:apply-templates select="*"/>
          </body>
        </html>
      </xsl:result-document>
    </frameset>
  </html>
</xsl:template>

[ERR128] It is a dynamic error to evaluate the xsl:result-document instruction when the current result tree is neither the principal result tree, nor a secondary result tree. The processor must signal the error.. This means, for example, that it is an error to use xsl:result-document when the current result tree is a temporary tree created using xsl:variable, or a tree created using xsl:message, xsl:attribute, etc.

[ERR129] It is a dynamic error for a transformation to generate two or more result trees with the same URI. The processor must signal the error.

Technically, the result of evaluating the xsl:result-document instruction is an empty sequence. This means it does not contribute any nodes to the result of the content constructor it is part of.

[ERR130] It is a dynamic error for a stylesheet to write to an external resource and read from the same resource during a single transformation, whether or not the same URI is used to access the resource in both cases. The effect of this error is implementation-dependent: implementations are not obliged to detect it.

20 Serialization

A processor may output a final result tree as a sequence of bytes, although it is not required to be able to do so (see [21 Conformance]). Stylesheet authors can use the xsl:output declaration to specify how they wish result trees to be serialized. If a processor serializes the result tree, it should do so as specified by these elements; however, it is not required to do so.

<!-- Category: declaration -->
<xsl:output
  name = qname
  method = "xml" | "html" | "xhtml" | "text" | qname-but-not-ncname
  version = nmtoken
  encoding = string
  omit-xml-declaration = "yes" | "no"
  standalone = "yes" | "no"
  doctype-public = string
  doctype-system = string
  cdata-section-elements = qnames
  escape-uri-attributes = "yes" | "no"
  include-content-type = "yes" | "no"
  indent = "yes" | "no"
  media-type = string />

The xsl:output declaration is optional; if used, it must always appear as a top-level element within a stylesheet.

All attributes on xsl:output except the name attribute are interpreted as attribute value templates. Expressions recognized in the attributes of xsl:output are evaluated with a singleton focus based on the document node of the principal source document. [ERR131] It is a dynamic error if the value of an attribute does not conform to the rules listed below. The processor must either signal the error, or must recover by using the default value for that attribute.

A stylesheet may contain multiple xsl:output declarations and may include or import stylesheet modules that also contain xsl:output declarations. The name of an xsl:output declaration is the value of its name attribute, if any. All the xsl:output declarations in a stylsheet that share the same name are grouped into a named output definition; those that have no name are grouped into a a single unnamed output definition.

A named output definition is used when its name matches the format attribute used in an xsl:result-document element. The unnamed output definition is used when an xsl:result-document element omits the format attribute.

All the xsl:output elements making up an output definition are effectively merged. For the cdata-section-elements attribute, the output definition uses the union of the values from all the constituent xsl:output declarations. For other attributes, the output definition uses the value of that attribute from the xsl:output declaration with the highest import precedence. [ERR132] It is a dynamic error if two xsl:output declarations within an output definition specify explicit values for the same attribute (other than cdata-section-elements), with the values of the attributes being not equal, and with neither of these declarations being overridden by an xsl:output declaration with higher import precedence that specifies an explicit value for the same attribute. The processor must either signal the error, or must recover by using the value that occurs last in declaration order.

The values of attributes are defaulted after the xsl:output elements have been merged; different output methods may have different default values for an attribute.

An implementation may allow the attributes of the xsl:output declaration to be overridden using the API that controls the transformation.

Before a final result tree is serialized, namespace fixup is performed (see [4.5 Namespace Fixup]).

The location to which result trees are serialized (whether in filestore or elsewhere) is implementation-defined (which in practice may mean that it is controlled using an implementation-defined API). However, these locations must satisfy the constraint that any relative URI used to reference one result tree from another remains valid when all the result trees are serialized. So, with the example in [19.2 Secondary Result Trees], the HTML document produced by serializing the principal result tree would contain valid references to the HTML documents produced by serializing the result trees denoted by href="toc.html" and href="body.html" respectively.

The method attribute on the xsl:output element identifies the overall method that should be used for outputting the result tree. [ERR133] The value must be a valid QName. If the QName does not have a prefix, then it identifies a method specified in this document and must be one of xml, html, xhtml, or text. If the QName has a prefix, then the QName is expanded into an expanded-QName as described in [5.1 Qualified Names]; the expanded-QName identifies the output method; the behavior in this case is not specified by this document.

The default for the method attribute is ch