Efficient XML Interchange (EXI) Format 1.0 Errata

1. Substantive Errata

Section 7

19 September 2012 (1)

Below is a paragraph excerpted from section 7 Representing Event Content.

Schemas can provide one or more enumerated values for datatypes. When the Preserve.lexicalValues option is false, EXI exploits those pre-defined values when they are available to represent values of such datatypes in a more efficient manner than would have done otherwise without using pre-defined values. The encoding rule for representing enumerated values is described in . Datatypes that are derived from another by union and their subtypes are always represented as String regardless of the availability of enumerated values. Representation of values of which the datatype is one of QName, Notation or a datatype derived therefrom by restriction are also not affected by enumerated values if any.

Make the above paragraph the one shown below. The modified part is highlighted in color for distinction purposes only.

Schemas can provide one or more enumerated values for datatypes. When the Preserve.lexicalValues option is false, EXI exploits those pre-defined values when they are available to represent values of such datatypes in a more efficient manner than would have done otherwise without using pre-defined values. The encoding rule for representing enumerated values is described in . Datatypes that are derived from another by union and their subtypes are always represented as String regardless of the availability of enumerated values. Representation of values of which the datatype is either a list datatype^XS2, or one of QName, Notation or a datatype derived therefrom by restriction are also not affected by enumerated values if any.

Section 7.2

19 September 2012 (2)

Below is a paragraph excerpted from section 7.2 Enumerations.

Exceptions are for schema types derived from others by union and their subtypes, QName or Notation and types derived therefrom by restriction. The values of such types are processed by their respective built-in EXI datatype representations instead of being represented as enumerations.

Make the above paragraph the one shown below. The modified part is highlighted in color for distinction purposes only.

Exceptions are for schema union datatypes^XS2 , list datatypes^XS2, as well as QName or Notation and types derived therefrom by restriction. The values of such types are processed by their respective built-in EXI datatype representations instead of being represented as enumerations.

Section 8.4.3

08 May 2012

Change the semantics section that currently reads as follows

All productions in the built-in element grammarof the form LeftHandSide: AT (*) RightHandSide are evaluated as follows:

Let qname be the qname of the attribute matched by AT (*)

Create a production of the form LeftHandSide : AT (qname) RightHandSide with an event code 0 and increment the first part of the event code of each production in the current grammar with the non-terminal LeftHandSide on the left-hand side. Add this production to the grammar.

If qname is xsi:type, let target-type be the value of the xsi:type attribute and assign it the QName datatype representation (see 7.1.7 QName). If there is no namespace in scope for the specified qname prefix, set the uri of target-type to empty ("") and the localName to the full lexical value of the QName, including the prefix. Encode target-type according to section 7. Representing Event Content. If a grammar can be found for the target-type type using the encoded target-type representation, evaluate the element contents using the grammar for target-type type instead of RightHandSide.

All productions in the built-in element grammarof the form LeftHandSide: AT (*) RightHandSide are evaluated as follows:

Let qname be the qname of the attribute matched by AT (*)

If qname is not xsi:type or If a production of the form LeftHandSide : AT(xsi:type) with an event code of length 1 does not exist in the current element grammar, create a production of the form LeftHandSide : AT (qname) RightHandSide with an event code 0 and increment the first part of the event code of each production in the current grammar with the non-terminal LeftHandSide on the left-hand side. Add this production to the grammar.

If qname is xsi:type, let target-type be the value of the xsi:type attribute and assign it the QName datatype representation (see 7.1.7 QName). If there is no namespace in scope for the specified qname prefix, set the uri of target-type to empty ("") and the localName to the full lexical value of the QName, including the prefix. Encode target-type according to section 7. Representing Event Content. If a grammar can be found for the target-type type using the encoded target-type representation, evaluate the element contents using the grammar for target-type type instead of RightHandSide.

Section 8.5.4.1.3

29 March 2013 (1)

Change the the fourth paragraph in Section 8.5.4.1.3 Type Grammars from

Sections 8.5.4.1.3.1 Simple Type Grammars and 8.5.4.1.3.2 Complex Type Grammars describe the processes for creating Type_i and TypeEmpty_i from XML Schema simple type definitions^XS1 and complex type definitions^XS1 defined in schemas as well as built-in primitive types^XS2, built-in derived types^XS2 and simple ur-type^XS2 defined by XML Schema specification [XML Schema Datatypes]. Section 8.5.4.1.3.3 Complex Ur-Type Grammar defines the grammar used for processing instances of element contents of type xsd:anyType^XS1.

Sections 8.5.4.1.3.1 Simple Type Grammars and 8.5.4.1.3.2 Complex Type Grammars describe the processes for creating Type_i and TypeEmpty_i from XML Schema simple type definitions^XS1 and complex type definitions^XS1 defined in schemas as well as built-in primitive types^XS2, built-in derived types^XS2, simple ur-type^XS2 and complex ur-type^XS1 defined by XML Schema specification [XML Schema Datatypes].

Section 8.5.4.1.3.2

29 March 2013 (2)

Change the grammar that reads as follows

G_n−1, 0 :
EE

to the following form

G_n−1, 0 :
EE

G_n−1, 1 :
EE

and add the following rule just before the first note in the section:

If there is neither an attribute use nor an {attribute wildcard}, G₀ of the following form is used as an attribute use grammar.

G_0, 0 :
EE

Section 8.5.4.1.3.3

29 March 2013 (3)

Given that the EXI specification is already clear in Section 8.5.4.1.3 Type Grammars how grammars are build Section 8.5.4.1.3.3 Complex Ur-Type Grammar and references to it are entirely removed.

Appendix A.1

26 June 2013

Add the following paragraph below the Namespaces in XML reference:

Namespaces in XML 1.1

Namespaces in XML 1.1 (Second Edition), T. Bray, D. Hollander, A. Layman, and R. Tobin, Editors. World Wide Web Consortium, 4 February 2004, revised 16 August 2006. This version is http://www.w3.org/TR/2006/REC-xml-names11-20060816. The latest version is available at http://www.w3.org/TR/xml-names11/.

2. Editorial Errata

To be added upon receipt of errors.

3. Clarifications

Section 7

30 May 2011

Append the following text as 2nd paragraph right after Table 7-2.

The restricted character set for a value that would be represented as an EXI enumeration is the restricted character set of the EXI datatype representation of the enumeration base type.

Section 7.1.2

22 February 2012

The primary change is in the order of the two paragraphs. In the revised text, the special case is described first, followed by the default case. A clause clarifying the condition is added, highlighted in color above for distinction.

Section 7.4

05 October 2011

Make the above paragraph the one shown below by appending a text. The appended part is highlighted in color for distinction purposes only.

Section 8.5.4.1.5

03 April 2012

Make the above text the one shown below. The modified part is highlighted in color for distinction purposes only.

Section 4

06 May 2013

The namespace of elements and attributes is specified as part of SE and AT events and hence namespace declarations can be omitted from the EXI stream if preservation of prefixes is not required by the applications. As prescribed by Table B-2 and Table B-11, [namespace attributes] representing namespace declarations are mapped to NS events and SHOULD NOT be represented by AT events. This also implies that the following AT events SHOULD NOT occur in EXI streams: (1) AT events with qname whose uri is "http://www.w3.org/2000/xmlns/"; (2) AT events with qname which has empty uri ("") and local name either of the form "xmlns" or "xmlns:*", where "*" represent string with 0 or more characters.

Section 7.1.8

13 June 2013

Below is the first paragraph and Table 7-3 excerpted from section 7.1.8 Date-Time:

Change the content of the paragraph and the table to the one shown below by appending the highlighted text:

Table 7-3. Date-Time components
Component	Value	Type
Year	Offset from 2000	Integer ( 7.1.5 Integer)
MonthDay	Month * 32 + Day	9-bit Unsigned Integer (7.1.9 n-bit Unsigned Integer) where day is a value in the range 1-31 and month is a value in the range 1-12.
Time	((Hour * 64) + Minutes) * 64 + seconds	17-bit Unsigned Integer (7.1.9 n-bit Unsigned Integer)
FractionalSecs	Fractional seconds	Unsigned Integer ( 7.1.6 Unsigned Integer) representing the fractional part of the seconds with digits in reverse order to preserve leading zeros
TimeZone	TZHours * 64 + TZMinutes	11-bit Unsigned Integer (7.1.9 n-bit Unsigned Integer) representing a signed integer offset by 896 ( = 14 * 64 )
presence	Boolean presence indicator	Boolean (7.1.2 Boolean)

The Date-Time datatype representation is a sequence of values representing the individual components of the Date-Time. The following table specifies each of the possible date-time components along with how they are encoded. The value ranges of the date-time components follow the definitions of the XML Schema specification [XML Schema Datatypes] which for example prescribes the value range of the seconds to be between 0 and 60 to account for leap second representation and hour between 0 and 24 among others.

Table 7-3. Date-Time components
Component	Value	Type
Year	Offset from 2000	Integer ( 7.1.5 Integer)
MonthDay	Month * 32 + Day	9-bit Unsigned Integer (7.1.9 n-bit Unsigned Integer) where day is a value in the range 1-31 and month is a value in the range 1-12.
Time	((Hour * 64) + Minutes) * 64 + seconds	17-bit Unsigned Integer (7.1.9 n-bit Unsigned Integer) where Hour is a value in the range 0-24, Minutes is a value in the range 0-59 and seconds is a value in the range 0-60
FractionalSecs	Fractional seconds	Unsigned Integer ( 7.1.6 Unsigned Integer) representing the fractional part of the seconds with digits in reverse order to preserve leading zeros
TimeZone	TZHours * 64 + TZMinutes	11-bit Unsigned Integer (7.1.9 n-bit Unsigned Integer) representing a signed integer offset by 896 ( = 14 * 64 ) where TZHours is a value in the range [-14 .. 14] and TZMinutes is a value in the range [-59 .. 59]
presence	Boolean presence indicator	Boolean (7.1.2 Boolean)

Section 7.1.5

27 June 2013

If the associated schema datatype is derived from xsd:integer and the bounded range determined by its minInclusive^XS2, minExclusive^XS2, maxInclusive^XS2 and maxExclusive^XS2 facets has 4096 or fewer values, the value is represented as an n-bit Unsigned Integer where n is ⌈ log₂ m ⌉ and m is the bounded range of the schema datatype.

If the associated schema datatype is derived from xsd:integer and the bounded range determined by its minInclusive^XS2, minExclusive^XS2, maxInclusive^XS2 and maxExclusive^XS2 facets has 4096 or fewer values, the value is represented as an n-bit Unsigned Integer offset from the minimum value in the range where n is ⌈ log₂ m ⌉ and m is the bounded range of the schema datatype.

Section 8.5.3

19 August 2013 (1)

Remove the last sentence from Section "8.5.3 Schema-informed Element Fragment Grammar" that reads:

Section 8.5.4.1.3

19 August 2013 (2)

Section 8.5.4.1.3.1

19 August 2013 (3)

Remove the last sentence from Section "8.5.4.1.3.1 Simple Type Grammars" that reads:

Section 8.5.4.1.3.2

19 August 2013 (4)

Also remove the last sentence from Section "8.5.4.1.3.2 Complex Type Grammars" that reads:

Section 8.5.4.2.2

19 August 2013 (5)

Remove the paragraph from Section "8.5.4.2.2 Eliminating Duplicate Terminal Symbols" that reads:

Section 8.5.4.4.1

19 August 2013 (6)

Insert the following text as a second paragraph in Section "8.5.4.4.1 Adding Productions when Strict is False" right after the first sentence that reads:

Modify the second sentence from Section "8.5.4.4.1 Adding Productions when Strict is False" that reads:

	G_{{min occurs}, k} :
		EE

	G_{{min occurs}, k} :
		G_{{min occurs}, 0}

	G_{{min occurs}, 0} :
		EE

	G_{{min occurs}, k} :
		EE

Efficient XML Interchange (EXI) Format 1.0 Errata

27 June 2013

Abstract

Table of Contents

1. Substantive Errata

19 September 2012 (1)

19 September 2012 (2)

08 May 2012

29 March 2013 (1)

29 March 2013 (2)

29 March 2013 (3)

26 June 2013

Namespaces in XML 1.1

2. Editorial Errata

3. Clarifications

30 May 2011

22 February 2012

05 October 2011

03 April 2012

06 May 2013

13 June 2013

27 June 2013

19 August 2013 (1)

19 August 2013 (2)

19 August 2013 (3)

19 August 2013 (4)

19 August 2013 (5)

19 August 2013 (6)

A. Errata Changes (in reverse chronological order)