XML 1.0 Internationalization Features
- Reference model: XML document is a sequence of Unicode characters
- Encoding identification (e.g.
encoding='iso-8859-1'
) and
priorities
UTF-8
and UTF-16
as encoding defaults
- Numeric character references (e.g.
ꯍ
, always
Unicode)
xml:lang
for language identification (inherits)
- Large character repertoire for element/attribute names
- System Identifiers are IRIs (Internationalized Resource
Identifiers)
- First edition said something about character normalization