Canonical XML


Are these two documents the same?

 
<workplan day="Friday" name="Nevada Barr" >
 <garden /> <work>
 <learn>XML </learn>
 <teach>Fred Smith</teach>
 <mark>CS302</mark>
 </work>
 <play>
 <tv time="17.00" location="Channel 4"></tv> 
 <radio location="BBC1" time="19.00">
 </radio>
 </play>
</workplan>
            <workplan name ="Nevada Barr"
    day ="Friday">
            <garden></garden>
 <work>
 <learn>XML </learn>
 <teach>Fred Smith</teach>
 <mark>CS302</mark> </work><play>
 <tv location="Channel 4" time="17.00" />
 <radio time="19.00" location="BBC1" />
 </play>
</workplan>
  • No space between last attribute quote mark and >
  • No space between = and opening quote of attribute
  • Namespaces and attributes in lexical order
  • Defines a physical representation that does not vary under syntactic variations of the input document
  • For each node, order is content, namespaces, attributes,child nodes