With the reference processing model, escapes become unambiguous:
NCRs are decimal (A is A) or, in HTML 4.0 and XML, hexadecimal (A is A); now in SGML corrigendum
A
A
One character = one escape, not two for surrogate pairs
François Yergeau & Martin Dürst
26 of 107