Internationalization of URIs: Problem
- Characters =>1) Bytes =>2) Characters
- 1) is undefined (originally iso-8859-1, in most cases ASCII is
ASCII)
- 2) is defined (ASCII is ASCII, rest is %HH-escaped)
- Result: Transformation factored out for ASCII, undefined for rest
- Convergence to uniform mapping 1) is needed
- [I personally favored UTF-7 because of length issues, but switched to
UTF-8 (originally proposed by François Yergeau) quickly once I knew that
the IETF went with UTF-8 in RFC 2277]