URIs define escaping for octets, not for characters
http://www.example.com/turnover/2001/March (March alone can be a relative URI)
encoding
(server side/ undefined) |
us-ascii
or %HH |
utf-8 or %HH | ||
original characters | <====> |
bytes | URI | IRI |
---|---|---|---|---|
March | us-ascii/utf-8 | 4D 61 72 63 68 |
March |
March |
März | iso-8859-1 | 4D E4 72 7A |
M%E4rz |
|
März | macintosh | 4D 8A 72 7A |
M%8Arz |
M%8Arz |
März | utf-8 | 4D C3 A4 72 7A |
M%C3%A4rz |
|
IRI functionality can only be used when URI octet encoding is UTF-8.
4 of 18 |