Applications: Checking encodings; bootstrapping if encoding information is
internal to a file
  - XML/XSLT: not possible
- Java: use byte array (byte [])
- Perl: switching UTF-8 recognition on and off may be tricky, version
    dependent
- C/C++: use char*
- Python: use 'string'