Original by Markus Kuhn, adapted for HTML by Martin Dürst.

missing byte:
   3-byte sequence with last byte missing (U-0000FFFF): "ï¿"