ISSUE-446: U+FFFD in euc_kr index
U+FFFD in euc_kr index
- State:
- CLOSED
- Product:
- encoding
- Raised by:
- Richard Ishida
- Opened on:
- 2015-03-30
- Description:
- https://www.w3.org/Bugs/Public/show_bug.cgi?id=27675
This issue tracks the bug listed above and was created as part of the WG CR process.
---
Reporter: public+w3@mearie.org
The updated euc_kr table now has the following entries:
---8<---
5916 0xFFFD � (REPLACEMENT CHARACTER)
5917 0xFFFD � (REPLACEMENT CHARACTER)
5918 0xFFFD � (REPLACEMENT CHARACTER)
5919 0xFFFD � (REPLACEMENT CHARACTER)
5920 0xFFFD � (REPLACEMENT CHARACTER)
5921 0xFFFD � (REPLACEMENT CHARACTER)
[snip]
5948 0xFFFD � (REPLACEMENT CHARACTER)
5949 0xFFFD � (REPLACEMENT CHARACTER)
5950 0xFFFD � (REPLACEMENT CHARACTER)
5951 0xFFFD � (REPLACEMENT CHARACTER)
5952 0xFFFD � (REPLACEMENT CHARACTER)
5953 0xFFFD � (REPLACEMENT CHARACTER)
---8<---
They correspond to byte sequences A0 5B..60 and A0 7B..80, which are gaps
between UHC ranges. I don't think Bug 16691 intended this (as they are the only
occurrences of U+FFFD throughout the indices at the moment). This causes an
otherwise valid decoder to accept those sequences even when the fatal mode is
in the use.
- Related Actions Items:
- No related actions
- Related emails:
- I18N-ISSUE-446 (BUG27675): U+FFFD in euc_kr index [encoding] (from sysbot+tracker@w3.org on 2015-03-30)
Related notes:
These issues are now tracked at http://www.w3.org/International/docs/encoding/encoding-cr-doc
Richard Ishida, 16 Sep 2015, 11:51:33
Display change log