ISSUE-446: U+FFFD in euc_kr index

U+FFFD in euc_kr index

State:
CLOSED
Product:
encoding
Raised by:
Richard Ishida
Opened on:
2015-03-30
Description:
https://www.w3.org/Bugs/Public/show_bug.cgi?id=27675

This issue tracks the bug listed above and was created as part of the WG CR process.

---

Reporter: public+w3@mearie.org

The updated euc_kr table now has the following entries:

---8<---
5916 0xFFFD � (REPLACEMENT CHARACTER)
5917 0xFFFD � (REPLACEMENT CHARACTER)
5918 0xFFFD � (REPLACEMENT CHARACTER)
5919 0xFFFD � (REPLACEMENT CHARACTER)
5920 0xFFFD � (REPLACEMENT CHARACTER)
5921 0xFFFD � (REPLACEMENT CHARACTER)
[snip]
5948 0xFFFD � (REPLACEMENT CHARACTER)
5949 0xFFFD � (REPLACEMENT CHARACTER)
5950 0xFFFD � (REPLACEMENT CHARACTER)
5951 0xFFFD � (REPLACEMENT CHARACTER)
5952 0xFFFD � (REPLACEMENT CHARACTER)
5953 0xFFFD � (REPLACEMENT CHARACTER)
---8<---

They correspond to byte sequences A0 5B..60 and A0 7B..80, which are gaps
between UHC ranges. I don't think Bug 16691 intended this (as they are the only
occurrences of U+FFFD throughout the indices at the moment). This causes an
otherwise valid decoder to accept those sequences even when the fatal mode is
in the use.
Related Actions Items:
No related actions
Related emails:
  1. I18N-ISSUE-446 (BUG27675): U+FFFD in euc_kr index [encoding] (from sysbot+tracker@w3.org on 2015-03-30)

Related notes:

These issues are now tracked at http://www.w3.org/International/docs/encoding/encoding-cr-doc

Richard Ishida, 16 Sep 2015, 11:51:33

Display change log ATOM feed


Addison Phillips <addisonI18N@gmail.com>, Chair, Richard Ishida <ishida@w3.org>, Bert Bos <bert@w3.org>, Fuqiao Xue <xfq@w3.org>, Atsushi Shimono <atsushi@w3.org>, Staff Contacts
Tracker: documentation, (configuration for this group), originally developed by Dean Jackson, is developed and maintained by the Systems Team <w3t-sys@w3.org>.
$Id: 446.html,v 1.1 2023/07/19 12:02:04 carcone Exp $