Bug 21151 - Don't limit set of supported encodings
Summary: Don't limit set of supported encodings
Alias: None
Product: WHATWG
Classification: Unclassified
Component: Encoding (show other bugs)
Version: unspecified
Hardware: PC All
: P2 normal
Target Milestone: Unsorted
Assignee: Anne
QA Contact: sideshowbarker+encodingspec
Depends on:
Reported: 2013-02-28 07:45 UTC by Norbert Lindenberg
Modified: 2013-02-28 11:00 UTC (History)
3 users (show)

See Also:


Note You need to log in before you can comment on or make changes to this bug.
Description Norbert Lindenberg 2013-02-28 07:45:56 UTC
Section 4, Encodings, currently contains the statement: "User agents must not support any other encodings or labels."

I don't think this restriction is acceptable. Some user agents have already implemented support for other encodings (for example, the Google search engine supports some Indian font encodings), and there's no good reason for them to stop supporting them and make web pages using them unreadable.
Comment 1 Henri Sivonen 2013-02-28 09:10:28 UTC
I think this should be WONTFIX. We should only support UTF-8 and the set of legacy encodings needed to successfully render the current Web.

Supporting other encodings just adds incompatibility and attack surface by definition without improving Web compatibility.

As for Indian font encodings specifically, pages that rely on particular fonts are already broken in the sense that they don't work correctly across different client systems. We should get Indian sites to use UTF-8 instead of perpetuating uninteroperable font dependencies of the 1990s.
Comment 2 Henri Sivonen 2013-02-28 09:13:46 UTC
Also note that the font dependency use case is server by x-user-defined for legacy purposes.

Furthermore, experience says that leaving things open-ended makes people introduce security holes by exposing Web-unsuited encodings to the Web. Being an unfortunate person who tries to clean up messes like that, I have zero sympathy for the request made here.
Comment 3 Anne 2013-02-28 11:00:18 UTC
If there's a case to be made to add an additional encoding it should be added to the standard and supported by everyone.