<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>24336</bug_id>
          
          <creation_ts>2014-01-20 18:18:42 +0000</creation_ts>
          <short_desc>Encoding names should match what people actually call them</short_desc>
          <delta_ts>2014-01-20 23:30:22 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WHATWG</product>
          <component>Encoding</component>
          <version>unspecified</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Linux</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>WONTFIX</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>Unsorted</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Geoffrey Sneddon">geoffers+w3cbugs</reporter>
          <assigned_to name="Anne">annevk</assigned_to>
          <cc>mike</cc>
    
    <cc>www-international</cc>
          
          <qa_contact>sideshowbarker+encodingspec</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>98782</commentid>
    <comment_count>0</comment_count>
    <who name="Geoffrey Sneddon">geoffers+w3cbugs</who>
    <bug_when>2014-01-20 18:18:42 +0000</bug_when>
    <thetext>http://gsnedders.html5.org/web-encoding-names/results.html shows what document.characterSet returns in current versions of browsers. Notably, Firefox and Chrome both return the uppercased names for many of these. (IE returns them all lowercase except &quot;GB18030&quot;; ZombieOpera returns them all lowercase)

Googling these encoding names it becomes clear that almost everyone refers to &quot;UTF-8&quot;, &quot;ISO-8859-n&quot;, etc. (uppercased), and as there is no interop here currently, and the proposed behaviour matches Firefox/Chrome, it would seem better to just give them their names that are in common usage.

As such, I propose to change the names to the following (thereby changing case only):

 - UTF-8
 - IBM866
 - ISO-8859-n
 - ISO-8859-8-I
 - KOI8-R
 - KOI8-U
 - HZ-GB-2312
 - Big5
 - EUC-JP
 - ISO-2022-JP
 - Shift_JIS
 - EUC-KR
 - UTF-16BE
 - UTF-16LE</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>98796</commentid>
    <comment_count>1</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2014-01-20 23:30:22 +0000</bug_when>
    <thetext>I value more that now you can predict what characterSet returns. With your proposed change you need to know that windows-1252 is not spelled Windows-1252. And that gb18030 is an exception.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>