<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>20599</bug_id>
          
          <creation_ts>2013-01-08 13:32:04 +0000</creation_ts>
          <short_desc>Remove ISO-2022-KR encoder (or the entire encoding) from the spec</short_desc>
          <delta_ts>2013-08-24 20:07:28 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WHATWG</product>
          <component>Encoding</component>
          <version>unspecified</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>FIXED</resolution>
          
          <see_also>https://bugzilla.mozilla.org/show_bug.cgi?id=827796</see_also>
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>Unsorted</target_milestone>
          
          <blocked>16687</blocked>
    
    <blocked>16691</blocked>
    
    <blocked>19942</blocked>
          <everconfirmed>1</everconfirmed>
          <reporter name="Masatoshi Kimura">VYV03354</reporter>
          <assigned_to name="Anne">annevk</assigned_to>
          <cc>excors</cc>
    
    <cc>jshin</cc>
    
    <cc>mike</cc>
    
    <cc>pub-w3</cc>
    
    <cc>zackw</cc>
          
          <qa_contact>sideshowbarker+encodingspec</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>81042</commentid>
    <comment_count>0</comment_count>
    <who name="Masatoshi Kimura">VYV03354</who>
    <bug_when>2013-01-08 13:32:04 +0000</bug_when>
    <thetext>Currently Gecko doesn&apos;t support the encoder. I would not like to implement it only to comply with the spec. Is there any reason it is needed in the spec?
I&apos;d rather remove the decoder if every encoding requires both encoder and decoder.
Removing ISO-2022-KR is also proposed on www-international.
http://lists.w3.org/Archives/Public/www-international/2012OctDec/0039.html</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>81044</commentid>
    <comment_count>1</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2013-01-08 13:44:27 +0000</bug_when>
    <thetext>Is there a bug on removing the decoder? What happens if someone needs the encoder? Does Gecko use utf-8 instead or some such?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>81045</commentid>
    <comment_count>2</comment_count>
    <who name="Masatoshi Kimura">VYV03354</who>
    <bug_when>2013-01-08 13:59:11 +0000</bug_when>
    <thetext>(In reply to comment #1)
&gt; Is there a bug on removing the decoder?

Filed &lt;https://bugzilla.mozilla.org/show_bug.cgi?id=827796&gt;.

&gt; What happens if someone needs the
&gt; encoder? Does Gecko use utf-8 instead or some such?

Yes, uses UTF-8.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>83281</commentid>
    <comment_count>3</comment_count>
    <who name="Masatoshi Kimura">VYV03354</who>
    <bug_when>2013-02-18 18:13:09 +0000</bug_when>
    <thetext>Removing stateful encodings will affect some types of attack such as:
https://www.w3.org/Bugs/Public/show_bug.cgi?id=20886#c3</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>83282</commentid>
    <comment_count>4</comment_count>
    <who name="Zack Weinberg">zackw</who>
    <bug_when>2013-02-18 18:16:30 +0000</bug_when>
    <thetext>It would be unfortunate if we had to continue supporting legacy stateful encodings just to avoid a hypothetical attack based on something *not* getting interpreted in a legacy encoding.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>83316</commentid>
    <comment_count>5</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2013-02-19 11:00:52 +0000</bug_when>
    <thetext>http://zaynar.co.uk/docs/charset-encoding-xss.html lists another problem with not supporting ISO-2022-KR but also notes that IE already has this problem. And this problem might already exist for ISO-2022-CN too.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>92483</commentid>
    <comment_count>6</comment_count>
    <who name="Jungshik Shin">jshin</who>
    <bug_when>2013-08-22 22:13:04 +0000</bug_when>
    <thetext>I also proposed the same by email. ISO-2022-KR should be removed from the spec entirely. It&apos;s not for Web. No sane person would use it on the web. Moreover, it&apos;s a potential security risk like any other non-ASCII 7bit encoding. 

We have to live with ISO-2022-JP, but no other non-ASCII 7bit encoding should be specified for the web.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>92497</commentid>
    <comment_count>7</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2013-08-23 10:35:17 +0000</bug_when>
    <thetext>Jungshik, was that quite a while ago? I don&apos;t have any recent email from you suggesting that, but I recall you might have said that on a mailing list at some point.

In any event, two implementers willing to take the plunge works for me.

https://github.com/whatwg/encoding/commit/01f872bf168e138533d5aa67405d358f8c2fdc94</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>92543</commentid>
    <comment_count>8</comment_count>
    <who name="">pub-w3</who>
    <bug_when>2013-08-24 20:07:28 +0000</bug_when>
    <thetext>(In reply to comment #6)
&gt; We have to live with ISO-2022-JP, but no other non-ASCII 7bit encoding
&gt; should be specified for the web.

The encoding standard currently includes the 7-bit Chinese HZ encoding — should that one be removed as well?</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>