<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>21146</bug_id>
          
          <creation_ts>2013-02-27 15:32:18 +0000</creation_ts>
          <short_desc>Separate big5-hkscs from big5</short_desc>
          <delta_ts>2014-09-25 22:07:26 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WHATWG</product>
          <component>Encoding</component>
          <version>unspecified</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Windows NT</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>INVALID</resolution>
          
          <see_also>https://bugzilla.mozilla.org/show_bug.cgi?id=845743</see_also>
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>Unsorted</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Masatoshi Kimura">VYV03354</reporter>
          <assigned_to name="Anne">annevk</assigned_to>
          <cc>hsivonen</cc>
    
    <cc>jshin</cc>
    
    <cc>mike</cc>
    
    <cc>philipj</cc>
    
    <cc>zcorpan</cc>
          
          <qa_contact>sideshowbarker+encodingspec</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>83810</commentid>
    <comment_count>0</comment_count>
    <who name="Masatoshi Kimura">VYV03354</who>
    <bug_when>2013-02-27 15:32:18 +0000</bug_when>
    <thetext>Please see the Gecko bug: https://bugzilla.mozilla.org/show_bug.cgi?id=845743</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>83812</commentid>
    <comment_count>1</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2013-02-27 16:52:09 +0000</bug_when>
    <thetext>So we are going to add this back because of an intranet application? This is a tad hypocritical as we have previously refused to standardize behavior from Internet Explorer where they have said that intranet applications depended upon it.

I guess as long as Chrome supports the encoding too though there is not much leverage here either way.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>83868</commentid>
    <comment_count>2</comment_count>
    <who name="Henri Sivonen">hsivonen</who>
    <bug_when>2013-03-01 15:03:30 +0000</bug_when>
    <thetext>Before changing the spec, we should check whether big5 in Gecko implemented the requirements of the Encoding Standard for the decoder.

And yes, it&apos;s pretty sad to do this for one intranet app (though breaking someone&apos;s CRM would be uncool, too).</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>83899</commentid>
    <comment_count>3</comment_count>
    <who name="Masatoshi Kimura">VYV03354</who>
    <bug_when>2013-03-02 03:00:08 +0000</bug_when>
    <thetext>At least Gecko&apos;s implementation passed &lt;https://code.google.com/p/stringencoding/source/browse/test-big5.js&gt;.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>85084</commentid>
    <comment_count>4</comment_count>
    <who name="Simon Pieters">zcorpan</who>
    <bug_when>2013-03-28 09:55:22 +0000</bug_when>
    <thetext>The test case in the gecko bug (https://bugzilla.mozilla.org/attachment.cgi?id=718907 ) has this byte sequence for the interesting character:

0x91 0x6f

and expects this character (what I get in Firefox Nightly and Opera with big5-hkscs label):

U+9C02

I cloned https://code.google.com/p/stringencoding/ and changed the test-big5.js file as follows:

test(
  function () {
    var bytes = [0x91,0x6f];
    var string = &quot;\u9c02&quot;;
    assert_equals(TextDecoder(&quot;big5&quot;).decode(new Uint8Array(bytes)), string, &quot;decoded&quot;);
  },
  &quot;big5&quot;
);

Then I ran the tests.html file and got the following result for the above test:

Pass	big5

This means that, assuming the stringencoding project implements the Encoding Standard correctly, the spec would pass the test case in the gecko bug.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>85085</commentid>
    <comment_count>5</comment_count>
    <who name="Simon Pieters">zcorpan</who>
    <bug_when>2013-03-28 09:59:58 +0000</bug_when>
    <thetext>Also note that the Web compat analysis that lead to the current spec recommended a unified label:

[[
Not treating big5 and big5-hkscs as aliases is clearly breaking  
pages, so I would recommend a single mapping for both.

Of the existing mappings, opera-hk seems like the overall winner. As a  
starting point for the spec, I suggest taking the intersection of  
opera-hk, firefox-hk and chrome-hk.
]]
http://lists.w3.org/Archives/Public/public-whatwg-archive/2012Apr/0082.html</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>92950</commentid>
    <comment_count>6</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2013-09-04 09:35:49 +0000</bug_when>
    <thetext>Masatoshi, it&apos;s not really clear to me what to do here. It seems the experiment in Gecko was done poorly and we should try again there, do you agree?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>92962</commentid>
    <comment_count>7</comment_count>
    <who name="Masatoshi Kimura">VYV03354</who>
    <bug_when>2013-09-04 12:47:47 +0000</bug_when>
    <thetext>I didn&apos;t verify that the Encodings spec&apos;s Big-5 is able to replace Big-HKSCS yet.
Anyway, Gecko will have to obtain an agreement from MozTW community.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>92975</commentid>
    <comment_count>8</comment_count>
    <who name="Philip Jägenstedt">philipj</who>
    <bug_when>2013-09-04 13:40:08 +0000</bug_when>
    <thetext>(In reply to comment #0)
&gt; Please see the Gecko bug: https://bugzilla.mozilla.org/show_bug.cgi?id=845743

Can someone explain the problem here? As far as I can tell, someone from Yahoo has data that needs to be interpreted as Big5-HKSCS and says that &quot;if FF don&apos;t support HKSCS, we have no choice but to go to google chrome&quot;, so how is treating Big5 and Big5-HKSCS differently a fix for this?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>93042</commentid>
    <comment_count>9</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2013-09-05 14:44:09 +0000</bug_when>
    <thetext>I&apos;m marking this INVALID as the implementation was done all wrong. https://bugzilla.mozilla.org/show_bug.cgi?id=912470 is the bug on implementing this better.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>112133</commentid>
    <comment_count>10</comment_count>
    <who name="Jungshik Shin">jshin</who>
    <bug_when>2014-09-24 21:19:55 +0000</bug_when>
    <thetext>Anne, could you tell me how you derived the current Big5 in the spec? Is it the merge of Big5-HKSCS (2008) [1]  with Windows-950 [2] ?  


[1] http://www.ogcio.gov.hk/en/business/tech_promotion/ccli/terms/doc/New2003cmp_2008.txt

[2] http://msdn.microsoft.com/en-us/goglobal/cc305155</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>112135</commentid>
    <comment_count>11</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2014-09-24 21:27:42 +0000</bug_when>
    <thetext>Philip can probably do that better. He did most of the work.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>112202</commentid>
    <comment_count>12</comment_count>
    <who name="Philip Jägenstedt">philipj</who>
    <bug_when>2014-09-25 22:07:26 +0000</bug_when>
    <thetext>https://bugzilla.mozilla.org/show_bug.cgi?id=912470#c48 summarizes how this came to be.

In order to separate them, we need another solution for all of the Big5-HKSCS content labeled as Big5, which was common in Hong Kong at the time I did the research.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>