<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>23971</bug_id>
          
          <creation_ts>2013-12-03 14:15:58 +0000</creation_ts>
          <short_desc>Define an encoding for formerly latin1</short_desc>
          <delta_ts>2014-11-04 15:04:13 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WHATWG</product>
          <component>Encoding</component>
          <version>unspecified</version>
          <rep_platform>PC</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>WONTFIX</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>Unsorted</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Anne">annevk</reporter>
          <assigned_to name="Anne">annevk</assigned_to>
          <cc>hsivonen</cc>
    
    <cc>mike</cc>
    
    <cc>www-international</cc>
          
          <qa_contact>sideshowbarker+encodingspec</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>97042</commentid>
    <comment_count>0</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2013-12-03 14:15:58 +0000</bug_when>
    <thetext>The web does not have latin1, but we do need it for HTTP and stuff. We should probably also expose it to the API.

&quot;identity&quot; makes some sense, but only in one direction. &quot;bikeshed&quot; also makes sense.

We should not expose this encoding to HTTP charset= or &lt;meta&gt; overrides. This is only for internal matters (such as XMLHttpRequest) and developers using the API.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>97047</commentid>
    <comment_count>1</comment_count>
    <who name="Henri Sivonen">hsivonen</who>
    <bug_when>2013-12-03 14:45:08 +0000</bug_when>
    <thetext>Would be good to see some pointers to HTTP code that actually uses de jure ISO-8859-1 and not windows-1252 decoding. Do you have pointers?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>97048</commentid>
    <comment_count>2</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2013-12-03 14:52:54 +0000</bug_when>
    <thetext>That&apos;s a good point. In XMLHttpRequest this is used for methods and headers. While most of that is restricted to 0x00 - 0x7F, header values can be pretty much any octet.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>97517</commentid>
    <comment_count>3</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2013-12-12 16:23:39 +0000</bug_when>
    <thetext>http://dump.testsuite.org/xhr/header-with-bytes.php is an example of that. Header has 0x80 as value. Comes out as U+0080 (and not as €).</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>104567</commentid>
    <comment_count>4</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2014-04-28 15:24:52 +0000</bug_when>
    <thetext>&quot;unicodelatin1&quot; might be an acceptable name. Unicode refers to this block as &quot;Latin-1 Supplement&quot; so that does not seem too bad.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>114488</commentid>
    <comment_count>5</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2014-11-04 15:04:13 +0000</bug_when>
    <thetext>My understanding is that user agents have dedicated routines for original &quot;latin1&quot; type of conversion in the HTTP layer and potentially elsewhere. If we do indeed want something similar we should probably add statics on String and ArrayBuffer or some such for such conversion.

The Encoding API can then remain for actual encodings only.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>