<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>10260</bug_id>
          
          <creation_ts>2010-07-29 11:08:09 +0000</creation_ts>
          <short_desc>Meta charset handling during the parse should special-case UTF-16 and reject non-ASCII supersets</short_desc>
          <delta_ts>2010-10-04 14:56:47 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>HTML WG</product>
          <component>pre-LC1 HTML5 spec (editor: Ian Hickson)</component>
          <version>unspecified</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc>https://bugzilla.mozilla.org/show_bug.cgi?id=582788</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P1</priority>
          <bug_severity>critical</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Henri Sivonen">hsivonen</reporter>
          <assigned_to name="Ian &apos;Hixie&apos; Hickson">ian</assigned_to>
          <cc>ian</cc>
    
    <cc>mike</cc>
    
    <cc>public-html-admin</cc>
    
    <cc>public-html-wg-issue-tracking</cc>
          
          <qa_contact name="HTML WG Bugzilla archive list">public-html-bugzilla</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>37161</commentid>
    <comment_count>0</comment_count>
    <who name="Henri Sivonen">hsivonen</who>
    <bug_when>2010-07-29 11:08:09 +0000</bug_when>
    <thetext>The tree builder spec says about meta: &quot;If the element has a charset attribute, and its value is a supported encoding, and the confidence is currently tentative, then change the encoding to the encoding given by the value of the charset attribute.&quot;

Instead, of merely considering &quot;supported encoding&quot;, UTF-16 should be mappend to UTF-8 and after alias resolution non-ASCII-superset encodings should be treated as unsupported.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>37462</commentid>
    <comment_count>1</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2010-08-16 19:19:17 +0000</bug_when>
    <thetext>Can alias resolution ever change whether something is an ASCII superset or not?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>37463</commentid>
    <comment_count>2</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2010-08-16 19:23:41 +0000</bug_when>
    <thetext>UTF-16 to UTF-8 mapping is already done by the &quot;change the encoding&quot; algorithm.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>37464</commentid>
    <comment_count>3</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2010-08-16 19:26:20 +0000</bug_when>
    <thetext>EDITOR&apos;S RESPONSE: This is an Editor&apos;s Response to your comment. If you are satisfied with this response, please change the state of this bug to CLOSED. If you have additional information and would like the editor to reconsider, please reopen this bug. If you would like to escalate the issue to the full HTML Working Group, please add the TrackerRequest keyword to this bug, and suggest title and text for the tracker issue; or you may create a tracker issue yourself, if you are able to do so. For more details, see this document:
   http://dev.w3.org/html5/decision-policy/decision-policy.html

Status: Partially Accepted
Change Description: see diff given below
Rationale: Concurred with reporter&apos;s comments, notwithstanding additional comments above.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>37465</commentid>
    <comment_count>4</comment_count>
    <who name="">contributor</who>
    <bug_when>2010-08-16 19:28:07 +0000</bug_when>
    <thetext>Checked in as WHATWG revision r5295.
Check-in comment: &lt;meta charset&gt; should only work for ASCII-compatible encodings.
http://html5.org/tools/web-apps-tracker?from=5294&amp;to=5295</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>