<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>24337</bug_id>
          
          <creation_ts>2014-01-20 18:22:29 +0000</creation_ts>
          <short_desc>Authors should be able to use both &quot;utf8&quot; and &quot;utf-8&quot; labels, case-insensitively</short_desc>
          <delta_ts>2014-01-24 22:47:23 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WHATWG</product>
          <component>Encoding</component>
          <version>unspecified</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Linux</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>Unsorted</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Geoffrey Sneddon">geoffers+w3cbugs</reporter>
          <assigned_to name="Anne">annevk</assigned_to>
          <cc>duerst</cc>
    
    <cc>mike</cc>
    
    <cc>www-international</cc>
          
          <qa_contact>sideshowbarker+encodingspec</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>98783</commentid>
    <comment_count>0</comment_count>
    <who name="Geoffrey Sneddon">geoffers+w3cbugs</who>
    <bug_when>2014-01-20 18:22:29 +0000</bug_when>
    <thetext>Currently the spec says: &apos;Authors must use the utf-8 encoding and must use the &quot;utf-8&quot; label to identify it.&apos;

Given the label matching is done case-insensitively, it is not entirely clear whether authors must use this label case-sensitively or not. This should be clarified, preferably to allow either case (there is no practical benefit of requiring it to be lowercased).

We should also make the &quot;utf8&quot; label conforming. Making this non-conforming is of no practical benefit and makes a large number of documents non-conforming.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>98816</commentid>
    <comment_count>1</comment_count>
    <who name="Martin Dürst">duerst</who>
    <bug_when>2014-01-21 05:34:57 +0000</bug_when>
    <thetext>(In reply to Geoffrey Sneddon from comment #0)
&gt; Currently the spec says: &apos;Authors must use the utf-8 encoding and must use
&gt; the &quot;utf-8&quot; label to identify it.&apos;
&gt; 
&gt; Given the label matching is done case-insensitively, it is not entirely
&gt; clear whether authors must use this label case-sensitively or not. This
&gt; should be clarified, preferably to allow either case (there is no practical
&gt; benefit of requiring it to be lowercased).

Agreed.

&gt; We should also make the &quot;utf8&quot; label conforming. Making this non-conforming
&gt; is of no practical benefit and makes a large number of documents
&gt; non-conforming.

This looks innocuous at first. However, in some products (in particular Oracle Databases), the label &quot;utf8&quot; is used for a variant of UTF-8 where characters outside the BMP are encoded with two surrogates, with a total of 6 bytes. For security reasons, this is prohibited in UTF-8.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>99035</commentid>
    <comment_count>2</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2014-01-24 22:47:23 +0000</bug_when>
    <thetext>Yeah, only utf-8 was intentional. Clarified the case stuff.

https://github.com/whatwg/encoding/commit/61af3cdf199b4ab86babd47b7d48bb328c54a702</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>