<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>7524</bug_id>
          
          <creation_ts>2009-09-07 10:39:27 +0000</creation_ts>
          <short_desc>The list of forbidden Unicode characters in the authoring requirements does not match the list that generates parse errors in #tokenizing-character-references</short_desc>
          <delta_ts>2010-10-04 13:59:14 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>HTML WG</product>
          <component>pre-LC1 HTML5 spec (editor: Ian Hickson)</component>
          <version>unspecified</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc>http://whatwg.org/specs/web-apps/current-work/#character-references</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P3</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>LC</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter>contributor</reporter>
          <assigned_to name="Ian &apos;Hixie&apos; Hickson">ian</assigned_to>
          <cc>lachlan.hunt</cc>
    
    <cc>mike</cc>
    
    <cc>public-html-admin</cc>
    
    <cc>public-html-wg-issue-tracking</cc>
          
          <qa_contact name="HTML WG Bugzilla archive list">public-html-bugzilla</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>26889</commentid>
    <comment_count>0</comment_count>
    <who name="">contributor</who>
    <bug_when>2009-09-07 10:39:27 +0000</bug_when>
    <thetext>Section: http://whatwg.org/specs/web-apps/current-work/#character-references

Comment:
The list of forbidden Unicode characters in the authoring requirements does not match the list that generates parse errors in #tokenizing-character-references

Posted from: 213.236.208.22</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>26891</commentid>
    <comment_count>1</comment_count>
    <who name="Lachlan Hunt">lachlan.hunt</who>
    <bug_when>2009-09-07 10:46:17 +0000</bug_when>
    <thetext>The authoring requirements disallow the following character ranges:

U+0000, U+000D, U+0080 to U+009F, 0xD800 to 0xDFFF

The implementation requirements require parse errors for the following additional ranges:

0x0001 to 0x0008, 0x000E to 0x001F, 0x007F, 0xFDD0 to 0xFDEF, or is one of 0x000B, 0xFFFE, 0xFFFF, 0x1FFFE, 0x1FFFF, 0x2FFFE, 0x2FFFF, 0x3FFFE, 0x3FFFF, 0x4FFFE, 0x4FFFF, 0x5FFFE, 0x5FFFF, 0x6FFFE, 0x6FFFF, 0x7FFFE, 0x7FFFF, 0x8FFFE, 0x8FFFF, 0x9FFFE, 0x9FFFF, 0xAFFFE, 0xAFFFF, 0xBFFFE, 0xBFFFF, 0xCFFFE, 0xCFFFF, 0xDFFFE, 0xDFFFF, 0xEFFFE, 0xEFFFF, 0xFFFFE, 0xFFFFF, 0x10FFFE, or 0x10FFFF

http://www.whatwg.org/specs/web-apps/current-work/#tokenizing-character-references

Similarly, the requirements in section 9.1.3 Text should also forbid these control characters.

http://www.whatwg.org/specs/web-apps/current-work/#text-0</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>27501</commentid>
    <comment_count>2</comment_count>
    <who name="">contributor</who>
    <bug_when>2009-09-22 10:12:31 +0000</bug_when>
    <thetext>Checked in as WHATWG revision r3961.
Check-in comment: Bring the authoring section in line with the parsing section for allowed character references.
http://html5.org/tools/web-apps-tracker?from=3960&amp;to=3961
</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>