This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 7524 - The list of forbidden Unicode characters in the authoring requirements does not match the list that generates parse errors in #tokenizing-character-references
Summary: The list of forbidden Unicode characters in the authoring requirements does n...
Status: CLOSED FIXED
Alias: None
Product: HTML WG
Classification: Unclassified
Component: pre-LC1 HTML5 spec (editor: Ian Hickson) (show other bugs)
Version: unspecified
Hardware: All All
: P3 normal
Target Milestone: LC
Assignee: Ian 'Hixie' Hickson
QA Contact: HTML WG Bugzilla archive list
URL: http://whatwg.org/specs/web-apps/curr...
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-09-07 10:39 UTC by contributor
Modified: 2010-10-04 13:59 UTC (History)
4 users (show)

See Also:


Attachments

Description contributor 2009-09-07 10:39:27 UTC
Section: http://whatwg.org/specs/web-apps/current-work/#character-references

Comment:
The list of forbidden Unicode characters in the authoring requirements does not match the list that generates parse errors in #tokenizing-character-references

Posted from: 213.236.208.22
Comment 1 Lachlan Hunt 2009-09-07 10:46:17 UTC
The authoring requirements disallow the following character ranges:

U+0000, U+000D, U+0080 to U+009F, 0xD800 to 0xDFFF

The implementation requirements require parse errors for the following additional ranges:

0x0001 to 0x0008, 0x000E to 0x001F, 0x007F, 0xFDD0 to 0xFDEF, or is one of 0x000B, 0xFFFE, 0xFFFF, 0x1FFFE, 0x1FFFF, 0x2FFFE, 0x2FFFF, 0x3FFFE, 0x3FFFF, 0x4FFFE, 0x4FFFF, 0x5FFFE, 0x5FFFF, 0x6FFFE, 0x6FFFF, 0x7FFFE, 0x7FFFF, 0x8FFFE, 0x8FFFF, 0x9FFFE, 0x9FFFF, 0xAFFFE, 0xAFFFF, 0xBFFFE, 0xBFFFF, 0xCFFFE, 0xCFFFF, 0xDFFFE, 0xDFFFF, 0xEFFFE, 0xEFFFF, 0xFFFFE, 0xFFFFF, 0x10FFFE, or 0x10FFFF

http://www.whatwg.org/specs/web-apps/current-work/#tokenizing-character-references

Similarly, the requirements in section 9.1.3 Text should also forbid these control characters.

http://www.whatwg.org/specs/web-apps/current-work/#text-0
Comment 2 contributor 2009-09-22 10:12:31 UTC
Checked in as WHATWG revision r3961.
Check-in comment: Bring the authoring section in line with the parsing section for allowed character references.
http://html5.org/tools/web-apps-tracker?from=3960&to=3961