This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 14429 - Named references: Point out HTML5's deviations from HTML4
Summary: Named references: Point out HTML5's deviations from HTML4
Status: RESOLVED FIXED
Alias: None
Product: HTML WG
Classification: Unclassified
Component: HTML5 differences from HTML4 (show other bugs)
Version: unspecified
Hardware: Other other
: P3 normal
Target Milestone: ---
Assignee: Simon Pieters
QA Contact: HTML WG Bugzilla archive list
URL: http://www.w3.org/TR/html5/named-char...
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2011-10-11 12:57 UTC by contributor
Modified: 2012-03-02 15:16 UTC (History)
6 users (show)

See Also:


Attachments
XHTML 1.0 test file (818 bytes, application/xml)
2011-10-11 15:06 UTC, Leif Halvard Silli
Details
HTML test file (635 bytes, text/html)
2011-10-11 15:07 UTC, Leif Halvard Silli
Details

Description contributor 2011-10-11 12:57:21 UTC
Specification: http://www.w3.org/TR/html5/
Multipage: http://www.whatwg.org/C#top
Complete: http://www.whatwg.org/c#top

Comment:
Named character set references
(http://www.w3.org/TR/html5/named-character-references.html) in these draft
contains whole character set of html 4 specification
(http://www.w3.org/TR/REC-html40/sgml/entities.html). We can say that html 4
character set references is subset of html 5 character references. But
following case breaks this rule.

In html 4 ⟩ and &lang is declared as below,
<!ENTITY lang	  CDATA "&#9001;" -- left-pointing angle bracket = bra,
				     U+2329 ISOtech -->
<!-- lang is NOT the same character as U+003C 'less than' 
     or U+2039 'single left-pointing angle quotation mark' -->
<!ENTITY rang	  CDATA "&#9002;" -- right-pointing angle bracket = ket,
				     U+232A ISOtech -->

Contrary html 5 defines this characters as below,
rang;	U+027E9
lang;	U+027E8 

Except these two named references rest is same both in html 4 and html 5.

regards,
Yasin HINISLIOĞLU
yaskil@gmail.com

Posted from: 193.34.132.160
User agent: Mozilla/5.0 (Windows NT 6.0; rv:7.0.1) Gecko/20100101 Firefox/7.0.1
Comment 1 Leif Halvard Silli 2011-10-11 15:02:02 UTC
The change in HTML5 is probably motivated by this letter: 

   http://lists.whatwg.org/pipermail/whatwg-whatwg.org/2007-July/012108.html

But note that  HTML4's behavour is (still) implemented in:

  * Opera 12 pre-beta
  * Internet Explorer 9 and down (but IE9's XML 
    parser implements the HTML5 behaviour)
  * Firefox 3.6 and down
  * Oxygen (using a Xerces parser)
  * XMLmind XMLeditor (using a Xerces parser)
  * XMLlint's  HTML parser
  * Text browsers: W3m, Links, elinks, Lynx
  * Lobo Java web browser (both in XHTML 1 mode
    and HTML mode)

(current) HTML5 spec is implemented in:
  * Webkit (Safari 5.1 and Chrome current)
  * Firefox 4/Firefox 7
  * The XML parser of IE9 (regardless of whether a DTD declares it)

Unknown/Not tested:
  * Konqueror
  * IE10-beta

These entities can be used to separate HTML5 parsers from other parsers ...
Comment 2 Leif Halvard Silli 2011-10-11 15:06:00 UTC
Created attachment 1034 [details]
XHTML 1.0 test file
Comment 3 Leif Halvard Silli 2011-10-11 15:07:02 UTC
Created attachment 1035 [details]
HTML test file
Comment 4 Leif Halvard Silli 2011-10-11 15:50:38 UTC
( For convenience, the XHTML test file as data URI: <http://tinyurl.com/6gmedfy> )

* Edited the Summary:
  1) Moved 
      http://www.w3.org/TR/html5/named-character-references#entity-rang
      to the URL field
  2) Deleted http://www.w3.org/TR/REC-html40/sgml/entities.html
  3) Added this, that shouldn't be too far from reporterer's intentions: 
      "Named references: Point out HTML5's deviations from HTML4"

   Motivation: To avoid that authors and implementors assume that HTML5's named character references is a strict superset of HTML5, the deviatons from HTML4 should be pointed out via footnote or other text.

    PS: Supposedly, this deviation from HTML4/XHTML1 is defendable *only* because the old  character is not too different from the new character
Comment 5 Simon Pieters 2012-03-02 15:16:11 UTC
EDITOR'S RESPONSE: This is an Editor's Response to your comment. If you are satisfied with this response, please change the state of this bug to CLOSED. If you have additional information and would like the editor to reconsider, please reopen this bug. If you would like to escalate the issue to the full HTML Working Group, please add the TrackerRequest keyword to this bug, and suggest title and text for the tracker issue; or you may create a tracker issue yourself, if you are able to do so. For more details, see this document:
   http://dev.w3.org/html5/decision-policy/decision-policy.html

Status: Accepted
Change Description: http://dev.w3.org/cvsweb/html5/html4-differences/Overview.src.html.diff?r1=1.149;r2=1.150
Rationale: Concurred with reporter's comments