<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>10014</bug_id>
          
          <creation_ts>2010-06-25 19:43:44 +0000</creation_ts>
          <short_desc>XHTML1 Doctypes cause DOM diff in XML parsers with external entity handling</short_desc>
          <delta_ts>2010-10-05 13:07:40 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>HTML WG</product>
          <component>pre-LC1 HTML/XHTML Compat. Authoring Guide (ed: Eliot Graff)</component>
          <version>unspecified</version>
          <rep_platform>Macintosh</rep_platform>
          <op_sys>Mac System 9.x</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>WONTFIX</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Leif Halvard Silli">xn--mlform-iua</reporter>
          <assigned_to name="Eliot Graff">eliotgra</assigned_to>
          <cc>eliotgra</cc>
    
    <cc>mike</cc>
    
    <cc>public-html-admin</cc>
    
    <cc>public-html-wg-issue-tracking</cc>
          
          <qa_contact name="HTML WG Bugzilla archive list">public-html-bugzilla</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>36422</commentid>
    <comment_count>0</comment_count>
    <who name="Leif Halvard Silli">xn--mlform-iua</who>
    <bug_when>2010-06-25 19:43:44 +0000</bug_when>
    <thetext>Henri in bug In bug 9965, regarding CDATA and non-CDATA attributes:

Comment #12 From Henri Sivonen 2010-06-22 07:17:09 -------
With &lt;!DOCTYPE html&gt;, all attribute are CDATA attributes.

However, if you use one of the permitted XHTML 1.0 doctypes and the XML
processor processes external entities, some attributes aren&apos;t CDATA attributes.
As for it being an issue, the document tree would be *different* which implies
not polyglot.
----------------

So, there should be 4 options:

1) Say that the DOM differences are acceptable
2) Variant of 1): Say  that polyglot spec is only meant for HTML parsers and XML processor without external entity handling
3) Declare XHML1.x doctypes as incompatible with polyglot spec.
4) Require such strict authoring requirements that document is treated the same way, regardless of DTD/DOCTYPE

In case of 3) then that also decides what kind of DTD based doctypes that could be permitted: Only those that do not declare anything _but_ CDATA attributes.

This is a judgement call: even when it comes to CDATA attributes, then there are differences - namely: HTML parsers do not perform any XML-like normalization. So, it would be possible to say that even CDATA attributes cannot become similar enough.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>36745</commentid>
    <comment_count>1</comment_count>
    <who name="Eliot Graff">eliotgra</who>
    <bug_when>2010-07-10 00:27:09 +0000</bug_when>
    <thetext>If I read things correctly, most of these doctypes will cause such entities to be supported in conforming XHTML5 implementations: http://dev.w3.org/html5/spec/Overview.html#parsing-xhtml-documents 

Older implementations and other XML parsers could have issues, but the polyglot spec is focusing on HTML5 implementations--at least for now. If time and resources allow, perhaps notes for earlier implementations could be added later (if necessary). 

Thanks so much for the feedback!</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>