<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>6298</bug_id>
          
          <creation_ts>2008-12-10 16:30:02 +0000</creation_ts>
          <short_desc>Provide a parser override</short_desc>
          <delta_ts>2018-05-09 20:10:07 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>Validator</product>
          <component>Website</component>
          <version>HEAD</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Windows XP</op_sys>
          <bug_status>NEW</bug_status>
          <resolution></resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>enhancement</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Simon Pieters">zcorpan</reporter>
          <assigned_to name="This bug has no owner yet - up for the taking">dave.null</assigned_to>
          <cc>dean</cc>
    
    <cc>karl+w3c</cc>
    
    <cc>ot</cc>
          
          <qa_contact name="qa-dev tracking">www-validator-cvs</qa_contact>

      

      

      <flag name="needinfo"
          id="186"
          type_id="3"
          status="+"
          setter="jordancarrillo530"
    />

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>22732</commentid>
    <comment_count>0</comment_count>
    <who name="Simon Pieters">zcorpan</who>
    <bug_when>2008-12-10 16:30:02 +0000</bug_when>
    <thetext>From Bug 6296.

Please provide a parser override, at least between HTML5 and XML/SGML.

If you provide an override between XML and SGML, and the user chooses SGML parser with XML DTD or vice versa, you could either refuse to validate or change to an appropriate DTD that works with the parser.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22782</commentid>
    <comment_count>1</comment_count>
    <who name="Simon Pieters">zcorpan</who>
    <bug_when>2008-12-17 14:50:44 +0000</bug_when>
    <thetext>Or you could list both HTML5 and XHTML5 in the &quot;Document Type&quot; list.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22783</commentid>
    <comment_count>2</comment_count>
    <who name="Olivier Thereaux">ot</who>
    <bug_when>2008-12-17 15:16:27 +0000</bug_when>
    <thetext>(In reply to comment #1)
&gt; Or you could list both HTML5 and XHTML5 in the &quot;Document Type&quot; list.

I thought of doing that indeed. That&apos;d work. I got mixed message as to whether it is a good idea to use the XHTML5 name however, as far as I can tell it&apos;s unclear what the best name is. Any pointer to what the currently accepted best term is?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22784</commentid>
    <comment_count>3</comment_count>
    <who name="Simon Pieters">zcorpan</who>
    <bug_when>2008-12-17 15:36:54 +0000</bug_when>
    <thetext>The spec uses &quot;XHTML5&quot; and so does Validator.nu. The term is pretty widely used on the Web. Any other name would probably have less backing and just be confusing for users, I think.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22795</commentid>
    <comment_count>4</comment_count>
    <who name="Karl Dubost">karl+w3c</who>
    <bug_when>2008-12-18 13:20:02 +0000</bug_when>
    <thetext>(In reply to comment #3)
&gt;  The term is pretty widely used on the Web.

hmm not widely used on the Web, only a few occurences. I would go through the Working Group and discuss that with Sam Ruby (new chair starting in January) and others. Just to make it a group decision. It Seems that Sam is interested by fixing the issues of communications with XHTML 2 WG.


</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22797</commentid>
    <comment_count>5</comment_count>
    <who name="Olivier Thereaux">ot</who>
    <bug_when>2008-12-18 13:33:50 +0000</bug_when>
    <thetext>(In reply to comment #4)
&gt; I would go through the Working Group and discuss that 

Got a few pointers from Mike Smith:
http://lists.w3.org/Archives/Public/public-xhtml2/2008Jun/thread.html#msg45
http://www.w3.org/html/wg/tracker/issues/52
 so things don&apos;t seem *that* simple. 

Mike suggests, as a relatively safe (if imperfect) alternative, to use something like HTML5 (XML Syntax).</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22888</commentid>
    <comment_count>6</comment_count>
    <who name="Dean Edridge">dean</who>
    <bug_when>2009-01-03 15:45:10 +0000</bug_when>
    <thetext>(In reply to comment #4)
&gt; (In reply to comment #3)
&gt; &gt;  The term is pretty widely used on the Web.
&gt; 
&gt; hmm not widely used on the Web, only a few occurences.

It *is* widely used on the web, I know this because I have had Google alerts for XHTML5 for the last 3 1/2 years. XHTML5 has also been mentioned in books, magazines and on the BBC&apos;s web site. 
 
&gt; I would go through the
&gt; Working Group and discuss that with Sam Ruby (new chair starting in January)
&gt; and others. Just to make it a group decision. It Seems that Sam is interested
&gt; by fixing the issues of communications with XHTML 2 WG.

This has already been sorted out Karl [1]

The way to solve the problem for good is to throw away the rejected XHTML 2 proposal. It&apos;s not implementable, was rejected by the browser vendors 8 years ago, it offers no benefits over XHTML1 or XHTML5 and it&apos;s very existence continues to hold back the progress of the web. And despite what some people might say, it will not lead to the successful implementation or deployment of XForms. 

You can&apos;t use &quot;HTML5+XML&quot; even in the short term as both HTML5 and XHTML5 can use &quot;XML syntax&quot; (some XML syntax is valid in text/html). This will just confuse people as it&apos;s not the syntax that distinguishes HTML5 from XHTML5 [2] You&apos;ll have to use &quot;HTML5/XHTML&quot; while we wait for the issue to be &quot;officially&quot; resolved. This label would be OK for the short term as the spec makes it clear that XHTML can not be used as text/html but there is no such wording for &quot;XML syntax&quot; or &quot;HTML5+XML&quot; so that will have to be changed.

[1] http://lists.w3.org/Archives/Public/public-html/2007Oct/0386.html
[2] http://dev.w3.org/html5/spec/Overview.html#html-vs-xhtml
</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22889</commentid>
    <comment_count>7</comment_count>
    <who name="Karl Dubost">karl+w3c</who>
    <bug_when>2009-01-03 16:03:40 +0000</bug_when>
    <thetext>(In reply to comment #6)
&gt; It *is* widely used on the web, I know this because I have had Google alerts
&gt; for XHTML5 for the last 3 1/2 years. XHTML5 has also been mentioned in books,
&gt; magazines and on the BBC&apos;s web site. 

I should have backed up what I said. :)

http://www.ask.com/web?q=xhtml5 Showing 1-10 of 12,700
http://www.ask.com/web?q=xhtml1 Showing 1-10 of 533,000

I&apos;m following also the discussions on different alerts ;)


The proposal of Mike is reasonable  and there are still a few issues to solve in terms of community and agreements. 
See http://intertwingly.net/blog/2008/12/15/Co-Chair-HTML-WG and the comments.

(btw I have no preferences over a term. I just pointed out that there are still two voices.)


</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>23070</commentid>
    <comment_count>8</comment_count>
    <who name="Dean Edridge">dean</who>
    <bug_when>2009-01-14 13:12:56 +0000</bug_when>
    <thetext>(In reply to comment #7)
&gt; (In reply to comment #6)
&gt; &gt; It *is* widely used on the web, I know this because I have had Google alerts
&gt; &gt; for XHTML5 for the last 3 1/2 years. XHTML5 has also been mentioned in books,
&gt; &gt; magazines and on the BBC&apos;s web site. 
&gt; 
&gt; I should have backed up what I said. :)
&gt; 
&gt; http://www.ask.com/web?q=xhtml5 Showing 1-10 of 12,700
&gt; http://www.ask.com/web?q=xhtml1 Showing 1-10 of 533,000

And what does this prove?

&gt; 
&gt; I&apos;m following also the discussions on different alerts ;)
&gt; 
&gt; 
&gt; The proposal of Mike is reasonable


Karl, I have just explained that using &quot;XML&quot; instead of &quot;XHTML&quot; is problematic. Only &quot;XHTML&quot; can distinguish between HTML5 and XHTML5 as it is possible to use XML syntax in HTML5 text/html web pages.

I&apos;m sure Mike&apos;s just trying to be nice and keep the peace, but it is not a good solution.

&gt; and there are still a few issues to solve
&gt; in terms of community and agreements.

So what? We can&apos;t let silly politics hold back the progress of the web and the validation of XHTML web sites Karl.
 
&gt; See http://intertwingly.net/blog/2008/12/15/Co-Chair-HTML-WG and the comments.
&gt; 

Yeah, I&apos;ve already seen that, so what? Sam&apos;s heading in the wrong direction and this has been pointed out on his blog and on www-html.

&gt; (btw I have no preferences over a term. I just pointed out that there are still
&gt; two voices.)
&gt; 

&lt;sigh&gt;


</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>23071</commentid>
    <comment_count>9</comment_count>
    <who name="Damien B">kame</who>
    <bug_when>2009-01-14 14:25:54 +0000</bug_when>
    <thetext>&gt; I should have backed up what I said. :)
&gt; &gt; 
&gt; &gt; http://www.ask.com/web?q=xhtml5 Showing 1-10 of 12,700
&gt; &gt; http://www.ask.com/web?q=xhtml1 Showing 1-10 of 533,000
&gt; 
&gt; And what does this prove?

If you look closely you&apos;ll see that the term &quot;XHTML5&quot; has virtually no existence outside of the WhatWG circle. Proposing &quot;HTML 5 (HTML5 syntax)&quot; and &quot;HTML 5 (XML syntax)&quot; is clearer, fully in accordance with the Draft, less misleading regarding XHTML and won&apos;t prevent the few people really accustomed to &quot;XHTML5&quot; from using the validator. </thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>