<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>14</bug_id>
          
          <creation_ts>2002-10-25 02:04:14 +0000</creation_ts>
          <short_desc>XHTML Detection is over-eager.</short_desc>
          <delta_ts>2008-12-01 03:03:31 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>Validator</product>
          <component>check</component>
          <version>0.6.0b1</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>NEW</bug_status>
          <resolution></resolution>
          
          
          <bug_file_loc>http://www.damowmow.com/playground/html-not-xml-2.html</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>1.0</target_milestone>
          <dependson>24</dependson>
    
    <dependson>739</dependson>
    
    <dependson>1500</dependson>
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Terje Bless">link</reporter>
          <assigned_to name="Olivier Thereaux">ot</assigned_to>
          <cc>aaz</cc>
    
    <cc>ian</cc>
          
          <qa_contact name="qa-dev tracking">www-validator-cvs</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>16</commentid>
    <comment_count>0</comment_count>
    <who name="Terje Bless">link</who>
    <bug_when>2002-10-25 02:04:14 +0000</bug_when>
    <thetext>Reported by Ian Hickson:

The following document:

   http://www.damowmow.com/playground/html-not-xml-2.html

....is a valid HTML 4.01 document. However, with the new validator, I get
the following error message:

| This Page Is NOT Valid XHTML 1.0 Strict!
|
| Below are the results of attempting to parse this document with an SGML
| parser.
|
| 1. Line 2, column 7: S separator in comment declaration
|
| &lt;!-- -- --&gt;
|        ^

This is probably a bug in the XHTML detection code.

Furthermore, when I force it to be handled as HTML 4.01, it still gets
autodetected as XHTML.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>96</commentid>
    <comment_count>1</comment_count>
    <who name="Terje Bless">link</who>
    <bug_when>2002-10-27 11:40:00 +0000</bug_when>
    <thetext>This case is pathological and compounded by the differing comment syntax between
SGML and XML. AFAICT, the root cause is that HTML::Parser doesn&apos;t understand the
comment syntax and so detects the XHTML DOCTYPE, forcing the Validator into XML
mode. Any fix for this needs to begin by fixing HTML::Parser&apos;s comment parser
and then we can see what this leaves us with in &quot;check&quot;. This probably also
means we&apos;ll have to fix Bug #24 first.

Setting blocker on Bug #24 and target to 0.7.0 to revisit the issue then.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>3398</commentid>
    <comment_count>2</comment_count>
    <who name="Olivier Thereaux">ot</who>
    <bug_when>2005-05-12 01:19:47 +0000</bug_when>
    <thetext>I doubt we&apos;ll get around to fixing this bug for 0.7.0. 
Terje, What do you think?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>3399</commentid>
    <comment_count>3</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2005-05-12 02:03:56 +0000</bug_when>
    <thetext>According to the HTML WG, a UA is non-compliant if it handles an XHTML document
sent as text/html as XHTML; such a UA must apparently handle the document as
HTML regardless of what it looks like.

# [...] documents served as text/html should be treated as HTML and not as XHTML.
 -- http://lists.w3.org/Archives/Public/www-html/2000Sep/0024.html

But I don&apos;t know if they meant to include validators in that statement.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>5362</commentid>
    <comment_count>4</comment_count>
    <who name="Olivier Thereaux">ot</who>
    <bug_when>2005-08-05 00:45:20 +0000</bug_when>
    <thetext>(In reply to comment #2)
&gt; I doubt we&apos;ll get around to fixing this bug for 0.7.0. 

indeed.
</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>7664</commentid>
    <comment_count>5</comment_count>
    <who name="murali">murali1287</who>
    <bug_when>2006-01-06 08:33:55 +0000</bug_when>
    <thetext>

*** This bug has been marked as a duplicate of 12 ***</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>11312</commentid>
    <comment_count>6</comment_count>
    <who name="Olivier Thereaux">ot</who>
    <bug_when>2006-08-30 02:26:05 +0000</bug_when>
    <thetext>adding dependency on Bug #1500 too, as the switching to xml mode may end up being decided not by the doctype but by the media type.
</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>15254</commentid>
    <comment_count>7</comment_count>
    <who name="Olivier Thereaux">ot</who>
    <bug_when>2007-05-31 01:04:01 +0000</bug_when>
    <thetext>(In reply to comment #3)
&gt; According to the HTML WG, a UA is non-compliant if it handles an XHTML document
&gt; sent as text/html as XHTML; such a UA must apparently handle the document as
&gt; HTML regardless of what it looks like.

According to the WG, XHTML is always XML, and should be validated as such.

http://lists.w3.org/Archives/Public/www-validator/2007Apr/0175.html

&gt; But I don&apos;t know if they meant to include validators in that statement.

Apparently not.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>19347</commentid>
    <comment_count>8</comment_count>
      <attachid>524</attachid>
    <who name="venki">venkiktr</who>
    <bug_when>2008-03-07 05:35:38 +0000</bug_when>
    <thetext>Created attachment 524
Hello india

bug QA</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>19348</commentid>
    <comment_count>9</comment_count>
      <attachid>525</attachid>
    <who name="venki">venkiktr</who>
    <bug_when>2008-03-07 05:37:11 +0000</bug_when>
    <thetext>Created attachment 525
Hello india

bug QA</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>19349</commentid>
    <comment_count>10</comment_count>
      <attachid>526</attachid>
    <who name="venki">venkiktr</who>
    <bug_when>2008-03-07 05:37:39 +0000</bug_when>
    <thetext>Created attachment 526
Hello india

bug QA</thetext>
  </long_desc>
      
          <attachment
              isobsolete="1"
              ispatch="1"
              isprivate="0"
          >
            <attachid>524</attachid>
            <date>2008-03-07 05:35:38 +0000</date>
            <delta_ts>2008-03-07 05:35:38 +0000</delta_ts>
            <desc>Hello india</desc>
            <filename>Buglist_QA.txt</filename>
            <type>text/plain</type>
            <size>0</size>
            <attacher name="venki">venkiktr</attacher>
            
              <data encoding="base64"></data>

          </attachment>
          <attachment
              isobsolete="1"
              ispatch="0"
              isprivate="0"
          >
            <attachid>525</attachid>
            <date>2008-03-07 05:37:11 +0000</date>
            <delta_ts>2008-03-07 05:37:11 +0000</delta_ts>
            <desc>Hello india</desc>
            <filename>317200623702PMbhumika_chawla.jpg</filename>
            <type>image/pjpeg</type>
            <size>0</size>
            <attacher name="venki">venkiktr</attacher>
            
              <data encoding="base64"></data>

          </attachment>
          <attachment
              isobsolete="1"
              ispatch="0"
              isprivate="0"
          >
            <attachid>526</attachid>
            <date>2008-03-07 05:37:39 +0000</date>
            <delta_ts>2008-03-07 05:37:39 +0000</delta_ts>
            <desc>Hello india</desc>
            <filename>317200623702PMbhumika_chawla.jpg</filename>
            <type>image/pjpeg</type>
            <size>0</size>
            <attacher name="venki">venkiktr</attacher>
            
              <data encoding="base64"></data>

          </attachment>
      

    </bug>

</bugzilla>