<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>705</bug_id>
          
          <creation_ts>2004-05-07 19:59:12 +0000</creation_ts>
          <short_desc>The Markup Validation Service detects HTML 4.01 Strict as HTML 4.01 Transitional.</short_desc>
          <delta_ts>2005-02-05 04:45:02 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>Validator</product>
          <component>check</component>
          <version>0.6.5</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc>http://validator.w3.org/check</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P1</priority>
          <bug_severity>major</bug_severity>
          <target_milestone>0.6.7</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Brian Sexton">bugzilla-w3c</reporter>
          <assigned_to name="Terje Bless">link</assigned_to>
          <cc>ot</cc>
    
    <cc>ville.skytta</cc>
          
          <qa_contact name="qa-dev tracking">www-validator-cvs</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>1793</commentid>
    <comment_count>0</comment_count>
    <who name="Brian Sexton">bugzilla-w3c</who>
    <bug_when>2004-05-07 19:59:12 +0000</bug_when>
    <thetext>Version 0.6.5 of the W3C Markup Validation Service reports that documents with 
HTML 4.01 Strict DOCTYPE declaration are HTML 4.01 Transitional documents:

&apos;The uploaded file was checked and found to be valid HTML 4.01 Transitional. 
This means that the resource in question identified itself as &quot;HTML 4.01 
Transitional&quot; and that we successfully performed a formal validation using an 
SGML or XML Parser (depending on the markup language used).&apos;

I have tried different source spacinglots of line breaks or none at all; a 
single line for the DOCTYPE declaration or multiple linesbut always with the 
same results.  Here is a test case (in its liberally spaced form for easy 
reading):

-----[ Begin Clip ]-----

&lt;!DOCTYPE HTML PUBLIC &quot;-//W3C//DTD HTML 4.01 
Strict//EN&quot; &quot;http://www.w3.org/TR/html4/strict.dtd&quot;&gt;


&lt;HTML&gt;


&lt;HEAD&gt;

  &lt;META HTTP-EQUIV=&quot;Content-Type&quot; CONTENT=&quot;text/html; charset=ISO-8859-1&quot;&gt;

  &lt;TITLE&gt;Test Case&lt;/TITLE&gt;

&lt;/HEAD&gt;


&lt;BODY&gt;


&lt;DIV&gt;


&lt;H1&gt;Test Case&lt;/H1&gt;


&lt;P&gt;Despite the HTML 4.01 Strict DOCTYPE declaration, the W3C validator reports 
that this document is &amp;quot;HTML 4.01 Transitional&amp;quot;:&lt;/P&gt;

&lt;P&gt;&apos;The uploaded file was checked and found to be valid HTML 4.01 
Transitional. This means that the resource in question identified itself as 
&amp;quot;HTML 4.01 Transitional&amp;quot; and that we successfully performed a formal 
validation using an SGML or XML Parser (depending on the markup language 
used).&apos;&lt;/P&gt;

&lt;P&gt;I have tried different source spacing&amp;mdash;lots of line breaks or none at 
all; a single line for the DOCTYPE declaration or multiple lines&amp;mdash;but 
always with the same results.&lt;/P&gt;


&lt;/DIV&gt;


&lt;/BODY&gt;


&lt;/HTML&gt;

-----[ End Clip ]-----</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1803</commentid>
    <comment_count>1</comment_count>
      <attachid>362</attachid>
    <who name="Brian Sexton">bugzilla-w3c</who>
    <bug_when>2004-05-08 14:27:41 +0000</bug_when>
    <thetext>Created attachment 362
A Well-Spaced Test Case

This is the same test case as within the body of my report (except that it
lacks the hard line breaks that were introduced by Bugzilla), but in a separate
file for your convenience.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1826</commentid>
    <comment_count>2</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2004-05-15 06:47:17 +0000</bug_when>
    <thetext>This is caused by validator&apos;s &quot;fall back to HTML 4.01 Transitional if no known
public identifier is found&quot; behaviour.

What triggers it in your document is that you&apos;ve specified:

   &quot;-//W3C//DTD HTML 4.01 Strict//EN&quot;

...but there&apos;s no such public identifier.  It should be:

   &quot;-//W3C//DTD HTML 4.01//EN&quot;

...for HTML 4.01 Strict.

Nevertheless, the validator&apos;s output for these cases is not acceptable, I wish
we can get this fixed soon.  BTW, specifying just about anything as long as the
root element is &quot;HTML&quot; will produce the same results, for example this:

  &lt;!DOCTYPE HTML PUBLIC &quot;-//foo//DTD bar//EN&quot; &quot;quux.dtd&quot;&gt;</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1829</commentid>
    <comment_count>3</comment_count>
    <who name="Brian Sexton">bugzilla-w3c</who>
    <bug_when>2004-05-15 16:39:08 +0000</bug_when>
    <thetext>I am not sure how I got into the habit of using a non-standard DTD declaration,
but I am confirming the assertion made in Comment #2 with reference URLs:

http://www.w3.org/QA/2002/04/valid-dtd-list.html

http://www.w3.org/QA/Tips/Doctype

It should be noted, however, that the same non-standard DTD declaration does
appear in what looks like old versions of the validator&apos;s source code archived
on the W3C Web site:

http://dev.w3.org/cvsweb/validator/httpd/cgi-bin/check?rev=1.200.2.13
http://dev.w3.org/cvsweb/validator/httpd/cgi-bin/check?rev=1.200.2.14
http://dev.w3.org/cvsweb/validator/httpd/cgi-bin/check?rev=1.200.2.15
http://dev.w3.org/cvsweb/validator/httpd/cgi-bin/check?rev=1.200.2.16
http://dev.w3.org/cvsweb/validator/httpd/cgi-bin/check?rev=1.200.2.17

Naturally, it also appears in some discussion on the validator list.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1835</commentid>
    <comment_count>4</comment_count>
    <who name="Terje Bless">link</who>
    <bug_when>2004-05-16 03:45:41 +0000</bug_when>
    <thetext>Argh! And if the page happens to be valid HTML 4.01 Transitional you won&apos;t even
get as much as a warning about this, much less an error. OpenSP isn&apos;t even
outputting anything we could key off of. Anyone have suggestions?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1860</commentid>
    <comment_count>5</comment_count>
    <who name="Terje Bless">link</who>
    <bug_when>2004-05-17 16:45:16 +0000</bug_when>
    <thetext>*** Bug 454 has been marked as a duplicate of this bug. ***</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1869</commentid>
    <comment_count>6</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2004-05-17 18:15:39 +0000</bug_when>
    <thetext>One suggestion off the cuff, untested: remove the &quot;HTML&quot; ~ &quot;HTML 4.01
Transitional&quot; fallback from the catalog(s) (as well as other similar ones from
other catalogs, if there are any), and insert the doctype ourselves if none is
found, akin to how it&apos;s currently done in doctype override.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1870</commentid>
    <comment_count>7</comment_count>
    <who name="Terje Bless">link</who>
    <bug_when>2004-05-17 19:13:54 +0000</bug_when>
    <thetext>I found that idea impractical to implement last time I looked at it (for several reasons). If you can come 
up with code for this I&apos;d be very happy. In either case, this is too disruptive for 0.6.6 IMO; setting target 
0.7.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1871</commentid>
    <comment_count>8</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2004-05-18 02:27:48 +0000</bug_when>
    <thetext>What about just removing the catch-all-HTML catalog fallback for 0.6.6?
Currently the problems it causes outweigh its benefits, and removing the
fallback should not be too disruptive.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1892</commentid>
    <comment_count>9</comment_count>
    <who name="niq">nick</who>
    <bug_when>2004-05-19 03:19:46 +0000</bug_when>
    <thetext>Two points Re: Terje&apos;s #4:

(1) I&apos;m confused that it&apos;s giving preference to a catalogue default over a valid
SYSTEM identifier in a case where the latter is provided.  But that&apos;s really a
side-issue: if we fix it, the problem just reappears when someone mistypes the
SYSTEM id as well.

(2) I think we could get OpenSP to emit a useful message by hacking the local
defaulted DTD to emit a custom version string we can test against.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1894</commentid>
    <comment_count>10</comment_count>
    <who name="Terje Bless">link</who>
    <bug_when>2004-05-20 08:14:58 +0000</bug_when>
    <thetext>Retargetting for 0.6.7 and change Status-&gt;Accepted.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1902</commentid>
    <comment_count>11</comment_count>
    <who name="Olivier Thereaux">ot</who>
    <bug_when>2004-05-20 21:42:44 +0000</bug_when>
    <thetext>Seems like Terje&apos;s tweak with &quot;OVERRIDE NO&quot; in sgml.soc fixed this. 

One interesting  consequence is that documents with no doctype are now said to be &quot;NOT VALID  &quot;. 
Which is quite true, they&apos;re not valid anything, they don&apos;t claim to be anything. I&apos;m sure some people 
would consider it a heresy that a validation claim does not express against what. 

Other than that, a run through /dev/tests/ did not show anything abnormal. </thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1903</commentid>
    <comment_count>12</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2004-05-21 03:16:37 +0000</bug_when>
    <thetext>Sort of fixed, but the &quot;friendliness&quot; ie. fallback from no doctype to HTML 4.01
Transitional seems to be gone, or at least is partially broken:
http://qa-dev.w3.org/wmvs/0.6/check?uri=http%3A%2F%2Fkoti.welho.com%2Fvskytta%2Fbug705-2.html</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>1904</commentid>
    <comment_count>13</comment_count>
    <who name="Terje Bless">link</who>
    <bug_when>2004-05-21 11:31:19 +0000</bug_when>
    <thetext>False alarm! I repeat, false alarm! :-)

The change appeared to &quot;fix&quot; it because it introduced a bug that disabled the
fallback Doctype completely. With that gone, the behaviour described in this bug
remains.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>2239</commentid>
    <comment_count>14</comment_count>
    <who name="Olivier Thereaux">ot</who>
    <bug_when>2004-09-06 06:24:02 +0000</bug_when>
    <thetext>The bug is apparently fixed in current HEAD. close?</thetext>
  </long_desc>
      
          <attachment
              isobsolete="0"
              ispatch="0"
              isprivate="0"
          >
            <attachid>362</attachid>
            <date>2004-05-08 14:27:41 +0000</date>
            <delta_ts>2004-05-08 14:27:41 +0000</delta_ts>
            <desc>A Well-Spaced Test Case</desc>
            <filename>Bug 705 Test Case.html</filename>
            <type>text/html</type>
            <size>933</size>
            <attacher name="Brian Sexton">bugzilla-w3c</attacher>
            
              <data encoding="base64">PCFET0NUWVBFIEhUTUwgUFVCTElDICItLy9XM0MvL0RURCBIVE1MIDQuMDEgU3RyaWN0Ly9FTiIg
Imh0dHA6Ly93d3cudzMub3JnL1RSL2h0bWw0L3N0cmljdC5kdGQiPgoKCjxIVE1MPgoKCjxIRUFE
PgoKICA8TUVUQSBIVFRQLUVRVUlWPSJDb250ZW50LVR5cGUiIENPTlRFTlQ9InRleHQvaHRtbDsg
Y2hhcnNldD1JU08tODg1OS0xIj4KCiAgPFRJVExFPlRlc3QgQ2FzZTwvVElUTEU+Cgo8L0hFQUQ+
CgoKPEJPRFk+CgoKPERJVj4KCgo8SDE+VGVzdCBDYXNlPC9IMT4KCgo8UD5EZXNwaXRlIHRoZSBI
VE1MIDQuMDEgU3RyaWN0IERPQ1RZUEUgZGVjbGFyYXRpb24sIHRoZSBXM0MgdmFsaWRhdG9yIHJl
cG9ydHMgdGhhdCB0aGlzIGRvY3VtZW50IGlzICZxdW90O0hUTUwgNC4wMSBUcmFuc2l0aW9uYWwm
cXVvdDs6PC9QPgoKPFA+J1RoZSB1cGxvYWRlZCBmaWxlIHdhcyBjaGVja2VkIGFuZCBmb3VuZCB0
byBiZSB2YWxpZCBIVE1MIDQuMDEgVHJhbnNpdGlvbmFsLiBUaGlzIG1lYW5zIHRoYXQgdGhlIHJl
c291cmNlIGluIHF1ZXN0aW9uIGlkZW50aWZpZWQgaXRzZWxmIGFzICZxdW90O0hUTUwgNC4wMSBU
cmFuc2l0aW9uYWwmcXVvdDsgYW5kIHRoYXQgd2Ugc3VjY2Vzc2Z1bGx5IHBlcmZvcm1lZCBhIGZv
cm1hbCB2YWxpZGF0aW9uIHVzaW5nIGFuIFNHTUwgb3IgWE1MIFBhcnNlciAoZGVwZW5kaW5nIG9u
IHRoZSBtYXJrdXAgbGFuZ3VhZ2UgdXNlZCkuJzwvUD4KCjxQPkkgaGF2ZSB0cmllZCBkaWZmZXJl
bnQgc291cmNlIHNwYWNpbmcmbWRhc2g7bG90cyBvZiBsaW5lIGJyZWFrcyBvciBub25lIGF0IGFs
bDsgYSBzaW5nbGUgbGluZSBmb3IgdGhlIERPQ1RZUEUgZGVjbGFyYXRpb24gb3IgbXVsdGlwbGUg
bGluZXMmbWRhc2g7YnV0IGFsd2F5cyB3aXRoIHRoZSBzYW1lIHJlc3VsdHMuPC9QPgoKCjwvRElW
PgoKCjwvQk9EWT4KCgo8L0hUTUw+
</data>

          </attachment>
      

    </bug>

</bugzilla>