This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 5222 - Page is valid from upload or copy/paste but not valid from link
Summary: Page is valid from upload or copy/paste but not valid from link
Alias: None
Product: Validator
Classification: Unclassified
Component: Parser (show other bugs)
Version: HEAD
Hardware: PC Windows XP
: P2 normal
Target Milestone: ---
Assignee: This bug has no owner yet - up for the taking
QA Contact: qa-dev tracking
Depends on:
Reported: 2007-10-23 12:24 UTC by Stuart Wilson
Modified: 2007-10-29 15:31 UTC (History)
1 user (show)

See Also:


Description Stuart Wilson 2007-10-23 12:24:33 UTC
From link it says the code is invalid:
This page is not Valid (no Doctype found)!
 No Character Encoding Found! Falling back to UTF-8.
 Unable to Determine Parse Mode!
 Line 1, Column 0: end of document in prolog.
But on upload it changes to a valid page:
The uploaded document "C:\wamp\www\links\videolinks\shinevideo.html" was checked and found to be valid XHTML 1.0 Transitional. This means that the resource in question identified itself as "XHTML 1.0 Transitional" and that we successfully performed a formal validation using an SGML or XML Parser (depending on the markup language used). 
I have a few pages with the same code in the same folder and same happens.
All other pages on my site will validate from a link.
Comment 1 Olivier Thereaux 2007-10-24 03:42:26 UTC
There seems to be a misconfiguration in your server somewhere.
Where an HTTP request is made with
Connection: close
the server returns no content at all (which is what the validator is getting, see e.g validation results with "show source" on).

Most browsers will see the page (because they use Connection: keep-alive) but a number of other agents, such as search engine crawlers, will have the same issue.

I recommend fixing your server config to allow non-keep-alive connections.

*** This bug has been marked as a duplicate of bug 4586 ***
Comment 2 Stuart Wilson 2007-10-24 09:49:10 UTC
I don't understand this even more now.
This is the opposite of bug 4586.
When I validate from http://www.sunflowertherapies.netindex.html it says valid.
Yet when I do the same at validome it says invalid on the same page. Contrary to bug 4586.
Comment 3 Stuart Wilson 2007-10-26 00:17:53 UTC
Further testing
I now have a page called
which validates correctly.
In the code there is a section that is commented out.

This is the piece i cut out and then it doesnt validate - but it says "no doctype found" - the two files are identical apart from this part...

<table style="text-align: left; width: 800px;" summary="base for the bumf" border="0" cellpadding="5">
<p align="left"><br />
Whether you&#8217;re planning a holiday or live nearby, Wendy and Stuart at<br />
<strong><em>Sunflower Holistic Therapies</em></strong> near Antequera will make sure you leave feeling refreshed, relaxed and beautiful - inside and out!  - and if you&#8217;re not sure which treatment is right for you, you can learn more from the&nbsp;&#8217;treatments&#8217; pages or contact us for guidance.&nbsp;&nbsp; Decided on your treatment already?&nbsp;&nbsp;Then simply email us or call to&nbsp;make your appointment.</p>

Down to her I cut out and then it say - no doctype found

Commented out it is fine. but when i cut it out to tidy up the page it does not validate from a link.
How can a piece of code near the bottom of the page make it say there is no doctype stated.
Comment 4 Stuart Wilson 2007-10-26 00:23:03 UTC
Further to this bug...
I have just pasted the commented out piece of code into all of the previously failed pages in the same directory and it magically makes them all validate from a link.
Now can you find out why for me please.
It cannot be a server issue.
Perhaps it is the length of the page that is causing it to fail the validation.
Comment 5 Olivier Thereaux 2007-10-29 10:28:04 UTC

The addresses you have now all redirect to a "404" page, which is what the validator is checking. I am not sure what you are doing with your server, but if you keep changing the setup, it's impossible to debug consistently.

Furthermore, I am rather confident about my original diagnosis, before you started changing things around.
Comment 6 Stuart Wilson 2007-10-29 15:31:36 UTC
(In reply to comment #5)
Sorry - I changed the folder layout without thinking.
The pages are on

and are test.html plus others.
(I have another, test1.html which is for divx player and avi files where i use the embed tag, but i dont really expect that to be valid.)

All validate with a piece of commented out code, but non without that commented code.