Re: Validation error frequencies in Alexa top 500 front pages

It turned out that the previous collapsed counts were skewed because  
Google is serving similar HTML from a number of country-specific  
domains.

Here are the errors with each one counted at most once per page with  
Google excluded:
http://hsivonen.iki.fi/test/moz/alexa500-page-collapsed-counts-no-google.txt

> 0256 / 428 Attribute “border” not allowed on element “img” at  
> this point.
> 0240 / 428 Attribute “language” not allowed on element  
> “script” at this point.
> 0216 / 428 Attribute “cellspacing” not allowed on element  
> “table” at this point.
> 0216 / 428 Attribute “cellpadding” not allowed on element  
> “table” at this point.
> 0199 / 428 Bad value (consolidated) for attribute “target” on  
> element “a”: Browsing context name started with the underscore  
> and used a reserved keyword “blank”.
> 0198 / 428 Text after “&” did not match an entity name.
> 0198 / 428 Almost standards mode doctype.
> 0191 / 428 Attribute “border” not allowed on element “table”  
> at this point.
> 0188 / 428 Attribute “width” not allowed on element “table”  
> at this point.
> 0153 / 428 Attribute “valign” not allowed on element “td” at  
> this point.
> 0153 / 428 Attribute “align” not allowed on element “td” at  
> this point.
> 0151 / 428 Attribute “width” not allowed on element “td” at  
> this point.
> 0144 / 428 Bad value (consolidated) for attribute “http-equiv” on  
> element “meta”.
> 0133 / 428 ↩  The internal character encoding declaration must be  
> the first child of the “head” element.
> 0128 / 428 Attribute “size” not allowed on element “input” at  
> this point.

Of the top 15 errors, 14 would have been non-errors in HTML 4.01 /  
XHTML 1.0 Transitional.

9 would have been non-errors in HTML 4.01 / XHTML 1.0 Strict.

-- 
Henri Sivonen
hsivonen@iki.fi
http://hsivonen.iki.fi/

Received on Monday, 4 February 2008 09:23:46 UTC