This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.
it would be great if the output of the validator would be an xml file with well-defined errors. Then, I could have my httpunit test each of my web-pages and collect the result in an automated way.
The validator can allready give you XML output by appending ";output=xml" to the query string. We are also investigating a SOAP interface and other options for programattic access to the Validator. Note that the XML output option is highly experimental, the format is 100% guaranteed to change (we will deliberately change it at some point!), and the entire option is fairly likely to go away when a better method is found. However, that said, the XML output has been reasonably stable for about a year now; so if you don't mind a few bumps and bruises every now and then, the XML output is available and can be used.