This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.
The doctype detection routine in preparse_doctype() has the following regexp to detect FPI and SI: m(<!DOCTYPE\s+(\w+)\s+(?:PUBLIC|SYSTEM)\s+... the first (\w+) is the name of the document type, which has to be the root element (ref: http://www.w3.org/TR/xml/#vc-roottype ) but the \w+ is incorrect, as the root element can (among others) have a dash or dot. (ref: http://www.w3.org/TR/xml/#IDANQDS ) This half-breaks detection of the doctype for languages with root element including non "perl word (alphanum plus _)" characters.
Patched in http://lists.w3.org/Archives/Public/www-validator-cvs/2007Sep/0071.html Adding test case. The validator should now report on this test case as: This Page Is Valid -//Recordare//DTD MusicXML 1.1 Partwise//EN! rather than This Page Is Valid XML!