This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 24831 - "the following additional information" -- ITYM "misinformation"? (What if the user wants DTD validation done?)
Summary: "the following additional information" -- ITYM "misinformation"? (What if th...
Status: RESOLVED WORKSFORME
Alias: None
Product: WHATWG
Classification: Unclassified
Component: HTML (show other bugs)
Version: unspecified
Hardware: Other other
: P3 normal
Target Milestone: Unsorted
Assignee: Ian 'Hixie' Hickson
QA Contact: contributor
URL: http://www.whatwg.org/specs/web-apps/...
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-02-26 23:21 UTC by contributor
Modified: 2014-03-12 18:10 UTC (History)
3 users (show)

See Also:


Attachments

Description contributor 2014-02-26 23:21:02 UTC
Specification: http://www.whatwg.org/specs/web-apps/current-work/multipage/the-xhtml-syntax.html
Multipage: http://www.whatwg.org/C#parsing-xhtml-documents
Complete: http://www.whatwg.org/c#parsing-xhtml-documents
Referrer: http://www.whatwg.org/specs/web-apps/current-work/multipage/the-iframe-element.html

Comment:
"the following additional information" -- ITYM "misinformation"?  (What if the
user wants DTD validation done?)

Posted from: 2001:470:1f07:57:1ca0:ef6b:5048:c171 by naesten@gmail.com
User agent: Mozilla/5.0 (Windows NT 5.1; rv:24.0) Gecko/20100101 Firefox/24.0
Comment 1 Samuel Bronson 2014-02-27 00:20:59 UTC
Note that I totally support the goal of not making any attempt to download these DTDs from the provided SYSTEM identifiers, and the idea of making sure that the character entities are always defined for documents specifying any of these PUBLIC identifiers.

I'm just:

1. Nitpicking about the wording here, since the DTD you "link to" here is quite different from the ones normally named by these PUBLIC identifiers.

and:

2. Suggesting that it could be reasonable for a user-agent to instead use a complete copy of the appropriate DTD that it happens to have on hand, perhaps looking it up in a catalog file.

I was going to suggest it might make sense to treat SYSTEM identifiers corresponding to these PUBLIC identifiers in the same way, but then I noticed that my system's XML catalogs don't do this (except for identifiers starting with "http://www.w3.org/TR/MathML2/dtd/"), so it probably isn't useful.

(I may have temporarily forgotten that SYSTEM identifiers are mandatory in XML. I really would prefer that they be optional: the security, document stability[1], and performance implications of actually dereferencing these being what they are, one would often prefer to rely entirely on catalog files; having made this choice, mandatory SYSTEM identifiers are nothing but a nuisance -- one must either use bogus identifiers, or be careful to always disable networking, lest something not in a catalog sneak in ...)

[1]: Not everyone can keep a URL alive forever like the W3 or OASIS; even docbook.org makes me a bit nervous.
Comment 2 Ian 'Hixie' Hickson 2014-02-28 00:21:54 UTC
The term "additional information" is from XML:

# An XML processor attempting to retrieve the entity's content may use any 
# combination of the public and system identifiers as well as additional 
# information outside the scope of this specification to try to generate an 
# alternative URI reference.