ISSUE-174: Bug 18396 - Encoding Sniffing Algorithm: Add an XML check as a step zero

Bug 18396 - Encoding Sniffing Algorithm: Add an XML check as a step zero

State:
CLOSED
Product:
html
Raised by:
Richard Ishida
Opened on:
2012-08-01
Description:
Bugzilla: https://www.w3.org/Bugs/Public/show_bug.cgi?id=18396

Raised by: Leif Halvard Silli

About: http://dev.w3.org/html5/spec/Overview#encoding-sniffing-algorithm


Proposal: Extend the encoding sniffing algorithm by adding a new,
explicit step zero, like so:

0. If the document is an XML document, abort these steps.

Justification.

By extending the algorithm this way, then there is an *explicit*
step to 'jump out of the algorithm if XML' - for which it would also be
possible write test cases.

Currently, and especially if the XML document lives in a 'nested
browsing context'[1], then (unless there is a BOM) some browsers let
the XML doc default to the encoding of the 'parent browsing context'
instead of letting it default to the default encoding of the XML format
(UTF-8). Webkit/Chromium/Opera have this error. Firefox do not have
this error. I did not test IE9/10 yet, but suspect they are more on
Firefox' side. Regarding defaulting to the encoding of the parent
browsing context, then [see bug #foo and see bug #bar]

More data in my related blog post.[2]

[1] http://dev.w3.org/html5/spec/Overview#nested-browsing-context
[2] http://målform.no/blog/white-spots-in-html5-s-encoding-sniffing-algorithm
Related Actions Items:
No related actions
Related emails:
  1. I18N-ISSUE-174: Bug 18396 - Encoding Sniffing Algorithm: Add an XML check as a step zero [.Monitor-HTML] (from sysbot+tracker@w3.org on 2012-08-01)

Related notes:

No additional notes.

Display change log ATOM feed


Addison Phillips <addisonI18N@gmail.com>, Chair, Richard Ishida <ishida@w3.org>, Bert Bos <bert@w3.org>, Fuqiao Xue <xfq@w3.org>, Atsushi Shimono <atsushi@w3.org>, Staff Contacts
Tracker: documentation, (configuration for this group), originally developed by Dean Jackson, is developed and maintained by the Systems Team <w3t-sys@w3.org>.
$Id: 174.html,v 1.1 2023/07/19 12:01:59 carcone Exp $