This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 11055 - BOM confusion
Summary: BOM confusion
Status: RESOLVED FIXED
Alias: None
Product: HTML WG
Classification: Unclassified
Component: LC1 HTML/XHTML Compatibility Authoring Guide (ed: Eliot Graff) (show other bugs)
Version: unspecified
Hardware: PC Windows NT
: P2 normal
Target Milestone: ---
Assignee: Eliot Graff
QA Contact: HTML WG Bugzilla archive list
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2010-10-14 21:04 UTC by David Carlisle
Modified: 2011-08-04 05:07 UTC (History)
4 users (show)

See Also:


Attachments

Description David Carlisle 2010-10-14 21:04:17 UTC
section 3 of

http://dev.w3.org/html5/html-xhtml-author-guide/WD-html-polyglot-20101019.htm



When polyglot markup uses UTF-16, it  must not include a BOM. When polyglot markup uses UTF-16, it  must include the BOM 


I assume the first UTF=16 was intended to say UTF-8


It goes on to say

Therefore, polyglot markup may use <meta charset="*"/> in combination with BOM,

If there is a BOM, then this would be some flavour of UTF-16 but that would be invalid HTML5 according to the current draft

http://www.whatwg.org/specs/web-apps/current-work/multipage/semantics.html#attr-meta-charset

says

If the attribute is present in an XML document, its value must be an ASCII case-insensitive match for the string "UTF-8" (and the document is therefore forced to use UTF-8 as its encoding).
Comment 1 Eliot Graff 2010-10-29 20:36:06 UTC
In the 29 October editor's draft, I changed the relevant part of section 3 to the following:

Using <meta charset="*"/> has no effect in XML. Therefore, polyglot markup may use <meta charset="*"/> provided the document is encoded as UTF-8 and the value of charset is a case-insensitive match for the string utf-8. 

And fixed that typo.

I believe that satisfies the requests in this bug.

Thanks for the feedback!

Eliot
Comment 2 Michael[tm] Smith 2011-08-04 05:07:06 UTC
mass-move component to LC1
Comment 3 Michael[tm] Smith 2011-08-04 05:07:30 UTC
mass-move component to LC1