Internationalization (i18n)

Making the World Wide Web worldwide!


Groups/repos

i18n WG

i18n Interest Group

African LE

Americas LE

Arabic LE

Chinese LE

Ethiopic LE

European LE

Hebrew LE

India LE

Japanese LE

Korean LE

Mongolian LE

SE Asian LE

Tibetan LE

Participate!

Join a Group

Follow the work

Translate a specification or page

International­ization Sponsorship Program

News by category
News archives
July 2011 (13)
July 2009 (10)
June 2009 (10)
June 2008 (13)
Search news

I18n sponsors

APL, Japan The Paciello Group Monotype Alibaba

Tag(s): tutorial-char-enc

Posts

Article for review: Character encodings in HTML and CSS

Read the article

Comments are being sought on this article prior to final release. Please send any comments to www-international@w3.org (subscribe). We expect to publish a final version in one to two weeks.

This is an update, in a temporary location, of the tutorial Character sets & encodings in XHTML, HTML and CSS. (Please be careful about bookmarking the location, since it is only temporary.)

A lot of new material was added, eg. related to the UTF-8 BOM, normalization, etc., and the material was rearranged significantly. The rearrangement was to downplay slightly the XHTML 1.0 issues, given that that is now only relevant to IE6, but also to help readers more quickly find information they need for the format they are dealing with.

The explicit distinction between XHTML 1.0 and XHTML 1.1 with regard to MIME types was removed, since the XHTML2 WG is hopefully very close to issuing a PER that enables XHTML 1.1 to be served as text/html.

The update adds information about HTML5.

Where a section corresponds to an article that has been updated, those updates were also migrated to this document.

Updated tutorial: Serving XHTML 1.0

Read the tutorial

This tutorial was updated to reflect the fact that IE7 no longer flips into quirks mode when an XML declaration is used. For a detailed list of changes read the full post.

The following paragraphs were changed as shown by the ins and del markup.

<p>In browsers such as <ins>Internet Explorer 7, Firefox,</ins><del>Mozilla</del>, Netscape, Opera, and others, with or without the XML declaration, a page served with a DOCTYPE declaration will be rendered in standards mode.</p>
<p> With Internet Explorer<ins> 6</ins>, however, if anything appears before the DOCTYPE declaration the page is rendered in quirks mode. </p>
<p>Because Internet Explorer <ins> 6</ins> users <ins>still</ins> count for a very large proportion of browser users, this is a significant issue. If you want to ensure that your pages are rendered in the same way on all standards-compliant browsers, you need to think carefully about how you deal with this.</p>
<p>The presence of an XML declaration in an XHTML 1.0 file served as HTML will cause your file to be rendered in quirks mode on Internet Explorer <ins> 6</ins> (and therefore for a potentially large proportion of your audience).</p>

Categories: Update

New tutorial: Character sets & encodings in XHTML, HTML and CSS

Link to tutorial

If a user agent (eg. a browser) is unable to detect the character encoding used in a Web document, the user may be presented with unreadable text. This information is particularly important for those maintaining and extending a multilingual site, but declaring the character encoding of the document is important for anyone producing XHTML/HTML or CSS. This tutorial will give you an understanding of the topic that will help you make the right choices when doing so. The topic is not as straightforward as it may sometimes appear, and the advice contained here is the end result of a great deal of thought and discussion.

After reading this tutorial you should:

  • get advice on choosing an encoding for XHTML/HTML documents
  • understand when and how to declare the character encoding (charset) for documents using XHTML/HTML and CSS
  • have a grasp of aspects of serving and coding XHTML/HTML files that affect the above
  • know when and how to use escapes and entities to represent characters
Categories: New resource, Tutorials

Copyright © 2023 World Wide Web Consortium.
W3C® liability, trademark and permissive license rules apply.

Questions or comments? ishida@w3.org