New tutorial: Character sets & encodings in XHTML, HTML and CSS
If a user agent (eg. a browser) is unable to detect the character encoding used in a Web document, the user may be presented with unreadable text. This information is particularly important for those maintaining and extending a multilingual site, but declaring the character encoding of the document is important for anyone producing XHTML/HTML or CSS. This tutorial will give you an understanding of the topic that will help you make the right choices when doing so. The topic is not as straightforward as it may sometimes appear, and the advice contained here is the end result of a great deal of thought and discussion.
After reading this tutorial you should:
- get advice on choosing an encoding for XHTML/HTML documents
- understand when and how to declare the character encoding (charset) for documents using XHTML/HTML and CSS
- have a grasp of aspects of serving and coding XHTML/HTML files that affect the above
- know when and how to use escapes and entities to represent characters
Previous post: New article: When to use language negotiation
Next post: New tutorial: Using language information in XHTML, HTML and CSS
W3C® liability, trademark and permissive license rules apply.
Questions or comments? ishida@w3.org