Internationalization (i18n)

Making the World Wide Web worldwide!


Groups/repos

i18n WG

i18n Interest Group

African LE

Americas LE

Arabic LE

Chinese LE

Ethiopic LE

European LE

Hebrew LE

India LE

Japanese LE

Korean LE

Mongolian LE

SE Asian LE

Tibetan LE

Participate!

Join a Group

Follow the work

Translate a specification or page

International­ization Sponsorship Program

News by category
News archives
July 2011 (13)
July 2009 (10)
June 2009 (10)
June 2008 (13)
Search news

I18n sponsors

APL, Japan The Paciello Group Monotype Alibaba

Tag(s): article-definitions-characters

Posts

Updated article: Character encodings: Essential concepts

This article introduces a number of basic concepts needed to understand other articles that deal with characters and character encodings.

The article has been updated with explanations of the terms ‘user-perceived character’, ‘grapheme-cluster’, ‘typographic character unit’, and ‘glyph’, and a warning about the vague use of the term ‘character’.

Read the article Character encodings: Essential concepts.

New translations into Romanian

These articles were translated into Romanian thanks to George Misel.

New translations into German

These articles were translated into German thanks to Gunnar Bittersmann.

More new translations into Spanish

Codificación de caracteres: conceptos básicos (Character encodings: Essential concepts)

Selección & aplicación de codificación de caracteres (Choosing & applying a character encoding)

These articles were translated into Spanish thanks to the Spanish Translation Team, Trusted Translations, Inc.

6 new articles about character encodings and HTML/CSS

Some articles are brand new and others were originally part of a tutorial, but have been updated and amplified to bring HTML5 to the fore and incorporate feedback from various readers. The articles are:

  1. Character encodings: Essential concepts
  2. Choosing & applying a character encoding
  3. Declaring character encodings in HTML
  4. The byte-order mark (BOM) in HTML
  5. Normalization in HTML and CSS
  6. Characters or markup?

Together these articles, with several other existing articles that were updated at the same time, provide practical advice to content authors on how to handle character encodings in HTML and CSS.

For review: 7 new and 3 updated articles about character encoding

Comments are being sought on the following new articles prior to final publication:

  1. Handling character encodings in HTML and CSS
  2. Essential definitions related to character encodings
  3. Choosing & applying a character encoding
  4. Character encoding declarations in HTML
  5. The byte-order mark (BOM) in HTML
  6. Normalization in HTML and CSS
  7. Characters or markup?

These articles have been derived from the former tutorial, which has already undergone a review. Since then, HTML5 has been brought to the fore in the articles and various small changes have been added, including some short summary information.

The three updated articles are the result of merging the tutorial material with existing articles. They are:

The character encoding section of the techniques page relating to HTML and CSS authoring has also been overhauled, to include the new material.

Please send any comments to www-international@w3.org (subscribe). We hope to publish a final version in one to two weeks.


Copyright © 2023 World Wide Web Consortium.
W3C® liability, trademark and permissive license rules apply.

Questions or comments? ishida@w3.org