Home page
The W3C Internationalization (I18n) Activity works with W3C working groups and liaises with other organizations to make it possible to use Web technologies with different languages, scripts, and cultures. From this page you can find articles and other resources about Web internationalization, and information about the groups that make up the Activity.
Recent highlights
5 March 10
New First Public Working Draft: Additional Requirements for Bidi in HTML
24 February 10
W3C Workshop on Conversational Applications
9 February 10
Article for review: Character encodings in HTML and CSS
8 December 09
New article: Choosing a Language Tag
25 November 09
Internet Governance Forum Poster
23 October 09
Unicode Collation Algorithm Version 5.2 Released
All news
New First Public Working Draft: Additional Requirements for Bidi in HTML
This document arose out of the frustrations of people who have to work with bidirectional text on the Web in everyday practical situations. For example, it covers issues related to re-use of fragments of text in various new locations by web apps or scripts, or situations where users need to type in or send bidirectional form data. It proposes additions to the HTML5 specification for such situations, which are not covered by the current HTML specification. Many of the ideas in the document, however, are also relevant to markup formats in general, and there are some implications for CSS and XSL-FO (which we hope to address more directly in a subsequent document).
Please send comments on this document to public-i18n-bidi@w3.org. Join the list and view the archive). We hope to publish a new version of the document, incorporating feedback, in about a month (depending on feedback received). We will then ask the HTML WG to review that version.
Editor: Aharon Lanin, Google. [search key: tr-html-bidi]
W3C Workshop on Conversational Applications
A call has gone out for a Workshop on Conversational Applications — Use Cases and Requirements for New Models of Human Language to Support Mobile Conversational Systems, 18-19 June 2010, Hosted by Openstream, NJ, US
Scope of the Workshop Submissions must describe (1) requirements and use cases for improving W3C standards for conversational interaction and (2) how the use cases justify one or more of these topics:
- Formal notations for representing grammar in: Syntax, Morphology, Phonology, Prosodics
- Engine standards for improvement in processing: Syntax, Morphology, Phonology, Lexicography
- Lexicography standards for: parts-of-speech, grammatical features and polysemy
- Formal semantic representation of human language including: verbal tense, aspect, valency, plurality, pronouns, adverbs, etc.
- Efficient data structures for binary representation and passing of: parse trees, alternate lexical/morphologic analysis, alternate phonologic analysis
- Other suggested areas or improvements for standards based conversational systems development
Experts in the following technology areas would be welcome.
- computational linguistics
- speech prosody
- syntax
- internationalization
- mobile applications
- MMI/voice technology
For more information see http://www.w3.org/2010/02/convapps/cfp.html
New translations into Romanian
Thanks to Sorin Velescu, the following article has been translated into Romanian.
Sfaturi utile de internationalizare pentru Web (Internationalization Quick Tips for the Web)
[search keys: article-quicktips
New translation into French
Thanks to the French Translation Team, Trusted Translations Inc., the following article has been translated into French.
Création de pages SVG Tiny en arabe, hébreu et autres scripts lus de droite à gauche (Creating SVG Tiny Pages in Arabic, Hebrew and other Right-to-Left Scripts)
[search keys: tutorial-svg-tiny-bidi
New translations into Spanish
Thanks to the Spanish Translation Team, Spanish Translation US, the following articles have been translated into Spanish.
Selección de una etiqueta de idioma (Choosing a Language Tag)
Problemas de visualización provocados por BOM en UTF-8 (Display problems caused by the UTF-8 BOM)
Caracteres y glifos faltantes (Missing characters and glyphs)
[search keys: qa-choosing-language-tags qa-utf8-bom article-o-misscharglyph
Article for review: Character encodings in HTML and CSS
Comments are being sought on this article prior to final release. Please send any comments to www-international@w3.org (subscribe). We expect to publish a final version in one to two weeks.
This is an update, in a temporary location, of the tutorial Character sets & encodings in XHTML, HTML and CSS. (Please be careful about bookmarking the location, since it is only temporary.)
A lot of new material was added, eg. related to the UTF-8 BOM, normalization, etc., and the material was rearranged significantly. The rearrangement was to downplay slightly the XHTML 1.0 issues, given that that is now only relevant to IE6, but also to help readers more quickly find information they need for the format they are dealing with.
The explicit distinction between XHTML 1.0 and XHTML 1.1 with regard to MIME types was removed, since the XHTML2 WG is hopefully very close to issuing a PER that enables XHTML 1.1 to be served as text/html.
The update adds information about HTML5.
Where a section corresponds to an article that has been updated, those updates were also migrated to this document.
[search keys: tutorial-char-enc]
New translations into Romanian
Thanks to the Sorin Velescu, the following articles have been translated into Romanian.
Data si Ora (Dates and Time)
Formate pentru data (Date formats)
Capacitati de afisare (Display capabilities)
[search keys: article-o-time qa-date-format qa-display-capabilities]
New translations into Spanish
Thanks to the Spanish Translation Team, Spanish Translation US, the following articles have been translated into Spanish.
Consejos rápidos sobre internacionalización para la Web (Internationalization Quick Tips for the Web)
Introducción a las codificaciones y a los sets de caracteres (Introducing Character Sets and Encodings)
[search keys: gs-characters article-quicktips
New translations into Hungarian
Thanks to Dénes Kohn, Metaphraser Translation Company, the following articles have been translated into Hungarian.
Karakterkódolás ellenőrzése a validatorral (Checking the character encoding using the validator)
HTTP Fejlécek Ellenőrzése (Checking HTTP Headers)
Karakterkészletre vonatkozó beállítás a .htaccess fájlban (Setting charset information in .htaccess)
[search keys: qa-validator-charset-check qa-headers-charset qa-htaccess-charset]
New translations into Romanian
Thanks to the Sorin Velescu, the following articles have been translated into Romanian.
Localizare vs. Internationalizare (Localization vs. Internationalization)
Site-uri Web monolinguale vs. multilinguale (Monolingual vs. multilingual Web sites)
Site-uri internationale si multilinguale (International & multilingual web sites)
[search keys: qa-i18n qa-mono-multilingual qa-international-multilingual]
Questions or comments? ishida@w3.org
Powered by ![]()
Copyright © 1997-2010 W3C® (MIT, ERCIM, Keio), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply. Your interactions with this site are in accordance with our public and Member privacy statements.