Resources by type
This page list various types of resource available on the W3C site relating to Internationalization. Some resources are listed on this page, others are on linked pages.
Links
-
Technical reports & notes: Documents that follow the W3C Process guidelines. The target for such documents is normally W3C Recommendation or WG Note.
-
Articles: Miscellaneous articles and notes related to the use of W3C and other more general Web technologies in an internationalization context. This list includes a number of articles based around frequently asked questions.
-
Tutorials: Tutorial materials are organized around a set of slides. The tutorial can be followed without reference to the slides, or the slides can be used to present information to others and the textual material used to provide a talk track.
-
Tests: Tests are provided that help to assess support for internationalization features in user agents. Note that these tests do not only test conformance with W3C standards. In some cases the tests also allow for exploration of the behavior of user agents in ways not described by the standards.
-
Talks: Presentations given by members of the Internationalization Working Group.
-
Reviews: Review comments of Core Working Group on specifications of other groups.
Tools
- IANA Language Subtag Registry search tool, written in XHTML and JavaScript. Look for subtags by searching on the descriptions in the registry. Look up subtags and sequences of subtags. List the various types of tag currently available. Runs on the most up-to-date version of the registry.
- Unicode Code Converter, written in XHTML and JavaScript. Convert between characters, Unicode code point numbers, UTF-8 and UTF-16 code units in hex, and Numeric Character References (hex and decimal).
- Unicode Character Pickers, written in XHTML and JavaScript. The pickers display the characters that are used for a range of languages, and allow you to compose one or more words by clicking on characters. The languages available include Arabic (including Persian and Urdu), Armenian, Bengali, Devanagari, Ethiopic, Gurmukhi, Hebrew, IPA, Latin (with diacritics), Malayalam, Myanmar, Tamil, Thai and Tifinagh. More languages are planned. Characters can be displayed as graphics or using an installed font, and their size can be altered. Other characters can be added by typing. The resulting text can be copied and pasted into other applications.
- UniView Unicode Database Viewer, written in XHTML and JavaScript. UniView is a Unicode character viewer that can display characters from a selected range using an installed font or as graphics. You can also paste in and discover unknown characters, store your own info about characters, search on character names, do hex/dec/ncr conversions, highlight character types, etc. etc. Characters can be selected so that their details are displayed, or copied into a text box from where they can be copied and pasted into other applications. It supports the supplementary planes as well as the BMP.
- Charlint, a Perl script for character normalization.
- mod_fileiri, an Apache module to support IRIs on various file systems (Talk).
Historical documents
These links point to pages that are out of date, or that contain information of historical interest only.
- Ruby
- URIs & other identifiers including non-ASCII characters
- Markup: HTML, base character set, Ruby
- HTTP
- URIs & other identifiers including non-ASCII characters
- ISO 3166 country codes
- Fonts
- Multilingualism in HTML
- Hyphenation
- History of language negotiation
- M.T. (Tomas) Carrasco Benitez's WInter (Web Internationalization) Page
- Help with creating a multilingual site
- More links...