Internationalization tools
This page list various tools related to internationalization created by members of the Internationalization Activity These are not maintained or owned by the W3C.
Links
Charlint
A Perl script for character normalization.
IANA Language Subtag Registry search tool
Written in XHTML and JavaScript. Look for subtags by searching on the descriptions in the registry. Look up subtags and sequences of subtags. List the various types of tag currently available. Runs on the most up-to-date version of the registry.
-
W3C Internationalization Checker (prototype)
This checker performs various tests on a Web Page to determine its level of internationalisation-friendliness. It also lists key internationalization settings related to character encoding, language declarations, text direction and class/id names. This information includes HTTP headers, which can be particularly useful for troubleshooting problems. The checker is still only a prototype, so there are guarranteed to be bugs and missing features. It will slowly improve over the coming months, but it has been made available for use now since it is likely to be helpful to many people already.
mod_fileiri
An Apache module to support IRIs on various file systems (Talk).
Unicode Character Pickers
Written in XHTML and JavaScript. The pickers display the characters that are used for a range of languages, and allow you to compose one or more words by clicking on characters. The list of languages supported includes Arabic (including Persian and Urdu), Armenian, Bengali, Devanagari, Ethiopic, Gujarati, Gurmukhi, Hebrew, Khmer, IPA, Lao, Latin (with diacritics), Lisu, Malayalam, Myanmar, Tamil, Thai, Tibetan, Tifinagh, Tłįchǫ (Dogrib), Urdu and Vietnamese. More languages are planned. In more recent pickers, it is possible to view characters arranged for optimised typing, by similarity in shape, or associated with a common transcription. Normalization features are also being added. The resulting text can be copied and pasted into other applications.
Unicode Code Converter
Written in XHTML and JavaScript. Convert between characters, Unicode code point numbers, UTF-8 and UTF-16 code units in hex, Numeric Character References (hex and decimal), precent-encoded text, and other character escape formats.
-
UniView (Unicode Database Viewer)
Written in XHTML and JavaScript, UniView is a Unicode character viewer/picker that displays characters from a selected range or search pattern, provides information about them, and allows you to assemble strings from them like in a character map. You can also create customizable lists of characters for pasting into documents. UniView supports the latest version of the Unicode Standard, including the supplementary planes as well as the BMP. Characters are displayed as graphics by default, but you can also display them using an installed font.
-
UniView lite
An alternative version of the UniView tool that works well on devices with small screens.