Notes for HTML i18n Guidelines Devt

These are very rough notes as a first step towards developing a set of techniques for use of HTML.

Each rule attempts to broadly list likely audiences for that rule. Current categories include (authoring) tools developers, web masters, authors, user agent developers. Note that 'author' is used in the sense described by the HTML 4.01 spec, ie as a person or program that writes or generates HTML documents.

Click on anything with a + sign to the left to expand its contents.

- Choosing character encodings and representing characters

- Choosing an encoding

- Transmitting encoded text

- Specifying the character encoding

- Detecting document encodings

- Referring to specific characters

- Dealing with undisplayable characters

- Implementing bidi text

- Setting directionality for an entire document in a bidirectional script

- Changing the directional properties of a part of the text

- Overriding the Unicode bidirectional algorithm

- Declaring the language of the text

- Declaring the language of a whole page

Use the lang and xml:lang attributes in the html tag

Give reasons for use.

Give an example.

How do I specify language codes?

- Labelling text in a different language

Use the lang and xml:lang attributes around the text.

Give reasons for use.

Give an example.

How do I specify language codes?

- Specifying the language of a linked page

Use the hreflang attribute on the a element.

Need to think about this - don't think it is supported by browsers.

Do we include detail here or under section on links?

How do I specify language codes?

- Specifying language codes

Follow the guidelines in RFC3066.

Note that the HTML spec still says rfc1766, but this has been obsoleted by rfc3066.

Explain the basic principles here.

Use the two letter ISO 639 codes for the language code and the two letter ISO 3166 codes for the country code wherever possible.

This aids interoperability, and increases the likelihood of recognition by browsers.

- Document structure

- Creating an internationalised page header

Use the lang and xml:lang attributes in the html tag

Point to or include detail

Use the META element in HTML documents to explicity declare the document's character encoding.

Point to or include detail


Richard Ishida, 31 dec 02