This update brings the article in line with recent developments in HTML5, and de-emphasizes information about legacy formats.
An attempt was also made to organize the material so that readers can find information more quickly, and also de-clutter the essential information by moving edge topics, such as UTF-16 and charset links, down the page. This led to the article being almost completely rewritten.
A new boilerplate and styling has also been applied to the article.
German, Spanish, Russian, Swedish and Ukrainian translators are asked to update their translation of this article within the next month, otherwise the translations will be removed per the translation policy, since the changes are substantive.
Inline markup and bidirectional text in HTML is a major update of the article formerly titled What You Need to Know About the Bidi Algorithm and Inline Markup, and reflects the recent changes in bidi markup in the HTML5 specification.
Technically speaking, the main change is that the dir attribute now isolates text by default with respect to the bidi algorithm. Isolation as a default is the recommendation of the Unicode Standard as of version 6.3.
For the less technical-minded, the main advantage of this change is a much simpler transition for both content authors and browser developers who want reap the benefits of isolation. At the same time, these approaches have good results for existing legacy content.
An updated version of What you need to know about the bidi algorithm and inline markup is out for wide review. We are looking for comments over the next two weeks. After the review period is over, this content will be copied to the same location as the current version of What you need to know about the bidi algorithm and inline markup and the URL of the updated version will cease to exist.
The update rewrites the article to reflect the recent changes in bidi markup in the HTML5 specification.
Technically speaking, the main change is that the
dir attribute now isolates text by default with respect to the bidi algorithm. Isolation as a default is the recommendation of the Unicode Standard as of version 6.3.
From a less technical point of view, the main advantages to the update are that the new methods introduced here reduce the need to use a new approach when the direction of content is known, and therefore makes for a much simpler transition for both content authors and browser developers to support the advances in the handling of bidirectional text content. At the same time, these approaches have good results for existing legacy content.
Please send comments to email@example.com.
Creating HTML Pages in Arabic, Hebrew and Other Right-to-left Scripts
This tutorial has been modified to bring it in line with the current tutorial format. Rather than contain duplicate content, it now introduces the novice to key concepts and points off to useful further reading in an organized fashion. It has been completely rewritten.
Text direction and structural markup in HTML
This article has been created from material formerly in the tutorial “Creating HTML Pages in Arabic, Hebrew and Other Right-to-left Scripts” and augmented with information about new HTML5 markup constructs that are beginning to see adoption. It should be regarded as a new article, focusing on applying bidi markup to document- and block-level content, including forms.
What you need to know about the bidi algorithm and inline markup
This is an update of an existing article, but it has been almost completely rewritten. The most significant changes are the new parts describing how to apply the new HTML5 constructs which are beginning to see adoption. Additional changes will be needed as HTML5 bidi markup is finalised over the coming months. The article also proposes a simpler way to approach markup of bidi text, particularly useful for those with less experience, that relies less on a deep understanding of the issues involved.
Visual vs. logical ordering of text
This is a new article created from material that has been removed from the previously mentioned articles. It was removed into a separate article because visual ordering is much less important these days, and to avoid duplication. Only a few changes have been made to the content itself.
ITS 2.0 provides metadata to foster the adoption of the multilingual Web.
The article The byte-order mark (BOM) in HTML was updated significantly to reflect the fact that the byte-order mark in UTF-8 is less problematic now than it used to be, and that it has a higher precedence than the HTTP header for character encoding detection.
The article was largely rewritten, and now incorporates the relevant information that used to be in the article “Display problems caused by the UTF-8 BOM”. That article has now been decommissioned.
German, Spanish, Russian and Ukrainian translations need to be updated. Translators, please contact Richard Ishida (firstname.lastname@example.org) for the source text.
Minor editorial changes have been made to Unicode in XML and other Markup Languages to fix one typo (“accent” to “acute” in Table 3.1) and update references to the Unicode Standard in the Introduction and References section.
Substantive updates are currently on hold, pending final decisions relating to new developments to be introduced with HTML5.
This document is simultaneously published by the Unicode Consortium as Unicode Technical Report #20.
The article Background images that support localization was updated as follows:
- A note was added at the beginning of the background section, mentioning that CSS now enables you to create the examples in the article, where appropriate, and that the article now contains pointers to live code samples using CSS.
- The first sentence of each section describing a technique was changed to better position and introduce the section.
- A sentence was added to the end of each of the above sections, pointing to an example of how CSS could be used to reproduce that example, for browsers that support it.
- ” Internet Explorer and Opera will split the word and the hyphen will appear at the end of the line” was changed to “recent versions of major browsers will split the word and the hyphen will appear at the end of the line”
- The section “By the way” was removed.
Spanish, Russian and Ukrainian translations need to be updated. Please contact Richard Ishida (email@example.com) for the source text. In the meantime, the note and the link text have been added to those translations in English, but not the other additions.
Just Published! New Version of Working Group Note, Requirements for Japanese Text Layout (日本語組版処理の要件)
Requirements for Japanese Text Layout describes requirements for Japanese layout realized with technologies like CSS, SVG and XSL-FO. For non-Japanese speakers it provides access to a wealth of detailed and authoritative information about Japanese typesetting. The document is mainly based on a standard for Japanese layout, JIS X 4051 and its authors include key contributors to that standard. However, it also addresses areas which are not covered by JIS X 4051.
This second version of the document contains a significant amount of additional information related to hanmen design, such as handling headings, placement of illustrations and tables, handling of notes and reference marks, etc.
A Japanese version is also available.
The Internationalization Activity home page has recently been ported to WordPress. This means that the URIs for the various RSS feeds have changed. You can find the new links at the page W3C I18n news filters and RSS feeds.
The current URIs will continue to work for a short while, to support the transition, but you should change as soon as possible.
URIs for category filters have also changed, as have those for search key text within posts (useful for finding the history of a particular article or document). The latter have been converted to tags.