The Unicode Consortium has announced Version 6.3 of the Unicode Standard and with it, significantly improved bidirectional behavior. The updated Version 6.3 Unicode Bidirectional Algorithm now ensures that pairs of parentheses and brackets have consistent layout and provides a mechanism for isolating runs of text.
Based on contributions from major browser developers, the updated Bidirectional Algorithm and five new bidi format characters will improve the display of text for hundreds of millions of users of Arabic, Hebrew, Persian, Urdu, and many others. The display and positioning of parentheses will better match the normal behavior that users expect. By using the new methods for isolating runs of text, software will be able to construct messages from different sources without jumbling the order of characters. The new bidi format characters correspond to features in markup (such as in CSS). Overall, these improvements also bring greater interoperability and an improved ability for inserting text and assembling user interface elements.
The improvements come with new rigor: the Consortium now offers two reference implementations and greatly improved testing and test data.
In a major enhancement for CJK usage, this new version adds standardized variation sequences for all 1,002 CJK compatibility ideographs. These sequences address a well-known issue of the CJK compatibility ideographs — that they could change their appearance when any process normalized the text. Using the new standardized variation sequences allows authors to write text which will preserve the specific required shapes of these CJK ideographs, even under Unicode normalization.
Version 6.3 includes other improvements as well:
- Improved Unihan data to better align with ISO/IEC 10646
- Better support for Hebrew word break behavior and for ideographic space in line breaking
The Cascading Style Sheets (CSS) Working Group has published a Working Draft of CSS Ruby Module Level 1. “Ruby” are short runs of text alongside the base text, typically used in East Asian documents to indicate pronunciation or to provide a short annotation. This module describes the rendering model and formatting controls related to displaying ruby annotations in CSS.
The pre-final Internationalization Tag Set (ITS) 2.0 specification is already getting uptake in the machine translation community. At the Machine Translation Summit XIV (2-6 September, Nice), ITS 2.0 will be introduced in a presentation, a panel and as part of the META-NET booth. MT Summit is an annual event of the machine translation community, including keynote speeches by renowned experts in the field of Machine Translation, panel discussions and presentations of submitted and invited papers organized in two program tracks – research and commercial/user. Register now to learn about the presence and future of automatic multilingual content processing on the Web.
The META-FORUM 2013 will provide an overview of the pre-final ITS 2.0 specification and other recent developments around automated processing of multilingual Web content. registration is free, but the number of participants is limited – get your seat soon! A call for active participation in META Exhibition and LT Industry Session (deadline 16. August) provides an opportunity to demonstrate other technologies around multilingual Web content.
A report is now available summarizing the Workshop on Richer Internationalization for eBooks, which took place 4 June in Tokyo.
The Workshop was Hosted by Keio University, and sponsored by Intel as well as W3C organization sponsor Google.
(Learn more about W3C’s new Digital Publishing Activity, how to get involved in the Digital Publishing Interest Group, and the agenda of the workshop on Publishing and the Open Web Platform, which takes place in September in Paris; position paper deadline 15 July.)
The two-day workshop surveyed and shared information about currently available best practices and standards that can help content creators and localizers address the needs of the multilingual Web, including the Semantic Web. Attendees also heard about gaps that need to be addressed, and enjoyed opportunities to network and share information between the various different communities involved in enabling the multilingual Web. Half of the second day was dedicated to an Open Space discussion with breakouts.
The workshop was sponsored by the EU-funded QTLaunchPad project and Verisign. “During the W3C Rome Workshop we were able to identify the gaps holding organizations back from achieving a truly multilingual Internet and discuss best practices that will further our collective goal of achieving a web void of cultural borders”, said Pat Kane, senior vice president of Naming and Directory Services at Verisign.
The recently announced Internationalization Tag Set 2.0 showcase event in Dublin now allows for remote participation. Please register by 17 June 6 p.m. UTC. We will provide dial in details to registered
participants. The number of remote participants is limited and we choose on a
first-come, first-served basis – get your seat soon!
On 18 June the MultilingualWeb-LT Working Group holds a showcase event in Dublin about the upcoming Internationalization Tag Set (ITS) 2.0 specification. Group participants demonstrate implementations for authoring ITS 2.0 data categories, for using them in localization workflows, and for improving machine translation or other language technology processes with ITS 2.0. Participation is free, but registration is required.
A report summarizing the MultilingualWeb workshop in Rome is now available from the MultilingualWeb site. It contains a summary of each session with links to presentation slides and more detailed scribing done on site in Rome. Links to video for each session will be posted soon.
With approximately 150 attendees, the Rome Workshop focused on the theme “Making the Multilingual Web Work” and emphasized information about the best practices and standards that help content creators and localizers ensure that the World-Wide Web lives up to its name, across boundaries of language and culture. Attendees heard from a variety of perspectives, with fruitful dialogue between various stakeholder groups involved in trying to expand the multilingual scope of the Web.
Taking place over two days (12 and 13 March, 2013) at the headquarters of the UN’s Food and Agriculture Organization (FAO), the Workshop featured twenty-four conference-style presentations, seven poster presentations, and an “open space” discussion that featured six breakout sessions focusing on key topics that emerged during the Workshop. In addition, it showcased technology implementations of the forthcoming internationalization Tag Set (ITS) 2.0 standard.
FEISGILLT 2013 will showcase the upcoming Internationalization Tag Set 2.0 standard, together with closely related, core localization standards like XLIFF. FEISGILTT 2013 is the preconference event of Localization World, London 2013. W3C members will get 20% discount for FEISGILTT. FEISGILTT participants are entitled to a 10% discount when registering for the main conference. However, registering for the main conference is NOT required to register for FEISGILTT.