Internationalization (i18n) Activity

Making the World Wide Web worldwide!


Group pages

Activity Statement

i18n WG

i18n Interest Group

i18n Tag Set (ITS) IG

Chinese Layout Task Force

Indic Layout Task Force

Community groups

Site links

Getting Started
Tasks
Topics
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Resource types
Reviews
Mail archives
Aggregated data
Translation
News by category
News archives
July 2011 (13)
July 2009 (10)
June 2009 (10)
June 2008 (13)

Search for news

Admin

Category: w3cXMLCore

Posts

MultilingualWeb-LT Working Group closed, ITS community continues in ITS IG

The MultilingualWeb-LT Working Group has been closed, since it successfully completed the work in its charter.

We thank the co-chairs, the editors, implementers and the Working Group for achieving the goal to publish Internationalization Tag Set (ITS) 2.0 as a W3C Recommendation, and for doing so ahead of the original schedule.

Work on enlarging the community around ITS, gathering feedback and requirements for future work will now continue in the ITS Interest Group.

Tags: , ,

Updated article: Inline markup and bidirectional text in HTML

Inline markup and bidirectional text in HTML is a major update of the article formerly titled What You Need to Know About the Bidi Algorithm and Inline Markup, and reflects the recent changes in bidi markup in the HTML5 specification.

Technically speaking, the main change is that the dir attribute now isolates text by default with respect to the bidi algorithm. Isolation as a default is the recommendation of the Unicode Standard as of version 6.3.

For the less technical-minded, the main advantage of this change is a much simpler transition for both content authors and browser developers who want reap the benefits of isolation. At the same time, these approaches have good results for existing legacy content.

Call for position statements: Linked Data for Language Technology #LiderEU

The LD4LT (Linked Data for Language Technology) Workshop will be held on 21 March, in Athens, Greece, aligned with the European Data Forum 2014. See the agenda.

The workshop is a free community event – there is no admission fee for participants, but registration is required.

You are encouraged to provide a title for a position statement in your registration form. This is a simple, short statement that summarizes your ideas / technologies / use cases related to Linked Data and Language Technology.

The meeting is supported by the LIDER project, the MultilingualWeb community, the NLP2RDF project, the Working Group for Open Data in Linguistics as well as the DBpedia Project.

As input to the discussion and the work of the LD4LT group, you may also want to fill in the first LIDER survey.

Tags: , ,

First public working draft of Encoding published

The Internationalization Working Group has published a First Public Working Draft of Encoding.

While encodings have been defined to some extent, implementations have not always implemented them in the same way, have not always used the same labels, and often differ in dealing with undefined and former proprietary areas of encodings. This specification attempts to fill those gaps so that new implementations do not have to reverse engineer encoding implementations of the market leaders and existing implementations can converge.

This is a snapshot of the Encoding Living Standard, as of the date shown on the title page. No changes have been made in the body of the W3C draft other than to align with W3C house styles. The primary reason that W3C is publishing this document is so that HTML5 and other specifications may normatively refer to a stable W3C Recommendation.

Tags:

Register now for the 7th W3C MultilingualWeb Workshop, Madrid, 7-8 May

Register early to ensure you get a place. Anyone may attend all sessions at no charge and the W3C welcomes participation by both speakers and non-speaking attendees.

Since 2010 the W3C’s Multilingual Web Workshop series has become the preeminent venue for discussion of the standards and technologies that define and enable multilingualism on the Web. The 7th Workshop, “New Horizons for the Multilingual Web,” will be held 7–8 May 2014 in Madrid, Spain.

The workshop brings together participants interested in the best practices, new technologies, and standards needed to help content creators, localizers, language tools developers, and others address the new opportunities and challenges of the multilingual Web. It will provide for networking across communities and building connections.

We are particularly interested in speakers who are facing emerging challenges or who can demonstrate novel solutions for reaching out to a global, multilingual audience. The deadline for speaker proposals is March 14, but early submission is strongly encouraged.

This workshop is made possible by the generous support of the LIDER project, which will organize a roadmapping workshop on linked data and content analytics as one of the tracks at Multilingual Web Workshop.

See the Call for Participation and register online.

Tags:

ITS 2.0 for everybody – the ITS 2.0 promotion video is now online

Several participants of the ITS Interest Group have created a video to promote ITS 2.0. The video explains usage scenarios for ITS 2.0 and also explains benefits in easy terms for a non-technical audience. It will be the main video of the ITS 2.0 video channel. In the next weeks further videos will be added, to showcase ITS 2.0 implementations in various usage scenarios.

Tags:

W3C MultilingualWeb Workshop Announced: New Horizons for the Multilingual Web

To be held 7-8 May 2014 in Madrid, Spain, W3C announced today the seventh MultilingualWeb workshop in a series of events exploring the mechanisms and processes needed to ensure that the World Wide Web lives up to its potential around the world and across barriers of language and culture.

This workshop is made possible by the generous support of the LIDER project. As part of the event, LIDER will organize a roadmapping workshop on linked data and content analytics.

Anyone may attend all sessions at no charge and the W3C welcomes participation by both speakers and non-speaking attendees. Early registration is encouraged due to limited space.

Building on the success of six highly regarded previous workshops, this workshop will emphasize new technology developments that lead to new opportunities for the Multilingual Web. The workshop brings together participants interested in the best practices and standards needed to help content creators, localizers, language tools developers, and others meet the challenges of the multilingual Web. It provides further opportunities for networking across communities. We are particularly interested in speakers who can demonstrate novel solutions for reaching out to a global, multilingual audience.

See the Call for Participation and register online.

Tags:

ITS 2.0 is a W3C Recommendation!

The MultilingualWeb-LT Working Group has published a W3C Recommendation of Internationalization Tag Set (ITS) Version 2.0.

ITS 2.0 provides a foundation for integrating automated processing of human language into core Web technologies. ITS 2.0 bears many commonalities with its predecessor, ITS 1.0, but provides additional concepts that are designed to foster the automated creation and processing of multilingual Web content.

Work on application scenarios for ITS 2.0 and gathering of usage and implementation experience will now take place in the ITS Interest Group.

Tags:

The Unicode Standard, Version 6.3 published

The Unicode Consortium has announced Version 6.3 of the Unicode Standard and with it, significantly improved bidirectional behavior. The updated Version 6.3 Unicode Bidirectional Algorithm now ensures that pairs of parentheses and brackets have consistent layout and provides a mechanism for isolating runs of text.

Based on contributions from major browser developers, the updated Bidirectional Algorithm and five new bidi format characters will improve the display of text for hundreds of millions of users of Arabic, Hebrew, Persian, Urdu, and many others. The display and positioning of parentheses will better match the normal behavior that users expect. By using the new methods for isolating runs of text, software will be able to construct messages from different sources without jumbling the order of characters. The new bidi format characters correspond to features in markup (such as in CSS). Overall, these improvements also bring greater interoperability and an improved ability for inserting text and assembling user interface elements.

The improvements come with new rigor: the Consortium now offers two reference implementations and greatly improved testing and test data.

In a major enhancement for CJK usage, this new version adds standardized variation sequences for all 1,002 CJK compatibility ideographs. These sequences address a well-known issue of the CJK compatibility ideographs — that they could change their appearance when any process normalized the text. Using the new standardized variation sequences allows authors to write text which will preserve the specific required shapes of these CJK ideographs, even under Unicode normalization.

Version 6.3 includes other improvements as well:

  • Improved Unihan data to better align with ISO/IEC 10646
  • Better support for Hebrew word break behavior and for ideographic space in line breaking
Tags:

Internationalization Tag Set (ITS) Version 2.0 Proposed Recommendation Published

The MultilingualWeb-LT Working Group has published a Proposed Recommendation of Internationalization Tag Set (ITS) Version 2.0. The technology described in this document provides a foundation for to integrating automated processing of human language into core Web technologies. ITS 2.0 bears many commonalities with its predecessor, ITS 1.0 but provides additional concepts that are designed to foster the automated creation and processing of multilingual Web content. ITS 2.0 focuses on HTML, XML-based formats in general, and can leverage processing based on the XML Localization Interchange File Format (XLIFF), as well as the Natural Language Processing Interchange Format (NIF). Comments are welcome through 22 October.

Tags:

Copyright © 2014 W3C ® (MIT, ERCIM, Keio, Beihang) Usage policies apply.
Questions or comments? ishida@w3.org