The slides from the MultilingualWeb workshop (including several posters) and the LIDER roadmapping workshop are now available for download. Additional material (videos of the presentations, a workshop report and more) will follow in the next weeks – stay tuned.
The MultilingualWeb workshop on 7-8 May will be streamed live! Follow the event online if you cannot make it to Madrid. For details about speakers and presentations see the workshop program. The workshop is supported by the LIDER project and sponsored by Verisign and Lionbridge.
See the program. The keynote speaker will be Alolita Sharma, Director of Language Engineering from the Wikimedia Foundation. She is followed by a strong line up in sessions entitled Developers, Creators, Localizers, Machines, and Users, including speakers from Microsoft, Wikimedia Foundation, the UN FAO, W3C, Yandex, SDL, Lionbridge, Asia Pacific TLD, Verisign, DFKI, and many more. On the afternoon of the second day we will hold Open Space breakout discussions. Abstracts and details about an additional poster session will be provided shortly.
The program will also feature an LD4LT event on May 8-9, focusing on text analytics and the usefulness of Wikipedia and Dbpedia for multiilngual text and content analytics, and on language resources and aspects of converting selected types of language resources into RDF.
Participation in both events is free. See the Call for Participation for details about how to register for the MultilingualWeb workshop. The LD4LT event requires a separate registration and you have the opportunity to submit position statements about language resources and RDF.
If you haven’t registered yet, note that space is limited, so please be sure to register soon to ensure that you get a place.
The MultilingualWeb workshops, funded by the European Commission and coordinated by the W3C, look at best practices and standards related to all aspects of creating, localizing and deploying the multilingual Web. The workshops are successful because they attract a wide range of participants, from fields such as localization, language technology, browser development, content authoring and tool development, etc., to create a holistic view of the interoperability needs of the multilingual Web.
We look forward to seeing you in Madrid!
Registration for Workshop on Linked Data, Language Technologies and Multilingual Content Analytics now open!
Register now for the recently announced workshop on Linked Data, Language Technologies and Multilingual Content Analytics (8-9 May, Madrid). A preliminary agenda has been created and the registration form is available.
If you are interested in contributing a position statement please indicate this in the dedicated field in the registration form. The workshop organizers will come back to you with questions to answer in the position statement. We then will select which statements are appropriate for presentations on 9 May, and inform you by 28 April.
We are looking forward to see you in Madrid, both for this event and the MultilingualWeb workshop!
This update brings the article in line with recent developments in CSS, and reorganizes the material so that readers can find information more quickly. This led to the article being almost completely rewritten.
The article addresses the question: How do I declare the character encoding of a CSS style sheet?
German, Greek, Spanish, Hebrew, Hungarian, Brazilian Portuguese, Russian, Swedish, Ukrainian and Vietnamese translators are asked to update their translation of this article within the next month, otherwise the translations will be removed per the translation policy, since the changes are substantive.
This update brings the article in line with recent developments in HTML5, and reorganizes the material so that readers can find information more quickly. This led to the article being almost completely rewritten.
The article addresses the question: Which character encoding should I use for my content, and how do I apply it to my content?
German, Spanish, Brazilian Portuguese, Russian, Swedish and Ukrainian translators are asked to update their translation of this article within the next month, otherwise the translations will be removed per the translation policy, since the changes are substantive.
In line with decisions taken by the Working Group several months ago, the editor’s copies of two Internationalization WG Notes have been significantly changed.
The information has been overhauled to bring it up to date with recent changes to various articles, in line with changes in HTML and CSS.
The new versions have also been paired down to a set of do’s and don’ts, with pointers to more detailed explanations in other articles on the i18n site, plus some introductory material that is not found elsewhere. This significantly reduces duplication of information, and thereby improves usability and maintainability of the content.
The documents in question are:
If you have any comments on these documents, please send them to firstname.lastname@example.org before 12 April. We hope to replace the current versions of these Notes with the new versions shortly after.
Extend your stay for the upcoming MultilingualWeb workshop! Join the LIDER Workshop 8-9 May, to discuss Wikipedia, Multilingual Analytics and Linked Data for Language Resources
Aligned with the MultilingualWeb workshop (7-8 May, Madrid), the LIDER project is organizing a roadmapping workshop 8-9 May. The 8 May afternoon session will provide a keynote by Seth Grimes and also focus on the topic of Wikipedia for multilingual Web content. Via several panels including contributions from key Wikipedia engineers, we will discuss cross lingual analytics and intelligent multilingual content handling in Wikipedia. On 9 May, a 1/2 day session will focus on aspects of migrating language resources into linked data.
Mark your calendar now! A dedicated registration form including ways to contribute to the workshop agenda will be made available soon.
Unicode CLDR 25 has been released, providing an update to the key building blocks for software supporting the world’s languages. This data is used by a wide spectrum of companies for their software internationalization and localization, adapting software to the conventions of different languages for such common software tasks.
Unicode CLDR 25 focused primarily on improvements to the LDML structure and tools, and on consistency of data. There are many smaller data fixes, but there was no general data submission. Changes include the following:
- New rules for plural ranges (1-2 liters) for 72 locales, plurals for 2 locales, and ordinals for 18 locales.
- Better locale matching with fallbacks for languages, default languages for continents and subcontinents, and default scripts for more languages.
- Two new locales: West Frisian (fy) and Uyghur (ug).
- Two new metazones: Mexico_Pacific and Mexico_Northwest
- Updated zh pinyin & zhuyin collations and translators for Unicode 6.3 kMandarin data
- Updated keyboard layout data for OSX, Windows and others.
This version contains data for 238 languages and 259 territories—740 locales in all.
Details are provided in http://cldr.unicode.org/index/downloads/cldr-25, along with a detailed Migration section.
This update brings the article in line with recent developments in HTML5, and de-emphasizes information about legacy formats.
An attempt was also made to organize the material so that readers can find information more quickly, and also de-clutter the essential information by moving edge topics, such as UTF-16 and charset links, down the page. This led to the article being almost completely rewritten.
A new boilerplate and styling has also been applied to the article.
German, Spanish, Russian, Swedish and Ukrainian translators are asked to update their translation of this article within the next month, otherwise the translations will be removed per the translation policy, since the changes are substantive.