<?xml version="1.0" encoding="utf-8"?><!-- generator="b2evolution/1.10.2" -->
<rdf:RDF xmlns="http://purl.org/rss/1.0/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"					xmlns:admin="http://webns.net/mvcb/" xmlns:content="http://purl.org/rss/1.0/modules/content/">
		<channel rdf:about="http://www.w3.org/blog/International">
			<title>W3C Internationalization (I18n) Activity - Category: Miscellaneous</title>
			<link>http://www.w3.org/blog/International</link>
			<description>Home page of W3C's Web Internationalization Activity--resources for increasing accessibility of the Web for worldwide audiences.</description>
			<dc:language>en-EU</dc:language>
			<admin:generatorAgent rdf:resource="http://b2evolution.net/?v=1.10.2"/>
			<sy:updatePeriod>hourly</sy:updatePeriod>
			<sy:updateFrequency>1</sy:updateFrequency>
			<sy:updateBase>2000-01-01T12:00+00:00</sy:updateBase>
			<items>
				<rdf:Seq>
									<rdf:li rdf:resource="http://www.w3.org/blog/International/2009/11/25/internet_governance_forum_poster"/>
									<rdf:li rdf:resource="http://www.w3.org/blog/International/2009/10/23/unicode_collation_algorithm_version_5_2_"/>
									<rdf:li rdf:resource="http://www.w3.org/blog/International/2009/10/07/unicode_5_2_0_released"/>
									<rdf:li rdf:resource="http://www.w3.org/blog/International/2009/09/07/new_language_tag_specification_rfc_5646_"/>
									<rdf:li rdf:resource="http://www.w3.org/blog/International/2009/07/29/tcworld_article_about_japanese_requireme"/>
								</rdf:Seq>
			</items>
		</channel>
		
		<item rdf:about="http://www.w3.org/blog/International/2009/11/25/internet_governance_forum_poster">
			<title>Internet Governance Forum Poster</title>
			<link>http://www.w3.org/blog/International/2009/11/25/internet_governance_forum_poster</link>
			<dc:date>2009-11-25T13:26:38Z</dc:date>
			<dc:creator>Richard Ishida</dc:creator>
			<dc:subject>Miscellaneous</dc:subject>
			<description>

The fourth annual IGF Meeting was held in Sharm El Sheikh, Egypt on 15-18 November 2009. The W3C Internationalization Activity had a poster [PDF] at the event.
</description>
			<content:encoded><![CDATA[<p style="float:left;margin: 0.5em 1em 0.5em 0;"><a href="http://www.w3.org/International/datespace/2009/IGF-poster.pdf"><img src="http://www.w3.org/International/datespace/2009/IGF-poster-icon.png" alt="Poster used at the IGF meeting." title="Poster used at the IGF meeting." /></a></p>

<p>The fourth annual <a href="http://www.intgovforum.org/cms/index.php/the-meeting">IGF Meeting</a> was held in Sharm El Sheikh, Egypt on 15-18 November 2009. The W3C Internationalization Activity had a <a href="http://www.w3.org/International/datespace/2009/IGF-poster.pdf">poster [PDF]</a> at the event.</p>
<br style="clear:both;" />]]></content:encoded>
		</item>

		
		<item rdf:about="http://www.w3.org/blog/International/2009/10/23/unicode_collation_algorithm_version_5_2_">
			<title>Unicode Collation Algorithm Version 5.2 Released</title>
			<link>http://www.w3.org/blog/International/2009/10/23/unicode_collation_algorithm_version_5_2_</link>
			<dc:date>2009-10-23T17:52:22Z</dc:date>
			<dc:creator>Richard Ishida</dc:creator>
			<dc:subject>Miscellaneous</dc:subject>
			<description>Version 5.2 of the Unicode Collation Algorithm has been released. This version resynchronizes the Unicode Collation Algorithm with all
of the updates for the Unicode Standard, Version 5.2.

The rest of this post is taken from the Unicode Consortium's release notification and details changes and issues for implementations.


The text of UTS #10 has been updated. Among other changes, the revised text for UTS #10 makes it clear that the BASE for   implicit generation of weights for Han characters does not include unassigned code points.
There are small changes in Gujarati, Telugu, Malayalam (including weighting for chillus), Tamil, and Sinhala. While these changes move in the direction of expected behavior, good   results will only come from tailoring for particular languages,  such as with CLDR.

There have been significant changes to the ordering of many  combining marks. Many combining marks that are not in customary  use in modern languages now have the same secondary weight, and  will only be distinguished on a fourth level, by code point    ordering. This can be seen by looking at the Unicode Collation   Charts (http://unicode.org/charts/collation/). In 5.2, many characters now have a white background, indicating that they   sort exactly the same as the previous character, unless a 4th  (codepoint) level is used.

Implementations of UCA should take note that the increased  number of characters may cause overflows if the implementing  code makes certain assumptions or optimizations. This can result either from the new character additions (which increase the number of distinct weights in the table) or because of changes in the way the weights, particularly for secondary weight values, are assigned in the table. The latter change may result in unexpected numbers of characters having the same weight.

</description>
			<content:encoded><![CDATA[<p>Version 5.2 of the <a href="http://www.unicode.org/reports/tr10/">Unicode Collation Algorithm</a> has been released. This version resynchronizes the Unicode Collation Algorithm with all
of the updates for the Unicode Standard, Version 5.2.</p>

<p>The rest of this post is taken from the Unicode Consortium's release notification and details changes and issues for implementations.</p>

<ul>
<li>The text of UTS #10 has been updated. Among other changes, the revised text for UTS #10 makes it clear that the BASE for   implicit generation of weights for Han characters does not include unassigned code points.</li>
<li>There are small changes in Gujarati, Telugu, Malayalam (including weighting for chillus), Tamil, and Sinhala. While these changes move in the direction of expected behavior, good   results will only come from tailoring for particular languages,  such as with CLDR.</li>

<li>There have been significant changes to the ordering of many  combining marks. Many combining marks that are not in customary  use in modern languages now have the same secondary weight, and  will only be distinguished on a fourth level, by code point    ordering. This can be seen by looking at the Unicode Collation   Charts (http://unicode.org/charts/collation/). In 5.2, many characters now have a white background, indicating that they   sort exactly the same as the previous character, unless a 4th  (codepoint) level is used.</li>

<li>Implementations of UCA should take note that the increased  number of characters may cause overflows if the implementing  code makes certain assumptions or optimizations. This can result either from the new character additions (which increase the number of distinct weights in the table) or because of changes in the way the weights, particularly for secondary weight values, are assigned in the table. The latter change may result in unexpected numbers of characters having the same weight.</li>
</ul>
]]></content:encoded>
		</item>

		
		<item rdf:about="http://www.w3.org/blog/International/2009/10/07/unicode_5_2_0_released">
			<title>Unicode 5.2.0 Released</title>
			<link>http://www.w3.org/blog/International/2009/10/07/unicode_5_2_0_released</link>
			<dc:date>2009-10-07T13:46:01Z</dc:date>
			<dc:creator>Richard Ishida</dc:creator>
			<dc:subject>Miscellaneous</dc:subject>
			<description>On 1st October, Unicode 5.2 was released! The data files, code charts, and Unicode Standard Annexes for this version are final and are posted on the Unicode site.

For Unicode 5.2, the core specification is no longer just a delta document applied to the book; instead, the entire core specification,with all textual changes integrated, will be available on the Unicode site. As of this announcement, the first five chapters are available; the other chapters will follow soon

For full details about what is new or changed in this release, see the version documentation for Unicode 5.2.


</description>
			<content:encoded><![CDATA[<p>On 1st October, Unicode 5.2 was released! The data files, code charts, and Unicode Standard Annexes for this version are final and are posted on the Unicode site.</p>

<p>For Unicode 5.2, the core specification is no longer just a delta document applied to the book; instead, the entire core specification,with all textual changes integrated, will be available on the Unicode site. As of this announcement, the first five chapters are available; the other chapters will follow soon</p>

<p>For full details about what is new or changed in this release, see the <a href="http://www.unicode.org/versions/Unicode5.2.0/">version documentation for Unicode 5.2</a>.</p>


]]></content:encoded>
		</item>

		
		<item rdf:about="http://www.w3.org/blog/International/2009/09/07/new_language_tag_specification_rfc_5646_">
			<title>New language tag specification, RFC 5646, published</title>
			<link>http://www.w3.org/blog/International/2009/09/07/new_language_tag_specification_rfc_5646_</link>
			<dc:date>2009-09-07T19:51:22Z</dc:date>
			<dc:creator>Richard Ishida</dc:creator>
			<dc:subject>Miscellaneous</dc:subject>
			<description>The IETF has published RFC 5646, an update of Tags for Identifying Languages.  This specification obsoletes former RFCs 4646, 3066 and 1766. 

RFC 5646 makes it possible to use over 7,000 three-letter ISO 639-3 language codes, in addition to the 2 letter codes that have been in use for some time.  It also introduces 220 'extended language' subtags, mainly for backwards compatibility.

It continues to be best to refer to this specification as BCP47. This is a non-changing name and web address that points to the latest relevant RFCs.

The Internationalization Working Group at the W3C is working on an article to help users choose language tags, given the various types of subtag that are now available, and the sheer number of subtags.

You can look up language and other subtags in the IANA Language Subtag Registry. 

(Richard Ishida has provided an unofficial tool for searching the registry that also provides advice for choosing subtags, and allows you to partially validate a hyphen-separated language tag.)
</description>
			<content:encoded><![CDATA[<p>The IETF has published RFC 5646, an update of Tags for Identifying Languages.  This specification obsoletes former RFCs 4646, 3066 and 1766.</p> 

<p>RFC 5646 makes it possible to use over 7,000 three-letter ISO 639-3 language codes, in addition to the 2 letter codes that have been in use for some time.  It also introduces 220 'extended language' subtags, mainly for backwards compatibility.</p>

<p>It continues to be best to refer to this specification as <a href="http://www.rfc-editor.org/rfc/bcp/bcp47.txt">BCP47</a>. This is a non-changing name and web address that points to the latest relevant RFCs.</p>

<p>The Internationalization Working Group at the W3C is working on an article to help users choose language tags, given the various types of subtag that are now available, and the sheer number of subtags.</p>

<p>You can look up language and other subtags in the <a href="http://www.iana.org/assignments/language-subtag-registry">IANA Language Subtag Registry</a>.</p> 

(Richard Ishida has provided an <a href="http://rishida.net/utils/subtags/">unofficial tool</a> for searching the registry that also provides advice for choosing subtags, and allows you to partially validate a hyphen-separated language tag.)
]]></content:encoded>
		</item>

		
		<item rdf:about="http://www.w3.org/blog/International/2009/07/29/tcworld_article_about_japanese_requireme">
			<title>tcworld article about Japanese Requirements Note</title>
			<link>http://www.w3.org/blog/International/2009/07/29/tcworld_article_about_japanese_requireme</link>
			<dc:date>2009-07-29T13:07:28Z</dc:date>
			<dc:creator>Richard Ishida</dc:creator>
			<dc:subject>Miscellaneous</dc:subject>
			<description>tcworld magazine has published an article by Tony Graham about the recently published W3C Note, Requirements for Japanese Text Layout.

Read the article</description>
			<content:encoded><![CDATA[<p>tcworld magazine has published an article by Tony Graham about the recently published W3C Note, <a href="http://www.w3.org/TR/jlreq">Requirements for Japanese Text Layout</a>.</p>

<p><a href="http://www.tcworld.info/index.php?id=64">Read the article</a></p>]]></content:encoded>
		</item>

		</rdf:RDF>
