[iip] Text units for letter-spacing are incorrect (#101) from r12a via GitHub on 2020-02-05 (public-i18n-archive@w3.org from January to March 2020)

From: r12a via GitHub <sysbot+gh@w3.org>
Date: Wed, 05 Feb 2020 06:17:56 +0000
To: public-i18n-archive@w3.org
Message-ID: <issues.opened-560164685-1580883475-sysbot+gh@w3.org>

r12a has just created a new issue for https://github.com/w3c/iip:

== Text units for letter-spacing are incorrect ==
For various reasons wherever a word needs to be broken in constituent characters in case of Latin script, Indian language words can and should be broken based on Akshara as given here for a case of Devanagari script here 

As the W3C specification points to Unicode Text Segmentation (TR 29), it is observed that some of the browsers support it (e.g. Chrome and Firefox) whereas Microsoft Edge and Interner Explorer seems to break the words in individual characters. 

It has been marked as basic as the Unicode Text Segmentation rules themselves need to be matured enough to cater to nuances of many languages that get written using Gurmukhi script. Some of the languages like Santali, require some special Nukta rules. 

Also, in cases where there is wrong Akshara formation e.g. Consonant+Matra+Matra, the breaking seems to stack ill formed akshara into one set instead of clearly breaking it separate. This breaking behaviour needs to improve. 

Please view or discuss this issue at https://github.com/w3c/iip/issues/101 using your GitHub account

Received on Wednesday, 5 February 2020 06:17:58 UTC