{"id":1314,"date":"2006-01-20T13:00:29","date_gmt":"2006-01-20T13:00:29","guid":{"rendered":"http:\/\/www.w3.org\/blog\/Internationaltmp\/2006\/01\/20\/request_for_feedback_usefulness_of_first\/"},"modified":"2011-10-11T14:52:57","modified_gmt":"2011-10-11T14:52:57","slug":"request_for_feedback_usefulness_of_first","status":"publish","type":"post","link":"https:\/\/www.w3.org\/blog\/International\/2006\/01\/20\/request_for_feedback_usefulness_of_first\/","title":{"rendered":"Request for feedback: Usefulness of ::first-letter in non-Latin scripts"},"content":{"rendered":"<p><em>The W3C i18n Working Group would like to hear from you if you have some knowledge\/thoughts in this area. We would like to gather information about the usefulness, in general, of the ::first-letter pseudo-element in non-Latin scripts, and any particular issues or differences arising from the different characteristics of the scripts.Please send your comments to www-international @ w3.org (Archive and subscription: <a href=\"http:\/\/lists.w3.org\/Archives\/Public\/www-international\/\">http:\/\/lists.w3.org\/Archives\/Public\/www-international\/<\/a>)<\/em><\/p>\n<p>The latest working draft of CSS3 Selectors proposes the ::first-letter pseudo-element.<br \/>\nSee <a href=\"http:\/\/www.w3.org\/TR\/2005\/WD-css3-selectors-20051215\/#first-letter\">http:\/\/www.w3.org\/TR\/2005\/WD-css3-selectors-20051215\/#first-letter<\/a><br \/>\nThe ::first-letter pseudo-element represents the first letter of the first line of a block, if it is not preceded by any other content (such as images or inline tables) on its line.<br \/>\nIt allows that first letter to be styled individually, without markup. It may be used for <a href=\"http:\/\/www.w3.org\/TR\/2005\/WD-css3-selectors-20051215\/#first-letter\">&#8220;initial caps&#8221; and &#8220;drop caps&#8221;<\/a>, which are common typographical effects in text in Latin script.<br \/>\nWe commented to the CSS Working Group that they need to define &#8216;letter&#8217; more carefully, and proposed that they specify that &#8216;letter&#8217; equates to &#8216;default grapheme cluster&#8217;, as described in the Unicode Standard Annex #29.<br \/>\nSee <a href=\"http:\/\/www.unicode.org\/reports\/tr29\/#Grapheme_Cluster_Boundaries\">http:\/\/www.unicode.org\/reports\/tr29\/#Grapheme_Cluster_Boundaries<\/a><br \/>\n(A rough and ready explanation of this is that base characters and any following combining characters are styled together. So<br \/>\n0065: e LATIN SMALL LETTER E + 0301: &#769; COMBINING ACUTE ACCENT<br \/>\nwould be handled as a single letter.)<br \/>\nWe also suggested that implementors should then be encouraged to provide tailored algorithms on a per language basis to cope with anomolies, particularly such as may occur in non-Latin scripts.<br \/>\nHere are some initial questions:<br \/>\n<strong>[1] Are there scripts that would never use this approach?<\/strong><br \/>\n<strong>[2] We mention &#8216;initial caps&#8217; and &#8216;drop caps&#8217; above.  What other types of styling would be commonly applied in other scripts if this feature were available?<\/strong><br \/>\n<strong>[3] What script features would cause difficulties, eg syllabic groupings (see the example of indic script example below), ligatures, cursive text (eg. Arabic, Urdu, etc.), and how would the script normally deal with them?<\/strong><br \/>\nPlease send your comments to www-international @ w3.org<br \/>\nArchive: <a href=\"http:\/\/lists.w3.org\/Archives\/Public\/www-international\/\">http:\/\/lists.w3.org\/Archives\/Public\/www-international\/<\/a><br \/>\n&#8212;&#8212;&#8212;&#8212;<br \/>\nWhat follows are some examples of questions that spring to mind.<br \/>\nSYLLABIC INDIC SCRIPTS<br \/>\nIn the Hindi word &#2360;&#2381;&#2341;&#2367;&#2340;&#2367; (&#8216;sthiti&#8217;) the sequence of characters in the first syllable is as follows in memory:<br \/>\n0938:   &#2360;  DEVANAGARI LETTER SA<br \/>\n094D:   &#2381;  DEVANAGARI SIGN VIRAMA<br \/>\n0925:   &#2341;  DEVANAGARI LETTER THA<br \/>\n093F:   &#2367;  DEVANAGARI VOWEL SIGN I<br \/>\nThe displayed text, however, is <img decoding=\"async\" alt=\"The first syllable of the word 'sthiti' in Hindi, showing the positions of the characters after display.\" src=\"http:\/\/www.w3.org\/International\/reviews\/0601-css3-selectors\/sthiti.gif\" \/><br \/>\nNote how the vowel sign appears to the left of the first character, not the third.<br \/>\nThe default grapheme clusters here are, I believe, 0938+094D, then each of the following two characters.<br \/>\nWould Devanagari-based languages use special styling for initial syllables?  If so, would they actually apply the styling to the vowel sign alone, or to the whole syllable?<br \/>\nLIGATURES<br \/>\nIf a script styles the &#8216;first letter&#8217;, but that letter is part of a ligature (ie. a single glyph representing more than one underlying character), would it be ok to split the ligature, or should the other characters that compose the ligature also be styled?<br \/>\nCURSIVE SCRIPTS<br \/>\nSince Arabic and Mongolian letters in a word are normally joined, has first letter styling been used at all in these scripts?<br \/>\nCHINESE, JAPANESE, KOREAN<br \/>\nDo languages using these scripts do first letter styling?<br \/>\nRUSSIAN, GREEK, ARMENIAN, etc.<br \/>\nIs first letter styling common practise in these scripts too?<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The W3C i18n Working Group would like to hear from you if you have some knowledge\/thoughts in this area. We would like to gather information about the usefulness, in general, of the ::first-letter pseudo-element in non-Latin scripts, and any particular &hellip; <a href=\"https:\/\/www.w3.org\/blog\/International\/2006\/01\/20\/request_for_feedback_usefulness_of_first\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":79,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[28,27,3],"tags":[124],"class_list":["post-1314","post","type-post","status-publish","format-standard","hentry","category-css","category-feedback-request","category-highlight","tag-fr-first-letter"],"_links":{"self":[{"href":"https:\/\/www.w3.org\/blog\/International\/wp-json\/wp\/v2\/posts\/1314","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.w3.org\/blog\/International\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.w3.org\/blog\/International\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.w3.org\/blog\/International\/wp-json\/wp\/v2\/users\/79"}],"replies":[{"embeddable":true,"href":"https:\/\/www.w3.org\/blog\/International\/wp-json\/wp\/v2\/comments?post=1314"}],"version-history":[{"count":1,"href":"https:\/\/www.w3.org\/blog\/International\/wp-json\/wp\/v2\/posts\/1314\/revisions"}],"predecessor-version":[{"id":1879,"href":"https:\/\/www.w3.org\/blog\/International\/wp-json\/wp\/v2\/posts\/1314\/revisions\/1879"}],"wp:attachment":[{"href":"https:\/\/www.w3.org\/blog\/International\/wp-json\/wp\/v2\/media?parent=1314"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.w3.org\/blog\/International\/wp-json\/wp\/v2\/categories?post=1314"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.w3.org\/blog\/International\/wp-json\/wp\/v2\/tags?post=1314"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}