ISSUE-425: Normalization and string identity issues

Normalization and string identity issues

State:
CLOSED
Product:
webvtt1
Raised by:
Addison Phillips
Opened on:
2015-02-26
Description:
Bugzilla: https://www.w3.org/Bugs/Public/show_bug.cgi?id=28259


http://www.w3.org/TR/webvtt1/#webvtt-file-structure

Various constructs such as 'cue identifier' are described as being:

--
...any sequence of one or more characters not containing the substring "-->"...
--

The document makes understood that this is a sequence of Unicode characters. However, it leaves open the question of whether different Unicode character sequences that represent the same semantic string identifier (see: Charmod [1] and Charmod-Norm [2]) are considered "the same" or not. As currently written, different UTF-8 byte sequences are considered distinct.

We would suggest that identifiers that use distinct code point sequences are considered distinct (that is, that you are what we call a "non-normalizing Specification"), which suggests that you include at least a health warning about the dangers of using different character sequences.

[1] http://www.w3.org/TR/charmod/
[2] http://www.w3.org/TR/charmod-norm/
Particularly: http://www.w3.org/TR/charmod-norm/#formal-language and http://www.w3.org/TR/charmod-norm/#non-normalizing
Related Actions Items:
No related actions
Related emails:
  1. [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-11-25)
  2. [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-11-23)
  3. [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-10-01)
  4. [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-06-08)
  5. [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-06-01)
  6. [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-04-21)
  7. [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-03-24)
  8. Re: [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from silviapfeiffer1@gmail.com on 2015-03-22)
  9. [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from addison@lab126.com on 2015-03-20)
  10. [minutes] Internationalization WG telecon 2015-03-19 (from ishida@w3.org on 2015-03-19)
  11. I18N-ISSUE-425: Normalization and string identity issues ⓟ [WebVTT] (from sysbot+tracker@w3.org on 2015-02-26)

Related notes:

Needs review of new text.

Richard Ishida, 23 Jul 2015, 11:20:12

Addison, can we close this?

Richard Ishida, 16 Nov 2015, 13:31:17

Closed. Satisfied.

Addison Phillips, 23 Nov 2015, 18:37:10

Display change log ATOM feed


Addison Phillips <addison@amazon.com>, Chair, Richard Ishida <ishida@w3.org>, Fuqiao Xue <xfq@w3.org>, Atsushi Shimono <atsushi@w3.org>, Staff Contacts
Tracker: documentation, (configuration for this group), originally developed by Dean Jackson, is developed and maintained by the Systems Team <w3t-sys@w3.org>.
$Id: index.php,v 1.326 2018/10/13 17:29:51 vivien Exp $