ISSUE-425: Normalization and string identity issues
Normalization and string identity issues
- State:
- CLOSED
- Product:
- webvtt1
- Raised by:
- Addison Phillips
- Opened on:
- 2015-02-26
- Description:
- Bugzilla: https://www.w3.org/Bugs/Public/show_bug.cgi?id=28259
http://www.w3.org/TR/webvtt1/#webvtt-file-structure
Various constructs such as 'cue identifier' are described as being:
--
...any sequence of one or more characters not containing the substring "-->"...
--
The document makes understood that this is a sequence of Unicode characters. However, it leaves open the question of whether different Unicode character sequences that represent the same semantic string identifier (see: Charmod [1] and Charmod-Norm [2]) are considered "the same" or not. As currently written, different UTF-8 byte sequences are considered distinct.
We would suggest that identifiers that use distinct code point sequences are considered distinct (that is, that you are what we call a "non-normalizing Specification"), which suggests that you include at least a health warning about the dangers of using different character sequences.
[1] http://www.w3.org/TR/charmod/
[2] http://www.w3.org/TR/charmod-norm/
Particularly: http://www.w3.org/TR/charmod-norm/#formal-language and http://www.w3.org/TR/charmod-norm/#non-normalizing
- Related Actions Items:
- No related actions
- Related emails:
- [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-11-25)
- [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-11-23)
- [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-10-01)
- [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-06-08)
- [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-06-01)
- [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-04-21)
- [Bug 28259] [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from bugzilla@jessica.w3.org on 2015-03-24)
- Re: [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from silviapfeiffer1@gmail.com on 2015-03-22)
- [webvtt] Normalization and string identity issues [I18N-ISSUE-425] (from addison@lab126.com on 2015-03-20)
- [minutes] Internationalization WG telecon 2015-03-19 (from ishida@w3.org on 2015-03-19)
- I18N-ISSUE-425: Normalization and string identity issues ⓟ [WebVTT] (from sysbot+tracker@w3.org on 2015-02-26)
Related notes:
Needs review of new text.
Richard Ishida, 23 Jul 2015, 11:20:12Addison, can we close this?
Richard Ishida, 16 Nov 2015, 13:31:17Closed. Satisfied.
Addison Phillips, 23 Nov 2015, 18:37:10Display change log