Only comments that are publicly visible are listed below.
Such comments sometimes contain links to other items that are not publicly visible.
| C018 | Na | Na | N | Dan Connolly
| -
| 4.4 | Example unclear
Decision: Not applicable. Rationale: We have classified this as 'not applicable' because you have not asked for any actual changes of the document.
Your problem with understanding the cz example may have been due to rendering issues, but we doubt that, because the
(non-combining) cedilla that we used in the draft to represent the combining cedilla is part of iso-8859-1, which was rendered well since the very first browsers. We suspect that you may have overlooked the cedilla, a little low hook after the z.
We will try to take your suggestion for a test into consideration for our next phase. Our response (sent 2003-12-11) -- Notification
|
| C023 | E | A | S | Jeremy Carroll
| -
| 4.4 | 'normalization-sensitive' unclear
Decision: Accepted. Decision: Add examples (i) of operations which are not normalization-sensitive, and (ii) illustrating what we mean by inputs and outputs. Our response (sent 2003-05-01) -- Notification
|
| C024 | Na | Na | S | Jeremy Carroll
| -
| 3.2 | Is UTF-7 a unicode encoding form?
Decision: Not applicable. We have classified this comment as 'not applicable', because it is only a question. Our answer is that yes and no. UTF-7 can be considered an unicode encoding form, or not. It is an unicode encoding form to the extent that it encodes a sequence of unicode characters. However, it does not map a character to an identifiable sequence of bytes, and has a number of other rather undesirable properties. It was designed for use in very special cases such as Email, but has widely been replaced by UTF-8, and is no longer recommended for use, to the extent that we decided that the most adequate way to handle it in the Character Model was to completely ignore it.
|
| C025 | T | A | N | Olu Ibidunni
| -
| 3.1.5 | In 4th example, should 'o' be 'ö'?
|
| C026 | E | A | N | Ian Jacobs
| -
| 3.1.3 | mapping between character codes and units of displayed text
See also the following comments: C096
Decision: Accepted. Decision: Change 'character codes' to 'characters'. Our response (sent 2003-02-17) -- Notification |
| C027 | Na | Na | O | Joseph Reagle
| XML Sig WG
| Various | XML Sig WG comments
|
| C028 | Na | Na | N | Jeremy Carroll
| RDF Core WG
| Various | Endorsement from RDF Core
Decision: Not applicable. Rationale: We thank you for your endorsement. We have classified this comment as 'not applicable' because it does not suggest or imply any changes. We would like to note that the Character Model is written so as to make clear that specifications do not have to follow all the requirements, just those that apply in their specific case. Our response (sent 2003-02-13) -- Notification |
| C029 | Na | Na | N | Jeremy Carroll
| RDF Core WG
| 2 | breadth of scope
Our response (sent 2002-05-27) -- Re: breadth of scope Decision: Not applicable. Rationale: We have classified this comment as 'not applicable', because it is too general. Each CharMod requirement applies only where applicable. For example, if a specification doesn't deal with sorting, then requirements related to sorting do not apply. Also, specifications that don't deal with text (e.g. a bitmap format) would therefore not have any applicable requrements (except e.g. for textual comments and other metainformation embedded in the format). We would also like to point out that the term 'processing model' is taken very widely here. Even if a specification does not have an explicitly defined processing model, it implicitly defines how to process (e.g. match) characters. As an example, RDF conforms to the processing model, on the level of the abstract syntax by virtue of the fact that the abstract syntax is expressed in Unicode, and on the level of RDF/XML by virtue of being based on XML. Our response (sent 2003-02-13) -- Notification |
| C030 | E | N | N | Jeremy Carroll
| RDF Core WG
| 3.5 | non-universality of processing model
Our response (sent 2002-05-27) -- Re: non-universality of processing model Decision: Noted. Rationale: We have classified this comment as 'Noted', because it did not contain any suggestions for changes. However, in order to address the misunderstanding that we think
this comment exposes, we have added some text (just before
C014): "Also, while this document uses the term Reference <emph>Processing</emph>
Model and describes its properties in terms of processing, the model also
applies to specifications that do not explicitly define a processing model." We hope that this clarifies the situation for RDF: Even if there is
no processing model for RDF, on the level of text processing, RDF
conforms to the Charmod Reference Processing Model because of the
way the abstract syntax is defined in terms of Unicode characters
and because of the way XML is used. Our response (sent 2003-02-13) -- Notification |
| C031 | S | P | N | Jeremy Carroll
| RDF Core WG
| 8 | no dependency on IRI draft
See also the following comments: C059 C170
Decision: Partially accepted. Rationale: Our plan is that the IRI ID, referenced in this section, will have been submitted for Proposed Standard by the time CharMod moves to the next stage. IRI equality is fully addressed in the latest IRI ID version. Our response (sent 2003-02-13) -- Notification |
| C032 | Na | Na | O | Jeremy Carroll
| RDF Core WG
| Various | Overview of RDF Core feedback
-
Comment (received 2002-05-27) -- Overview of RDF Core feedback
The RDF Core WG has made feedback concerning the following sections of charmod: > 1. Introduction > 2. Conformance > 3.4 Strings > 3.5 Reference Processing Model > 4. Early Uniform Normalization > 6. String Identity Matching > 8. Characeter Encoding in URI References > 9. Referencing the Unicode Standard > A.2 Other References > C. Composing Characters > D. Resources for Normalization [...] RDF Core makes no comments on the other sections.
This comment lists the sections that have been commented on by the RDF Core WG. Please see the specific comments listed below. This comment has been split into the following comments: C028 C029 C030 C031 |
| C033 | E | P | N | Ian Jacobs
| -
| 3.1.6 | Use of word 'byte'
Decision: Improve the sentence. Decision: Partially accepted. Rationale for 'Partially accepted': We think we can improve on the suggested wording. Our response (sent 2003-02-17) -- Notification |
| C034 | S | A | S | Joseph Reagle
| XML Sig WG
| 3.6.3 | Private Use Code Points: Disagreement with our approach
|
| C035 | S | A | N | Joseph Reagle
| XML Sig WG
| Various | 'All W3C specs must conform.'
Decision: Accepted Rationale: We have originally rejected this comment. We have later, after extensive discussions, been instructed by W3C Management that it is inappropriate for a W3C spec to directly enforce requirements on other specifications, and have removed the relevant language. We have also been instructed to request a finding from the TAG corresponding to the text that we removed.
|
| C036 | E | A | N | Joseph Reagle
| XML Sig WG
| 3.1.3 | Define 'logical order'
|
| C037 | E | A | N | Joseph Reagle
| XML Sig WG
| 4.1.1 | Character 'í' hard to distinguish from 'i', particularly when
italicized
|
| C038 | E | A | N | Jim Melton
| XML Query WG
| 2 | Conformance of new vs. old specs
See also the following comments: C051 C088 C089 C135
Decision: Accepted You point out a clear inconsistency, which we have fixed a while ago. We have later been told that it is inappropriate for a W3C spec to directly enforce requirements on other specifications, and have removed the relevant language altogether. We have been instructed to request a finding from the TAG corresponding to the text that we removed. We will make sure that, if relevant, the inconsistency you pointed out will not reappear. |
| C039 | E | A | N | Jim Melton
| XML Query WG
| 3.1.5 | Determining relevant language for sorting
|
| C040 | E | A | N | Jim Melton
| XML Query WG
| 3.1.7 | How to avoid use of the term 'character'?
See also the following comments: C004 C138 C166
Decision: Accepted Decision: We'll add clarification. Our response (sent 2003-05-01) -- Notification |
| C041 | S | A | N | Jim Melton
| XML Query WG
| 3.2 | Proprietary charset identifiers
See also the following comments: C139
Decision: Rejected. See our response (below) for our rationale. Our response (sent 2002-06-05) -- Re: proprietary charset identifiers [...] Please tell us, at your earliest convenience, whether you are
satisfied with our decision or not. If not, please provide
additional rationale. Decision: Accepted. Note: We've made the requested change and will ask the XML Query WG whether they have further concerns about section 3.6.2: [S] If the unique encoding approach is not taken, specifications SHOULD mandate the use of the IANA charset registry names [...] If they do have such concerns, they need to raise a separate comment. After some discussion (see the last message on this from Jim Melton)
we have decided to accept the comment as it was made. We have changed "... is identified by an IANA charset identifier." to "... is identified by a unique identifier, such as an IANA charset identifier." However, our exchange suggests that the XML Query WG may also not be okay with some of the wording in Section 3.6.2, which (among
else) says:
"[S] If the unique encoding approach is not taken, specifications SHOULD mandate the use of the IANA charset registry names [...]"; if this is the case, please indicate so at as soon as possible, or we will have to assume that this is okay with you. |
| C042 | S | A | | Jim Melton
| XML Query WG
| 4.4 | Discussion of subsequent items
|
| C043 | S | A | | Jim Melton
| XML Query WG
| 4.4 | Objection to prohibition against receiver from normalizing text
|
| C044 | S | A | | Jim Melton
| XML Query WG
| 4.4 | Prohibition against normalizing suspect text
|
| C045 | S | A | | Jim Melton
| XML Query WG
| 4.4 | Prohibition against interim unnormalized states
|
| C046 | Na | Na | O | David Fallside
| XMLP WG
| Various | XMLP WG response to Charmod LC#2
|
| C047 | Na | Na | O | Tim Bray
| -
| Various | Comments on Character Model
|
| C048 | Na | Na | O | Yin Leng Husband
| WSArch WG
| Various | WSArch WG review of Charmod LC #2
|
| C049 | Na | Na | O | Norman Walsh
| TAG
| Various | TAG comments on Character Model for the World Wide Web 1.0
|
| C050 | S | A | S | Cliff Schmidt
| Microsoft
| Various | CharMod restricts closed systems
|
| C051 | E | A | S | Cliff Schmidt
| Microsoft
| 2 | Inconsistent/Redundant Requirements for W3C Spec Conformance
|
| C052 | S | P | S | Cliff Schmidt
| Microsoft
| 2 | W3C Spec Conformance
|
| C053 | E | P | S | Cliff Schmidt
| Microsoft
| 3.5 | Full Range of Unicode Code Points Not Allowed in XML
Decision: Partially accepted. Rationale for 'Partially accepted': It is up to each specification to provide the specific reason(s) for deviating from a 'SHOULD' requirement. We have however amended the text to read: 'Specifications SHOULD not arbitrarily exclude characters from the full range of Unicode code points from U+0000 to U+10FFFF inclusive;'. Our response (sent 2003-03-06) -- Notified and discussed during FTF mtg |
| C054 | E | R | S | Cliff Schmidt
| Microsoft
| 4.2.3 | Definition of 'Fully-Normalized'
Decision: Partially accepted. Rationale: Checking reveals that we could go either way;
change record to partially accepted, but no change. Our response (sent 2003-03-06) -- Notified and discussed during FTF mtg |
| C055 | S | R | S | Cliff Schmidt
| Microsoft
| 4.4 | Mandating NFC for All Web Content
Decision: Rejected. Note, however, the relaxation of the language in section 4.4. Rationale: 'Foreign systems' is undefined and undefinable. Our response (sent 2003-03-06) -- Notified and discussed during FTF mtg |
| C056 | S | P | S | Cliff Schmidt
| Microsoft
| 4.4 | Text-Processors MUST Perform Normalization Checking
Decision: Partially accepted. Rationale: Try to add a note explaining that in
the base case,
only a one-character lookahead is needed. In the long term, try to move material about
'composing' characters to UAX 15. Our response (sent 2003-03-06) -- Notified and discussed during FTF mtg |
| C057 | S | P | S | Cliff Schmidt
| Microsoft
| 4.4 | Content Producers and Proxies
|
| C058 | E | R | S | Cliff Schmidt
| Microsoft
| 4.4 | Web Repositories
|
| C059 | S | A | N | Cliff Schmidt
| Microsoft
| 8 | IRIs
See also the following comments: C031 C170
Decision: Accepted. Our plan is that the IRI ID, referenced in this section, will have been submitted for Proposed Standard by the time CharMod moves to the next stage. IRI equality is fully addressed in the latest IRI ID version.
|
| C060 | Na | Na | N | David Fallside
| XMLP WG
| 2 | XML Protocol LC#2 review question on implementation testing
Decision: Not applicable Rationale: We have classified this comment as 'not applicable', because it is a question, not a comment leading to a potential change of the Character Model. The test suite should test for CharMod-related requirements in the
specification(s) being tested. The tests should conform to [C] requirements
(except where they are wrong on purpose). If the test collection includes
code, then that should also conform to [I] requirements.
|
| C061 | E | Na | N | David Fallside
| XMLP WG
| 1.1 | 'All W3C specifications must conform to this document'
Decision: Rejected Rationale 'Rejected': This para states the general principle and refers to section 2 for details. The various requirements will come into force once the CharMod spec becomes a REC. New decision: Not applicable Rationale: We have classified this comment as 'not applicable' because we have been told that it is inappropriate for a W3C spec to directly enforce requirements on other specifications, and have removed the relevant language from section 2. We still define conformance to CharMod. We have been instructed to request a finding from the TAG corresponding to the text that we removed. So CharMod will be enforced by the fact of being a REC, coupled with an eventual TAG finding and ongoing reviews of relevant specs by the I18N WG.
|
| C062 | Na | Na | C | David Fallside
| XMLP WG
| 4.4 | 'XML protocol need not normalize application payloads or check to insure that they are normalized'
|
| C063 | Na | Na | C | David Fallside
| XMLP WG
| 4.4 | Is text to be
normalized when forwarded?
|
| C064 | E | R | C | David Fallside
| XMLP WG
| 4.4 | Give the reason(s) for prohibition
against normalizing suspect text
|
| C065 | Na | Na | C | David Fallside
| XMLP WG
| 4.4 | Does rejected un-normalized
text have to be
normalized before it is returned to sender?
|
| C066 | Na | Na | | David Fallside
| XMLP WG
| 4.4 | Specifications that define a mechanism for producing a document SHOULD require that the final output be normalized
Q: Is a document text-based format? A: Yes. [We must clarify that we mean the textual parts of documents] Q: Is this requirement covered by
the earlier one? A: No. There may not be a spec for the output document, as in the case of plain text, or the spec may not (yet) require N11N.
|
| C067 | S | P | S | Tim Bray
| -
| 3.1.5 | Collation
Decision: Partially accepted. Note: We'll change the 'MUST' to a 'SHOULD'. Our response (sent 2003-05-01) -- Notification
|
| C068 | S | P | S | Tim Bray
| -
| 3.6 | Unique Character Encoding
See also the following comments: C114 Decision: Partially accepted. Rationale: We have added:
"[S] When basing a protocol, format, or API on a protocol, format, or API that already has rules for character encoding, specifications SHOULD use rather than change these rules." and have added XML as an example. As said elsewhere, we prefer not to have requirements specific to a particular format. Also, the 'authored by humans' part is not necessarily true; in general, humans care about the actual text and about the tools they use, not about encodings. Our response (sent 2004-01-16) -- Notification
|
| C069 | E | P | S | Tim Bray
| -
| 3.6.2 | Admissibility of UTF-*
Decision: Partially accepted. Note: Covered by our edit resulting from C114 and your previous comment C068. Our response (sent 2004-01-16) -- Notification
|
| C070 | S | N | S | Tim Bray
| -
| 4 | Early Uniform Normalization
Decision: Noted.
We agree with the sentiment. Refer to some
mails by Mark about cost of checking/normalizing. Doing normalization
really early (when data is input or converted) is usually
very cheap because it can be done by design (e.g. keyboards
with dead keys, conversion from a specific legacy encoding).
Decision: Noted. We agree that this is an important consideration. Please refer to some earlier mails by Mark Davis about cost of checking/normalizing. Doing normalization really early (when data is input or converted) is usually very cheap because it can be done by design (e.g. keyboards with dead keys, conversion from a specific legacy encoding). Normalization is indeed best run at i/o speed, but this should be human input speed rather than network i/o speed. A general normalization algorithm needs significantly more than 10KB footprint. But there is quite a wide range of possible tradeoffs between speed and footprint. We have added references to implementations and additional material in Appendix D, resources for Normalization. http://www.w3.org/International/Group/charmod-edit/#sec-n11n-resources
There is also an FAQ at http://www.unicode.org/faq/normalization.html.
An implementation that I (MD) did for just *checking* NFC came in under 50KB (in C). Mark reported 110KB for actual normalization to NFC (in Java). Our response (sent 2004-01-16) -- Notification |
| C071 | E | R | D | Tim Bray
| -
| 6 | Bit-by-bit identity
|