<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>5948</bug_id>
          
          <creation_ts>2008-08-11 11:33:57 +0000</creation_ts>
          <short_desc>Reference to Unicode Database</short_desc>
          <delta_ts>2010-11-10 17:43:05 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>XML Schema</product>
          <component>Datatypes: XSD Part 2</component>
          <version>1.1 only</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Windows XP</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc>http://www.w3.org/International/reviews/0807-xsd11/</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords>resolved</keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Felix Sasaki">fsasaki</reporter>
          <assigned_to name="C. M. Sperberg-McQueen">cmsmcq</assigned_to>
          <cc>David_E3</cc>
    
    <cc>mike</cc>
    
    <cc>oliver</cc>
          
          <qa_contact name="XML Schema comments list">www-xml-schema-comments</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>21487</commentid>
    <comment_count>0</comment_count>
    <who name="Felix Sasaki">fsasaki</who>
    <bug_when>2008-08-11 11:33:57 +0000</bug_when>
    <thetext>Hello,

this is a comment on behalf of the i18n core Working Group.
We suggest that you have a non-static reference to the Unicode database, and that you make the Note starting with &quot;[Unicode Database] is subject to future revision&quot; to normative text.
Locations:
References
http://www.w3.org/TR/2008/WD-xmlschema11-2-20080620/#UnicodeDB
and G1
http://www.w3.org/TR/2008/WD-xmlschema11-2-20080620/#charcter-classes
Thank you,
Felix</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>21798</commentid>
    <comment_count>1</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2008-09-05 14:36:34 +0000</bug_when>
    <thetext>I would be more inclined to agree with this if the Unicode Database offered better guarantees of backwards compatibility. But recent releases have, for example, renamed one of the character groups from &quot;Greek&quot; to &quot;Greek and Coptic&quot;, and an implementation that followed that blindly would cause any schema using the regular expression \P{IsGreek} to become invalid overnight. Similarly there are characters that have changed category, which also changes the semantics of regular expressions, causing a message that is validated by the sender to be rejected by the recipient even though both use the same schema. 

I think we need to offer XML Schema users better stability than this.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22060</commentid>
    <comment_count>2</comment_count>
    <who name="Felix Sasaki">fsasaki</who>
    <bug_when>2008-09-29 05:57:15 +0000</bug_when>
    <thetext>Hello Michael,

we discussed your comment
http://www.w3.org/Bugs/Public/show_bug.cgi?id=5948#c1
at
http://www.w3.org/2008/09/17-core-minutes#item05
We are now contacting the Unicode Technical Committee asking for stability of block names. See the thread at 
http://lists.w3.org/Archives/Member/member-i18n-core/2008Sep/0013.html
The short summary is that there is a trade off between having a stable reference to the Unicode database and allowing for more characters in &quot;yet to come&quot; versions of Unicode. We think a good compromise would be to make it implementation-defined which version of the database is used. This proposal is also based on Bug 
http://www.w3.org/Bugs/Public/show_bug.cgi?id=5818
Mark Davis pointed out at
http://lists.w3.org/Archives/Member/member-i18n-core/2008Sep/0025.html
that the property Alias file
http://unicode.org/Public/UNIDATA/PropertyValueAliases.txt
provides information about previous block names, so you might want to take that into account as well.

Regards, Felix.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22313</commentid>
    <comment_count>3</comment_count>
    <who name="David Ezell">David_E3</who>
    <bug_when>2008-10-31 21:14:49 +0000</bug_when>
    <thetext>WG agreed at the Face to Face:

&lt;MSM&gt; We think the text is mostly ok, but some tweaks are desirable:
&lt;MSM&gt; - replace references to &apos;current version&apos; with references to &apos;the version cited in the References&apos; (or to the actual number of that version -- we won&apos;t change it)
&lt;MSM&gt; - change references to &apos;future&apos; versions to &apos;later&apos; versions
&lt;MSM&gt; - add prose to the reference re-stating the conformance rules (ceiling/floor)</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22314</commentid>
    <comment_count>4</comment_count>
    <who name="David Ezell">David_E3</who>
    <bug_when>2008-10-31 21:16:00 +0000</bug_when>
    <thetext>AND:
&lt;MSM&gt; - Make those two Notes normative text!</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22319</commentid>
    <comment_count>5</comment_count>
    <who name="Felix Sasaki">fsasaki</who>
    <bug_when>2008-11-01 09:00:33 +0000</bug_when>
    <thetext>(In reply to comment #3)
&gt; WG agreed at the Face to Face:
&gt; 
&gt; &lt;MSM&gt; We think the text is mostly ok, but some tweaks are desirable:
&gt; &lt;MSM&gt; - replace references to &apos;current version&apos; with references to &apos;the version
&gt; cited in the References&apos; (or to the actual number of that version -- we won&apos;t
&gt; change it)
&gt; &lt;MSM&gt; - change references to &apos;future&apos; versions to &apos;later&apos; versions
&gt; &lt;MSM&gt; - add prose to the reference re-stating the conformance rules
&gt; (ceiling/floor)
&gt; 

Hello David, all,

it is a bit hard to see what the actual change will be, could you give a link to the updated draft later? Also, FYI and maybe of importante for this issues: the i18n core WG has talked to the Unicode consortium about various stability policies, including character class names, which resulted in an update of these policies. See 
http://lists.w3.org/Archives/Member/member-i18n-core/2008Oct/0038.html

Felix</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22320</commentid>
    <comment_count>6</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2008-11-01 10:00:17 +0000</bug_when>
    <thetext>I don&apos;t see anything in that Unicode stability policy about names of blocks such as &quot;Greek&quot;, which is my biggest concern here. Did I miss something? Also, a policy for the future doesn&apos;t solve the problem for the past - I do think we need to say something explicit to make sure that a schema using &lt;pattern value=&quot;p{IsGreek}*&quot;/&gt; continues to work.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22421</commentid>
    <comment_count>7</comment_count>
    <who name="Felix Sasaki">fsasaki</who>
    <bug_when>2008-11-13 02:10:28 +0000</bug_when>
    <thetext>(In reply to comment #6)
&gt; I don&apos;t see anything in that Unicode stability policy about names of blocks
&gt; such as &quot;Greek&quot;, which is my biggest concern here. Did I miss something? Also,
&gt; a policy for the future doesn&apos;t solve the problem for the past - I do think we
&gt; need to say something explicit to make sure that a schema using &lt;pattern
&gt; value=&quot;p{IsGreek}*&quot;/&gt; continues to work.
&gt; 

From a mail exchange with Mark Davis on the topic:

[Soon] &quot;there will be a publicly available stability provision for all of the property aliases and property value aliases on

    * http://unicode.org/Public/UNIDATA/PropertyValueAliases.txt
    * http://unicode.org/Public/UNIDATA/PropertyAliases.txt 

with the exception of Contributory properties listed on &lt;http://www.unicode.org/Public/UNIDATA/UCD.html#Properties&gt;. This is not completely final yet, since the exact wording has to be formulated by the editorial committee, and it actually requires approval by the officers, but I don&apos;t anticipate any problems.

So that will include block names. Note that that the set of characters having a given property or property value may change (subject to the stability policies). What the above means is that the identifiers will always remain valid, so \p{script=Greek} or equivalent syntax will remain valid. That should address your concerns.

Mark&quot;
</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22806</commentid>
    <comment_count>8</comment_count>
    <who name="C. M. Sperberg-McQueen">cmsmcq</who>
    <bug_when>2008-12-19 03:34:03 +0000</bug_when>
    <thetext>A wording proposal intended to resolve bug 5948 and bug 5950 is at

  http://www.w3.org/XML/Group/2004/06/xmlschema-2/datatypes.b5948.html

This should make it easier to see what changes, exactly, are proposed.
</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22808</commentid>
    <comment_count>9</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2008-12-19 09:39:34 +0000</bug_when>
    <thetext>In the wording proposal, the Note after the table of block names refers to PropertyAliases.txt and PropertyValueAliases.txt. The block names are actually defined in Blocks.txt</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>22829</commentid>
    <comment_count>10</comment_count>
    <who name="C. M. Sperberg-McQueen">cmsmcq</who>
    <bug_when>2008-12-22 15:37:43 +0000</bug_when>
    <thetext>The wording proposal mentioned in comment #8 was approved by the XML Schema
WG at its telcon of 19 December 2008, with minor amendments.  We discussed the
point raised in comment #9; some WG members took comment #7 to mean that
the two files in question are in fact relevant to (changes in) block
assignment, or will be.  In the end, recalling that those two files are
already mentioned in the first note in the change proposal, not very far
above this location in the text, the WG decided just to delete that 
sentence.

The changes have been integrated into the status quo document at the usual
location.  

The WG decided not to close the issue, however.  Since XSD 1.0 refers to
version 3.1 of the Unicode database, the current draft of XSD 1.1 to 4.1,
and the Unicode Consortium&apos;s web site now carries version 5.1, we discussed
briefly which version to require of XSD 1.1 processors:  3.1 (for compatibility
with XSD 1.0)?, 4.1 (for compatibility with earlier drafts of 1.1)?, or 
5.1 (to be current)?  We decided to require 5.1, and instructed the editors
to check the block and property information and update the reference.

I&apos;m marking this needsDrafting, accordingly.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>23196</commentid>
    <comment_count>11</comment_count>
    <who name="C. M. Sperberg-McQueen">cmsmcq</who>
    <bug_when>2009-01-21 00:04:12 +0000</bug_when>
    <thetext>On 16 January 2009, the XML Schema WG adopted the proposal at

  http://www.w3.org/XML/Group/2004/06/xmlschema-2/datatypes.b5948b.html

with amendments proposed by Michael Kay in 

  http://lists.w3.org/Archives/Member/w3c-xml-schema-ig/2009Jan/0011.html

(both of these are member-only links).

Felix, if you could convey this decision to the i18n WG and let us know
whether it resolves the issue to your and their satisfaction, we&apos;d be
grateful. Close the issue if you&apos;re satisfied, reopen it if not.

If we don&apos;t hear from you in the next two weeks, we will assume you and
the i18n core WG are satisfied with the resolution of the issue. </thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>36404</commentid>
    <comment_count>12</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2010-06-24 12:13:04 +0000</bug_when>
    <thetext>*** Bug 10008 has been marked as a duplicate of this bug. ***</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>42386</commentid>
    <comment_count>13</comment_count>
    <who name="David Ezell">David_E3</who>
    <bug_when>2010-11-10 17:43:05 +0000</bug_when>
    <thetext>The WG reported this bug as FIXED on 2010-06-24.  We are closing this bug
as requiring no futher work.  If there are issues remaining, you can reopen
this bug and enter a comment to indicate the problem.  Thanks very much for the
feedback.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>