<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>8732</bug_id>
          
          <creation_ts>2010-01-13 14:18:31 +0000</creation_ts>
          <short_desc>Unicode 5.2</short_desc>
          <delta_ts>2011-05-04 16:18:11 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>XML Schema</product>
          <component>Datatypes: XSD Part 2</component>
          <version>1.1 only</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Windows NT</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords>resolved</keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Michael Kay">mike</reporter>
          <assigned_to name="David Ezell">David_E3</assigned_to>
          <cc>cmsmcq</cc>
          
          <qa_contact name="XML Schema comments list">www-xml-schema-comments</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>31018</commentid>
    <comment_count>0</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2010-01-13 14:18:31 +0000</bug_when>
    <thetext>Our current XSD 1.1 specs refer normatively to Unicode 5.1. Since 5.2 has been released, we should probably update to that.

This includes checking that the list of character categories and block names used in regular expressions is complete.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>31079</commentid>
    <comment_count>1</comment_count>
    <who name="David Ezell">David_E3</who>
    <bug_when>2010-01-15 16:26:45 +0000</bug_when>
    <thetext>On the telcon Michael Kay agreed to do some research to determine the required scope of the requested change.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>36114</commentid>
    <comment_count>2</comment_count>
    <who name="David Ezell">David_E3</who>
    <bug_when>2010-06-11 15:34:07 +0000</bug_when>
    <thetext>See email at from Mike Kay:
http://lists.w3.org/Archives/Member/w3c-xml-schema-wg/2010Jun/0002.html</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>36115</commentid>
    <comment_count>3</comment_count>
    <who name="David Ezell">David_E3</who>
    <bug_when>2010-06-11 15:42:10 +0000</bug_when>
    <thetext>WG decided to adopt Michael Kay&apos;s proposal in comment #2.

proposal -- update &quot;Unicode 5.1&quot; to &quot;Unicode 5.2&quot; and add the code blocks listed in my email.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>42806</commentid>
    <comment_count>4</comment_count>
    <who name="C. M. Sperberg-McQueen">cmsmcq</who>
    <bug_when>2010-11-27 01:09:01 +0000</bug_when>
    <thetext>A problem arises in executing this change, perhaps a minor one.  We currently point to &apos;The Unicode Character Database&apos; with a URI of http://www.unicode.org/Public/5.1.0/ucd/UCD.html.  This page, in turn, follows the familiar convention of providing a link to the previous version, and to the latest version, of the document.  Following the link to the latest version, one is told that &quot;The UCD Documentation File You Requested Has Been Replaced&quot;, as of version 5.2.0.

The current version of Unicode now (as of October 2010) appears to be Unicode 6.0.0.

We need to repeat the work MK did on 5.2.0, checking character categories and block names.  

And we need to figure out which document(s) to link to.  Perhaps the best way to point to the Unicode Database itself is via http://www.unicode.org/Public/6.0.0/, which has the advantage that it has both the traditional semicolon-separated value files and an XML version of the database files in its subdirectories.  But for documentation, it would probably be better to point people to http://www.unicode.org/ucd/, which in turn links to http://www.unicode.org/versions/latest/ for the Unicode spec itself.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>42842</commentid>
    <comment_count>5</comment_count>
    <who name="C. M. Sperberg-McQueen">cmsmcq</who>
    <bug_when>2010-11-29 15:38:54 +0000</bug_when>
    <thetext>It seems clear that the WG needs to revisit the question of our Unicode 
reference and how tightly we should couple XSD 1.1 to any particular
Unicode version.  So after discussion with the chair I&apos;m changing the
status from needsDrafting back to needsAgreement.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>43443</commentid>
    <comment_count>6</comment_count>
    <who name="C. M. Sperberg-McQueen">cmsmcq</who>
    <bug_when>2010-12-17 02:47:01 +0000</bug_when>
    <thetext>I wonder, having just revisited our discussion of anyURI and its relation to the relevant RFCs, whether we should cultivate a looser coupling to the Unicode database.  I can think of two or three variants that might simplify life:

(1) Just leave the reference at 5.1.  The spec allows implementations to support later versions of Unicode, and we don&apos;t have a pressing need to require them to be up to date.

(2) Move the reference back to Unicode 3 or so, on the same logic.

(3) Make the Unicode reference wholly generic and specify that the version of Unicode supported is implementation-defined.  

(4) Move the table of block names (and any other version-sensitive information about Unicode) to a non-normative document, and in that document provide not just the block names for the current version of Unicode, but record their changes (so users and implementors have a better chance of deciphering problems in the area). 

Note that (3) is compatible with each of the others, or with updating the minimum Unicode version to 6.0.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>43451</commentid>
    <comment_count>7</comment_count>
    <who name="David Ezell">David_E3</who>
    <bug_when>2010-12-17 16:45:17 +0000</bug_when>
    <thetext>Resolution: put the actual definitions in a separate document, and target version 3.1, time permitting; MSM to provide a Unicode history to the block names.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>46807</commentid>
    <comment_count>8</comment_count>
    <who name="David Ezell">David_E3</who>
    <bug_when>2011-03-18 15:25:56 +0000</bug_when>
    <thetext>&lt;dezell&gt; MSM: proposed -- I&apos;d like to sever the tie to a particular unicode version.
&lt;dezell&gt; MSM: I&apos;d like us to have a &quot;first&quot; unicode version, and allow later.
&lt;dezell&gt; ...: no restriction on supporting more than one version.

WG approved.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>47591</commentid>
    <comment_count>9</comment_count>
    <who name="C. M. Sperberg-McQueen">cmsmcq</who>
    <bug_when>2011-04-21 19:54:28 +0000</bug_when>
    <thetext>A proposal intended to resolve this issue is at

  http://www.w3.org/XML/Group/2004/06/xmlschema-2/datatypes.b8732.html
  (member-only link).</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>47949</commentid>
    <comment_count>10</comment_count>
    <who name="C. M. Sperberg-McQueen">cmsmcq</who>
    <bug_when>2011-05-04 16:02:07 +0000</bug_when>
    <thetext>The proposal mentioned in comment 9 has been integrated into the status-quo document, so I&apos;m closing this issue.

Michael, if you would close or reopen to signal assent/dissent?  Thanks.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>