<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>10008</bug_id>
          
          <creation_ts>2010-06-24 11:47:12 +0000</creation_ts>
          <short_desc>Use of Unicode blocks that no longer exist in regular expressions.</short_desc>
          <delta_ts>2010-11-10 17:22:38 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>XML Schema</product>
          <component>Datatypes: XSD Part 2</component>
          <version>1.0/1.1 both</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Windows NT</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>DUPLICATE</resolution>
          <dup_id>5948</dup_id>
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          <dependson>5818</dependson>
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Oliver Hallam">oliver</reporter>
          <assigned_to name="David Ezell">David_E3</assigned_to>
          <cc>cmsmcq</cc>
    
    <cc>mike</cc>
          
          <qa_contact name="XML Schema comments list">www-xml-schema-comments</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>36402</commentid>
    <comment_count>0</comment_count>
    <who name="Oliver Hallam">oliver</who>
    <bug_when>2010-06-24 11:47:12 +0000</bug_when>
    <thetext>Section F.1.1 states the following:

Note:  [Unicode Database] is subject to future revision. For example, the grouping of code points into blocks might be updated. All ·minimally conforming· processors ·must· support the blocks defined in the version of [Unicode Database] that is current at the time this specification became a W3C Recommendation. However, implementors are encouraged to support the blocks defined in any future version of the Unicode Standard.

Unfortunately some of these blocks no longer exist in the current Unicode specification!  I believe the changes are limited to the following:

CombiningMarksforSymbols is now CombiningDiacriticalMarksforSymbols

Greek is now GreekandCoptic

PrivateUse has been split into three groups (we think):
PrivateUseArea, SupplementaryPrivateUseAreaA and SupplementaryPrivateUseAreaB.

The behaviour for these old group names is left a bit vague.  I suggest that the correct behaviour should be one of the following, but this is not specified anywhere:

1) The old block names should no longer be valid.  This is a direct contradiction with the specification and would cause compatibility problems.

2) The old names should refer to groups in an older version of the Unicode specification that did have them.  In particular I suggest that this should be the version used in the Schema specification.

3) The old names should map to the equivalent groups in the newer version of the specification.  I can&apos;t find this mapping specified anywhere, but I believe it to be as described above (at least for the current version).</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>36403</commentid>
    <comment_count>1</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2010-06-24 12:13:04 +0000</bug_when>
    <thetext>

*** This bug has been marked as a duplicate of bug 5948 ***</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>42363</commentid>
    <comment_count>2</comment_count>
    <who name="David Ezell">David_E3</who>
    <bug_when>2010-11-10 17:22:38 +0000</bug_when>
    <thetext>The WG reported this bug as DUPLICATE on 2010-06-24.  We are closing this bug as
requiring no futher work.  If there are issues remaining, you can reopen this
bug and enter a comment to indicate the problem.  Thanks very much for the
feedback.

Comment 1 contains a reference to the duplicate bug.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>