<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>6795</bug_id>
          
          <creation_ts>2009-04-09 07:16:36 +0000</creation_ts>
          <short_desc>[FO] ^ should not be allowed by XmlCharIncDash</short_desc>
          <delta_ts>2018-10-09 00:23:40 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>XML Schema</product>
          <component>Datatypes: XSD Part 2</component>
          <version>1.0 only</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>REOPENED</bug_status>
          <resolution></resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard>11248</status_whiteboard>
          <keywords>decided</keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>REC</target_milestone>
          
          <blocked>5</blocked>
          <everconfirmed>1</everconfirmed>
          <reporter name="Murata">eb2m-mrt</reporter>
          <assigned_to name="Michael Kay">mike</assigned_to>
          <cc>David_E3</cc>
    
    <cc>jim.melton</cc>
    
    <cc>pumba.parker2017</cc>
          
          <qa_contact name="Mailing list for public feedback on specs from XSL and XML Query WGs">public-qt-comments</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>24632</commentid>
    <comment_count>0</comment_count>
    <who name="Murata">eb2m-mrt</who>
    <bug_when>2009-04-09 07:16:36 +0000</bug_when>
    <thetext>^ should not be allowed by XmlCharIncDash

Because it is now allowed, &quot;[^a]&quot; can be interpreted as a posCharGroup.

charClassExpr -&gt; &apos;[&apos; charGroup &apos;]&apos; 
  -&gt;  &apos;[&apos; posCharGroup &apos;]&apos; 
  -&gt;  &apos;[&apos; charRange charRange &apos;]&apos; 
  -&gt;  &apos;[&apos; XmlCharIncDash XmlCharIncDash &apos;]&apos; 
  -&gt;  &apos;[&apos; &apos;^&apos; &apos;a&apos; &apos;]&apos;</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24633</commentid>
    <comment_count>1</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2009-04-09 07:28:33 +0000</bug_when>
    <thetext>Personal response:

I think this is covered by the prose rule in XML Schema Part 2:

&quot;The ^ character is only valid at the beginning of a ·positive character group· if it is part of a ·negative character group·&quot;

(and the subsequent note which points out that this rule is needed to disambiguate the grammar).

The revision to this text in XSD 1.1 Part 2 describes the rule more clearly:

&quot;If the first character in a charGroup is &apos;^&apos;, this is taken as indicating that the charGroup starts with a negCharGroup. A posCharGroup can itself start with &apos;^&apos; but only when it appears within a negCharGroup, that is, when the &apos;^&apos; is preceded by another &apos;^&apos;. &quot;

Come to think of it, I&apos;m not sure why this needs to be written in English - it feels like it could have been achieved in the BNF.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24635</commentid>
    <comment_count>2</comment_count>
    <who name="Murata">eb2m-mrt</who>
    <bug_when>2009-04-09 08:14:03 +0000</bug_when>
    <thetext>(In reply to comment #1)
&gt; Personal response:
&gt; 
&gt; I think this is covered by the prose rule in XML Schema Part 2:
&gt; 
&gt; &quot;The ^ character is only valid at the beginning of a ·positive character group·
&gt; if it is part of a ·negative character group·&quot;

I had no ideas about this sentence.  But the rewrite in W3C XML Schema 1.1 
makes much more sense.
</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24716</commentid>
    <comment_count>3</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2009-04-14 15:38:55 +0000</bug_when>
    <thetext>The joint WGs agreed today to resolve this as Invalid. If this resolution is acceptable, Murata-san, I would be grateful if you could mark it as CLOSED.

Michael Kay</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24717</commentid>
    <comment_count>4</comment_count>
    <who name="Murata">eb2m-mrt</who>
    <bug_when>2009-04-14 15:48:55 +0000</bug_when>
    <thetext>(In reply to comment #3)
&gt; The joint WGs agreed today to resolve this as Invalid. If this resolution is
&gt; acceptable, Murata-san, I would be grateful if you could mark it as CLOSED.

Well, although I agree that the prose in W3C XML Schema 1.0 is correct, 
I do not think that it is very understandable.  It may be succint but is 
underspecified for non-native speakers.  Is it possible to incorporate 
that part of 1.1 into 1.0?

</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24719</commentid>
    <comment_count>5</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2009-04-14 16:46:16 +0000</bug_when>
    <thetext>This is now addressing a weakness in XSD 1.0 Part 2, rather than in QT Functions and Operators. We could transfer the bug there if you like. However, I think that with the limited resources available to the XML Schema WG, the chances of this getting fixed in an erratum are small. The Schema WG made a conscious decision to put 1.0 bugs on hold until 1.1 is done, and with every year that passes, the value of &quot;improving&quot; the 1.0 spec becomes less. And there are many things more pressing than this problem - the 1.0 spec might not state the answer very elegantly, but I don&apos;t think it&apos;s ambiguous.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24721</commentid>
    <comment_count>6</comment_count>
    <who name="Murata">eb2m-mrt</who>
    <bug_when>2009-04-14 17:39:57 +0000</bug_when>
    <thetext>(In reply to comment #5)

I do not trust extensions from those who do not care defects of 
old versions.

&gt; the 1.0 spec might not state
&gt; the answer very elegantly, but I don&apos;t think it&apos;s ambiguous.

IMHO, it&apos;s understandable only when you understand the intention already.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24735</commentid>
    <comment_count>7</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2009-04-15 09:17:11 +0000</bug_when>
    <thetext>I will reopen this bug and reallocate it to XML Schema 1.0 (assuming the system allows me to do that).</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>25555</commentid>
    <comment_count>8</comment_count>
    <who name="David Ezell">David_E3</who>
    <bug_when>2009-06-12 15:52:48 +0000</bug_when>
    <thetext>(from the telcon minutes of 2009-06-12)

Mike Kay summarized the bug and suggested that the best solution would be to change not just the passage objected to by Murata but all of 1.1&apos;s regex appendix into 1.0

This would resolve both this issue and some other outstanding bugs and problems.

Some retrofitting will be required.

We will NOT change the Unicode baseline for XSD 1.0 -- it will remain Unicode 3.1

We will continue to allow support for new versions of Unicode (1.0 status quo already does)

We will re-synch the table of character name to Unicode 3.1

We will rephrase the paragraphs about changes to Unicode since 3.1, to tell the story from the 1.0 pov, not the 1.1 pov. (Editorial)

We will scour the text to remove references to the implementaiton option of supporting XML 1.1.

RESOLVED:  to resolve bug 6795 by adopting Michael Kay&apos;s change proposal (take the whole regex appendix into 1.0, modulo minor changes outlined above).</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>