<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>2682</bug_id>
          
          <creation_ts>2006-01-09 10:01:31 +0000</creation_ts>
          <short_desc>Whitespace text nodes stripped from source document</short_desc>
          <delta_ts>2006-06-22 12:59:58 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>XML Query Test Suite</product>
          <component>XML Query Test Suite</component>
          <version>0.8.4</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Windows XP</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Michael Kay">mike</reporter>
          <assigned_to name="Carmelo Montanez">carmelo</assigned_to>
          
          
          

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>7719</commentid>
    <comment_count>0</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2006-01-09 10:01:34 +0000</bug_when>
    <thetext>Test Parenexpr-19 should simply return a copy of the input document. In fact,
however, it returns a copy of the input document with whitespace text nodes removed.

I think there are many other tests suffering from the same problem. At present I
handle this using a comparator that ignores whitespace text nodes in the result
(with a comment that this has been done). At the very least, I think results
should be published with and without whitespace stripping, or the catalog should
say whether whitespace stripping on the source document is required. In fact,
however, I think that according to the data model spec, whitespace stripping in
this case is incorrect if the standard &quot;construction from an InfoSet&quot; is used.
In the mapping from the InfoSet, whitespace text nodes are discarded only if the
[element content whitespace] property is true, and this will only be the case if
there is a DTD.

Other tests showing this problem (in most cases I only did careful checking of
the first one in each group; the others are listed because they displayed the
same symptoms):

Axes085
NodeTest003/4/5
unabbreviatedSyntax-4/5/8/10/11/12/13/20/21/22/24/25/26/28/29/30/31
abbreviatedSyntax-2/3/5/6/14/16/17/18/19/20/21/24
filterexpressionhc1/4/5
fn-union-node-args-002/3/5/7/9/10/11/15/16
fn-intersect-node-args-011/15/16
fn-except-node-args-003/017
ForExpr005
WhereExpr004
ReturnExpr004
orderDecl-4/5/6/7/21/22
extexpr-6
position-1/3/4/5/6/7/8/9/10/13-18/20-22
last-1/3/4/5/8/9/10/13-18/20-22
ancestor-11/12/13
ancestorself-11/12/13

Michael Kay</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>7732</commentid>
    <comment_count>1</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2006-01-09 14:47:12 +0000</bug_when>
    <thetext>Also affects

ForExprType038/039</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>7954</commentid>
    <comment_count>2</comment_count>
    <who name="Carmelo Montanez">carmelo</who>
    <bug_when>2006-01-23 20:09:43 +0000</bug_when>
    <thetext>Michael et all:

There may be other tests that are affected by this.  I will try and find a way
to automate findig those tests.  For the time I corrected all the identified
ones from the NIST side.  Assigning the bug to ravindranath as this
affects some of his tests as well.

Thanks,
Carmelo</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>7956</commentid>
    <comment_count>3</comment_count>
    <who name="Michael Kay">mike</who>
    <bug_when>2006-01-23 21:14:19 +0000</bug_when>
    <thetext>There was actually a certain amount of automation in the way I produced the
list: if my test driver finds a discrepancy, it tests to see if the discrepancy
is in whitespace text nodes only, and if so, it tries running the test again
with whitespace stripping applied to the source document. However, as I
mentioned, it also involved some manual checking and I did this only for a
couple of tests in each group.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>8183</commentid>
    <comment_count>4</comment_count>
    <who name="Ravindranath (Ravi) Chennoju">ravindranath.chennoju</who>
    <bug_when>2006-02-11 00:13:12 +0000</bug_when>
    <thetext>Fixed following test output:
fn-union-node-args-002/3
fn-intersect-node-args-011
fn-except-node-args-003/017

Did not find any white space problems with the following:
fn-union-node-args-5/7/9/10/11/15/16
fn-intersect-node-args-015/16
fn-except-node-args-017
</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>8184</commentid>
    <comment_count>5</comment_count>
    <who name="Ravindranath (Ravi) Chennoju">ravindranath.chennoju</who>
    <bug_when>2006-02-11 00:14:34 +0000</bug_when>
    <thetext>Closed by mistake. Reopening and assigning to Andrew.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>8284</commentid>
    <comment_count>6</comment_count>
    <who name="Carmelo Montanez">carmelo</who>
    <bug_when>2006-02-16 14:39:53 +0000</bug_when>
    <thetext>All:

All issues pertaining this particular bug seems fixed.  I will go ahead and 
close it fixed for the sake of time.

Thanks,
Carmelo</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>8298</commentid>
    <comment_count>7</comment_count>
    <who name="Carmelo Montanez">carmelo</who>
    <bug_when>2006-02-16 20:13:11 +0000</bug_when>
    <thetext>Some test cases are still not fixed.  I will take a look a this once more.

Carmelo</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>8657</commentid>
    <comment_count>8</comment_count>
    <who name="Andrew Eisenberg">andrew.eisenberg</who>
    <bug_when>2006-03-09 20:08:18 +0000</bug_when>
    <thetext>We believe that all are done but the following:

ForExpr005
WhereExpr004
ReturnExpr004
orderDecl-4/5/6/7/21/22</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>9046</commentid>
    <comment_count>9</comment_count>
    <who name="Carmelo Montanez">carmelo</who>
    <bug_when>2006-04-05 16:53:34 +0000</bug_when>
    <thetext>Michael:

Thanks.  Corrected all the missing whitespaces.  Please close bug when able to verify.

Thanks,
Carmelo</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>9065</commentid>
    <comment_count>10</comment_count>
    <who name="Jinghao Liu">jinghaol</who>
    <bug_when>2006-04-07 00:26:07 +0000</bug_when>
    <thetext>All;

For ForExpr005, we should NOT have get whitespace text node in front of &lt;Deny&gt;.  The expect result should be:
===============
&lt;Deny&gt;
							&lt;security&gt;
								&lt;right&gt;Read&lt;/right&gt;
							&lt;/security&gt;
							&lt;user&gt;system\Toshiko&lt;/user&gt;
						&lt;/Deny&gt;

instead of 
===============
						&lt;Deny&gt;
							&lt;security&gt;
								&lt;right&gt;Read&lt;/right&gt;
							&lt;/security&gt;
							&lt;user&gt;system\Toshiko&lt;/user&gt;
						&lt;/Deny&gt;

Thanks
Jinghao

&gt; Michael:
&gt; Thanks.  Corrected all the missing whitespaces.  Please close bug when able to
&gt; verify.
&gt; Thanks,
&gt; Carmelo

</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>9072</commentid>
    <comment_count>11</comment_count>
    <who name="Carmelo Montanez">carmelo</who>
    <bug_when>2006-04-07 19:08:26 +0000</bug_when>
    <thetext>Mike et all:

Looks like this was all taken care of.  I am marking the bug as fix.

Thanks,
Carmelo</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>