<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>5556</bug_id>
          
          <creation_ts>2008-03-10 20:24:04 +0000</creation_ts>
          <short_desc>[FT] distance for sentence/paragraph computation inconsistency</short_desc>
          <delta_ts>2008-04-03 23:10:19 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>XPath / XQuery / XSLT</product>
          <component>Full Text 1.0</component>
          <version>Working drafts</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>INVALID</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P1</priority>
          <bug_severity>minor</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="zhen hua liu">zhen.liu</reporter>
          <assigned_to name="Jim Melton">jim.melton</assigned_to>
          
          
          <qa_contact name="Mailing list for public feedback on specs from XSL and XML Query WGs">public-qt-comments</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>19396</commentid>
    <comment_count>0</comment_count>
    <who name="zhen hua liu">zhen.liu</who>
    <bug_when>2008-03-10 20:24:04 +0000</bug_when>
    <thetext>In section 4.2.4 formal semantics function:
fts:wordDistance() is defined as $tokenInfo2/@startPos - $tokenInfo1/@endPos -1
This is consistent with 4.2.7.9 FTDistance section on how fts:wordDistance() is invoked with fts:tokenInfo is ordered by (&apos;$fts:tokenInfo/@startPos, fts:tokenInfo/@endPos).

However, such consistency is not followed for sentence &amp; paragraph computation.
 fts:sentenceDistance() is defined 
as $tokenInfo2/@startSent - $tokenInfo1/@endSent -1 in section 4.2.4.
However, fts:sentenceDistance() is involked in section 4.2.7.9 with fts:tokenInfo is order by (&apos;$fts:tokenInfo/@startPos, fts:tokenInfo/@endPos)
instead of ($fts:tokenInfo/@startSent, fts:tokenInfo/@endSent).
The same inconsistency applies to paragraph as well. The seciton 4.2.7.9 should invoke fts:paraDistance() with fts:tokenInfo ordered by ($fts:tokenInfo/@startPara, fts:tokenInfo/@endPara).</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>19494</commentid>
    <comment_count>1</comment_count>
    <who name="Michael Dyck">jmdyck</who>
    <bug_when>2008-03-16 22:36:40 +0000</bug_when>
    <thetext>(In reply to comment #0)
&gt; 
&gt; However, fts:sentenceDistance() is involked in section 4.2.7.9 with
&gt; fts:tokenInfo is order by (&apos;$fts:tokenInfo/@startPos, fts:tokenInfo/@endPos)
&gt; instead of ($fts:tokenInfo/@startSent, fts:tokenInfo/@endSent).

Can you give an example where these two orderings would lead to different sentence-distances? 
</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>19724</commentid>
    <comment_count>2</comment_count>
    <who name="zhen hua liu">zhen.liu</who>
    <bug_when>2008-04-03 17:44:47 +0000</bug_when>
    <thetext>Per group discussion, we found the distance computation algorithm needs to be revisited as action item 169-01 by Mary Holstege. So we close this bug as invalid.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>