<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>11209</bug_id>
          
          <creation_ts>2010-11-03 15:57:30 +0000</creation_ts>
          <short_desc>[FT] Wildcard and thesaurus options in combination</short_desc>
          <delta_ts>2010-11-30 05:37:06 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>XPath / XQuery / XSLT</product>
          <component>Full Text 1.0</component>
          <version>Candidate Recommendation</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Windows NT</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Tim Mills">tim</reporter>
          <assigned_to name="Jim Melton">jim.melton</assigned_to>
          <cc>jmdyck</cc>
          
          <qa_contact name="Mailing list for public feedback on specs from XSL and XML Query WGs">public-qt-comments</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>42042</commentid>
    <comment_count>0</comment_count>
    <who name="Tim Mills">tim</who>
    <bug_when>2010-11-03 15:57:30 +0000</bug_when>
    <thetext>Suppose an implementation&apos;s default thesaurus expands each of the tokens &quot;three&quot; and &quot;3.0&quot; to the two query items &quot;three&quot; and &quot;3.0&quot;.

Consider queries such as:

&quot;version 3.0&quot; contains text &quot;three&quot; 
using wildcards 
using default thesaurus

and

&quot;version three&quot; contains text &quot;3.0&quot; 
using wildcards 
using default thesaurus

It seems that thesaurus expansion can occur even when the wildcard option is enabled.  However, in:

 fts:lookupThesaurus( $queryTokens,
                           $matchOption/fts:thesaurusName,
                           $matchOption/@language,
                           $matchOption/fts:relationship,
                           $matchOption/fts:range )

it appears that the thesaurus won&apos;t know whether characters in the query tokens should be interpreted as wildcards or not.  Also the thesaurus might need to escape its results (&quot;three&quot; expanding to &quot;3\.0&quot;) to avoid triggering wildcard matching.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>42875</commentid>
    <comment_count>1</comment_count>
    <who name="Michael Dyck">jmdyck</who>
    <bug_when>2010-11-30 05:36:43 +0000</bug_when>
    <thetext>At their joint 2010-11-23 meeting, the XQuery/XSL WGs approved the following fix:

   -- In 4.2.5.2, have fts:applyQueryTokensAsPhrase() pass
      $noThesaurusOptions to fts:applyThesaurusOption().

   -- In 4.2.5.7, have fts:applyThesaurusOption() accept
      $noThesaurusOptions and pass it on to fts:lookupThesaurus().

   -- In 4.2.5.3, have fts:lookupThesaurus() [an external
      (implementation-defined) function] accept $noThesaurusOptions,
      and in the prose, say something to the effect that by giving it
      access to the other match options, we allow the implementation to
      apply any of those match options (whichever it deems relevant) to
      the input or output of the &quot;actual&quot; thesaurus lookup.

(They also approved the subsequent cleanup of eliminating the now-redundant $thesaurusLanguage parameter from fts:lookupThesaurus(), and the now-unnecessary &quot;language&quot; attribute from complexType ftThesaurusOption.)

I have made these changes to the editors&apos; draft of the Full Text spec, and so am marking this issue resolved-FIXED. Since you were at the meeting and indicated that these changes would be satisfactory, I will also mark the issue CLOSED.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>