<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>6668</bug_id>
          
          <creation_ts>2009-03-09 14:27:54 +0000</creation_ts>
          <short_desc>[FT] Stemming files</short_desc>
          <delta_ts>2009-03-13 14:22:51 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>XPath / XQuery / XSLT</product>
          <component>Full Text 1.0</component>
          <version>Candidate Recommendation</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc>http://basex.org</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Christian Gruen">christian.gruen</reporter>
          <assigned_to name="Jim Melton">jim.melton</assigned_to>
          <cc>pcase</cc>
          
          <qa_contact name="Mailing list for public feedback on specs from XSL and XML Query WGs">public-qt-comments</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>24108</commentid>
    <comment_count>0</comment_count>
    <who name="Christian Gruen">christian.gruen</who>
    <bug_when>2009-03-09 14:27:54 +0000</bug_when>
    <thetext>Sorry, another one.. the stemming file &quot;english-stems.txt&quot; seems to have some inconsistencies..

[...]
test tests testing tested testers
picture pictures
use user
users user
[...]

&quot;tests, testing&quot; etc is stemmed to &quot;test&quot;, which won&apos;t work for the &quot;user&quot; term.

Thanks,

Christian, BaseX Team 
http://www.basex.org</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24227</commentid>
    <comment_count>1</comment_count>
    <who name="Pat Case">pcase</who>
    <bug_when>2009-03-13 12:27:51 +0000</bug_when>
    <thetext>Hi Christian,

I have reordered each line in stemming file to begin with the simplest form of the word. 

I have combined and added forms of words to the use/users line. It is now:
use uses using used user users

I have reviewed every test case that calls the stemming file and whose query contains use or user to be sure that this did not nullify the test cases or change the results.

If this result is acceptable, please close the bug.  

Pat Case

</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>