This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 4709 - [FT] editorial: 3.3.5 Stop Word Option
Summary: [FT] editorial: 3.3.5 Stop Word Option
Alias: None
Product: XPath / XQuery / XSLT
Classification: Unclassified
Component: Full Text 1.0 (show other bugs)
Version: Last Call drafts
Hardware: All All
: P2 minor
Target Milestone: ---
Assignee: Pat Case
QA Contact: Mailing list for public feedback on specs from XSL and XML Query WGs
Depends on:
Reported: 2007-06-23 10:03 UTC by Michael Dyck
Modified: 2008-03-30 00:30 UTC (History)
0 users

See Also:


Description Michael Dyck 2007-06-23 10:03:22 UTC
3.3.5 Stop Word Option

'[173] FTRefOrList ::= ...'
    The name "RefOrList" is fairly generic. Perhaps "FTStopWords"?

'[174] FTInclExclStringLiteral ::= ("union" | "except") FTRefOrList'
    The "StringLiteral" in the name doesn't really make sense.
    Perhaps "FTStopwordInclExcl" ?

para 2
"they are used as they occur in the sequence."
    There's no antecedent for "the sequence".

"If applied during indexing asking for stop words to not be used during a
query, will have no effect."
    After "indexing", insert comma.

    Except for the fact that it's a Note, this appears to be a license for
    a Full-Text processor to behave non-conformantly. Surely if an
    implementation does some pre-emptive computation assuming certain
    options, and a query uses conflicting options, then the implementation
    is obliged to ignore the prepared results and do what it takes to give
    the correct answer.

    I think it would be instructive if you had an example like
        ... p ftcontains "supports users" with stop words ("the")
    As I understand it, the result is false (despite the fact that the
    text contains "supports the users"), which might not be what people

'contains the "then"'
    Change to 'contains "the" and "then"' ?

'except ("the then")'
    Are you sure? Supposedly, no tokenization is performed on "the then".
    I think you mean 'except ("the", "then")', or else you need to clarify
    how string literals are handled here.
Comment 1 Jochen Doerre 2007-11-09 18:12:48 UTC
Fixed as decided in FTTF-156. This just leaves 4b open.
(member-only link)

Comment 2 Pat Case 2008-01-24 16:14:38 UTC
[4b] No change. FTTF believes that the note is accurate.

Michael, if you agree we are OK with no change, please close this bug.
Comment 3 Michael Dyck 2008-02-16 06:59:44 UTC
(In reply to comment #2)
> [4b] No change. FTTF believes that the note is accurate.

I don't think that answers my concern. Do you agree and intend that it's a license for a Full-Text processor to behave non-conformantly? Is a processor required to answer correctly or not?

Is this the kind of Note that's a not-immediately-obvious consequence of some normative statements? If so, which ones?
Comment 4 Mary Holstege 2008-03-24 18:29:19 UTC
On additional consideration, WG agreed to clarify the situation, see comment:
Comment 5 Michael Dyck 2008-03-30 00:30:24 UTC
At meeting 169, the FTTF approved the proposed wording with minor amendments. Mary has committed the changes to the document.

I agree that the new wording resolves [4b], which was the only unresolved part of this issue. Therefore I will close this issue.