This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 5556 - [FT] distance for sentence/paragraph computation inconsistency
Summary: [FT] distance for sentence/paragraph computation inconsistency
Status: CLOSED INVALID
Alias: None
Product: XPath / XQuery / XSLT
Classification: Unclassified
Component: Full Text 1.0 (show other bugs)
Version: Working drafts
Hardware: All All
: P1 minor
Target Milestone: ---
Assignee: Jim Melton
QA Contact: Mailing list for public feedback on specs from XSL and XML Query WGs
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2008-03-10 20:24 UTC by zhen hua liu
Modified: 2008-04-03 23:10 UTC (History)
0 users

See Also:


Attachments

Description zhen hua liu 2008-03-10 20:24:04 UTC
In section 4.2.4 formal semantics function:
fts:wordDistance() is defined as $tokenInfo2/@startPos - $tokenInfo1/@endPos -1
This is consistent with 4.2.7.9 FTDistance section on how fts:wordDistance() is invoked with fts:tokenInfo is ordered by ('$fts:tokenInfo/@startPos, fts:tokenInfo/@endPos).

However, such consistency is not followed for sentence & paragraph computation.
 fts:sentenceDistance() is defined 
as $tokenInfo2/@startSent - $tokenInfo1/@endSent -1 in section 4.2.4.
However, fts:sentenceDistance() is involked in section 4.2.7.9 with fts:tokenInfo is order by ('$fts:tokenInfo/@startPos, fts:tokenInfo/@endPos)
instead of ($fts:tokenInfo/@startSent, fts:tokenInfo/@endSent).
The same inconsistency applies to paragraph as well. The seciton 4.2.7.9 should invoke fts:paraDistance() with fts:tokenInfo ordered by ($fts:tokenInfo/@startPara, fts:tokenInfo/@endPara).
Comment 1 Michael Dyck 2008-03-16 22:36:40 UTC
(In reply to comment #0)
> 
> However, fts:sentenceDistance() is involked in section 4.2.7.9 with
> fts:tokenInfo is order by ('$fts:tokenInfo/@startPos, fts:tokenInfo/@endPos)
> instead of ($fts:tokenInfo/@startSent, fts:tokenInfo/@endSent).

Can you give an example where these two orderings would lead to different sentence-distances? 
Comment 2 zhen hua liu 2008-04-03 17:44:47 UTC
Per group discussion, we found the distance computation algorithm needs to be revisited as action item 169-01 by Mary Holstege. So we close this bug as invalid.