<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>2877</bug_id>
          
          <creation_ts>2006-02-15 08:24:01 +0000</creation_ts>
          <short_desc>Terminology data category</short_desc>
          <delta_ts>2006-07-21 17:49:11 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>ITS</product>
          <component>ITS tagset</component>
          <version>WorkingDraft</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Windows XP</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>DUPLICATE</resolution>
          <dup_id>2969</dup_id>
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>LastCall20May</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Felix Sasaki">fsasaki</reporter>
          <assigned_to name="Felix Sasaki">fsasaki</assigned_to>
          
          
          <qa_contact name="ITS mailing-list">public-i18n-its</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>8265</commentid>
    <comment_count>0</comment_count>
    <who name="Felix Sasaki">fsasaki</who>
    <bug_when>2006-02-15 08:24:02 +0000</bug_when>
    <thetext>TBD until last call.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>8307</commentid>
    <comment_count>1</comment_count>
    <who name="Christian Lieske">christian.lieske</who>
    <bug_when>2006-02-17 11:11:16 +0000</bug_when>
    <thetext>While working on data category &quot;terminology&quot; I stumbled across some questions, 
which may pop up elsewhere as well:

1. What if the host vocabulary already has markup related to terms (see for 
example DITA and DocBook)? Do we recommend keeping it and mapping it via a 
documentRule? If so: Can this recommendation be generalized, and thus for 
example become part of the introduction to data categories?

2. What if the host vocabulary and our ITS markup related to terms only share 
some commonalities? Example: The DITA &quot;term&quot; element allows more than just one 
attribute with additional information? Do we suggest to

a. move stuff from ITS into host vocabulary
   
  &lt;dita:term its:dir=&quot;ltr&quot;&gt;PlateBroiler&lt;/dita:term&gt;

b. move stuff from host vocabulary into ITS

  &lt;its:term dita:platform=&quot;CoolOS&quot;&gt;PlateBroiler&lt;/its:term&gt;

Or do we suggest something completely different?

3. What if we have a clash of the information from the namespace of the host 
vocabulary and the ITS namespace? Example

&lt;head&gt;
  &lt;documentRule its:term=&quot;yes&quot; its:termSelector=&quot;//dita:term&quot;&gt;
&lt;/head&gt;
&lt;body&gt;
  &lt;p&gt;The highly visible &lt;dita:term dita:translate=&quot;no&quot;&gt;PlateBroiler&lt;/term&gt; ...
&lt;/body&gt;

4. What if the host vocabulary and ITS differ with regard to one of the 
following:

4.1 content model (for example PCDATA vs. mixed)
4.2 data type (for example NMTOKEN vs. CDATA)

In addition, I stumbled across some things which may only be relevant for the 
term data category

5. The &quot;termRef&quot; is a URI which consist of a termbase identifier prefix and a 
term identified suffix. Example:

&lt;its:documentRules&gt;
 &lt;its:documentRule its:term=&quot;yes&quot; its:termSelector=&quot;/body/p[1]/span&quot;
its:termRef=&quot;http://example.com/termdatabase/#x142539&quot;/&gt;
&lt;/its:documentRules&gt;

I wonder if there is a need to &quot;factor out&quot; the termbase identifier, since it 
will be the same for possibly dozens of terms. Example:

&lt;its:documentRules termBaseRef=&quot;http://example.com/termdatabase/# &quot;&gt;
 &lt;its:documentRule its:term=&quot;yes&quot; its:termSelector=&quot;/body/p[1]/span&quot;
its:termRef=&quot;x142539&quot;/&gt;
&lt;/its:documentRules&gt;

6. I wonder if we need a recommendation related to Yomigana (phonetic strings; 
see http://esw.w3.org/topic/its0503ReqTermIdentification). We currently have 
not foreseen this as part of the term data category. I could  imagine a 
recommendation like &apos;Use &quot;termRef&quot; and put the Yomigana into your termbase&apos;.




</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>8524</commentid>
    <comment_count>2</comment_count>
    <who name="Yves Savourel">ysavourel</who>
    <bug_when>2006-03-02 20:53:45 +0000</bug_when>
    <thetext>Regarding 2:

I would think that in general, the markup of the host language should (and I 
am tempted to say must) be used if it offer an ITS-equivalent semantic.


Regarding 5:

I would think separating termRef into termRef and termBaseRef may make things 
a bit complicated from the viewpoint of implementation. It would be OK in a 
documentRule/s but locally knowing which termBase is to use may be more 
difficult.

</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>8529</commentid>
    <comment_count>3</comment_count>
    <who name="Felix Sasaki">fsasaki</who>
    <bug_when>2006-03-03 00:08:02 +0000</bug_when>
    <thetext>Im trying to answer the questions using the proposals from the ITS f2f in Mandelieu.

(In reply to comment #1)
&gt; While working on data category &quot;terminology&quot; I stumbled across some questions, 
&gt; which may pop up elsewhere as well:
&gt; 
&gt; 1. What if the host vocabulary already has markup related to terms (see for 
&gt; example DITA and DocBook)? Do we recommend keeping it and mapping it via a 
&gt; documentRule? If so: Can this recommendation be generalized, and thus for 
&gt; example become part of the introduction to data categories?

I agree with Yves. One addition: I would say

&lt;its:termRule its:select=&quot;//qterm&quot;/&gt;
selects the &lt;qterm&gt; element and says &quot;this is a term in the semantics of ITS&quot;.

&lt;its:termRule its:select=&quot;//qterm&quot;
its:termRef=&quot;http://www.example.com/termbase/#entry2332&quot;/&gt;
does in addition &quot;adding&quot;, that is adding the term reference.

&lt;its:termRule its:select=&quot;//qterm&quot; its:termRefMap=&quot;@someTermRef&quot;/&gt;
would be instead of &quot;adding&quot; a &quot;pass trough&quot; of term reference information.
Maybe the name @termRefContent would be more approriate? ;)

&gt; 
&gt; 2. What if the host vocabulary and our ITS markup related to terms only share 
&gt; some commonalities?

I&apos;d say we can select everthing which has less or equal compositional semantics
as ITS. As for the terminology data category, our semantics has the parts &quot;this
is a term&quot; and &quot;this is a term reference&quot;. Everything in an existing vocabulary
that can selected by these semantic components IMO should be selected.

&gt; Example: The DITA &quot;term&quot; element allows more than just one 
&gt; attribute with additional information? Do we suggest to
&gt; 
&gt; a. move stuff from ITS into host vocabulary
&gt;    
&gt;   &lt;dita:term its:dir=&quot;ltr&quot;&gt;PlateBroiler&lt;/dita:term&gt;
&gt; 
&gt; b. move stuff from host vocabulary into ITS
&gt; 
&gt;   &lt;its:term dita:platform=&quot;CoolOS&quot;&gt;PlateBroiler&lt;/its:term&gt;
&gt; 
&gt; Or do we suggest something completely different?

you could do &lt;its:termRule its:select=&quot;//dita:term&quot;/&gt;
but I would not know what to do about the @dita:platform attribute.

&gt; 
&gt; 3. What if we have a clash of the information from the namespace of the host 
&gt; vocabulary and the ITS namespace? Example
&gt; 
&gt; &lt;head&gt;
&gt;   &lt;documentRule its:term=&quot;yes&quot; its:termSelector=&quot;//dita:term&quot;&gt;
&gt; &lt;/head&gt;
&gt; &lt;body&gt;
&gt;   &lt;p&gt;The highly visible &lt;dita:term dita:translate=&quot;no&quot;&gt;PlateBroiler&lt;/term&gt; ...
&gt; &lt;/body&gt;

You have two tasks: identifing &lt;dita:term&gt; as a term in the sense of ITS, and
the content of this element as not being translatable. I would keep the tasks
separate, so have
&lt;its:termRule its:selector=&quot;//dita:term&quot;/&gt; and
&lt;its:translateRule its:selector=&quot;//dita:term&quot; its:translate=&quot;yes&quot;/&gt;

&gt; 
&gt; 4. What if the host vocabulary and ITS differ with regard to one of the 
&gt; following:
&gt; 
&gt; 4.1 content model (for example PCDATA vs. mixed)
&gt; 4.2 data type (for example NMTOKEN vs. CDATA)

Same as above: we can select everthing for a data category which has less or
equal compositional semantics as ITS. More fine grained information about
content models or data types will be lost.

&gt; 
&gt; In addition, I stumbled across some things which may only be relevant for the 
&gt; term data category
&gt; 
&gt; 5. The &quot;termRef&quot; is a URI which consist of a termbase identifier prefix and a 
&gt; term identified suffix. Example:
&gt; 
&gt; &lt;its:documentRules&gt;
&gt;  &lt;its:documentRule its:term=&quot;yes&quot; its:termSelector=&quot;/body/p[1]/span&quot;
&gt; its:termRef=&quot;http://example.com/termdatabase/#x142539&quot;/&gt;
&gt; &lt;/its:documentRules&gt;
&gt; 
&gt; I wonder if there is a need to &quot;factor out&quot; the termbase identifier, since it 
&gt; will be the same for possibly dozens of terms. Example:
&gt; 
&gt; &lt;its:documentRules termBaseRef=&quot;http://example.com/termdatabase/# &quot;&gt;
&gt;  &lt;its:documentRule its:term=&quot;yes&quot; its:termSelector=&quot;/body/p[1]/span&quot;
&gt; its:termRef=&quot;x142539&quot;/&gt;
&gt; &lt;/its:documentRules&gt;

I would not factor it out, since people might as well point to a place in the
current document.

&gt; 
&gt; 6. I wonder if we need a recommendation related to Yomigana (phonetic strings; 
&gt; see http://esw.w3.org/topic/its0503ReqTermIdentification). We currently have 
&gt; not foreseen this as part of the term data category. I could  imagine a 
&gt; recommendation like &apos;Use &quot;termRef&quot; and put the Yomigana into your termbase&apos;.

I don&apos;t think we need that.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>9341</commentid>
    <comment_count>4</comment_count>
    <who name="Felix Sasaki">fsasaki</who>
    <bug_when>2006-04-20 08:32:27 +0000</bug_when>
    <thetext>

*** This bug has been marked as a duplicate of 2969 ***</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>