<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>7062</bug_id>
          
          <creation_ts>2009-06-29 02:53:37 +0000</creation_ts>
          <short_desc>replace terms &quot;CDATA element&quot; and &quot;RCDATA element&quot; with... something better</short_desc>
          <delta_ts>2010-10-04 14:31:49 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>HTML WG</product>
          <component>pre-LC1 HTML5 spec (editor: Ian Hickson)</component>
          <version>unspecified</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>NEEDSINFO</resolution>
          
          
          <bug_file_loc>http://www.whatwg.org/specs/web-apps/current-work/multipage/syntax.html#elements-0</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Michael[tm] Smith">mike</reporter>
          <assigned_to name="Ian &apos;Hixie&apos; Hickson">ian</assigned_to>
          <cc>mike</cc>
    
    <cc>public-html-admin</cc>
    
    <cc>public-html-wg-issue-tracking</cc>
    
    <cc>zcorpan</cc>
          
          <qa_contact name="HTML WG Bugzilla archive list">public-html-bugzilla</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>25771</commentid>
    <comment_count>0</comment_count>
    <who name="Michael[tm] Smith">mike</who>
    <bug_when>2009-06-29 02:53:37 +0000</bug_when>
    <thetext>http://www.whatwg.org/specs/web-apps/current-work/multipage/syntax.html#elements-0

The current draft uses the terms &quot;CDATA element&quot; to describe &lt;script&gt; and &lt;style&gt; elements, and &quot;RCDATA element&quot; to describe &lt;title&gt; and &lt;textarea&gt; elements.

I think it would be good to consider replacing those with other terms.

The problem is that &quot;CDATA&quot; already has a number of meanings[1] that conflict with one another.

[1]http://www.flightlab.com/~joe/sgml/cdata.html

So it seems less than ideal to introduce a new term that potentially adds even more ambiguity and confusion around what &quot;CDATA&quot; means.

All that said, I can&apos;t at the moment think of any better terms with which to replace CDATA and RCDATA and that would themselves be accurate and less confusing. 

But maybe other people have some some specific suggestions.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>25773</commentid>
    <comment_count>1</comment_count>
    <who name="Michael[tm] Smith">mike</who>
    <bug_when>2009-06-29 03:26:26 +0000</bug_when>
    <thetext>Maybe just &quot;replaceable character data&quot; for RCDATA, and &quot;non-replaceable character data&quot; for CDATA.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>26311</commentid>
    <comment_count>2</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2009-08-08 01:41:16 +0000</bug_when>
    <thetext>I replaced CDATA with &quot;raw text&quot;. Would you like me to change RCDATA also?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>26319</commentid>
    <comment_count>3</comment_count>
    <who name="Michael[tm] Smith">mike</who>
    <bug_when>2009-08-08 13:23:36 +0000</bug_when>
    <thetext>(In reply to comment #2)
&gt; I replaced CDATA with &quot;raw text&quot;. Would you like me to change RCDATA also?

If you&apos;re going with &quot;raw text&quot;, then maybe RCDATA could be &quot;replaceable text&quot; or &quot;parsed text&quot;.

</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>26321</commentid>
    <comment_count>4</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2009-08-08 19:12:32 +0000</bug_when>
    <thetext>I don&apos;t think either of those is any clearer than RCDATA, to be honest. Replaceable how? How is it parsed any more or less than &quot;raw text&quot;?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>26571</commentid>
    <comment_count>5</comment_count>
    <who name="Simon Pieters">zcorpan</who>
    <bug_when>2009-08-20 22:16:00 +0000</bug_when>
    <thetext>RCDATA stands for &quot;replaceable character data&quot;, so &quot;replaceable text&quot; seems like a better term for it, along with a note saying it means text that can have character references.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>26573</commentid>
    <comment_count>6</comment_count>
    <who name="Michael[tm] Smith">mike</who>
    <bug_when>2009-08-21 01:46:42 +0000</bug_when>
    <thetext>(In reply to comment #5)
&gt; RCDATA stands for &quot;replaceable character data&quot;, so &quot;replaceable text&quot; seems
&gt; like a better term for it, along with a note saying it means text that can have
&gt; character references.

Calling it any kind of &quot;text&quot; at all, and thus needing to add a note to say that it&apos;s text that can contain character references, is the reason I suggested &quot;replaceable character data&quot; initially.

What the spec currently defines as &quot;text&quot; cannot contain character references. Also, what it defines &quot;text&quot; has two possible forms:

   - &quot;raw&quot; text that is allowed to contain unparsed markup characters
   - &quot;non-raw&quot; text that is not allowed to contain unparsed markup characters

...where &quot;unparsed markup characters&quot; essentially means the character &quot;&lt;&quot; and the strings &quot;&lt;!--&quot; and &quot;--&gt;&quot;.

So there are three ways in which the text/html syntax allows those two forms of text to be combined with character references:

  1. non-raw text that can be combined with character references
  2. raw text that can be combined with character references (RCDATA)
  3. raw text that cannot be combined character references

One way to describe the above more succinctly is:

  1. normal character data
  2. replaceable character data
  3. non-replaceable character data

Or maybe &quot;raw character data&quot; would be a better term for #3 (which is what the spec now calls &quot;raw text&quot; and which it previously called &quot;CDATA&quot;).

But regardless, the term &quot;character data&quot; seems very useful as a general term for describing all three of those possible combinations, and each of them could be defined specifically by preceding &quot;character data&quot; with some adjective to describe what type of character data it is.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>26885</commentid>
    <comment_count>7</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2009-09-07 10:19:05 +0000</bug_when>
    <thetext>I don&apos;t see why we&apos;d use &quot;character data&quot; instead of the simpler &quot;text&quot;.

Distinguishing between &quot;normal text&quot; and &quot;replaceable text&quot; leads to the question of in what sense &quot;normal text&quot; isn&apos;t &quot;replaceable&quot;.

So I&apos;m still at a loss for a better term than the opaque &quot;RCDATA&quot;.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>26950</commentid>
    <comment_count>8</comment_count>
    <who name="Michael[tm] Smith">mike</who>
    <bug_when>2009-09-09 04:07:29 +0000</bug_when>
    <thetext>(In reply to comment #7)
&gt; I don&apos;t see why we&apos;d use &quot;character data&quot; instead of the simpler &quot;text&quot;.

Because text, as defined in the spec, cannot contain character references. So it&apos;s useful to have a term that means &quot;text mixed with character references&quot;.

&gt; Distinguishing between &quot;normal text&quot; and &quot;replaceable text&quot; leads to the
&gt; question of in what sense &quot;normal text&quot; isn&apos;t &quot;replaceable&quot;.

Yeah, I agree that&apos;s a problem with choosing any non-opaque term.

&gt; So I&apos;m still at a loss for a better term than the opaque &quot;RCDATA&quot;.

I agree that the term &quot;RCDATA element&quot; as defined in the spec is unambiguous.

So... lacking anything new to say here and no better suggestions than what I&apos;ve given so far, I&apos;m fine with you moving this to FIXED if you don&apos;t think there are any remaining changes that could be made.
</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>33129</commentid>
    <comment_count>9</comment_count>
    <who name="Maciej Stachowiak">mjs</who>
    <bug_when>2010-03-14 14:48:21 +0000</bug_when>
    <thetext>This bug predates the HTML Working Group Decision Policy.

If you are satisfied with the resolution of this bug, please change the state of this bug to CLOSED. If you have additional information and would like the editor to reconsider, please reopen this bug. If you would like to escalate the issue to the full HTML Working Group, please add the TrackerRequest keyword to this bug, and suggest title and text for the tracker issue; or you may create a tracker issue yourself, if you are able to do so. For more details, see this document:
  http://dev.w3.org/html5/decision-policy/decision-policy.html

This bug is now being moved to VERIFIED. Please respond within two weeks. If this bug is not closed, reopened or escalated within two weeks, it may be marked as NoReply and will no longer be considered a pending comment.
</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>