<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>6670</bug_id>
          
          <creation_ts>2009-03-09 18:49:15 +0000</creation_ts>
          <short_desc>Allow unescaped &amp;s, at least in attributes that accept URLs</short_desc>
          <delta_ts>2011-11-23 07:09:37 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>HTML WG</product>
          <component>pre-LC1 HTML5 spec (editor: Ian Hickson)</component>
          <version>unspecified</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>VERIFIED</bug_status>
          <resolution>WONTFIX</resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords>NoReply</keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Ian &apos;Hixie&apos; Hickson">ian</reporter>
          <assigned_to name="Ian &apos;Hixie&apos; Hickson">ian</assigned_to>
          <cc>jens</cc>
    
    <cc>julian.reschke</cc>
    
    <cc>mathias</cc>
    
    <cc>mike</cc>
    
    <cc>public-html-admin</cc>
    
    <cc>public-html-wg-issue-tracking</cc>
          
          <qa_contact name="HTML WG Bugzilla archive list">public-html-bugzilla</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>24111</commentid>
    <comment_count>0</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2009-03-09 18:49:15 +0000</bug_when>
    <thetext>Based on data from Google, requiring &amp;s to be escaped in URL-heavy pages (like most major sites) causes something like a 6% increase in file size. This error seems more common than the error it is trying to catch (accidentally forgetting the semicolon in an entity), but it would leave sites exposed to accidental entities, e.g. in the case of:

   http://example.com?cut&amp;copy;paste;color&amp;style&amp;bold</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24122</commentid>
    <comment_count>1</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2009-03-10 00:55:24 +0000</bug_when>
    <thetext>maybe only if followed by alphanumeric ASCII and an equals sign?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24147</commentid>
    <comment_count>2</comment_count>
    <who name="Lachlan Hunt">lachlan.hunt</who>
    <bug_when>2009-03-11 08:52:44 +0000</bug_when>
    <thetext>It is useful to know if a semi-colon has accidentally been omitted from an entity reference, or if you&apos;ve unintentionally used a sequence that looks like an entity reference.  This could be defined so that it is valid when:

1. The value is alphanumeric ASCII followed an equals sign
2. The value does not match one of the defined entity references.

So the most common cases of &quot;?x&amp;foo=1&quot; would be valid.
But &quot;?x&amp;copy=1&quot; would be invalid because &amp;copy; is an entity and, in this case, would be resolved to &quot;?x©=1&quot;, which is not likely what the author wants.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>24474</commentid>
    <comment_count>3</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2009-04-01 01:50:10 +0000</bug_when>
    <thetext>&quot;?x&amp;copy=1&quot; in an attribute would not be treated as an entity, though, so the author still wouldn&apos;t be affected... Are you sure we need to check for matches even though they won&apos;t be handled as such? Maybe I don&apos;t really understand what you are proposing.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>25351</commentid>
    <comment_count>4</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2009-05-29 00:15:51 +0000</bug_when>
    <thetext>I&apos;ve tried to do this. The spec text for this is highly unintuitive, but I hope it matches practical intuition more than the previous text. I&apos;m not compeltely convinced that this is a good idea, so let me know if you think this should be changed back.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>25426</commentid>
    <comment_count>5</comment_count>
    <who name="Lachlan Hunt">lachlan.hunt</who>
    <bug_when>2009-06-02 13:16:17 +0000</bug_when>
    <thetext>(In reply to comment #3)
&gt; &quot;?x&amp;copy=1&quot; in an attribute would not be treated as an entity

Yes it would because &quot;&amp;copy&quot; is one of the entity references that needs to be supported without the use of the semi-colon on the end.  Try it.

http://software.hixie.ch/utilities/js/live-dom-viewer/?%3C!DOCTYPE%20html%3E%0A%3Ca%20href%3D%22%3Fx%26copy%3D1%22%3Etest%3C%2Fa%3E</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>25569</commentid>
    <comment_count>6</comment_count>
    <who name="Ian &apos;Hixie&apos; Hickson">ian</who>
    <bug_when>2009-06-12 22:24:40 +0000</bug_when>
    <thetext>Reverted.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>33068</commentid>
    <comment_count>7</comment_count>
    <who name="Maciej Stachowiak">mjs</who>
    <bug_when>2010-03-14 13:17:02 +0000</bug_when>
    <thetext>This bug predates the HTML Working Group Decision Policy.

If you are satisfied with the resolution of this bug, please change the state of this bug to CLOSED. If
you have additional information and would like the editor to reconsider, please reopen this bug. If you would like to escalate the issue to the full HTML Working Group, please add the TrackerRequest keyword to this bug, and suggest title and text for the tracker issue; or you may create a tracker issue yourself, if you are able to do so. For more details, see this document:
   http://dev.w3.org/html5/decision-policy/decision-policy.html

This bug is now being moved to VERIFIED. Please respond within two weeks. If this bug is not closed, reopened or escalated within two weeks, it may be marked as NoReply and will no longer be considered a pending comment.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>49632</commentid>
    <comment_count>8</comment_count>
    <who name="Julian Reschke">julian.reschke</who>
    <bug_when>2011-06-15 10:07:49 +0000</bug_when>
    <thetext>Apparently, this change *was* applied later on. Is there another bug related to this?</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>