<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>21897</bug_id>
          
          <creation_ts>2013-05-02 01:18:44 +0000</creation_ts>
          <short_desc>Definition of &apos;valid URLs&apos; points to The URL standard, which lacks requirement to escape spaces</short_desc>
          <delta_ts>2013-05-02 11:26:08 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>HTML WG</product>
          <component>HTML5 spec</component>
          <version>unspecified</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Windows 3.1</op_sys>
          <bug_status>CLOSED</bug_status>
          <resolution>WORKSFORME</resolution>
          
          
          <bug_file_loc>http://www.w3.org/html/wg/drafts/html/master/infrastructure.html#valid-url</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Leif Halvard Silli">xn--mlform-iua</reporter>
          <assigned_to name="This bug has no owner yet - up for the taking">dave.null</assigned_to>
          <cc>annevk</cc>
    
    <cc>mike</cc>
    
    <cc>public-html-admin</cc>
    
    <cc>public-html-wg-issue-tracking</cc>
    
    <cc>shadow2531</cc>
    
    <cc>xn--mlform-iua</cc>
          
          <qa_contact name="HTML WG Bugzilla archive list">public-html-bugzilla</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>87134</commentid>
    <comment_count>0</comment_count>
    <who name="Leif Halvard Silli">xn--mlform-iua</who>
    <bug_when>2013-05-02 01:18:44 +0000</bug_when>
    <thetext>ISSUE: 

1) NU validator considers it an error if a string that is
   supposed to be a URL, contains an unescaped space character.
   (Percentage escaping of valid URLs is common knowledge.)

2) HTML5 says &quot;A URL is a valid URL if it conforms to the
   authoring conformance requirements in the URL standard.&quot;
   http://www.w3.org/html/wg/drafts/html/master/infrastructure.html#valid-url

3) HOWEVER, the URL standard does has no requirement that
   spaces are written as percentage-encoded.

NOTE:

   Probably relates to other characters that needs escape too.

PROPOSAL: 

* If the URL standards editor plans to add this requirement, then clarify that this is currently not defined by the URL standard.

* If the URL standard editor has no such plans, then define the requirement in HTML 5.1. (HTML5 CR does not have this issue I believe.)</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>87146</commentid>
    <comment_count>1</comment_count>
    <who name="Anne">annevk</who>
    <bug_when>2013-05-02 09:24:23 +0000</bug_when>
    <thetext>U+0020 is not a URL unit (unless you write it %20).</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>87159</commentid>
    <comment_count>2</comment_count>
    <who name="Leif Halvard Silli">xn--mlform-iua</who>
    <bug_when>2013-05-02 11:26:08 +0000</bug_when>
    <thetext>(In reply to comment #1)
&gt; U+0020 is not a URL unit (unless you write it %20).

My bad ... for not seeing that the &apos;URL code points&apos; paragraph does not list the space character.

Think the section ought to explicitly mention - perhaps in a note - that U+0009, U+000A, and U+000D are to be escaped. Currently, the way to grok it is to understand that space is not listed in the code points list. But that&apos;s a bug against the URL standard.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>