This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 19743 - Align HTML syntax and URL syntax with respect to Unicode
Summary: Align HTML syntax and URL syntax with respect to Unicode
Status: RESOLVED FIXED
Alias: None
Product: WHATWG
Classification: Unclassified
Component: URL (show other bugs)
Version: unspecified
Hardware: PC Windows 3.1
: P2 normal
Target Milestone: Unsorted
Assignee: Anne
QA Contact: sideshowbarker+urlspec
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2012-10-27 21:48 UTC by Anne
Modified: 2014-01-15 14:10 UTC (History)
5 users (show)

See Also:


Attachments

Description Anne 2012-10-27 21:48:46 UTC
URL disallows these code points that HTML does not disallow: U+FFF0 to U+FFFD.

Compare: 

http://url.spec.whatwg.org/#url-units
http://www.whatwg.org/specs/web-apps/current-work/multipage/parsing.html#preprocessing-the-input-stream

Also http://tools.ietf.org/html/rfc3987#section-2.2 btw.

It is not entirely clear to me why this difference is justified and I plan on aligning URL with HTML with regard to this in due course.
Comment 1 Anne 2014-01-15 14:10:21 UTC
Since these will be percent-encoded anyway I don't see any reason to exclude them.

https://github.com/whatwg/url/commit/1767998279617ff773cf7f0f99d9013949f921db