<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>5852</bug_id>
          
          <creation_ts>2008-07-10 11:03:26 +0000</creation_ts>
          <short_desc>Incorporate Entity Reference Table into Authoring Guide</short_desc>
          <delta_ts>2015-10-05 00:56:01 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>HTML WG</product>
          <component>HTML5 Reference (editor: Lachlan Hunt)</component>
          <version>unspecified</version>
          <rep_platform>PC</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>NEW</bug_status>
          <resolution></resolution>
          
          
          <bug_file_loc>http://dev.w3.org/html5/html-author/charref</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Lachlan Hunt">lachlan.hunt</reporter>
          <assigned_to name="Lachlan Hunt">lachlan.hunt</assigned_to>
          <cc>aaz</cc>
    
    <cc>mike</cc>
    
    <cc>olivier.gendrin</cc>
    
    <cc>public-html-admin</cc>
    
    <cc>zcorpan</cc>
          
          <qa_contact name="HTML WG Bugzilla archive list">public-html-bugzilla</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>21122</commentid>
    <comment_count>0</comment_count>
    <who name="Lachlan Hunt">lachlan.hunt</who>
    <bug_when>2008-07-10 11:03:26 +0000</bug_when>
    <thetext>Obviously, the way this is listed in the HTML5 spec is aimed at implementers, and is totally unusable by authors.  This will need significant imporovement for the authoring guide.

Things to do before this can be done:

* Work out what information is useful for authors
  - Entity Ref (&amp;foo;)
  - Equivalent Hexadecimal reference (&amp;#x0000;)
  - Equivalent Decimal Reference (&amp;#0;)
  - Name of character
  - Image of character (especially in cases where the user may not have appropriate fonts avaialable)
  - Human readable description
  - Links to related characters

* Provide some way to quickly and easily search or filter the list.
* Should it be broken into some sort of categories?  If so, how?
* Should authors be able to show and hide interesting/uninteresting colums?
* It might also be useful to indicate which entities will still work without the trailing semicolon, even though that&apos;s non-conforming in HTML5.

Here are some existing tables covering HTML4 entity references.

http://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references
http://www.cookwood.com/html/extras/entities.html
http://www.digitalmediaminute.com/reference/entity/index.php

The wikipedia table contains additional info about which standard and DTD it was introduced in. That stuff isn&apos;t really relevant to HTML5.  The digitalmediaminute page takes an interesting approach, instead of a traditional table layout.

Additional feature requests and suggestions welcome.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>21254</commentid>
    <comment_count>1</comment_count>
    <who name="Lachlan Hunt">lachlan.hunt</who>
    <bug_when>2008-07-19 21:03:34 +0000</bug_when>
    <thetext>Initial draft created. (See URL)

Contains several known bugs:

* Major layout bugs in Opera, minor bugs with Firefox
* Doesn&apos;t handle characters with either long or many named character references well
* Will need to provide images of glyphs, most system fonts still lack too many (or provide the wrong glyphs for some characters!)
* Need to add Unicode Category selection and filtering support</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>21258</commentid>
    <comment_count>2</comment_count>
    <who name="Lachlan Hunt">lachlan.hunt</who>
    <bug_when>2008-07-21 09:33:18 +0000</bug_when>
    <thetext>Useful Unicode tools:
http://rishida.net/scripts/uniview/
http://rishida.net/scripts/uniview/descn
http://rishida.net/scripts/uniview/conversion

Could obtain glyph images from here.
http://decodeunicode.org/</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>21270</commentid>
    <comment_count>3</comment_count>
    <who name="Simon Pieters">zcorpan</who>
    <bug_when>2008-07-22 09:08:49 +0000</bug_when>
    <thetext>I would prefer if the leading zeros were dropped in the hex NCR and that the NCRs were always visible -- not just on hover (I think the hover effect is a bit annoying). I&apos;d also like to have the Unicode name of the character visible so that I can search the page using my browser search feature.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>