Pencil and Paper / Reduced UTF-8

The list of HTML Entities is very long and many of the character table entries find little use in the US.  In any event, the HTML Entities have been superceeded by Unicodes.  Some characters, and combinations still need "extra" data to make sense in a diverse Public Sector + Private Sector ecosystem.  On a Regional and Community level, shared data often comes in the form of subsets of large federalized sets, and there is no good reason to redefine the point names of the data, but rather to just describe, in plain text, the new subset and perhaps the URI source.


This scheme, or something like it will do that job.  http://tinyurl.com/reduced-charset


--Gannon

Received on Saturday, 29 December 2012 19:36:59 UTC