<?xml version="1.0" encoding="utf-8"?>
<!-- $Id: character-set.xml,v 1.61 2009/11/08 16:59:37 dcarlis Exp $ -->
<!DOCTYPE spec [<!ENTITY date "20091022">]>
<spec w3c-doctype="wd">
<header>
<title>XML Entity definitions for Characters</title>
<w3c-designation>xml-entity-names-&date;</w3c-designation>
<w3c-doctype>W3C Working Draft</w3c-doctype>
<pubdate><day>22</day> <month>October</month> <year>2009</year></pubdate>
<publoc>
<loc href="http://www.w3.org/TR/2009/WD-xml-entity-names-&date;/">http://www.w3.org/TR/2009/WD-xml-entity-names-&date;/</loc>
</publoc>
<latestloc>
<loc href="http://www.w3.org/TR/xml-entity-names/">http://www.w3.org/TR/xml-entity-names/</loc>
</latestloc>
<prevlocs>
  <loc href="http://www.w3.org/TR/2008/WD-xml-entity-names-20080721/">http://www.w3.org/TR/2008/WD-xml-entity-names-20080721/</loc>
<!--
  <loc href="http://www.w3.org/TR/2007/WD-xml-entity-names-20071214/">http://www.w3.org/TR/2007/WD-xml-entity-names-20071214/</loc>
-->
</prevlocs>
<authlist>
<author>
<name>David Carlisle</name>
<affiliation>NAG</affiliation>
</author>
<author>
<name>Patrick Ion</name>
<affiliation>Mathematical Reviews, American Mathematical Society</affiliation>
</author>
</authlist>
<errataloc href="http://www.w3.org/2003/entities/2007doc/errata.html"/>
<status id="status">


<p><emph> This section describes the status of this document at the time
of its publication. Other documents may supersede this document. A
list of current W3C publications and the latest revision of this
technical report can be found in the <loc
href="http://www.w3.org/TR/">W3C technical reports index</loc> at
http://www.w3.org/TR/.</emph>
</p> 

<p>
This document is a W3C Last Call Public Working Draft produced by 
the <loc href="http://www.w3.org/Math/">W3C Math Working Group</loc> as 
part of the W3C <loc href="http://www.w3.org/Math/Activity">Math
Activity</loc>.The goals of the W3C Math Working Group are
discussed in the <loc
href="http://www.w3.org/Math/Documents/Charter2006.html"> W3C Math WG
Charter</loc>.  The authors of this document are W3C Math Working
Group members.
</p>
<p>
Publication as a Working Draft does not imply endorsement by the 
W3C Membership. This is a draft document and may be updated, 
replaced or obsoleted by other documents at any time. It is 
inappropriate to cite this document as other than work in progress.
</p>
<p>
This document contains the W3C Last Call Working Draft of the 
XML Entity definitions for Characters.  
The Last Call period ends on 18 November 2009.
</p>
<p>
Feedback should be sent to the  
<loc href="mailto:www-math@w3.org">Public W3C Math mailing list</loc>
(<loc href="http://lists.w3.org/Archives/Public/www-math/">list archives</loc>).
When sending an e-mail comment that you expect to be addressed in the 
final XML Entity definitions for Characters, please put the 
text &#x201C;last-call&#x201D; in the subject line, preferably like this: 
 &#x201C;[Entities-last-call] &#x2026;summary of comment &#x201D;.
</p>

<p> This document was produced by a group operating under the 
<loc href="http://www.w3.org/Consortium/Patent-Policy-20040205/">5 February 2004 W3C Patent Policy</loc>. 
W3C maintains a 
<loc role="disclosure" href="http://www.w3.org/2004/01/pp-impl/35549/status">
public list of any patent disclosures</loc> made in connection with 
the deliverables of the group; that page also includes instructions 
for disclosing a patent. An individual who has actual knowledge of 
a patent which the individual believes contains 
<loc href="http://www.w3.org/Consortium/Patent-Policy-20040205/#def-essential">Essential Claim(s)</loc> 
must disclose the information in accordance with 
<loc href="http://www.w3.org/Consortium/Patent-Policy-20040205/#sec-Disclosure">section 6 of the W3C Patent Policy</loc>. 
</p>

</status>
<abstract id="abstract">
<p>
This document defines several sets of names which are assigned to 
Unicode characters. Each of these sets is also implemented as a file 
of XML entity declarations.
</p>
</abstract>

<langusage>
<language id="en">English</language>
</langusage>

<revisiondesc>
<p>
First draft, derived from the MathML2 sources.
</p>
<p>
Second draft, incorporating comments from Karl Tomlinson, 
Ian Hickson and others.
</p>
<p>
Final Last Call draft, incorporating new comments from many and 
ensuring that the listings are fully up-to-date with W3C and 
Unicode development.
</p>
</revisiondesc>

</header>
<body>

<div1 id="chars_intro"><head>Introduction</head>


<p>
Notation and symbols have proved very important for scientific
documents, especially in mathematics. Mathematics has grown in part
because <phrase>its notation continually changes toward being succinct
and suggestive</phrase>. There have been many new signs
<phrase>developed</phrase> for use in mathematical notation, and
mathematicians have not held back from making use of many symbols
originally <phrase>introduced</phrase> elsewhere. The result is that
science in general, and particularly mathematics, makes use of 
a very large collection of symbols.  It is
difficult to write science fluently if these characters are not
available for use. It is difficult to read science if
corresponding glyphs are not available for presentation on specific
display devices. In the majority of cases it is preferable to store
characters directly as Unicode character data or as XML numeric
character references.  However, in some environments it is more
convenient to use the ASCII input mechanism provided by XML entity
references. Many entity names are in common use, and this 
specification aims to provide standard mappings to Unicode for each of
these names. It introduces no names that have not already been used in
earlier specifications.
</p>
<p>
Specifically, the entity names in the sets
starting with the letters <quote>iso</quote> were first standardized in SGML (<bibref
ref="SGML"/>) and updated in <bibref ref="ISO9573-13-1991"/>.
The  W3C Math Working Group has been
invited to take over the maintenance and development of these sets by the
original standards committee (ISO/IECJTC1 SC34). The sets with names
starting  <quote>mml</quote> were first standardized in 
MathML <bibref ref="MathML2"/> and those starting
with <quote>xhtml</quote> were first standardized in HTML <bibref ref="HTML4"/>.
</p>

</div1>

<div1 id="sets">
<head>Sets of names</head>
<p>This specification defines mappings to Unicode of many sets of names
that have been defined by earlier specifications.</p>
<p>We first present two tables listing all the sets combined, first in Unicode order and then in alphabetic order:
<ulist>
<item><p>All in <loc href="bycodes.html">Unicode order</loc></p></item>
<item><p>All in <loc href="byalpha.html">alphabetic order</loc>.</p></item>
</ulist>
</p>
<p>Then there come tables
documenting each of the entity sets. Each set has a link to the DTD
entity declaration for the corresponding entity set, and also a link
to an XSLT2 stylesheet that will implement a reverse mapping from
characters to entity names (this is, of course, only possible for  entity names
that map to a single Unicode code point).
</p>
<p>In addition to the
stylesheets and entity files corresponding to each individual entity
set, a <loc
href="http://www.w3.org/2003/entities/2007/entitynamesmap.xsl">combined
stylesheet</loc> is provided, as well as two combined sets of DTD
entity declarations. The first is a <loc
href="http://www.w3.org/2003/entities/2007/w3centities.ent">small file
which includes all the other entity files via parameter entity
references</loc>; the second is a <loc
href="http://www.w3.org/2003/entities/2007/w3centities-f.ent">larger
file that directly contains a definition of each entity, with all
duplicates removed</loc>.
</p>
</div1>

<div1 id="blocks">
<head>Unicode Character Blocks for Scientific Documents</head>
<p>
Certain characters are of of particular relevance to scientific document production. The following tables display 
Unicode ranges containing the characters that are most used in mathematics.
</p>
</div1>


<div1 id="alphabets">
<head>Mathematical Alphanumeric Characters</head>
<p>
Many of the entities defined by this specification relate to the
mathematical alphanumeric characters contained in the letter-like
symbols block of Unicode Plane 0, or in the Mathematical Alphanumeric Symbols
block in Unicode Plane 1.  The following tables list all these symbols,
highlighting those that are not in Plane 1, and giving entity names
where appropriate.
</p>
</div1>


<div1 id="combining">
<head>Entities for Negated and Variant Characters</head>
<p>
The majority of entity definitions in the specification expand to a
single Unicode character, however some use multiple character
combinations, as outlined in this section.
</p>

<div2 id="chars_math-negated-tables">
<head>Negated Mathematical Characters</head>

<p>
In addition to the Unicode Characters so far listed, one may use the
combining characters <phrase diff="chg">U+0338</phrase> (/), 
<phrase diff="chg">U+20D2</phrase> (|) and 
<phrase diff="chg">U+20E5</phrase> (\) to produce
negated or <phrase diff="chg">canceled</phrase> forms of 
characters. A combining character
should be placed immediately after its <quote>base</quote> character, with no
intervening markup or space, just as is the case for combining accents.
</p>

<p>
In principle, the negation characters may be applied to any Unicode
character, although fonts designed for mathematics typically have some
negated glyphs ready composed. A MathML renderer should be able to use
these pre-composed glyphs in these cases.  A compound character code
either represents a UCS character that is already available, as in the
case of <phrase diff="chg">U+003D U+0338</phrase> which amounts to 
<phrase diff="chg">U+2260</phrase>, or it does not as is the
case for <phrase diff="chg">U+2202 U+0338</phrase>. The common cases of 
negations, <phrase diff="chg">of the latter  type</phrase>,
that have been identified are listed in the tables.</p>

<p id="cancellations"/>

<p>
Note that it is the policy of the W3C and of Unicode that if a single
character is already defined for what can be achieved with a combining
character, that character must be used instead of the decomposed form.
It is also intended that no new single characters representing what
can be done by with existing compositions will be introduced. 
<phrase diff="add">For further information on these matters see
the Unicode Standard Annex 15, Unicode Normalization Forms 
<bibref ref="Unicode15"/>, especially
the discussion of Normalization Form C.</phrase>
</p>


</div2>

<div2 id="chars_math-variant-tables"><head>Variant
Mathematical Characters</head>

<p>
Unicode attempts to avoid having several character codes for simple
font variants.  For a code point to be assigned there should be
more than a nuance in glyphs to be recorded.  To record 
variants worth noting there is a special character <phrase diff="chg">in</phrase>
Unicode 3.2, U+FE00 (VARIATION SELECTOR-1), which
acts as a postfix modifier.  However the legally allowed
combinations with this variation selector are restricted to a
list recorded as part of Unicode. The  VARIATION SELECTOR-1
character may only be applied to the characters listed here.
The resulting combination is not regarded by Unicode as a separate
character, but a variation on the base character. Unicode aware systems
may render the combination as the base if the available fonts do not
support the variant glyph shape.
</p>

<p id="variants"/>

</div2>

</div1>

</body>
<back>
  <div1 id="oddities">
    <head>Special Considerations</head>
    <div2 id="epsilon">
      <head>Epsilon</head>
      <p>Historically there has been much confusion and lack of
      agreement over variant forms for lower case epsilon.</p>
      <p>This specification uses the definitions below. Note that the
      name <kw>epsilon</kw> is used for the character used in textual
      Greek (U+03B5) and <kw>varepsilon</kw> used for the epsilon
      symbol character more commonly used in mathematics
      (U+03F5). Note that this usage is compatible with the naming of
      similar pairs of characters (for example <kw>theta</kw>,
      <kw>vartheta</kw>) but <emph>incompatible</emph> with the naming
      convention used in TeX, MathML2 and some earlier mappings of the
      ISO entity sets to Unicode.</p>
      <p id="epsilontab"/>
    </div2>
    <div2 id="phi">
      <head>Phi</head>
      <p>The situation for phi is very similar to that of epsilon,
      although with the further complication that early versions of
      Unicode had the sample glyphs for U+03C6 and U+03D5 in swapped
      from the current usage, and some older fonts still in use follow
      that older convention. The definitions used in this
      specification are as listed below.</p>
      <p id="phitab"/>
    </div2>

<div2 id="chars_math-multiple-tables">
<head>Multiple Character Entities</head>
<p>
In addition to the combining and variant character combinations
listed in the previous sections,
the following table lists the remaining entity replacement texts that
consist of more than one character.
</p>

<p id="multiple"/>

<p>Unicode does not have an fj character, although the other common f ligatures such as fi (U+FB01) are contained in the Alphabetic Presentation Forms block.
The <kw>fjlig</kw> entity is mapped to the pair of characters <quote>fj</quote>, modern typesetting engines should automatically use the fj ligature for this combination ligature if the font supplies such a ligature.</p>
<p>Unicode has a range of space characters (including all multiples of
1/18 em up to 6/18, except for 5/18 em) thus this entity is made from
a pair of space characters.  An alternative would have been to use
U+2005 (1/4 em), but 1/4 em is not equal to 5/18 em, so the above definition was
chosen, despite the fact that the difference is unlikely to be visibly
noticable at most typeset font sizes.</p>
<p>The entities <kw>race</kw> and <kw>acE</kw> denote underlined
characters for which Unicode does not have codepoints, thus combining
underline characters have been used, in a way analogous to the use of
combining strokes for negated operators.</p>
<p>For reasons explained further in <bibref ref="Charmod-norm"/>, it is
unadvisable to to start the replacement text of an entity with a
combining character, as then potentially different results may be
produced depending on the order in which entity expansion and Unicode
normalisation are performed. As far as possible this specification
uses non combining characters, however in the three cases shown above
Unicode only has combining forms of the accents, and so the entity
replacement text starts with a space, to avoid the possibility that
the expansion of the entity combines with preceding text.</p>
</div2>

  </div1>
  <div1 id="changes">
    <head>Changes</head>
    <div2 id="changes20080721">
      <head>Changes since 2008-07-21</head>
      <p>The html5-uppercase set is now documented.</p>
      <p>The entities <kw>ohm</kw> and <kw>angst</kw> have changed to U+03A9 and U+00C5 to match NFC. See 
      <loc href="http://www.w3.org/Bugs/Public/show_bug.cgi?id=5897">w3c bugzilla entry</loc>.</p>
      <p>The entity <kw>race</kw>, which had been erroneously assigned to U+29DA,
      is now assigned to the combination U+223D U+0331. ( U+223D isn't
      quite the shape shown in the original ISO document which is a
      rotated S rather than a rotated tilde, but this appears to be
      the closest character in Unicode 5.2.)</p>
      <p>The entities <kw>bsolhsub</kw> and <kw>suphsol</kw> which were previously
      mapped to two-character combinations U+005C U+2282 and U+2283 U+002F
      are now mapped to the Unicode 5 characters that were added
      specifically to support these entities, U+27C8 and U+27C9.</p>
      <p>The source files have all been updated to match Unicode 5.2.</p>
      <p>The entity <kw>ThickSpace</kw> now maps to the pair
        U+205F U+200A rather than the triple U+2009 U+200A U+200A
      (4/18 + 1/18)em rather than (3/18 + 1/18 + 1/18)em.</p>
      <p>The entity <kw>UnderBar</kw> maps to the spacing character
       _ rather than the combining character U+0332.</p>
      <p>The entity <kw>OverBar</kw> maps to the spacing character
       U+203E (like the XHTML entity <kw>oline</kw>) rather than the macron character U+00AF.</p>
       <p>The entities <kw>epsiv</kw> and <kw>varepsilon</kw> are bow mapped to the epsilon symbol U+03F5 rather than being aliases for  the entity <kw>epsilon</kw>, U+03B5.</p>
       <p>The entities <kw>phiv</kw> and <kw>varphi</kw> are now mapped to the phi symbol U+03D5 rather than being aliases for  the entity <kw>phi</kw>, U+03C6.</p>
    </div2>
    <div2 id="changes20071214">
      <head>Changes between 2008-07-21 and 2007-12-14</head>
      <p>The following entity definitions have changed at this draft:</p>
      <p><kw>phi</kw>, <kw>lang</kw>, <kw>rang</kw>, 
      <kw>OverParenthesis</kw>, <kw>UnderParenthesis</kw>, 
      <kw>OverBrace</kw>, <kw>UnderBrace</kw>,
      <kw>lbbrk</kw>, <kw>rbbrk</kw>.</p>
    </div2>
  </div1>
  <div1 id="diffs">
    <head>Differences between these entities and earlier W3C DTDs</head>
    <div2 id="diff-xhtml1">
      <head>Differences from XHTML 1.0</head>
      <p>Differences between the XHTML entity definitions described here and the entity set described in the <loc href="http://www.w3.org/TR/xhtml1/dtds.html">XHTML 1.0 DTD</loc>.</p>
      <glist>
	<gitem><label><kw>lang</kw> and <kw>rang</kw></label><def><p>U+27E8 and
	U+27E9; XHTML 1.0 used U+2329 and  U+232A (which have canonical
	decomposition to U+3008 and U+3009).</p></def></gitem>
      </glist>
	
    </div2>
    <div2 id="diff-mathml2">
      <head>Differences from MathML 2.0 (second edition)</head>
      <p>The differences between MathML 2 and the current entity
      definitions are listed below.</p>
      <glist>
	<gitem><label><kw>fjlig</kw></label><def><p>ISOPUB (and MathML 1) defined an fj ligature; 
   Unicode does not have a specific character and the entity was dropped from MathML2.
   It is re-instated here for maximum compatibility with <bibref ref="SGML"/>.</p></def></gitem>
	<gitem><label><kw>phi</kw></label><def><p>U+03C6 GREEK SMALL LETTER PHI 
	(the definition used in HTML4); 
	MathML2 used  U+03D5 GREEK PHI SYMBOL. </p></def></gitem>
	<gitem><label><kw>epsiv</kw>, <kw>varepsilon</kw>, <kw>phiv</kw>, <kw>varphi</kw></label><def><p>these have been changed to map to the symbol character (to match other uses of the var prefix such as <kw>vartheta</kw>).</p></def>
<!--
      <note><p>It is very difficult for (X)HTML
      definitions to change since HTML is so widely deployed. Many of
      the assignments in the current definitions would be different if
      it were not for HTML compatibilty.  However in this case,
      perhaps this change could be made in an XHTML2/HTML5 time frame.
      Currently U+03D5 has the entity names:
      straightphi,phis. U+03C6 has the entity names phi, phgr, phiv,varphi.</p>
      <p>It is also worth noting that Unicode has changed (swapped)
      the default glyphs for U+03C6 and U+03D5 since the publication
      of HTML4. The current recommendation is to use a cursive form
      for U+03C6 (<graphic role="glyph" source="U003C6"/>), and a form
      with a straight vertical bar for  U+03D5 (<graphic role="glyph"
      source="U003D5"/>). Some newer fonts  
      use glyphs that correspond to the change made by Unicode, while a number of
      older fonts remain unchanged and hence will display the glyphs swapped
      relative to the current version of Unicode.  There is no way to guarantee
	that the intended glyph is displayed without font-specific knowledge.</p></note>
--></gitem>
	<gitem><label><kw>jmath</kw></label><def><p>U+0237; MathML 2 used U+006A (j) as
	there was no dotless j before Unicode 4.1.</p></def></gitem>
	<gitem><label><kw>trpezium</kw>, <kw>elinters</kw></label><def><p>U+23E2 and U+23E7;
	MathML 2 used U+FFFD (REPLACEMENT CHARACTER) as these characters were added at Unicode 5.0
	specifically to support these entities. </p></def></gitem>
	<gitem><label><kw>ohm</kw>, <kw>angst</kw></label><def><p>As noted above, the
	definitions of these entities has been changed so that the
	definitions use characters that are in NFC normal
	form.</p></def></gitem>
	<gitem><label><kw>bsolhsub</kw> and <kw>suphsol</kw></label><def><p> U+27C8 and
	U+27C9;
MathML2 used U+005C U+02282 and   U+2283 U+002F.</p></def></gitem>

      </glist>
      <p>The following bracket  symbols have been added to the Mathematical
      symbols block in Unicode versions between 3.1 and 5.1. MathML2 used
      similar characters intended for CJK punctuation.</p>
      <glist>
	<gitem><label><kw>lang</kw>, <kw>langle</kw>, <kw>LeftAngleBracket</kw> and 
	<kw>rang</kw>, <kw>rangle</kw>, <kw>RightAngleBracket</kw></label><def><p>U+27E8 and
	U+27E9; MathML2 used U+2329 and  U+232A (which have canonical
	decomposition to U+3008 and U+3009).</p></def></gitem>
	<gitem><label><kw>Lang</kw> and <kw>Rang</kw></label><def><p>U+27EA and U+27EB;
	MathML2 used U+300A and U+300B.</p></def></gitem>
	<gitem><label><kw>lbbrk</kw> and <kw>rbbrk</kw></label><def><p>U+2772 and
	U+2773; MathML2 used U+3014 and  U+3015.</p></def></gitem>
	<gitem><label><kw>loang</kw> and <kw>roang</kw></label><def><p>U+27EC and
	U+27ED; MathML2 used U+3018 and  U+3019.</p></def></gitem>
	<gitem><label><kw>lobrk</kw> and <kw>robrk</kw></label><def><p>U+27E6 and
	U+27E7; MathML2 used U+301A and U+301B.</p></def></gitem>

	<gitem><label><kw>OverBrace</kw> and <kw>UnderBrace</kw></label><def><p>U+23DE and U+23DF; MathML2
	used U+FE37 and  U+FE38.</p></def></gitem>
	<gitem><label><kw>OverParenthesis</kw> and
	<kw>UnderParenthesis</kw></label><def><p>U+23DC and U+23DD;
	MathML2 used U+FE35 and U+FE36.</p></def></gitem>
	<gitem><label><kw>LeftDoubleBracket</kw> and
	<kw>RightDoubleBracket</kw></label><def><p>U+27E6 and U+27E7;
	MathML2 used U+301A and U+301B.</p></def></gitem>
      </glist>
      <note><p><bibref ref="MathML3"/> uses the entity sets defined by this specification, so there will be no differences
      between MathML and the entities defined here once MathML3 is finalized.</p></note>
    </div2>

  </div1>
  <div1 id="source">
    <head>Source Files</head>
    <p>All data files used to construct the entity declarations, XSLT character maps, and HTML tables referenced from this document are available from <loc href="http://www.w3.org/2003/entities/2007xml/">http://www.w3.org/2003/entities/2007xml/</loc>.</p>
<p>

</p>
   <ulist>
     <item><p><loc
		  href="http://www.w3.org/2003/entities/2007xml/unicode.xml">unicode.xml</loc> master file detailing all unicode characters with names in various entity sets and applications, TeX equivalents and other data. This file has been maintained for many years, originally by Sebastian Rahtz as part of the jadetex distribution and since around 1999 as part of the MathML specification sources by David Carlisle. The current version encodes data for all characters in Unicode 5.2.
<emph>Note: unicode.xml is over 5MB in size and may not really be suitable for direct viewing in a browser, you may prefer to save the file rather than follow the above link to unicode.xml in a browser.</emph></p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/charlist.rnc">charlist.rnc</loc> relax NG schema for unicode.xml.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/unicode.xsl">unicode.xsl</loc> XSLT stylesheet that renders unicode.xml as an HTML table.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/character-set.xml">character-set.xml</loc> the source file for this document.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/xmlspec.xsl">xmlspec.xsl</loc> a copy of the  standard xmlspec stylesheet.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/run">run</loc> small script file that builds this collection.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/xhtml1.xml">xhtml1.xml</loc> record of XHTML 1.0 entity definitions.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/mml2.xml">mml2.xml</loc> record of MathML 2.0 (second edition) entity definitions.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/unicodedata.xsl">unicodedata.xsl</loc> stylesheet that generates a new copy of unicode.xml, incorporating data from the unicode data file, used to updated unicode.xml as new versions of Unicode are released.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/entities.xsl">entities.xsl</loc> stylesheet to generate the DTD declarations for the entities.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/charmap.xsl">charmap.xsl</loc> stylesheet to generate the XSLT character maps.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/characters.xsl">characters.xsl</loc> stylesheet to generate this document, including the referenced HTML tables.</p></item>
     <item><p><loc href="http://www.w3.org/2003/entities/2007xml/schemas.xml">schemas.xml</loc> file associating XML documents with appropriate Relax NG schema.</p></item>
   </ulist>
     
  </div1>
  <div1 id="references"><head>References</head>
  <blist>
     <bibl id="SGML">ISO/IEC 8879:1986, Information processing &#x2014;  Text and office systems &#x2014;  Standard Generalized Markup Language (SGML)</bibl>
     <bibl id="ISO9573-13-1991">ISO/IEC TR :1991, Information
technology &#x2014; SGML support facilities
Techniques for using
SGML &#x2014; Part 13: Public entity sets for
mathematics and science</bibl>
    <bibl id="Unicode">The Unicode Consortium. The Unicode Standard, Version 5.2.0, defined by: The Unicode Standard, Version 5.2 (Mountain View, CA: The Unicode Consortium, 2009. ISBN 978-1-936213-00-9).
    (<loc
    href="http://www.unicode.org/versions/Unicode5.0.0/">http://www.unicode.org/versions/Unicode5.2.0/</loc>) 
    </bibl>
    <bibl id="Unicode15">Unicode Standard Annex 15, Version 5.2.0;
<emph><loc
href="http://www.unicode.org/reports/tr15/tr15-31.html">Unicode Normalization Forms</loc></emph>, 
The Unicode Consortium, 2009-09-03. 
(<loc
href="http://www.unicode.org/reports/tr15/tr15-31.html">http://www.unicode.org/reports/tr15/tr15-31.html</loc>) 
</bibl>

    <bibl id="Unicode25">Barbara Beeton, Asmus Freytag, Murray Sargent III,
    <emph><loc
	      href="http://www.unicode.org/unicode/reports/tr25/">Unicode Support for Mathematics</loc></emph>, 
    Unicode Technical Report #25 2007-05-07.
    (<loc
    href="http://www.unicode.org/unicode/reports/tr25/">http://www.unicode.org/unicode/reports/tr25/</loc>) 
    </bibl>
    <bibl id="MathML2">David Carlisle, Patrick Ion, Robert Miner, Nico Poppelier,
    <emph><loc href="http://www.w3.org/TR/MathML2/">Mathematical Markup Language (MathML) Version 2.0 (Second Edition)</loc></emph>
    W3C Recommendation 21 October 2003
    (<loc href="http://www.w3.org/TR/2003/REC-MathML2-20031021/">http://www.w3.org/TR/2003/REC-MathML2-20031021/</loc>) 
    </bibl>
    <bibl id="MathML3">David Carlisle, Patrick Ion, Robert Miner,
    <emph><loc href="http://www.w3.org/TR/MathML3/">Mathematical Markup Language (MathML) Version 3.0</loc></emph>
    W3C Working Draft 24 September 2009 
    (<loc href="http://www.w3.org/TR/2009/WD-MathML3-20090924/">http://www.w3.org/TR/2009/WD-MathML3-20090924/</loc>) 
    </bibl>
    <bibl id="HTML4">Dave Raggett, Arnaud Le Hors, Ian Jacobs,
 <emph><loc href="http://www.w3.org/TR/html4/"/>HTML 4.01 Specification</emph>
W3C Recommendation 24 December 1999
(<loc
href="http://www.w3.org/TR/1999/REC-html401-19991224/">http://www.w3.org/TR/1999/REC-html401-19991224</loc>)</bibl>
<bibl id="Charmod-norm"> Fran&#xe7;ois Yergeau,
    Martin J. D&#xfc;rst,
    Richard Ishida,
    Addison Phillips,
    Misha Wolf,
    Tex Texin,
<emph><loc  href="http://www.w3.org/TR/charmod-norm/">Character Model
for the World Wide Web 1.0: Normalization</loc></emph>
W3C Working Draft 27 October 2005
(<loc href="http://www.w3.org/TR/charmod-norm/">http://www.w3.org/TR/charmod-norm/</loc>)
  </bibl>
</blist>
  </div1>
</back>
</spec>
