ISO/IEC 10036 and its technical corrigendum 1 allows registration of glyphs for private or limited use. This specification provides a mechanism for XML documents to reference to such registered glyphs.
This document is a submission to the World Wide Web Consortium from Center for Global Communications, International University of Japan (GLOCOM), Infoteria Corporation, and Media Fusion Co., Ltd. (see Submission Request, W3C Staff Comment).
This document is a NOTE made available by W3C for discussion only. Publication of this Note by W3C indicates no endorsement by W3C, the W3C Team, or any W3C Members. W3C has had no editorial control over the preparation of this NOTE. The acknowledgment of a Submission request does not imply that any action will be taken by W3C. It merely records publicly that the Submission request has been made by the submitting Member. This document may not be referred to as "work in progress" of the W3C. No W3C resources were, are, or will be allocated to the issues addressed by the NOTE.
A list of current W3C technical documents can be found at the Technical Reports page.
Comments on this document should be sent to firstname.lastname@example.org.
Embedding Glyph Identifiers in XML Documents [JIS TR X 0047] was originally published by Japanese Standards Association (JSA) in Japanese. It is not a standard but rather a technical report, which is intended to encourage public discussion, promote consensus among relevant parties, and eventually become a Japanese Industrial Standard (JIS), if appropriate. [JIS TR X 0047] was developed by the XML working group (XML WG2) of Information Technology Research and Standardization Center (INSTAC), JSA . This specification is core subset of [JIS TR X 0047]. But this specification has complete enough features. WG2 planned to remove functions which are not included in this specification. [JIS TR X 0047] was produced from XKP GAIJI Exchange Specification which was published by Extended Kanji Processing Council.
This section is normative.
This Specification provides an XML-based language for embedding glyph identifiers in an XML document.
NOTE: "Glyph" is defined in ISO/IEC 9541-1 as "a recognizable abstract graphic symbol which is independent of any specific design."
A glyph identifier is registered through the procedure for glyphs in ISO/IEC 10036 and its technical corrigendum 1.
NOTE: This specification does not constrain internal representations but rather provides an interchange format.
This section is normative.
Glyph Reference Language is a language for embedding glyph identifier(s) with XML documents.
The namespace name is "http://www.xml.gr.jp/xmlns/PRE/Reference". The attribute for specifying glyph identifiers (i.e. the attribute name shown below) belongs to this namespace.
<!ATTLIST someElement name CDATA #REQUIRED>
NOTE: This Specification uses the DTD syntax for convenience. In practice, a suitably declared namespace prefix shall be attached to the attribute.
Elements containing this attribute are meant to reference to glyphs. The value of this attribute is a glyph identifier according to ISO/IEC 10036 and its technical corrigendum 1.
NOTE: The attribute 'name' is typically used for searching glyph identifiers.
This XHTML document has a special glyph of '吉'. A normal glyph for '吉' has a long upper line (). While the special one has a short upper line (). In ISO/IEC 10646-1, these two variation were unified to form a single code point. But many Japanese people need to distinguish them. This example includes information for search processors to distinguish two variations, but does not include for display or printing processors. Note that 吉田茂 (Yoshida Sigeru) was Prime Minister of Japan from 1946 to 1954 A.D.
<html xmlns="http://www.w3.org/1999/xhtml"> <body xmlns:glyph="http://www.xml.gr.jp/PRE/Reference"> <p><span glyph:name="ISO/IEC 10036/RA//Glyphs:10003290" >吉</span>田茂</p> </body> </html>
Same as Example 1, but includes information for human readers. An human readable comment was inserted. Search processors can ignore the value of span elements. As a result, the comments will not be used for search.
<html xmlns="http://www.w3.org/1999/xhtml"> <body xmlns:glyph="http://www.xml.gr.jp/PRE/Reference"> <p><span glyph:name="ISO/IEC 10036/RA//Glyphs:10003290" >吉(The version of Short Upper Line)</span>田茂</p> </body> </html>
Same as Example 1, but includes GIF graphics to express the glyph variation of '吉' for displaying or printing. GIF graphics is merely example, and any other graphics formats are suitable for this purpose.
<html xmlns="http://www.w3.org/1999/xhtml"> <body xmlns:glyph="http://www.xml.gr.jp/PRE/Reference"> <p><img glyph:name="ISO/IEC 10036/RA//Glyphs:10003290" src="http://www.mojikyo.gr.jp/gif/003/003290.gif" alt="吉(The version of Short Upper Line)" />田茂</p> </body> </html>
NOTE: As illustrated by this example, the scope of the attribute "name" is not limited to characters. It might be possible to use this attribute for other types of information such as SVG.