W3C

Embedding Glyph Identifiers in XML Documents

W3C Note 20 December 2002

This version:
http://www.w3.org/TR/2002/NOTE-EGIX-20021220
Latest version:
http://www.w3.org/TR/EGIX
Editor:
KAWAMATA Akira (Pie Dey CO.,Ltd.) <>
INSTAC XML WG2:
MURATA Makoto (International University of Japan, Research Institute)
KOMACHI Yushi (Panasonic)
KAWAMATA Akira (Piedey)
UCHIYAMA Mitsukazu (Toshiba)
KAMIMURA Keisuke (GLOCOM)
YAMAMOTO Yohei (RICOH)

Abstract

ISO/IEC 10036 and its technical corrigendum 1 allows registration of glyphs for private or limited use. This specification provides a mechanism for XML documents to reference to such registered glyphs.

Status of this Document

This document is a submission to the World Wide Web Consortium from Center for Global Communications, International University of Japan (GLOCOM), Infoteria Corporation, and Media Fusion Co., Ltd. (see Submission Request, W3C Staff Comment).

This document is a NOTE made available by W3C for discussion only. Publication of this Note by W3C indicates no endorsement by W3C, the W3C Team, or any W3C Members. W3C has had no editorial control over the preparation of this NOTE. The acknowledgment of a Submission request does not imply that any action will be taken by W3C. It merely records publicly that the Submission request has been made by the submitting Member. This document may not be referred to as "work in progress" of the W3C. No W3C resources were, are, or will be allocated to the issues addressed by the NOTE.

A list of current W3C technical documents can be found at the Technical Reports page.

Comments on this document should be sent to akirak@piedey.co.jp.

Embedding Glyph Identifiers in XML Documents [JIS TR X 0047] was originally published by Japanese Standards Association (JSA) in Japanese. It is not a standard but rather a technical report, which is intended to encourage public discussion, promote consensus among relevant parties, and eventually become a Japanese Industrial Standard (JIS), if appropriate. [JIS TR X 0047] was developed by the XML working group (XML WG2) of Information Technology Research and Standardization Center (INSTAC), JSA . This specification is core subset of [JIS TR X 0047]. But this specification has complete enough features. WG2 planned to remove functions which are not included in this specification. [JIS TR X 0047] was produced from XKP GAIJI Exchange Specification which was published by Extended Kanji Processing Council.

Table of Contents

1 Scope
2 Glyph Reference Language
    2.1 Attribute "name"

Appendices

A Examples
    A.1 Example 1
    A.2 Example 2
    A.3 Example 3
B References
    B.1 Normative References
    B.2 Informative References


1 Scope

This section is normative.

This Specification provides an XML-based language for embedding glyph identifiers in an XML document.

NOTE: "Glyph" is defined in ISO/IEC 9541-1 as "a recognizable abstract graphic symbol which is independent of any specific design."

A glyph identifier is registered through the procedure for glyphs in ISO/IEC 10036 and its technical corrigendum 1.

NOTE: This specification does not constrain internal representations but rather provides an interchange format.

2 Glyph Reference Language

This section is normative.

Glyph Reference Language is a language for embedding glyph identifier(s) with XML documents.

The namespace name is "http://www.xml.gr.jp/xmlns/PRE/Reference". The attribute for specifying glyph identifiers (i.e. the attribute name shown below) belongs to this namespace.

2.1 Attribute "name"

<!ATTLIST someElement name CDATA #REQUIRED>

NOTE: This Specification uses the DTD syntax for convenience. In practice, a suitably declared namespace prefix shall be attached to the attribute.

Elements containing this attribute are meant to reference to glyphs. The value of this attribute is a glyph identifier according to ISO/IEC 10036 and its technical corrigendum 1.

NOTE: The attribute 'name' is typically used for searching glyph identifiers.

Appendices

A Examples (Non-Normative)

A.1 Example 1

This XHTML document has a special glyph of '吉'. A normal glyph for '吉' has a long upper line (character 'yoshi', with
   the upper line longer than the lower line). While the special one has a short upper line (character 'yoshi', with
   the lower line longer than the upper line). In ISO/IEC 10646-1, these two variation were unified to form a single code point. But many Japanese people need to distinguish them. This example includes information for search processors to distinguish two variations, but does not include for display or printing processors. Note that 吉田茂 (Yoshida Sigeru) was Prime Minister of Japan from 1946 to 1954 A.D.

<html xmlns="http://www.w3.org/1999/xhtml">
<body xmlns:glyph="http://www.xml.gr.jp/PRE/Reference">
<p><span glyph:name="ISO/IEC 10036/RA//Glyphs:10003290"
>吉</span>田茂</p>
</body>
</html>

A.2 Example 2

Same as Example 1, but includes information for human readers. An human readable comment was inserted. Search processors can ignore the value of span elements. As a result, the comments will not be used for search.

<html xmlns="http://www.w3.org/1999/xhtml">
<body xmlns:glyph="http://www.xml.gr.jp/PRE/Reference">
<p><span glyph:name="ISO/IEC 10036/RA//Glyphs:10003290"
>吉(The version of Short Upper Line)</span>田茂</p>
</body>
</html>

A.3 Example 3

Same as Example 1, but includes GIF graphics to express the glyph variation of '吉' for displaying or printing. GIF graphics is merely example, and any other graphics formats are suitable for this purpose.

<html xmlns="http://www.w3.org/1999/xhtml">
<body xmlns:glyph="http://www.xml.gr.jp/PRE/Reference">
<p><img glyph:name="ISO/IEC 10036/RA//Glyphs:10003290"
src="http://www.mojikyo.gr.jp/gif/003/003290.gif"
alt="吉(The version of Short Upper Line)" />田茂</p>
</body>
</html>
NOTE: As illustrated by this example, the scope of the attribute "name" is not limited to characters. It might be possible to use this attribute for other types of information such as SVG.

B References

B.1 Normative References

XML
Extensible Markup Language (XML) 1.0 (Second Edition) , Tim Bray, Jean Paoli, C. M. Sperberg-McQueen, Eve Maler, 2000. W3C Recommendation available at: http://www.w3.org/TR/REC-xml.
namespace
Namespaces in XML, Tim Bray, Dave Hollander, Andrew Layman, 1999. W3C Recommendation available at: http://www.w3.org/TR/REC-xml-names.
ISO/IEC 9541
ISO/IEC 9541-1:1991, Information technology - Font Information Interchange - Part 1: Architecture, ISO (International Organization for Standardization), 1991
ISO/IEC 10036
ISO/IEC 10036:1996, Information Technology -- Font information interchange -- Procedures for registration of font-related identifiers, ISO (International Organization for Standardization), 1996
technical corrigendum 1
Technical Corrigendum 1:2001 to ISO/IEC 10036:1996, ISO (International Organization for Standardization), 2001

B.2 Informative References

ISO/IEC 9070
ISO/IEC 9070:1991, Information Technology -- SGML support facilities -- Registration procedures for public text owner identifiers, ISO (International Organization for Standardization), 1991
JIS TR X 0047
Japanese Industrial Standards Committee. Picture Reference Exchange by XML, Japanese Standards Association, 2001
SVG
Scalable Vector Graphics (SVG) 1.0 Specification, Jon Ferraiolo, 2001. W3C Recommendation available at: http://www.w3.org/TR/SVG/.