[ contents ]

W3C

Requirements for Japanese Text Layout

W3C Working Group Note 4 June 2009

This version:
http://www.w3.org/TR/2009/NOTE-jlreq-20090604/
Latest version:
http://www.w3.org/TR/jlreq/
Previous version:
http://www.w3.org/TR/2008/WD-jlreq-20081015/
Editors:
Yasuhiro Anan (阿南 康宏), Microsoft
Hiroyuki Chiba (千葉 弘幸), Invited Expert
Junsaburo Edamoto (枝本 順三郎), Invited Expert
Richard Ishida, W3C
Keiichiro Ishino (石野 恵一郎), Antenna House
Tatsuo Kobayashi (小林 龍生), JustSystems
Toshi Kobayashi (小林 敏), Invited Expert
Kenzou Onozawa (小野澤 賢三), Invited Expert
Felix Sasaki, University of Applied Sciences Potsdam

Please refer to the errata for this document.

A Japanese version of this document is also available. See also translations. The English version of this document is the authoritative version.


Abstract

This document describes requirements for general Japanese layout realized with technologies like CSS, SVG and XSL-FO. The document is mainly based on a standard for Japanese layout, JIS X 4051, however, it also addresses areas which are not covered by JIS X 4051.

Status of this Document

This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/.

This document describes requirements for general Japanese layout realized with technologies like CSS, SVG and XSL-FO. The document is mainly based on a standard for Japanese layout, JIS X 4051, however, it also addresses areas which are not covered by JIS X 4051.

This is a W3C Working Group Note. The document was originally authored in Japanese, then translated to English under the guidance of the Japanese authors. In order to reach the largest international audience, the W3C works in English, so this English version of the document is the authoritative version. However, the Japanese version of this document is also available.

This document was developed by participants from four W3C Working Groups, CSS, Internationalization Core, SVG and XSL , working together as the Japanese Layout Task Force.

Feedback about the content of this document can be sent to www-i18n-comments@w3.org. Use "[JLReq]" in the subject line of your email, followed by a brief subject. The archive for this list is public.

Publication as a Working Group Note does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.

This document was produced by groups operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of disclosures for each group: CSS Working Group disclosures, i18n Core Working Group disclosures, SVG Working Group disclosures, and XSL Working Group disclosures. Those pages also include instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.


Table of Contents

1 Introduction
1.1 Purpose of This Document
1.2 How This Document was Created
1.3 Basic Principles for Development of this Document
1.4 The Structure of This Document
1.5 Reference of Definition and Others
2 Basics of Japanese Composition
2.1 Characters and the Principles of Setting them for Japanese Composition
2.1.1 Characters Used for Japanese Composition
2.1.2 Kanji, hiragana and katakana
2.1.3 Principles of Arrangement of Kanji and Kana Characters
2.2 Page Formats for Japanese Documents
2.2.1 Specification of Page Formats
2.2.2 Basic Templates of Page Formats
2.2.3 Elements of Page Formats
2.2.4 Elements of Kihon-hanmen
2.2.5 Kihon-hanmen and Examples of Real Page Format
2.3 Vertical Writing Mode and Horizontal Writing Mode
2.3.1 Directional Factors in Japanese Composition
2.3.2 Major Differences between Vertical Writing Mode and Horizontal Writing Mode
2.4 Specifying the Kihon-hanmen
2.4.1 Procedure for Defining the Kihon-hanmen
2.4.2 Considerations in Designing the Kihon-hanmen
2.5 Page wise Arrangement of Kihon-hanmen Elements
2.5.1 Examples of Items Jutting Out of the Kihon-hanmen
2.5.2 Line Positioning based on the Kihon-hanmen Design
2.5.3 Character Positioning based on Kihon-hanmen Design
2.6 Running Heads and Page Numbers
2.6.1 Positioning of Running Heads and Page Numbers
2.6.2 Principles of Arrangement of Running Heads and Page Numbers
2.6.3 Ways of Arranging Running Heads and Page Numbers
3 Line Composition
3.1 Line Composition Rules for Punctuation Marks
3.1.1 Differences in Vertical and Horizontal Composition in Use of Punctuation Marks
3.1.2 Positioning of Punctuation Marks (Commas, Periods and Brackets)
3.1.3 Exceptional Positioning of Ideographic Comma and Katakana Middle Dot
3.1.4 Positioning of Consecutive Opening Brackets, Closing Brackets, Commas, Full Stops and Middle Dots
3.1.5 Positioning of Opening Brackets at Line Head
3.1.6 Positioning of Dividing Punctuation Marks (Question Mark and Exclamation Mark) and Hyphens
3.1.7 Characters Not Starting a Line
3.1.8 Characters Not Ending a Line
3.1.9 Positioning of Closing Brackets, Full Stops, Commas and Middle Dots at Line End
3.1.10 Unbreakable Character Sequences
3.1.11 Character Sequences which Do Not Allow Space Insertion as Part of Line Adjustment Processing
3.1.12 Examples of Line Adjustment
3.2 Japanese and Western Mixed Text Composition (including Horizontal-in-Vertical Text Composition)
3.2.1 Composition of Japanese and Western Mixed Texts
3.2.2 Mixed Text Composition in Horizontal Writing Mode
3.2.3 Mixed Text Composition in Vertical Writing Mode
3.2.4 Method for Setting Full-width Latin Letters and European Numerals
3.2.5 Handling of Tate-chu-yoko (Horizontal-in-Vertical Settings)
3.2.6 Handling of Western Text in Japanese Text using Proportional Western Fonts
3.3 Ruby and Emphasis Dots
3.3.1 Usage of Ruby
3.3.2 Choice of Base Characters to be Annotated by Ruby
3.3.3 Choice of Size for Ruby Characters
3.3.4 Choice of Sides for Ruby with Respect to Base Characters
3.3.5 Positioning of Mono-ruby with Respect to Base Characters
3.3.6 Positioning of Group-ruby with Respect to Base Characters
3.3.7 Positioning of Jukugo-ruby with Respect to Base Characters
3.3.8 Adjustments of Ruby with Length Longer than that of the Base Characters
3.3.9 Composition of Emphasis Dots
3.4 Inline Cutting Note (Warichu)
3.4.1 Where the Inline Cutting Note (Warichu) is used
3.4.2 Character Size for Inline Cutting Notes and Line Gaps
3.4.3 Handling an Inline Cutting Note when it Straddles Two Kihon-hanmen Lines
3.5 Paragraph Adjustment Rules
3.5.1 Line Head Indent at the Beginning of Paragraphs
3.5.2 Line Head Indent and Line End Indent
3.5.3 Single Line Alignment Processing
3.5.4 Widow Adjustment of Paragraphs
3.6 Tab Setting
3.6.1 Usage of Tab Setting
3.6.2 Types of Tab Settings
3.6.3 The Method of Setting the Target Text
3.7 Other Rules of Japanese Typesetting
3.7.1 Superscripts and Superscripts
3.7.2 Furiwake
3.7.3 Jidori Processing
3.7.4 Processing of Math Symbols and Math Operators
3.8 Line Adjustment
3.8.1 Necessity for Line Adjustment
3.8.2 Reduction and Addition of Inter-Character Space
3.8.3 Procedures for Inter-Character Space Reduction
3.8.4 Procedures for Inter-Character Space Expansion
3.9 About Character Classes
3.9.1 Differences in Positioning of Characters and Symbols
3.9.2 Grouping of Characters and Symbols depending on their Positioning
3.9.3 Positioning Methods for each Character Class
4 Approach to Hanmen Design
4.1 Headings
4.2 Notes
4.3 Illustrations
4.4 Tables

Appendices

A Character Classes
B Spacing between Characters
C Possibilities for Line-breaking between Characters
D Opportunities for Inter-character Space Reduction during Line Adjustment
E Opportunities for Inter-character Space Expansion during Line Adjustment
F Positioning of Jukugo-ruby
G Terminology
H References
I Revision Log
J Acknowledgements


1 Introduction

1.1 Purpose of This Document

Each cultural community has its own language, script and writing system. In that sense, the transfer of each writing system into cyberspace is a task with very high importance for information and communication technology.

As one of the basic work items of this task force, this document describes issues of text composition in the Japanese writing system. The goal of the task force is not to propose actual solutions but describe important issues as basic information for actual implementations.

1.2 How This Document was Created

This document was created by the W3C Japanese Layout Task Force. The Task Force has discussed many issues and harmonized the requirements from user communities and solutions from technological experts. It includes the following participants:

  1. Japanese text composition experts (The editors of "JIS X 4051:Formatting rules for Japanese documents").

  2. Internationalization and standardization experts in Japan (from Microsoft, Antenna House, JustSystems).

  3. Members of the W3C CSS,SVG, XSL and i18n Core, Working Groups.

This task force also constitutes an important innovation due to its bilingual work-flow. Discussion is mainly conducted in Japanese, because of the Japanese composition issues, but minutes and one mailing list were in English. To support development, the task force held face-to-face meetings with participating Working Groups.

The document itself was also developed bilingually, and is published bilingually. We carefully avoided using jargon for technical terms. Even if there were English words corresponding to the Japanese, we carefully studied any potential differences in the nuances of meaning, and if there were differences between corresponding concepts, we provided the Japanese jargon in romaji (Latin transliteration) for future discussion. Moreover, we prepared as many figures as possible, with clear and understandable English, to help non-Japanese readers.

1.3 Basic Principles for Development of this Document

Japanese composition exhibits several differences from Western composition. Major differences include:

  1. The use of not only horizontal writing mode but also vertical writing mode.

  2. In principle, all character frames of ideographic (cl-19), hiragana (cl-15) and katakana (cl-16) characters used in Japanese composition are designed in a square box, and these characters are composed without intervening spaces (i.e. set solid). In this document, notations such as ideographic (cl-19) and hiragana (cl-15) characters indicate character classes (see 3.9 About Character Classes).

This document mainly explains the characteristics of Japanese composition along the lines of the following policy.

  1. It does not fully cover all issues of the Japanese composition system, but mainly discusses the differences from Western composition systems.

  2. It focuses on the requirements for the Japanese visual presentation form of text composition. Technology-specific interpretations of the requirements and/or how to implement them are out of scope for this document.

  3. It explicitly refers to JIS X 4051 "Formatting rules for Japanese documents" as much as possible. This document focuses on fundamental and important issues of Japanese layout as much as possible, and for more detail references the corresponding clause of JIS X 4051. The JIS X 4051 topics that are not described in this document, are only for exceptional, corner cases or to provide some specific line composition algorithms. On the other hand, some topics that are not described in JIS X 4051 are described in detail. Accordingly, this document is sufficient to implement Japanese layout processing for most parts of the Japanese market.

    In accordance with the stated policy, this document provides tutorial- or summary-like, supplementary explanations, related background, and additional descriptions for JIS X 4051 information. This document covers all the fundamental issues of Japanese text layout, but the reader will need to refer to JIS X 4051 for advanced discussion of exceptional topics.

  4. It provides typical examples of actual use for key composition features, to enable better understanding of their usage.

  5. For non-Japanese readers, frequency of use is indicated for each requirement. These frequencies are not the outcome of any accurate research, but arise from the long experience of the authors. They are intuitive for ordinary Japanese text readers; however, for non-Japanese readers it may be difficult to imagine without explicit information. These frequencies are only rough information to prioritize the importance of issues. A couple of examples:

    "warichu (inline cutting note) is not frequently used, but is useful to simply annotate persons, things, and so on, at the place where the text appears, especially in classic texts or translations.", or "ruby is frequently used in modern documents, including newspapers."

  6. In consideration of non-Japanese readers of this document, figures are used for explanations wherever possible.

  7. Text layout rules and recommendations for readable design are different things, however these two issues are difficult to discuss independently. In this document, these two aspects are carefully separated. The aesthetic design recommendations are mainly described using notes.

  8. The main target of this document is common books. The authors' experiences are mainly related to common books, and the quality required for common books is the highest in the market. There are many kinds of books in the market, and the requirements are quite diverse. The task force has a lot of accumulated experience in requirements and solutions for Japanese text composition. Nonetheless, many issues, which have been discussed over a long period of time, are applicable for other kinds of publication.

    In terms of frequency of use, the importance of magazines, technical manuals, and Web documents rates alongside common books. However, there are several characteristics in these publications, which are different from common books. These issues should be treated more fully in future documents.

1.4 The Structure of This Document

This document consists of three parts:

1 Introduction

2 Basics of Japanese Composition

3 Line Composition

2 Basics of Japanese Composition explains the characteristics of letters and symbols which are used in Japanese composition, their differences in vertical writing mode and horizontal writing mode, and the design and adaptation of the kihon-hanmen.

3 Line Composition explains line composition methods for ideographic (cl-19), hiragana (cl-15), katakana (cl-16) characters and punctuation marks, together with ruby (inter-line pronunciation information and annotation) and Japanese and Western mixed text composition, i.e. mixtures of Japanese characters and Western characters (cl-27).

There are plans to add a fourth section in the next version of this document, which will describe construction methods and composition methods for headings, notes, illustrations and tables.

In principle, characters in Japanese composition are designed in a square box and positioned without spaces, i.e. solid setting. This is taken as a basic premise for the design of the kihon-hanmen, the basis of book layout. Furthermore, to understand Japanese layout, it is important to understand the design of the kihon-hanmen and how to position illustrations, characters, symbols etc. in relation to it. Hence, 2 Basics of Japanese Composition describes in detail the design of the kihon-hanmen and its dependencies. In particular, 2.5 Page wise Arrangement of Kihon-hanmen Elements provides prototypical patterns for the three guidelines listed after this paragraph: what recommendations need to be strictly taken into account, and what exceptions are possible. (The goal of these explanations is an understanding of Japanese composition. Since detailed explanations of the various elements of the kihon-hanmen are given in 3 Line Composition and 4 Approach to Hanmen Design, some explanations are repeated.)

  1. Keep to the basic size and column numbers of multi-column format that were decided upon in setting up the kihon-hanmen, with some exceptions.

  2. Keep to the line positions that were decided upon in setting up the kihon-hanmen, with some exceptions.

  3. Keep to the letter positions that were decided upon in setting up the kihon-hanmen, with some exceptions.

1.5 Reference of Definition and Others

The definitions of technical terms are described in the Appendix G Terminology appendix. Terms are linked to corresponding places in the Terminology appendix only at first appearance and in important places. If there is no appropriate English terminology for Japanese terminology, or the English terminology may possibly cause misunderstanding, the Japanese terminology is only transliterated to Hepburn style romaji notation (except that "m", not "n", is used before "b", "m" and "p").

Also, the definitions of terminology in the Terminology appendix are basically the same as the definitions of JIS X 8125 or JIS X 4051, with respect to common Japanese usage of terminology.

Each character class has its own character class number in parentheses. Members of each character class are listed in Appendix A Character Classes, except for CJK Ideographs. Each character in this document is named and referred to using the character names of ISO/IEC 10646 (UCS).

The formal title of the frequently mentioned Japanese Industrial Standard JIS X 4051 is as follows:

JIS X 4051 is available from the Japan Standards Association (http://www.jsa.or.jp/), but a PDF version is not available from JSA. The PDF version is accessible from the Japanese Industrial Standards Committee site (http://www.jisc.go.jp/), however it is not possible to download it.

2 Basics of Japanese Composition

2.1 Characters and the Principles of Setting them for Japanese Composition

2.1.1 Characters Used for Japanese Composition

Japanese letters used for composing Japanese text mainly consist of ideographic (cl-19), hiragana (cl-15) and katakana (cl-16) characters (see [Fig.1]).

Kanji, hiragana and katakana.
[Fig.1]: Kanji, hiragana and katakana.

(note 1)

In addition to ideographic (cl-19), hiragana (cl-15) and katakana (cl-16) characters, various punctuation marks (see [Fig.2]) as well as Western characters (cl-27), such as European numerals, Latin letters and/or Greek letters, may be used in Japanese text. In this document these characters are classified into character classes, for which explanations are given describing their behavior in type-setting.

Examples of punctuation marks.
[Fig.2]: Examples of punctuation marks.

(note 2)

The details of character classes used in this document will be explained in 3.9 About Character Classes, as well as in Appendix B Spacing between Characters. Also, in "Spacing between Characters" all non-Kanji characters included in ISO/IEC 10646 (UCS) Annex A collection 285 (Basic Japanese character set) and collection 286 (Extended non-Kanji character set) are explicitly classified by character class.

2.1.2 Kanji, hiragana and katakana

Ideographic (cl-19), hiragana (cl-15) and katakana (cl-16) characters are the same size, and have square character frames of equal dimensions. Aligned with the vertical and horizontal center of the character frame, there is a smaller box called the letter face, which contains the actual symbol. Character size is measured by the size of the character frame (see [Fig.3]). "Character advance" is a term used to describe the advance width of the character frame of a character. By definition, it is equal to the "width" of a character in horizontal writing mode, whereas it is the height of a character in vertical writing mode (see [Fig.3]).

The size of kanji and hiragana, and the character frames.
[Fig.3]: The size of kanji and hiragana, and the character frames.

(note 1)

In vertical writing mode, the letter face of small kana (cl-11) characters (ぁぃぅァィゥ etc.) is placed at the vertical center and to the right of the horizontal center of the character frame; in horizontal writing mode, it is placed at the horizontal center and below the vertical center (see [Fig.4]). Also there are punctuation marks with letter faces that are not placed at the vertical and horizontal center of the character frame.

Small kana and the position of their letter face in the character frame.
[Fig.4]: Small kana and the position of their letter face in the character frame.

2.1.3 Principles of Arrangement of Kanji and Kana Characters

In principle, when composing a line with ideographic (cl-19), hiragana (cl-15) and katakana (cl-16) characters no extra space appears between their character frame. This is called solid setting (see [Fig.5]).

Example of solid setting in horizontal writing mode.
[Fig.5]: Example of solid setting in horizontal writing mode.

(note 1)

In the letterpress printing era ideographic (cl-19), hiragana (cl-15) and katakana (cl-16) characters were designed so that they were easy to read in solid setting, regardless of text direction. However, unlike the letterpress printing era, when several sizes of the original pattern of a letter were required to create matrices, in today's digital era the same original pattern is used for any size simply by enlargement or reduction. Because of this, it might be necessary to adjust the inter-character space when composing lines at large character sizes. When composing lines at small character sizes, hinting data is used to ensure that the width of the strokes that make up a character look correct.

(note 2)

Depending on the context, there are several setting methods used in addition to solid setting, as shown below.

  1. Fixed inter-character spacing: Text set with a fixed size space between each character frame (see [Fig.6]).

    Examples of fixed inter-character spacing in horizontal writing mode.
    [Fig.6]: Examples of fixed inter-character spacing in horizontal writing mode.

    Fixed inter-character spacing is used in books for the following cases:

    1. To achieve a balance between running heads with few and with many characters. Fixed inter-character spacing is used for the running heads with few characters. Examples of fixed inter-character spacing for running heads are given in JIS X 4051, annex 5.

    2. To achieve a balance between headings with few and with many characters. Fixed inter-character spacing is used for the headings with few characters. Examples of fixed inter-character spacing for headings are given in JIS X 4051, annex 6.

    3. For captions of illustrations and tables, which only have a few characters. Fixed inter-character spacing is used to balance with the size of the illustration or table.

    4. In some cases, fixed inter-character spacing is used for Chinese and Japanese poetry where one line has only a few characters.

    (note 1)

    Fixed inter-character spacing, including also even tsumegumi, is defined in JIS X 4051, sec. 4.18.1 b.

  2. Even inter-character spacing: Text set with equal inter-character spacing between characters on a given line, so that each line is aligned to the same line head and line end (see [Fig.7]).

    Example of even inter-character space setting in horizontal writing mode.
    [Fig.7]: Example of even inter-character space setting in horizontal writing mode.

    Even inter-character space setting is used in books for unifying the length of table headings with Japanese text (see [Fig.8]). There are also examples (e.g. lists of names) in which parts of a person names receive inter-character spacing.

    Example of a table with inter-character spacing.
    [Fig.8]: Example of a table with inter-character spacing.

    (note 1)

    Even inter-character spacing, including processing of jidori, is defined in JIS X 4051, sec. 4.18.1.

  3. Tsumegumi (kerning / tracking): Text is set with negative inter-character space by arranging characters so that a portion of two character frames overlap each other. This is divided further into two types, depending on the methods used for inter-character space reduction. One method involves reducing by the same amount of inter-character space (even tsumegumi or tracking, see [Fig.9]) and the other involves determining the amount of space to reduce based on the distance between the two letter faces of adjacent characters (face tsumegumi or letter face kerning, see [Fig.10]).

    Example of even tsumegumi in horizontal writing mode. (The 1st line is the same text with solid setting, for comparison.)
    [Fig.9]: Example of even tsumegumi in horizontal writing mode. (The 1st line is the same text with solid setting, for comparison.)
    Example of face tsumegumi in horizontal writing mode. (The 1st line is the same text with solid setting, for comparison.)
    [Fig.10]: Example of face tsumegumi in horizontal writing mode. (The 1st line is the same text with solid setting, for comparison.)

    In the main text of books, the most reader-friendly approach is to use solid setting. However, if the character size is larger, the distance between characters may become unbalanced, and tsumegumi will be applied. For example, there are books where tsumegumi is used with headings set in large character sizes. These methods are rarely used in books, since ease of reading is very important, but in magazines or advertisements there are many more examples of tsumegumi. Magazines tend to use type to differentiate themselves from others, and so devices like this are sometimes used for that purpose.

2.2 Page Formats for Japanese Documents

2.2.1 Specification of Page Formats

The page format of a Japanese document is specified by:

  • Firstly, preparing a template of the page format, which determines the basic appearance of pages of the document;

  • Then, specifying the details of actual page elements based on the templates.

2.2.2 Basic Templates of Page Formats

Generally, books use only one template for page format, whereas magazines often use several templates.

Although in books, as will be mentioned later, there tends to be one template for the page format, the basic pattern is typically adapted. For example, the table of contents may contain small modifications. Furthermore, there are many examples of indexes with a different page format than the basic page format, and vertically set books often have indexes in horizontal writing mode and sometimes multiple columns. This still holds true where the goal is to make the size of the hanmen for indexes close to the size of hanmen in the basic page format.

Magazines gather articles of different kinds. Often the page format will differ depending on the content of the article. For example, one part may have 9 point character size and 3 columns, and another part 8 point character size and 4 columns.

2.2.3 Elements of Page Formats

Example of a page format in vertical writing mode.
[Fig.11]: Example of a page format in vertical writing mode.

The following are the basic elements of a page format. [Fig.11] illustrates an example of a page format in vertical writing mode).

  1. Trim size and binding side (vertically set Japanese documents are bound on the right-hand side, and horizontally set documents are bound on the left-hand side. See [Fig.12].)

  2. Principal text direction (vertical writing mode or horizontal writing mode).

  3. Appearance of the kihon-hanmen and its position relative to the trim size.

  4. Appearance of running heads and page numbers, and their positions relative to the trim size and kihon-hanmen.

Binding-side (bound on the right-hand side and bound on the left-hand side).
[Fig.12]: Binding-side (bound on the right-hand side and bound on the left-hand side).

(note 1)

Establishing a kihon-hanmen may be seen as defining not only a rectangular area on a page, but also within that area an underlying, logical grid, to guide the placement of such things as characters, headings, and illustrations. However, once a kihon-hanmen is established, there is no absolute requirement to align characters with the grid, especially when setting characters inside a line. The only factors that influence the placement of characters are strong gravitational forces that (i) attract the first and last characters on a line to align with the border of the kihon-hanmen, and (ii) attract each line position to the line positions on which the kihon-hanmen is based.

It may help in understanding the basic concepts of Japanese layout and kihon-hanmen to think in terms of a slit-based model, rather than a grid-based model. Each slit is the full length of the lines on which the kihon-hanmen is based.

2.2.4 Elements of Kihon-hanmen

The kihon-hanmen is the hanmen style designed as the basis of a book. The following are the basic elements of the kihon-hanmen (see [Fig.13]).

(note 1)

To understand the characteristics of Japanese composition, it is important to understand how the various elements of the kihon-hanmen are applied to a real page. The details will be explained later.

(note 2)

The normative definition of kihon-hanmen is provided in JIS X 4051, sec. 7.5.

(note 3)

Format examples (including running heads and page numbers) and composition examples for kihon-hanmen in different trim sizes are available in JIS X 4051, annexes 3 and 4.

Elements of kihon-hanmen. (Example in vertical writing mode.)
[Fig.13]: Elements of kihon-hanmen. (Example in vertical writing mode.)
  1. Character size and typeface name

  2. Text direction (vertical writing mode or horizontal writing mode)

  3. Number of columns and column gap when using multi-column format

  4. Length of a line

  5. Number of lines per page (number of lines per column when using multi-column format)

  6. Line gap

2.2.5 Kihon-hanmen and Examples of Real Page Format

Below are several examples of how the basic page format is created, and how then various elements are placed on a real text page. (This and other aspects of how the various elements of the kihon-hanmen are arranged on each page are explained in 2.5 Page wise Arrangement of Kihon-hanmen Elements.)

  1. Realm and position of headings

    To set a heading, first establish a rectangular space based on a number of lines in the kihon-hanmen. For example, a '3 line space' means (3 * the size of the character frame used for the kihon-hanmen + 2 * the line gap in the kihon-hanmen). (Details of this processing are defined in JIS X 4051, sec. 8.3.3.d). The heading text is usually set in the centre of the rectangular space in the block direction, and indented from the line head. The size of the indent is usually specified as a number of characters in the kihon-hanmen. For example, a '4 character indent' means (4 * the size of the character frames used for establishing the kihon-hanmen). (See the example at [Fig.14].)

    Layout example of a heading based on the line positions established by the kihon-hanmen.
    [Fig.14]: Layout example of a heading based on the line positions established by the kihon-hanmen.

    (note 1)

    Details of the different types of heading, creation of headings, methods for placing headings, etc. will be explained in 4.1 Headings in a future version of this document.

  2. Size of illustrations

    In horizontal writing mode with two columns, for example, the width of illustrations should, if at all possible, be either the width of one kihon-hanmen column or the width of the kihon-hanmen (see [Fig.15]). The illustrations are usually set at the head or the foot of the page (see [Fig.15]).

    Example of illustrations in two columns, horizontal writing mode.
    [Fig.15]: Example of illustrations in two columns, horizontal writing mode.

    Also, in vertical writing mode, for example with three columns, the height of illustrations should, if at all possible, be either the height of one kihon-hanmen column or the height of the kihon-hanmen (see [Fig.16]). The illustrations are usually set at the right side or left side of the kihon-hanmen (see [Fig.16]).

    Example of illustrations in three columns, vertical writing mode.
    [Fig.16]: Example of illustrations in three columns, vertical writing mode.

    (note 1)

    Details of illustration positioning will be explained in a subsequent version of this document.

  3. Hanmen size for the table of contents

    The hanmen size for the table of contents of books is based on the size of the kihon-hanmen. There are many examples of tables of contents in vertical writing mode where the left-to-right size is identical to that of the kihon-hanmen, but the top-to-bottom size is a little bit smaller (see [Fig.17]).

    Example of the design of the table of contents (TOC) in vertical writing mode.
    [Fig.17]: Example of the design of the table of contents (TOC) in vertical writing mode.

    (note 1)

    There are cases when a different hanmen than the kihon-hanmen is used for positioning of running heads and page numbers. This will be discussed in 2.6.2 Principles of Arrangement of Running Heads and Page Numbers (see [Fig.51]).

2.3 Vertical Writing Mode and Horizontal Writing Mode

2.3.1 Directional Factors in Japanese Composition

Japanese composition has two text directions. One is vertical direction (vertical writing mode), the other is horizontal direction (horizontal writing mode).

(note 1)

Ideographic (cl-19), hiragana (cl-15) and katakana (cl-16) characters for Japanese composition have basically been designed to have a square character frame from the letterpress printing era on. Thus the same collection of printing types can be used in either vertical writing mode or horizontal writing mode, simply by changing the direction of text, (see [Fig.18]). There were some attempts to develop printing types designed exclusively for horizontal writing mode, but they were not widely accepted.

Vertical writing mode and horizontal writing mode. (The arrows show the reading direction.)
[Fig.18]: Vertical writing mode and horizontal writing mode. (The arrows show the reading direction.)

(note 2)

There is little market data comparing the number of pages with vertical writing mode and horizontal writing mode, but it is said that both are almost the same.

(note 3)

For official (e.g. governmental) documentation, horizontal writing mode is recommended. Educational material (with the exception of certain topics) is mostly in horizontal writing mode. Readers of "mobile novels" are increasing, and it is expected that in the future horizontal writing mode will increase in this area as well. However, most of the large newspapers are written completely in vertical writing mode, and most of the large journals for ordinary readers are almost completely set in vertical writing mode. In addition, novels, which are the most widely read kind of book publication, are almost completely in vertical writing mode (some readers say that they cannot read a novel if it is not in vertical writing mode). Hence it can be expected that the importance of vertical writing mode for Japanese will not change for the time being.

(note 4)

There is usually only one direction for all text throughout a book, but there are cases where horizontal writing mode is used in certain parts of vertically composed books (see [Fig.19]). Tables, captions for illustrations, running heads, and page numbers are usually composed horizontally in a page with a vertical writing mode.

Example of horizontal writing mode in parts of vertically set books.
[Fig.19]: Example of horizontal writing mode in parts of vertically set books.

2.3.2 Major Differences between Vertical Writing Mode and Horizontal Writing Mode

The following are major differences between vertical writing mode and horizontal writing mode.

  1. Arrangement of characters, lines, columns and pages; direction of page progression.

    (note 1)

    The positioning of characters, lines and paragraphs in vertical and horizontal writing mode is defined in JIS X 4051, sec. 7.4.4.

    1. Vertical writing mode. See [Fig.20] for an example of vertical writing mode with two columns per page.

      Direction of arrangement of characters in vertical writing mode.
      [Fig.20]: Direction of arrangement of characters in vertical writing mode.
      1. Characters are arranged from top to bottom, lines are arranged from right to left.

      2. Columns are arranged from top to bottom. A book starts with the left (recto) side and progresses from right to left (see [Fig.21]).

        Progression of pages for a vertically set books.
        [Fig.21]: Progression of pages for a vertically set books.
    2. Horizontal composition. See [Fig.22] for an example of horizontal text layout with two-columns per page.

      Direction of arrangement of characters in horizontal writing mode.
      [Fig.22]: Direction of arrangement of characters in horizontal writing mode.
      1. Characters are arranged from left to right, and lines are arranged from top to bottom.

      2. Columns are arranged from left to right. A book starts with the right (recto) side and progresses from left to right (see [Fig.23]).

        Progression of pages for a horizontally set book.
        [Fig.23]: Progression of pages for a horizontally set book.
  2. Orientation of Latin alphanumeric characters in a line.

    1. There are three ways to arrange Latin alphanumerics in vertical writing mode:

      1. One by one with the same normal orientation as that of Japanese characters. This is usually applied to one-letter alphanumerics or capitalized abbreviations (see [Fig.24]).

        Arrangement of alphanumerics in vertical writing mode - normal orientation.
        [Fig.24]: Arrangement of alphanumerics in vertical writing mode - normal orientation.

        (note 1)

        The alphanumeric characters used for this arrangement have different typographic features than those with proportional width used for Western text. They are of fixed-width and full-width design, and have been used this way since the letterpress printing era.

      2. Rotated 90 degrees clockwise. This is usually applied to English words or sentences (see [Fig.25]).

        Arrangement of alphanumerics in vertical writing mode - rotated 90 degrees clockwise.
        [Fig.25]: Arrangement of alphanumerics in vertical writing mode - rotated 90 degrees clockwise.

        (note 1)

        In [Fig.25], there are spaces before and after the character frame for the Western word "editor". These spaces are necessary for composition of mixed Japanese and Western text, and details will be provided in a later section.

      3. Set horizontally without changing orientation (called tate-chu-yoko, which means horizontal-in-vertical composition) (see [Fig.26]). This is usually applied to two-digit numbers (see JIS X 4051, sec. 4.8 for the definition).

        Arrangement of alphanumerics in vertical writing mode - tate-chu-yoko.
        [Fig.26]: Arrangement of alphanumerics in vertical writing mode - tate-chu-yoko.
    2. In horizontal writing mode there is only one way of arranging alphanumerics, i.e. normal orientation.

  3. Arrangement of tables and/or illustrations rotated 90 degrees clockwise or counter-clockwise for reasons of size. (This processing is defined in JIS X 4051, sec. 7.3.).

    1. In vertical writing mode, align the top of tables/illustrations to the right of the page (see [Fig.27]).

      Example of arrangement of a table rotated 90 degrees clockwise in vertical writing mode.
      [Fig.27]: Example of arrangement of a table rotated 90 degrees clockwise in vertical writing mode.
    2. In horizontal writing mode, align the top of tables/illustrations to the left of the page (see [Fig.28]).

      Example of arrangement of a table rotated 90 degrees counterclockwise in horizontal writing mode.
      [Fig.28]: Example of arrangement of a table rotated 90 degrees counterclockwise in horizontal writing mode.

      (note 1)

      The orientation is chosen to minimize interference with the overall reading flow of the book.

  4. Arrangement of an incomplete number of lines on a multi-column format page due to new recto, page break or other reasons. (The processing of new recto and page break is defined in JIS X 4051, sec. 8.1.1.).

    1. In vertical writing mode, just finish the line where it ends ("nariyuki"). The number of lines in each column is not uniform (see [Fig.29]).

      How to process incomplete number of lines on a multi-column format page (vertically set book).
      [Fig.29]: How to process incomplete number of lines on a multi-column format page (vertically set book).
    2. In horizontal writing mode, re-arrange columns so that each column has the same number of lines. In case the number of lines is not divisible by the number of columns, add the smallest number to make it divisible and re-arrange columns using the quotient as the number of lines so that only the last column shall have the incomplete number of lines (see [Fig.30]).

      How to process incomplete number of lines on a multi-column format page (horizontally set book).
      [Fig.30]: How to process incomplete number of lines on a multi-column format page (horizontally set book).

      (note 1)

      Neither horizontal nor vertical balance of column arrangement would break the stability of vertical page layout very much, while horizontal balance of column arrangement is determinant for horizontal page layout. In vertical text it doesn't matter too much whether columns are balanced or not. For horizontally set text it is best to balance columns wherever possible.

2.4 Specifying the Kihon-hanmen

2.4.1 Procedure for Defining the Kihon-hanmen

In Japanese composition, first the size of the kihon-hanmen is defined, using the square character frames of characters in solid setting. Taking this as a base, the position of the kihon-hanmen with regards to the trim size is then specified. The following are procedures for determining the size and position of the kihon-hanmen (see [Fig.31]).

  1. Specifying the dimensions of the kihon-hanmen.

    1. For a document with a single column per page, specify the character size, the line length (the number of characters per line), the number of lines per page, and the line gap.

    2. For a document with multiple columns per page, specify the character size, the line length (the number of characters per line), the number of lines per column, the line-gap, and the number of columns and the column gap.

      Procedures to determine the size and position of the kihon-hanmen, step 1.
      [Fig.31]: Procedures to determine the size and position of the kihon-hanmen, step 1.
  2. Determining the position of the kihon-hanmen relative to the trim size.

    There are various alternative methods for specifying the position of the kihon-hanmen relative to the trim size (see [Fig.32]):

    1. Position vertically by centering the kihon-hanmen. Position horizontally by centering the kihon-hanmen.

    2. Position vertically by specifying the space at the head (for horizontal writing mode) or the space at the foot (for vertical writing mode). Position horizontally by centering the kihon-hanmen.

    3. Position vertically by centering the kihon-hanmen. Position horizontally by specifying the space of the gutter.

    4. Position vertically by specifying the space at the head (for horizontal writing mode) or the space at the foot (for vertical writing mode). Position horizontally by specifying the space of the gutter.

    Procedures to determine the size and position of the kihon-hanmen, step 2.
    [Fig.32]: Procedures to determine the size and position of the kihon-hanmen, step 2.

    (note 1)

    In most cases the kihon-hanmen is set at the horizontal and vertical center of the trim size, which should be the default positioning, but depending on the dimensions of the kihon-hanmen there may be cases where the default needs to be changed; for example, by moving the kihon-hanmen up, down, to the left or to the right of the default position.

    (note 2)

    It is technically possible to determine the dimensions of the kihon-hanmen by specifying the trim size and margins of all sides, but this method is not common in the tradition of Japanese composition. If this is the only way an implementation allows, the margins of each side need to be determined beforehand in relation to the dimensions of the kihon-hanmen and its position in the trim size.

2.4.2 Considerations in Designing the Kihon-hanmen

The following are considerations to take into account when designing the kihon-hanmen. (This topic is not about processing, but rather an explanation of design preferences. The definition of kihon-hanmen is given in JIS X 4051, sec. 7.4.1.)

  1. Trim size and margins. It would be best if the shape of the kihon-hanmen could be made similar to that of the trim size.

  2. Character size. Generally 9 point (about 3.2mm) type is common. Except for specialized publications such as dictionaries, the minimum size of type is 8 point (about 2.8mm).

    (note 1)

    In Western text layout, 10 point (about 3.5mm) or 12 point (about 4.2mm) type is common. This is mainly because of a difference in design principles between Japanese and Latin characters.

  3. Line length should be multiples of the character size (see [Fig.33]).

    Line length should be multiples of the character size.
    [Fig.33]: Line length should be multiples of the character size.

    (note 1)

    There are basically two reasons why line length should be multiples of the character size.

    1. For Japanese composition, all line lengths except that of the last line of the paragraph should, in principle, be the same.

    2. In principle, for printing, Japanese characters like ideographic (cl-19), hiragana (cl-15) and katakana (cl-16) characters are uniformly designed in the same square character frame and they are set solid (no extra space between adjacent character frames).

    (note 2)

    The best line length (number of characters per line) is around 52 characters, maximum, in vertical writing mode, and 40 characters, maximum, in horizontal writing mode. If the trim size would take lines beyond the recommended length, consider using a multi-column format and making the line length shorter.

  4. Use the same amount of line gap throughout the book, except for special cases. The size of the kihon-hanmen in the block direction is specified using the number of lines and the size of the line-gap.

    (note 1)

    In Japanese composition, there are cases where ruby or emphasis (emphasis dots, bousen, underlines, etc.) are inserted between lines. In such cases the line gap is not changed but is kept constant (see [Fig.34]). It is also possible to insert reference marks to notes between lines within the main text. This case is handled in the same manner. If these elements are likely to occur in text, the line gap established during the kihon-hanmen design needs to be of an adequate size to accommodate them. Further explanations about the placement of ruby will be given in 3 Line Composition.

    Inserting ruby or other items between lines.
    [Fig.34]: Inserting ruby or other items between lines.

    (note 2)

    Warichu (inline cutting note) juts into the line gap on either side of a line. The basic line gap isn't changed where warichu occurs (the line gap between warichu itself and the adjacent lines looks narrower than for the rest of the line), so when warichu is likely to occur in text, the line gap for the kihon-hanmen may be set slightly larger than normal to accommodate it. The same is true for tate-chu-yoko or subscript and superscript (ornament characters). Further explanation of the placement of warichu and other items is provided in 3 Line Composition.

    Example of inter-line processing with warichu between lines.
    [Fig.35]: Example of inter-line processing with warichu between lines.

    (note 3)

    It is common that the line gap for the kihon-hanmen is set to a value between a half em space and the one em space of the character frame used for the kihon-hanmen. A half em space can be chosen in cases where the line length is short, but a one em space or close to it is more appropriate when the line length is longer than 35 characters.

    (note 4)

    Unless ruby or other design elements are placed in the space between lines (e.g. for books such as classics, with many annotations), there is no need to make the line-gap larger than full-width, since this would decrease legibility.

    (note 5)

    It is said that the standard line-gap in Western text layout is a one third em space, which is smaller than that in Japanese composition. This difference again comes from the different approach to the design of Latin and Japanese characters.

    (note 6)

    There is another method of specifying the kihon-hanmen that uses line feeds rather than line gaps. Line feed is the distance between two adjacent lines measured from their reference points (see [Fig.36]). The reference point differs from implementation to implementation, however, in vertical writing mode the horizontal center of the character frame is usually used, and with horizontal writing mode, the vertical center of the character frame is used. When the character size is the same for every character, the following calculation is used:
    line feed = character size/2 + line gap + character size/2 i.e. character size + line gap
    line gap = line feed - character size

    Specifying kihon-hanmen with line feed.
    [Fig.36]: Specifying kihon-hanmen with line feed.

    The size of the kihon-hanmen in this case can be calculated by following method:

    • Vertical writing mode with one column

      Width of kihon-hanmen = character size * number of lines per page + line gap * (number of lines per page -1)

      e.g.

      9 point * 18 lines + 8 point (18 lines -1) = 298 point

      Height of kihon-hanmen = character size * number of characters per line

      e.g.

      9 point * 52 characters = 468 points

    • Vertical writing mode with multi columns

      Width of kihon-hanmen = character size * number of lines per column + line gap * (number of lines per column -1)

      e.g.

      9 point * 21 lines + 6 point * (21 lines -1) = 309 points

      Height of kihon-hanmen = (character size * number of characters per line) * number of columns + column gap * (number of columns - 1)

      e.g.

      9 point * 25 characters + 18 point * (2-1) = 468 points

    • Horizontal writing mode with one column

      Width of kihon-hanmen = character size * number of characters per line

      e.g.

      9 point * 35 characters = 315 points

      Height of kihon-hanmen = character size * number of lines per page + line gap * (number of lines per page -1)

      e.g.

      9 point * 28 lines + 8 point * (28 lines - 1) = 468 points

    • Horizontal writing mode with multi columns

      Width of kihon-hanmen = (character size * number of characters per line) * number of columns + column gap * (number of columns - 1)

      e.g.

      (8 point * 19 characters) * 2 + 16 point * (2-1) = 320 points

      Height of kihon-hanmen = character size * number of lines per column + line gap * (number of lines per column - 1)

      e.g.

      8 point * 40 lines + 4 point * (40 lines -1) = 476 points

2.5 Page wise Arrangement of Kihon-hanmen Elements

2.5.1 Examples of Items Jutting Out of the Kihon-hanmen

The various elements of a page should remain inside the boundaries of the kihon-hanmen. However, there are exceptions such as the following:

  1. Ruby or emphasis marks (bousen, emphasis dots, etc.) at the before edge of the hanmen, are placed outside the hanmen (see [Fig.37]). The same applies in cases where ruby, underline, etc. appear beyond the after edge of the hanmen. Like the handling of exceptions mentioned below, the purpose here is to preserve the line positions established for the kihon-hanmen. This technique can also be used for reference marks associated with lines of text.

    Example of ruby annotation placed outside of the kihon-hanmen.
    [Fig.37]: Example of ruby annotation placed outside of the kihon-hanmen.
  2. When there are inline elements whose dimensions extend beyond the before edge and the after edge of a line of characters as determined by the kihon-hanmen, and when those elements appear in the first or last line of the hanmen, the parts that jut out beyond the regular line of characters also jut out of the hanmen area. For example, this is the case when the width of a sequence of characters which are set to tate-chu-yoko is wider than the characters set for the kihon-hanmen. In addition, warichu (inline cutting note) or subscript and superscript (ornament characters) are handled in the same way. (The processing rules for this item and the previous item are defined in JIS X 4051, sec. 12.1.1.)

  3. Line adjustment by hanging punctuation is only necessary for full stops (cl-06) and commas (cl-07) when they would otherwise need to be wrapped to the line head. The character is placed so that it touches the hanmen at the line end (see [Fig.38]). (Hanging punctuation is not defined in JIS X 4051, but there is an explanation in sec. 8.1, c.)

    Example of IDEOGRAPHIC COMMA and IDEOGRAPHIC FULL STOP placed below the kihon-hanmen.
    [Fig.38]: Example of IDEOGRAPHIC COMMA and IDEOGRAPHIC FULL STOP placed below the kihon-hanmen.

    (note 1)

    Line adjustment by hanging punctuation is a way of reducing the processing cost of line adjustment by reducing the need to change inter-character space.

    (note 2)

    A lot of books apply hanging punctuation.

  4. Illustrations and tables are normally placed inside the area defined by the kihon-hanmen. However, there may also be cases in which a particular illustration or table juts outside the kihon-hanmen.

    1. Cases in which it is necessary to make the illustration or table larger than the kihon-hanmen, because reducing its size would make it unreadable.

    2. For the sake of visual effect, the illustration may bleed into the complete paper area. This is not often used in books, but is often used in magazines (see [Fig.39]).

      Example of bleeds.
      [Fig.39]: Example of bleeds.
  5. Magazines may place the captions of illustrations outside the column area or in the column gap. (Some people regard this as bad style.)

2.5.2 Line Positioning based on the Kihon-hanmen Design

In principle, pagewise positioning of lines relies on the line positions established for the kihon-hanmen. This holds for lines with ruby or emphasis dots as shown in [Fig.34]. Even when lines contain characters that are smaller than the character size used for the kihon-hanmen (as shown in [Fig.40]), the line positions used for the kihon-hanmen continue to be used as the basic guide lines. This is so that following lines with normal-sized characters still naturally fall into the line positions established for the kihon-hanmen.

Positioning of lines with a smaller size of text.
[Fig.40]: Positioning of lines with a smaller size of text.

(note 1)

Characters within brackets are made smaller, since the text is an additional explanation. Such cases are handled in the following three ways. The first method, making only characters in restricted places smaller, is the most commonly used.

  1. Make the characters smaller only in restricted places, for example for references.

  2. Make all characters within brackets smaller (as shown in [Fig.40]).

  3. Make all characters within brackets the same size as the character size established for the kihon-hanmen.

The following are exceptions when handling line position:

  1. When inserting more than one illustration or table item in horizontal writing mode, assuming that there is no text to the left or right of the items, the items may either slip off the lines established for the kihon-hanmen (see [Fig.41]), or stick to the lines (see [Fig.42]). The former approach is used, whenever possible, to achieve inter-character spacing before and after illustrations or tables . (This method is often used in books.) (This processing method is defined in JIS X 4051, sec. 10.3.2., d.)

    Positioning of lines with multiple illustrations - 1.
    [Fig.41]: Positioning of lines with multiple illustrations - 1.
    Positioning of lines with multiple illustrations - 2.
    [Fig.42]: Positioning of lines with multiple illustrations - 2.
  2. The size of characters in endnotes inserted between paragraphs or those in footnotes at the bottom of the page (in horizontal writing mode) is smaller than the character size established for the kihon-hanmen. As a result, the character size and line gap are also smaller, and so the line positions are no longer identical to those established for the kihon-hanmen. As an example, [Fig.43] shows the position of an endnote between paragraphs in vertical writing mode. (The processing of endnotes is defined in JIS X 4051, sec. 9.3, and the processing of footnotes in sec. 9.4.)

    Positioning of an endnote in vertical writing mode.
    [Fig.43]: Positioning of an endnote in vertical writing mode.
  3. As mentioned above, the position of a heading may not be identical to the lines established for the kihon-hanmen. Nevertheless, in the block direction, headings base their alignment on the line positions established for the kihon-hanmen (see [Fig.14]).

2.5.3 Character Positioning based on Kihon-hanmen Design

In principle, the characters in each line follow the solid setting positions of characters established for the kihon-hanmen. However, as already shown in some of the previous figures, there are examples where this is not the case. Such cases are rather common, and here we will show some prototypical examples (details will be given in 3 Line Composition).

(note 1)

Where character sizes differ from the solid set sizes established for the kihon-hanmen, line lengths may not be identical with the line length of the kihon-hanmen; it is necessary to align the ends of the lines, with the exception of the last line in a paragraph. The processing method for this is explained in 3.8 Line Adjustment.

  1. When 9pt is the character size used to establish the kihon-hanmen, characters smaller than 9pt may be inserted in part of a line (see [Fig.40]). In such cases, the parts set at 9pt and any parts set at a smaller, say, 8pt size both use solid setting, with character frames at the respective sizes for each part.

  2. In cases where proportional Latin letters are rotated 90 degrees clockwise (see [Fig.25]), the proportional letters are placed according to their proportional widths. Hence, they do not fit to the character positions established for the kihon-hanmen (see [Fig.44]). Japanese letters following the Latin letters consequently slip away from the default positions as well.

    Positioning of a mixture of Western and Japanese letters in a line.
    [Fig.44]: Positioning of a mixture of Western and Japanese letters in a line.
  3. There are several methods for positioning opening brackets (cl-01) at the beginning of a line (details are explained in 3.1.5 Positioning of Opening Brackets at Line Head). Because an opening bracket is not a full-width character, in cases where the indentation of the first line of a paragraph is a one em space, or if the tentsuki position is used for the bracket (that is, there is no space at the line head), the character following the bracket will be in a position which does not fit to the character positions established for the kihon-hanmen (see [Fig.45]). However, the adaptations made during the alignment of line ends will ensure that the character at the end of a line is at a position that fits with the kihon-hanmen.

    Example of positioning of characters off the kihon-hanmen position due to opening brackets at the line head.
    [Fig.45]: Example of positioning of characters off the kihon-hanmen position due to opening brackets at the line head.
  4. 3 Line Composition explains that full stops (cl-06), commas (cl-07), opening brackets (cl-01) and closing brackets (cl-02) are half-width. If these punctuation marks and brackets are adjacent to ideographic (cl-19), katakana (cl-16) or hiragana (cl-15) characters, in principle there should be a half em space before or after the punctuation mark or brackets, so that these occupy in effect a full-width size. However, if they are adjacent to other punctuation marks or brackets, the half em space is not used. This is done to improve the visual appearance. In such cases, the character positions are different than the positions established when defining the kihon-hanmen (see [Fig.46]).

    Example of lines with consecutive punctuation marks.
    [Fig.46]: Example of lines with consecutive punctuation marks.
  5. 3 Line Composition explains the principle that closing brackets (cl-02), full stops (cl-06) and commas (cl-07) should not be placed at the line head. If by simple sequential placement these characters would appear at the line head or at the line end, some kind of adjustment becomes necessary. A similar adjustment is required for characters that should not be placed at the end of a line, such as opening brackets (cl-01). As a result of such adjustment, it may happen that other characters are placed at positions which are different from those established for the kihon-hanmen.

    Example of line adjustment to avoid those characters which shall not start and end a line.
    [Fig.47]: Example of line adjustment to avoid those characters which shall not start and end a line.

2.6 Running Heads and Page Numbers

2.6.1 Positioning of Running Heads and Page Numbers

Typical positions of running heads and page numbers for vertically set books with double running heads (see 2.6.3 Ways of Arranging Running Heads and Page Numbers) are as shown in [Fig.48].

Typical positioning of running heads and page numbers for vertically set books with double running heads.
[Fig.48]: Typical positioning of running heads and page numbers for vertically set books with double running heads.

Typical positions of running heads and page numbers for horizontally set books with double running heads (see 2.6.3 Ways of Arranging Running Heads and Page Numbers) are as shown in [Fig.49].

Typical positioning of running heads and page numbers for horizontally set books with double running heads.
[Fig.49]: Typical positioning of running heads and page numbers for horizontally set books with double running heads.

In principle, positions of running heads and page numbers should be specified relative to the kihon-hanmen, not with absolute coordinates in the trim size. (Positioning of running heads is defined in JIS X 4051, sec. 7.6.4. Positioning of page numbers is defined in JIS X 4051, sec. 7.5.4.)

Example:

Positioning a horizontal running head above the top left corner (to head and fore-edge) of the kihon-hanmen in a typical vertically set book (see [Fig.50]).

9 points above the kihon-hanmen (vertical space)

9 points from the left edge of the kihon-hanmen (horizontal space)

Positioning of a running head (vertical writing mode).
[Fig.50]: Positioning of a running head (vertical writing mode).

The following recommendations should be taken into account when positioning running heads and page numbers with reference to the kihon-hanmen.

  1. When positioning horizontal running heads and page numbers with reference to the kihon-hanmen in vertically set books, the amount of vertical space between the edge of the kihon-hanmen and the running head is a one em space as established for the kihon-hanmen. If the kihon-hanmen of the book is horizontally set, take more vertical space than the character size in the kihon-hanmen.

  2. Regardless of the direction of text in the kihon-hanmen of a book, horizontal running heads and page numbers on the left page should be aligned either at the left edge of the kihon-hanmen or one em space to the right of the left edge. On the right page, the tail of the running heads or page numbers should be aligned either at the right edge of the kihon-hanmen or one full-width space to left of the right edge.

  3. Regardless of the direction of text in a book, when arranging running heads and page numbers together on the same horizontal line, the space between the running head and the page number should be double or one and a half times the character size of the running head. On the left page, the page number should be set at the left side and the running head should be set at the right side. On right-hand pages, the page number should be set at the right side and the running head should be set at the left side. The exact positions of the page numbers are given by the instructions above (see b).

  4. When positioning running heads and page numbers vertically to the fore-edge of the kihon-hanmen in a vertically set book (see spread (e) in [Fig.48], for example), the minimum horizontal distance from the kihon-hanmen should be the same as that of the line gap of the kihon-hanmen. The top of the running head should be positioned approximately four kihon-hanmen characters below the head, and the bottom of the page numbers should be positioned approximately five kihon-hanmen characters above the foot.

    (note 1)

    In general, ideographic numerals (一二三四五六七八九〇) are used for vertically set page numbers, and European numerals for horizontal pagination. When using independent pagination for the front matter, small Roman numerals are used for horizontal pagination.

2.6.2 Principles of Arrangement of Running Heads and Page Numbers

Positioning of all running heads and page numbers in the same book should be consistent.

(note 1)

Even on a page with a text area smaller in size than that of the kihon-hanmen, such as for a table of contents or index, positioning of the running head and page number relative to the trim size will remain the same. Therefore, the positioning of running heads and page numbers relative to those areas smaller than the kihon-hanmen is different. [Fig.51] below demonstrates the respective positions of the hanmen for a table of contents and running heads or page numbers. As shown in [Fig.17], this hanmen is smaller than the kihon-hanmen. [Fig.52] demonstrates the related positions of running heads and page numbers and the hanmen of indexes. These hanmen are not only 4 points smaller at the left and right, but also 5 points smaller at the top and bottom.

Positioning of running heads and page numbers on TOC pages for which the hanmen is smaller in size than the kihon-hanmen.
[Fig.51]: Positioning of running heads and page numbers on TOC pages for which the hanmen is smaller in size than the kihon-hanmen.
Positioning of running heads and page numbers on index pages for which hanmen is smaller in size than the kihon-hanmen.
[Fig.52]: Positioning of running heads and page numbers on index pages for which hanmen is smaller in size than the kihon-hanmen.

Because the start of a page will be on the recto side, the right-hand page of a spread in a vertically set book is always an even page and the left-hand page is always an odd page (see [Fig.53]). Likewise, the left-hand page of a spread in a horizontally set book is always an even page and the right-hand page is always an odd page (see [Fig.54]).

Page numbers on a spread in a vertically set book.
[Fig.53]: Page numbers on a spread in a vertically set book.
Page numbers on a spread in a horizontally set book.
[Fig.54]: Page numbers on a spread in a horizontally set book.

2.6.3 Ways of Arranging Running Heads and Page Numbers

There are two ways to arrange running heads. One is the single running head method and the other is the double running head method. (Arrangement of running heads is defined in JIS X 4051, sec. 7.6.2. Page Numbers are defined in sec. 7.5.2.).

Double running head method.
[Fig.55]: Double running head method.
Single running head method.
[Fig.56]: Single running head method.

(note 1)

In general, there will be only one running head per page. However, in some cases, such as in dictionaries, multiple running heads are printed on each page to indicate contents.

(note 2)

In general, there will be only one page number per page. However in some cases multiple page numbers are printed on each page as in the following examples:

  1. When a horizontally set index and/or bibliography appears at the end of a volume in a vertically set book, both reverse pagination and continuous pagination are printed.

  2. For multivolume works, both serial page numbers throughout the work and page numbers per volume are printed.

  1. In the double running head method, a higher-level title, such as that of the chapter or book, is used for the running heads on the even pages, and a lower-level title, such as that for a section, on the odd pages. Where there are no differing levels of titles, such as on the page containing the table of contents, the same running head is used on both even and odd pages.

    (note 1)

    Which information is used for the running heads depends on the content of the book. Given that the main purpose of running heads is to signpost to readers what is written on each page, or the content of the current page, it does not make much sense to use the book title for the running head. The most common approach for a book with three levels of headings, such as chapter, section and subsection, is to use the highest level heading (i.e. chapter) and the second level heading (i.e. section).

  2. In the single running head method, one of the headings between the top and third levels is used.

  3. In principle, the contents of running heads will be the same as those of headings with the following differences:

    1. Numbers and words in Latin alphanumeric characters in vertically set headings in vertically set books should be changed to horizontal notation for horizontally set running heads (see 2.3.2 Major Differences between Vertical Writing Mode and Horizontal Writing Mode).

    2. If headings are too long, they should be made shorter by paraphrasing them in fewer characters. Running heads with too many characters will not look good.

    3. For certain publications, such as a collection of monographs, the names of authors may be added in parentheses at the end of the running head.

  4. In principle, the text direction of running heads and page numbers should be the same as that of the kihon-hanmen. For vertically set books, however, it is more common to set running heads and page numbers horizontally.

  5. In principle, for the single running head method running heads are printed on all odd pages, and for the double running head method on all even and odd pages. However, for the sake of appearance, running heads may be omitted as follows:

    1. Pages on which running heads should be hidden:

      1. Naka-tobira and han-tobira.

      2. Pages where a running head overlaps with other elements such as illustrations.

      3. Blank pages.

    2. Pages on which running heads may be hidden:

      1. Pages where a figure or a table is positioned adjacent to the running head.

      2. Pages with a heading right after a new recto or new page.

  6. In principle, page numbers are printed on all pages. However, for the sake of appearance, they may be omitted as follows:

    1. Pages on which page numbers should be hidden:

      1. Pages on which a illustration or a table is positioned adjacent to the page number.

      2. Blank pages.

    2. Pages on which page numbers may be hidden:

      1. Divisional title and simplified divisional title pages.

      2. Pages in horizontally set books with a page number placed in the margin at the top of the page, and with a heading at the beginning of a new recto or new page. (In this case, it is also possible to move the page numbers to the center of the margin at the foot of the page.)

    (note 1)

    Pages are not counted in cases such as the following:

    1. If a different type or color of paper is used for the main title page,

    2. if a frontispiece is inserted in the opening page of a book; or

    3. if an illustration of the enclosure or a divisional title is present in the main text.

  7. There are two types of page numbering. "Continuous pagination" means that page numbers continue throughout the whole book. "Independent pagination" means that page numbers start from "1" separately at beginning of the front matter and back matter. There is also, for example in manuals, the method of starting each chapter from page number "1". (In such cases, it is common that the name of the chapter is added as a prefix before the page number.)

    (note 1)

    If the front matter and the main text have different page numbers, each starts with page number "1". In this case, it is common to use Roman numerals for the pages of the front matter, in order to distinguish them from the main text.

    (note 2)

    For vertically set books with indexes in horizontal writing mode, the following methods are available.

    1. Reverse pagination. The index reads from the end of the book, and page numbers are added starting with "1" from the end of the book and flow in the same order as the index.

    2. Continuous pagination. The index reads from the end of the book, but page numbers start with "1" and flow in the same order as the book. (The index pages flow in the reverse order to the page numbers.)

    3. Both reverse pagination and continuous pagination. In this case, the page numbers for continuous pagination are in the same position as the page numbers of the main text, and page numbers in reverse pagination are in a different position (for example, if serial pagination is in the foot of the page, reverse pagination is in the head). Often other methods are applied to distinguish the different paginations. For example, Arabic numbers are used for both continuous pagination and reverse pagination, but for reverse pagination, brackets are added around the numbers.

3 Line Composition

3.1 Line Composition Rules for Punctuation Marks

3.1.1 Differences in Vertical and Horizontal Composition in Use of Punctuation Marks

There are some punctuation marks that are used uniquely in either vertical writing mode or horizontal writing mode. In this document, characters and symbols are treated as members of a character class, classified by their behavior for composition. Each class name is followed by class id, such as opening brackets (cl-01). Details are explained in 3.9 About Character Classes. The following are some typical examples:

  1. Full stops (cl-06) and commas (cl-07)

    1. In vertical writing mode, IDEOGRAPHIC FULL STOP "。" and IDEOGRAPHIC COMMA "、" are used for full stops (cl-06) and commas (cl-07).

    2. In horizontal writing mode, there are three conventions in choice of symbols for full stops (cl-06) and commas (cl-07):

      1. Using COMMA "," and FULL STOP "." (see [Fig.57]).

        Example text using COMMA and FULL STOP.
        [Fig.57]: Example text using COMMA and FULL STOP.
      2. Using COMMA "," and IDEOGRAPHIC FULL STOP "。" (see [Fig.58]).

        Example text using COMMA and IDEOGRAPHIC FULL STOP.
        [Fig.58]: Example text using COMMA and IDEOGRAPHIC FULL STOP.
      3. Using IDEOGRAPHIC COMMA "、" and IDEOGRAPHIC FULL STOP "。" (see [Fig.59]).

        Example text using IDEOGRAPHIC COMMA and IDEOGRAPHIC FULL STOP.
        [Fig.59]: Example text using IDEOGRAPHIC COMMA and IDEOGRAPHIC FULL STOP.

      (note 1)

      In horizontal writing mode, there are many cases of composition that mixes Japanese and Western text. The convention shown in (i) is a way to apply the same comma and full stop to both Western and Japanese texts for consistency, and is common in books on science and technology. The convention shown in (ii) was invented because in (i) FULL STOP "." appears too small for Japanese texts and using IDEOGRAPHIC FULL STOP "。" for period looks better. This convention has been adopted for Japanese official publications. (In the past, COMMA "," and FULL STOP "." were used for some official publications.)

  2. LEFT CORNER BRACKET "「", RIGHT CORNER BRACKET "」", LEFT DOUBLE QUOTATION MARK "“" and RIGHT DOUBLE QUOTATION MARK "”"

    1. In vertical writing mode, LEFT CORNER BRACKET "「" and RIGHT CORNER BRACKET "」" are used for quotations (see [Fig.60]).

      Examples of quoted text using LEFT CORNER BRACKET and RIGHT CORNER BRACKET.
      [Fig.60]: Examples of quoted text using LEFT CORNER BRACKET and RIGHT CORNER BRACKET.
    2. In horizontal writing mode, pairs of LEFT DOUBLE QUOTATION MARK "“" and RIGHT DOUBLE QUOTATION MARK "”" or pairs of LEFT SINGLE QUOTATION MARK "‘" and RIGHT SINGLE QUOTATION MARK "’" may be used in place of LEFT CORNER BRACKET "「" and RIGHT CORNER BRACKET "」" (see [Fig.61]).

      Examples of quoted text using LEFT DOUBLE QUOTATION MARK and RIGHT DOUBLE QUOTATION MARK.
      [Fig.61]: Examples of quoted text using LEFT DOUBLE QUOTATION MARK and RIGHT DOUBLE QUOTATION MARK.

      (note 1)

      This is because LEFT CORNER BRACKET "「" and (especially) RIGHT CORNER BRACKET "」" may not look good in horizontal writing mode, but adoption of corner brackets for horizontal writing mode seems to be increasing.

      (note 2)

      Though LOW DOUBLE PRIME QUOTATION MARK "〟" and REVERSED DOUBLE PRIME QUOTATION MARK "〝" are similar to double quotation marks in appearance (see [Fig.62]) they are exclusively used for vertical writing mode and are not to be used in horizontal writing mode.

      Examples of quoted text using LOW DOUBLE PRIME QUOTATION MARK and REVERSED DOUBLE PRIME QUOTATION MARK.
      [Fig.62]: Examples of quoted text using LOW DOUBLE PRIME QUOTATION MARK and REVERSED DOUBLE PRIME QUOTATION MARK.

      (note 3)

      LEFT DOUBLE QUOTATION MARK "“" and RIGHT DOUBLE QUOTATION MARK "”" are exclusively for horizontal writing mode and not to be used in vertical writing mode. Also, LEFT SINGLE QUOTATION MARK "‘" and RIGHT SINGLE QUOTATION MARK "’" are exclusively for horizontal writing mode and not to be used in vertical writing mode. However, in vertical writing mode, when Western characters (cl-27) are composed rotated 90 degrees clockwise, these quotation marks are sometimes used.

  3. LEFT SQUARE BRACKET "[", RIGHT SQUARE BRACKET "]", LEFT TORTOISE SHELL BRACKET "〔" and RIGHT TORTOISE SHELL BRACKET "〕"

    LEFT TORTOISE SHELL BRACKET "〔" and RIGHT TORTOISE SHELL BRACKET "〕" are vertical variants of LEFT SQUARE BRACKET "[" and RIGHT SQUARE BRACKET "]", which are used in horizontal writing mode. Square brackets should be used in horizontal writing mode except for special cases.

(note 1)

The position of the letter face of commas (cl-07) and full stops (cl-06) within the character frame differs in vertical and horizontal writing modes. The same letter face can be used for opening brackets (cl-01), closing brackets (cl-02) and hyphens (cl-03) in both vertical and horizontal writing mode by rotating clockwise 90 degrees to the inline direction. The position of the letter face of small kana (cl-11) symbols within the character frame is different in vertical and horizontal writing modes. For KATAKANA-HIRAGANA PROLONGED SOUND MARK "ー", the difference between vertical and horizontal writing modes is not only in the orientation of the letter form to the inline direction, but also the shape of the symbol (see [Fig.63]).

KATAKANA-HIRAGANA PROLONGED SOUND MARK for vertical and horizontal writing modes.
[Fig.63]: KATAKANA-HIRAGANA PROLONGED SOUND MARK for vertical and horizontal writing modes.

3.1.2 Positioning of Punctuation Marks (Commas, Periods and Brackets)

The positioning of punctuation marks (commas, periods and brackets) in a line proceeds as follows.

(note 1)

Basic processing of characters and symbols, including punctuation marks, which are subject to considerations of line head wrapping, line end wrapping and inter-character space adjustment, will be described in detail in 3.9 About Character Classes. All combinations of character class are provided as a complete table in Appendix B Spacing between Characters.

The character advance of commas (cl-07), full stops (cl-06), opening brackets (cl-01), closing brackets (cl-02) and middle dots (cl-05) is half-width (half em). But when those punctuation marks are placed side-by-side with ideographic (cl-19), hiragana (cl-15), or katakana (cl-16) characters, in principle, a given amount of space will be inserted before or after the symbols, which makes them appear as if they were intrinsically full-width (one em) (see [Fig.64]). Space is inserted before and after middle dots (cl-05). This principle makes the symbols consistent with ideographic (cl-19), hiragana (cl-15) and katakana (cl-16) characters in character width, and at the same time the space for punctuation helps to make the organization of text clearer. The space added before or after punctuation marks is subject, in principle, to line adjustment and may eventually be removed, except for that added after full stops (cl-06). (Details of line adjustment are discussed in 3.8 Line Adjustment).

  1. After commas (cl-07), a half em space is added, in principle.

  2. After full stops (cl-06), in the middle of a line, a half em space is added. At the end of a line, a half em space is added, in principle.

  3. Before opening brackets (cl-01), a half em space is added, in principle.

  4. After closing brackets (cl-02), a half em space is added, in principle.

  5. Before and after middle dots (cl-05), a quarter em space is added, in principle.

Character widths of commas, periods, and the space appended before and/or after the symbols.
Character widths of commas, periods, and the space appended before and/or after the symbols.
[Fig.64]: Character widths of commas, periods, and the space appended before and/or after the symbols.

(note 1)

In font implementations, punctuation marks can be given a different character width, but it is expected that the font is capable of following the line composition rules explained here to produce the final result. For example, when opening brackets (cl-01) and closing brackets (cl-02) are implemented with full-width size, it is possible that a minus half em space is inserted between adjacent closing brackets (cl-02) and opening brackets (cl-01) (Some implementations prepare minus half em and quarter em spaces). In letterpress printing, it was also common practice to combine punctuation marks with a half-width body and half em spaces in order to make it easier to remove the space later for adjustment. Because of that, the types were picked up except for the punctuation marks at the type-picking phase, following the manuscript, and the punctuation marks were picked only when they were necessary in composing a page. Later, with the increasing adoption of Monotype machines, punctuation marks with a full-width body became popular and both full-width and half-width punctuation marks have been used, mixed together, since then.

(note 2)

Among opening brackets (cl-01) and closing brackets (cl-02), LEFT PARENTHESIS "(", RIGHT PARENTHESIS ")", LEFT ANGLE BRACKET "〈" and RIGHT ANGLE BRACKET "〉" are used to indicate supplementary explanations, and in that case their usage differs slightly from other opening brackets (cl-01) and closing brackets (cl-02). To reflect the difference, there is an alternative convention to not append a half em space before or after the parentheses and angle brackets, and instead just set them solid (see [Fig.65]).

Positioning of parentheses and brackets. (The left-hand side shows an example of setting them solid.)
[Fig.65]: Positioning of parentheses and brackets. (The left-hand side shows an example of setting them solid.)

3.1.3 Exceptional Positioning of Ideographic Comma and Katakana Middle Dot

The space usually added after IDEOGRAPHIC COMMA "、" and the space before and after KATAKANA MIDDLE DOT "・" are omitted, in principle, for cosmetic reasons in the following cases.

  1. In vertical writing mode, ideographic numerals and IDEOGRAPHIC COMMA "、" used as a decimal separator are set solid (as in the right line in [Fig.66]).

    Example of  exceptional positioning of the IDEOGRAPHIC COMMA.
    [Fig.66]: Example of exceptional positioning of the IDEOGRAPHIC COMMA.

    (note 1)

    In vertical writing mode, ideographic digits used with IDEOGRAPHIC COMMA "、" to represent an approximate number are expected to be set solid too (as in the right line in [Fig.67]).

    Example of the positioning of IDEOGRAPHIC COMMA with ideographic digits to represent an approximate number.
    [Fig.67]: Example of the positioning of IDEOGRAPHIC COMMA with ideographic digits to represent an approximate number.
  2. Ideographic digits and KATAKANA MIDDLE DOT "・" representing a decimal point are set solid (as in the right line in [Fig.68]). In vertical writing mode, when KATAKANA MIDDLE DOT "・" is used as a member of unit symbols (cl-25) in unit symbols, grouped numerals (cl-24), and Western characters (cl-27) in mathematical and chemical formulae, before and after KATAKANA MIDDLE DOT "・" is set solid.

    Example of the exceptional positioning of KATAKANA MIDDLE DOT.
    [Fig.68]: Example of the exceptional positioning of KATAKANA MIDDLE DOT.

3.1.4 Positioning of Consecutive Opening Brackets, Closing Brackets, Commas, Full Stops and Middle Dots

In cases where multiple punctuation marks, such as opening brackets (cl-01), closing brackets (cl-02), commas (cl-07), full stops (cl-06) and middle dots (cl-05), come one after the other, the following space adjustments are made for aesthetic reasons (see [Fig.69]). Note also that the half em and quarter em spaces added before or after punctuation marks, including the half em space after full stops (cl-06) appearing in the middle of a line, are subject, in principle, to line adjustment and may eventually be removed, except for those added after full stops (cl-06). (See 3.8 Line Adjustment for more about line adjustment.) For more information about the positioning of closing brackets (cl-02), full stops (cl-06), commas (cl-07) and middle dots (cl-05) at line end, see 3.1.9 Positioning of Closing Brackets, Full Stops, Commas and Middle Dots at Line End.

  1. When closing brackets (cl-02) come immediately after commas (cl-07) or full stops (cl-06), remove the default half em space between them and, in principle, add a half em space after the closing brackets (see [Fig.69] (1)).

  2. When commas (cl-07) come immediately after closing brackets (cl-02), remove the default half em space between them and, in principle, add a half em space after the comma. When full stops (cl-06) come immediately after closing brackets (cl-02), remove the default half em space between them and, in middle of a line, add a half em space after the full stop; at the end of a line, in principle, add a half em space after the full stop (see [Fig.69] (2)).

  3. When opening brackets (cl-01) come immediately after commas (cl-07), in principle, add a half em space between them (see [Fig.69] (3)). When opening brackets (cl-01) come immediately after full stops (cl-06) in the middle of a line, add a half em space between them. Note that when full stops (cl-06) come in the bottom of lines, in principle, insert a half space after full stops (cl-06).

  4. When opening brackets (cl-01) come immediately after closing brackets (cl-02), in principle, add a half em space between them (see [Fig.69] (4)).

  5. When opening brackets (cl-01) come immediately after other opening brackets (cl-01), set them solid and, in principle, add a half em space before the first one (see [Fig.69] (5)).

  6. When closing brackets (cl-02) come immediately after other closing brackets (cl-02), set them solid and, in principle, add a half em space after the last closing bracket (see [Fig.69] (6)).

  7. When middle dots (cl-05) come immediately after closing brackets (cl-02), in principle, add a quarter em space before the following middle dot (see [Fig.69] (7)).

  8. When opening brackets (cl-01) come immediately after middle dots (cl-05), in principle, add a quarter em space after the preceding middle dot (see [Fig.69] (7)).

Examples of line adjustment with multiple opening brackets, closing brackets, commas, full stops or middle dots.
[Fig.69]: Examples of line adjustment with multiple opening brackets, closing brackets, commas, full stops or middle dots.

The line adjustment rules shown above have been established because the default half em space before or after consecutive punctuation marks, or quarter em space before and after them, makes the line look sparse and doesn't make the line appear well-proportioned (see [Fig.70]).

Examples of bad line composition with unadjusted spaces between multiple opening brackets, closing brackets, commas, full stops or middle dots.
[Fig.70]: Examples of bad line composition with unadjusted spaces between multiple opening brackets, closing brackets, commas, full stops or middle dots.

(note 1)

Japanese composition is based on the design of full-width characters, but strictly following full-width based composition sometimes produces an unbalanced appearance. In such exceptional cases, the appearance of the resulting composition must be given higher priority than the full-width design principle. When and how to invoke such exceptional procedures has a direct bearing on the quality of the text layout. In other words, it is a matter of how to resolve the conflicts between the principle and the products of it.

3.1.5 Positioning of Opening Brackets at Line Head

When starting a new line with opening brackets (cl-01) there are some patterns as shown in [Fig.71]. Note that the amount of line indent after the line feed (the first line indent of a new paragraph) is assumed to be a one em space across all the patterns.

  1. The first line indent after the line feed is set full-width (one em) and the next line after the first line break starts with no space (so-called tentsuki) (see [Fig.71] (1)).

  2. The first line indent after the line feed is set one and a half em and the next line indent after the first line break is set to a half em (see [Fig.71] (2)).

  3. The first line indent after the line feed is set at a half em and the next line after the first line break is set tentsuki (see [Fig.71] (3)).

Examples of positioning of opening brackets at line head.
[Fig.71]: Examples of positioning of opening brackets at line head.

(note 1)

Because the inherent character width of a bracket is half-width, [Fig.71] (1) can be explained as the result of applying the principle that any line should start with no space. On the other hand, the principle represented by [Fig.71] (2) is to assume that opening brackets should be always accompanied by the preceding half em space as if they were full-width and then apply the same principle as in [Fig.71] (1). JIS X 4051 adopts the principle shown in (1) (the patterns shown in (2) and (3) are offered as options) . The pattern shown in (3) was first invented in books such as novels, which use frequent line feeds and corner brackets in dialogues, for which the first line indent with one em or one and a half em would create the appearance of too much space (then this pattern was accepted and adopted by general books). Major Japanese publishers who deal with literature, such as Kodansha, Shinchosha, Bungei Shunju, Chuoh Kouronsha, and Chikuma Shobo, have adopted the pattern shown in (3). By contrast, Iwanami Shoten and other publishers adopted the pattern shown in (1). Because Iwanami Shoten once adopted pattern (2) in vertical composition, there used to be many examples of it, but few examples of (2) can be found today.

(note 2)

The first line indent of a new paragraph is full-width in principle. However, the following exceptions can be found.

  1. The most popular scheme is to set the first line indent of all new paragraphs to full-width. However, even if there is a paragraph break and the new line looks like the beginning of new paragraph, in contexts where the new line is a continuation of the preceding line, the new line is set tentsuki as shown in [Fig.72]. (There are books such as novels which adopt full-width line indent without exception.) Similarly, in horizontal writing mode, the line indent is set tentsuki where the new line continues the preceding line of a mathematical expression connected by conjunctions such as "therefore".

    Examples of line indent followed by the preceding line with quoted text (as in dialogues).
    [Fig.72]: Examples of line indent followed by the preceding line with quoted text (as in dialogues).
  2. When headings have no line indent, the first line indent of the first paragraph after the heading can be also set tentsuki, for cosmetic reasons. However, it is not recommended to set the first line indent to tentsuki for all paragraphs, because it would make paragraph breaks unclear.

3.1.6 Positioning of Dividing Punctuation Marks (Question Mark and Exclamation Mark) and Hyphens

The dividing punctuation marks (cl-04) (QUESTION MARK "?" and EXCLAMATION MARK "!") should be full-width, and they are typeset as follows.

  1. Basically, add no space before dividing punctuation marks (cl-04) at the end of a sentence and add a one em space after them (see [Fig.73]). However when a closing bracket (cl-02) follows right after the dividing punctuation mark, add no space after the dividing punctuation mark and add a half em space after the closing bracket (see [Fig.73]).

    Positioning of dividing punctuation marks (Examples in vertical writing mode).
    [Fig.73]: Positioning of dividing punctuation marks (Examples in vertical writing mode).

    (note 1)

    Many implementations use full-width ideographic space (cl-14) for the one em space appended after dividing punctuation marks (cl-04).

    (note 2)

    No full stops (cl-06) should be appended after dividing punctuation marks (cl-04) at the end of a sentence.

    (note 3)

    There are some cases where dividing punctuation marks (cl-04) are used in the middle of a sentence, not at the end. In those cases, either add no space or a quarter em space before and after the dividing punctuation mark see [Fig.74]).

    Examples of positioning of dividing punctuation marks in the middle of a sentence (in vertical writing mode).
    [Fig.74]: Examples of positioning of dividing punctuation marks in the middle of a sentence (in vertical writing mode).

    (note 4)

    The details of composition rules for dividing punctuation marks (cl-04) and hyphens (cl-03) are described in Appendix B Spacing between Characters as a complete table, following the descriptions of character classes in 3.9 About Character Classes.

  2. When dividing punctuation marks (cl-04) at the end of a sentence reach the end of a line, apply the following rules (see [Fig.75]).

    Examples of positioning of dividing punctuation marks at the end of a line (in vertical writing mode).
    [Fig.75]: Examples of positioning of dividing punctuation marks at the end of a line (in vertical writing mode).
    1. If the line length is 13 character widths and a dividing punctuation mark (cl-04) occurs in the 12th character position, a one em space should be appended after it.

    2. If the line length is 13 character widths and a dividing punctuation mark (cl-04) occurs in the 13th character position, no space should be appended after it. In addition, do not carry over the one em space usually appended after the dividing punctuation marks to the line head of the next line; the line in this case should be set tentsuki.

The character width of hyphens (cl-03) varies according to the type of hyphen. HYPHEN "‐" should be quarter em width (i.e. one quarter of an em width), EN DASH "–" and KATAKANA-HIRAGANA DOUBLE HYPHEN "゠" should be half-width (a half em width), WAVE DASH "〜" should be full-width. Basically there should be no space before and after hyphens (cl-03). However, a half em space should be appended, in principle, when opening brackets (cl-01) follow right after a hyphen (cl-03) and a quarter em space when middle dots (cl-05) follow a hyphen (cl-03).

3.1.7 Characters Not Starting a Line

In principle, no line should begin with closing brackets (cl-02), hyphens (cl-03), dividing punctuation marks (cl-04), middle dots (cl-05), full stops (cl-06), commas (cl-07), iteration marks (cl-09), a prolonged sound mark (cl-10), small kana (cl-11) or warichu closing brackets (cl-29) (line-start prohibition rule). Otherwise the line would have an odd appearance.

(note 1)

Not a small number of books adopt a less strict set of rules which allow IDEOGRAPHIC ITERATION MARK "々" (one of the iteration marks (cl-09)), prolonged sound mark (cl-10) and small kana (cl-11) to start a line. There is another method whereby IDEOGRAPHIC ITERATION MARK "々" is replaced by a kanji character when it would otherwise be set at the head of a line. For example, 家 (at the end of a line) + 々 (at the head of the next line) will be changed to 家 (at the end of a line) + 家 (at the head of the next line).

(note 2)

There is yet another less strict rule that allows KATAKANA MIDDLE DOT "・" to start a line.

(note 3)

In the layout of newspaper text, dividing punctuation marks (cl-04) (QUESTION MARK "?" and EXCLAMATION MARK "!") are allowed to start a line. This is due to the fact that the line lengths are shorter in newspapers. When the line is very short, there are fewer opportunities for inter-character space adjustment, which makes it difficult to preserve the number of characters per line. It is thought that this is the reason why the less strict set of line head wrapping rules was adopted in newspaper text layout.

(note 4)

The details of the line-start prohibition rules and line-end prohibition rules are described in Appendix B Spacing between Characters as a complete table, following the description of character classes in 3.9 About Character Classes.

3.1.8 Characters Not Ending a Line

No line should end with opening brackets (cl-01) or warichu opening brackets (cl-28) (line-end prohibition rules). Otherwise the line would have an odd appearance.

(note 1)

The process of formatting lines to avoid non-starter characters at the line head, non-ending characters at the line end, spaces before and/or after inseparable characters, line breaking before and/or after unbreakable characters, etc., is generally called kinsokushori.

3.1.9 Positioning of Closing Brackets, Full Stops, Commas and Middle Dots at Line End

In principle, closing brackets (cl-02), commas (cl-07) or full stops (cl-06) at the line end have a half em space after them (see [Fig.76]). This half em space can be deleted for line adjustment (for more about line adjustment, see 3.8 Line Adjustment). However, the possibilities are only half em space or solid. Other spaces, such as a quarter em space should not be used. In principle, the middle dot (cl-05) character at the line end also has a quarter em space before and after, and is handled like a full-width character (see [Fig.76]). This quarter em space can also be deleted for line adjustment, namely middle dots (cl-05) can be set solid before and after (about line adjustment, see 3.8 Line Adjustment). However, in this case also, the only possibilities are quarter em space or solid setting. Other intermediate-sized spacing should not be used.

Example of handling closing brackets, full stops, commas and middle dots at the line end like full-width characters.
[Fig.76]: Example of handling closing brackets, full stops, commas and middle dots at the line end like full-width characters.

(note 1)

With regard to closing brackets (cl-02), full stops (cl-06), commas (cl-07) and middle dots (cl-05) at line end, the following processing is defined in JIS X 4051 (see [Fig.77]).

full stops (cl-06)

After full stops (cl-06), there must be a half em space, including at the line end. This half em space must not be a target for reduction during line adjustment.

commas (cl-07)

After commas (cl-07),solid setting is applied.

closing brackets (cl-02)

After closing brackets (cl-02), solid setting is applied.

middle dots (cl-05)

In principal, before middle dots (cl-05) there is a quarter em space, and after middle dots (cl-05) solid setting is applied.

Example of handling closing brackets, full stops, commas and middle dots at the line end in JIS X 4051.
[Fig.77]: Example of handling closing brackets, full stops, commas and middle dots at the line end in JIS X 4051.

(note 2)

In the letterpress printing era, the following methods were common (see [Fig.78]).

  1. For full stops (cl-06), commas (cl-07) and closing brackets (cl-02), if possible, a half em space was preserved. Using a half em space was the general approach. For middle dots (cl-05), if possible, quarter em space was preserved. Using a quarter em space was the general approach.

  2. If the line length was not sufficient or too great and line adjustment processing became necessary (see below), the first priority was to replace the half em space after full stops (cl-06), commas (cl-07) and closing brackets (cl-02) with solid setting. The reason was that this was at the line end, and no problems would arise, even though the half em space became set solid. The option of replacing the half em space after punctuation marks with a quarter em space, instead of removing the whole half em space was not used. That meant that there was the choice between either a half em space after punctuation marks or solid setting. The second priority was to replace quarter em spaces before and after middle dots (cl-05) with solid setting.

Examples of closing brackets, commas and full stops at the end of a line with either a half em space after or set solid.
[Fig.78]: Examples of closing brackets, commas and full stops at the end of a line with either a half em space after or set solid.

(note 3)

In some DTP systems etc., after full stops (cl-06), commas (cl-07) or closing brackets (cl-02) at the line end, the line end is always set solid (see [Fig.79]).

Example of always applying solid setting after closing brackets, full stops, and commas at the line end.
[Fig.79]: Example of always applying solid setting after closing brackets, full stops, and commas at the line end.

3.1.10 Unbreakable Character Sequences

If the following characters and symbols appear in sequence there will be no line break between them. The reason is that these characters and symbols are to be handled as one unit.

(note 1)

Both line head wrap and line end wrap mean that there will be no line break between characters or symbols. For line head wrap there is no break between the line head wrap character and the character or symbol before. For line end wrap, there is no break between the line end wrap character and the following character or symbol.

(note 2)

The details of unbreakable character sequences are described in Appendix C Possibilities for Line-breaking between Characters as a complete table, following the description of character classes in 3.9 About Character Classes.

  1. Between a sequence of EM DASH "—" characters (to be more specific, for a double dash, see [Fig.80]). Note that some systems implement HORIZONTAL BAR "―" with very similar behavior to EM DASH "—".

    Sequence of EM DASH characters is unbreakable.
    [Fig.80]: Sequence of EM DASH characters is unbreakable.

    (note 1)

    Other characters and punctuation marks before and after EM DASH "—" are set solid. However, in the following cases some space should be set between EM DASH "—" and other characters. As mentioned in the next note, HORIZONTAL ELLIPSIS "…", TWO DOT LEADER "‥", prefixed abbreviations (cl-12) and postfixed abbreviations (cl-13) behave similarly to EM DASH "—".

    1. Where closing brackets (cl-02) or commas (cl-07) are followed by EM DASH "—", a half em space is inserted between them, in principle. Also, in the middle of lines, where full stops (cl-06) are followed by EM DASH "—", a half em space is inserted between them. However, when full stops (cl-06) are placed at the end of a line a half em space is added after the full stops (cl-06).

    2. Where EM DASH "—" is followed by opening brackets (cl-01), a half em space is inserted between them, in principle.

    3. Where EM DASH "—" and middle dots (cl-05) are set side by side, a quarter em space is inserted between them, in principle.

    (note 2)

    A double dash is handled as one unit, hence a line break between them is forbidden. In letterpress printing, breaking of the double dash was forbidden very strongly by the fact that the double dash was created as a double full body (so it was actually not possible to break it). Nevertheless, if it was not possible to avoid a break, two EM DASH characters were used instead the double dash. That made it possible to have a line break between them.

  2. Between sequences of HORIZONTAL ELLIPSIS "…" or TWO DOT LEADER "‥" (to be more specific, double HORIZONTAL ELLIPSIS "……" or double TWO DOT LEADER "‥‥").

    Unbreakable sequence of HORIZONTAL ELLIPSIS.
    [Fig.81]: Unbreakable sequence of HORIZONTAL ELLIPSIS.

    (note 1)

    In the letterpress printing era, double HORIZONTAL ELLIPSIS etc. was created as a sequence of HORIZONTAL ELLIPSIS. Hence, compared to double dash, the line break between them was not so strongly forbidden.

  3. Between European numerals (see [Fig.82], [Fig.83], and [Fig.84].). European numerals indicate ranks via the position of a numeral.

    (note 1)

    It is possible to have a line break between ideographic numerals. Also it is possible to have a line break after IDEOGRAPHIC COMMA "、" used as a decimal separator or an indicator for approximate number, and KATAKANA MIDDLE DOT "・" as a decimal point. However, the position between ideographic numerals and IDEOGRAPHIC COMMA "、" used as a decimal separator or an indicator for approximate number, and KATAKANA MIDDLE DOT "・" used as a decimal point is unbreakable. The reason is that the natural usage of ideographic numerals is to write them like "二百三十五", that is with inserted rank indicators (Explanation: "二" means "two", "百" means "hundred". "二百" means "two hundred". "三" means "three", "十" means "ten". "三十" means "thirty"."五" means "five". The complete sequence "二百三十五" means "Two hundred and thirty five".). Hence, it is not necessary to express rank via position. In contrast, line breaks are forbidden for European numerals, since it is necessary to express the rank via the position. Furthermore, if in vertical writing mode European numerals are placed in the appropriate direction side by side, they are used like the ideographic numerals, and it is possible to have a line break between them.

    (note 2)

    When writing European numerals, FULL STOP "." is used as a decimal point, COMMA "," or space is used as a rank indicator. Line breaks cannot occur before and after these characters (see [Fig.84]: the space before "4" expresses a rank).

  4. Between prefixed abbreviations (cl-12) (YEN SIGN "¥"DOLLAR SIGN "$"CENT SIGN "¢" etc.) and the following arabic or ideographic numeral (see [Fig.82]). The reason is that such character sequences are to be handled as one unit.

    Unbreakable sequences between prefixed abbreviations and the following European numeral.
    [Fig.82]: Unbreakable sequences between prefixed abbreviations and the following European numeral.
  5. Between postfixed abbreviations (cl-13) (PERCENT SIGN "%", PER MILLE SIGN "‰" etc.) and the preceding European numeral or ideographic numeral (see [Fig.83]). The reason is that such character sequences are to be handled as one unit.

    Unbreakable sequences between postfixed abbreviations and the preceding European numeral.
    [Fig.83]: Unbreakable sequences between postfixed abbreviations and the preceding European numeral.

    (note 1)

    Some people think that it is appropriate to have a line break between the PERCENT SIGN "%" and the preceding arabic or ideographic numeral. The reason seems to be that PERCENT SIGN "%" has a high level of independence. Furthermore it is possible to have a line break between "0" and "パ" in cases like "50パーセント" (meaning "50 percent", where "percent" is written in katakana).

  6. Inter-letter space among Western characters (cl-27) in a word (or, sequence of letters, which it is not possible to hyphenate), or unit indicators (km, kg, mm etc.) in Latin letters (see [Fig.84]).

    It is not possible to break a line between letters in unit symbols using Latin letters.
    [Fig.84]: It is not possible to break a line between letters in unit symbols using Latin letters.

    (note 1)

    When using HYPHEN "‐" at the end of a line, it becomes possible to have a line break within Western characters (cl-27).

    (note 2)

    In this document, description of units with proportional Western characters, such as km and kg, are treated as unit symbols (cl-25).

    (note 3)

    In [Fig.84], there is a quarter em space between "4" and "k", because of the convention to insert a quarter em between unit symbols (cl-25) and following European numerals or Western characters (cl-27). It is permitted to break a line between "4" and "k". In this case, there is no quarter em space in either the head or the end of the line. Note that the space between "3" and "4" expresses a rank.

  7. Inter-letter space among ruby letters, when composed as mono-ruby. Note that it is possible to break a line between base characters with mono-ruby (see [Fig.85]).

  8. Inter-letter space among ruby letters or base characters, composed as group-ruby (see [Fig.85]).

    Example of unbreakable sequences of ruby.
    [Fig.85]: Example of unbreakable sequences of ruby.

    (note 1)

    With ruby used for compound words (jukugo-ruby), a group of ruby characters is attached to each base character. It is possible to have a line break between such groups of ruby letters and base characters (see [Fig.86]), however, a line break should not occur between ruby characters related to a given base character.

    Example of a line break for jukugo-ruby.
    [Fig.86]: Example of a line break for jukugo-ruby.
  9. Between a subscript or superscript and an adjacent base character (preceding or following) (see [Fig.87]), or between base characters with ornament characters, or between ornament characters themselves. The reason is that these character sequences are to be handled as one object.

    Unbreakable sequences between a character and its related subscripts.
    [Fig.87]: Unbreakable sequences between a character and its related subscripts.
  10. In order to create a correspondence between notes and the related main text, reference marks (aijirushi) are often added. Line breaks are not allowed before the reference mark or between letters of the reference mark itself (see [Fig.88]). The application of the no-line-break rule here is a matter of style.

    Unbreakable sequences before an aijirushi (reference marks, European numerals or ideographic numerals).
    [Fig.88]: Unbreakable sequences before an aijirushi (reference marks, European numerals or ideographic numerals).

    (note 1)

    In this document, characters in reference marks are treated as characters as reference marks (cl-20).

    (note 2)

    Often there are full stops (cl-06) after reference marks. In these cases, line breaks are not allowed between the reference marks and the full stops. The reason is that full stops should not be set at the head of lines (see [Fig.88]).

  11. After warichu opening brackets (cl-28), which open warichu, or before warichu closing brackets (cl-29), which close warichu.

  12. A unit of furiwake. A unit of furiwake is handled as one object.

3.1.11 Character Sequences which Do Not Allow Space Insertion as Part of Line Adjustment Processing

For line adjustment processing, space must not be added between the following characters. (This is called the inseparable characters rule.) The reason is that these characters or symbols should appear as one unit (for more about line adjustment, see 3.8 Line Adjustment).

(note 1)

The main places where solid setting may be dropped for line adjustment processing are the spaces among hiragana (cl-15), katakana (cl-16), and ideographic characters (cl-19). Furthermore, things such as the amount of space between words in Western text are also a target for space adaptation.

(note 2)

Combinations of character classes which allow spaces to be inserted for line alignment, are described as a complete table in Appendix E Opportunities for Inter-character Space Expansion during Line Adjustment, following 3.9 About Character Classes.

  1. There must be no space between any characters described in 3.1.10 Unbreakable Character Sequences.

    (note 1)

    Some people think that it is only permissible to increase space between letters in Western words for cases where there is no possibility of regular line adjustment processing.

  2. In addition to the cases mentioned above, the inseparable character rule has to be applied to the following cases.

    1. Before or after opening brackets (cl-01) or closing brackets (cl-02).

      (note 1)

      After opening brackets (cl-01) or before closing brackets (cl-02) the inseparable characters rule is always applied. In contrast, before opening brackets or after closing brackets the rule is not applied. Full stops (cl-06) and commas (cl-07) are handled the same as closing brackets.

    2. Before or after full stops (cl-06) or commas (cl-07).

    3. Before or after middle dots (cl-05).

    4. Before or after dividing punctuation marks (cl-04).

    5. Before or after hyphens (cl-03).

    6. Before or after one em, etc. spaces between Japanese characters.

    7. Among base characters with jukugo-ruby.

3.1.12 Examples of Line Adjustment

Methods of line adjustment processing are discussed in 3.8 Line Adjustment. However, since layout processing of punctuation marks is one reason for the need for line adjustment processing, we will here introduce two main examples of cases where line adjustment processing is necessary, and show adjustment examples (see [Fig.89]).

  1. The principal approach in Japanese composition is that with the exception of the last line of a paragraph, the length of all lines is the same, so all lines are aligned. As explained before, the line length is set to be n-times the character size established for the kihon-hanmen. Hence, as long as only full-width characters are used, all lines have the same length (see (1) at [Fig.89]).

  2. In [Fig.89] at (2), there is an IDEOGRAPHIC COMMA "、" followed by a LEFT CORNER BRACKET "「", and the complete space taken by the two characters is one and a half em. That means that the line overshoots or runs short of the edge of the kihon-hanmen by a half em. To restore a uniform line length, line adjustment is applied as shown at (3) in [Fig.89]. The half em space overshoot or shortage is recovered by reducing inter-character space to a quarter em before the LEFT CORNER BRACKET "「" and after the RIGHT CORNER BRACKET "」".

  3. At (4) in [Fig.89], the 15th character is an opening bracket (cl-01). This should not appear at the line end. Ideally, a full width space reduction would be applied, and the character "前" on the second line would be moved onto the first line in the 15th position. In that way, the problem could be avoided. However, in this example a full-width space reduction is not possible, so line adjustment processing is applied as shown at (5) in [Fig.89]. The opening bracket (cl-01) is moved to the second line, and line adjustment by inter-character space expansion is applied. That means that space is inserted in the first line at places where it is allowed.

    Examples of line adjustment.
    [Fig.89]: Examples of line adjustment.

3.2 Japanese and Western Mixed Text Composition (including Horizontal-in-Vertical Text Composition)

3.2.1 Composition of Japanese and Western Mixed Texts

There are a lot of examples of Japanese text in which Western and/or Greek letters are mixed among Japanese letters. Examples are as follows:

  1. Using one Latin letter as a symbol for something, like "A" and "B".

  2. Using a Western word in a Japanese context, like "editor".

  3. Using acronyms of things and organization names, like "DTP" and "GDP".

  4. Writing Western book titles and authors in lists of referred books with original spelling.

Latin letters are also used in itemized lists and numbering of headings, as well as symbols for units, symbols for chemical elements, and mathematical symbols. As can be judged from these examples, mixtures of Latin letters among Japanese letters are in daily use in Japanese composition.

(note 1)

There are some examples which include whole Western paragraphs in Japanese documents. However, there are few ordinary books that include such paragraphs. There are a lot of examples of textbooks for foreign languages, in which one or more Western paragraphs are followed by annotations in Japanese language. Also, there are some examples in treatises and journals, which contain whole Western paragraphs.

(note 2)

In vertical writing mode, symbols for units are usually described with katakana (cl-16), such as センチメートル (centimeter) or センチ(abbreviation of centimeter in katakana, "senchi"). In horizontal writing mode, the International System of Units (SI) is usually used, such as "cm".

(note 3)

The treatment of mixtures of Japanese and Western characters is also described in JIS X 4051 4.7.

3.2.2 Mixed Text Composition in Horizontal Writing Mode

In horizontal writing mode the basic approach is to use proportional Western fonts ([Fig.90]). For European numerals, both half-width fonts and proportional fonts are used. Note that Western word space (cl-26) is a one third em space, in principle, except at line head, line head of warichu, line end and line end of warichu. Western word space (cl-26) at line head, line head of warichu, line end and line end of warichu, is set solid.

Example of proportional Western fonts used in Japanese in horizontal writing mode.
[Fig.90]: Example of proportional Western fonts used in Japanese in horizontal writing mode.
Example of Western full-width fonts used in Japanese in horizontal writing mode. (In horizontal writing mode, Western full-width fonts are usually not recommended.)
[Fig.91]: Example of Western full-width fonts used in Japanese in horizontal writing mode. (In horizontal writing mode, Western full-width fonts are usually not recommended.)

(note 1)

As shown in [Fig.91], there are some examples of Western full-width fonts used in horizontal Japanese typesetting, this usage is not recommended, for cosmetic reasons.

(note 2)

Usually, in horizontal Japanese text, fonts are used with European numeric glyphs that are easy to balance and harmonize with Japanese fonts. Considering line adjustment, the use of fonts with half-width numeric glyphs is recommended. There are some cases of Japanese fonts with half-width glyphs for European digits.

(note 3)

There are two choices for including glyphs of Latin letters and numerals in Japanese and Western mixed text compositions. One way is to use the glyphs for Western characters built into the same Japanese font. The other is to combine an independent Western font for Western characters with a Japanese font for Japanese characters. (Example: [Fig.92] is composed using proportional glyphs for Latin letters and numerals included in Ryumin R-KL. [Fig.93] is composed with Ryumin R-KL for Japanese characters and Times New Roman for Latin letters and numerals.)

Example of Japanese and Western mixed text with the same font Ryumin R-KL for both Japanese characters and proportional Western
     characters.
[Fig.92]: Example of Japanese and Western mixed text with the same font Ryumin R-KL for both Japanese characters and proportional Western characters.
Example of Japanese and Western mixed text with two distinct fonts - Ryumin R-KL for Japanese characters and Times New Roman
     for Western characters.
[Fig.93]: Example of Japanese and Western mixed text with two distinct fonts - Ryumin R-KL for Japanese characters and Times New Roman for Western characters.

3.2.3 Mixed Text Composition in Vertical Writing Mode

As explained in 2.3.2 Major Differences between Vertical Writing Mode and Horizontal Writing Mode, there are three different styles for setting Latin letters and European numerals in vertical writing mode:

  1. Setting Latin letters and/or European numerals one by one in inline direction with Japanese characters (see [Fig.94]). This style is typically adopted when composing a single Latin letter or European numeral between the adjacent Japanese letters. Full-width glyphs are specified for Latin letters and European numerals in this style.

    Example of Latin letters in normal orientation.
    [Fig.94]: Example of Latin letters in normal orientation.
  2. Setting Latin letters and/or European numerals rotated 90 degrees clockwise in vertical text mode ([Fig.95]). This style is usually adopted when Latin letters compose a word or sentence. Proportional fonts are specified for characters in this style, as in horizontal writing mode (or half-width fonts for European numerals).

    Example of Latin letters rotated 90 degrees clockwise.
    [Fig.95]: Example of Latin letters rotated 90 degrees clockwise.
  3. Setting Latin letters and/or European numerals in tate-chu-yoko (horizontal-in-vertical setting, see [Fig.96]). Tate-chu-yoko layout is usually adopted when dealing with a two-digit number in European numerals, or a combination of two or three Latin letters, the length of which is equal to the default size of the line in paragraph direction or longer than that just to an acceptable extent. (A combination of two or three Latin letters may be rotated 90 degrees clockwise rather than set it in tate-chu-yoko layout.) Proportional glyphs (or half-width glyphs for European numerals) are used for characters in tate-chu-yoko layout.

    Example of European numerals in tate-chu-yoko (horizontal-in-vertical setting).
    [Fig.96]: Example of European numerals in tate-chu-yoko (horizontal-in-vertical setting).

    (note 1)

    Acronyms, such as "GNP", and abbreviations like "Web", are usually set one by one, character-wise in normal orientation (see [Fig.97]). However, there are some cases where acronyms and abbreviations are rotated 90 degrees clockwise (see [Fig.98]).

    Example of acronyms set one by one in normal orientation.
    [Fig.97]: Example of acronyms set one by one in normal orientation.
    Example of acronyms rotated 90 degrees clockwise.
    [Fig.98]: Example of acronyms rotated 90 degrees clockwise.

    (note 2)

    The ideographic numerals were traditionally used in vertical writing mode instead of European numerals. (Road numbers and car registration numbers were examples of a couple of exceptions). However, the more newspapers and other publications have been adopting European numerals in vertical writing mode, the more the use of tate-chu-yoko layout for European numerals has also been increasing.

3.2.4 Method for Setting Full-width Latin Letters and European Numerals

When full-width and fixed-width Western characters or European numerals are set in vertical writing mode as "quasi" Japanese characters, inter-character spaces between these characters and hiragana (cl-15), katakana (cl-16) or ideographic characters (cl-19) are set solid, similar to ordinary ideographic characters (cl-19) (see [Fig.99]). Also, in principle, when full-width and fixed-width Western characters or European numerals are set after full stops (cl-06), commas (cl-07) or closing brackets (cl-02), or before opening brackets (cl-01), insert a half em space after commas (cl-07) or closing brackets (cl-02), or before opening brackets (cl-01). In addition, in these cases, insert a half em space after full stops (cl-06). When full-width and fixed-width Western characters or European numerals are set before a full stop (cl-06), comma (cl-07) or closing bracket (cl-02), or after an opening bracket (cl-01), the inter-character space before the full stop, comma or closing bracket, or after the opening bracket is set solid.

Setting example of full-width Latin letters and European numerals.
[Fig.99]: Setting example of full-width Latin letters and European numerals.

(note 1)

In this document, full-width and fixed-space Western characters and European numerals are treated as members of the ideographic characters (cl-19) class. Accordingly, when KATAKANA MIDDLE DOT "・" appears before or after full-width and fixed-space Western characters and Western numerals, in principle, a quarter em space is inserted between KATAKANA MIDDLE DOT "・" and Western characters or Western numerals. However, when KATAKANA MIDDLE DOT "・" is used as a ranking symbol between Western numerals, in principle, it is set solid, like ideographic numerals.

Example of setting KATAKANA MIDDLE DOT as a ranking symbol among full-width, fixed-space European numerals.
[Fig.100]: Example of setting KATAKANA MIDDLE DOT as a ranking symbol among full-width, fixed-space European numerals.

(note 2)

The details of ideographic characters (cl-19), including full-width and fixed-width Western characters and European numerals, are described as a complete table in Appendix B Spacing between Characters, following 3.9 About Character Classes.

3.2.5 Handling of Tate-chu-yoko (Horizontal-in-Vertical Settings)

To set strings as tate-chu-yoko (horizontal-in-vertical setting), first set from left to right using solid setting, then align the whole string to the center of the vertical line ([Fig.101]). When hiragana (cl-15), katakana (cl-16) or ideographic characters (cl-19) are set before/after tate-chu-yoko, the inter-character space is set solid. In principle, when tate-chu-yoko is set after a comma (cl-07) or closing bracket (cl-02), or before an opening bracket (cl-01), a half em space is inserted. In addition, when tate-chu-yoko is set after a full stop (cl-06) in the middle of a line, a half em space is inserted. When a full stop (cl-06) is set at the end of a line, a half em space is inserted after it, in principle. When tate-chu-yoko is set before full stops, commas or closing brackets, or after opening brackets, the inter-character space is set solid.

Example of setting tate-chu-yoko (horizontal-in-vertical text setting).
[Fig.101]: Example of setting tate-chu-yoko (horizontal-in-vertical text setting).

(note 1)

The details of handling of inter-character spaces between characters in tate-chu-yoko (cl-30) and adjacent other character classes are described as a complete table in Appendix B Spacing between Characters, following 3.9 About Character Classes.

3.2.6 Handling of Western Text in Japanese Text using Proportional Western Fonts

Composition rules for Western characters, Western text and European numerals, set rotated 90 degrees clockwise in vertical writing mode, and horizontal writing mode, are as follows:

  1. A sequence of Western characters in a Western word should not be broken across a line-break, except where hyphenation is allowed.

  2. When line adjustment is done with line adjustment by inter-character space reduction, Western word space (cl-26) is used as first priority. Also, when line adjustment is done with line adjustment by inter-character space expansion, Western word spaces are used as first priority.

  3. When line adjustment by inter-character space addition is used, inter-character spaces within Western words and European numerals are not used for expansion.

  4. Inter-character space, between hiragana (cl-15), katakana (cl-16) or ideographic characters (cl-19) and Western characters or European numerals, is a quarter em space (see [Fig.102]). The issue as to whether the quarter em space can be used for line end adjustment or not is discussed in 3.8.2 Reduction and Addition of Inter-Character Space and 3.8.4 Procedures for Inter-Character Space Expansion.

    Example of a quarter em inter-character space between hiragana, katakana and ideographic characters, and Latin characters.
    [Fig.102]: Example of a quarter em inter-character space between hiragana, katakana and ideographic characters, and Latin characters.
  5. In the following cases, a quarter em space is not inserted (see [Fig.103]).

    1. At the start of a line, there is no space before Latin characters or European numerals. At the end of the line, there is no space after Latin characters or European numerals.

    2. In the case where Latin characters and European numerals follow a comma (cl-07) or closing bracket (cl-02), or are followed by opening brackets (cl-01), in principle, a half em space is inserted. In the case where Latin characters and European numerals follow a full stop (cl-06) in the middle of a line, a half em space is inserted. When the full stop (cl-06) is set at the end of a line, in principle, a half em space is inserted after the full stop (cl-06).

    3. In the case where Latin characters and European numerals are set before a full stop (cl-06), comma (cl-07) or closing bracket (cl-02), or after an opening bracket (cl-01), the inter-character space is set solid.

Example of no inter-character space before and after Latin characters and European numerals.
[Fig.103]: Example of no inter-character space before and after Latin characters and European numerals.

(note 1)

In this document, proportional Western characters and European numerals are treated as members of the Western characters (cl-27) class. Note that half- and fixed-width European numerals, when mixed with Japanese text, are treated as members of the grouped numerals (cl-24) class.

(note 2)

The reason a quarter em space is needed between Western characters or European numerals and hiragana (cl-15), katakana (cl-16) or ideographic characters (cl-19), is that the design concept of Latin fonts and Japanese fonts are different from each other, so it looks too tight without the spaces ([Fig.104]).

Example of solid setting between, katakana and ideographic characters and Latin characters and European numerals. (This method is
     not recommended).
[Fig.104]: Example of solid setting between, katakana and ideographic characters and Latin characters and European numerals. (This method is not recommended).

3.3 Ruby and Emphasis Dots

3.3.1 Usage of Ruby

Ruby is a small-sized, supplementary text attached to a character or a group of characters in the main text. A run of ruby text, usually attached to the right of the characters in vertical writing mode or immediately above them in horizontal writing mode, indicates the reading or the meaning of those characters (see [Fig.105]). The characters in the main text that are annotated by ruby are called "base characters". Kana characters are often used for ruby to indicate how to read kanji characters; this is known as ruby annotation or as "furigana".

Ruby and base characters.
[Fig.105]: Ruby and base characters.

(note 1)

Societal change in the use of kanji when composing Japanese and changes in the use of ruby implementations in text layout methods have been affecting the usage of ruby itself. Ruby was not very popular while the Touyou Kanji Table (当用漢字表) (the list of kanji characters for everyday use, issued by the Government on November 16, 1946) was in effect because, in principle, it discouraged the use of furigana in its 'directions for use' section. The Jouyou Kanji Table (常用漢字表) (the list of kanji characters in common use, issued by the Government to replace the Touyou Kanji Table on October 10, 1981) has changed the scope of the list. (The Touyou Kanji Table defined the list of all kanji characters that can be used for any documents, while the Jouyou Kanji Table was designed to be a 'reference guide' for the use of kanji to compose contemporary Japanese for general use in society.) For example, the preface of the Jouyou Kanji Table states that one may consider the use of 'furigana (annotations to indicate pronunciation)' when the text appears too difficult to read. This is why the use of ruby has been increasing today. The use of ruby is common in magazines and books, and even newspapers have begun to use ruby these days.

(note 2)

The specification of ruby composition was first introduced in the second revision of JIS X 4051, which was further revised in 2004 to add jukugo-ruby, katatsuki-ruby and the dual ruby composition with which two distinct runs of ruby are attached to the same base characters (4.12 Ruby Composition).

In terms of annotation, ruby has several purposes as described below. Depending on the purpose and/or the function of the ruby, there are different ruby composition methods (the details of the composition methods will be provided in the later sections).

  1. PURPOSE: Ruby annotation with kana (usually hiragana) to provide readings of kanji characters. There are two types of ruby for this purpose depending on the type of base character:

    1. Add one or more hiragana (cl-15) or katakana (cl-16) ruby character to indicate the reading (Japanese onyomi or kunyomi) for each base ideographic character (cl-19) (see [Fig.106]). This method, attaching one or several hiragana or katakana characters for each base ideographic character, is called mono-ruby.

      Example of ruby annotation  per Kanji Character.
      [Fig.106]: Example of ruby annotation per Kanji Character.
    2. In the Japanese writing system, compound words (jukugo) occasionally appear, usually constructed with a couple of kanji characters. There are two different methods of attaching ruby letters to base kanji characters for these compound words.

      1. Mono-ruby. Ruby letters are attached to each base kanji character, similarly to the previous section (see [Fig.107]).

        Example of mono-ruby method. Ruby letters are attached to each base kanji character in a compound word.
        [Fig.107]: Example of mono-ruby method. Ruby letters are attached to each base kanji character in a compound word.

        (note 1)

        In [Fig.107], there is a quarter em space between the base characters "凝" and "視". So when this line happens to appear in the middle of a paragraph, there needs to be some line adjustment processing.

      2. Jukugo-ruby. Pronunciation is indicated for each base character, but the positioning takes into account the fact that together they make up a compound word (see [Fig.108]). The intention when using jukugo-ruby is to handle the kanji phrase as one object.

        Example of jukugo-ruby method. Ruby letters are attached to groups of kanji characters in compound words.
        [Fig.108]: Example of jukugo-ruby method. Ruby letters are attached to groups of kanji characters in compound words.

        (note 1)

        There is no difference between [Fig.107] of mono-ruby and [Fig.108] of jukugo-ruby, when the number of ruby letters for each base kanji character is one or two. When the mono-ruby approach is adopted, the inter-character space between base kanji characters can be expanded for line adjustment (in [Fig.107], the inter-character space between "鬼" and "門", or, "方" and "角" can be expanded). When the jukugo-ruby approach is adopted, line adjustment cannot be applied to the base kanji characters.

        (note 2)

        The following examples show the relationship between ruby letters and base kanji characters.

        Example of mono-ruby:

        "凝+(ぎよう)" "視+(し)"

        Example of jukugo-ruby 1:

        "凝+(ぎよう) 視+(し)"

        Example of jukugo-ruby 2:

        "(凝視)+(ぎよう/し)"

        (note 3)

        Books commonly adopt kana-based jukugo-ruby for kanji compound words. However, due to technical difficulties for rendering jukugo-ruby in machine-assisted text layout, the adoption of kana-based mono-ruby is increasing. For example, newspapers do not use jukugo-ruby, and study aids generally use mono-ruby because it is considered more important to show the readings of each kanji character for students than to be concerned about the beauty of the layout.

        (note 4)

        Multiple kanji compound words can form one compound phrase. In this case, there are two ways to attach ruby, i.e. attaching ruby to the compound phrase as a whole, or to each word which forms the compound (see [Fig.109]). Similarly, a Japanese personal name consists of a given name and a family name, which together form a compound of a full name, and it is an editorial decision whether to attach two runs of ruby, one each for given name and family name, or to attach the full ruby text to the compound which represents the reading of the full name.

        Examples of ruby attachment for a compound phrase.
        [Fig.109]: Examples of ruby attachment for a compound phrase.

        (note 5)

        In most cases, the reading of a kanji compound word is just a concatenation of the readings of each individual kanji character, but some phrases have their own native readings (known as jukuji readings), which cannot be derived from the readings of each character. Attaching ruby to those phrases, which are usually sequences of two or three kanji characters, is essentially the same as attaching ruby in katakana to kanji and/or kana base characters (see [Fig.110]).

        Examples of ruby for jukuji readings.
        [Fig.110]: Examples of ruby for jukuji readings.
  2. PURPOSE: Ruby annotation that annotates a kanji or kana word with kana to provide its meaning, together with it's reading. In terms of ruby layout, attaching ruby text to a single character in kanji or kana is essentially the same as attaching the reading to a kanji character (e.g. attaching ruby text "バザール", 'Bazaar', in kana to a kanji character "市" is just like attaching the reading "いち" to that character). When attaching ruby text to a run of base text consisting of two or more characters in kanji and/or kana, the ruby text needs to be positioned as if it corresponds to the annotated text itself, no matter how the ruby characters are distributed across each base character. The most typical example of this is attaching ruby text to a kanji compound word to indicate a corresponding loan word in katakana (see [Fig.110]). The use of ruby text of this kind is on the increase in proportion to the growing need for translations and loan words. This type of ruby, namely ruby letters that are attached to two or more base characters as one object, is called group-ruby. Group-ruby and it's base characters are unbreakable, because of their behavior as one object (it is possible to break a line in the middle of the base characters where jukugo-ruby is in use).

    Examples of ruby for compound kanji words to indicate corresponding words in katakana.
    [Fig.111]: Examples of ruby for compound kanji words to indicate corresponding words in katakana.

    (note 1)

    Having said that, the layout of ruby text to a single kanji character is not entirely the same depending on the use of ruby, and may differ according to whether it is for indicating the reading or the meaning. In fact, the katatsuki, one of ruby layout per character which will be described in the later section, does allow the nakatsuki layout for those ruby which indicate meaning.

  3. PURPOSE: Ruby annotation, usually with katakana characters, to indicate the reading or the meaning of a Western word used in base text (see [Fig.112]). There are opposite cases where a synonymous Western word in Latin characters is attached as a ruby annotation to a Japanese word in kana or kanji (see [Fig.112]). These cases are less used than a and b, however they are quite common in study guides, translated books and travel guides.

    Examples of Latin characters used either in  base text or ruby text for Western words.
    [Fig.112]: Examples of Latin characters used either in base text or ruby text for Western words.
  4. PURPOSE: Ruby annotation using kanji for a base text word in hiragana. This is called furikanji, and is very rarely found.

Hereafter, mainly the usage of (a) and (b) will be explained.

(note 1)

There is no difference in the positioning of ruby text whether attaching ruby characters in kana to a Western word in Latin characters or attaching ruby in Latin to a Japanese word in kanji or kana. The ruby text in either case should be positioned relative to a run of base text to be annotated as in b). However, there is a difference in cases where the lengths of the base text and the ruby text are different. When the length of a ruby text in kana or kanji is shorter than the corresponding base text, the ruby text is, in general, stretched by adding inter-character spaces between ruby characters, and when the length of a ruby text in kana or kanji is longer than the corresponding base text, the base text is, in general, stretched by adding inter-character spaces between base characters, so that the ruby and base text look similar in length (see [Fig.107]). On the other hand, no inter-character space will be added to any ruby or base text in Latin characters no matter how different the ruby and base text look in length (see [Fig.112]). Details will be explained later.

3.3.2 Choice of Base Characters to be Annotated by Ruby

There are several methods of choosing how to attach ruby annotations to which base characters.

  1. 'General-ruby' is the method of attaching ruby annotations to all base characters in kanji.

  2. 'Para-ruby' is the method of attaching ruby annotations to only those base characters in kanji for which readings are difficult.

    (note 1)

    The 'para-ruby' method is further divided into the method of attaching ruby annotations regardless of multiple occurrences of the same base kanji character, and the method of attaching ruby annotations to only the first instance of the same base kanji character. Furthermore, the latter method has several variations in counting the first instance for the entire book, each chapter, or each spread, etc.

Note that ruby should be attached to all kanji characters in a compound word, to reflect the unitary nature of the text. To attach ruby to only some of the kanji characters in a compound word is not recommended (see [Fig.113]).

Examples of ruby on kanji characters in a compound word. (Left side, recommended. Right side, not recommended.)
[Fig.113]: Examples of ruby on kanji characters in a compound word. (Left side, recommended. Right side, not recommended.)

3.3.3 Choice of Size for Ruby Characters

The character size of ruby characters is, in principle, the half size of the base characters (see [Fig.114]).

Examples of ruby with half the size of the base characters.
[Fig.114]: Examples of ruby with half the size of the base characters.

The 'one-third-ruby' characters are used on rare occasions to attach three ruby characters to one full-width kanji character. One-third-ruby for vertical layout has the dimension of the half of the base character in width and the one third in height. Those for horizontal writing mode have the dimension of half of the base characters in height and one third in width (see [Fig.115]).

Examples of one third ruby.
[Fig.115]: Examples of one third ruby.

When ruby is attached to twelve point or larger base characters (usually used for headings), the size of the ruby letter is generally smaller than half the size of the base characters, considering the proportion of the sizes of base characters and ruby. When all is said and done, these cases are very rare.

Examples of ruby at a size smaller than half the size of the base characters.
[Fig.116]: Examples of ruby at a size smaller than half the size of the base characters.

(note 1)

There are cases where the body size is twelve points in books designed for infants or aged people. In these cases, the size of ruby can be six points, exactly the half size of the base characters.

(note 2)

When the size of base characters is very small (for e.g. smaller than seven points), ruby which is half the size, will be even more small and illegible. In such cases where the size of base characters is very small, ruby is not a suitable method of annotation. In those cases, consider other annotation methods such as adding the reading in parenthesis immediately after the base character.

(note 3)

Slightly different from the question of the size of ruby, there is a question of whether or not small kana (cl-11) characters should be used in ruby annotations. Because the size is so small, there was no choice of using small kana in letterpress printing. Today there are cases where small kana are used in ruby annotations, but taking legibility into consideration, these cases should be limited to those where accurate readings are needed, such as for proper nouns.

3.3.4 Choice of Sides for Ruby with Respect to Base Characters

In principle, ruby is attached to the right of base characters in vertical writing mode, and above in horizontal writing mode.

In some special cases, ruby can be seen to the left of base characters in vertical writing mode, and below in horizontal writing mode, but this is very rare.

There are cases where two kinds of ruby are attached, one to either side of the base characters, one for readings and the other for meanings (see [Fig.117]). This is also very rare.

An example of ruby attached to both sides of the base characters.
[Fig.117]: An example of ruby attached to both sides of the base characters.

In the following sections, the ruby composition methods will be explained on the assumption that the size of ruby is half the size of the base characters, and they will be attached to the right in vertical writing mode and above in horizontal writing mode. First we look at the basic composition rules of mono-ruby, group-ruby and jukugo-ruby, then the rules of positioning of ruby with respect to those characters which come before and after the base characters, and finally the composition rules at the line head and at the line end.

3.3.5 Positioning of Mono-ruby with Respect to Base Characters

When mono-ruby characters are Japanese, they are set solid. If mono-ruby characters have their own character widths such as Western characters or European numerals, they are set according to their own widths and then the ruby text is placed so that its center matches that of its base character. There are more variations depending on the combination of the base character and ruby text and accordingly various composition rules have been invented, which will be explained with examples.

When attaching two kana ruby characters to a single base character, the lengths of the ruby text and the base text are the same and they are positioned as shown in [Fig.118].

An example of composition with two ruby characters.
[Fig.118]: An example of composition with two ruby characters.

When attaching a single ruby character to a single base character, there are two ways of positioning the ruby character.

  1. In vertical writing mode, attach a ruby character so that its vertical center matches that of the base character (see [Fig.119]). In horizontal writing mode, attach a ruby character so that its horizontal center matches that of the base character (see [Fig.119]). This positioning of a ruby character is called 'nakatsuki' (center-alignment).

  2. In vertical writing mode, attach a ruby character so that the top of its virtual body is aligned with the top of that of the base character (see [Fig.119]). This positioning of a ruby character is called 'katatsuki' (top-alignment). For horizontal writing mode, 'katatsuki' should not be adopted. If a ruby character is attached so that the left-edge of its virtual body is aligned with the left-edge of that of the base character, it would result in the loss of the center of balance, which doesn't look good (see [Fig.120]).

Examples of nakatsuki and katatsuki alignment.
[Fig.119]: Examples of nakatsuki and katatsuki alignment.
Example of katatsuki alignment in horizontal layout (this is intentionally wrong and should not be applied).
[Fig.120]: Example of katatsuki alignment in horizontal layout (this is intentionally wrong and should not be applied).

(note 1)

Katatsuki alignment was commonly used in the letterpress printing era, but today the adoption of nakatsuki alignment is increasing even in vertical writing mode. However, there is still an opinion favoring katatsuki alignment, because it is familiar and readable.

When attaching more than three hiragana ruby characters to a single base character, the ruby characters are set solid. In this case, where the length of a ruby text is longer than that of its base character, positioning of the ruby text depends on which alignment has been adopted for a single ruby character. There is another issue: how to maintain the spatial balance of the ruby characters hanging over those characters which are not related base characters. The adjustment of inter-character spacing for those characters which come before and after the base character will be explained in a later section.

  1. When nakatsuki alignment is adopted for a single ruby character, position a ruby text so that its vertical center is aligned with that of its base character in vertical writing mode (see [Fig.121]). In horizontal writing mode, position a ruby text so that its horizontal center is aligned with that of its base character (see [Fig.121]).

    Example 1 of positioning of ruby text with more than three characters.
    [Fig.121]: Example 1 of positioning of ruby text with more than three characters.
  2. When katatsuki alignment is adopted for a single ruby character, there are two methods, as follows.

    1. Position the ruby text so that its vertical center is aligned with that of its base character (see [Fig.121]).

    2. Depending on the type of script of the adjacent characters to the base character, and the number of ruby characters, a decision is made about whether ruby hangover is allowed on the character before its base character, or on the character after, or on both adjacent characters. At break-even situation, the hangover is usually on the character after its base character (see [Fig.122]).

Example 2 of positioning of ruby text with more than three characters (vertical writing mode).
[Fig.122]: Example 2 of positioning of ruby text with more than three characters (vertical writing mode).

(note 1)

The terms katatsuki and nakatsuki were originally used for ruby alignment only when a single ruby character is attached to a single base character. However there are cases where the meaning is stretched so that the terms can be used when more than three ruby characters are involved. We use the terms katatsuki and nakatsuki with their original meaning throughout this document.

(note 2)

The preference for the choice of hanging the ruby over the character after its base character is due to the preference to avoid adjustment of inter-character spacing between base characters and their adjacent characters, which was common in letterpress printing.

For mono-ruby, base characters and adjacent ruby characters are handled as one object, and internal line-breaks are prohibited.

3.3.6 Positioning of Group-ruby with Respect to Base Characters

When the length of a sequence of base characters (number of characters * advance-width of each character) and that of the ruby text are the same, each text is set solid and the center of both texts are aligned with each other (see [Fig.123]).

Examples of group-ruby where the length is the same as that of the base text.
[Fig.123]: Examples of group-ruby where the length is the same as that of the base text.

When the length of the ruby text is shorter than that of its base characters, set the base text solid and stretch the ruby text by adding a certain amount of inter-character space between each adjacent ruby character, so that both texts balance each other. To be more specific, where 2 units of inter-character space are used between ruby characters, add 1 unit of space between the start of the base text and the start of the ruby text, and between the end of the ruby text and the end of the base text. This will give a balanced appearance, and is the method specified in JIS X 4051 (see [Fig.124]). Another way is to first align the leading characters for both the base text and ruby text and the ends of both trailing characters, and then add the same amount of inter-character space between the rest of the ruby characters (see [Fig.125]).

Example 1 of distribution of group-ruby alongside base characters where the length of the ruby is shorter than that of the base characters.
[Fig.124]: Example 1 of distribution of group-ruby alongside base characters where the length of the ruby is shorter than that of the base characters.
Example 2 of distribution of group-ruby alongside base characters where the length of the ruby is shorter than that of the base characters.
[Fig.125]: Example 2 of distribution of group-ruby alongside base characters where the length of the ruby is shorter than that of the base characters.

(note 1)

In letterpress printing, there were not many choices for adjustment of inter-character space between ruby characters. Therefore, depending on the number of characters in the base text and its ruby text, the choice was whether to add a certain amount of space before the leading ruby character and after the trailing character, or not. In the former case it had been said that for 2 units of inter-character space between each adjacent ruby characters, adding 1 unit of the leading and trailing space would give a balanced appearance.

(note 2)

When the length of the ruby text is far shorter than that of the base text, the method specified in JIS X 4051 could result in space twice the size of a ruby character for the leading and the trailing space, which might give a misleading appearance. Therefore, a criterion for deciding whether or not to adopt the method of JIS X 4051 is to see if the amount of the leading and the trailing space exceeds the full-width size (or up to 1.5 times the size) of a ruby character (see [Fig.126]).

Examples of distribution of group-ruby where the length is much shorter than that of the base text.
[Fig.126]: Examples of distribution of group-ruby where the length is much shorter than that of the base text.

When the length of the ruby text is longer than that of the base characters, balance the base characters with the ruby text by setting the ruby text solid and adding a certain amount of inter-character space between each adjacent base character. To be more specific, for 2 units of inter-character space, add 1 unit of space between the start of the ruby text and the start of the base text, and between the end of the base text and the end of the ruby text, as specified in JIS X 4051 (see [Fig.127]). Another way is to first align the start of both the leading characters and the end of the trailing characters, and then add a certain amount of inter-character space between each adjacent base character (see [Fig.128]).

Example 1 of distribution of group-ruby where the length is longer than that of the base characters.
[Fig.127]: Example 1 of distribution of group-ruby where the length is longer than that of the base characters.
Example 2 of distribution of group-ruby where the length is longer than that of the base characters.
[Fig.128]: Example 2 of distribution of group-ruby where the length is longer than that of the base characters.

For group-ruby, base characters and adjacent ruby characters are handled as one object, and internal line-breaks are prohibited. Also, for an object constructed with base characters and adjacent ruby characters it is prohibited to insert additional spaces between each character for line adjustment.

3.3.7 Positioning of Jukugo-ruby with Respect to Base Characters

If the number of ruby characters are two or less for each kanji character which participates in a kanji compound word (or jukugo), then for each run of ruby text associated with each base character, compose ruby characters as described in 3.3.5 Positioning of Mono-ruby with Respect to Base Characters (see [Fig.129]).

Example 1 of distribution of jukugo-ruby.
[Fig.129]: Example 1 of distribution of jukugo-ruby.

If there is any kanji character in a given kanji compound word which needs more than three ruby characters, the jukugo-ruby layout cannot be used. In this case, attach the ruby text to the kanji compound word as a whole. The available methods include the layout as specified in JIS X 4051, which is similar to the group-ruby method described in 3.3.6 Positioning of Group-ruby with Respect to Base Characters (see [Fig.130]), and layout decided by the phonetic structure of the kanji compound word and the type of script of the adjacent characters (see [Fig.131]). The latter method can be used unless a run of ruby text for the base character hangs over another base character more than a full character width (or one and a half times the full-width) of a ruby character.

Example 2 distribution of jukugo-ruby.
[Fig.130]: Example 2 distribution of jukugo-ruby.
Example 3 distribution of jukugo-ruby.
[Fig.131]: Example 3 distribution of jukugo-ruby.

(note 1)

There are often cases where the jukugo-ruby consists of one ruby character followed by three ruby characters, and vice versa, for a kanji compound word of two base characters. If the mono-ruby layout were chosen for these cases, it would look like as shown in [Fig.132], which wouldn't be very beautiful.

Example of distribution as mono-ruby for jukugo.
[Fig.132]: Example of distribution as mono-ruby for jukugo.

Jukugo-ruby can be split into two lines at the boundary of each unit of ruby text attached to one kanji character. When a kanji compound word consists of two characters, each unit will be processed using the mono-ruby method. When dividing a compound word that consists of three kanji, use the mono-ruby method for the first kanji character and use the jukugo-ruby method for the remaining two kanji characters, and vice versa. In order to maintain the correspondence of each kanji to its ruby annotation, the layout of the ruby may be different after the division (see [Fig.133]). Note that jukugo-ruby and its base characters cannot be the subject of inter-character space expansion for line adjustment.

Examples of distribution of jukugo-ruby split across two lines.
[Fig.133]: Examples of distribution of jukugo-ruby split across two lines.

3.3.8 Adjustments of Ruby with Length Longer than that of the Base Characters

When the length of any ruby text is shorter than that of the base characters, the main text can be just set solid because there is no need for any adjustment of the inter-character spacing between base characters and their adjacent characters in the main text.

Set solid when the length of ruby text is shorter than that of base characters.
[Fig.134]: Set solid when the length of ruby text is shorter than that of base characters.

When the length of the ruby text is longer than that of the base characters, the method of composing the main text depends on how much the ruby text hangs over the kanji or kana characters, or punctuation marks, which are adjacent to the principal characters. The following are the general rules (see [Fig.135] and [Fig.136]). They were established especially in order to avoid misreading and to maintain the beauty of the layout.

  1. Ruby text shall not hang over the ideographic characters (cl-19) adjacent to the base characters.

  2. When the adjacent character is a hiragana (cl-15), katakana (cl-16), prolonged sound mark (cl-10) or small kana (cl-11), the ruby text may overhang the character up to the full-width size of the ruby characters.

  3. The ruby letter may go over the base characters and overhang the half em spaces which are inserted after closing brackets (cl-02), full stops (cl-06) or commas (cl-07), set before the target ruby object, up to the full-width size of a ruby letter. Also, the ruby letter may go over the base characters and hang over the half em spaces which are inserted before opening brackets (cl-01), set after the target ruby object, up to the full -width size of a ruby letter. Note that when the spaces are reduced for line adjustment, the room for ruby letter overhang is also compressed. (For example, if the space is a quarter em in the base character size, the ruby letter can overhang by a half em in ruby letter size.)

  4. When the adjacent character is an inseparable character (cl-08), the ruby text may overhang the character up to the full-width size of a ruby character.

  5. When the adjacent character is one of the middle dots (cl-05), the ruby text may overhang the middle dots, in principle, up to the full-width size of a ruby character. But if there is any reduction of space before and after the middle dots as a result of the line adjustment, the amount of the extension shall be up to the amount of space after the middle dots plus 1/2 a ruby character size when the middle dots are set before the ruby object, or the space before the middle dots plus 1/2 a ruby character size when the middle dots are set after the ruby object.

  6. When the adjacent character is one of the closing brackets (cl-02), the ruby text may go over the principal characters up to the full-width size of a ruby character. Note that the overhang must not go beyond the closing bracket itself.

  7. When the adjacent character is a comma (cl-07) or full stop (cl-06), the ruby text may go over the base characters and overhang the comma or full stop up to the full-width size of a ruby character. Note that the overhang must not go beyond the comma or the period itself.

  8. Also, when the adjacent character is one of the opening brackets (cl-01) before the ruby object, the ruby text may go over the base characters and hang over the opening brackets up to the full-width size of a ruby character. Note that the overhang must not go beyond the opening brackets..

Example 1 of distribution of ruby characters overhanging adjacent characters.
[Fig.135]: Example 1 of distribution of ruby characters overhanging adjacent characters.
Example 2 of distribution of ruby characters overhanging adjacent characters.
[Fig.136]: Example 2 of distribution of ruby characters overhanging adjacent characters.

(note 1)

There is an opinion that it is not good style to hang ruby characters over opening brackets (cl-01), especially LEFT CORNER BRACKET "「" and RIGHT CORNER BRACKET "」". When this opinion holds, there are two possibilities. One is not allow ruby characters to overhang the opening brackets (cl-01). The other is allow ruby characters to overhang the opening brackets up to 1/2 the size of a ruby character.

(note 2)

JIS X 4051 classifies katakana characters and kanji characters in same class. Accordingly, in JIS X 4051 it is prohibited for ruby characters to overhang adjacent katakana characters.

(note 3)

There is another variation that allows ruby text to overhang any ideographic characters (cl-19), hiragana (cl-15) or katakana (cl-16) up to the full-width size of a ruby character (see [Fig.137]).

Example 3 of distribution of ruby characters overhanging adjacent characters.
[Fig.137]: Example 3 of distribution of ruby characters overhanging adjacent characters.

(note 4)

There is a further variation that does not allow ruby text to overhang any ideographic (cl-19), hiragana (cl-15) or katakana (cl-16) characters (see [Fig.138]).

Example 4 of distribution of ruby characters overhanging adjacent characters.
[Fig.138]: Example 4 of distribution of ruby characters overhanging adjacent characters.

When the line head starts with ruby annotated text where the ruby text length is shorter than that of the base characters, compose the text so that the first base character is aligned with the line head. Similarly, when ruby annotated text ends at the line end and the ruby length is shorter than that of the basic characters, compose the text so that the last basic character is aligned with the line end.

When the line head starts with ruby annotated text where the ruby text length is longer than that of the base characters, compose the text so that the first ruby character which overhangs the base text is aligned with the line head, and vice versa (see [Fig.139]). Alternatively, there is a variation by which the text is composed so that the first base character is aligned with the line head, and vice versa (see [Fig.140]).

Example 1 of positioning of ruby characters at the line head and at the line end.
[Fig.139]: Example 1 of positioning of ruby characters at the line head and at the line end.
Example 2 of positioning of ruby characters at the line head and at the line end.
[Fig.140]: Example 2 of positioning of ruby characters at the line head and at the line end.

When aligning the first base character to the line head and the last base character to the line end, ruby text is not allowed to extend beyond the hanmen or the area of the column. If it does, the following adjustments should be considered in positioning base characters and ruby characters.

  1. Mono-ruby at the line head: Make adjustments so that the top of the ruby text is aligned with that of the base characters (see [Fig.140]).

  2. Mono-ruby at the line end: Make adjustments so that the bottom of the ruby text is aligned with that of the last base character (see [Fig.140]).

  3. Group-ruby at the line head: Make adjustments so that the top of the ruby text is aligned with that of the first base character, and add the same amount of inter-character spacing between the base characters and between the end of the last base character and the end of the last ruby character after the last base character (the method specified in JIS X 4051) (see [Fig.141]).

    Example 3 of positioning of ruby characters at the line head and at the line end.
    [Fig.141]: Example 3 of positioning of ruby characters at the line head and at the line end.
  4. Group-ruby at the line end: Make adjustments so that the end of the ruby text is aligned with that of the last base character and add the same amount of inter-character space between the base characters and the space between the start of the base text and the start of the ruby text (the method specified in JIS X 4051) (see [Fig.141]).

  5. Jukugo-ruby at the line head or at the line end: Make the same adjustments as described in (c) or (d) for the group-ruby.

  6. Jukugo-ruby at the line head: Make adjustments so that the top of the ruby text is aligned with that of the first base character. A run of ruby characters for a base character may overhang the adjacent base characters of the same kanji compound word, up to the full-width size (or one and a half of it) of a ruby character. If the extension should go beyond the limit, just force the ruby text out of the base characters, or make a further adjustment by adding inter-character space between the base characters.

  7. Jukugo-ruby at the line end: Make adjustments so that the end of the ruby text is aligned with that of the last base character. A run of ruby characters for a base character may overhang the adjacent base characters of the same kanji compound word, up to the full-width size (or one and a half of it) of the ruby characters. If the extension should go beyond the limit, just force the ruby text out of the base characters, or make a further adjustment by adding inter-character space between the base characters.

  8. Jukugo-ruby split across two lines: jukugo-ruby can be split across two lines, with one part at the line end and the other at the line head. In the case of a compound word with two kanji characters, it is as the same as dealing with one kanji character with a mono-ruby text at the line end and the other kanji character with another mono-ruby text at the next line head. In the case of a phrase with three kanji characters, handle one kanji character with mono-ruby text and the remaining two kanji characters with jukugo-ruby, and vice versa. The layout of one kanji character with mono-ruby text will be composed by method (a) or (b) described above. The layout of two kanji characters with jukugo-ruby text will be composed by method (f) or (g) above.

(note 1)

The composition of jukugo-ruby changes in accordance with the construction of the jukugo, the position at the head, middle or bottom of the line, and adjacent characters before or after. The detail of this issue is too complicated to discuss here, so it is discussed in Appendix F Positioning of Jukugo-ruby.

(note 2)

The complexes of base characters with ruby characters are classified as simple-ruby character complex (cl-22) and jukugo-ruby character complex (cl-23). The handling and positioning of these complexes with adjacent characters is discussed in Appendix F Positioning of Jukugo-ruby as a complete table, following 3.9 About Character Classes.

3.3.9 Composition of Emphasis Dots

Emphasis dots (also known as bouten or side dots) are symbols placed alongside a run of kanji or kana characters to emphasize the text.

(note 1)

There are many ways to emphasize a run of text in Japanese composition. Besides attaching emphasis dots, one may emphasize a certain run of text by selecting a different typeface (for example, Mincho face for normal text and Japanese gothic face for emphasis), a different color (for example, turning to red for emphasis), by enclosing the text within brackets (for example, LEFT CORNER BRACKET "「" and RIGHT CORNER BRACKET "」" or LEFT ANGLE BRACKET "〈" and RIGHT ANGLE BRACKET "〉"), by drawing a line alongside the text (or bousen, underlining the text), and so on. It is an editorial decision which emphasis method to adopt. Selecting a different typeface and enclosing text within brackets are generally used and popular emphasis methods. Attaching emphasis dots is not very common but one of the long-established and traditional methods typically used in kanbun composition (classic Chinese text).

(note 2)

In practice, emphasis dots are not used for commas (cl-07), full stops (cl-06), opening brackets (cl-01) or closing brackets (cl-02).

Composition of emphasis dots is as follows (see [Fig.142]).

  1. The character size of emphasis dots is the half size of the base characters to be emphasized.

  2. Emphasis dots are attached to the right of the base characters in vertical writing mode, or above them in horizontal writing mode. The center of emphasis dots is aligned with that of the base characters.

  3. There are many symbols that could be specified for use as emphasis dots. SESAME DOT "﹅" in vertical writing mode and BULLET "•" in horizontal writing mode are those used for emphasis dots in general.

Composition of emphasis dots.
[Fig.142]: Composition of emphasis dots.

3.4 Inline Cutting Note (Warichu)

3.4.1 Where the Inline Cutting Note (Warichu) is used

Warichu (inline cutting note) is a type of inline notation, where two lines of small characters are inserted into the text. Warichu divides a line into two sub lines. The frequency of use of the inline cutting note is not so high. However, the inline cutting note is very important for study guides, travel guides, reference books, encyclopedias and manuals, because it is very effective for inserting notes at the point in the text where they are needed (see [Fig.143]). Inline cutting note is usually used in vertical writing mode. It is very infrequently used in horizontal writing mode.

Warichu (inline cutting note).
[Fig.143]: Warichu (inline cutting note).

(note 1)

JIS X 4051 specifies inline cutting note in 4.16 "Handling of inline cutting note".

3.4.2 Character Size for Inline Cutting Notes and Line Gaps

Character size for an inline cutting note depends on the character size established for the kihon-hanmen. Usually, around six point size is used (see [Fig.143]).

The space between adjacent lines in an inline cutting note is zero, that is to say, there is no line gap between them (see [Fig.144]).

As shown in [Fig.144], an inline cutting note usually has two lines, and is surrounded by LEFT PARENTHESIS "(" and RIGHT PARENTHESIS ")" characters that are double the size of the characters in the inline cutting note itself. There is no space between the surrounding text and parentheses for the inline cutting note.

(note 1)

There is a style of inline cutting note, which has no opening brackets (cl-01) or closing brackets (cl-02), but is surrounded by spaces that have been added explicitly to act as delimiters.

(note 2)

Handling of inter-character spacing between warichu opening brackets (cl-28) or warichu closing brackets (cl-29) and adjacent characters is described in Appendix B Spacing between Characters as a complete table, using the concept of character class described in 3.9 About Character Classes.

Example of construction of an inline cutting note.
[Fig.144]: Example of construction of an inline cutting note.

Symbols, like opening brackets (cl-01), closing brackets (cl-02), commas (cl-07) and full stops (cl-06) are also used in inline cutting note text. In such cases, the handling of such symbols is the same as for the main text.

In vertical text, the horizontal width of the inline cutting note area is wider than the width of a kihon-hanmen line. The horizontal centers of the kihon-hanmen line and inline cutting note area are aligned. The line gap used to establish the kihon-hanmen should not be affected by the horizontal width of the inline cutting note area. In other words, the line gap for the kihon-hanmen needs to be designed wider than usual in preparation for the use of the inline cutting note. Warichu is used also in horizontal text, however it is not so common, and usually occurs only in study guides and encyclopedias.

The length of the two lines of the inline cutting note should be as near as possible the same. When the inline cutting note can be set in one kihon-hanmen line, the whole inline note text should be broken at a position where line breaking is permitted, and where the two resulting lines are as close as possible to the same length. The length of the second line should not be longer than the length of the first line. Note that the same line breaking rules are used as for basic text (see [Fig.145]).

Examples showing how the inline cutting note can be set in one line of base text.
[Fig.145]: Examples showing how the inline cutting note can be set in one line of base text.

3.4.3 Handling an Inline Cutting Note when it Straddles Two Kihon-hanmen Lines

When an inline cutting note will not fit on a single kihon-hanmen line, it will wrap onto the following line, and will be set as shown in [Fig.146] or [Fig.147].

Example of an inline cutting note straddling two base text lines.
[Fig.146]: Example of an inline cutting note straddling two base text lines.
Example of an inline cutting note straddling three base text lines.
[Fig.147]: Example of an inline cutting note straddling three base text lines.

(note 1)

Normally, an inline cutting note is short, and will therefore fit on a single kihon-hanmen line. There are cases where the note wraps onto the following line, but it is rare that it extends over three or more of the kihon-hanmen lines. If the note is too large, other styles of notation should be considered.

3.5 Paragraph Adjustment Rules

3.5.1 Line Head Indent at the Beginning of Paragraphs

A paragraph, a section of a document which consists of one or more sentences to indicate a distinct idea, usually begins on a new line. For the related line head indent at the beginning of paragraphs (in JIS 4051, this is called the "paragraph line head indent") the following methods are available. The amount of space used for the indentation is, in principle, one em space using the character size in the paragraph.

(note 1)

The definition of line head indent at the beginning of paragraphs is provided in JIS X 4051, sec. 4.17 "paragraph appearance processing". Note that "paragraph appearance processing" includes "line head indent", "line end indent" and "indent" in general.

(note 2)

The layout processing for opening brackets (cl-01) and closing brackets (cl-02) in the case of full-width line head indent at the beginning of paragraphs, is described in this document, 3.1.5 Positioning of Opening Brackets at Line Head.

  1. Line head indent at the beginning of paragraphs is applied to all paragraphs. Nearly all books and magazines make use of this method (see [Fig.148]).

    Example of line head indent at the beginning of paragraphs.
    [Fig.148]: Example of line head indent at the beginning of paragraphs.

    (note 1)

    In the case of written conversational text followed by, for example, "she said", if the conversational part is bounded by LEFT CORNER BRACKET "「" and RIGHT CORNER BRACKET "」" and the "she said" text appears after a line-break, the conversational text and the "she said" text are considered to be one continuous phrase. Hence, there is no line head indent at the beginning of the "she said" phrase, after the conversation part, i.e. so-called tentsuki (see [Fig.149]). When you have a (mathematical) formula on a separate line in horizontal writing mode, and a following line containing text such as "will be" there is also no line head indent at the beginning of the paragraph. On the other hand, there is also an approach in novels etc., where the first line indent is applied to paragraphs (see [Fig.150]).

    Layout example 1 of a line immediately following a written conversation.
    [Fig.149]: Layout example 1 of a line immediately following a written conversation.
    Layout example 2 of a line immediately following a written conversation.
    [Fig.150]: Layout example 2 of a line immediately following a written conversation.
  2. Line head indent at the beginning of paragraphs is not applied for any paragraph at all, and the tentsuki position is used (see [Fig.151]). There are examples of this method being used in certain books and magazines for the sake of styling, but this is rather hard to read.

    Example of no line head indent at the beginning of paragraphs.
    [Fig.151]: Example of no line head indent at the beginning of paragraphs.
  3. In principle, line head indent is applied at the beginning of a paragraph. However, a paragraph immediately following a tentsuki-set heading is also set tentsuki, so that the beginning of the heading and the paragraph are aligned (see [Fig.152]). In some books and magazines this method is applied to text in horizontal writing mode.

    Example of no line head indent at the beginning of paragraphs immediately following headings.
    [Fig.152]: Example of no line head indent at the beginning of paragraphs immediately following headings.

On the other hand, for example with itemization, there is also the method that indents the second and following lines of the paragraph (see [Fig.153]). This is the so-called "questions and answer" (Q&A) form. It has the effect that numbers (if used) stand out.

Layout example for itemization.
[Fig.153]: Layout example for itemization.

(note1)

The definition of itemization is provided in JIS X 4051, sec. 8.4 "itemization processing".

3.5.2 Line Head Indent and Line End Indent

The line head indent is the indentation of the line head by a fixed amount, starting from the line head side of the hanmen (in the case of one column) or of the column area (in the case of several columns). In contrast, the indentation of the line end position by a fixed amount, starting from the line head, is called line end indent.

There are examples of line head indent for quotations in separate lines (see [Fig.154]) or for headings in separate lines. Line end indent is used, for example, for headings or for quotations in separate lines.

Example of line head indent for a quotation in a separate line.
[Fig.154]: Example of line head indent for a quotation in a separate line.

(note 1)

For quotations on separate lines, there are the following approaches: either (a) the character size is the same as for the main text, and the difference from the main text is expressed only by the line head indent; or (b) the character size is made smaller than that of the main text. The former approach is applied frequently. With this approach, the line head indent is often set at double the normal character size of the main text. If large quotations are inserted frequently, there is also an approach that uses a full-width line head indent with an empty line inserted before and after the quotation. The approach where characters are made smaller than the main text is not discussed in this document (but may be addressed in a future version of this document, see 4.2 Notes).

(note 2)

Line head indent is also applied for endnotes, but this is not discussed in this document (it may be addressed in a future version, see 4.2 Notes).

3.5.3 Single Line Alignment Processing

The Japanese "single line alignment method" is a process for setting alignment for a run of text that is shorter than a given line length. This method is frequently used for headings and poems. The following methods are available (see [Fig.155]).

(note 1)

The single line alignment method is defined in JIS X 4051, sec. 4.18 "Single Line Alignment Processing".

  1. Centering: The space between adjacent characters is, in principle, set solid. (If space is needed between Japanese text and western text, before opening brackets (cl-01) and after closing brackets (cl-02), that space is inserted based on the table in Appendix B Spacing between Characters.) Also, if there is an explicit instruction to insert spaces, such spaces are inserted. If there is not solid setting but a fixed space between characters, this is used; the amount of space at the line head and line end is made equal, and the center of the character sequence is unified with the center of the line.

  2. Line head alignment: The space between adjacent characters is, in principle, set solid. (If space is needed between Japanese text and western text, before opening brackets (cl-01) and after closing brackets (cl-02), that space is inserted based on the table in Appendix B Spacing between Characters.) Also, if there is an explicit instruction to insert spaces, such spaces are inserted. If there is not solid setting but a fixed space between characters, this is used; the start of the character sequence is unified with the line head, and if the line is not full, the line end is kept empty.

  3. Line end alignment: The space between adjacent characters is, in principle, set solid. (If space is needed between Japanese text and western text, before opening brackets (cl-01) and after closing brackets (cl-02), that space is inserted based on the table in Appendix B Spacing between Characters.) Also, if there is an explicit instruction to insert spaces, such spaces are inserted. If there is not solid setting but a fixed space between characters, this is used; the end of the character sequence is unified with the line end, and if the line is not full, the line head is kept empty.

  4. Even spacing distribution: The space between adjacent characters is, in principle, set solid. (If space is needed between Japanese text and western text, before opening brackets (cl-01) and after closing brackets (cl-02), that space is inserted based on the table in Appendix B Spacing between Characters.) Also, if there is an explicit instruction to insert spaces, such spaces are inserted. In addition, using the space made available during line adjustment processing, equal character spacing is applied where possible (or character space reduction is applied where possible). The start of the character sequence is aligned to the position of the line head, and the end of the character sequence to the position of the line end.

Single line alignment processing.
[Fig.155]: Single line alignment processing.

(note 1)

Several justification methods are applied for positioning of headings or items of tables. For example, centering is often used for headings in horizontal writing mode, taking the left-right balance is taken into account. However, there are also examples of line head alignment.

(note 2)

Even spacing distribution is often used for printing Haiku in separate lines (see [Fig.156]).

Example of Haiku positioning with even spacing distribution.
[Fig.156]: Example of Haiku positioning with even spacing distribution.

3.5.4 Widow Adjustment of Paragraphs

The intent of widow adjustment of paragraphs is to avoid that the last line of a paragraph contains less than a given number of characters. This is also called "widow" processing.

(note 1)

Widow adjustment of paragraphs is defined in JIS X 4051, sec. 4.20 "Widow Adjustment of Paragraphs".

(note 2)

Widow adjustment of paragraphs is not regarded as very important in Japanese composition. However, care is taken to avoid cases such as a single character in the last line of a paragraph (often this is tolerated), or - even more extreme - just one character on a page just before a page break or a new recto (see [Fig.157]).

Example of just one character on a page just before a page break (to be avoided).
[Fig.157]: Example of just one character on a page just before a page break (to be avoided).

3.6 Tab Setting

3.6.1 Usage of Tab Setting

Tab setting is useful for alignment of table data, itemized lists, etc. where a series of characters need to be set at specific alignment positions within a line (see [Fig.158]).

Example of tab setting.
[Fig.158]: Example of tab setting.

(note 1)

Tab Setting is described in "JIS X 04051 4.21 Tab Setting".

For tab setting, it is necessary to identify tab positions, tab types (how to align the characters in the tab position), and the characters to be set. For this purpose, it is necessary to insert a tab sign before the tabbed character. The series of characters just after the tab sign are the target characters (see [Fig.159]). If there is more than one tab sign, it is necessary to set the same numbers of tab positions and tab types as the number of tab signs.

Tab signs and the target text of tab setting.
[Fig.159]: Tab signs and the target text of tab setting.

3.6.2 Types of Tab Settings

There are the following types of tab setting to align texts.

  1. Start alignment tab setting: the start position of the text is aligned to the tab position (see [Fig.160]).

    Examples of start alignment tab settings.
    [Fig.160]: Examples of start alignment tab settings.
  2. End alignment tab setting: the end position of the text is aligned to the tab position (see [Fig.161]).

    Examples of end alignment tab settings.
    [Fig.161]: Examples of end alignment tab settings.
  3. Center alignment tab setting: the center of the text is aligned to the tab position (see [Fig.162]).

    Examples of center alignment tab settings.
    [Fig.162]: Examples of center alignment tab settings.
  4. Alignment with a specified character tab setting: the start position of a specified character or sign (for example, a period) in the text is aligned to the tab position (see [Fig.163]).

    Examples of specified character alignment tab settings.
    [Fig.163]: Examples of specified character alignment tab settings.

3.6.3 The Method of Setting the Target Text

Set the text from the line head to the position before the tab sign in the first tab position, set the text from the first tab sign to the next tab sign in the second tab position, and so on. The behavior of opening brackets (cl-01) and closing brackets (cl-02), etc. is same as for the main text.

Following are some examples. The behavior of text before and after the tab positions are very difficult to anticipate, so it is necessary to design using trial and error.

  1. If the target string is the first series of the line, the characters should be set in the first tab position from the start of the line, and so on, one after another (see [Fig.164]).

    Example of tab setting 1.
    [Fig.164]: Example of tab setting 1.
  2. If the target string of text is too long to be set before the next tab position and overflows, the next string of text is aligned to the tag position after the end of the preceding string (see [Fig.165]).

    Example of tab setting 2.
    [Fig.165]: Example of tab setting 2.
  3. If the beginning of the string overlaps with the end of the preceding string as the result of the tab setting indication, the following string is set just after the preceding string (see [Fig.166]).

    Example of tab setting 3.
    [Fig.166]: Example of tab setting 3.
  4. If there is no tab position corresponding to the target string, the string should be set from the tab position of the next line, and so forth (see [Fig.167]).

    Example for tab setting 4.
    [Fig.167]: Example for tab setting 4.

3.7 Other Rules of Japanese Typesetting

3.7.1 Superscripts and Superscripts

Superscripts and subscripts are small letters associated with base characters, and typically used to indicate SI unit symbols, or used for mathematical or chemical formulae.

(note 1)

Superscripts and subscripts are described in JIS X 4051 sec.4.13.

Superscripts and subscripts are usually set after the base character, with some exceptions for chemical formulae (which appear before the base character). They should be set solid.

For examples of superscripts and subscripts see [Fig.168]. In this document, superscripts and subscripts and their base characters are handled as ornamented character complex (cl-21) characters.

Examples of superscripts and subscripts.
[Fig.168]: Examples of superscripts and subscripts.

(note 1)

Inter-letter space between ornamented character complex (cl-21) and adjacent characters is described in detail in Appendix B Spacing between Characters in accordance with the character class concept in 3.9 About Character Classes.

JIS X 4051 specifies the character size and the block direction positioning of superscripts and subscripts alongside the base character to be implementation definable parameters. However it is recommended that the size of superscripts and subscripts are around 60% of the base character, depending on the size of the base character.

It is prohibited to break lines within an ornamented character complex (cl-21) sequence. Also, it is prohibited to use inter-character spacing within an ornamented character complex (cl-21) sequence for line adjustment.

(note 1)

In vertical writing mode, characters with superscripts or subscripts, that is ornamented character complex (cl-21) characters, are rotated 90 degrees clock-wise. If the length of the sequence is short enough, the sequence can be set as tate-chu-yoko.

(note 2)

When both the superscript and the subscript follow the base character, usually the subscript is set first and set solid, followed by the superscript set with solid space. For chemical formulae sometimes both superscript and subscript are set vertically in the same position with respect to the base character, and with solid space between them and the base character.

3.7.2 Furiwake

Furiwake is a typesetting style for setting multiple phrases or sentences in in the middle of a line. Furiwake is also used to indicate options (see [Fig.169]). Study guides, manuals and reference books sometimes use furiwake. In many furiwake styles, multiple lines are indicated with opening brackets (cl-01) and closing brackets (cl-02), etc.

Example of furiwake.
[Fig.169]: Example of furiwake.

Furiwake is usually done as follows (see [Fig.170]). In this document, the lines which combine to make the furiwake are called furiwake-gyou.

  1. The character size of the furiwake-gyou is usually the same as that of the base paragraph. Sometimes, the furiwake-gyou is a little bit smaller than the size of the base paragraph. Sometimes, the font style of the furiwake-gyou is different typeface the style of the base paragraph.

  2. In the same furiwake block, the top positions of all the furiwake-gyou lines are aligned.

  3. The line length of the furiwake block is the line length of the longest furiwake-gyou. However, it is permitted to indicate the length of the furiwake block, and break the furiwake-gyou lines. In this style, the start positions of the broken lines should be explicitly indicated. When there are line break marks in the furiwake-gyou, the line is broken in the indicated places. In this style, the start positions of the wrapped lines are aligned to the first line. The space between wrapped lines should be set solid.

  4. The line-feed space of each furiwake block should be explicitly indicated.

  5. The center line of the furiwake block should be aligned with the center line of the main text.

  6. When the furiwake block is enclosed by opening brackets (cl-01) and closing brackets (cl-02), etc. the width of brackets should be the same as the width of the furiwake block.

  7. One furiwake block should be set per base text line. One furiwake block should not be extended across multiple base text lines.

Setting method of furiwake.
[Fig.170]: Setting method of furiwake.

The size of the line-feed space of the paragraph which contains the furiwake block, should be explicitly indicated. The space should be decided by considering the content of the furiwake block, and may therefore differ from the size of the line-feed space of kihon-hanmen.

In general, the width of the furiwake block is larger than the width of an inline cutting note block. However, unlike in the case of the inline cutting block, the whole furiwake block should be set inside of the kihon-hanmen, or a column of the kihon-hanmen Setting a furiwake block that extends beyond the border of the kihon-hanmen is prohibited.

3.7.3 Jidori Processing

In cases such as lists of names of Japanese people, the length of some part of the text may be explicitly defined. In such cases, different numbers of characters are set, using adjustment of the inter-character spacing, so that they are all aligned to the same length. This is called jidori processing (see [Fig.171]).

Example 1 of jidori processing.
[Fig.171]: Example 1 of jidori processing.

Sometimes, in horizontal writing mode, text in running heads (with the exception of chapter and section numbers) are set using jidori processing. For example, three to six characters are set in a 7 character space (based on the size of the characters in the running head (see [Fig.172]). Two characters are set in a 6 character space to avoid too much space. Seven characters are set solid in a seven character space, and eight or more characters are set solid in a space of eight or more characters. This rule can be applied to other numbers of characters, such as five, six and eight.

Example 2 of of jidori processing.
[Fig.172]: Example 2 of of jidori processing.

Jidori processing should be done as follows:

  1. The length for the jidori processing should be defined as a whole number of full-width characters at the size defined for the surrounding text.

  2. The jidori text should be adjusted using spacing between characters so that the sides of the text are aligned at the defined length. The following, however, should be set solid:

    1. Positions where line breaks are prohibited: inter-character spaces between European numerals; between two EM DASH "—" characters; between two TWO DOT LEADER "‥" characters; between two HORIZONTAL ELLIPSIS "…" characters; and so on. These sequences should be treated as a single block.

      (note 1)

      The handling of opening brackets (cl-01) and closing brackets (cl-02) in jidori processing is also very controversial. Usually, the space before opening brackets (cl-01) and space after closing brackets (cl-02) is set solid. In jidori processing, the space before opening brackets (cl-01) and the space after closing brackets (cl-02) may be used for line adjustment, but the space after opening brackets (cl-01) and the space before closing brackets (cl-02) should not used for adjustment, because it is prohibited to break lines in these positions. [Fig.173] shows one example with even spacing for all characters, a second example as explained here, and a third example that is set solid before and after the opening brackets (cl-01) and closing brackets (cl-02).

      Examples of jidori processing including opening brackets and closing brackets.
      [Fig.173]: Examples of jidori processing including opening brackets and closing brackets.
    2. Where Western word space (cl-26) or full-width ideographic space (cl-14) are inserted, the add same space as for other inter-character spaces to the value of the Western word space or the full-width ideographic space.

  3. If there is only one character, it should be aligned to the left of the jidori block.

3.7.4 Processing of Math Symbols and Math Operators

Math symbols and math operators, such as EQUALS SIGN "=", APPROXIMATELY EQUAL TO OR THE IMAGE OF "≒", PLUS SIGN "+" and MINUS SIGN "−" are commonly used not only for scientific and technical documents but also for ordinary books. In the Japanese composition system, there are two different groups of math symbols, which are each treated differently. So in this document math symbols are classified into two different classes; math symbols (cl-17) and math operators (cl-18).

(note 1)

The members of the math symbols (cl-17) and math operators (cl-18) classes are described in 3.9 About Character Classes. Also, the handling of inter-character spaces between these math symbols and adjacent characters is described in Appendix A Character Classes as a complete table, in accordance with the concept of character class, described in 3.9 About Character Classes.

(note 2)

Because the math symbols, such as SQUARE ROOT "√", INTEGRAL "∫" and GREEK CAPITAL LETTER SIGMA "Σ", are not frequently used in common books, the handling of these math symbols is considered to be out of scope for this document. Also, mathematical and scientific notations like numerical formulae are out of scope for this document.

Composition rules for math symbols (cl-17) and math operators (cl-18) are as follows:

  1. The width of math symbols (cl-17) and math operators (cl-18) is full-width, i.e. one em (see [Fig.174]).

  2. The inter-character space between math symbols (cl-17) or math operators (cl-18) and before and after adjacent characters, such as grouped numerals (cl-24), Western characters (cl-27), and ornamented character complex (cl-21) in one line is set solid (see [Fig.174]). However, when the top and/or the bottom of the mathematical formula is grouped numerals (cl-24) or Western characters (cl-27), the space between ideographic characters (cl-19), hiragana (cl-15) and katakana (cl-16) is quarter em space.

    Example of math symbols and math operators set within an ordinary line.
    [Fig.174]: Example of math symbols and math operators set within an ordinary line.

    (note 1)

    The inter-character space between ideographic characters (cl-19), hiragana (cl-15) or katakana (cl-16) and math symbols (cl-17) or math operators (cl-18) is solid. However, it is better to insert a quarter em space between ideographic characters (cl-19),hiragana (cl-15) or katakana (cl-16) and adjacent math operators (cl-18) when the math operators (cl-18) are followed by grouped numerals (cl-24) or Western characters (cl-27) which represent some mathematical value (see [Fig.174]).

    Example of math symbols and math operators set within an ordinary line.
    [Fig.175]: Example of math symbols and math operators set within an ordinary line.
  3. When math formulae or chemical formulae are set in one independent line, the inter-character space between math symbols (cl-17) and adjacent grouped numerals (cl-24), Western characters (cl-27) and ornamented character complex (cl-21) is quarter em. Also, when math formulae or chemical formulae are set in an individual line, the inter-character space between math operators (cl-18) and adjacent grouped numerals (cl-24), Western characters (cl-27) or ornamented character complex (cl-21) is set solid.

    Example of math symbols and math operators in one independent line.
    [Fig.176]: Example of math symbols and math operators in one independent line.

    (note 1)

    In most case when a math formula or chemical formula is set in one independent line, the position of the formula is the center of the line in horizontal writing mode, and has is indented by some indicated number of characters from the head of line in vertical writing mode.

    (note 2)

    In the formula in one independent line, there is another method to set the inter-character space between grouped numerals (cl-24) or Western characters (cl-27) and math symbols (cl-17), solid or half em. When the inter-character space before and after the math symbols (cl-17) is set to a quarter em or a half em, there is another method to set the inter-character space, between math operators (cl-18) and grouped numerals (cl-24) or Western characters (cl-27), that is, a quarter em.

    Another example of setting math symbols  and math operators in one independent formula line.
    [Fig.177]: Another example of setting math symbols and math operators in one independent formula line.
  4. A line can be broken between math symbols (cl-17) or math operators (cl-18) and adjacent grouped numerals (cl-24), Western characters (cl-27) or ornamented character complex (cl-21).

    (note 1)

    In an independent formula line, when there are more than one place where the line can be broken the first priority is before the math symbols (cl-17), and the next is before the math operators (cl-18).

    (note 2)

    The inter-character space before and after KATAKANA MIDDLE DOT "・", before opening brackets (cl-01) and after closing brackets (cl-02) in an independent formula line is set solid, except for the case to set space between these characters and adjacent math symbols (cl-17) or math operators (cl-18).

3.8 Line Adjustment

3.8.1 Necessity for Line Adjustment

Line adjustment processing is applied where inter-character adjustments are needed to bring the line end into the correct alignment, e.g. because of line wrap or other reasons. Within a paragraph, lines are created by separating character sequences at places where line breaking is not prohibited. Except for the end of the last line of a paragraph, it is necessary to set the head and end of each line at predicable, aligned positions. For the last line of the paragraph, it is still necessary to set the head at the aligned position, however the line end may not aligned to the other alignment position. To achieve this, only inter-character spacing indicated in the table of Appendix B Spacing between Characters, or explicitly chosen spaces, are inserted, and other inter-character spaces are set solid.

(note 1)

In Japanese composition, there is no concept corresponding to the Western "ragged right (flush left)", "ragged left (flush right)", or "ragged center". In Japanese composition, especially common book composition, usually only "justification" is applied. Note that, in Western composition, "justification" is usually applied for only word spaces. In Japanese composition, it is applied for not only Western word space (cl-26) but also in several other places as explained in following text.

(note 2)

Usually, the last line of a paragraph needs no adjustment. However, when the last line of a paragraph is a little bit longer than the line length, inter-character space reduction is applied for opening brackets (cl-01) , closing brackets (cl-02) and at other places, if possible.

(note 3)

There is another adjustment processing, besides line adjustment, called "single line alignment". Single line alignment is applied to align a run of the text that is shorter than a given line length to designated positions. Details are explained in 3.9 About Character Classes.

There are various reasons for line adjustment processing. Examples of the most important ones will be given below.

  1. Mixed use of characters and symbols (e.g. grouped numerals (cl-24) or Western characters (cl-27)) where not all characters are full-width (see [Fig.178]).

    Example of grouped numerals and Western characters.
    [Fig.178]: Example of grouped numerals and Western characters.
  2. Sequences of punctuation marks. For example, a sequence of a closing bracket (cl-02) and a full stop (cl-06) takes one and a half em spaces together (see [Fig.179]). However, if an opening bracket (cl-01) follows immediately after the full stop, these punctuation marks will need two em spaces together. Hence, no adjustment is needed to correctly align the line end (see [Fig.179]).

    Examples of sequences of punctuation marks.
    [Fig.179]: Examples of sequences of punctuation marks.
  3. Mixtures of characters with different sizes (see [Fig.180]).

    Example of characters within brackets which are made a level smaller than the normal character size.
    [Fig.180]: Example of characters within brackets which are made a level smaller than the normal character size.

    (note 1)

    In cases where additional information like page references, explanations of terminology, etc. appear within brackets, sometimes the character size is a level smaller than the character size established by the kihon-hanmen.

  4. Cases where line head wrapping, line end wrapping or unbreakable character sequences should not be broken (see [Fig.89]).

3.8.2 Reduction and Addition of Inter-Character Space

Line adjustment processing targets places with a predefined space or solid setting. Methods for line adjustment are as follows.

  1. Line adjustment by inter-character space reduction. This means that a half em space is reduced after commas (cl-07) or closing brackets (cl-02), or before opening brackets (cl-01), and Western word space (cl-26) is reduced within a defined limit.

  2. Line adjustment by inter-character space expansion. Line adjustment by inter-character space expansion means expanding inter-character space for line adjustment, where inter-character space is allowed to be extended up to a defined limit, such as for Western word space (cl-26) or other places where it is not prohibited to extend inter-character space.

Normally line adjustment by inter-character space reduction is preferred. Only when there are no spaces that can be reduced is line adjustment by inter-character space expansion applied. The reason for the preference of line adjustment by inter-character space reduction comes from the thinking that characters in solid setting should not have more inter character space, if at all possible.

(note 1)

Line adjustment by hanging punctuation is a method of avoiding line head wrap of full stops (cl-06) and commas (cl-07). This method is not formally defined in JIS X 4051, however JIS X 4051 does provide explanatory material about it.

Line adjustment by hanging punctuation is a method which is only applied to full stops (cl-06) and commas (cl-07). These touch the hanmen and are set in a place outside the defined line length (see [Fig.181]). This is also applied, for example, in books, in order to avoid the addition of inter character space and maintain solid setting. However, line adjustment by hanging punctuation is not an appropriate method for dealing with a mixture of Japanese and Latin script text, since the latter principally does not apply line adjustment by hanging punctuation. In addition, there is another argument against line adjustment by hanging punctuation. Originally it was a method used in letterpress printing, to make the task of line adjustment easier. Furthermore, as shown at the end of line 1 and 5 of [Fig.181], if possible the full stops (cl-06) or commas (cl-07) are placed at the line end (the 18th position). In DTP there are examples of hanging punctuation like in line 3, but this may be regarded as unnecessary processing.

Examples of line adjustment by hanging punctuation.
[Fig.181]: Examples of line adjustment by hanging punctuation.

3.8.3 Procedures for Inter-Character Space Reduction

For line adjustment by inter-character space reduction decisions must first be made about the preferred order in which reduction processing options are applied, and the maximum amount of space reduction needed. Inter-character space reduction is processed with following priorities.

(note 1)

JIS X 4051 provides also definitions for the adaptation of inline cutting note, but for the purpose of this section, these are rather complex and hence left out.

(note 2)

The details of inter-character spaces and where reduction processing may be applied are described in the table of Appendix D Opportunities for Inter-character Space Reduction during Line Adjustment, following 3.9 About Character Classes.

  1. Western word space (cl-26), which is usually one third em, is reduced by equal amounts, to leave a minimum of a quarter em space between words. The same space reduction is applied to all spaces on the target line at the same time.

  2. The half em space after closing brackets (cl-02),commas (cl-07) and full stops (cl-06) at the end of a line, is deleted and set solid.

  3. The quarter em spaces both before and after the middle dots (cl-05) are deleted and set solid.

  4. The quarter em space before or after middle dots (cl-05), in the middle of a line, is reduced equally with proportional character size as far as solid setting.

  5. The half em spaces before opening brackets (cl-01) or after closing brackets (cl-02) or commas (cl-07), in the middle of a line, are reduced equally with proportional character size, as far as solid setting.

    (note 1)

    With the exception of the line end position, the half space after full stops (cl-06) should not adapted, since it plays an important role as a sentence separator.

    (note 2)

    Commas (cl-07) fulfill different roles to opening brackets (cl-01) and closing brackets (cl-02). Hence, there are examples where it is preferred to reduce the half em space before or after brackets, rather than to reduce the half em space after commas.

    (note 3)

    The reduction of the half em space before opening brackets (cl-01) or after closing brackets (cl-02) and commas (cl-07) up to solid setting is regarded as too much reduction. Hence, there are examples where the maximum amount of space reduction is up to quarter em space.

  6. The quarter em space between Japanese text (hiragana (cl-15), katakana (cl-16) and ideographic characters (cl-19)) and Latin script text (grouped numerals (cl-24), Western characters (cl-27) and unit symbols (cl-25)), is reduced equally with proportional character size, as far as one eighth em space.

    (note 1)

    There are also examples where the quarter em space between Japanese text (hiragana (cl-15), katakana (cl-16) and ideographic characters (cl-19)) and Latin script text (Western characters (cl-27), grouped numerals (cl-24) or unit symbols (cl-25)) is regarded as fixed space, and space adaptation is not applied.

In JIS X 4051, the space after closing brackets (cl-02), commas (cl-07) and middle dots (cl-05) at the end of a line are set solid, and the space after commas (cl-07) at the end of a line is set to a half em. Accordingly, JIS X 4051 defines the priority of processing as follows:

(note 1)

The details of the inter-character spaces, where the inter-space reduction processing can be applied, which is defined in JIS X 4051, is described in detail in the table 5 of Appendix D Opportunities for Inter-character Space Reduction during Line Adjustment in accordance with the character class concept in 3.9 About Character Classes.

  1. Western word space (cl-26), which is usually one third em, is reduced by equal amounts, to leave a minimum of a quarter em space between words.

  2. The quarter em space before and after middle dots (cl-05) is reduced equally with proportional character size as far as solid setting.

  3. The half em spaces before opening brackets (cl-01) and after closing brackets (cl-02) or commas (cl-07), are reduced equally with proportional character size as far as solid setting.

  4. The quarter em spaces between Japanese text (hiragana (cl-15), katakana (cl-16) and ideographic characters (cl-19)) and Latin script text (grouped numerals (cl-24), Western characters (cl-27) and unit symbols (cl-25)) are reduced equally with proportional character size, as far as a 1/8th em space.

3.8.4 Procedures for Inter-Character Space Expansion

As with line adjustment by inter-character space reduction, for line adjustment by inter-character space expansion at first the order of processing and the maximum amount of space to be added are defined. In JIS X 4051, the following processing order is defined.

  1. Western word space (cl-26), which is usually one third em, is added equally with proportional character size up to a maximum of a half em size for each space.

  2. The quarter em space between Japanese text (hiragana (cl-15), katakana (cl-16) and ideographic characters (cl-19)) and Latin script text (grouped numerals (cl-24), Western characters (cl-27) and unit symbols (cl-25)) is increased equally with proportional character size, up to half em space (or one third em space).

    (note 1)

    Like with inter-character space reduction, there are also examples there the quarter em space between Japanese text (hiragana (cl-15), katakana (cl-16) and ideographic characters (cl-19)) and Latin script text (Western characters (cl-27), grouped numerals (cl-24) and unit symbols (cl-25)) is regarded as a fixed space, and space adaptation is not applied.

  3. For places which do not fall under (a) or (b) and which do not have bunrikinshi, space is added equally with proportional character size up to a quarter em space.

  4. In addition to the adaptation in the manner of (a), (b) and (c), in cases where such processing is not possible, space is added equally with proportional character size, with the exception of places which require bunrikinshi.

    (note 1)

    JIS X 4051 provides a definition in addition to (d). This says that it depends on each layout processing system whether inter-character space should be added equally. This includes the space between Western characters (cl-27).

    (note 2)

    The detail of the places where space expansion is possible for line adjustment is described in Appendix B Spacing between Characters as a complete table, in accordance with the concept of character class in 3.9 About Character Classes.

3.9 About Character Classes

3.9.1 Differences in Positioning of Characters and Symbols

The positioning of characters and symbols may vary depending on the following.

  1. Is the character width full-width, half-width, or something else?

  2. Is it allowed or forbidden to place the character or symbol at the line head? If it is allowed, how will it be placed?

  3. Is it allowed or forbidden to place the character or symbol at the line end? If it is allowed, how will it be placed?

  4. Are characters and symbols appearing in sequence in solid setting, or will there be a fixed space between them? For example, sequences of ideographic characters (cl-19) and hiragana (cl-15) are set solid, and for Western characters (cl-27) following hiragana (cl-15) there will be quarter em space.

  5. Is it allowed to have a line break within a sequence of characters? For example, there must not be a line break in a sequence of grouped numerals (cl-24).

  6. Is it allowed to use the space between characters in a sequence during line adjustment processing? For example, is inter-character space reduction or addition possible between the characters appearing in sequence? Another issue to be decided is the preferred order for adjustment processing, and the amount of the allowed adjustment.

3.9.2 Grouping of Characters and Symbols depending on their Positioning

During layout processing, the issues mentioned in the previous section are addressed by grouping characters and symbols according to their characteristics, and handling them as character classes.

JIS X 4051 also provides similar character classes but that are slightly different from this document. Furthermore JIS X 4051 states that it is implementation-defined how to handle characters that are not explicitly mentioned, e.g. whether they should belong to either class or not.

(note 1)

In JIS X 4051 Annex 1, the member characters and symbols of each character class are specified as a mapping table to JIS X 0213 character names.

A few character classes of this document are modified from JIS X 4051. In Appendix A Character Classes, there is a whole mapping table to ISO/IEC 10646 Annex A collection 285 (BASIC JAPANESE) and collection 286 (JAPANESE NON IDEOGRAPHIC EXTENSION). All character classes of this document are as follows:

  1. Opening brackets (cl-01)

    Example:

    ‘“(〔[{〈《「『【

    etc.

  2. Closing brackets (cl-02)

    Example:

    ’”)〕]}〉》」』】

    etc.

    (note 1)

    In JIS X 4051, IDEOGRAPHIC COMMA "、" and COMMA "," are classified as closing brackets (cl-02), because they have similar positioning methods. However, in this document, the handling of IDEOGRAPHIC COMMA "、" and COMMA "," are described as an independent class, named commas (cl-07).

  3. Hyphens (cl-03)

    Example:

    ‐〜

    etc.

  4. Dividing punctuation marks (cl-04)

    Example:

    ?!

    etc.

  5. Middle dots (cl-05)

    Example:

    ・:;

  6. Full stops (cl-06)

    Example:

    。.

  7. Commas (cl-07)

    Example:

    、,

  8. Inseparable characters (cl-08)

    Example:

    —…‥

    etc.

  9. Iteration marks (cl-09)

    Example:

    ヽヾゝゞ々

    etc.

    (note 1)

    In JIS X 4051, iteration marks such as IDEOGRAPHIC ITERATION MARK "々" are classified as "no line break allowed before" characters. In this document, IDEOGRAPHIC ITERATION MARK "々" etc. are classified as iteration marks (cl-09).

    (note 2)

    There is another method where it is permitted to break a line before IDEOGRAPHIC ITERATION MARK "々". In this case, IDEOGRAPHIC ITERATION MARK "々" is regarded as a member of ideographic characters (cl-19).

  10. Prolonged sound marks (cl-10)

    Example:

    (note 1)

    In JIS X 405, KATAKANA-HIRAGANA PROLONGED SOUND MARK "ー" is a member of "Japanese characters with no line break allowed before". In this document KATAKANA-HIRAGANA PROLONGED SOUND MARK "ー" is the only member of prolonged sound mark (cl-10).

    (note 2)

    In JIS X 4051, it is permitted to exclude KATAKANA-HIRAGANA PROLONGED SOUND MARK "ー" from the "Japanese characters with no line break allowed before" character class.

    (note 3)

    When it is permitted to break a line before KATAKANA-HIRAGANA PROLONGED SOUND MARK "ー", KATAKANA-HIRAGANA PROLONGED SOUND MARK "ー" is regarded as a member of the katakana (cl-16) class.

  11. Small kana (cl-11)

    Example:

    ぁぃぅぇぉァィゥェォっゃゅょッャュョ

    etc.

    (note 1)

    In JIS X 4051, small katakana letters, such as KATAKANA LETTER SMALL TU "ッ", are regarded as members of "Japanese characters with no line break allowed before" character class. In this document, small katakana letters are classified in small kana (cl-11). Accordingly, the "Japanese characters with no line break allowed before" character class in JIS X 4051 is divided to three separate classes; iteration marks (cl-09), prolonged sound mark (cl-10) and small kana (cl-11).

    (note 2)

    In JIS X 4051, it is permitted to exclude small kana letters (ぁぃぅァィゥ etc.) from the "Japanese characters with no line break allowed before" character class as an implementation definable option.

    (note 3)

    When it is permitted to break a line before small hiragana (ぁぃぅ etc.) are regarded as members of the hiragana (cl-15) class, and small katakana (ァィゥ etc.) are regarded as members of the katakana (cl-16) class.

  12. Prefixed abbreviations (cl-12)

    Example:

    ¥$£#

    etc.

  13. Postfixed abbreviations (cl-13)

    Example:

    °′″℃¢%‰

    etc.

  14. Full-width ideographic space (cl-14)

    Example:

    U+3000 IDEOGRAPHIC SPACE

  15. Hiragana (cl-15)

    Example:

    あいうえおかがきぎ

    etc.

    (note 1)

    The reason kanji etc. (Japanese characters except 1. to 12.) and hiragana are regarded different classes in JIS X 4051, is the difference in the case of ruby, and specifically the handling in terms of characters jutting out of the base and overhanging adjacent characters.

  16. Katakana (cl-16)

    Example:

    アイウエオカガキギ

    etc.

    (note 1)

    In JIS X 4051, katakana and kanji are included in the same class (Japanese characters except for 1. to 12.). However, in this document, when ruby characters jut out of the base characters and overhang adjacent hiragana or katakana, the handling is same. That is the reason that in this document, katakana (cl-16) is an independent character class.

  17. Math symbols (cl-17)

    Example:

    =≠<>≦≧⊆⊇∪∩

    etc.

    (note 1)

    In JIS X 4051, math symbols (+-÷×etc.) and math operators (=≠<>≦≧⊆⊇∪∩etc.) are included in the "Japanese characters excluded from 1. to 12." class or Western character class. However, handling of math symbols and math operators adjacent to Western character or Arabic numerals is different from kanji. So, in this document, new math operators (cl-18) and math symbols (cl-17) classes are defined.

  18. Math operators (cl-18)

    Example:

    +-÷×

    etc.

  19. Ideographic characters (cl-19)

    Example:

    亜唖娃阿哀愛挨〃仝〆♂♀

    etc.

    (note 1)

    In JIS X 4051, corresponding character class for ideographic characters (cl-19) is "Japanese characters excluded from 1. to 12.".

  20. Characters as reference marks (cl-20)

    Characters which are inside verification seal (those are characters inside a verification seal that appear in the line just after the item applicable for reference marks of notes)

  21. Ornamented character complexes (cl-21)

    (note 1)

    The name of this class in JIS X 4051 is "Characters included in ornamented base characters complex". The meaning of "base characters complex" is characters in a complex including ruby, ornament characters and emphasis dots.

  22. Simple-ruby character complexes (cl-22)

    (note 1)

    The name of this class in JIS X 4051 is "Characters included in base characters complex with ruby (excluding jukugo-ruby)".

  23. Jukugo-ruby character complexes (cl-23)

    (note 1)

    The name of this class in JIS X 4051 is "Characters included in base characters complex with jukugo-ruby".

  24. Grouped numerals (cl-24)

    Sequences of European numerals which are not full-width and are handled as Japanese text, the decimal point or the comma and space used as grade indicator in number.

  25. Unit symbols (cl-25)

    Units described here include combinations of Latin script and Greek script characters used for international units (SI).

    (note 1)

    There are units created with combinations of Latin and Greek script characters with a full-width character frame (full-width units). Such units are not part of the characters for units described here. Furthermore, full-width characters for units are mainly used in vertical writing mode. Their usage in horizontal writing mode is regarded as bad style and should be avoided (see [Fig.182]).

    Example of a unit which encompasses a full-width unit character (upper part) and characters for Latin script text (lower part).
    [Fig.182]: Example of a unit which encompasses a full-width unit character (upper part) and characters for Latin script text (lower part).
  26. Western word space (cl-26)

  27. Western characters (cl-27)

    (note 1)

    Western characters (cl-27) include punctuation marks, such as commas, used in Western context. Among these punctuation marks, several marks are used both in a Japanese context and Western context. However, these marks have different character shapes depending on whether they are used in a Japanese context or Western context. For example, LEFT PARENTHESIS "(" and RIGHT PARENTHESIS ")" has not only different width (Japanese, half em, Western, proportional) but are also different in line position (Japanese, center of the character frame in the inline direction, Western, base line and descender line dependent) and design (Japanese, slightly bent and constant line thickness, Western, strongly curved and dynamic line thickness). The usage of these two differently designed commas should be explicit. Usually, in a Japanese context Japanese design is used, and in Western context Western design is used. However, there are some ambiguous cases, such as "エディター(editor)は……". In this case, English spelling is indicated using parentheses in a Japanese line of text. In this particular case, Japanese design is better.

  28. Warichu opening brackets (cl-28)

    Example:

    (〔[

    etc.

  29. Warichu closing brackets (cl-29)

    Example:

    )〕]

    etc.

    (note 1)

    warichu opening brackets (cl-28) and warichu closing brackets (cl-29) are used for surrounding inline cutting notes and the space before and after. They are in a separate class since they differ from normal brackets with regard to their processing.

  30. Characters in tate-chu-yoko (cl-30)

3.9.3 Positioning Methods for each Character Class

For each character class it is possible to describe whether the characters may appear at the line head or line end or not, the positioning method for the line head or line end positions (if available), the amount of space between sequences of several characters, and the combination with character classes before or after the characters (in a 2 dimensional table). In JIS X 4051 this is shown in table 5 "Amount of space (between characters)".

(note 1)

For the presentation as a two dimensional table, it becomes necessary for each class to have separate items about "line head" (the column about the character classes appearing before) and "line end" (the column about the character classes appearing after). If it is forbidden that the characters of the class appear at the line head or line end, JIS X 4051 uses an "X" mark in the columns for "line head" and "line end".

Also, it can be defined for each combination of the character classes (in a two dimensional table) whether the characters of classes appearing in sequence allow for a line break between them, or whether it is possible during line adjustment processing to add inter character space between them. In JIS X 4051 these items are also shown in a two dimensional table. Table 6 shows whether a line break is possible, and table 7 shows if it is possible to add inter character space.

(note 1)

It is also possible to define for each combination of character classes whether it is possible to apply kerning during line adjustment processing for character classes appearing in sequence. However, JIS X 4051 does not provide this information as a two dimensional table, but only as a textual description.

The width, in principle, of the space between each character or symbol in character classes used in this document is described in the table of Appendix B Spacing between Characters.

The combinations of adjacent characters and symbols in character classes used in this document, and where text is breakable or not, is described in the table of Appendix C Possibilities for Line-breaking between Characters.

The width of spaces between each character or symbol in character classes used in this document, and which can be reduced, is described in the table of Appendix D Opportunities for Inter-character Space Reduction during Line Adjustment. Also, expandable spaces are described in the table of Appendix E Opportunities for Inter-character Space Expansion during Line Adjustment.

4 Approach to Hanmen Design

4.1 Headings

This section will be written in an updated version of this document.

4.2 Notes

This section will be written in an updated version of this document.

4.3 Illustrations

This section will be written in an updated version of this document.

4.4 Tables

This section will be written in an updated version of this document.

Appendix A Character Classes

The following are lists of (non-ideographic) characters from a subset of ISO/IEC 10646 (collection number 285 "BASIC JAPANESE" and 286 "JAPANESE NON IDEOGRAPHICS EXTENSION") grouped by character class according to the classification explained in 3.9.2 Grouping of Characters and Symbols depending on their Positioning.

(note 1)

General punctuation marks are script neutral in ISO/IEC 10646 and there is no "Opening Ideographic Parenthesis" specifically encoded for CJK scripts in the standard. However, because of the typographic difference among scripts with respect to base line position as well as width and glyph design, the same set of punctuation marks usually do not fit to both Latin and Japanese text. To work around this issue, many existing implementations use compatibility characters encoded separately in the standard for the purpose of round-trip integrity with the legacy encoding standards. For example, while this document lists U+0028 LEFT PARENTHESIS as a member of opening brackets (cl-01), the compatibility counterpart U+FF08 FULLWIDTH LEFT PARENTHESIS is used in the Japanese context in most Japanese text layout software.

A.1 Opening brackets (cl-01)

CharacterUCSNameRemark
2018LEFT SINGLE QUOTATION MARKused horizontal composition
201CLEFT DOUBLE QUOTATION MARKused horizontal composition
0028LEFT PARENTHESIS
3014LEFT TORTOISE SHELL BRACKET
005BLEFT SQUARE BRACKET
007BLEFT CURLY BRACKET
3008LEFT ANGLE BRACKET
300ALEFT DOUBLE ANGLE BRACKET
300CLEFT CORNER BRACKET
300ELEFT WHITE CORNER BRACKET
3010LEFT BLACK LENTICULAR BRACKET
2985LEFT WHITE PARENTHESIS
3018LEFT WHITE TORTOISE SHELL BRACKET
3016LEFT WHITE LENTICULAR BRACKET
«00ABLEFT-POINTING DOUBLE ANGLE QUOTATION MARK
301DREVERSED DOUBLE PRIME QUOTATION MARKused vertical composition

A.2 Closing brackets (cl-02)

CharacterUCSNameRemark
2019RIGHT SINGLE QUOTATION MARKused horizontal composition
201DRIGHT DOUBLE QUOTATION MARKused horizontal composition
0029RIGHT PARENTHESIS
3015RIGHT TORTOISE SHELL BRACKET
005DRIGHT SQUARE BRACKET
007DRIGHT CURLY BRACKET
3009RIGHT ANGLE BRACKET
300BRIGHT DOUBLE ANGLE BRACKET
300DRIGHT CORNER BRACKET
300FRIGHT WHITE CORNER BRACKET
3011RIGHT BLACK LENTICULAR BRACKET
2986RIGHT WHITE PARENTHESIS
3019RIGHT WHITE TORTOISE SHELL BRACKET
3017RIGHT WHITE LENTICULAR BRACKET
»00BBRIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
301FLOW DOUBLE PRIME QUOTATION MARKused vertical composition

A.3 Hyphens (cl-03)

CharacterUCSNameRemark
2010HYPHENquarter em width
301CWAVE DASH
30A0KATAKANA-HIRAGANA DOUBLE HYPHENhalf-width
2013EN DASHhalf-width

A.4 Dividing punctuation marks (cl-04)

CharacterUCSNameRemark
003FQUESTION MARK
0021EXCLAMATION MARK
203CDOUBLE EXCLAMATION MARK
2047DOUBLE QUESTION MARK
2048QUESTION EXCLAMATION MARK
2049EXCLAMATION QUESTION MARK

A.5 Middle dots (cl-05)

CharacterUCSNameRemark
30FBKATAKANA MIDDLE DOT
003ACOLON
003BSEMICOLONused horizontal composition

A.6 Full stops (cl-06)

CharacterUCSNameRemark
3002IDEOGRAPHIC FULL STOP
002EFULL STOPused horizontal composition

A.7 Commas (cl-07)

CharacterUCSNameRemark
3001IDEOGRAPHIC COMMA
002CCOMMAused horizontal composition

A.8 Inseparable characters (cl-08)

CharacterUCSNameRemark
2014EM DASHSome systems implement U+2015 HORIZONTAL BAR very similar behavior to U+2014 EM DASH
2026HORIZONTAL ELLIPSIS
2025TWO DOT LEADER
3033VERTICAL KANA REPEAT MARK UPPER HALFused vertical composition
U+3035 follows this
3034VERTICAL KANA REPEAT WITH VOICED SOUND MARK UPPER HALFused vertical composition
U+3035 follows this
3035VERTICAL KANA REPEAT MARK LOWER HALFused vertical composition

A.9 Iteration marks (cl-09)

CharacterUCSNameRemark
30FDKATAKANA ITERATION MARK
30FEKATAKANA VOICED ITERATION MARK
309DHIRAGANA ITERATION MARK
309EHIRAGANA VOICED ITERATION MARK
3005IDEOGRAPHIC ITERATION MARK
303BVERTICAL IDEOGRAPHIC ITERATION MARK

A.10 Prolonged sound mark (cl-10)

CharacterUCSNameRemark
30FCKATAKANA-HIRAGANA PROLONGED SOUND MARK

A.11 Small kana (cl-11)

CharacterUCSNameRemark
3041HIRAGANA LETTER SMALL A
3043HIRAGANA LETTER SMALL I
3045HIRAGANA LETTER SMALL U
3047HIRAGANA LETTER SMALL E
3049HIRAGANA LETTER SMALL O
30A1KATAKANA LETTER SMALL A
30A3KATAKANA LETTER SMALL I
30A5KATAKANA LETTER SMALL U
30A7KATAKANA LETTER SMALL E
30A9KATAKANA LETTER SMALL O
3063HIRAGANA LETTER SMALL TU
3083HIRAGANA LETTER SMALL YA
3085HIRAGANA LETTER SMALL YU
3087HIRAGANA LETTER SMALL YO
308EHIRAGANA LETTER SMALL WA
3095HIRAGANA LETTER SMALL KA
3096HIRAGANA LETTER SMALL KE
30C3KATAKANA LETTER SMALL TU
30E3KATAKANA LETTER SMALL YA
30E5KATAKANA LETTER SMALL YU
30E7KATAKANA LETTER SMALL YO
30EEKATAKANA LETTER SMALL WA
30F5KATAKANA LETTER SMALL KA
30F6KATAKANA LETTER SMALL KE
31F0KATAKANA LETTER SMALL KU
31F1KATAKANA LETTER SMALL SI
31F2KATAKANA LETTER SMALL SU
31F3KATAKANA LETTER SMALL TO
31F4KATAKANA LETTER SMALL NU
31F5KATAKANA LETTER SMALL HA
31F6KATAKANA LETTER SMALL HI
31F7KATAKANA LETTER SMALL HU
31F8KATAKANA LETTER SMALL HE
31F9KATAKANA LETTER SMALL HO
31FAKATAKANA LETTER SMALL MU
31FBKATAKANA LETTER SMALL RA
31FCKATAKANA LETTER SMALL RI
31FDKATAKANA LETTER SMALL RU
31FEKATAKANA LETTER SMALL RE
31FFKATAKANA LETTER SMALL RO
ㇷ゚<31F7, 309A><KATAKANA LETTER SMALL HU, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>

A.12 Prefixed abbreviations (cl-12)

CharacterUCSNameRemark
00A5YEN SIGN
0024DOLLAR SIGN
00A3POUND SIGN
0023NUMBER SIGN
20ACEURO SIGN
2116NUMERO SIGN

A.13 Postfixed abbreviations (cl-13)

CharacterUCSNameRemark
°00B0DEGREE SIGNproportional
2032PRIMEproportional
2033DOUBLE PRIMEproportional
2103DEGREE CELSIUS
00A2CENT SIGN
0025PERCENT SIGN
2030PER MILLE SIGN
33CBSQUARE HP
2113SCRIPT SMALL L

A.14 Full-width ideographic space (cl-14)

CharacterUCSNameRemark
 3000IDEOGRAPHIC SPACE

A.15 Hiragana (cl-15)

CharacterUCSNameRemark
3042HIRAGANA LETTER A
3044HIRAGANA LETTER I
3046HIRAGANA LETTER U
3048HIRAGANA LETTER E
304AHIRAGANA LETTER O
304BHIRAGANA LETTER KA
304CHIRAGANA LETTER GA
304DHIRAGANA LETTER KI
304EHIRAGANA LETTER GI
304FHIRAGANA LETTER KU
3050HIRAGANA LETTER GU
3051HIRAGANA LETTER KE
3052HIRAGANA LETTER GE
3053HIRAGANA LETTER KO
3054HIRAGANA LETTER GO
3055HIRAGANA LETTER SA
3056HIRAGANA LETTER ZA
3057HIRAGANA LETTER SI
3058HIRAGANA LETTER ZI
3059HIRAGANA LETTER SU
305AHIRAGANA LETTER ZU
305BHIRAGANA LETTER SE
305CHIRAGANA LETTER ZE
305DHIRAGANA LETTER SO
305EHIRAGANA LETTER ZO
305FHIRAGANA LETTER TA
3060HIRAGANA LETTER DA
3061HIRAGANA LETTER TI
3062HIRAGANA LETTER DI
3064HIRAGANA LETTER TU
3065HIRAGANA LETTER DU
3066HIRAGANA LETTER TE
3067HIRAGANA LETTER DE
3068HIRAGANA LETTER TO
3069HIRAGANA LETTER DO
306AHIRAGANA LETTER NA
306BHIRAGANA LETTER NI
306CHIRAGANA LETTER NU
306DHIRAGANA LETTER NE
306EHIRAGANA LETTER NO
306FHIRAGANA LETTER HA
3070HIRAGANA LETTER BA
3071HIRAGANA LETTER PA
3072HIRAGANA LETTER HI
3073HIRAGANA LETTER BI
3074HIRAGANA LETTER PI
3075HIRAGANA LETTER HU
3076HIRAGANA LETTER BU
3077HIRAGANA LETTER PU
3078HIRAGANA LETTER HE
3079HIRAGANA LETTER BE
307AHIRAGANA LETTER PE
307BHIRAGANA LETTER HO
307CHIRAGANA LETTER BO
307DHIRAGANA LETTER PO
307EHIRAGANA LETTER MA
307FHIRAGANA LETTER MI
3080HIRAGANA LETTER MU
3081HIRAGANA LETTER ME
3082HIRAGANA LETTER MO
3084HIRAGANA LETTER YA
3086HIRAGANA LETTER YU
3088HIRAGANA LETTER YO
3089HIRAGANA LETTER RA
308AHIRAGANA LETTER RI
308BHIRAGANA LETTER RU
308CHIRAGANA LETTER RE
308DHIRAGANA LETTER RO
308FHIRAGANA LETTER WA
3090HIRAGANA LETTER WI
3091HIRAGANA LETTER WE
3092HIRAGANA LETTER WO
3093HIRAGANA LETTER N
3094HIRAGANA LETTER VU
か゚<304B, 309A><HIRAGANA LETTER KA, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
き゚<304D, 309A><HIRAGANA LETTER KI, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
く゚<304F, 309A><HIRAGANA LETTER KU, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
け゚<3051, 309A><HIRAGANA LETTER KE, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
こ゚<3053, 309A><HIRAGANA LETTER KO, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>

A.16 Katakana (cl-16)

CharacterUCSNameRemark
30A2KATAKANA LETTER A
30A4KATAKANA LETTER I
30A6KATAKANA LETTER U
30A8KATAKANA LETTER E
30AAKATAKANA LETTER O
30ABKATAKANA LETTER KA
30ACKATAKANA LETTER GA
30ADKATAKANA LETTER KI
30AEKATAKANA LETTER GI
30AFKATAKANA LETTER KU
30B0KATAKANA LETTER GU
30B1KATAKANA LETTER KE
30B2KATAKANA LETTER GE
30B3KATAKANA LETTER KO
30B4KATAKANA LETTER GO
30B5KATAKANA LETTER SA
30B6KATAKANA LETTER ZA
30B7KATAKANA LETTER SI
30B8KATAKANA LETTER ZI
30B9KATAKANA LETTER SU
30BAKATAKANA LETTER ZU
30BBKATAKANA LETTER SE
30BCKATAKANA LETTER ZE
30BDKATAKANA LETTER SO
30BEKATAKANA LETTER ZO
30BFKATAKANA LETTER TA
30C0KATAKANA LETTER DA
30C1KATAKANA LETTER TI
30C2KATAKANA LETTER DI
30C4KATAKANA LETTER TU
30C5KATAKANA LETTER DU
30C6KATAKANA LETTER TE
30C7KATAKANA LETTER DE
30C8KATAKANA LETTER TO
30C9KATAKANA LETTER DO
30CAKATAKANA LETTER NA
30CBKATAKANA LETTER NI
30CCKATAKANA LETTER NU
30CDKATAKANA LETTER NE
30CEKATAKANA LETTER NO
30CFKATAKANA LETTER HA
30D0KATAKANA LETTER BA
30D1KATAKANA LETTER PA
30D2KATAKANA LETTER HI
30D3KATAKANA LETTER BI
30D4KATAKANA LETTER PI
30D5KATAKANA LETTER HU
30D6KATAKANA LETTER BU
30D7KATAKANA LETTER PU
30D8KATAKANA LETTER HE
30D9KATAKANA LETTER BE
30DAKATAKANA LETTER PE
30DBKATAKANA LETTER HO
30DCKATAKANA LETTER BO
30DDKATAKANA LETTER PO
30DEKATAKANA LETTER MA
30DFKATAKANA LETTER MI
30E0KATAKANA LETTER MU
30E1KATAKANA LETTER ME
30E2KATAKANA LETTER MO
30E4KATAKANA LETTER YA
30E6KATAKANA LETTER YU
30E8KATAKANA LETTER YO
30E9KATAKANA LETTER RA
30EAKATAKANA LETTER RI
30EBKATAKANA LETTER RU
30ECKATAKANA LETTER RE
30EDKATAKANA LETTER RO
30EFKATAKANA LETTER WA
30F0KATAKANA LETTER WI
30F1KATAKANA LETTER WE
30F2KATAKANA LETTER WO
30F3KATAKANA LETTER N
30F4KATAKANA LETTER VU
カ゚<30AB, 309A><KATAKANA LETTER KA, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
キ゚<30AD, 309A><KATAKANA LETTER KI, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
ク゚<30AF, 309A><KATAKANA LETTER KU, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
ケ゚<30B1, 309A><KATAKANA LETTER KE, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
コ゚<30B3, 309A><KATAKANA LETTER KO, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
セ゚<30BB, 309A><KATAKANA LETTER SE, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
ツ゚<30C4, 309A><KATAKANA LETTER TU, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
ト゚<30C8, 309A><KATAKANA LETTER TO, COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK>
30F7KATAKANA LETTER VA
30F8KATAKANA LETTER VI
30F9KATAKANA LETTER VE
30FAKATAKANA LETTER VO

A.17 Math symbols (cl-17)

CharacterUCSNameRemark
003DEQUALS SIGN
2260NOT EQUAL TO
003CLESS-THAN SIGN
003EGREATER-THAN SIGN
2266LESS-THAN OVER EQUAL TO
2267GREATER-THAN OVER EQUAL TO
2208ELEMENT OF
220BCONTAINS AS MEMBER
2286SUBSET OF OR EQUAL TO
2287SUPERSET OF OR EQUAL TO
2282SUBSET OF
2283SUPERSET OF
222AUNION
2229INTERSECTION
2284NOT A SUBSET OF
2285NOT A SUPERSET OF
228ASUBSET OF WITH NOT EQUAL TO
228BSUPERSET OF WITH NOT EQUAL TO
2209NOT AN ELEMENT OF
2305PROJECTIVE
2306PERSPECTIVE
2227LOGICAL AND
2228LOGICAL OR
21D2RIGHTWARDS DOUBLE ARROW
21D4LEFT RIGHT DOUBLE ARROW
2225PARALLEL TO
2226NOT PARALLEL TO
2261IDENTICAL TO
2252APPROXIMATELY EQUAL TO OR THE IMAGE OF
226AMUCH LESS-THAN
226BMUCH GREATER-THAN
223DREVERSED TILDE (lazy S)
221DPROPORTIONAL TO
2262NOT IDENTICAL TO
2243ASYMPTOTICALLY EQUAL TO
2245APPROXIMATELY EQUAL TO
2248ALMOST EQUAL TO
2276LESS-THAN OR GREATER-THAN
2277GREATER-THAN OR LESS-THAN
22A5UP TACK
2194LEFT RIGHT ARROW
22DALESS-THAN EQUAL TO OR GREATER-THAN
22DBGREATER-THAN EQUAL TO OR LESS-THAN

A.18 Math operators (cl-18)

CharacterUCSNameRemark
002BPLUS SIGN
2212MINUS SIGN
±00B1PLUS-MINUS SIGN
×00D7MULTIPLICATION SIGN
÷00F7DIVISION SIGN
2295CIRCLED PLUS
2296CIRCLED MINUS
2297CIRCLED TIMES
2213MINUS-OR-PLUS SIGN

A.19 Ideographic characters (cl-19)

In addition to CJK Ideographs, ideographic characters (cl-19) also includes some handful of other symbols. The following is the list of all non-ideographic characters assigned to this character class.

CharacterUCSNameRemark
3003DITTO MARK
4EDDCJK UNIFIED IDEOGRAPH-4EDD
3006IDEOGRAPHIC CLOSING MARK
3007IDEOGRAPHIC NUMBER ZERO
002FSOLIDUS
005CREVERSE SOLIDUS
2016DOUBLE VERTICAL LINE
007CVERTICAL LINE
221EINFINITY
2234THEREFORE
2642MALE SIGN
2640FEMALE SIGN
0026AMPERSAND
002AASTERISK
0040COMMERCIAL AT
§00A7SECTION SIGN
2606WHITE STAR
2605BLACK STAR
25CBWHITE CIRCLE
25CFBLACK CIRCLE
25CEBULLSEYE
25C7WHITE DIAMOND
25C6BLACK DIAMOND
25A1WHITE SQUARE
25A0BLACK SQUARE
25B3WHITE UP-POINTING TRIANGLE
25B2BLACK UP-POINTING TRIANGLE
25BDWHITE DOWN-POINTING TRIANGLE
25BCBLACK DOWN-POINTING TRIANGLE
203BREFERENCE MARK
3012POSTAL MARK
2192RIGHTWARDS ARROW
2190LEFTWARDS ARROW
2191UPWARDS ARROW
2193DOWNWARDS ARROW
3013GETA MARK
303CMASU MARK
30FFKATAKANA DIGRAPH KOTO
309FHIRAGANA DIGRAPH YORI
221ASQUARE ROOT
2235BECAUSE
222BINTEGRAL
222CDOUBLE INTEGRAL
266FMUSIC SHARP SIGN
266DMUSIC FLAT SIGN
266AEIGHTH NOTE
2020DAGGER
2021DOUBLE DAGGER
00B6PILCROW SIGN
266EMUSIC NATURAL SIGN
266BBEAMED EIGHTH NOTES
266CBEAMED SIXTEENTH NOTES
2669QUARTER NOTE
25EFLARGE CIRCLE
25B7WHITE RIGHT-POINTING TRIANGLE
25B6BLACK RIGHT-POINTING TRIANGLE
25C1WHITE LEFT-POINTING TRIANGLE
25C0BLACK LEFT-POINTING TRIANGLE
2197NORTH EAST ARROW
2198SOUTH EAST ARROW
2196NORTH WEST ARROW
2199SOUTH WEST ARROW
21C4RIGHTWARDS ARROW OVER LEFTWARDS ARROW
21E8RIGHTWARDS WHITE ARROW
21E6LEFTWARDS WHITE ARROW
21E7UPWARDS WHITE ARROW
21E9DOWNWARDS WHITE ARROW
2934ARROW POINTING RIGHTWARDS THEN CURVING UPWARDS
2935ARROW POINTING RIGHTWARDS THEN CURVING DOWNWARDS
0030DIGIT ZERO
0031DIGIT ONE
0032DIGIT TWO
0033DIGIT THREE
0034DIGIT FOUR
0035DIGIT FIVE
0036DIGIT SIX
0037DIGIT SEVEN
0038DIGIT EIGHT
0039DIGIT NINE
⦿29BFCIRCLED BULLET
25C9FISHEYE
303DPART ALTERNATION MARK
25E6WHITE BULLET
2022BULLET
0041LATIN CAPITAL LETTER A
0042LATIN CAPITAL LETTER B
0043LATIN CAPITAL LETTER C
0044LATIN CAPITAL LETTER D
0045LATIN CAPITAL LETTER E
0046LATIN CAPITAL LETTER F
0047LATIN CAPITAL LETTER G
0048LATIN CAPITAL LETTER H
0049LATIN CAPITAL LETTER I
004ALATIN CAPITAL LETTER J
004BLATIN CAPITAL LETTER K
004CLATIN CAPITAL LETTER L
004DLATIN CAPITAL LETTER M
004ELATIN CAPITAL LETTER N
004FLATIN CAPITAL LETTER O
0050LATIN CAPITAL LETTER P
0051LATIN CAPITAL LETTER Q
0052LATIN CAPITAL LETTER R
0053LATIN CAPITAL LETTER S
0054LATIN CAPITAL LETTER T
0055LATIN CAPITAL LETTER U
0056LATIN CAPITAL LETTER V
0057LATIN CAPITAL LETTER W
0058LATIN CAPITAL LETTER X
0059LATIN CAPITAL LETTER Y
005ALATIN CAPITAL LETTER Z
0061LATIN SMALL LETTER A
0062LATIN SMALL LETTER B
0063LATIN SMALL LETTER C
0064LATIN SMALL LETTER D
0065LATIN SMALL LETTER E
0066LATIN SMALL LETTER F
0067LATIN SMALL LETTER G
0068LATIN SMALL LETTER H
0069LATIN SMALL LETTER I
006ALATIN SMALL LETTER J
006BLATIN SMALL LETTER K
006CLATIN SMALL LETTER L
006DLATIN SMALL LETTER M
006ELATIN SMALL LETTER N
006FLATIN SMALL LETTER O
0070LATIN SMALL LETTER P
0071LATIN SMALL LETTER Q
0072LATIN SMALL LETTER R
0073LATIN SMALL LETTER S
0074LATIN SMALL LETTER T
0075LATIN SMALL LETTER U
0076LATIN SMALL LETTER V
0077LATIN SMALL LETTER W
0078LATIN SMALL LETTER X
0079LATIN SMALL LETTER Y
007ALATIN SMALL LETTER Z
29FADOUBLE PLUS
29FBTRIPLE PLUS
Α0391GREEK CAPITAL LETTER ALPHA
Β0392GREEK CAPITAL LETTER BETA
Γ0393GREEK CAPITAL LETTER GAMMA
Δ0394GREEK CAPITAL LETTER DELTA
Ε0395GREEK CAPITAL LETTER EPSILON
Ζ0396GREEK CAPITAL LETTER ZETA
Η0397GREEK CAPITAL LETTER ETA
Θ0398GREEK CAPITAL LETTER THETA
Ι0399GREEK CAPITAL LETTER IOTA
Κ039AGREEK CAPITAL LETTER KAPPA
Λ039BGREEK CAPITAL LETTER LAMDA
Μ039CGREEK CAPITAL LETTER MU
Ν039DGREEK CAPITAL LETTER NU
Ξ039EGREEK CAPITAL LETTER XI
Ο039FGREEK CAPITAL LETTER OMICRON
Π03A0GREEK CAPITAL LETTER PI
Ρ03A1GREEK CAPITAL LETTER RHO
Σ03A3GREEK CAPITAL LETTER SIGMA
Τ03A4GREEK CAPITAL LETTER TAU
Υ03A5GREEK CAPITAL LETTER UPSILON
Φ03A6GREEK CAPITAL LETTER PHI
Χ03A7GREEK CAPITAL LETTER CHI
Ψ03A8GREEK CAPITAL LETTER PSI
Ω03A9GREEK CAPITAL LETTER OMEGA
2664WHITE SPADE SUIT
2660BLACK SPADE SUIT
2662WHITE DIAMOND SUIT
2666BLACK DIAMOND SUIT
2661WHITE HEART SUIT
2665BLACK HEART SUIT
2667WHITE CLUB SUIT
2663BLACK CLUB SUIT
α03B1GREEK SMALL LETTER ALPHA
β03B2GREEK SMALL LETTER BETA
γ03B3GREEK SMALL LETTER GAMMA
δ03B4GREEK SMALL LETTER DELTA
ε03B5GREEK SMALL LETTER EPSILON
ζ03B6GREEK SMALL LETTER ZETA
η03B7GREEK SMALL LETTER ETA
θ03B8GREEK SMALL LETTER THETA
ι03B9GREEK SMALL LETTER IOTA
κ03BAGREEK SMALL LETTER KAPPA
λ03BBGREEK SMALL LETTER LAMDA
μ03BCGREEK SMALL LETTER MU
ν03BDGREEK SMALL LETTER NU
ξ03BEGREEK SMALL LETTER XI
ο03BFGREEK SMALL LETTER OMICRON
π03C0GREEK SMALL LETTER PI
ρ03C1GREEK SMALL LETTER RHO
σ03C3GREEK SMALL LETTER SIGMA
τ03C4GREEK SMALL LETTER TAU
υ03C5GREEK SMALL LETTER UPSILON
φ03C6GREEK SMALL LETTER PHI
χ03C7GREEK SMALL LETTER CHI
ψ03C8GREEK SMALL LETTER PSI
ω03C9GREEK SMALL LETTER OMEGA
ς03C2GREEK SMALL LETTER FINAL SIGMA
24F5DOUBLE CIRCLED DIGIT ONE
24F6DOUBLE CIRCLED DIGIT TWO
24F7DOUBLE CIRCLED DIGIT THREE
24F8DOUBLE CIRCLED DIGIT FOUR
24F9DOUBLE CIRCLED DIGIT FIVE
24FADOUBLE CIRCLED DIGIT SIX
24FBDOUBLE CIRCLED DIGIT SEVEN
24FCDOUBLE CIRCLED DIGIT EIGHT
24FDDOUBLE CIRCLED DIGIT NINE
24FEDOUBLE CIRCLED NUMBER TEN
2616WHITE SHOGI PIECE
2617BLACK SHOGI PIECE
3020POSTAL MARK FACE
260EBLACK TELEPHONE
2600BLACK SUN WITH RAYS
2601CLOUD
2602UMBRELLA
2603SNOWMAN
2668HOT SPRINGS
25B1WHITE PARALLELOGRAM
А0410CYRILLIC CAPITAL LETTER A
Б0411CYRILLIC CAPITAL LETTER BE
В0412CYRILLIC CAPITAL LETTER VE
Г0413CYRILLIC CAPITAL LETTER GHE
Д0414CYRILLIC CAPITAL LETTER DE
Е0415CYRILLIC CAPITAL LETTER IE
Ё0401CYRILLIC CAPITAL LETTER IO
Ж0416CYRILLIC CAPITAL LETTER ZHE
З0417CYRILLIC CAPITAL LETTER ZE
И0418CYRILLIC CAPITAL LETTER I
Й0419CYRILLIC CAPITAL LETTER SHORT I
К041ACYRILLIC CAPITAL LETTER KA
Л041BCYRILLIC CAPITAL LETTER EL
М041CCYRILLIC CAPITAL LETTER EM
Н041DCYRILLIC CAPITAL LETTER EN
О041ECYRILLIC CAPITAL LETTER O
П041FCYRILLIC CAPITAL LETTER PE
Р0420CYRILLIC CAPITAL LETTER ER
С0421CYRILLIC CAPITAL LETTER ES
Т0422CYRILLIC CAPITAL LETTER TE
У0423CYRILLIC CAPITAL LETTER U
Ф0424CYRILLIC CAPITAL LETTER EF
Х0425CYRILLIC CAPITAL LETTER HA
Ц0426CYRILLIC CAPITAL LETTER TSE
Ч0427CYRILLIC CAPITAL LETTER CHE
Ш0428CYRILLIC CAPITAL LETTER SHA
Щ0429CYRILLIC CAPITAL LETTER SHCHA
Ъ042ACYRILLIC CAPITAL LETTER HARD SIGN
Ы042BCYRILLIC CAPITAL LETTER YERU
Ь042CCYRILLIC CAPITAL LETTER SOFT SIGN
Э042DCYRILLIC CAPITAL LETTER E
Ю042ECYRILLIC CAPITAL LETTER YU
Я042FCYRILLIC CAPITAL LETTER YA
а0430CYRILLIC SMALL LETTER A
б0431CYRILLIC SMALL LETTER BE
в0432CYRILLIC SMALL LETTER VE
г0433CYRILLIC SMALL LETTER GHE
д0434CYRILLIC SMALL LETTER DE
е0435CYRILLIC SMALL LETTER IE
ё0451CYRILLIC SMALL LETTER IO
ж0436CYRILLIC SMALL LETTER ZHE
з0437CYRILLIC SMALL LETTER ZE
и0438CYRILLIC SMALL LETTER I
й0439CYRILLIC SMALL LETTER SHORT I
к043ACYRILLIC SMALL LETTER KA
л043BCYRILLIC SMALL LETTER EL
м043CCYRILLIC SMALL LETTER EM
н043DCYRILLIC SMALL LETTER EN
о043ECYRILLIC SMALL LETTER O
п043FCYRILLIC SMALL LETTER PE
р0440CYRILLIC SMALL LETTER ER
с0441CYRILLIC SMALL LETTER ES
т0442CYRILLIC SMALL LETTER TE
у0443CYRILLIC SMALL LETTER U
ф0444CYRILLIC SMALL LETTER EF
х0445CYRILLIC SMALL LETTER HA
ц0446CYRILLIC SMALL LETTER TSE
ч0447CYRILLIC SMALL LETTER CHE
ш0448CYRILLIC SMALL LETTER SHA
щ0449CYRILLIC SMALL LETTER SHCHA
ъ044ACYRILLIC SMALL LETTER HARD SIGN
ы044BCYRILLIC SMALL LETTER YERU
ь044CCYRILLIC SMALL LETTER SOFT SIGN
э044DCYRILLIC SMALL LETTER E
ю044ECYRILLIC SMALL LETTER YU
я044FCYRILLIC SMALL LETTER YA
2153VULGAR FRACTION ONE THIRD
2154VULGAR FRACTION TWO THIRDS
2155VULGAR FRACTION ONE FIFTH
2713CHECK MARK
2318PLACE OF INTEREST SIGN
2423OPEN BOX
23CERETURN SYMBOL
3251CIRCLED NUMBER TWENTY ONE
3252CIRCLED NUMBER TWENTY TWO
3253CIRCLED NUMBER TWENTY THREE
3254CIRCLED NUMBER TWENTY FOUR
3255CIRCLED NUMBER TWENTY FIVE
3256CIRCLED NUMBER TWENTY SIX
3257CIRCLED NUMBER TWENTY SEVEN
3258CIRCLED NUMBER TWENTY EIGHT
3259CIRCLED NUMBER TWENTY NINE
325ACIRCLED NUMBER THIRTY
325BCIRCLED NUMBER THIRTY ONE
325CCIRCLED NUMBER THIRTY TWO
325DCIRCLED NUMBER THIRTY THREE
325ECIRCLED NUMBER THIRTY FOUR
325FCIRCLED NUMBER THIRTY FIVE
32B1CIRCLED NUMBER THIRTY SIX
32B2CIRCLED NUMBER THIRTY SEVEN
32B3CIRCLED NUMBER THIRTY EIGHT
32B4CIRCLED NUMBER THIRTY NINE
32B5CIRCLED NUMBER FORTY
32B6CIRCLED NUMBER FORTY ONE
32B7CIRCLED NUMBER FORTY TWO
32B8CIRCLED NUMBER FORTY THREE
32B9CIRCLED NUMBER FORTY FOUR
32BACIRCLED NUMBER FORTY FIVE
32BBCIRCLED NUMBER FORTY SIX
32BCCIRCLED NUMBER FORTY SEVEN
32BDCIRCLED NUMBER FORTY EIGHT
32BECIRCLED NUMBER FORTY NINE
32BFCIRCLED NUMBER FIFTY
25D0CIRCLE WITH LEFT HALF BLACK
25D1CIRCLE WITH RIGHT HALF BLACK
25D2CIRCLE WITH LOWER HALF BLACK
25D3CIRCLE WITH UPPER HALF BLACK
©00A9COPYRIGHT SIGN
®00AEREGISTERED SIGN
¼00BCVULGAR FRACTION ONE QUARTER
½00BDVULGAR FRACTION ONE HALF
¾00BEVULGAR FRACTION THREE QUARTERS
2776DINGBAT NEGATIVE CIRCLED DIGIT ONE
2777DINGBAT NEGATIVE CIRCLED DIGIT TWO
2778DINGBAT NEGATIVE CIRCLED DIGIT THREE
2779DINGBAT NEGATIVE CIRCLED DIGIT FOUR
277ADINGBAT NEGATIVE CIRCLED DIGIT FIVE
277BDINGBAT NEGATIVE CIRCLED DIGIT SIX
277CDINGBAT NEGATIVE CIRCLED DIGIT SEVEN
277DDINGBAT NEGATIVE CIRCLED DIGIT EIGHT
277EDINGBAT NEGATIVE CIRCLED DIGIT NINE
277FDINGBAT NEGATIVE CIRCLED NUMBER TEN
24EBNEGATIVE CIRCLED NUMBER ELEVEN
24ECNEGATIVE CIRCLED NUMBER TWELVE
24EDNEGATIVE CIRCLED NUMBER THIRTEEN
24EENEGATIVE CIRCLED NUMBER FOURTEEN
24EFNEGATIVE CIRCLED NUMBER FIFTEEN
24F0NEGATIVE CIRCLED NUMBER SIXTEEN
24F1NEGATIVE CIRCLED NUMBER SEVENTEEN
24F2NEGATIVE CIRCLED NUMBER EIGHTEEN
24F3NEGATIVE CIRCLED NUMBER NINETEEN
24F4NEGATIVE CIRCLED NUMBER TWENTY
2170SMALL ROMAN NUMERAL ONE
2171SMALL ROMAN NUMERAL TWO
2172SMALL ROMAN NUMERAL THREE
2173SMALL ROMAN NUMERAL FOUR
2174SMALL ROMAN NUMERAL FIVE
2175SMALL ROMAN NUMERAL SIX
2176SMALL ROMAN NUMERAL SEVEN
2177SMALL ROMAN NUMERAL EIGHT
2178SMALL ROMAN NUMERAL NINE
2179SMALL ROMAN NUMERAL TEN
217ASMALL ROMAN NUMERAL ELEVEN
217BSMALL ROMAN NUMERAL TWELVE
24D0CIRCLED LATIN SMALL LETTER A
24D1CIRCLED LATIN SMALL LETTER B
24D2CIRCLED LATIN SMALL LETTER C
24D3CIRCLED LATIN SMALL LETTER D
24D4CIRCLED LATIN SMALL LETTER E
24D5CIRCLED LATIN SMALL LETTER F
24D6CIRCLED LATIN SMALL LETTER G
24D7CIRCLED LATIN SMALL LETTER H
24D8CIRCLED LATIN SMALL LETTER I
24D9CIRCLED LATIN SMALL LETTER J
24DACIRCLED LATIN SMALL LETTER K
24DBCIRCLED LATIN SMALL LETTER L
24DCCIRCLED LATIN SMALL LETTER M
24DDCIRCLED LATIN SMALL LETTER N
24DECIRCLED LATIN SMALL LETTER O
24DFCIRCLED LATIN SMALL LETTER P
24E0CIRCLED LATIN SMALL LETTER Q
24E1CIRCLED LATIN SMALL LETTER R
24E2CIRCLED LATIN SMALL LETTER S
24E3CIRCLED LATIN SMALL LETTER T
24E4CIRCLED LATIN SMALL LETTER U
24E5CIRCLED LATIN SMALL LETTER V
24E6CIRCLED LATIN SMALL LETTER W
24E7CIRCLED LATIN SMALL LETTER X
24E8CIRCLED LATIN SMALL LETTER Y
24E9CIRCLED LATIN SMALL LETTER Z
32D0CIRCLED KATAKANA A
32D1CIRCLED KATAKANA I
32D2CIRCLED KATAKANA U
32D3CIRCLED KATAKANA E
32D4CIRCLED KATAKANA O
32D5CIRCLED KATAKANA KA
32D6CIRCLED KATAKANA KI
32D7CIRCLED KATAKANA KU
32D8CIRCLED KATAKANA KE
32D9CIRCLED KATAKANA KO
32DACIRCLED KATAKANA SA
32DBCIRCLED KATAKANA SI
32DCCIRCLED KATAKANA SU
32DDCIRCLED KATAKANA SE
32DECIRCLED KATAKANA SO
32DFCIRCLED KATAKANA TA
32E0CIRCLED KATAKANA TI
32E1CIRCLED KATAKANA TU
32E2CIRCLED KATAKANA TE
32E3CIRCLED KATAKANA TO
32FACIRCLED KATAKANA RO
32E9CIRCLED KATAKANA HA
32E5CIRCLED KATAKANA NI
32EDCIRCLED KATAKANA HO
32ECCIRCLED KATAKANA HE
2051TWO ASTERISKS ALIGNED VERTICALLY
2042ASTERISM
2460CIRCLED DIGIT ONE
2461CIRCLED DIGIT TWO
2462CIRCLED DIGIT THREE
2463CIRCLED DIGIT FOUR
2464CIRCLED DIGIT FIVE
2465CIRCLED DIGIT SIX
2466CIRCLED DIGIT SEVEN
2467CIRCLED DIGIT EIGHT
2468CIRCLED DIGIT NINE
2469CIRCLED NUMBER TEN
246ACIRCLED NUMBER ELEVEN
246BCIRCLED NUMBER TWELVE
246CCIRCLED NUMBER THIRTEEN
246DCIRCLED NUMBER FOURTEEN
246ECIRCLED NUMBER FIFTEEN
246FCIRCLED NUMBER SIXTEEN
2470CIRCLED NUMBER SEVENTEEN
2471CIRCLED NUMBER EIGHTEEN
2472CIRCLED NUMBER NINETEEN
2473CIRCLED NUMBER TWENTY
2160ROMAN NUMERAL ONE
2161ROMAN NUMERAL TWO
2162ROMAN NUMERAL THREE
2163ROMAN NUMERAL FOUR
2164ROMAN NUMERAL FIVE
2165ROMAN NUMERAL SIX
2166ROMAN NUMERAL SEVEN
2167ROMAN NUMERAL EIGHT
2168ROMAN NUMERAL NINE
2169ROMAN NUMERAL TEN
216AROMAN NUMERAL ELEVEN
3349SQUARE MIRI
3314SQUARE KIRO
3322SQUARE SENTI
334DSQUARE MEETORU
3318SQUARE GURAMU
3327SQUARE TON
3303SQUARE AARU
3336SQUARE HEKUTAARU
3351SQUARE RITTORU
3357SQUARE WATTO
330DSQUARE KARORII
3326SQUARE DORU
3323SQUARE SENTO
332BSQUARE PAASENTO
334ASQUARE MIRIBAARU
333BSQUARE PEEZI
339CSQUARE MM
339DSQUARE CM
339ESQUARE KM
338ESQUARE MG
338FSQUARE KG
33C4SQUARE CC
33A1SQUARE M SQUARED
216BROMAN NUMERAL TWELVE
337BSQUARE ERA NAME HEISEI
2116NUMERO SIGN
33CDSQUARE KK
2121TELEPHONE SIGN
32A4CIRCLED IDEOGRAPH HIGH
32A5CIRCLED IDEOGRAPH CENTRE
32A6CIRCLED IDEOGRAPH LOW
32A7CIRCLED IDEOGRAPH LEFT
32A8CIRCLED IDEOGRAPH RIGHT
3231PARENTHESIZED IDEOGRAPH STOCK
3232PARENTHESIZED IDEOGRAPH HAVE
3239PARENTHESIZED IDEOGRAPH REPRESENT
337ESQUARE ERA NAME MEIZI
337DSQUARE ERA NAME TAISYOU
337CSQUARE ERA NAME SYOUWA
221FRIGHT ANGLE
22BFRIGHT TRIANGLE
2756BLACK DIAMOND MINUS WHITE X
261EWHITE RIGHT POINTING INDEX

A.20 Characters as reference marks (cl-20)

Any character may participate in reference marks.

A.21 Ornamented character complexes (cl-21)

Any character may participate in ornamented character complex.

A.22 Simple-ruby character complexes (cl-22)

Any character may participate in simple-ruby character complex.

A.23 Jukugo-ruby character complexes (cl-23)

Any character may participate in jukugo-ruby character complex.

A.24 Grouped numerals (cl-24)

CharacterUCSNameRemark
00030DIGIT ZEROhalf-width
10031DIGIT ONEhalf-width
20032DIGIT TWOhalf-width
30033DIGIT THREEhalf-width
40034DIGIT FOURhalf-width
50035DIGIT FIVEhalf-width
60036DIGIT SIXhalf-width
70037DIGIT SEVENhalf-width
80038DIGIT EIGHThalf-width
90039DIGIT NINEhalf-width
0020SPACEquarter em width
,002CCOMMAquarter em width or half-width
.002EFULL STOPdecimal point
quarter em width or half-width

A.25 Unit symbols (cl-25)

CharacterUCSNameRemark
0020SPACEquarter em width
30FBKATAKANA MIDDLE DOThalf-width
/002FSOLIDUSone third em width, half-width or proportional
(0028LEFT PARENTHESIS
)0029RIGHT PARENTHESIS
2212MINUS SIGN
Å212BANGSTROM SIGNproportional
10031DIGIT ONEhalf-width or proportional
20032DIGIT TWOhalf-width or proportional
30033DIGIT THREEhalf-width or proportional
40034DIGIT FOURhalf-width or proportional
A0041LATIN CAPITAL LETTER Aproportional
B0042LATIN CAPITAL LETTER Bproportional
C0043LATIN CAPITAL LETTER Cproportional
D0044LATIN CAPITAL LETTER Dproportional
E0045LATIN CAPITAL LETTER Eproportional
F0046LATIN CAPITAL LETTER Fproportional
G0047LATIN CAPITAL LETTER Gproportional
H0048LATIN CAPITAL LETTER Hproportional
I0049LATIN CAPITAL LETTER Iproportional
J004ALATIN CAPITAL LETTER Jproportional
K004BLATIN CAPITAL LETTER Kproportional
L004CLATIN CAPITAL LETTER Lproportional
M004DLATIN CAPITAL LETTER Mproportional
N004ELATIN CAPITAL LETTER Nproportional
O004FLATIN CAPITAL LETTER Oproportional
P0050LATIN CAPITAL LETTER Pproportional
Q0051LATIN CAPITAL LETTER Qproportional
R0052LATIN CAPITAL LETTER Rproportional
S0053LATIN CAPITAL LETTER Sproportional
T0054LATIN CAPITAL LETTER Tproportional
U0055LATIN CAPITAL LETTER Uproportional
V0056LATIN CAPITAL LETTER Vproportional
W0057LATIN CAPITAL LETTER Wproportional
X0058LATIN CAPITAL LETTER Xproportional
Y0059LATIN CAPITAL LETTER Yproportional
Z005ALATIN CAPITAL LETTER Zproportional
2127INVERTED OHM SIGNproportional
a0061LATIN SMALL LETTER Aproportional
b0062LATIN SMALL LETTER Bproportional
c0063LATIN SMALL LETTER Cproportional
d0064LATIN SMALL LETTER Dproportional
e0065LATIN SMALL LETTER Eproportional
f0066LATIN SMALL LETTER Fproportional
g0067LATIN SMALL LETTER Gproportional
h0068LATIN SMALL LETTER Hproportional
i0069LATIN SMALL LETTER Iproportional
j006ALATIN SMALL LETTER Jproportional
k006BLATIN SMALL LETTER Kproportional
l006CLATIN SMALL LETTER Lproportional
m006DLATIN SMALL LETTER Mproportional
n006ELATIN SMALL LETTER Nproportional
o006FLATIN SMALL LETTER Oproportional
p0070LATIN SMALL LETTER Pproportional
q0071LATIN SMALL LETTER Qproportional
r0072LATIN SMALL LETTER Rproportional
s0073LATIN SMALL LETTER Sproportional
t0074LATIN SMALL LETTER Tproportional
u0075LATIN SMALL LETTER Uproportional
v0076LATIN SMALL LETTER Vproportional
w0077LATIN SMALL LETTER Wproportional
x0078LATIN SMALL LETTER Xproportional
y0079LATIN SMALL LETTER Yproportional
z007ALATIN SMALL LETTER Zproportional
Ω03A9GREEK CAPITAL LETTER OMEGAproportional
μ03BCGREEK SMALL LETTER MUproportional

A.26 Western word space (cl-26)

CharacterUCSNameRemark
0020SPACE

A.27 Western characters (cl-27)

CharacterUCSNameRemark
,002CCOMMAproportional
.002EFULL STOPproportional
:003ACOLONproportional
;003BSEMICOLONproportional
?003FQUESTION MARKproportional
!0021EXCLAMATION MARKproportional
´00B4ACUTE ACCENTproportional
`0060GRAVE ACCENTproportional
¨00A8DIAERESISproportional
^005ECIRCUMFLEX ACCENTproportional
203EOVERLINEproportional
_005FLOW LINEproportional
2014EM DASHproportional
2010HYPHENproportional
/002FSOLIDUSproportional
\005CREVERSE SOLIDUSproportional
2016DOUBLE VERTICAL LINEproportional
|007CVERTICAL LINEproportional
2026HORIZONTAL ELLIPSISproportional
2025TWO DOT LEADERproportional
2018LEFT SINGLE QUOTATION MARKproportional
2019RIGHT SINGLE QUOTATION MARKproportional
201CLEFT DOUBLE QUOTATION MARKproportional
201DRIGHT DOUBLE QUOTATION MARKproportional
(0028LEFT PARENTHESISproportional
)0029RIGHT PARENTHESISproportional
[005BLEFT SQUARE BRACKETproportional
]005DRIGHT SQUARE BRACKETproportional
{007BLEFT CURLY BRACKETproportional
}007DRIGHT CURLY BRACKETproportional
+002BPLUS SIGNproportional
2212MINUS SIGNproportional
±00B1PLUS-MINUS SIGNproportional
×00D7MULTIPLICATION SIGNproportional
÷00F7DIVISION SIGNproportional
=003DEQUALS SIGNproportional
2260NOT EQUAL TOproportional
<003CLESS-THAN SIGNproportional
>003EGREATER-THAN SIGNproportional
2266LESS-THAN OVER EQUAL TOproportional
2267GREATER-THAN OVER EQUAL TOproportional
2234THEREFOREproportional
2642MALE SIGNproportional
2640FEMALE SIGNproportional
°00B0DEGREE SIGNproportional
2032PRIMEproportional
2033DOUBLE PRIMEproportional
¥00A5YEN SIGNproportional
$0024DOLLAR SIGNproportional
¢00A2CENT SIGNproportional
£00A3POUND SIGNproportional
%0025PERCENT SIGNproportional
#0023NUMBER SIGNproportional
&0026AMPERSANDproportional
*002AASTERISKproportional
@0040COMMERCIAL ATproportional
§00A7SECTION SIGNproportional
2606WHITE STARproportional
2605BLACK STARproportional
25CBWHITE CIRCLEproportional
25CFBLACK CIRCLEproportional
25CEBULLSEYEproportional
25C7WHITE DIAMONDproportional
25C6BLACK DIAMONDproportional
25A1WHITE SQUAREproportional
25A0BLACK SQUAREproportional
25B3WHITE UP-POINTING TRIANGLEproportional
25B2BLACK UP-POINTING TRIANGLEproportional
25BDWHITE DOWN-POINTING TRIANGLEproportional
25BCBLACK DOWN-POINTING TRIANGLEproportional
2192RIGHTWARDS ARROWproportional
2190LEFTWARDS ARROWproportional
2191UPWARDS ARROWproportional
2193DOWNWARDS ARROWproportional
'0027APOSTROPHEproportional
"0022QUOTATION MARKproportional
-002DHYPHEN-MINUSproportional
~007ETILDEproportional
221EINFINITYproportional
2208ELEMENT OFproportional
220BCONTAINS AS MEMBERproportional
2286SUBSET OF OR EQUAL TOproportional
2287SUPERSET OF OR EQUAL TOproportional
2282SUBSET OFproportional
2283SUPERSET OFproportional
222AUNIONproportional
2229INTERSECTIONproportional
2284NOT A SUBSET OFproportional
2285NOT A SUPERSET OFproportional
228ASUBSET OF WITH NOT EQUAL TOproportional
228BSUPERSET OF WITH NOT EQUAL TOproportional
2209NOT AN ELEMENT OFproportional
2205EMPTY SETproportional
2305PROJECTIVEproportional
2306PERSPECTIVEproportional
2227LOGICAL ANDproportional
2228LOGICAL ORproportional
¬00ACNOT SIGNproportional
21D2RIGHTWARDS DOUBLE ARROWproportional
21D4LEFT RIGHT DOUBLE ARROWproportional
2200FOR ALLproportional
2203THERE EXISTSproportional
2295CIRCLED PLUSproportional
2296CIRCLED MINUSproportional
2297CIRCLED TIMESproportional
2225PARALLEL TOproportional
2226NOT PARALLEL TOproportional
2220ANGLEproportional
22A5UP TACKproportional
2312ARCproportional
2202PARTIAL DIFFERENTIALproportional
2207NABLAproportional
2261IDENTICAL TOproportional
2252APPROXIMATELY EQUAL TO OR THE IMAGE OFproportional
226AMUCH LESS-THANproportional
226BMUCH GREATER-THANproportional
221ASQUARE ROOTproportional
223DREVERSED TILDE (lazy S)proportional
221DPROPORTIONAL TOproportional
2235BECAUSEproportional
222BINTEGRALproportional
222CDOUBLE INTEGRALproportional
2262NOT IDENTICAL TOproportional
2243ASYMPTOTICALLY EQUAL TOproportional
2245APPROXIMATELY EQUAL TOproportional
2248ALMOST EQUAL TOproportional
2276LESS-THAN OR GREATER-THANproportional
2277GREATER-THAN OR LESS-THANproportional
2194LEFT RIGHT ARROWproportional
Å212BANGSTROM SIGNproportional
2030PER MILLE SIGNproportional
266FMUSIC SHARP SIGNproportional
266DMUSIC FLAT SIGNproportional
266AEIGHTH NOTEproportional
2020DAGGERproportional
2021DOUBLE DAGGERproportional
00B6PILCROW SIGNproportional
266EMUSIC NATURAL SIGNproportional
266BBEAMED EIGHTH NOTESproportional
266CBEAMED SIXTEENTH NOTESproportional
2669QUARTER NOTEproportional
25EFLARGE CIRCLEproportional
25B7WHITE RIGHT-POINTING TRIANGLEproportional
25B6BLACK RIGHT-POINTING TRIANGLEproportional
25C1WHITE LEFT-POINTING TRIANGLEproportional
25C0BLACK LEFT-POINTING TRIANGLEproportional
2197NORTH EAST ARROWproportional
2198SOUTH EAST ARROWproportional
2196NORTH WEST ARROWproportional
2199SOUTH WEST ARROWproportional
21C4RIGHTWARDS ARROW OVER LEFTWARDS ARROWproportional
21E8RIGHTWARDS WHITE ARROWproportional
21E6LEFTWARDS WHITE ARROWproportional
21E7UPWARDS WHITE ARROWproportional
21E9DOWNWARDS WHITE ARROWproportional
2934ARROW POINTING RIGHTWARDS THEN CURVING UPWARDSproportional
2935ARROW POINTING RIGHTWARDS THEN CURVING DOWNWARDSproportional
00030DIGIT ZEROproportional
10031DIGIT ONEproportional
20032DIGIT TWOproportional
30033DIGIT THREEproportional
40034DIGIT FOURproportional
50035DIGIT FIVEproportional
60036DIGIT SIXproportional
70037DIGIT SEVENproportional
80038DIGIT EIGHTproportional
90039DIGIT NINEproportional
25E6WHITE BULLETproportional
2022BULLETproportional
A0041LATIN CAPITAL LETTER Aproportional
B0042LATIN CAPITAL LETTER Bproportional
C0043LATIN CAPITAL LETTER Cproportional
D0044LATIN CAPITAL LETTER Dproportional
E0045LATIN CAPITAL LETTER Eproportional
F0046LATIN CAPITAL LETTER Fproportional
G0047LATIN CAPITAL LETTER Gproportional
H0048LATIN CAPITAL LETTER Hproportional
I0049LATIN CAPITAL LETTER Iproportional
J004ALATIN CAPITAL LETTER Jproportional
K004BLATIN CAPITAL LETTER Kproportional
L004CLATIN CAPITAL LETTER Lproportional
M004DLATIN CAPITAL LETTER Mproportional
N004ELATIN CAPITAL LETTER Nproportional
O004FLATIN CAPITAL LETTER Oproportional
P0050LATIN CAPITAL LETTER Pproportional
Q0051LATIN CAPITAL LETTER Qproportional
R0052LATIN CAPITAL LETTER Rproportional
S0053LATIN CAPITAL LETTER Sproportional
T0054LATIN CAPITAL LETTER Tproportional
U0055LATIN CAPITAL LETTER Uproportional
V0056LATIN CAPITAL LETTER Vproportional
W0057LATIN CAPITAL LETTER Wproportional
X0058LATIN CAPITAL LETTER Xproportional
Y0059LATIN CAPITAL LETTER Yproportional
Z005ALATIN CAPITAL LETTER Zproportional
2213MINUS-OR-PLUS SIGNproportional
2135ALEF SYMBOLproportional
210FPLANCK CONSTANT OVER TWO PIproportional
2127INVERTED OHM SIGNproportional
a0061LATIN SMALL LETTER Aproportional
b0062LATIN SMALL LETTER Bproportional
c0063LATIN SMALL LETTER Cproportional
d0064LATIN SMALL LETTER Dproportional
e0065LATIN SMALL LETTER Eproportional
f0066LATIN SMALL LETTER Fproportional
g0067LATIN SMALL LETTER Gproportional
h0068LATIN SMALL LETTER Hproportional
i0069LATIN SMALL LETTER Iproportional
j006ALATIN SMALL LETTER Jproportional
k006BLATIN SMALL LETTER Kproportional
l006CLATIN SMALL LETTER Lproportional
m006DLATIN SMALL LETTER Mproportional
n006ELATIN SMALL LETTER Nproportional
o006FLATIN SMALL LETTER Oproportional
p0070LATIN SMALL LETTER Pproportional
q0071LATIN SMALL LETTER Qproportional
r0072LATIN SMALL LETTER Rproportional
s0073LATIN SMALL LETTER Sproportional
t0074LATIN SMALL LETTER Tproportional
u0075LATIN SMALL LETTER Uproportional
v0076LATIN SMALL LETTER Vproportional
w0077LATIN SMALL LETTER Wproportional
x0078LATIN SMALL LETTER Xproportional
y0079LATIN SMALL LETTER Yproportional
z007ALATIN SMALL LETTER Zproportional
2013EN DASHproportional
29FADOUBLE PLUSproportional
29FBTRIPLE PLUSproportional
Α0391GREEK CAPITAL LETTER ALPHAproportional
Β0392GREEK CAPITAL LETTER BETAproportional
Γ0393GREEK CAPITAL LETTER GAMMAproportional
Δ0394GREEK CAPITAL LETTER DELTAproportional
Ε0395GREEK CAPITAL LETTER EPSILONproportional
Ζ0396GREEK CAPITAL LETTER ZETAproportional
Η0397GREEK CAPITAL LETTER ETAproportional
Θ0398GREEK CAPITAL LETTER THETAproportional
Ι0399GREEK CAPITAL LETTER IOTAproportional
Κ039AGREEK CAPITAL LETTER KAPPAproportional
Λ039BGREEK CAPITAL LETTER LAMDAproportional
Μ039CGREEK CAPITAL LETTER MUproportional
Ν039DGREEK CAPITAL LETTER NUproportional
Ξ039EGREEK CAPITAL LETTER XIproportional
Ο039FGREEK CAPITAL LETTER OMICRONproportional
Π03A0GREEK CAPITAL LETTER PIproportional
Ρ03A1GREEK CAPITAL LETTER RHOproportional
Σ03A3GREEK CAPITAL LETTER SIGMAproportional
Τ03A4GREEK CAPITAL LETTER TAUproportional
Υ03A5GREEK CAPITAL LETTER UPSILONproportional
Φ03A6GREEK CAPITAL LETTER PHIproportional
Χ03A7GREEK CAPITAL LETTER CHIproportional
Ψ03A8GREEK CAPITAL LETTER PSIproportional
Ω03A9GREEK CAPITAL LETTER OMEGAproportional
2664WHITE SPADE SUITproportional
2660BLACK SPADE SUITproportional
2662WHITE DIAMOND SUITproportional
2666BLACK DIAMOND SUITproportional
2661WHITE HEART SUITproportional
2665BLACK HEART SUITproportional
2667WHITE CLUB SUITproportional
2663BLACK CLUB SUITproportional
α03B1GREEK SMALL LETTER ALPHAproportional
β03B2GREEK SMALL LETTER BETAproportional
γ03B3GREEK SMALL LETTER GAMMAproportional
δ03B4GREEK SMALL LETTER DELTAproportional
ε03B5GREEK SMALL LETTER EPSILONproportional
ζ03B6GREEK SMALL LETTER ZETAproportional
η03B7GREEK SMALL LETTER ETAproportional
θ03B8GREEK SMALL LETTER THETAproportional
ι03B9GREEK SMALL LETTER IOTAproportional
κ03BAGREEK SMALL LETTER KAPPAproportional
λ03BBGREEK SMALL LETTER LAMDAproportional
μ03BCGREEK SMALL LETTER MUproportional
ν03BDGREEK SMALL LETTER NUproportional
ξ03BEGREEK SMALL LETTER XIproportional
ο03BFGREEK SMALL LETTER OMICRONproportional
π03C0GREEK SMALL LETTER PIproportional
ρ03C1GREEK SMALL LETTER RHOproportional
σ03C3GREEK SMALL LETTER SIGMAproportional
τ03C4GREEK SMALL LETTER TAUproportional
υ03C5GREEK SMALL LETTER UPSILONproportional
φ03C6GREEK SMALL LETTER PHIproportional
χ03C7GREEK SMALL LETTER CHIproportional
ψ03C8GREEK SMALL LETTER PSIproportional
ω03C9GREEK SMALL LETTER OMEGAproportional
ς03C2GREEK SMALL LETTER FINAL SIGMAproportional
24F5DOUBLE CIRCLED DIGIT ONEproportional
24F6DOUBLE CIRCLED DIGIT TWOproportional
24F7DOUBLE CIRCLED DIGIT THREEproportional
24F8DOUBLE CIRCLED DIGIT FOURproportional
24F9DOUBLE CIRCLED DIGIT FIVEproportional
24FADOUBLE CIRCLED DIGIT SIXproportional
24FBDOUBLE CIRCLED DIGIT SEVENproportional
24FCDOUBLE CIRCLED DIGIT EIGHTproportional
24FDDOUBLE CIRCLED DIGIT NINEproportional
24FEDOUBLE CIRCLED NUMBER TENproportional
260EBLACK TELEPHONEproportional
2600BLACK SUN WITH RAYSproportional
2601CLOUDproportional
2602UMBRELLAproportional
2603SNOWMANproportional
25B1WHITE PARALLELOGRAMproportional
А0410CYRILLIC CAPITAL LETTER Aproportional
Б0411CYRILLIC CAPITAL LETTER BEproportional
В0412CYRILLIC CAPITAL LETTER VEproportional
Г0413CYRILLIC CAPITAL LETTER GHEproportional
Д0414CYRILLIC CAPITAL LETTER DEproportional
Е0415CYRILLIC CAPITAL LETTER IEproportional
Ё0401CYRILLIC CAPITAL LETTER IOproportional
Ж0416CYRILLIC CAPITAL LETTER ZHEproportional
З0417CYRILLIC CAPITAL LETTER ZEproportional
И0418CYRILLIC CAPITAL LETTER Iproportional
Й0419CYRILLIC CAPITAL LETTER SHORT Iproportional
К041ACYRILLIC CAPITAL LETTER KAproportional
Л041BCYRILLIC CAPITAL LETTER ELproportional
М041CCYRILLIC CAPITAL LETTER EMproportional
Н041DCYRILLIC CAPITAL LETTER ENproportional
О041ECYRILLIC CAPITAL LETTER Oproportional
П041FCYRILLIC CAPITAL LETTER PEproportional
Р0420CYRILLIC CAPITAL LETTER ERproportional
С0421CYRILLIC CAPITAL LETTER ESproportional
Т0422CYRILLIC CAPITAL LETTER TEproportional
У0423CYRILLIC CAPITAL LETTER Uproportional
Ф0424CYRILLIC CAPITAL LETTER EFproportional
Х0425CYRILLIC CAPITAL LETTER HAproportional
Ц0426CYRILLIC CAPITAL LETTER TSEproportional
Ч0427CYRILLIC CAPITAL LETTER CHEproportional
Ш0428CYRILLIC CAPITAL LETTER SHAproportional
Щ0429CYRILLIC CAPITAL LETTER SHCHAproportional
Ъ042ACYRILLIC CAPITAL LETTER HARD SIGNproportional
Ы042BCYRILLIC CAPITAL LETTER YERUproportional
Ь042CCYRILLIC CAPITAL LETTER SOFT SIGNproportional
Э042DCYRILLIC CAPITAL LETTER Eproportional
Ю042ECYRILLIC CAPITAL LETTER YUproportional
Я042FCYRILLIC CAPITAL LETTER YAproportional
а0430CYRILLIC SMALL LETTER Aproportional
б0431CYRILLIC SMALL LETTER BEproportional
в0432CYRILLIC SMALL LETTER VEproportional
г0433CYRILLIC SMALL LETTER GHEproportional
д0434CYRILLIC SMALL LETTER DEproportional
е0435CYRILLIC SMALL LETTER IEproportional
ё0451CYRILLIC SMALL LETTER IOproportional
ж0436CYRILLIC SMALL LETTER ZHEproportional
з0437CYRILLIC SMALL LETTER ZEproportional
и0438CYRILLIC SMALL LETTER Iproportional
й0439CYRILLIC SMALL LETTER SHORT Iproportional
к043ACYRILLIC SMALL LETTER KAproportional
л043BCYRILLIC SMALL LETTER ELproportional
м043CCYRILLIC SMALL LETTER EMproportional
н043DCYRILLIC SMALL LETTER ENproportional
о043ECYRILLIC SMALL LETTER Oproportional
п043FCYRILLIC SMALL LETTER PEproportional
р0440CYRILLIC SMALL LETTER ERproportional
с0441CYRILLIC SMALL LETTER ESproportional
т0442CYRILLIC SMALL LETTER TEproportional
у0443CYRILLIC SMALL LETTER Uproportional
ф0444CYRILLIC SMALL LETTER EFproportional
х0445CYRILLIC SMALL LETTER HAproportional
ц0446CYRILLIC SMALL LETTER TSEproportional
ч0447CYRILLIC SMALL LETTER CHEproportional
ш0448CYRILLIC SMALL LETTER SHAproportional
щ0449CYRILLIC SMALL LETTER SHCHAproportional
ъ044ACYRILLIC SMALL LETTER HARD SIGNproportional
ы044BCYRILLIC SMALL LETTER YERUproportional
ь044CCYRILLIC SMALL LETTER SOFT SIGNproportional
э044DCYRILLIC SMALL LETTER Eproportional
ю044ECYRILLIC SMALL LETTER YUproportional
я044FCYRILLIC SMALL LETTER YAproportional
22DALESS-THAN EQUAL TO OR GREATER-THANproportional
22DBGREATER-THAN EQUAL TO OR LESS-THANproportional
2153VULGAR FRACTION ONE THIRDproportional
2154VULGAR FRACTION TWO THIRDSproportional
2155VULGAR FRACTION ONE FIFTHproportional
2713CHECK MARKproportional
2318PLACE OF INTEREST SIGNproportional
2423OPEN BOXproportional
23CERETURN SYMBOLproportional
3251CIRCLED NUMBER TWENTY ONEproportional
3252CIRCLED NUMBER TWENTY TWOproportional
3253CIRCLED NUMBER TWENTY THREEproportional
3254CIRCLED NUMBER TWENTY FOURproportional
3255CIRCLED NUMBER TWENTY FIVEproportional
3256CIRCLED NUMBER TWENTY SIXproportional
3257CIRCLED NUMBER TWENTY SEVENproportional
3258CIRCLED NUMBER TWENTY EIGHTproportional
3259CIRCLED NUMBER TWENTY NINEproportional
325ACIRCLED NUMBER THIRTYproportional
325BCIRCLED NUMBER THIRTY ONEproportional
325CCIRCLED NUMBER THIRTY TWOproportional
325DCIRCLED NUMBER THIRTY THREEproportional
325ECIRCLED NUMBER THIRTY FOURproportional
325FCIRCLED NUMBER THIRTY FIVEproportional
32B1CIRCLED NUMBER THIRTY SIXproportional
32B2CIRCLED NUMBER THIRTY SEVENproportional
32B3CIRCLED NUMBER THIRTY EIGHTproportional
32B4CIRCLED NUMBER THIRTY NINEproportional
32B5CIRCLED NUMBER FORTYproportional
32B6CIRCLED NUMBER FORTY ONEproportional
32B7CIRCLED NUMBER FORTY TWOproportional
32B8CIRCLED NUMBER FORTY THREEproportional
32B9CIRCLED NUMBER FORTY FOURproportional
32BACIRCLED NUMBER FORTY FIVEproportional
32BBCIRCLED NUMBER FORTY SIXproportional
32BCCIRCLED NUMBER FORTY SEVENproportional
32BDCIRCLED NUMBER FORTY EIGHTproportional
32BECIRCLED NUMBER FORTY NINEproportional
32BFCIRCLED NUMBER FIFTYproportional
25D0CIRCLE WITH LEFT HALF BLACKproportional
25D1CIRCLE WITH RIGHT HALF BLACKproportional
25D2CIRCLE WITH LOWER HALF BLACKproportional
25D3CIRCLE WITH UPPER HALF BLACKproportional
Ǎ01CDLATIN CAPITAL LETTER A WITH CARONproportional
ǎ01CELATIN SMALL LETTER A WITH CARONproportional
ǐ01D0LATIN SMALL LETTER I WITH CARONproportional
1E3ELATIN CAPITAL LETTER M WITH ACUTEproportional
ḿ1E3FLATIN SMALL LETTER M WITH ACUTEproportional
Ǹ01F8LATIN CAPITAL LETTER N WITH GRAVEproportional
ǹ01F9LATIN SMALL LETTER N WITH GRAVEproportional
Ǒ01D1LATIN CAPITAL LETTER O WITH CARONproportional
ǒ01D2LATIN SMALL LETTER O WITH CARONproportional
ǔ01D4LATIN SMALL LETTER U WITH CARONproportional
ǖ01D6LATIN SMALL LETTER U WITH DIAERESIS AND MACRONproportional
ǘ01D8LATIN SMALL LETTER U WITH DIAERESIS AND ACUTEproportional
ǚ01DALATIN SMALL LETTER U WITH DIAERESIS AND CARONproportional
ǜ01DCLATIN SMALL LETTER U WITH DIAERESIS AND GRAVEproportional
20ACEURO SIGNproportional
 00A0NO-BREAK SPACEproportional
¡00A1INVERTED EXCLAMATION MARKproportional
¤00A4CURRENCY SIGNproportional
¦00A6BROKEN BARproportional
©00A9COPYRIGHT SIGNproportional
ª00AAFEMININE ORDINAL INDICATORproportional
«00ABLEFT-POINTING DOUBLE ANGLE QUOTATION MARKproportional
­00ADSOFT HYPHENproportional
®00AEREGISTERED SIGNproportional
¯00AFMACRONproportional
²00B2SUPERSCRIPT TWOproportional
³00B3SUPERSCRIPT THREEproportional
·00B7MIDDLE DOTproportional
¸00B8CEDILLAproportional
¹00B9SUPERSCRIPT ONEproportional
º00BAMASCULINE ORDINAL INDICATORproportional
»00BBRIGHT-POINTING DOUBLE ANGLE QUOTATION MARKproportional
¼00BCVULGAR FRACTION ONE QUARTERproportional
½00BDVULGAR FRACTION ONE HALFproportional
¾00BEVULGAR FRACTION THREE QUARTERSproportional
¿00BFINVERTED QUESTION MARKproportional
À00C0LATIN CAPITAL LETTER A WITH GRAVEproportional
Á00C1LATIN CAPITAL LETTER A WITH ACUTEproportional
Â00C2LATIN CAPITAL LETTER A WITH CIRCUMFLEXproportional
Ã00C3LATIN CAPITAL LETTER A WITH TILDEproportional
Ä00C4LATIN CAPITAL LETTER A WITH DIAERESISproportional
Å00C5LATIN CAPITAL LETTER A WITH RING ABOVEproportional
Æ00C6LATIN CAPITAL LETTER AE (ash)proportional
Ç00C7LATIN CAPITAL LETTER C WITH CEDILLAproportional
È00C8LATIN CAPITAL LETTER E WITH GRAVEproportional
É00C9LATIN CAPITAL LETTER E WITH ACUTEproportional
Ê00CALATIN CAPITAL LETTER E WITH CIRCUMFLEXproportional
Ë00CBLATIN CAPITAL LETTER E WITH DIAERESISproportional
Ì00CCLATIN CAPITAL LETTER I WITH GRAVEproportional
Í00CDLATIN CAPITAL LETTER I WITH ACUTEproportional
Î00CELATIN CAPITAL LETTER I WITH CIRCUMFLEXproportional
Ï00CFLATIN CAPITAL LETTER I WITH DIAERESISproportional
Ð00D0LATIN CAPITAL LETTER ETH (Icelandic)proportional
Ñ00D1LATIN CAPITAL LETTER N WITH TILDEproportional
Ò00D2LATIN CAPITAL LETTER O WITH GRAVEproportional
Ó00D3LATIN CAPITAL LETTER O WITH ACUTEproportional
Ô00D4LATIN CAPITAL LETTER O WITH CIRCUMFLEXproportional
Õ00D5LATIN CAPITAL LETTER O WITH TILDEproportional
Ö00D6LATIN CAPITAL LETTER O WITH DIAERESISproportional
Ø00D8LATIN CAPITAL LETTER O WITH STROKEproportional
Ù00D9LATIN CAPITAL LETTER U WITH GRAVEproportional
Ú00DALATIN CAPITAL LETTER U WITH ACUTEproportional
Û00DBLATIN CAPITAL LETTER U WITH CIRCUMFLEXproportional
Ü00DCLATIN CAPITAL LETTER U WITH DIAERESISproportional
Ý00DDLATIN CAPITAL LETTER Y WITH ACUTEproportional
Þ00DELATIN CAPITAL LETTER THORN (Icelandic)proportional
ß00DFLATIN SMALL LETTER SHARP S (German)proportional
à00E0LATIN SMALL LETTER A WITH GRAVEproportional
á00E1LATIN SMALL LETTER A WITH ACUTEproportional
â00E2LATIN SMALL LETTER A WITH CIRCUMFLEXproportional
ã00E3LATIN SMALL LETTER A WITH TILDEproportional
ä00E4LATIN SMALL LETTER A WITH DIAERESISproportional
å00E5LATIN SMALL LETTER A WITH RING ABOVEproportional
æ00E6LATIN SMALL LETTER AE (ash)proportional
ç00E7LATIN SMALL LETTER C WITH CEDILLAproportional
è00E8LATIN SMALL LETTER E WITH GRAVEproportional
é00E9LATIN SMALL LETTER E WITH ACUTEproportional
ê00EALATIN SMALL LETTER E WITH CIRCUMFLEXproportional
ë00EBLATIN SMALL LETTER E WITH DIAERESISproportional
ì00ECLATIN SMALL LETTER I WITH GRAVEproportional
í00EDLATIN SMALL LETTER I WITH ACUTEproportional
î00EELATIN SMALL LETTER I WITH CIRCUMFLEXproportional
ï00EFLATIN SMALL LETTER I WITH DIAERESISproportional
ð00F0LATIN SMALL LETTER ETH (Icelandic)proportional
ñ00F1LATIN SMALL LETTER N WITH TILDEproportional
ò00F2LATIN SMALL LETTER O WITH GRAVEproportional
ó00F3LATIN SMALL LETTER O WITH ACUTEproportional
ô00F4LATIN SMALL LETTER O WITH CIRCUMFLEXproportional
õ00F5LATIN SMALL LETTER O WITH TILDEproportional
ö00F6LATIN SMALL LETTER O WITH DIAERESISproportional
ø00F8LATIN SMALL LETTER O WITH STROKEproportional
ù00F9LATIN SMALL LETTER U WITH GRAVEproportional
ú00FALATIN SMALL LETTER U WITH ACUTEproportional
û00FBLATIN SMALL LETTER U WITH CIRCUMFLEXproportional
ü00FCLATIN SMALL LETTER U WITH DIAERESISproportional
ý00FDLATIN SMALL LETTER Y WITH ACUTEproportional
þ00FELATIN SMALL LETTER THORN (Icelandic)proportional
ÿ00FFLATIN SMALL LETTER Y WITH DIAERESISproportional
Ā0100LATIN CAPITAL LETTER A WITH MACRONproportional
Ī012ALATIN CAPITAL LETTER I WITH MACRONproportional
Ū016ALATIN CAPITAL LETTER U WITH MACRONproportional
Ē0112LATIN CAPITAL LETTER E WITH MACRONproportional
Ō014CLATIN CAPITAL LETTER O WITH MACRONproportional
ā0101LATIN SMALL LETTER A WITH MACRONproportional
ī012BLATIN SMALL LETTER I WITH MACRONproportional
ū016BLATIN SMALL LETTER U WITH MACRONproportional
ē0113LATIN SMALL LETTER E WITH MACRONproportional
ō014DLATIN SMALL LETTER O WITH MACRONproportional
Ą0104LATIN CAPITAL LETTER A WITH OGONEKproportional
˘02D8BREVEproportional
Ł0141LATIN CAPITAL LETTER L WITH STROKEproportional
Ľ013DLATIN CAPITAL LETTER L WITH CARONproportional
Ś015ALATIN CAPITAL LETTER S WITH ACUTEproportional
Š0160LATIN CAPITAL LETTER S WITH CARONproportional
Ş015ELATIN CAPITAL LETTER S WITH CEDILLAproportional
Ť0164LATIN CAPITAL LETTER T WITH CARONproportional
Ź0179LATIN CAPITAL LETTER Z WITH ACUTEproportional
Ž017DLATIN CAPITAL LETTER Z WITH CARONproportional
Ż017BLATIN CAPITAL LETTER Z WITH DOT ABOVEproportional
ą0105LATIN SMALL LETTER A WITH OGONEKproportional
˛02DBOGONEKproportional
ł0142LATIN SMALL LETTER L WITH STROKEproportional
ľ013ELATIN SMALL LETTER L WITH CARONproportional
ś015BLATIN SMALL LETTER S WITH ACUTEproportional
ˇ02C7CARON (Mandarin Chinese third tone)proportional
š0161LATIN SMALL LETTER S WITH CARONproportional
ş015FLATIN SMALL LETTER S WITH CEDILLAproportional
ť0165LATIN SMALL LETTER T WITH CARONproportional
ź017ALATIN SMALL LETTER Z WITH ACUTEproportional
˝02DDDOUBLE ACUTE ACCENTproportional
ž017ELATIN SMALL LETTER Z WITH CARONproportional
ż017CLATIN SMALL LETTER Z WITH DOT ABOVEproportional
Ŕ0154LATIN CAPITAL LETTER R WITH ACUTEproportional
Ă0102LATIN CAPITAL LETTER A WITH BREVEproportional
Ĺ0139LATIN CAPITAL LETTER L WITH ACUTEproportional
Ć0106LATIN CAPITAL LETTER C WITH ACUTEproportional
Č010CLATIN CAPITAL LETTER C WITH CARONproportional
Ę0118LATIN CAPITAL LETTER E WITH OGONEKproportional
Ě011ALATIN CAPITAL LETTER E WITH CARONproportional
Ď010ELATIN CAPITAL LETTER D WITH CARONproportional
Ń0143LATIN CAPITAL LETTER N WITH ACUTEproportional
Ň0147LATIN CAPITAL LETTER N WITH CARONproportional
Ő0150LATIN CAPITAL LETTER O WITH DOUBLE ACUTEproportional
Ř0158LATIN CAPITAL LETTER R WITH CARONproportional
Ů016ELATIN CAPITAL LETTER U WITH RING ABOVEproportional
Ű0170LATIN CAPITAL LETTER U WITH DOUBLE ACUTEproportional
Ţ0162LATIN CAPITAL LETTER T WITH CEDILLAproportional
ŕ0155LATIN SMALL LETTER R WITH ACUTEproportional
ă0103LATIN SMALL LETTER A WITH BREVEproportional
ĺ013ALATIN SMALL LETTER L WITH ACUTEproportional
ć0107LATIN SMALL LETTER C WITH ACUTEproportional
č010DLATIN SMALL LETTER C WITH CARONproportional
ę0119LATIN SMALL LETTER E WITH OGONEKproportional
ě011BLATIN SMALL LETTER E WITH CARONproportional
ď010FLATIN SMALL LETTER D WITH CARONproportional
đ0111LATIN SMALL LETTER D WITH STROKEproportional
ń0144LATIN SMALL LETTER N WITH ACUTEproportional
ň0148LATIN SMALL LETTER N WITH CARONproportional
ő0151LATIN SMALL LETTER O WITH DOUBLE ACUTEproportional
ř0159LATIN SMALL LETTER R WITH CARONproportional
ů016FLATIN SMALL LETTER U WITH RING ABOVEproportional
ű0171LATIN SMALL LETTER U WITH DOUBLE ACUTEproportional
ţ0163LATIN SMALL LETTER T WITH CEDILLAproportional
˙02D9DOT ABOVE (Mandarin Chinese light tone)proportional
Ĉ0108LATIN CAPITAL LETTER C WITH CIRCUMFLEXproportional
Ĝ011CLATIN CAPITAL LETTER G WITH CIRCUMFLEXproportional
Ĥ0124LATIN CAPITAL LETTER H WITH CIRCUMFLEXproportional
Ĵ0134LATIN CAPITAL LETTER J WITH CIRCUMFLEXproportional
Ŝ015CLATIN CAPITAL LETTER S WITH CIRCUMFLEXproportional
Ŭ016CLATIN CAPITAL LETTER U WITH BREVEproportional
ĉ0109LATIN SMALL LETTER C WITH CIRCUMFLEXproportional
ĝ011DLATIN SMALL LETTER G WITH CIRCUMFLEXproportional
ĥ0125LATIN SMALL LETTER H WITH CIRCUMFLEXproportional
ĵ0135LATIN SMALL LETTER J WITH CIRCUMFLEXproportional
ŝ015DLATIN SMALL LETTER S WITH CIRCUMFLEXproportional
ŭ016DLATIN SMALL LETTER U WITH BREVEproportional
ɱ0271LATIN SMALL LETTER M WITH HOOKproportional
ʋ028BLATIN SMALL LETTER V WITH HOOKproportional
ɾ027ELATIN SMALL LETTER R WITH FISHHOOKproportional
ʃ0283LATIN SMALL LETTER ESHproportional
ʒ0292LATIN SMALL LETTER EZHproportional
ɬ026CLATIN SMALL LETTER L WITH BELTproportional
ɮ026ELATIN SMALL LETTER LEZHproportional
ɹ0279LATIN SMALL LETTER TURNED Rproportional
ʈ0288LATIN SMALL LETTER T WITH RETROFLEX HOOKproportional
ɖ0256LATIN SMALL LETTER D WITH TAILproportional
ɳ0273LATIN SMALL LETTER N WITH RETROFLEX HOOKproportional
ɽ027DLATIN SMALL LETTER R WITH TAILproportional
ʂ0282LATIN SMALL LETTER S WITH HOOKproportional
ʐ0290LATIN SMALL LETTER Z WITH RETROFLEX HOOKproportional
ɻ027BLATIN SMALL LETTER TURNED R WITH HOOKproportional
ɭ026DLATIN SMALL LETTER L WITH RETROFLEX HOOKproportional
ɟ025FLATIN SMALL LETTER DOTLESS J WITH STROKEproportional
ɲ0272LATIN SMALL LETTER N WITH LEFT HOOKproportional
ʝ029DLATIN SMALL LETTER J WITH CROSSED-TAILproportional
ʎ028ELATIN SMALL LETTER TURNED Yproportional
ɡ0261LATIN SMALL LETTER SCRIPT Gproportional
ŋ014BLATIN SMALL LETTER ENG (Sami)proportional
ɰ0270LATIN SMALL LETTER TURNED M WITH LONG LEGproportional
ʁ0281LATIN LETTER SMALL CAPITAL INVERTED Rproportional
ħ0127LATIN SMALL LETTER H WITH STROKEproportional
ʕ0295LATIN LETTER PHARYNGEAL VOICED FRICATIVEproportional
ʔ0294LATIN LETTER GLOTTAL STOPproportional
ɦ0266LATIN SMALL LETTER H WITH HOOKproportional
ʘ0298LATIN LETTER BILABIAL CLICKproportional
ǂ01C2LATIN LETTER ALVEOLAR CLICKproportional
ɓ0253LATIN SMALL LETTER B WITH HOOKproportional
ɗ0257LATIN SMALL LETTER D WITH HOOKproportional
ʄ0284LATIN SMALL LETTER DOTLESS J WITH STROKE AND HOOKproportional
ɠ0260LATIN SMALL LETTER G WITH HOOKproportional
Ɠ0193LATIN CAPITAL LETTER G WITH HOOKproportional
œ0153LATIN SMALL LIGATURE OEproportional
Œ0152LATIN CAPITAL LIGATURE OEproportional
ɨ0268LATIN SMALL LETTER I WITH STROKEproportional
ʉ0289LATIN SMALL LETTER U BARproportional
ɘ0258LATIN SMALL LETTER REVERSED Eproportional
ɵ0275LATIN SMALL LETTER BARRED Oproportional
ə0259LATIN SMALL LETTER SCHWAproportional
ɜ025CLATIN SMALL LETTER REVERSED OPEN Eproportional
ɞ025ELATIN SMALL LETTER CLOSED REVERSED OPEN Eproportional
ɐ0250LATIN SMALL LETTER TURNED Aproportional
ɯ026FLATIN SMALL LETTER TURNED Mproportional
ʊ028ALATIN SMALL LETTER UPSILONproportional
ɤ0264LATIN SMALL LETTER RAMS HORNproportional
ʌ028CLATIN SMALL LETTER TURNED Vproportional
ɔ0254LATIN SMALL LETTER OPEN Oproportional
ɑ0251LATIN SMALL LETTER ALPHAproportional
ɒ0252LATIN SMALL LETTER TURNED ALPHAproportional
ʍ028DLATIN SMALL LETTER TURNED Wproportional
ɥ0265LATIN SMALL LETTER TURNED Hproportional
ʢ02A2LATIN LETTER REVERSED GLOTTAL STOP WITH STROKEproportional
ʡ02A1LATIN LETTER GLOTTAL STOP WITH STROKEproportional
ɕ0255LATIN SMALL LETTER C WITH CURLproportional
ʑ0291LATIN SMALL LETTER Z WITH CURLproportional
ɺ027ALATIN SMALL LETTER TURNED R WITH LONG LEGproportional
ɧ0267LATIN SMALL LETTER HENG WITH HOOKproportional
ɚ025ALATIN SMALL LETTER SCHWA WITH HOOKproportional
æ̀<00E6, 0300><LATIN SMALL LETTER AE, COMBINING GRAVE ACCENT>proportional
ǽ01FDLATIN SMALL LETTER AE WITH ACUTE (ash)proportional
1F70GREEK SMALL LETTER ALPHA WITH VARIAproportional
ά1F71GREEK SMALL LETTER ALPHA WITH OXIAproportional
ɔ̀<0254, 0300><LATIN SMALL LETTER OPEN O, COMBINING GRAVE ACCENT>proportional
ɔ́<0254, 0301><LATIN SMALL LETTER OPEN O, COMBINING ACUTE ACCENT>proportional
ʌ̀<028C, 0300><LATIN SMALL LETTER TURNED V, COMBINING GRAVE ACCENT>proportional
ʌ́<028C, 0301><LATIN SMALL LETTER TURNED V, COMBINING ACUTE ACCENT>proportional
ə̀<0259, 0300><LATIN SMALL LETTER SCHWA, COMBINING GRAVE ACCENT>proportional
ə́<0259, 0301><LATIN SMALL LETTER SCHWA, COMBINING ACUTE ACCENT>proportional
ɚ̀<025A, 0300><LATIN SMALL LETTER SCHWA WITH HOOK, COMBINING GRAVE ACCENT>proportional
ɚ́<025A, 0301><LATIN SMALL LETTER SCHWA WITH HOOK, COMBINING ACUTE ACCENT>proportional
1F72GREEK SMALL LETTER EPSILON WITH VARIAproportional
έ1F73GREEK SMALL LETTER EPSILON WITH OXIAproportional
͡0361COMBINING DOUBLE INVERTED BREVEproportional
ˈ02C8MODIFIER LETTER VERTICAL LINEproportional
ˌ02CCMODIFIER LETTER LOW VERTICAL LINEproportional
ː02D0MODIFIER LETTER TRIANGULAR COLONproportional
ˑ02D1MODIFIER LETTER HALF TRIANGULAR COLONproportional
̆0306COMBINING BREVE (Vrachy)proportional
203FUNDERTIE (Enotikon)proportional
̋030BCOMBINING DOUBLE ACUTE ACCENTproportional
́0301COMBINING ACUTE ACCENT (Oxia, Tonos)proportional
̄0304COMBINING MACRONproportional
̀0300COMBINING GRAVE ACCENT (Varia)proportional
̏030FCOMBINING DOUBLE GRAVE ACCENTproportional
̌030CCOMBINING CARONproportional
̂0302COMBINING CIRCUMFLEX ACCENTproportional
˥02E5MODIFIER LETTER EXTRA-HIGH TONE BARproportional
˦02E6MODIFIER LETTER HIGH TONE BARproportional
˧02E7MODIFIER LETTER MID TONE BARproportional
˨02E8MODIFIER LETTER LOW TONE BARproportional
˩02E9MODIFIER LETTER EXTRA-LOW TONE BARproportional
˩˥<02E9, 02E5><MODIFIER LETTER EXTRA-LOW TONE BAR, MODIFIER LETTER EXTRA-HIGH TONE BAR>proportional
˥˩<02E5, 02E9><MODIFIER LETTER EXTRA-HIGH TONE BAR, MODIFIER LETTER EXTRA-LOW TONE BAR>proportional
̥0325COMBINING RING BELOWproportional
̬032CCOMBINING CARON BELOWproportional
̹0339COMBINING RIGHT HALF RING BELOWproportional
̜031CCOMBINING LEFT HALF RING BELOWproportional
̟031FCOMBINING PLUS SIGN BELOWproportional
̠0320COMBINING MINUS SIGN BELOWproportional
̈0308COMBINING DIAERESIS (Dialytika)proportional
̽033DCOMBINING X ABOVEproportional
̩0329COMBINING VERTICAL LINE BELOWproportional
̯032FCOMBINING INVERTED BREVE BELOWproportional
˞02DEMODIFIER LETTER RHOTIC HOOKproportional
̤0324COMBINING DIAERESIS BELOWproportional
̰0330COMBINING TILDE BELOWproportional
̼033CCOMBINING SEAGULL BELOWproportional
̴0334COMBINING TILDE OVERLAYproportional
̝031DCOMBINING UP TACK BELOWproportional
̞031ECOMBINING DOWN TACK BELOWproportional
̘0318COMBINING LEFT TACK BELOWproportional
̙0319COMBINING RIGHT TACK BELOWproportional
̪032ACOMBINING BRIDGE BELOWproportional
̺033ACOMBINING INVERTED BRIDGE BELOWproportional
̻033BCOMBINING SQUARE BELOWproportional
̃0303COMBINING TILDEproportional
̚031ACOMBINING LEFT ANGLE ABOVEproportional
2776DINGBAT NEGATIVE CIRCLED DIGIT ONEproportional
2777DINGBAT NEGATIVE CIRCLED DIGIT TWOproportional
2778DINGBAT NEGATIVE CIRCLED DIGIT THREEproportional
2779DINGBAT NEGATIVE CIRCLED DIGIT FOURproportional
277ADINGBAT NEGATIVE CIRCLED DIGIT FIVEproportional
277BDINGBAT NEGATIVE CIRCLED DIGIT SIXproportional
277CDINGBAT NEGATIVE CIRCLED DIGIT SEVENproportional
277DDINGBAT NEGATIVE CIRCLED DIGIT EIGHTproportional
277EDINGBAT NEGATIVE CIRCLED DIGIT NINEproportional
277FDINGBAT NEGATIVE CIRCLED NUMBER TENproportional
24EBNEGATIVE CIRCLED NUMBER ELEVENproportional
24ECNEGATIVE CIRCLED NUMBER TWELVEproportional
24EDNEGATIVE CIRCLED NUMBER THIRTEENproportional
24EENEGATIVE CIRCLED NUMBER FOURTEENproportional
24EFNEGATIVE CIRCLED NUMBER FIFTEENproportional
24F0NEGATIVE CIRCLED NUMBER SIXTEENproportional
24F1NEGATIVE CIRCLED NUMBER SEVENTEENproportional
24F2NEGATIVE CIRCLED NUMBER EIGHTEENproportional
24F3NEGATIVE CIRCLED NUMBER NINETEENproportional
24F4NEGATIVE CIRCLED NUMBER TWENTYproportional
24D0CIRCLED LATIN SMALL LETTER Aproportional
24D1CIRCLED LATIN SMALL LETTER Bproportional
24D2CIRCLED LATIN SMALL LETTER Cproportional
24D3CIRCLED LATIN SMALL LETTER Dproportional
24D4CIRCLED LATIN SMALL LETTER Eproportional
24D5CIRCLED LATIN SMALL LETTER Fproportional
24D6CIRCLED LATIN SMALL LETTER Gproportional
24D7CIRCLED LATIN SMALL LETTER Hproportional
24D8CIRCLED LATIN SMALL LETTER Iproportional
24D9CIRCLED LATIN SMALL LETTER Jproportional
24DACIRCLED LATIN SMALL LETTER Kproportional
24DBCIRCLED LATIN SMALL LETTER Lproportional
24DCCIRCLED LATIN SMALL LETTER Mproportional
24DDCIRCLED LATIN SMALL LETTER Nproportional
24DECIRCLED LATIN SMALL LETTER Oproportional
24DFCIRCLED LATIN SMALL LETTER Pproportional
24E0CIRCLED LATIN SMALL LETTER Qproportional
24E1CIRCLED LATIN SMALL LETTER Rproportional
24E2CIRCLED LATIN SMALL LETTER Sproportional
24E3CIRCLED LATIN SMALL LETTER Tproportional
24E4CIRCLED LATIN SMALL LETTER Uproportional
24E5CIRCLED LATIN SMALL LETTER Vproportional
24E6CIRCLED LATIN SMALL LETTER Wproportional
24E7CIRCLED LATIN SMALL LETTER Xproportional
24E8CIRCLED LATIN SMALL LETTER Yproportional
24E9CIRCLED LATIN SMALL LETTER Zproportional
2051TWO ASTERISKS ALIGNED VERTICALLYproportional
2042ASTERISMproportional
2460CIRCLED DIGIT ONEproportional
2461CIRCLED DIGIT TWOproportional
2462CIRCLED DIGIT THREEproportional
2463CIRCLED DIGIT FOURproportional
2464CIRCLED DIGIT FIVEproportional
2465CIRCLED DIGIT SIXproportional
2466CIRCLED DIGIT SEVENproportional
2467CIRCLED DIGIT EIGHTproportional
2468CIRCLED DIGIT NINEproportional
2469CIRCLED NUMBER TENproportional
246ACIRCLED NUMBER ELEVENproportional
246BCIRCLED NUMBER TWELVEproportional
246CCIRCLED NUMBER THIRTEENproportional
246DCIRCLED NUMBER FOURTEENproportional
246ECIRCLED NUMBER FIFTEENproportional
246FCIRCLED NUMBER SIXTEENproportional
2470CIRCLED NUMBER SEVENTEENproportional
2471CIRCLED NUMBER EIGHTEENproportional
2472CIRCLED NUMBER NINETEENproportional
2473CIRCLED NUMBER TWENTYproportional
222ECONTOUR INTEGRALproportional
221FRIGHT ANGLEproportional
2756BLACK DIAMOND MINUS WHITE Xproportional
261EWHITE RIGHT POINTING INDEXproportional

A.28 Warichu opening brackets (cl-28)

CharacterUCSNameRemark
0028LEFT PARENTHESIS
3014LEFT TORTOISE SHELL BRACKET
005BLEFT SQUARE BRACKET

A.29 Warichu closing brackets (cl-29)

CharacterUCSNameRemark
0029RIGHT PARENTHESIS
3015RIGHT TORTOISE SHELL BRACKET
005DRIGHT SQUARE BRACKET

A.30 Characters in tate-chu-yoko (cl-30)

Any character may participate in tate-chu-yoko composition.

Appendix B Spacing between Characters

The amount of space between two adjacent characters of given character classes explained in 3.9.2 Grouping of Characters and Symbols depending on their Positioning is determined by table 2 below. For methods to determine the amounts of space reduction and addition by line adjustments, see Appendix D Opportunities for Inter-character Space Reduction during Line Adjustment and Appendix E Opportunities for Inter-character Space Expansion during Line Adjustment.

See "Table 2 Spacing between characters" (PDF).

B.1 Legend of Table 2

  1. The left-most column, labeled "before", lists preceding character classes, and the top row, labeled "after", lists trailing character classes. Each cell indicates the amount of space between two characters of the corresponding character classes at the intersection of a given row and column.

  2. The last row, labeled "line head", lists amounts of space inserted between the line head and a character of the corresponding character class in each column. Likewise, the last column, labeled "line end", lists amounts of space inserted between a character of the corresponding character class in each row and the line end.

  3. The following notations in cells are used to indicate the amount of space or other related information.

    1. blank: Set solid between two adjacent characters (it can be that the preceding character is actually the line head or the trailing character is actually the line end).

    2. × mark: The combination is not allowed due to line breaking rules or other restrictions.

    3. 1/2 be: Insert a half em space, where the em here indicates the size of the preceding character.

    4. 1/2 af: Insert a half em space, where the em here indicates the size of the trailing character.

      (note)

      The reason for the distinction between the two half em spaces ("be" stands for "before" and "af" stands for "after") is necessary because there are cases where a line is composed with different sizes of characters, as shown in [Fig.183] where it is necessary to disambiguate which em size we are referring to. Likewise, other instances of the notation "be" or "af" in this appendix are for the same disambiguation.

      Referent for em in half em space.
      [Fig.183]: Referent for em in half em space.
    5. 1/2 be hang: Identical to "1/2 be" in the amount of space. Ruby text can be extended up to the size of the ruby character over this "1/2 be hang" space as long as it is not reduced due to line adjustments. When it is reduced, ruby text can be extended up to the size of the reduced space and shall not be extended over the other character.

    6. 1/2 af hang: Identical to "1/2 af" in the amount of space. Ruby text can be extended up to the size of the ruby character over this "1/2 af hang" space as long as it is not reduced due to line adjustments. When it is reduced, ruby text can be extended up to the size of the reduced space and shall not be extended over the other character.

    7. 1/4 be: Insert a quarter em space, where the em here indicates the size of the preceding character.

    8. 1/4 af: Insert a quarter em space, where the em here indicates the size of the trailing character.

    9. 1/4 be hang: Identical to "1/4 be" in the amount of space. Ruby text can be extended up to half the size of the ruby character over this "1/4 be hang" space as long as it is not reduced due to line adjustments. When it is reduced, ruby text can be extended up to the size of the reduced space and shall not be extended over the other character.

    10. 1/4 af hang: Identical to "1/4 af" in the amount of space. Ruby text can be extended up to half the size of the ruby character over this "1/4 af hang" space as long as it is not reduced due to line adjustments. When it is reduced, ruby text can be extended up to the size of the reduced space and shall not be extended over the other character.

    11. ruby hang: Set solid. Ruby text can be extended up to the size of the ruby character over the other character (see also 7 of B.2 Notes).

B.2 Notes

  1. When opening brackets (cl-01) are followed by a simple-ruby character complex (cl-22) or jukugo-ruby character complex (cl-23), the preferred approach is to allow the ruby text to be extended up to the size of the ruby character over the opening brackets (cl-01). One alternative approach is to not allow ruby text to be extended over opening brackets, and another is to allow it to be extended up to half the size of the ruby character.

  2. The preferred spacing between closing brackets (cl-02) and the line end is a half em. The alternative is to set solid (see 3.1.9 Positioning of Closing Brackets, Full Stops, Commas and Middle Dots at Line End).

  3. Character spacing between two consecutive middle dots (cl-05) shall be the sum of a quarter em of the preceding middle dots and a quarter em of the trailing middle dots.

  4. The preferred spacing between middle dots (cl-05) and the line end is a quarter em. The alternative is to set solid.

  5. In this document, character spacing between a full stop (cl-06) or comma (cl-07) and a following middle dot (cl-05) is the sum of the half em space of the full stop or comma and the quarter em space of the middle dot. On the other hand, JIS X 4051 classifies commas (cl-07) as a subset of closing brackets (cl-02), and, therefore, where a comma (cl-07) is followed by a middle dot (cl-05) in JIS X 4051 the character spacing between them is just the quarter em space of the following middle dot (cl-05).

  6. The preferred spacing between full stops (cl-06) or commas (cl-07) and the line end is a half em. The alternative is to set solid (see 3.1.9 Positioning of Closing Brackets, Full Stops, Commas and Middle Dots at Line End).

  7. When a simple-ruby character complex (cl-22) or jukugo-ruby character complex (cl-23) is adjacent to katakana (cl-16), the preferred approach is to allow the ruby text to be extended up to the size of the ruby character over the katakana. However, if it is required to conform to JIS X 4051, ruby text shall not be extended over the katakana because katakana characters belong to the ideographic character class in JIS X 4051.

    There are alternative methods, one of which is to allow ruby text to be extended up to the size of the ruby character over any character including ideographic (cl-19) as well as hiragana (cl-15) and katakana (cl-16) characters, and another is NOT to allow ruby text to be extended over any character from hiragana (cl-15), katakana (cl-16) and ideographic characters (cl-19).

  8. Ruby text can be extended up to the size of the ruby character over the full-width ideographic space (cl-14). The preferred approach is to apply the same for the full-width line head indent at the beginning of a paragraph. The alternative approach is not to allow ruby text to be extended over the line head indent.

  9. When two adjacent characters belong to the same ornamented character complex (cl-21) run, set them according to the method explained in 3.7.1 Superscripts and Superscripts. When two adjacent characters belong to two distinct ornamented character complex runs, set them solid.

  10. When two adjacent characters belong to the same simple-ruby character complex (cl-22) run, set them according to the method explained in 3.3.5 Positioning of Mono-ruby with Respect to Base Characters. When two adjacent characters belong to two distinct simple-ruby character complex runs, set them solid.

  11. When two adjacent characters belong to the same jukugo-ruby character complex (cl-23) run, set them according to the method explained in 3.3.7 Positioning of Jukugo-ruby with Respect to Base Characters. When two adjacent characters belong to two distinct simple-ruby character complex runs, set them solid.

  12. Character spacing between a preceding unit symbol (cl-25) and a trailing middle dot (cl-05) shall be a quarter em of the trailing character. Note that KATAKANA MIDDLE DOT "・" can be used either as a unit symbol (cl-25) or as a middle dot. When it is used as a unit symbol (cl-25), both preceding and trailing spacing of KATAKANA MIDDLE DOT "・" shall be zero.

  13. There shall be no visible space occupied by Western word space (cl-26) at the line head and that of warichu (inline cutting note), the line end and that of warichu (inline cutting note). If the condition is changed for the same text, restore the default visible space for Western word space (cl-26).

  14. In principle iteration marks (cl-09) should be placed neither at the line head nor at the head of an inline cutting note. When it happens with IDEOGRAPHIC ITERATION MARK "々", there are three ways to deal with this situation.

    1. Follow the principle by applying some sort of line adjustment. In this case, IDEOGRAPHIC ITERATION MARK "々" remains in iteration marks (cl-09).

    2. Allow IDEOGRAPHIC ITERATION MARK "々" to be placed either at the line head or at the head of an inline cutting note. In this case, the character shall be treated as part of the ideographic characters (cl-19) class.

    3. Replace IDEOGRAPHIC ITERATION MARK "々" with the corresponding character.

      e.g.

      line end: 国

      line head: 々

      <is replaced with>

      line end: 国

      line head: 国

      line end: 人

      line head: 々

      <is replaced with>

      line end: 人

      line head: 人

  15. In principle, a prolonged sound mark (cl-10) should be placed neither at the head of a line nor that of an inline cutting note. If it were allowed, the character shall be treated as part of the katakana (cl-16) class.

  16. In principle, small kana (cl-11) should be placed neither at the head of a line nor that of an inline cutting note in principle. If it were to be allowed, HIRAGANA LETTER SMALL * shall be treated as part of the hiragana (cl-15) class, and KATAKANA LETTER SMALL * as part of the katakana (cl-16) class.

  17. The preferred character spacing between the line head and opening opening brackets (cl-01) is zero. An alternative way is not to remove a conditional half em space accompanying the characters (see 3.1.5 Positioning of Opening Brackets at Line Head including methods of positioning of opening brackets at the beginning of paragraphs).

Appendix C Possibilities for Line-breaking between Characters

Line break opportunities between two adjacent characters of given character classes explained in 3.9.2 Grouping of Characters and Symbols depending on their Positioning shall be determined by table 3 below.

See "Table 3 Possibility of separation between characters" (PDF).

C.1 Legend of Table 3

  1. The left-most column, labeled "before", lists preceding character classes, and the top row, labeled "after", lists trailing character classes. Each cell indicates the type of line break opportunity between two adjacent characters of the corresponding character classes at a given row and column.

  2. The type of line break opportunity is indicated in each cell in the table using the following notation.

    1. blank: A line is allow to end between the two characters (breakable).

    2. not: A line is not allowed to end between the two characters (unbreakable).

    3. × mark: the combination is not allowed due to line breaking rules or other restrictions.

C.2 Notes

  1. If IDEOGRAPHIC ITERATION MARK "々" is allowed to appear at the line head or that of inline cutting note, the character shall be treated as a member of the ideographic character (cl-19) class. (For how it behaves in combination with other character classes, see the cells for ideographic characters (cl-19).)

  2. If a prolonged sound mark (cl-10) is allowed to appear at the head of a line or that of inline cutting note, the character shall be treated as a member of the katakana (cl-16) class. (For how it behaves in combination with other character classes, see the cells for katakana (cl-16).)

  3. If small kana (cl-11) are allowed to appear at the head of a line or that of inline cutting note, the character shall be treated as a member of the hiragana (cl-15) or katakana (cl-16) classes accordingly, depending on the script type of the character. (For how it behaves in combination with other character classes, see the cells for hiragana (cl-15) or katakana (cl-16).)

  4. For the default one em space after dividing punctuation marks (cl-04) at the end of a sentence, full-width ideographic space (cl-14) can be used. See 3.1.6 Positioning of Dividing Punctuation Marks (Question Mark and Exclamation Mark) and Hyphens for more detail on how to deal with this case.

  5. There is no line break opportunity between consecutive inseparable characters (cl-08) of the same kind. If two consecutive inseparable characters (cl-08) are of different kinds, a line break opportunity exists between them. For example, a line shall not be broken between two consecutive EM DASH "―" characters, but it is breakable between EM DASH "―" followed by HORIZONTAL ELLIPSIS "…".

  6. There is no line break opportunity between two consecutive characters belonging to the same ornamented character complex (cl-21). If two consecutive characters belong to different ornamented character complexes (cl-21), a line break opportunity exists between them.

  7. There is no line break opportunity between two consecutive characters belonging to the same simple-ruby character complex (cl-22). If two consecutive characters belong to different simple-ruby character complexes (cl-22), a line break opportunity exists between them.

  8. A line break opportunity exists between two consecutive base characters belonging to different jukugo-ruby character complexes (cl-23). There is also a line break opportunity between two consecutive base characters belonging to the same jukugo-ruby character complex (cl-23) and between two runs of ruby text accompanying the corresponding base characters. However, a base character and the accompanying ruby text shall be indivisible, hence there is no line break opportunity between any two consecutive ruby characters in a run of ruby text accompanying a base character.

  9. There is no line break opportunity between preceding grouped numerals (cl-24) and trailing postfixed abbreviations (cl-13). The alternative approach is to allow a line to break before trailing PERCENT SIGN "%", in which case PERCENT SIGN "%" shall be treated as a member of the ideographic character (cl-19) class.

  10. There are two approaches: one is to allow a line to break between preceding grouped numerals (cl-24) and trailing Western characters (cl-27), and the other is not to.

  11. A line break opportunity generally exists between preceding Western characters (cl-27) and trailing postfixed abbreviations (cl-13), unless the preceding Western character (cl-27) is used as a symbol of a quantity or a European numeral, in which case a line break is not allowed between them.

  12. There is no line break opportunity between two consecutive Western characters (cl-27). In order to break a line in the middle of a Western word, it needs to be divided into two syllables first. Then a line can be broken between the two by adding HYPHEN "-" at the line end.

  13. There is no line break opportunity between two consecutive characters belonging to the same set of characters in tate-chu-yoko (cl-30). If two consecutive characters belong to different sets of characters in tate-chu-yoko (cl-30), there a line break opportunity exists between them.

C.3 Addendum

As noted in Appendix B Spacing between Characters and C.2 Notes, there are several conventions for line-start prohibition, line-end prohibition and unbreakable character rules. The following lists four levels of convention. Note that breaking a line after opening brackets (cl-01) and before closing brackets (cl-02), full stops (cl-06) or commas (cl-07) is prohibited at all levels. Likewise, those rules common to all levels are not listed below.

  1. Very loose (Newspapers)

    Breaking a line is allowed before or after the following character classes even though table 3 prohibits it.

  2. Loose (Magazines)

    Breaking a line is allowed before or after the following character classes (or characters) even though table 3 prohibits it.

  3. Strict (Default, general publications)

    Breaking a line is allowed before or after the following character classes (or characters) even though table 3 prohibits it.

  4. Very strict (General publications)

    Breaking a line is not allowed for any place where table 3 prohibits it (no alternate rule explained in C.2 Notes is applied.).

    The difference between the strict and the very strict rules described above is as follows. The very strict line-breaking rule is the convention that attaches greater importance to the line-start prohibition and unbreakable character rules, and therefore it is more likely to lead to the need for line adjustment processing. In comparison, the strict line-breaking rule is the convention that weakens line-start prohibition in order to avoid line adjustment as much as possible to make it easier to achieve solid setting. In other words, the very strict rule is for the best appearance at the line head, while the strict rule is best to avoid inter-character space adjustment.

Appendix D Opportunities for Inter-character Space Reduction during Line Adjustment

The following tables indicate if an opportunity exists for inter-character space reduction during line adjustment between two adjacent characters of given character classes as explained in 3.9.2 Grouping of Characters and Symbols depending on their Positioning. (For more detail on line adjustment, see 3.8 Line Adjustment.) In the process of line adjustment by inter-character space reduction, the first place to look (the first stage of inter-character space reduction in priority order) is for Western word spaces (cl-26), each of which is reducible equally, to leave a minimum of a quarter em space (or a one fifth em space) with respect to the corresponding character size. The tables are for the second and subsequent stages of inter-character space reduction in priority order, assuming the first stage of the reduction for Western word spaces (cl-26) is already done. The default unadjusted space between two adjacent characters of given character classes shall be determined according to Appendix B Spacing between Characters.

There are several conventions for line adjustment, especially when it is to be achieved by inter-character space reduction. Table 4 follows the method adopted by this document, Table 5 supplies an alternative way specified by JIS X 4051, and Table 6, taking partially different approaches from the previous two, represents yet another method which can be seen in books or other publications.

See "Table 4 Opportunity of inter-character space reduction" (PDF).

See "Table 5 Opportunity of inter-character space reduction (the method specified by JIS X 4051)" (PDF).

See "Table 6 Opportunity of inter-character space reduction (the method adopted by books)" (PDF).

D.1 Legend of Tables 4, 5 and 6

  1. The left-most column, labeled "before", lists preceding character classes, and the top row, labeled "after", lists trailing character classes. Each cell indicates the type of opportunities of inter-character space reduction between two adjacent characters of the corresponding character classes at a given row and column.

  2. The cells in the row "line head" or the column "line end" should read if there exists opportunity for space reduction at line head or line end respectively. Tables 4, 5 and 6 all prohibit space reduction at line head. While Table 5 prohibits space reduction at line end too, Tables 4 and 6 allow it for several character classes at line end.

  3. The type of opportunity for space reduction is indicated in each cell in the tables using the following notation.

    1. Cells with a white background: No opportunity for space reduction because the two characters are set solid and there is no adjustable space between them, or because it is prohibited by rule to reduce the inter-character space for the given combination.

      1. Blank: No opportunity for space reduction because the given combination of characters is set solid (or there is no space before or after the given character at line head or at line end respectively).

      2. × mark:The combination is not allowed due to line breaking rules or other restrictions.

      3. 1/2: The amount of inter-character space shall be a half em, and this space shall not be reduced (no opportunity of inter-character space reduction).

      4. 1/4: The amount of inter-character space shall be a quarter em, and this space shall not be reduced (no opportunity of inter-character space reduction).

    2. Cells with a colored or gray background: An opportunity for inter-character space reduction exists.

      1. Priority order in space reduction: The second highest and subsequent priorities of inter-character space reduction are indicated by the given background colors of the cells. The cells with the colors in the left are higher in priority than ones in the right. Space reduction shall be processed in this order.

        Priority order in space reduction.
        [Fig.184]: Priority order in space reduction.
      2. Amount of space reducible: the amount of space to be reduced is indicated using the following notation.

        1. 1/2–0: A half em space by default, which is reducible equally, with respect to the corresponding character size, with no minimum space required (set solid).

        2. 1/2–1/4: A half em space by default, which is reducible equally, with respect to the corresponding character size, to leave a minimum of a quarter em space.

        3. 1/2=0: A half em space by default, which can be removed to set solid (it is not allowed to leave any intermediate size of space between zero to a half em, such as a quarter em space).

        4. 1/4–0: A quarter em space by default, which is reducible equally, with respect to the corresponding character size, with no minimum space required (set solid).

        5. 1/4–1/8: A quarter em space by default, which is reducible equally, with respect to the corresponding character size, to leave a minimum of a one eighth em space.

Note that JIS X 4051 specifies to not leave any space after the closing brackets (cl-02) or commas (cl-07) at the line end. Therefore, Table 5 also indicates that there is no opportunity for space reduction after closing brackets (cl-02) and commas (cl-07) at the line end. Likewise, because middle dots (cl-05) at the line end are supposed to have no space, Table 5 indicates there is no opportunity for space reduction for middle dots (cl-05) at the line end. On the other hand, while JIS X 4051 specifies to pad with a half em space after full stops (cl-06) at the line end, which is not allowed to reduce this space for line adjustment, Table 4 and 6 allow the removal of the default half em space after closing brackets (cl-02), full stops (cl-06) and commas (cl-07) at the line end for line adjustment. Table 4 further allows the removal of the default quarter em space padding before and after middle dots (cl-05) at the line end for line adjustment, while Table 6 does not.

D.2 Notes

  1. The default unadjusted space when a middle dot (cl-05) is followed by a middle dot (cl-05), is the sum of the conditional quarter em space accompanying the preceding middle dot (cl-05) and the conditional quarter em space accompanying the trailing middle dot (cl-05). Tables 4 and 5 allow these two quarter em spaces to be reduced, to leave no space as a minimum. The priority order in space reduction is the fourth in Table 4, and it is the second priority in Table 5.

  2. The default unadjusted space when a full stop (cl-06) is followed by a middle dot (cl-05), is the sum of the conditional half em space accompanying the preceding full stop (cl-06) and the conditional quarter em space accompanying the trailing middle dot (cl-05). Tables 4 and 5 allow the quarter em space accompanying the trailing middle dot (cl-05) to be reduced, to leave no space as a minimum. The priority order in space reduction is the fourth in Table 4, and it is the second priority in Table 5.

  3. The default unadjusted space when a comma (cl-07) is followed by a middle dot (cl-05), is the sum of the conditional half em space accompanying the preceding comma (cl-07) and the conditional quarter em space accompanying the trailing middle dot (cl-05). Table 6 allows the conditional half em space accompanying the trailing comma (cl-07) to be reduced, to leave a quarter em space as a minimum. The priority order in space reduction for the conditional space accompanying middle dots (cl-05) is the fourth in Table 4, and it is the second priority in Table 5. The priority order in space reduction for the conditional space accompanying commas (cl-07) is the fifth in Table 4, and it is the third priority in Table 5 and 6.

  4. There is no opportunity for space reduction for a Western word space (cl-26) at the line head and at the line end since there is supposed to be no visible space. The same applies to the Western word space (cl-26) at the line head or the line end of warichu (inline cutting note). If the condition is changed for the same text, restore the default visible space for Western word space (cl-26).

  5. Table 4, and only Table 4, allows the preceding and trailing conditional quarter em space accompanying middle dots (cl-05) to be reduced to leave no space. The priority order is the third.

Appendix E Opportunities for Inter-character Space Expansion during Line Adjustment

The following table indicates if an opportunity exists for inter-character space expansion during line adjustment between two adjacent characters of given character classes as explained in 3.9.2 Grouping of Characters and Symbols depending on their Positioning. (For more detail on line adjustment, see 3.8 Line Adjustment.) In the process of line adjustment by inter-character space expansion, the first place to look (the first stage of inter-character space expansion in priority order) is for Western word spaces (cl-26), each of which is expandable equally, to take up and maximum of a half em space with respect to the corresponding character size. The tables are for the second and subsequent stages of inter-character space expansion in priority order, assuming the first stage of the expansion for Western word spaces (cl-26) is already done. The default unadjusted space between two adjacent characters of given character classes shall be determined according to Appendix B Spacing between Characters.

See "Table 7 Opportunity of inter-character space expansion" (PDF).

E.1 Legend of Table 7

  1. The left-most column, labeled "before", lists preceding character classes, and the top row, labeled "after", lists trailing character classes. Each cell indicates the type of opportunities for inter-character space expansion between two adjacent characters of the corresponding character classes at a given row and column. Note that there are no cells involving "line head" or "line end" because there is no opportunity for inter-character space expansion before any character at the line head or after any character at the line end.

  2. The type of opportunity for space expansion is indicated in each cell in the table using the following notation.

    1. Cells with a white background: No opportunity for space expansion.

      1. Blank: Inter-character space expansion is not allowed because there is no line break opportunity between the given combination of characters.

      2. × mark: The combination is not allowed due to line breaking rules or other restrictions.

    2. Cells with a colored or gray background: An opportunity exists for inter-character space expansion.

      1. Priority order in space expansion: The second highest and subsequent priorities of inter-character space expansion are indicated by the given background colors of the cells. The cells with the colors in the left are higher in priority than ones in the right. Space expansion shall be processed in this order.

        Priority order in space expansion.
        [Fig.185]: Priority order in space expansion.
      2. Amount of space expandable: The amount of space to be expanded is indicated using the following notation.

        1. 1/4–1/2: A quarter em space by default, which is expandable equally, with respect to the corresponding character size, to take up to a maximum of a half em space. However if Table 5 is adopted as the method of inter-character space reduction for line adjustment, this quarter em space shall not be expanded.

        2. 1/4: No inter-character space by default, which is expandable equally, with respect to the corresponding character size, to take up to a maximum of a quarter em space.

        3. blank: Expandable equally with respect to the corresponding character size, only after no other expandable inter-character space is left.

E.2 Notes

  1. If the IDEOGRAPHIC ITERATION MARK "々" is allowed to appear at the head of a line or that of inline cutting note, the character shall be treated as a member of the ideographic character (cl-19) class. (For how it behaves in combination with other character classes, see the cells for ideographic characters (cl-19).)

  2. If a prolonged sound mark (cl-10) is allowed to appear at the line head or that of inline cutting note, the character shall be treated as a member of the katakana (cl-16) class. (For how it behaves in combination with other character classes, see the cells for katakana (cl-16).)

  3. If small kana (cl-11) are allowed to appear at the head of a line or that of inline cutting note, the character shall be treated as a member of the hiragana (cl-15) or katakana (cl-16) class accordingly, depending on the script type of the character. (For how it behaves in combination with other character classes, see the cells for hiragana (cl-15) or katakana (cl-16).)

  4. A third order opportunity exists for inter-character space expansion, to take up to a maximum of a quarter em space, with respect to the corresponding character size, between two consecutive inseparable characters (cl-08) which are of different kinds.

  5. A third order opportunity exists for inter-character space expansion, to take up to a maximum of a quarter em space, with respect to the corresponding character size, between the two consecutive characters which belong to different ornamented character complexes (cl-21)

  6. A third order opportunity exists for inter-character space expansion, to take up to a maximum of a quarter em space, with respect to the corresponding character size, if the two consecutive characters belong to different simple-ruby character complexes (cl-22). If not, inter-character space expansion is not allowed.

  7. A third order opportunity exists for inter-character space expansion, to take up to a maximum of a quarter em space, with respect to the corresponding character size, if the two consecutive base characters belonging to different jukugo-ruby character complexes (cl-23). If not, inter-character space expansion is not allowed.

  8. There is no opportunity for inter-character space expansion between a preceding grouped numeral (cl-24) and a trailing postfixed abbreviation (cl-13), unless the alternative approach is chosen which allows a line to break between a preceding grouped numeral (cl-24) and the trailing PERCENT SIGN "%", where PERCENT SIGN "%" shall be treated as a member of the ideographic character (cl-19) class, in front of which inter-character space is expandable.

  9. There is an alternative way to give a third order opportunity for inter-character space expansion, to take up to a maximum of a quarter em space, with respect to the corresponding character size, between a preceding grouped numeral (cl-24) and a trailing Western character (cl-27).

  10. A fourth order opportunity exists for inter-character space expansion between a preceding Western character (cl-27) and a trailing postfixed abbreviation (cl-13), unless the preceding Western character (cl-27) is used as a symbol of a quantity or a European numeral, in which case no inter-character space expansion is allowed between them.

  11. There is an alternative way to give a fourth order opportunity for inter-character space expansion with respect to the corresponding character size, between two consecutive Western characters (cl-27).

  12. A third order opportunity exists for the inter-character space expansion, to take up to a maximum of a quarter em space, with respect to the corresponding character size, if two consecutive characters belong to different runs of characters in tate-chu-yoko (cl-30). If not, inter-character space expansion is not allowed.

Appendix F Positioning of Jukugo-ruby

Positioning of ruby characters is explained in [3.3 Ruby and Emphasis Dots], including that of jukugo-ruby, however it is limited to the basic principles. This appendix provides supplementary notes on jukugo-ruby distribution in terms of the structure of a kanji compound word (jukugo) and the type of script of the characters adjacent to the kanji compound word. All explanations hereafter in this appendix assume we are going to compose ruby characters with 'katatsuki' distribution (top-alignment in vertical writing mode).

F.1 Principles of Jukugo-Ruby Distribution: Part 1

The following are principles of jukugo-ruby distribution, taking account of the structure of a kanji compound word and the type of script of the adjacent characters surrounding the compound word.

  1. Because the purpose of ruby annotation, including jukugo-ruby, is to supply the reading of each base kanji character, attach each run of ruby text which represents the reading to the corresponding base kanji character.

    If each run of ruby characters representing the reading of a given base kanji character participating in a kanji compound word is less than or equal to two, attach each ruby text to the corresponding base kanji character (see [Fig.186]). The following figures supply a pair of examples of the same text per line, with and without character frames displayed. Ruby characters and base characters in the same color indicate that they correspond to each other. Those ideographic characters (cl-19) which are not part of the jukugo-ruby, and which, therefore, ruby characters are not allowed to overhang, are presented in yet different colors.

    Jukugo-ruby distribution 1.
    [Fig.186]: Jukugo-ruby distribution 1.
  2. Because the intention when using jukugo-ruby is to handle the kanji compound word (jukugo) as one object, a run of ruby text for a base kanji character is allowed to overhang the adjacent base character participating to the same compound word, up to a maximum of one em in the ruby character size (alternatively, the maximum size for the ruby overhang can be one and a half em, but the explanation hereafter assumes the maximum overhang to be one em in the ruby size.).

  3. When the number of ruby characters is so great that the length of the ruby text is longer than that of the base characters forming the kanji compound word, the decision to what extent the ruby text can overhang the adjacent characters, if allowed, shall be made according to the methods explained in 3.3.8 Adjustments of Ruby with Length Longer than that of the Base Characters. The explanation hereafter adopts one of the methods that prohibits any ruby text to overhang ideographic characters (cl-19) but allows ruby to overhang the characters of a certain character class such as hiragana (cl-15) or katakana (cl-16), up to a maximum of one em in the ruby character size.

F.2 Principles of Jukugo-Ruby Distribution: Part 2

In letterpress printing, ruby text was composed according to "principles part one" in the previous section, but on a case by case basis. Therefore, ruby texts were often composed differently for the same kanji compound word in the same situation. In some cases they differed according to the person in charge of the composition. In this section, one consistent method of ruby composition is presented as "principles part two", which is established with reference to those adopted in books and other publications.

  1. As long as jukugo-ruby can fit the kanji compound word by allowing ruby characters to overhang an adjacent base character that is part of the same compound word or those characters adjacent to the compound word which are of a certain character class such as hiragana (cl-15), inter-character spacing between base characters in the kanji compound word and that between the kanji compound word and the adjacent characters, should not be expanded.

  2. When a base character is accompanied by more than three ruby characters, decide the positioning of the ruby characters in the following order.

    1. Let a run of ruby text for a given base character overhang either or both of the adjacent base characters up to a maximum of one em in the ruby character size. The first choice should be the succeeding base character. For example, suppose we have a kanji compound word consisting of three ideographic characters, each of which is accompanied by three, two and one ruby characters respectively. In this case, let the third character for the first base character overhang the second base character, by letting the second ruby character for the second base character overhang the third base character (see [Fig.187]). In cases where there is no possible arrangement of ruby characters that follows the first choice, let them overhang the preceding base characters as in the following illustrations. [Fig.188] shows examples where the succeeding base characters overhang, while [Fig.189] are examples where the preceding base characters overhang since there is no arrangement of ruby characters to take the succeeding ones.

      Jukugo-ruby distribution 2.
      [Fig.187]: Jukugo-ruby distribution 2.
      Jukugo-ruby distribution 3 (ruby characters to overhang the succeeding base characters).
      [Fig.188]: Jukugo-ruby distribution 3 (ruby characters to overhang the succeeding base characters).
      Jukugo-ruby distribution 4 (ruby characters to overhang the preceding base characters).
      [Fig.189]: Jukugo-ruby distribution 4 (ruby characters to overhang the preceding base characters).
    2. When there is no possible arrangement of ruby characters even by allowing them to overhang the base characters participating to the same kanji compound word, at maximum by one em in ruby character size, then look at the adjacent characters in the main text surrounding the kanji compound word if they are of such character classes which ruby characters are allowed to overhang as in hiragana (cl-15). If they are, allow ruby characters to overhang these characters. The same principle applies here as to the choice of the adjacent (base or non-base) character to be the succeeding one in priority. For example, suppose we have a kanji compound word consisting of two ideographic characters, each of which is accompanied by three and two ruby characters respectively. In this case, let the third ruby character for the first base character overhang the second base character, by letting the second ruby character for the second base character overhang the succeeding non-base character of hiragana (cl-15) class or the like (see [Fig.190]). The following are some examples. [Fig.191] shows examples of ruby characters that overhang the succeeding non-base characters and [Fig.192] are examples of ruby characters that overhang the preceding non-base characters since there is no possible arrangement to allow ruby characters to take the succeeding ones. The last illustration shows examples of ruby characters that overhang both of the non-base characters adjacent to the kanji compound word (see [Fig.193]).

      Jukugo-ruby distribution 5.
      [Fig.190]: Jukugo-ruby distribution 5.
      Jukugo-ruby distribution 6 (examples of ruby characters to overhang the succeeding non-base characters).
      [Fig.191]: Jukugo-ruby distribution 6 (examples of ruby characters to overhang the succeeding non-base characters).
      Jukugo-ruby distribution 7 (examples of ruby characters to overhang the preceding non-base characters).
      [Fig.192]: Jukugo-ruby distribution 7 (examples of ruby characters to overhang the preceding non-base characters).
      Jukugo-ruby distribution 8 (examples of ruby characters to overhang both of non-base characters).
      [Fig.193]: Jukugo-ruby distribution 8 (examples of ruby characters to overhang both of non-base characters).
    3. If the succeeding character in the main text adjacent to the kanji compound word accompanied by ruby characters is of a character class which does not allow ruby to overhang, as with ideographic characters (cl-19), then look at the preceding character and allow ruby to overhang the character if it is of a character class that allows it. If the examples shown in [Fig.190] or [Fig.191] had ideographic characters (cl-19) next to the kanji compound word accompanied by ruby characters, let ruby characters overhang the preceding character in all cases but two examples with "居候" and "古代紫" (see [Fig.194]).

      Jukugo-ruby distribution 9 (examples of ruby characters to overhang the preceding non-base characters).
      [Fig.194]: Jukugo-ruby distribution 9 (examples of ruby characters to overhang the preceding non-base characters).
    4. When both of the preceding and succeeding characters adjacent to the kanji compound word accompanied by ruby characters are of character classes which do not allow ruby to overhang, or there are too many ruby characters and it is impossible to fit them in using the approaches described above, the last thing to consider is to fit them in by expanding the inter-character spacing of the kanji compound word. There could be various methods, depending on what priorities are chosen, and in fact many different ways have been adopted. The following section will explain one of these methods.

To sum up the aforementioned, first try to let ruby characters overhang other base characters associated with the same kanji compound word, then look for adjacent non-base characters to see if they allow ruby to overhang. Finally when both approaches still cannot settle the positioning of the ruby characters, combine the method of expanding inter-character spacing of the compound word with the previous two.

F.3 Principles of Jukugo-Ruby Distribution with Inter-Character Space Expansion

Principles for a method of jukugo-ruby distribution which allows inter-character spacing to expand, are as follows.

  1. Try first to let ruby characters overhang other base characters associated with the same kanji compound word. If ruby still does not fit, then look for adjacent non-base characters that allow ruby to overhang. Lastly when both approaches still cannot settle the positioning of the ruby characters, combine the method of expanding inter-character spacing of the compound word with the previous two.

  2. Inter-character spacing can be expanded only for those base characters which are accompanied by more than two ruby characters.

  3. The total amount of inter-character spacing should be determined as follows.

    Total inter-character space = (the sum of the length of those ruby characters forced out from the corresponding base character) - (the sum of the length of those ruby characters which overhang other base characters) - (the sum of the length of those ruby characters which overhang other non-base characters).

  4. Distribute the amount of space across those base characters accompanied by more than two ruby characters in accordance with the number of ruby characters (or the length of ruby characters when set solid).

  5. For each base character, expand the preceding and succeeding inter-character space equally by half of the assigned space. Note that, depending on the position of the base character, it could be the expansion of inter-character spacing between two base characters, or it could be the expansion between the base character and a non-base character.

  6. In a special case where a jukugo-ruby character complex is at the line-head and the base character and the ruby character at the line-head are supposed to be aligned, expand only the succeeding inter-character space of the base character by all of the assigned space. Similarly, when a jukugo-ruby character complex is at the line-end and the base character and the ruby character at the line-end are supposed to be aligned, expand the preceding inter-character space of the base character by all of the assigned space.

F.4 Examples of Jukugo-Ruby Distribution with Inter-Character Space Expansion

The following are examples of jukugo-ruby distribution in accordance with the principles mentioned in the previous section.

The jukugo-ruby shown in [Fig.195] requires expansion of inter-character spacing by one ruby character. Expand the preceding and succeeding inter-character spacing for the base character "峻" accompanied by three ruby characters by a quarter em each in base character size.

Jukugo-ruby distribution with inter-character space expansion 1.
[Fig.195]: Jukugo-ruby distribution with inter-character space expansion 1.

The jukugo-ruby shown in [Fig.196] requires expansion of inter-character spacing by one ruby character. Expand the preceding and the succeeding inter-character spacing for the base character "候" accompanied by four ruby characters by a quarter em each in base character size.

Jukugo-ruby distribution with inter-character space expansion 2.
[Fig.196]: Jukugo-ruby distribution with inter-character space expansion 2.

[Fig.197] are examples with the same jukugo-ruby as in [Fig.195] and [Fig.195] except their positions in the lines are at the line-head or at the line-end, where the base character and the ruby character at the line-head or at the line-end are both aligned.

Jukugo-ruby distribution with inter-character space expansion 3 (examples at the line-head and at the line-end).
[Fig.197]: Jukugo-ruby distribution with inter-character space expansion 3 (examples at the line-head and at the line-end).

Examples shown in [Fig.198] are those with two base characters accompanied by six ruby characters. Distribution of ruby varies depending on the position of ideographic characters (cl-19) and whether it is adjacent to the kanji compound word.

Jukugo-ruby distribution with inter-character space expansion 4.
[Fig.198]: Jukugo-ruby distribution with inter-character space expansion 4.

Examples shown in [Fig.199] are those with three base characters accompanied by eight ruby characters. Distribution of ruby varies depending on the position of ideographic characters (cl-19) and whether it is adjacent to the kanji compound word.

Jukugo-ruby distribution with inter-character space expansion 5.
[Fig.199]: Jukugo-ruby distribution with inter-character space expansion 5.

Examples shown in [Fig.200] are those with three base characters accompanied by nine ruby characters. Distribution of ruby varies depending on the position of ideographic characters (cl-19) and whether it is adjacent to the kanji compound word.

Jukugo-ruby distribution with inter-character space expansion 6.
[Fig.200]: Jukugo-ruby distribution with inter-character space expansion 6.

Appendix G Terminology

The definitions with (JIS Z 8125) and (JIS X 4051) at the end are those adopted for the corresponding terms by JIS Z 8125 ("Graphic arts - Glossary - Digital printing terms") and JIS X 4051 ("Formatting rules for Japanese documents") respectively.

TerminologyJapaneseRomanized transliterationDefinition
back matter後付atozukeAppendices, supplements, glossary of terms, index and/or bibliography, and so on, appended at the end of a book.
base character親文字oya mojiA character to be annotated by ruby, ornament characters, or emphasis dots.
base line並び線narabi senA virtual line on which almost all glyphs in Western fonts are designed to be aligned. (JIS Z 8125)
bibliography参考文献sankō bunkenA list of works and papers related to the subjects in the text. (JIS Z 8125)
blank page白ページshiro pējiAn empty page.
bleed裁切りtachikiriTo print a picture or a tint to run off the edge of a trimmed page. (JIS Z 8125)
block direction行送り方向gyō okuri hōkōThe direction lines progress, one after the other. (JIS Z 8125)
bound on the left-hand side左綴じhidari tojiBinding of a book to be opened from the left.
bound on the right-hand side右綴じmigi tojiBinding of a book to be opened from the right.
bousen傍線bōsenA line drawn by the left or right side of a character or a run of text in vertical writing mode. (JIS Z 8125)
break (a line)(2行に)分割bunkatsuTo place the first of two adjacent characters at the end of a line and the second at the head of a new line.
captionキャプションkyapushonA title or a short description accompanying a picture, an illustration, or a table. (JIS Z 8125)
centering中央そろえchūō soroeTo align the center of a run of text that is shorter than a given line length to the center of a line. (JIS Z 8125)
character advance字幅jihabaSize of a character frame in the inline direction, generally indicated as a ratio of the size of a full-width character, as in full-width, half-width, or quarter em width. Character advance is the width of a given character in horizontal writing-mode, while it is the height in vertical writing-mode.
character frame(文字の)外枠sotowakuRectangular area occupied by a character when it is set solid.
character shape字形jikeiIncarnation of a character by handwriting, printing or rendering to a computer screen. (JIS Z 8125)
character size文字サイズmoji saizuDimensions of a character. Unless otherwise noted, it refers to the size of a character frame in the block direction.
characters not ending line行末禁則文字gyōmatsu kinsoku mojiAny character for which "line-end prohibition rule" is invoked. (JIS Z 8125)
characters not starting line行頭禁則文字gyōtō kinsoku mojiAny character for which "line-start prohibition rule" is invoked. (JIS Z 8125)
columndanA partition on a page in multi-column format. (JIS Z 8125)
column gap段間dankanAmount of space between columns on a page. (JIS Z 8125)
composition組版kumihanProcess of arrangement of text, figures and/or pictures, etc on a page in a desired layout (design) in preparation for printing.
compound word (jukugo)熟語jukugoA combination of two or more kanji characters which makes one word.
continuous pagination通しノンブルtōshi nonburua) To number the pages of a book continuously across all those in the front matter, the text and the back matter.
b) To number the pages continuously across those of all books, such as a series published in separate volumes. Also to number the pages continuously across those of all issues of a periodical published in a year, aside from pagination per issue.
(JIS Z 8125)
descender lineディセンダラインdisenda rainA descender is the part of a letter extending below the base line, as in 'g', 'j', 'p', 'q', or 'y'. A descender line is a virtual line drawn at the bottom of descender parallel to base line.
double running head method両柱方式ryōbashira hōshikiA method that prints running heads on both even and odd pages. (JIS Z 8125)
emphasis dots圏点kentenSymbols attached alongside a run of base characters to emphasize them. (JIS Z 8125)
endnote後注kōchūA set of notes placed at the end of a part, chapter, section, paragraph and so on, or at the end of a book. (JIS Z 8125)
European numeralsアラビア数字arabia sūjiAny of the symbols in [0-9] used to represent numbers. (JIS Z 8125)
even inter-character spacing均等割りkintō wariA text setting with uniform inter-character spacing per line so that each line is aligned on the same line-head and line-end. (JIS Z 8125)
even spacing distribution均等そろえkintō soroeTo align a text to both edges of a line by inserting the same amount of space between characters.
even tsumegumi均等詰めkintō zumeAdjustment of inter-character space by subtracting the same amount of space. (JIS Z 8125)
face tsumegumi字面詰めjizura zumeAdjustment of inter-character space by subtracting space to the extent that two letter faces are placed adjacent. (JIS Z 8125)
fixed inter-character spacingアキ組aki gumiA text setting with a uniform inter-character spacing. (JIS Z 8125)
fixed-widthモノスペースmonosupēsuA characteristic of a font where the same character advance is assigned for all glyphs. (JIS Z 8125)
fontフォントfontoA set of character glyphs of a given typeface. (JIS Z 8125)
footnote脚注kyakuchūA note in a smaller face than that of main text, placed at the bottom of a page. (JIS Z 8125)
fore-edge小口koguchia) The three front trimmed edges of pages in a book.
b) The opposite sides of the gutter in a book.
(JIS Z 8125)
front matter前付maezukeThe first part of a book followed by the text, usually consisting of a forward, preface, table of contents, list of illustrations, acknowledgement and so on.
full-width全角zenkakua) Relative index for the length which is equal to a given character size.
b) Character frame which character advance is equal to the amount referred to as a). A full-width character frame is square in shape by definition.
furigana振り仮名furiganaA method of ruby annotation using kana characters to indicate how to read kanji characters. This term derives from a Japanese verb "furu (to attach alongside)" and "kana", and has been used synonymously with "ruby". This document prefers to use the term "ruby".
furikanji振り漢字furikanjiA method of ruby annotation using Kanji characters for ruby instead of kana characters.
furiwake振分けfuriwakeA method of placing multiple runs of text in a line. (JIS Z 8125)
general-ruby総ルビsō rubiA method of ruby annotation that attaches ruby text for all Kanji characters in the text. (JIS Z 8125)
group-rubyグループルビgurūpu rubiA method of ruby character distribution such that the length of ruby text matches to that of the base text by giving the same adjusted amount of space between ruby characters.
gutterのどnodoa) The binding side of a spread of a book.
b) the margin between the binding edge of a book and the hanmen (text area).
c) The part of a book where all pages are bound together to the book spine.
(JIS Z 8125)
half em二分nibuHalf of the full-width size. (JIS Z 8125)
half em space二分アキnibu akiAmount of space that is half size of em space.
half-width半角hankakuCharacter frame which has a character advance of a half em.
hanmen (page content area)版面hanmenActual printed area in a page excluding the margins. (note: Running heads and page numbers are not part of hanmen.) (JIS Z 8125)
han-tobira半扉hantobiraA simplified version of naka-tobira, the verso side of which text of the new part starts. (JIS Z 8125)
heading見出しmidashia) A title of a paper or an article.
b) A title for each section of a book, paper or article.
(JIS Z 8125)
horizontal writing mode横組yokogumiThe process or the result of arranging characters on a line from left to right, of lines on a page from top to bottom, and/or of columns on a page from left to right. (JIS Z 8125)
hyphenationハイフネーションhaifunēshonA method of breaking a line by dividing a Western word at the end of a line and adding a hyphen at the end of the first half of the syllable.
ideographic numerals漢数字kansūjiIdeographic characters representing numbers.
illustrations図版zuhanA general term referring to a diagram, chart, cut, figure, picture and the like, to be used for printed materials.
independent pagination別ノンブルbetsu nonburuTo number the pages of the front matter, the text and the back matter independently. (JIS Z 8125)
index索引sakuinA list of terms or subjects with page numbers for where they are referred to in a single or multiple volumes of a book. (JIS Z 8125)
inline direction字詰め方向jizume hōkōText direction in a line. (JIS Z 8125)
inseparable characters rule分離禁止bunri kinshiA line adjustment rule that prohibits inserting any space between specific combinations of characters. (JIS Z 8125)
inter-character space字間jikanAmount of space between two adjacent character frames on the same line.
itemization箇条書きkajō gakiTo list ordered or unordered items one under the other. (JIS Z 8125)
Japanese and Western mixed text composition和欧文混植waō konshokuTo mix Japanese text and Western text in the same composition.
Japanese characters和文文字wabun mojiCharacters used to compose Japanese text.
Japanese gothic faceゴシック体goshikku taiA Japanese typeface, with strokes almost the same in thickness, and no special ornament on a stroke such as a triangular element commonly seen in the Mincho typeface. Used for text emphasis and/or headings.
jidori字取りjidoriA method of aligning a run of text to both edges which is specified by a position to start and the length calculated by a specified number of a given size of characters. (JIS Z 8125)
Jouyou Kanji Table常用漢字表jōyō kanji hyōThe official list of Kanji characters "for general use in society. such as in legal and official documents, newspapers, magazines, broadcasting and the like". It was established in 1981 as a reference guide for people in composing contemporary Japanese. It listed 1,945 of Kanji characters together with their orthographic shapes, Japanese native reading (Kun), Chinese derived reading (On) and other useful information.
jukugo-ruby熟語ルビjukugo rubiA method of ruby character distribution determined by two functions, one is to provide reading for each Kanji character, the other is to give a united appearance attached to a word.
kanbun composition漢文kanbunChinese classic text (or text in the same style) with various auxiliary symbols so that it can be read as Japanese text.
katatsuki (katatsuki-ruby)肩付き(肩付きルビ)katatsuki (katatsuki rubi)A method of attaching ruby at the upper right of each base character. (JIS Z 8125)
kihon-hanmen基本版面kihon hanmenThe default dimensions of the main area of a typeset page specified by text direction, number of columns, character size, number of characters in a line, number of lines in a column, inter-line spacing and inter-column spacing. (JIS X 4051)
letter face字面jizuraArea in which glyph is drawn. (JIS Z 8125)
letterpress printing活字組版katsuji kumihanThe traditional printing method using movable type.
line adjustment行の調整処理gyō no chōsei shoriA method of aligning both edges of all lines to be the same given length by removing or adding adjustable spaces.
line adjustment by hanging punctuationぶら下げ組burasage gumiA line breaking rule to avoid commas or full stops at a line head (which is prohibited in Japanese typography) by taking them back to the end of the previous line beyond the specified line length. (JIS Z 8125)
line adjustment by inter-character space expansion追出し処理oidashi shoriA line breaking rule that aligns both edges of a line by expanding inter-character spaces. (JIS Z 8125).
line adjustment by inter-character space reduction追込み処理oikomi shoriA line breaking rule that aligns both edges of a line by removing adjustable spaces such as conditional spaces for punctuation marks. (JIS Z 8125).
line breaking rules禁則処理kinsoku shoriA set of rules to avoid prohibited layout in Japanese typography, such as "line-start prohibition rule", "line-end prohibition rule", inseparable or unbreakable character sequences and so on. (JIS Z 8125)
line end行末gyōmatsuThe position at which a line ends. (JIS Z 8125)
line end alignment行末そろえgyōmatsu soroeTo align a run of text to the line end. (JIS Z 8125)
line end indent字上げjiageTo reserve a certain amount of space before the default position of a line end. (JIS Z 8125)
line-end prohibition rule行末禁則gyōmatsu kinsokuA line breaking rule that prohibits specific characters at a line end. (JIS Z 8125)
line feed行送りgyō okuriThe distance between two adjacent lines measured by their reference points. (JIS Z 8125)
line gap行間gyoukanThe smallest amount of space between adjacent lines.
line head行頭gyoutouThe position at which a line starts. (JIS Z 8125)
line head alignment行頭そろえgyōtō soroeTo align a run of text to the line head. (JIS Z 8125)
line head indent字下げjisageTo reserve a certain amount of space after the default position of a line head. (JIS Z 8125)
line length行長gyōchōLength of a line with a pre-defined number of characters. When the line is indented at the line head or the line end, it is length of the line from the specified amount of line head indent to the specified amount of line end indent.
line-start prohibition rule行頭禁則gyōtō kinsokuA line breaking rule that prohibits specific characters at a line head. (JIS Z 8125)
main text本文honbuna) The principal part of a book, usually preceded by the front matter, followed by the back matter.
b) The principal part of an article excluding figures, tables, heading, notes, leads and so on.
c) The content of a page excluding running heads and page numbers.
d) The net contents of a book excluding covers, end papers, insets and so on.
(JIS Z 8125)
matrix母型bokeiA metal mold from which movable type is cast.
mixed text composition混植konshokua) To interleave Japanese text with Western text in a line (Japanese and Western mixed text composition).
b) To compose text with different sizes of characters (mixed size composition).
c) To compose text with different typefaces (mixed typeface composition).
(JIS Z 8125)
mono-rubyモノルビmono rubiA method of ruby distribution where a run of ruby text is attached to each base character. (JIS Z 8125)
multi-column format段組dangumiA format of text on a page where text is divided into two or more sections (columns) in the inline direction and each column is separated by a certain amount of space (column space). (JIS Z 8125)
multivolume work多巻物takanmonoA set of work published in two or more volumes, as in the complete work or the first/last half volumes.
naka-tobira中扉naka tobiraA recto or a page inserted to divide two different parts in a book. It often has a title or other text to describe the new part. (JIS Z 8125)
nakatsuki (nakatsuki-ruby)中付き(中付きルビ)nakatsuki (nakatsuki rubi)A method of ruby character distribution where each ruby character is aligned to the vertical center of the corresponding base character in vertical writing mode, or to the horizontal center of the base character in horizontal writing mode. (JIS Z 8125)
new recto改丁kaichōTo start a new heading or something on a odd page. (JIS Z 8125)
notechūExplanatory information added to terms, figures or tables. (JIS Z 8125)
number of characters per line字詰めjizumeNumber of characters in a line to specify the length of lines. (JIS Z 8125)
number of columns段数dansūNumber of columns on a page. (JIS Z 8125)
one em space全角アキzenkaku akiAmount of space that is full-width size.
original pattern原図genzuAn original drawn pattern of a character image to be used for a printing type or a digitized glyph.
ornament characters添え字soejiA superscript or subscript attached to a base character. (JIS Z 8125)
pageページpējiA side of a sheet of paper in a written work such as a book. (JIS Z 8125)
page break改ページkai pējiTo end a page even if it is not full and to start a new page with the next paragraph, a new heading and so on. (JIS Z 8125)
page format組体裁kumi teisaiThe layout and presentation of a page with text, graphics and other elements for a publication such as a book.
page numberノンブルnonburuA sequential number to indicate the order of pages in a publication. (JIS Z 8125)
para-rubyパラルビpara rubiA method of ruby annotation where ruby text is only attached to selected Kanji characters in the text. (JIS Z 8125)
paragraph段落danrakuA group of sentences to be processed for line composition. A paragraph consists of one or more lines. (JIS Z 8125)
paragraph break改行kaigyōTo start a new line to indicate a new paragraph.
paragraph format段落整形danraku seikeiA format of a paragraph, as in line head indent or line end indent.
pointポイントpointoA measurement unit of character size. 1 point is equal to 0.3514mm (see JIS Z 8305). There is another unit to measure character sizes called Q, where 1Q is equivalent to 0.25mm.
printing types活字katsujiMovable type used for letterpress printing.
proportionalプロポーショナルpuropōshonaruA characteristic of a font where character advance is different per glyph. (JIS Z 8125)
punctuation marks約物yakumonoA general term referring to the symbols used in text composition to help make the meaning of text clearer, as in commas, full stops, question marks, brackets, diereses and so on. (JIS Z 8125)
quarter em四分shibuQuarter size of full-width.
quarter em space四分アキshibu akiAmount of space that is a quarter of an em space in size.
quarter em width四分角shibu kakuCharacter frame which has a character advance of a quarter em. (JIS X 4051)
quotation引用文in-yōbunExcerps from other published works. (JIS Z 8125)
reference marks合印aijirushiA symbol or short run of text attached to a specific part of text, to which notes are provided followed by the corresponding marks.
reverse pagination逆ノンブルgyaku nonburuNumbering pages of a book backwards. (JIS Z 8125)
Roman numeralsローマ数字rōma sūjiNumerals represented by upper case or lower case of Latin letters. (JIS Z 8125)
rubyルビrubiSupplementary small characters indicating pronunciation, meaning, etc. for the character or the block of characters they annotate. (JIS Z 8125) (Sometimes these annotations are referred to as "furigana".)
running headhashiraA page element which contains information on the title of the book, chapter, section and so on, printed outside the area of the hanmen. (JIS Z 8125)
single line alignment methodそろえsoroeTo align a run of text that is shorter than a given line length to designated positions.
single running head method片柱方式katabashira hōshikiA method that puts running heads only on odd pages. (JIS Z 8125)
small kana小書きの仮名kogaki no kanaKana with smaller letter faces to be used mainly for representing contracted sounds or prolonged vowels. (JIS Z 8125)
solid settingベタ組beta gumiTo arrange characters with no inter-character space between adjacent character frames.
spaceアキakiAmount of space between adjacent characters or lines. It also refers to the blank area between the edges of a hanmen or an illustration and text or other hanmen elements.
spread見開きmihirakiAny two facing pages when opening a book and the like. (JIS Z 8125)
subscript (inferior)下付きshitatsukiSmaller face of characters, attached to the lower right or the lower left of a normal size character. (JIS Z 8125)
superscript (superior)上付きuwatsukiSmaller face of characters, attached to the upper right or the upper left of a normal size character. (JIS Z 8125)
tab settingタブ処理tabu shoriA method of line composition to align one or more runs of text to designated positions on a line.
tablehyōFormatted data consisting of characters or numbers, arranged in cells and sometimes divided by lines, in order to present the data in a way that is easier to understand. (JIS Z 8125)
table of contents目次mokujiA list of headings of contents of a book in page order or arranged by subjects, with page numbers on which each section begins. (JIS Z 8125)
tate-chu-yoko縦中横tate chū yokoTo typeset a (small) group of characters horizontally within a vertical line of main text.
tentsuki天付きtentsukia) To remove conditional space from opening brackets at a line head to align the line head to the ones of the adjacent lines.
b) Not to add the default line head indent for the first line of a paragraph so as to align all line heads.
(JIS Z 8125)
text direction組方向kumi hōkōHorizontal setting or vertical setting. (JIS Z 8125)
Touyou Kanji Table当用漢字表tōyō kanji hyōThe official list of Kanji characters established in 1946, which was designed to restrict the Kanji characters for general use in society to only those 1850 specified in the list. The list together with other related tables was superseded by the Jouyou Kanji Table.
trim size仕上りサイズshiagari saizuDimensions of a full page in a publication, including margins. (JIS Z 8125)
tsumegumi詰め組tsumegumiAdjustment of inter-character space by making the distance between the letter face of adjacent characters shorter than that produced by solid setting. (JIS Z 8125)
type-picking文選bunsenTo select metal type for characters needed to print a manuscript. (Metal type is stored in a type case, but because the number of Japanese characters is very large, an extra operation was invented that involves collecting type in a so-called 'bunsen box' before typesetting a manuscript using a composing stick.)
typeface書体shotaiA set of letters or symbols, which are designed to have coherent patterns to be used for printing or rendering to a computer screen. (JIS Z 8125)
unbreakable characters rule分割禁止bunkatsu kinshiA line breaking rule that prohibits breaking a line between consecutive dashes or leaders, or between other specific combinations of characters.
underline下線kasenA line drawn under a character or a run of text in horizontal writing mode. (JIS Z 8125)
vertical writing mode縦組tate gumiThe process or the result of arranging characters on a line from top to bottom, of lines on a page from right to left, and/or of columns on a page from top to bottom. (JIS Z 8125)
warichu (inline cutting note)割注warichūA note of two or more lines inserted in the text. It includes brackets which surround the note (JIS Z 8125)
widowウィドウwidōThe term in Western text layout to describe that the last line of a paragraph with only a few words appears at the top of a new page or a column. (JIS Z 8125)
widow adjustment段落末尾処理danraku matsubi shoriA method of line composition to adjust lines in a paragraph so that the last line consists of more than a given number of characters.

Appendix H References

Appendix I Revision Log

This is the first publication of this document as a Working Group Note.

Appendix J Acknowledgements

This document has been developed with contributions from participants of the Japanese Layout Taskforce.