TTML Text and Image Profiles for Internet Media Subtitles and Captions 1.0

Abstract

This document specifies two profiles of [TTML1]: a text-only profile and an image-only profile. These profiles are intended to be used across subtitle and caption delivery applications worldwide, thereby simplifying interoperability, consistent rendering and conversion to other subtitling and captioning formats. The text profile is a superset of [ttml10-sdp-us].

The document defines extensions to [TTML1], as well as incorporates extensions specified in [ST2052-1] and [EBU-TT-D].

Both profiles are based on [SUBM].

6. Common Constraints

6.1 Document Encoding

A Document Instance SHALL use UTF-8 character encoding as specified in [UNICODE].

6.2 Foreign Element and Attributes

A Document Instance MAY contain elements and attributes that are neither specifically permitted nor forbidden by a profile.

6.3 Namespaces

The following namespaces (see [xml-names]) are used in this specification:

Name	Prefix	Value	Defining Specification
XML	xml	http://www.w3.org/XML/1998/namespace	[xml-names]
TT Parameter	ttp	http://www.w3.org/ns/ttml#parameter	[TTML1]
TT Styling	tts	http://www.w3.org/ns/ttml#styling	[TTML1]
TT Feature	none	http://www.w3.org/ns/ttml/feature/	[TTML1]
SMPTE-TT Extension	smpte	http://www.smpte-ra.org/schemas/2052-1/2010/smpte-tt	[ST2052-1]
EBU-TT Styling	ebutts	urn:ebu:tt:style	[EBU-TT-D]
IMSC 1.0 Styling	itts	http://www.w3.org/ns/ttml/profile/imsc1#styling	This specification
IMSC 1.0 Parameter	ittp	http://www.w3.org/ns/ttml/profile/imsc1#parameter	This specification
IMSC 1.0 Metadata	ittm	http://www.w3.org/ns/ttml/profile/imsc1#metadata	This specification
IMSC 1.0 Extension	none	http://www.w3.org/ns/ttml/profile/imsc1/extension/	This specification
IMSC 1.0 Text Profile Designator	none	http://www.w3.org/ns/ttml/profile/imsc1/text	This specification
IMSC 1.0 Image Profile Designator	none	http://www.w3.org/ns/ttml/profile/imsc1/image	This specification

The namespace prefix values defined above are for convenience and document instances MAY use any prefix value that conforms to [xml-names].

The namespaces defined by this specification are mutable [namespaceState]; all undefined names in these namespaces are reserved for future standardization by the W3C.

6.4 Overflow

A Document Instance SHOULD be authored assuming strict clipping of content that falls out of region areas, regardless of the computed value of tts:overflow for the region.

Note

As specified in [TTML1], tts:overflow has no effect on the extent of the region, and hence the total normalized drawing area S(En) at 9.3 Paint Regions.

6.6 Synchronization

Each intermediate synchronic document of the Document Instance is intended to be displayed on a specific frame and removed on a specific frame of the related video object.

When mapping a media time expression M to a frame F of a related video object, e.g. for the purpose of rendering a Document Instance onto the related video object, the presentation processor SHALL map M to the frame F with the presentation time that is the closest to, but not less, than M.

Note

In typical scenario, the same video program (the related video object) will be used for Document Instance authoring, delivery and user playback. The mapping from media time expression to related video object above allows the author to precisely associate subtitle video content with video frames, e.g. around scene transitions. In circumstances where the video program is downsampled during delivery, the application can specify that, at playback, the relative video object be considered the delivered video program upsampled to is original rate, thereby allowing subtitle content to be rendered at the same temporal locations it was authored.

If ttp:frameRate is specified, then the product of ttp:frameRate and ttp:frameRateMultiplier SHALL be the frame rate of the related video object.

Note

A document can be made independent of the frame rate of the related video object by never using the frames term in a time expression: as specified in 6.10 Features, ttp:frameRate is required only if the document includes one or more time expressions that uses the frames term.

6.7 Extensions

6.7.1 ittp:aspectRatio

The ittp:aspectRatio attributes allows authorial control of the mapping of the root container of a Document Instance to the related video object frame.

If present, the ittp:aspectRatio attribute SHALL conform to the following syntax:

ittp:aspectRatio
  : numerator denominator                                               // numerator != 0; denominator != 0
        
numerator | denominator
  : <digit>+

The root container of a Document Instance SHALL be mapped to the related video object frame according to the following:

If ittp:aspectRatio is present, the root container SHALL be mapped to a rectangular area within the related video object such that:
1. the ratio of the width to the height of the rectangular area is equal to ittp:aspectRatio,
2. the center of the rectangular area is collocated with the center of the related video object frame,
3. the rectangular area (including its boundary) is entirely within the related video object frame (including its boundary), and
4. the rectangular area has a height or width equal to that of the related video object frame.
Otherwise, the root container of a Document Instance SHALL be mapped to the related video object frame in its entirety. If tts:extent is present on the tt element, the extents of the root container SHALL be equal to the dimensions of the related video object frame.

ittp:aspectRatio SHALL NOT be present if tts:extent is present.

An ittp:aspectRatio attribute is considered to be significant only when specified on the tt element.

Example 2

<tt
  xmlns="http://www.w3.org/ns/ttml"
  xmlns:ttm="http://www.w3.org/ns/ttml#metadata" 
  xmlns:tts="http://www.w3.org/ns/ttml#styling"
  xmlns:ttp="http://www.w3.org/ns/ttml#parameter" 
  xmlns:ittp="http://www.w3.org/ns/ttml/profile/imsc1#parameter"
  ittp:aspectRatio="4 3"
 >
 ...
</tt>

Note

As specified in Section 6.10 Features, tts:extent is present if the px length measure is used anywhere within the document.

Integer pixel positions on the related video object frame computed from real percentage length values SHALL use half-up rounding, i.e. round(x) = floor(x+0.5).

6.7.2 ittp:progressivelyDecodable

A progressively decodable Document Instance is structured to facilitate presentation before the document is received in its entirety, and can be identified using ittp:progressivelyDecodable attribute.

A progressively decodable Document Instance is a Document Instance that conforms to the following:

no attribute or element of the TTML timing vocabulary is present within the head element;
given two intermediate synchronic documents A and B of the Document Instance, with start times TA and TB, respectively, TA is not greater than TB if A includes a p element that occurs earlier in the document than any p element that B includes;
no attribute of the TTML timing vocabulary is present on a descendant element of p; and
no element E1 explicitly references another element E2 where the opening tag of E2 occurs after the opening tag of E1.

If present, the ittp:progressivelyDecodable attribute SHALL conform to the following syntax:

ittp:progressivelyDecodable
  : "true"
  | "false"

An ittp:progressivelyDecodable attribute is considered to be significant only when specified on the tt element.

If not specified, the value of ittp:progressivelyDecodable SHALL be considered to be equal to "false".

A Document Instance for which the computed value of ittp:progressivelyDecodable is "true" SHALL be a progressively decodable Document Instance.

A Document Instance for which the computed value of ittp:progressivelyDecodable is "false" is neither asserted to be a progressively decodable Document Instance nor asserted not to be a progressively decodable Document Instance.

Example 3

<tt
  xmlns="http://www.w3.org/ns/ttml"
  xmlns:ttm="http://www.w3.org/ns/ttml#metadata" 
  xmlns:tts="http://www.w3.org/ns/ttml#styling"
  xmlns:ttp="http://www.w3.org/ns/ttml#parameter" 
  xmlns:ittp="http://www.w3.org/ns/ttml/profile/imsc1#parameter"
  ittp:progressivelyDecodable="true"
 >
 ...
</tt>

Note

[TTML1] specifies explicitly referencing of elements identified using xml:id in the following circumstances:

an element in body referencing region elements. In this case, Requirement 4 above is always satisfied.
an element in body referencing style elements. In this case, Requirement 4 above is always satisfied.
a region element referencing style elements. In this case, Requirement 4 above is always satisfied.
a style element referencing other style elements. In this case, Requirement 4 provides an optimization of style element ordering within the head element.
a ttm:actor element referencing a ttm:agent element. In this case, Requirement 4 provides optimization of metadata elements ordering within the document.
a content element referencing ttm:agent elements using the ttm:agent attribute. In this case, Requirement 4 provides optimization of metadata elements ordering within the document.

6.7.3 itts:forcedDisplay

itts:forcedDisplay allows the processor to override the computed value of tts:visibility attribute in conjunction with an application parameter displayForcedOnlyMode.

If and only if the value of displayForcedOnlyMode is "true", a content element with a itts:forcedDisplay computed value of "false" SHALL NOT produce any visible rendering, but still affect layout, regardless of the computed value of tts:visibility.

The itts:forcedDisplay attribute shall conform to the following:

Values:	`false \| true`
Initial:	`false`
Applies to:	`body`, `div`, `p`, `region`, `span`
Inherited:	yes
Percentages:	N/A
Animatable:	discrete

Annex C. Forced content (non-normative) illustrates the use of itts:forcedDisplay in an application in which a single document contains both hard of hearing captions and translated foreign language subtitles, using itts:forcedDisplay to display translation subtitles always, independently of whether the hard of hearing captions are displayed or hidden.

The presentation processor SHALL accept an optional boolean parameter called displayForcedOnlyMode, whose value MAY be set by a context external to the presentation processor. If not set, the value of displayForcedOnlyMode SHALL be assumed to be equal to "false".

The algorithm for setting the displayForcedOnlyMode parameter based on the circumstances under which the Document Instance is presented is left to the application.

Example 4

...
<head>
	...
	<region xml:id="r1" tts:origin="10% 2%" tts:extent="80% 10%" tts:color="white" itts:forcedDisplay="true" tts:backgroundColor="black"/>
	<region xml:id="r2" tts:origin="10% 80%" tts:extent="80% 88%" tts:color="white" tts:backgroundColor="black"/>
	...
</head>
...
<div>
	 <p region="r1" begin="1s" end="6s">Lycée</p>
		
	 <!-- the following will not appear if displayForcedOnlyMode='true' -->
	 <p region="r2" begin="4s" end="6s">Nous étions inscrits au même lycée.</p>
</div>
...

Note

As specified in [TTML1], the background of a region can be visible even if the computed value of tts:visibility equals "hidden" for all active content within. The background of a region for which itts:forcedDisplay equals "true" can therefore remain visible even if itts:forcedDisplay equals "false" for all active content elements within the region and displayForcedOnlyMode equals "true". Authors can avoid this situation, for instance, by ensuring that content elements and the regions that they are flowed into always have the same value of itts:forcedDisplay.

Note

Although itts:forcedDisplay, like all the TTML style attributes, has no defined semantics on a br content element, itts:forcedDisplay will apply to a br content element if it is either defined on an ancestor content element of the br content element or it is applied to a region element corresponding to a region that the br content element is being flowed into.

Note

It is expected that the functionality of itts:forcedDisplay will be mapped to a conditional style construct in a future revision of this specification.

6.7.4 ittm:altText

ittm:altText allows an author to provide a text string equivalent for an element, typically an image. This text equivalent MAY be used to support indexing of the content and also facilitate quality checking of the document during authoring.

The ittm:altText element SHALL conform to the following syntax:

<ittm:altText
  xml:id = ID
  xml:lang = string
  xml:space = (default|preserve)
  {any attribute not in the default namespace, any TT namespace or any IMSC 1.0 namespace}>
  Content: #PCDATA
</ittm:altText>

The ittm:altText element SHALL be a child of the metadata element.

8. Image Profile Constraints specifies the use of the ittm:altText element with images.

Example 5

...
<div region="r1" begin="1s" end="6s" smpte:backgroundImage="1.png">
  <metadata>
  <ittm:altText>Nous étions inscrits au même lycée.</ttm:title>
  </metadata>
</div>
...

Note

In contrast to the common use of alt attributes in [HTML5], the ittm:altText attribute content is not intended to be displayed in place of the element if the element is not loaded. The ittm:altText attribute content can however be read and used by assistive technologies.

6.8 Region

6.8.1 Presented Region

A presented region is a temporally active region that satisfies the following conditions:

the computed value of tts:opacity is not equal to "0.0"; and
the computed value of tts:display is not "none"; and
the computed value of tts:visibility is not "hidden"; and
either (a) content is selected into the region or (b) the computed value of tts:showBackground is equal to "always" and the computed value of tts:backgroundColor has non-transparent alpha.

6.8.2 Dimensions and Position

All regions SHALL NOT extend beyond the root container, i.e. the intersection of the sets of coordinates belonging to a region (including its boundary) and the sets of coordinates belonging to the root container (including its boundary) is the set of coordinates belonging to the region (including its boundary).

No two presented regions in a given intermediate synchronic document SHALL overlap, i.e. the intersection of the sets of coordinates within each region (including its boundary) is empty.

6.8.3 Maximum number

The number of presented regions in a given intermediate synchronic document SHALL NOT be greater than 4.

6.9 Hypothetical Render Model

Any sequence of consecutive intermediate synchronic documents SHALL be reproducible without error by the Hypothetical Render Model specified in Section 9. Hypothetical Render Model.

6.10 Features

Unless specified otherwise,a Document Instance SHALL conform to the following:

Feature	Provisions
Relative to the TT Feature namespace
`#animation`	MAY be used.
`#cellResolution`	MAY be used.
`#clockMode`	SHALL NOT be used.
`#content`	MAY be used.
`#core`	MAY be used.
`#display-block`	MAY be used.
`#display-inline`	MAY be used.
`#display-region`	MAY be used.
`#display`	MAY be used.
`#dropMode`	SHALL NOT be used.
`#extent-region`	MAY be used. The `tts:extent` attribute SHALL be present on all `region` elements.
`#extent-root`	MAY be used. If the document includes any length value that uses the `px` expression, `tts:extent` SHALL be present on the `tt` element.
`#extent`	MAY be used.
`#frameRate`	If the document includes any time expression that uses the frames term, the `ttp:frameRate` attribute SHALL be present on the `tt` element.
`#frameRateMultiplier`	MAY be used.
`#layout`	MAY be used.
`#length-cell`	SHALL NOT be used other than to specify the value of `ebutts:linePadding`.
`#length-integer`	MAY be used.
`#length-negative`	SHALL NOT be used.
`#length-percentage`	MAY be used.
`#length-pixel`	MAY be used.
`#length-positive`	MAY be used.
`#length-real`	MAY be used.
`#length`	MAY be used.
`#markerMode`	SHALL NOT be used.
`#metadata`	MAY be used.
`#opacity`	MAY be used.
`#origin`	MAY be used.
`#overflow`	MAY be used.
`#pixelAspectRatio`	SHALL NOT be used.
`#presentation`	MAY be used.
`#profile`	MAY be used.
`#showBackground`	MAY be used.
`#structure`	MAY be used.
`#styling-chained`	MAY be used.
`#styling-inheritance-content`	MAY be used.
`#styling-inheritance-region`	MAY be used.
`#styling-inline`	MAY be used.
`#styling-nested`	MAY be used.
`#styling-referential`	MAY be used.
`#styling`	MAY be used.
`#subFrameRate`	SHALL NOT be used.
`#tickRate`	MAY be used. `ttp:tickRate` SHALL be present on the `tt` element if the #time-offset-with-ticks feature is used in the document.
`#timeBase-clock`	SHALL NOT be used.
`#timeBase-media`	SHALL be used. `ttp:timeBase` SHALL be present on the `tt` element and SHALL be equal to "media".
`#timeBase-smpte`	SHALL NOT be used.
`#time-clock-with-frames`	MAY be used.
`#time-clock`	MAY be used.
`#time-offset-with-frames`	MAY be used.
`#time-offset-with-ticks`	MAY be used.
`#time-offset`	MAY be used.
`#timeContainer`	MAY be used.
`#timing`	MAY be used. All time expressions within a Document Instance SHOULD use the same syntax, either `clock-time` or `offset-time`.
`#transformation`	MAY be used.
`#unicodeBidi`	MAY be used.
`#visibility-block`	MAY be used.
`#visibility-inline`	MAY be used.
`#visibility-region`	MAY be used.
`#visibility`	MAY be used.
`#writingMode-horizontal-lr`	MAY be used.
`#writingMode-horizontal-rl`	MAY be used.
`#writingMode-horizontal`	MAY be used.
`#writingMode`	MAY be used.
`#zIndex`	MAY be used.
Extension	Provisions
Relative to the IMSC 1.0 Extension namespace
`#aspectRatio`	MAY be used.
`#forcedDisplay`	MAY be used.
`#progressivelyDecodable`	MAY be used.
`#altText`	MAY be used.

Note

As specified in [TTML1], a #time-offset-with-frames expression is translated to a media time M according to M = 3600 · hours + 60 · minutes + seconds + (frames ÷ (ttp:frameRateMultiplier · ttp:frameRate)).

7. Text Profile Constraints

7.1 Profile Designator

This profile is associated with the following profile designator:

Profile Name	Profile Designator
IMSC 1.0 Text	`http://www.w3.org/ns/ttml/profile/imsc1/text`

Note

As specified in 6.10 Features, the presence of the ttp:profile attribute is not required by this profile. The profile designator specified above is intended to be generally used to signal conformance of a Document Instance to the profile. The details of such signaling depends on the application, and can, for instance, use metadata structures out-of-band of the Document Instance.

7.2 Recommended Character Sets

A Document Instance SHOULD be authored using characters selected from the sets specified in B. Recommended Character Sets.

7.3 Reference Fonts

The flow of text within a region depends the dimensions and spacing (kerning) between individual glyphs. The following allows, for instance, region extents to be set such that text flows without clipping.

When processing glyphs that match the combinations of computed font family and code point listed in A. Reference Fonts, e.g. during layout, a presentation processor or transformation processor SHALL use glyph metrics equal to the metrics of the specified reference font, unless the glyph is not defined by the reference font.

Note

Implementations can use fonts other than those specified in A. Reference Fonts. Two fonts with equal metrics can have a different appearance, but flow identically.

7.4 Features

The Document Instance SHALL conform to the following table:

Feature	Provisions
Relative to the TT Feature namespace
`#backgroundColor-block`	MAY be used.
`#backgroundColor-inline`	MAY be used.
`#backgroundColor-region`	MAY be used.
`#backgroundColor`	MAY be used.
`#bidi`	MAY be used.
`#color`	MAY be used. The initial value of `tts:color` SHALL be "white". NOTE: This is consistent with [ST2052-1].
`#direction`	MAY be used.
`#displayAlign`	MAY be used. The initial value of `tts:displayAlign` SHALL be "after" for the Default Region. NOTE: This is consistent with [ST2052-1].
`#extent-region`	The `tts:extent` attribute when applied to a region element SHALL use `px` units or "percentage" representation, and SHALL NOT use `em` units.
`#fontFamily-generic`	MAY be used. A `tts:fontFamily` of either "monospaceSerif" or "proportionalSansSerif" SHOULD be specified for all presented text content. A tts:fontFamily of "default" SHALL be equivalent to "monospaceSerif".
`#fontFamily-non-generic`	MAY be used.
`#fontFamily`	MAY be used.
`#fontSize-anamorphic`	SHALL NOT be used.
`#fontSize-isomorphic`	MAY be used.
`#fontSize`	MAY be used.
`#fontStyle-italic`	MAY be used.
`#fontStyle-oblique`	MAY be used.
`#fontStyle`	MAY be used.
`#fontWeight-bold`	MAY be used.
`#fontWeight`	MAY be used.
`#length-em`	MAY be used.
`#lineBreak-uax14`	MAY be used.
`#lineHeight`	MAY be used. An explicit `<length>` SHOULD be specified as there is no uniform implementation of the "normal" value at the time of this writing.
`#nested-div`	MAY be used.
`#nested-span`	MAY be used.
`#origin`	The `tts:origin` attribute SHALL use `px` units or "percentage" representation, and SHALL NOT use `em` units.
`#padding-1`	MAY be used.
`#padding-2`	MAY be used.
`#padding-3`	MAY be used.
`#padding-4`	MAY be used.
`#padding`	MAY be used.
`#textAlign-absolute`	MAY be used.
`#textAlign-relative`	MAY be used.
`#textAlign`	MAY be used. The initial value of `tts:textAlign` SHALL be "center" for the default region. NOTE: This is consistent with [ST2052-1].
`#textDecoration-over`	MAY be used.
`#textDecoration-through`	MAY be used.
`#textDecoration-under`	MAY be used.
`#textDecoration`	MAY be used.
`#textOutline-blurred`	SHALL NOT be used.
`#textOutline-unblurred`	MAY be used.
`#textOutline`	MAY be used. If specified, the border thickness SHALL be 10% or less than the associated font size.
`#wrapOption`	MAY be used.
`#writingMode-vertical`	MAY be used.
Extension	Provisions
Relative to the SMPTE-TT Extension Namespace
`#image`	SHALL NOT be used.
Relative to the IMSC 1.0 Extension namespace
`#linePadding`	MAY be used.
`#multiRowAlign`	MAY be used.

8. Image Profile Constraints

8.1 Profile Designator

This profile is associated with the following profile designator:

Profile Name	Profile Designator
IMSC 1.0 Image	`http://www.w3.org/ns/ttml/profile/imsc1/image`

Note

8.2 Presented Image

8.2.1 Definition

A presented image is a div element with a smpte:backgroundImage attribute that does not extend beyond a presented region.

8.2.2 Number per Region

In a given synchronic document, there shall be at most one presented image per presented region.

8.3 `div` element

If a smpte:backgroundImage attribute is applied to a div element:

the width and height of the region extent associated with the div element SHALL be specified and SHALL be equal to the width and height of the image source referenced by the smpte:backgroundImage;
the metadata element of the div element SHOULD contain an instance of ittm:altText that is a verbatim text equivalent of the image referenced by the smpte:backgroundImage attribute; and
The smpte:backgroundImage attribute SHALL reference a complete image that conforms to the PNG image coding as specified in Sections 7.1.1.3 and 15.1 of [MHP]. If a pHYs chunk is present, it SHALL indicate square pixels. Note: If no pixel aspect ratio is carried, the default of square pixels is assumed.

Note

In [TTML1], tts:extent and tts:origin do not apply to div elements. In order to individually position multiple div elements, each div can be associated with a distinct region with the desired tts:extent and tts:origin.

8.4 Features

The features included in a Document Instance SHALL conform to the Table below:

Feature	Provisions
Relative to the TT Feature namespace
`#bidi`	SHALL NOT be used.
`#color`	SHALL NOT be used.
`#content`	The `p`, `span` and `br` elements SHALL NOT be present.
`#direction`	SHALL NOT be used.
`#displayAlign`	SHALL NOT be used.
`#fontFamily`	SHALL NOT be used.
`#fontSize`	SHALL NOT be used.
`#fontStyle`	SHALL NOT be used.
`#fontWeight`	SHALL NOT be used.
`#length-em`	SHALL NOT be used.
`#lineBreak-uax14`	SHALL NOT be used.
`#lineHeight`	SHALL NOT be used.
`#nested-div`	SHALL NOT be used.
`#nested-span`	SHALL NOT be used.
`#padding`	SHALL NOT be used.
`#textAlign`	SHALL NOT be used.
`#textDecoration`	SHALL NOT be used.
`#textOutline`	SHALL NOT be used.
`#wrapOption`	SHALL NOT be used.
`#writingMode-vertical`	SHALL NOT be used.
Extension	Provisions
Relative to the SMPTE-TT Extension namespace
`#image`	`smpte:backgroundImage` MAY be used. `smpte:backgroundImageHorizontal` and `smpte:backgroundImageVertical` SHALL NOT be used. `smpte:image` SHALL NOT be used.

9. Hypothetical Render Model

9.1 Overview

This Section specifies the Hypothetical Render Model illustrated in Fig. 1 Hypothetical Render Model .

The purpose of the model is to limit Document Instance complexity. It is not intended as a specification of the processing requirements for implementations. For instance, while the model defines a glyph buffer for the purpose of limiting the number of glyphs displayed at any given point in time, it neither requires the implementation of such a buffer, nor models the sub-pixel character positioning and anti-aliased glyph rendering that can be used to produce text output.

The model operates on successive intermediate synchronic documents obtained from an input Document Instance, and uses a simple double buffering model: while an intermediate synchronic document E_n is being painted into Presentation Buffer P_n (the "front buffer" of the model), the previous intermediate synchronic document E_n-1 is available for display in Presentation Buffer P_n-1 (the "back buffer" of the model).

The model specifies an (hypothetical) time required for completely painting an intermediate synchronic document as a proxy for complexity. Painting includes drawing region backgrounds, rendering and copying glyphs, and decoding and copying images. Complexity is then limited by requiring that painting of intermediate synchronic document E_n completes before the end of intermediate synchronic document E_n-1.

Whenever applicable, constraints are specified relative to root container dimensions, allowing subtitle sequences to be authored independently of related video object resolution.

To enables scenarios where the same glyphs are used in multiple successive intermediate synchronic documents, e.g. to convey a CEA-608/708-style roll-up (see [CEA-608] and [CEA-708]), the Glyph Buffers G_n and G_n-1 store rendered glyphs across intermediate synchronic documents, allowing glyphs to be copied into the Presentation Buffer instead of rendered, a more costly operation.

Similarly, Decoded Image Buffers D_n and D_n-1 store decoded images across intermediate synchronic documents, allowing images to be copied into the Presentation Buffer instead of decoded.

9.2 General

The Presentation Compositor SHALL render in Presentation Buffer P_n each successive intermediate synchronic document E_n using the following steps in order:

clear the pixels, except for the first intermediate synchronic document E₀ for the which the pixels of P₀ SHALL be assumed to have been cleared;
paint, according to stacking order, all background pixels for each region;
paint all pixels for background colors associated with text or image subtitle content; and
paint the text or image subtitle content.

The Presentation Compositor SHALL start rendering E_n:

at the presentation time of E₀ minus Initial Painting Delay (IPD), if n = 0
at the presentation time of E_n-1, if n > 0

The duration DUR(E_n) for painting an intermediate synchronic document E_n in the Presentation Buffer P_n SHALL be:

DUR(E_n) = S(E_n) / BDraw + DUR_T(E_n) + DUR_I(E_n)

Where:

S(E_n) is the total normalized drawing area for intermediate synchronic document E_n, as specified in 9.3 Paint Regions
BDraw is the normalized background drawing performance factor.
DUR_T(E_n) is the duration, in seconds, for painting the text subtitle content for intermediate synchronic document E_n, as specified in Section 9.5 Paint Text
DUR_I(E_n) is the duration, in seconds, for painting the image subtitle content for intermediate synchronic document E_n, as specified in Section 9.4 Paint Images

The contents of the Presentation Buffer P_n SHALL be transferred instantaneously to Presentation Buffer P_n-1 at the presentation time of intermediate synchronic document E_n, making the latter available for display.

Note

It is possible for the contents of Presentation Buffer P_n-1 to never be displayed. This can happen if Presentation Buffer P_n is copied twice to Presentation Buffer P_n-1 between two consecutive video frame boundaries of the related video object.

It SHALL be an error for the Presentation Compositor to fail to complete painting pixels for E_n before the presentation time of E_n.

Unless specified otherwise, the following table SHALL specify values for IPD and BDraw.

Parameter	Initial value
Initial Painting Delay (IPD)	1 s
Normalized background drawing performance factor (BDraw)	12 s^-1

Note

BDraw effectively sets a limit on fillings regions - for example, assuming that the root container is ultimately rendered at 1920×1080 resolution, a BDraw of 12 s^-1 would correspond to a fill rate of 1920×1080×12/s=23.7×2²⁰pixels s^-1.

Note

IPD effectively sets a limit on the complexity of any given intermediate synchronic document.

9.3 Paint Regions

The total normalized drawing area S(E_n) for intermediate synchronic document E_n SHALL be

S(E_n) = CLEAR(E_n) + PAINT(E_n )

where CLEAR(E₀) = 0 and CLEAR(E_{n | n > 0}) = 1, i.e. the root container in its entirety.

Note

To ensure consistency of the Presentation Buffer, a new intermediate synchronic document requires clearing of the root container.

PAINT(E_n) SHALL be the normalized area to be painted for all regions that are used in intermediate synchronic document E_n according to

PAINT(E_n) = ∑_{R_i∈R_p} SIZE(R_i) ∙ NBG(R_i)

where R_p SHALL be the set of presented regions in the intermediate synchronic document E_n.

NSIZE(R_i) SHALL be given by:

NSIZE(R_i) = (width of R_i ∙ height of R_i ) ÷ (root container height ∙ root container width)

Example 6

For a region R_i in with tts:extent="250px 50px" within a root container with tts:extent="1920px 1080px", NSIZE(R_i) = 0.603.

NBG(R_i) SHALL be the total number of tts:backgroundColor attributes associated with the given region R_i in the intermediate synchronic document. A tts:backgroundColor attribute is associated with a region when it is explicitly specified (either as an attribute in the element, or by reference to a declared style) in the following circumstances:

It is specified on the region layout element that defines the region.
It is specified on a div, p, span or br content element that is to be flowed into the region for presentation in the intermediate synchronic document (see [TTML1] for more details on when a content element is followed into a region).
It is specified on a set animation element that is to be applied to content elements that are to be flowed into the region for presentation in the intermediate synchronic document (see [TTML1] for more details on when a set animation element is applied to content elements).

Even if a specified tts:backgroundColor is the same as specified on the nearest ancestor content element or animation element, specifying any tts:backgroundColor SHALL require an additional fill operation for all region pixels.

9.4 Paint Images

The Presentation Compositor SHALL paint into the Presentation Buffer P_n all visible pixels of presented images of intermediate synchronic document E_n.

For each presented image, the Presentation Compositor SHALL either:

if an identical image is present in Decoded Image Buffer D_n, copy the image from Decoded Image Buffer D_n to the Presentation Buffer P_n using the Image Copier; or
if an identical image is present in Decoded Image Buffer D_n-1, i.e. an identical image was present in intermediate synchronic document E_n-1, copy using the Image Copier the glyph from Decoded Image Buffer D_n-1 to both the Decoded Image Buffer D_n and the Presentation Buffer P_n; or
Otherwise, decode the image using the Image Decoder the image into the Presentation Buffer P_n and Decoded Image Buffer D_n.

Two images SHALL be identical if and only if they reference the same encoded image source.

The duration DUR_I(E_n) for painting images of an intermediate synchronic document E_n in the Presentation Buffer SHALL be as follows:

DUR_I(E_n) = ∑_{I_i ∈ I_c} NRGA(I_i) / ICpy + ∑_{I_j ∈ I_d} NSIZ(I_j) / IDec

where

I_c is the set of images copied when painting intermediate synchronic document E_n
I_d is the set of images decoded when painting intermediate synchronic document E_n
IDec is the image decoding rate
ICpy is the normalized image copy performance factor.

NRGA(I_i) is the Normalized Image Area of presented image I_i and SHALL be equal to:

NRGA(I_i)= (width of I_i ∙ height of I_i ) ÷ ( root container height ∙ root container width )

NSIZ(I_i) SHALL be the number of pixels of presented image I_i.

The contents of the Decoded Image Buffer D_n SHALL be transferred instantaneously to Decoded Image Buffer D_n-1 at the presentation time of intermediate synchronic document E_n.

The total size occupied by images stored in Decoded Image Buffers D_n or D_n-1 SHALL be the sum of their Normalized Image Area.

The size of Decoded Image Buffers D_n or D_n-1 SHALL be the Normalized Decoded Image Buffer Size (NDIBS).

Unless specified otherwise, the following table SHALL specify ICpy, Idec, and NDBIS.

Parameter	Initial value
Normalized image copy performance factor (ICpy)	6
Image Decoding rate (Idec)	1 × 2²⁰ pixels s^-1
Normalized Decoded Image Buffer Size (NDIBS)	0.9885

9.5 Paint Text

For each glyph displayed in intermediate synchronic document E_n, the Presentation Compositor SHALL:

if an identical glyph is present in Glyph Buffer G_n, copy the glyph from Glyph Buffer G_n to the Presentation Buffer P_n using the Glyph Copier; or
if an identical glyph is present in Glyph Buffer G_n-1, i.e. an identical glyph was present in intermediate synchronic document E_n-1, copy using the Glyph Copier the glyph from Glyph Buffer G_n-1 to both the Glyph Buffer G_n and the Presentation Buffer P_n; or
Otherwise render using the Glyph Renderer the glyph into the Presentation Buffer P_n and Glyph Buffer G_n using the corresponding style information.

Two glyphs are identical if and only if the following [TTML1] styles are identical:

tts:color
tts:fontFamily
tts:fontSize
tts:fontStyle
tts:fontWeight
tts:textDecoration
tts:textOutline

Fig. 2 Example of Presentation Compositor Behavior for Text Rendering

The duration DUR_T(E_n) for painting the text of an intermediate synchronic document E_n in the Presentation Buffer is as follows:

DUR_T(E_n) = ∑_{G_i ∈ G_r} NRGA(G_i) / Ren(G_i) + ∑_{G_j ∈ G_c} NRGA(G_j) / GCpy

Where:

G_r is the set of glyphs rendered into the Presentation Buffer P_n using the Glyph Renderer in intermediate synchronic document E_n.
G_c is the set of glyphs copied to the Presentation Buffer P_n using the Glyph Copier in intermediate synchronic document E_n.
Ren(G_i) is the text rendering performance factor glyph G_i
GCpy is the normalized glyph copy performance factor

G_r and G_c SHALL include only glyphs in presented regions and SHALL NOT include a [UNICODE] Code Point if it does not result in a change to presentation, e.g. the Code Point is ignored.

The Normalized Rendered Glyph Area NRGA(G_i) of a glyph G_i SHALL be equal to:

NRGA(G_i)= (fontSize of G_i as percentage of root container height)²

The contents of the Glyph Buffer G_n SHALL be copied instantaneously to Glyph Buffer G_n-1 at the presentation time of intermediate synchronic document E_n.

The total size occupied by the glyphs stored in Glyph Buffers G_n or G_n-1 SHALL be the sum of their Normalized Rendered Glyph Area.

The size of Glyph Buffers G_n and G_n-1 SHALL be the Normalized Glyph Buffer Size (NGBS).

Unless specified otherwise, the following table SHALL specify GCpy, Ren and NGBS, and SHALL apply to all supported font styles (including provision of outline border).

Parameter	Initial value
Normalized glyph copy performance factor (GCpy)	12
Text rendering performance factor Ren(G_i if G_i is not a CJK Unified Ideograph as specified in [UNICODE].	1.2
Text rendering performance factor Ren(G_i) if G_i is a CJK Unified Ideograph as specified in [UNICODE].	0.6
Normalized Glyph Buffer Size (NGBS)	1

Note

NRGA(G_i) does not take into account glyph decorations (e.g. underline), glyph effects (e.g. outline) or actual glyph aspect ratio. An implementation can determine an actual buffer size needs based on worst-case glyph size complexity.

Computed Font Family	Code Points	Reference Font
monospaceSerif	All code points specified in B. Recommended Character Sets	http://www.microsoft.com/typography/fonts/family.aspx?FID=10 (Courier New)
proportionalSansSerif	All code points specified in B. Recommended Character Sets, excluding the code points defined for Semitic languages alone.	http://www.microsoft.com/typography/fonts/family.aspx?FID=8 (Arial) or http://www.linotype.com/en/526/Helvetica-family.html (Helvetica)

Primary language subtag	Characters
lv, lt, et, tr, hr, cs, pl, sl, sk	(Latin Extended-A) U+0100 – U+017F
nl	(Combining Diacritical Marks) U+0301
ro	(Latin Extended-A) U+0100 – U+017F (Latin Extended-B) U+0218 – U+0219 U+021A – U+021B
el	(Combining Diacritical Marks) U+0301 U+0308 (Greek and Coptic) U+0386 – U+0387 U+0388 – U+03CE
pt, es	(Currency symbols) U+20A1 – U+20A2 U+20B3
ar	(Arabic) U+060C – U+060D U+061B U+061E – U+061F U+0621 – U+063A U+0640 – U+0652 U+0660 – U+066D U+0670
he	(Hebrew) U+05B0 – U+05C3 U+05D0 – U+05EA U+05F3 – U+05F4
bs, bg, mk, ru, sr	(Latin Extended-A) U+0100 – U+017F (Cyrillic) U+0400 – U+045F
uk	(Latin Extended-A) U+0100 – U+017F (Cyrillic) U+0400 – U+045F U+0490 – U+0491 (Spacing Modifier Letters) U+02BC (Letterlike Symbols) U+2116
kk	(Latin Extended-A) U+0100 – U+017F (Cyrillic) U+0400 – U+045F U+0492 – U+0493 U+049A – U+049B U+04A2 – U+04A3 U+04AE – U+04B1 U+04BA – U+04BB U+04D8 – U+04D9 U+04E8 – U+04E9
hu	(Latin Extended-A) U+0100 – U+017F (General Punctuation) U+2052 (Miscellaneous Mathematical Symbols-A) U+27E8–U+27E9

Table 1. Common Character Set.
(Basic Latin)
U+0020 - U+007E (Letterlike Symbols)
U+2103 : DEGREES CELSIUS
U+2109 : DEGREES FAHRENHEIT
U+2120 : SERVICE MARK SIGN
U+2122 : TRADE MARK SIGN
(Latin-1 Supplement)
U+00A0 - U+00FF
(Number Forms)
U+2153 – U+215F : Fractions
(Latin Extended-A)
U+0152 : LATIN CAPITAL LIGATURE OE
U+0153 : LATIN SMALL LIGATURE OE
U+0160 : LATIN CAPITAL LETTER S WITH CARON
U+0161 : LATIN SMALL LETTER S WITH CARON
U+0178 : LATIN CAPITAL LETTER Y WITH DIAERESIS
U+017D : LATIN CAPITAL LETTER Z WITH CARON
U+017E : LATIN SMALL LETTER Z WITH CARON
(Box Drawing)
U+2500 : BOX DRAWINGS LIGHT HORIZONTAL
U+2502 : BOX DRAWINGS LIGHT VERTICAL
U+250C : BOX DRAWINGS LIGHT DOWN AND RIGHT
U+2510 : BOX DRAWINGS LIGHT DOWN AND LEFT
U+2514 : BOX DRAWINGS LIGHT UP AND RIGHT
U+2518 : BOX DRAWINGS LIGHT UP AND LEFT
(Latin Extended-B)
U+0192 : LATIN SMALL LETTER F WITH HOOK
(Block Elements)
U+2588 : FULL BLOCK
(Spacing Modifier Letters)
U+02DC : SMALL TILDE
(Geometric Shapes)
U+25A1 : WHITE SQUARE
(General Punctuation)
U+2010 - U+2015 : Dashes
U+2016 - U+2027 : General punctuation
U+2030 - U+203A : General punctuation
(Musical Symbols)
U+2669 : QUARTER NOTE
U+266A : EIGHTH NOTE
U+266B : BEAMED EIGHTH NOTES

Abstract

Status of This Document

Table of Contents

1. Scope

2. Documentation Conventions

3. Terms and Definitions

4. Conformance

5. Profiles

5.1 General

5.2 Text Profile

5.3 Image Profile

6. Common Constraints

6.1 Document Encoding

6.2 Foreign Element and Attributes

6.3 Namespaces

6.4 Overflow

6.5 Related Video Object

6.6 Synchronization

6.7 Extensions

6.7.1 ittp:aspectRatio

6.7.2 ittp:progressivelyDecodable

6.7.3 itts:forcedDisplay

6.7.4 ittm:altText

6.8 Region

6.8.1 Presented Region

6.8.2 Dimensions and Position

6.8.3 Maximum number

6.9 Hypothetical Render Model

6.10 Features

7. Text Profile Constraints

7.1 Profile Designator

7.2 Recommended Character Sets

7.3 Reference Fonts

7.4 Features

8. Image Profile Constraints

8.1 Profile Designator

8.2 Presented Image

8.2.1 Definition

8.2.2 Number per Region

8.3 div element

8.4 Features

9. Hypothetical Render Model

9.1 Overview

9.2 General

9.3 Paint Regions

9.4 Paint Images

9.5 Paint Text

A. Reference Fonts

B. Recommended Character Sets

C. Forced content (non-normative)

D. WCAG Considerations

E. Sample Document Instance (non-normative)

F. Extensions

F.1 General

F.2 #progressivelyDecodable

F.3 #aspectRatio

F.4 #forcedDisplay

F.5 #altText

F.6 #linePadding

F.7 #multiRowAlign

G. References

G.1 Normative references

G.2 Informative references

8.3 `div` element