Project review 30 June 2005: Requirements of the Internationalization Tag Set Working Group (i18n ITS WG) and possible technologies for their realization.

Presenter: Felix Sasaki

This presentation sketches the work of the i18n ITS WG:

Goals of ITS

Requirements for ITS

Identifying date and time

It should be possible to indicate in markup that a string represents an ISO 8601 date and/or time.

Bidirectional text

There has to be a mechanism to indicate the directionality of text.

Wrong directionality Wrong directionality is visualized, relying only on the Unicode bidirectional algorithm
Feasible directionality Feasible directionality is visualized
Markup to produce the feasible visualization Markup to produce the feasible visualization

Requirements for ITS

Identification of content that should not be translated

The content of the trademark element should not be translated.

<trademark its:translate="no" ...

Selection of information with i18n and l10n related characteristics

The information might be textual content of an element. In this case, an element like span could be used. In the case of attribute values, this is a difficult requirement.

Specification of inheritance features and exceptions

Example: All svg elements should not be localized, except the text element in a given context.

Requirements for ITS

Requiring uniqueness of IDs across documents

Linking mechanisms for external information

The error message in this software documentation should be retrieved from an external data base with local specific terms.

<para>If you create a typing error like "strs(s)",
you will get the message
 <xref id="resfile.resx">
  <subst>
   <search>{0}</search>
   <replace><Filename></replace>
  </subst>
 </xref>
.<para>

Overview of further requirements

All requirements are work items on the ITS home page.

Problems we face with the ITS requirements

Our current state:

The current state of ITS

Schema languages

Alternatives

XML DTDs, XML Schema, RELAX NG; in addition: Schematron?

ITS is - among others - meant for schema authors. The question is: Do we formulate requirements which cannot be fulfilled with all schema languages?

Important criteria:

Namespace sectioning

General idea

Rules describe what namespace should be validated against what schema. According to the rules, documents are seperated into schema-specific element and attribute sections.

Benefits

No need to change the existing schemas. Existing XPath expressions for a given scheme still work for the sections respectively!

Problems

Will this be supported widely? Will this be part of a W3C standard?

Schema annotation

General idea

ITS data categories are realized as additional information to the given schema.

Benefits

Allows for reusing existing schemas, without a need to change them.

Problems

Which annotation format to use? SGML based architectural forms? RDF-based annotations? DSDL part 8? an ITS-specific format?

XML processing model

General idea

The interrelation between processing steps like namespace sectioning, validation, linking to external information etc. are described in a declarative format.

Benefits

Holds everything together :)

Problems

When will it come? There is an urgent need for ITS, and we can't wait too much!

Questions to discuss