ITS Implementations
From ITS
Contents
Purpose & how to contribute
This page lists finalized and ongoing implementations of the Internationalization Tag Set (ITS) Version 2.0 specification. Feel free to update the wiki with new information about your work and also to announce it in the ITS Interest Group mailling list.
Parsers
Tilde ITS Parser
- Written in C#
- More info: https://github.com/tilde-nlp/taws/wiki/TAWS-Technical-Documentation#tilde-its
jQuery ITS Parser by Cocomore AG
- jQuery library
- Supports global rules and local markup for all HTML data categories.
- Supports only HTML markup.
- More info: http://plugins.jquery.com/its-parser/ (Documentation: https://github.com/attrib/jquery-its2)
Okapi ITS Engine
- Supports global rules and local markup for all data categories
- Support XML and HTML5
- Cross-platform (Java)
- More info: http://www.opentag.com/okapi/wiki/index.php?title=ITS_Components
Editors
BlueGriffon
- ITS 2.0 editing functionality for HTML5 documents and the data categories Translate, Localization Note and Terminology has been added.
- More info: http://www.bluegriffon.org/
- Note: the functionality is not yet part of the official BlueGriffon build. See instead the nightly build.
LibreOffice
- See http://www.init.de/en/LibreOfficeWriter?ILO2
- Note: this is a plugin that is not part of the official LibreOffice distribution
oXygen, DocBook5 and ePub / XHTML5
- See Its2-in-oXygen
- Note: this is not part of the official oXygen distribution
Validators
W3C Validator
- More info: http://validator.w3.org/
Validator.nu
- More info: http://validator.nu/
Annotations Services
IJS Enrycher
- Provides Text Analysis annotations on HTML5 documents.
- Cross-platform (Web service)
- More info: http://enrycher.ijs.si/
Tilde TAWS
- Provides Terminology annotations on text, HTML5 and XLIFF 1.2 documents.
- Cross-platform (Web service)
- More info: http://taws.tilde.com/
Okapi LanguageTool Step
- Invokes the LanguageTool library to annotate any content extracted with one of the Okapi filters with Localization Quality Issue items.
- Cross-platform (Java)
- More info: http://www.opentag.com/okapi/wiki/index.php?title=LanguageTool_Step
text4all ITS Term Tagger
- Provides terminology annotations on text or HTML. It uses fuzzy matching to identify terms that are not in the canonical form.
- Cross-platform (Web service)
- More info: http://www.text4all.net/itstagger.html
Babelfy
- Provides named entity annotation.
- The HTML output does not use ITS; the RDF output uses the ITS-RDF mapping.
- More info: http://www.babelfy.org/
Consumers
Okapi Enrycher Step
- Annotates any content extracted with one of the Okapi filters with Text Analysis annotations using the Enrycher Web service.
- Cross-platform (Java).
- More info: http://www.opentag.com/okapi/wiki/index.php?title=Enrycher_Step
Okapi Term Extraction Step
- Generates list of term candidates from any content extracted with one of the Okapi filters based on Terminology and/or Text Analysis annotations.
- Cross-platform (Java).
- More info: http://www.opentag.com/okapi/wiki/index.php?title=Enrycher_Step
Okapi Microsoft Batch Translation Step
- Uses the Domain information found in any content extracted with one of the Okapi filters to select which deployed MT system of the Microsoft Translator Hub is to be used to perform MT on the source text.
- Cross-platform (Java)
- More info: http://www.opentag.com/okapi/wiki/index.php?title=Microsoft_Batch_Translation_Step
ITSTool PO Extraction and Merging
- Uses Translate and Elements Within Text categories to segment XML into messages in PO files.
- Outputs localized XML documents based on source document, ITS information, and translation data in PO files.
- Also supports Localization Note, Preserve Space, Locale Filter, ID Value, and External Resource.
- Cross-platform (Python)
- More info: http://itstool.org/
Okapi Ocelot
- XLIFF+ITS Editor directed at the Linguistic Review and Post-editing processes. (Can also be used as a general purpose XLIFF Editor).
- Reads and renders Localization Quality Issue, Provenance and MT Confidence metadata.
- Adds Provenance metadata to edited segments.
- Retains original translation as XLIFF alternative translations.
- More info: http://open.vistatec.com/ocelot
qLabel
- JavaScript library that processes ITS Text Analysis markup.
- qLabel looks up and displays labels for entities marked up with Text Analysis markup in a Website.
- The look up queries various multilingual data sources like Wikidata.
- More info: http://googleknowledge.github.io/qlabel/
CMS Integration
Drupal (Cocomore AG)
- Support for ITS 2.0 in Drupal facilitates the localization/translation of Drupal-based content.
- Add the ability to apply ITS 2.0 local metadata through Drupal's WYSIWYG editor.
- Add the ability to apply global ITS 2.0 metadata at content mode level.
- Implemented jQuery plugin to optimize the GUI of the Translation Management tool.
- The Drupal modules facilitate the roundtripping process from WCMS with systems of Localization Service Provider (including automatic content re-integration).
- The Drupal modules enable tracking of provenance information (e.g. to identify translation post-editors).
- More info: http://www.drupal.org/project/its
GBC Server and PLINT (Linguaserve I.S. S.A.)
- Support for Drupal 7 ITS 2.0 compliant with GBC Server interoperability
- Content is roundtripped between a Drupal CMS and PLINT TMS, via GBC Server interoperability: The content originates in the CMS, and gets exposed/serialized as XHTML + ITS 2.0. This is sent to the TMS, and processed in a workflow. Upon completion, the TMS exposes/serializes localized/translated XHTML + ITS 2.0 to the CMS.
- L10N Workflow facilities
- Omit selected not translatable contents
- Send notifications to the project manager and tooltip visualization in the workflow.
- Quality checks in source language content, webservice parameters, and target language content.
- Inform to the translator about specific issues.
- Possibility to reassign the same translator/reviewer in new versions of the same content.
- Automatic selection and storage of terminology and translation memories.
- (Readiness as ITS 2.0 extension) Set processes to be done and expected finalization date.
- More info:
- More detailed description about the process: http://tinyurl.com/8woablr
- Videos of the different processing steps:
- 1) L10n workflow interaction for the pre-production phase: http://www.w3.org/International/multilingualweb/lt/wiki/images/6/67/Linguaserve_Preproduction_step_demo.zip
- 2) CAT tool usage with ITS 2.0: http://www.w3.org/International/multilingualweb/lt/wiki/images/c/ca/Linguaserve_ITS_CAT_Tool_usage_demo.zip
- 3) L10n workflow interaction for the post-production phase: http://www.w3.org/International/multilingualweb/lt/wiki/images/a/aa/Linguaserve_Postproduction_step_demo.zip
- 4) Demonstration of how the pre-production/post-production engine for Drupal XHTML files with ITS 2.0: https://www.linguaserve.net/las_demos/control/MLWLTWP3DemoEngine (user: demos, password: demosLingu@serve)
Apache JackRabbit / Sling (Adobe)
- Provides foundation features for round-tripping of content from the CMS out into a translation workflow and back
- Uses ITS 2.0 metadata to identify,manage and store multilingual content, within the JCR (Java Content Repository)
- Provides each translatable segment with a unique identity that persists throughout the translation workflow
- Applies ITS 2.0 rules across the entire CMS via centrally managed global rules.
- Includes support for embedded global rules and local rules.
- Import and Export ITS 2.0 Enabled XLIFF
- XLIFF + ITS2.0: The Industry-Standard content interchange format coupled with the latest standard metadata format.
- 100% Open Source.
- Built on free and open technologies such as Apache Jackrabbit, Apache Sling and Okapi
- (Download link available soon) [code should be resurrected on public repo soon, status March 2015]
Filters
Okapi XML Filter
- Extracts XML documents based on ITS defaults and rules
- Maps relevant ITS information to the internal Okapi data model
- Cross-platform (Java)
- More info: http://www.opentag.com/okapi/wiki/index.php?title=XML_Filter
Okapi HTML5-ITS Filter
- Extracts HTML5 documents based on ITS defaults and rules
- Maps relevant ITS information to the internal Okapi data model
- Cross-platform (Java)
- More info: http://www.opentag.com/okapi/wiki/index.php?title=HTML5-ITS_Filter
Okapi XLIFF Filter
- Extracts XLIFF 1.2 documents
- Maps relevant ITS information to the internal Okapi data model
- Cross-platform (Java)
- More info: http://www.opentag.com/okapi/wiki/index.php?title=XLIFF_Filter
Real Time Multilingual Publishing
Linguaserve's ATLAS Real Time, Lucy LT, DCU MaTrEx
- Enable an HTML5 content reviser (language editor, translation post-editor) to add ITS 2.0 metadata to the contents of web documents.
- Use the ITS 2.0 metadata to control the behavior of different Machine Translation Systems (Lucy LT from Lucy Software and MaTrEx from DCU)and Multilingual Publication System (ATLAS Real Time from Linguaserve).
- Covers post-editing of translations generated by MT.
- The ITS 2.0 markup:
- provides key information to drive the reliable extraction of translation-relevant content from HTML5;
- helps to control workflow dimensions such as selection of domain-specific vocabulary to improve the Machine Translation results;
- provides information for post-editing.
- Data category usage
- Translate - Parts that are not translation-relevant are marked (and protected).
- Implementers: Linguaserve, DCU, LucySoftware
- Localization Note - Provides additional information for language or translation editors to facilitate translation.
- Implementers: Linguaserve
- Language Information - Controls workflow dimensions such as setting the source language, and the target language (via the lang attribute of the output), it also protects the translation of contents where the lang attribute is different from the source language.
- Implementers: Linguaserve, DCU, LucySoftware
- Domain - Domain values are mapped to the domains used by the individual MT systems, and used to select the appropriate vocabulary.
- Implementers: Linguaserve, DCU, LucySoftware
- Provenance - Allows tracking of human agents (language or translation editors) or software agents (MT systems) that processed the content.
- Implementers: Linguaserve
- Localization Quality Issue - Can be provided for the translated content by the reviser. Can be utilized for example by MT developers to improve the MT System.
- Implementers: Linguaserve
- Locale Filter - Reveals that content is only relevant for certain locales (useful in localization).
- Implementers: DCU
- MT Confidence - Assesses the confidence in the quality of the translation generated by the MT system.
- Implementers: DCU
- Translate - Parts that are not translation-relevant are marked (and protected).
- Cross-platform (PHP)
- More info:
- Running software:
- ITS 2.0 ATLAS PW1 Testing Page (credentials: user=mlw-lt_public password=MLWLT4atlaspw1$).
- ITS 2.0 ATLAS PW1 Prototype (credentials: user=mlw-lt password=its2-wp4).
- Spanish Tax Agency showcase (Pre-production http://its2demo.aeat.es/)
- MaTrEx
- ITS 2.0 LucySoftware prototype (credentials: user=mlwlt password=ltweb11).
- General documentation:
- Running software:
Previewers
WICS - ITS 2.0 Metadata Previewer
- Supports HTML files enriched with ITS 2.0 metadata
- JavaScript-based preview of ITS 2.0 metadata (all categories) in supported Web browsers
- Compatible with Microsoft IE 10.0 and Mozilla Firefox 24.0 browsers
- Can be used to display context information for any content processing
- More info: https://github.com/renatb/ITS2.0-WICS-viewer
File Format Converters
WICS - ITS 2.0 File Format Converters
- Support XML, HTML, XLIFF input files enriched with ITS 2.0 metadata
- Produce output HTML files with ITS 2.0 metadata preserved
- Can be used to prepare the content and ITS metadata for previewing in Web browsers
- More info: https://github.com/renatb/ITS2.0-WICS-converter
Miscellaneous
Okapi XLIFF Writer
- Library to create XLIFF 1.2 documents with ITS information
- Maps internal Okapi data model to ITS/ITSXLF namespaces
- Cross-platform (Java)
- More info: http://okapi.opentag.com/javadoc/net/sf/okapi/common/filterwriter/XLIFFWriter.html
DocBook + ITS support
- DocBook schema extended with ITS elements/attributes allows to use all ITS features inside DocBook documents
- DocBook XSL Stylesheets pass ITS metadata from source DocBook into resulting HTML pages
- More info: http://xmlguru.cz/2013/05/docbook-and-its2
Experimental ITS 1.0 parser
See ITS Processing
MLOD4CON
- Goal: make multilingual linked open data sources accessible for content architects & content creators
- JavaScript library relying on jQuery
- More info: https://github.com/fsasaki/mlod4con