Internationalization (i18n)

Making the World Wide Web worldwide!


Groups/repos

i18n WG

i18n Interest Group

African LE

Americas LE

Arabic LE

Chinese LE

Ethiopic LE

European LE

Hebrew LE

India LE

Japanese LE

Korean LE

Mongolian LE

SE Asian LE

Tibetan LE

Participate!

Join a Group

Follow the work

Translate a specification or page

International­ization Sponsorship Program

News by category
News archives
July 2011 (13)
July 2009 (10)
June 2009 (10)
June 2008 (13)
Search news

I18n sponsors

APL, Japan The Paciello Group Monotype Alibaba

Tag(s): tr-charmod-norm

Posts

For review: Update to Character Model for the World Wide Web: String Matching

The Editor’s Draft of the Working Group Note “Character Model: String Matching” is out for wide review. We are looking for comments by Wednesday, 21 October.

The Character Model series of documents contains background material, best practices, and recommendations for specification authors, implementers, and content authors related to the use of character encodings and Unicode on the Web. The “String Matching” document deals with specifying content restrictions and the matching of identifiers in protocols and document formats for W3C specifications.

The new revision adds two new terminology definitions (“application internal identifiers”, “user-facing identifiers”) and a new section targeted to specification authors on “Specifying Content Restrictions” (Section 3.1). The Working Group would like comments about these specific changes; comments about other aspects of the document are also welcome.

Please send any comments as github issues.

Categories: For review

Working Group Note: Character Model for the World Wide Web: String Matching

The Internationalization Working Group at the W3C has published a new Working Group Note. Character Model for the World Wide Web: String Matching provides authors of specifications, software developers, and content developers a common reference on string identity matching on the World Wide Web and thereby increase interoperability.

String identity matching is the process by which a specification or implementation defines whether two string values are the same or different from one another. It describes the ways in which texts that are semantically equivalent can be encoded differently and the impact this has on matching operations important to formal languages. Topics include normalization and case folding.

For last call review: Character Model for the World Wide Web: String Matching

A final draft of Character Model for the World Wide Web: String Matching is out for wide review. We are looking for comments by Tuesday 29 May.

This document builds on the document Character Model for the World Wide Web 1.0: Fundamentals to provide authors of specifications, software developers, and content developers with a common reference on string identity matching on the World Wide Web, in order to increase interoperability.

All comments are welcome. Please raise them as github issues. To make it easier to track comments, please raise separate issues or emails for each comment, and point to the section you are commenting on using a URL.

Updated Working Draft: Character Model for the World Wide Web: String Matching and Searching

Character Model for the World Wide Web: String Matching and Searching builds upon Character Model for the World Wide Web 1.0: Fundamentals to provide authors of specifications, software developers, and content developers a common reference on string identity matching on the World Wide Web and thereby increase interoperability.

This new version introduces numerous editorial changes as well as replacing some temporary terminology with better terms, and integrating the case folding text from the string matching algorithm into the case folding section. The document template was also adapated to match the new Internationalization publication process. See details of changes.

Character Model for the World Wide Web: String Matching and Searching Draft Published

This document builds upon on the Character Model for the World Wide Web 1.0: Fundamentals to provide authors of specifications, software developers, and content developers a common reference on string matching on the World Wide Web and thereby increase interoperability. String matching is the process by which a specification or implementation defines whether two string values are the same or different from one another.

The main target audience of this specification is W3C specification developers. This specification and parts of it can be referenced from other W3C specifications and it defines conformance criteria for W3C specifications, as well as other specifications.

This version of this document represents a significant change from its previous edition. Much of the content is changed and the recommendations are significantly altered. This fact is reflected in a change to the name of the document from “Character Model: Normalization” to “Character Model for the World Wide Web: String Matching and Searching”.

Updated Working Draft: Character Model for the World Wide Web 1.0: Normalization

A new version of the Character Model for the World Wide Web 1.0: Normalization was published. The only significant change was a note to clarify that content of the Working Draft is currently out of date, and the Internationalization Core Working Group intends to substantially alter or replace the recommendations found in this document with very different recommendations in the near future.

Character Model for the World Wide Web 1.0: Normalization

Updated Working Draft

The Internationalization Core Working Group has published an updated Working Draft of Character Model for the World Wide Web 1.0: Normalization to improve text manipulation on the Web. Based on the character model Fundamentals W3C Recommendation, the draft provides authors of specifications, software developers, and content developers with a common reference for text normalization and string identity matching.

Categories: New draft

Copyright © 2023 World Wide Web Consortium.
W3C® liability, trademark and permissive license rules apply.

Questions or comments? ishida@w3.org