Intended audience: XHTML/HTML coders (using editors or scripting), script developers (PHP, JSP, etc.), Web project managers, and anyone who is wondering whether they should use Unicode.
Updated 2010-09-02 16:08
Are Web sites using Unicode right now?
It is sometimes assumed that Unicode encodings are popular "behind the scenes" but rarely used on the pages of major Web sites.
The initial version of this article, in 2003, pointed to the home pages of the following Web sites, which all used Unicode at the time, as evidence that Unicode was gaining ground:
A blog post by Google in January 2010 indicated that almost 50% of the Web in their sample of several billion pages was now using Unicode. According to Google, the figure for Unicode encoded pages then passed the 50% mark in August 2010. Add to that the figure for ASCII-only web pages (since ASCII is a subset of UTF-8), and the figure rises near to 70%.
To find out if a Web page is using Unicode, simply select the "encoding" feature of your Web browser (as shown below using Internet Explorer):
Many more companies do rely on Unicode "behind the scenes" – that is, within their content databases. For example, a content database may hold all text in Unicode format; when the content is pulled to the Web server, it is transcoded into the native encoding.
Tell us what you think (English).
Content first published 2003-07-31. Last substantive update 2010-09-02 16:08 GMT. This version 2010-09-04 5:51 GMT
For the history of document changes, search for qa-who-uses-unicode in the i18n blog.
Copyright © 2003-2010 W3C® (MIT, ERCIM, Keio, Beihang), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply. Your interactions with this site are in accordance with our public and Member privacy statements.