This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 10947 - CHARACTER_ENCODING_SUPPORT-5: may be reported for files correctly encoded as UTF-8
Summary: CHARACTER_ENCODING_SUPPORT-5: may be reported for files correctly encoded as ...
Status: NEW
Alias: None
Product: mobileOK Basic checker
Classification: Unclassified
Component: Java Library (show other bugs)
Version: unspecified
Hardware: PC Linux
: P2 normal
Target Milestone: ---
Assignee: fd
QA Contact:
URL:
Whiteboard:
Keywords:
Depends on: 10952
Blocks:
  Show dependency treegraph
 
Reported: 2010-10-01 07:43 UTC by fd
Modified: 2010-10-01 09:18 UTC (History)
0 users

See Also:


Attachments

Description fd 2010-10-01 07:43:51 UTC
Seems the error message may be reported for files that are correctly encoded in UTF-8.

See user report at:
http://lists.w3.org/Archives/Public/public-mobile-dev/2010Sep/0000.html

... for page:
http://honte.eu/

Possible leads to find the bug:
- content is SVG served as image/svg+xml. Could the Checker simply end up with empty content internally and think empty content is not properly encoded as UTF-8?
- content is compressed. Could the Checker run the UTF-8 encoding check on compressed content?
Comment 1 fd 2010-10-01 09:18:40 UTC
The second option is the right one: the mobileOK Checker does not support compression and tries to parse the body received directly as HTML. See Bug 10952 for more details.