This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 2275 - Charset/encoding issue
Summary: Charset/encoding issue
Status: RESOLVED DUPLICATE of bug 1833
Alias: None
Product: Validator
Classification: Unclassified
Component: Parser (show other bugs)
Version: 0.7.0
Hardware: PC Windows XP
: P2 major
Target Milestone: ---
Assignee: Terje Bless
QA Contact: qa-dev tracking
Depends on:
Reported: 2005-09-21 12:17 UTC by Jeppe H
Modified: 2005-09-21 13:07 UTC (History)
0 users

See Also:


Description Jeppe H 2005-09-21 12:17:49 UTC
When using the "Validate by direct input" (great feature!), I get an error,
which I believe is related to encoding or charset.

I try to validate this:

<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"
<html xml:lang="se" xmlns="">
	<title>Encoding/charset test</title>
	<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />

and I get 3 errors that relates to "non SGML character". All 3 errors are in
this line:

If I change charset from iso-8859-1 to utf-8, the XHTML will validate without
errors, I would have expected this behaviour to be the other way around.

If save the XHTML as .html and use the "Validate by File Upload" feature, I get
no errors, the XHTML validates fine. Also if this .html file is placed on a
webserver, using the "Validate by URL" feature will also validate fine.

So, to sum up, I believe there is an issue with the "Validate by direct input"
and encoding/charset - it does not seem to be set properly, even though it does
say "Encoding: iso-8859-1" a the top of the validation results page.
Comment 1 Olivier Thereaux 2005-09-21 13:07:05 UTC
This is a known bug we fixed today (in cvs). See Bug #1833 for details.

*** This bug has been marked as a duplicate of 1833 ***