<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>6567</bug_id>
          
          <creation_ts>2009-02-12 21:27:10 +0000</creation_ts>
          <short_desc>Transcoding should modify encoding in XML declaration, meta elements</short_desc>
          <delta_ts>2009-02-12 21:38:05 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>Validator</product>
          <component>check</component>
          <version>0.8.4</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>NEW</bug_status>
          <resolution></resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Ville Skyttä">ville.skytta</reporter>
          <assigned_to name="This bug has no owner yet - up for the taking">dave.null</assigned_to>
          
          
          <qa_contact name="qa-dev tracking">www-validator-cvs</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>23687</commentid>
    <comment_count>0</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2009-02-12 21:27:10 +0000</bug_when>
    <thetext>In addition to plain charset conversion, transcoding should also modify the encoding in XML declaration, as well as &lt;meta http-equiv&gt; and &lt;meta charset&gt; (HTML5), preferably the same way as doctype override does (leaves the existing one there in comments).

Not doing the above replacements results in issues when the transcoded content is passed to other validators that care about the encoding specified in one or more of the above.  There&apos;s already a hack in place for XML::LibXML (bug 4867) and some workarounds are attempted for the HTML5 validator in html5_validate() which are not enough when there&apos;s a charset or doctype override in effect, but I think it would be better to do this centrally (as part of the transcoding process?) and get rid of the parser specific hacks and workarounds.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>23689</commentid>
    <comment_count>1</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2009-02-12 21:38:05 +0000</bug_when>
    <thetext>More info:
- http://www.w3.org/mid/200902122336.17233.ville.skytta%40iki.fi
- http://dev.w3.org/cvsweb/validator/httpd/cgi-bin/check.diff?r1=1.626&amp;r2=1.627
</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>