<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>31</bug_id>
          
          <creation_ts>2002-10-25 22:51:57 +0000</creation_ts>
          <short_desc>checklink: Try GET if server responds 501 (or 405) to HEAD</short_desc>
          <delta_ts>2013-11-03 07:35:21 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>LinkChecker</product>
          <component>checklink</component>
          <version>unspecified</version>
          <rep_platform>Other</rep_platform>
          <op_sys>other</op_sys>
          <bug_status>ASSIGNED</bug_status>
          <resolution></resolution>
          
          
          <bug_file_loc></bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Ville Skyttä">ville.skytta</reporter>
          <assigned_to name="This bug has no owner yet - up for the taking">dave.null</assigned_to>
          <cc>frank.ellermann</cc>
    
    <cc>gonzo1lee</cc>
    
    <cc>quenneville456</cc>
    
    <cc>sporosbe</cc>
          
          <qa_contact name="qa-dev tracking">www-validator-cvs</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>52</commentid>
    <comment_count>0</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2002-10-25 22:51:57 +0000</bug_when>
    <thetext>The link checker should do a GET request when the server replies 501 to a HEAD
request.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>551</commentid>
    <comment_count>1</comment_count>
    <who name="Frank Ellermann">frank.ellermann</who>
    <bug_when>2003-06-23 06:01:37 +0000</bug_when>
    <thetext>JFTR, some servers don&apos;t send 501 if they don&apos;t like HEAD, e.g. 
http://www.rsasecurity.com/rsalabs/challenges/factoring/numbers.html

Trying to write my own checklink (REXX + rxsock.dll on OS/2)
I obviously caused some serious trouble for some servers. My
new strategy is to never test any given host again if it sent
400, 405, or 5??.  Until now I haven&apos;t seen any 501, and if I
understand RfC 2916 correctly 501 *_should not_* be used in
replies to HEAD requests.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>757</commentid>
    <comment_count>2</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2003-09-15 13:10:45 +0000</bug_when>
    <thetext>You&apos;re right, the results with servers that do not support HEAD vary a lot,
almost any status can be the result.  For example as reported in
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=188123

But RFC&apos;s 1945, 2068 and 2616 (section 5.1.1) all mention 501 as a &quot;SHOULD&quot; for
unrecognized methods; 2068 and 2616 additionally define the possibility of a 405.

I have always though that support for HEAD is a &quot;MUST&quot; in HTTP 1.1, but RFC 2616
(and 2068, FWIW) say:

   &quot;The methods GET
   and HEAD MUST be supported by all general-purpose servers.&quot;

Note &quot;general-purpose&quot;.  *sigh*</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>758</commentid>
    <comment_count>3</comment_count>
    <who name="Frank Ellermann">frank.ellermann</who>
    <bug_when>2003-09-15 14:37:52 +0000</bug_when>
    <thetext>Oops... tnx for this &quot;general purpose&quot; hint. That&apos;s a bit
like the many lines explaining GMT in date-headers, and
ending with the conclusion, that a server can omit this
header if it has difficulties to determine Zulu time... ;-)

Actually this makes sense (routers without clock etc.),
but not supporting HEAD is hard. Better stay away from GET,
at least as long as you don&apos;t support robots.txt

BTW, do you have admin rights on this bugzilla ?  If YES
please (re)enable the option to modify mail addresses.  
It exists, I&apos;ve seen it on the distributed bugzilla, where
I could change my address.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>759</commentid>
    <comment_count>4</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2003-09-15 14:50:01 +0000</bug_when>
    <thetext>I haven&apos;t seen the option to change a mail address in any Bugzilla.  Not to say
such a thing doesn&apos;t exist, but I believe it&apos;s a recent addition, most likely
available in newer versions than this one.  And no, my rights aren&apos;t up to the
task anyway, I suggest contacting Terje Bless &lt;link@pobox.com&gt; and/or Olivier
Thereaux &lt;ot@w3.org&gt; on Bugzilla issues, I believe they have the necessary admin
rights.  Hm, maybe a component named &quot;Bugzilla&quot; wouldn&apos;t be a bad idea here... :)
</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>