<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://www.w3.org/Bugs/Public/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4"
          urlbase="https://www.w3.org/Bugs/Public/"
          
          maintainer="sysbot+bugzilla@w3.org"
>

    <bug>
          <bug_id>1131</bug_id>
          
          <creation_ts>2005-02-23 13:58:07 +0000</creation_ts>
          <short_desc>LinkChecker should behave sensibly for news:// and irc:// URIs</short_desc>
          <delta_ts>2009-04-09 19:26:36 +0000</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>LinkChecker</product>
          <component>checklink</component>
          <version>unspecified</version>
          <rep_platform>All</rep_platform>
          <op_sys>All</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>FIXED</resolution>
          
          
          <bug_file_loc>http://qa-dev.w3.org/wlc/checklink?uri=http%3A%2F%2Fwww.mozilla.org%2Fprojects%2Fsvg%2Ffaq.html&amp;summary=on&amp;hide_type=all&amp;depth=10&amp;cookie=set&amp;check=Check</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P3</priority>
          <bug_severity>minor</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Jonathan Watt">jonathan.watt</reporter>
          <assigned_to name="Ville Skyttä">ville.skytta</assigned_to>
          <cc>gonzo1lee</cc>
    
    <cc>ville.skytta</cc>
          
          <qa_contact name="qa-dev tracking">www-validator-cvs</qa_contact>

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>4077</commentid>
    <comment_count>0</comment_count>
    <who name="Jonathan Watt">jonathan.watt</who>
    <bug_when>2005-02-23 13:58:07 +0000</bug_when>
    <thetext>LinkChecker should behave sensibly for news:// and irc:// URIs and not always
show errors.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>4078</commentid>
    <comment_count>1</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2005-02-23 18:03:50 +0000</bug_when>
    <thetext>Link checker&apos;s URI scheme/protocol support comes from the libwww-perl library. 
At least in theory, all schemes supported by it should be also supported in the
link checker.  Now, regarding the cases from your example URL:

news://news.mozilla.org/netscape.public.mozilla.svg
This URI is not RFC1738 compliant.  There&apos;s no hostname part in news: URIs
according to the RFC.  Maybe it should be nntp://...?

news:netscape.public.mozilla.svg
This looks ok, but the link checker @ qa-dev.w3.org does not have a default NNTP
server configured.  Olivier, is there one @w3.org or somewhere we (c|sh)ould use
for qa-dev and v.w.o?

As far as I know, there&apos;s no standard for irc:// URIs.  That&apos;s probably why it
isn&apos;t supported in libwww-perl (and consequently nor in link checker).  I
suggest you contact the libwww-perl authors if you feel it should be supported.
http://www.iana.org/assignments/uri-schemes
http://www.w3.org/Addressing/draft-mirashi-url-irc-01.txt (expired 8 years ago)

Finally, news: and nntp: URIs are not too well supported in libwww-perl.  In
particular, they will only work sufficiently for link checker use for URIs that
identify an article; the implementation for URIs identifying a newsgroup is
pretty incomplete.  Yes, that means that even if the news-&gt;nntp &quot;bug&quot; would be
fixed and a default NNTP server configured for qa-dev and v.w.o, your URIs (both
of which identify a newsgroup, not an article) wouldn&apos;t be completely supported
by libwww-perl (nor link checker) at the moment.

(This made me remember that I have some patches that I should send to
libwww-perl upstream, those make the nntp support a bit better.)</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>4079</commentid>
    <comment_count>2</comment_count>
    <who name="Bj">bjoern</who>
    <bug_when>2005-02-23 18:10:25 +0000</bug_when>
    <thetext>I suggest making news: and nntp: explicitly unsupported.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>4080</commentid>
    <comment_count>3</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2005-02-23 18:40:05 +0000</bug_when>
    <thetext>For good, or for now (if/until LWP gets proper support for them)?</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>4081</commentid>
    <comment_count>4</comment_count>
    <who name="Bj">bjoern</who>
    <bug_when>2005-02-23 18:55:22 +0000</bug_when>
    <thetext>Well, v.w3.org should only report links as broken if there is no doubt that the 
link is broken, for most news: and nntp: links it cannot know that. We might 
support nntp://example.org which could check whether there is a DNS entry for 
example.org and maybe even whether example.org accepts connections to :nntp, 
but we should not do much more. As not supporting them at all is easier...</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>4082</commentid>
    <comment_count>5</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2005-02-23 19:25:55 +0000</bug_when>
    <thetext>I just sent a patch to the LWP mailing list which makes the news:/nntp: support
in it complete enough for link checker use, excluding the case where the NNTP
server&apos;s hostname is not present in the URI.  And that case and how to work
around it has been documented in the link checker documentation for some time now.

If my patch is accepted to LWP, I&apos;m not in favour of dropping the feature
altogether.  We could possibly add a special message and short-circuit without
even trying for the no-server-in-URI case.  Or do that only if a default NNTP
server is not defined, but the problem with this approach is that Net::NNTP&apos;s
default server resolution is somewhat hairy and not exposed through the API.

Keeping the feature requires no changes to checklink, removing it does :)</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>7445</commentid>
    <comment_count>6</comment_count>
    <who name="Ville Skyttä">ville.skytta</who>
    <bug_when>2005-12-14 21:27:53 +0000</bug_when>
    <thetext>My news/nntp patch is included in LWP &gt;= 5.804, so as far as I&apos;m concerned, this
bug can be resolved by upgrading LWP on v.w.o.  Reassigning to Olivier for that.
 Note: I haven&apos;t personally tested libwww-perl &gt;= 5.804.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>7593</commentid>
    <comment_count>7</comment_count>
    <who name="Olivier Thereaux">ot</who>
    <bug_when>2006-01-03 22:19:52 +0000</bug_when>
    <thetext>Currently, all that&apos;s available on any debian distrib is libwww-perl 5.803-4.
http://packages.debian.org/libwww-perl 
For the sake of maintenance of the validator servers, I&apos;d like to wait and see whether/when 5.804+ gets 
into debian.</thetext>
  </long_desc>
      
      

    </bug>

</bugzilla>