This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 18802 - Web pages in Spanish are ignored
Summary: Web pages in Spanish are ignored
Status: RESOLVED INVALID
Alias: None
Product: LinkChecker
Classification: Unclassified
Component: checklink (show other bugs)
Version: unspecified
Hardware: PC Windows NT
: P2 normal
Target Milestone: ---
Assignee: Ville Skyttä
QA Contact: qa-dev tracking
URL: http://www.amelox.com
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2012-09-07 21:59 UTC by Ralph Seabrook
Modified: 2012-09-10 06:14 UTC (History)
0 users

See Also:


Attachments

Description Ralph Seabrook 2012-09-07 21:59:00 UTC
All our web pages are properly link-checked.
However, a few days ago I added two more with the content in the Spanish languge. There will be more in the future.
They are ignored.  Is there any reason for this?
Comment 1 Ville Skyttä 2012-09-08 08:36:36 UTC
More information is needed to investigate: exactly which documents or links are being ignored?
Comment 2 Ralph Seabrook 2012-09-08 15:31:20 UTC
These URLs are listed and evaluated:
www.amelox.com/orderpage.html
www.Amelox.com/Tutor-Start.html

These URLs are not listed nor evaluated:
www.amelox.com/orderpage-ES.html
www.Amelox.com/Tutor-Start-ES.html

The difference between the two is that the content in the latter is in Spanish.
But the HTML5 in both is in English.
Comment 3 Ralph Seabrook 2012-09-08 15:34:17 UTC
sorry, the computer capitalized amelox because there is a period in front.
Comment 4 Ville Skyttä 2012-09-08 18:51:54 UTC
What is the URL of the document that contains those links? http://www.amelox.com/ (the "root" page) does not contain any of those four.
Comment 5 Ralph Seabrook 2012-09-08 20:29:17 UTC
(In reply to comment #4)
> What is the URL of the document that contains those links?
> http://www.amelox.com/ (the "root" page) does not contain any of those four.

Yes, the first two are in the directory. The Link-Checker finds them, too. Please look again. The server is capitalization sensitive.

These URLs are listed and evaluated:
www.amelox.com/orderpage.html
www.amelox.com/Tutor-Start.html

These URLs are not listed nor evaluated:
www.amelox.com/Orderpage-ES.html
www.amelox.com/Tutor-Start-ES.html

Thank you,
Rolf
Comment 6 Ville Skyttä 2012-09-09 07:19:49 UTC
What's needed is the URL of the document that links to the documents you mentioned. The main page does not:

$ curl -s http://www.amelox.com/ | grep -Pi '(orderpage|tutor-start)'
(produces no output)
Comment 7 Ralph Seabrook 2012-09-09 17:44:48 UTC
Thank you.
I see now what is 'wrong':  Each page needs a referrer page. I fixed that even though I had not intended it at this time.

I also noted that sub-directory stand-alone pages can be checked separately.
That was not the case previously and it solves the problem. 

Thank you,
Rolf
Comment 8 Ville Skyttä 2012-09-10 06:14:21 UTC
Yes, if there are no links to a page, the link checker has no way of knowing that it exists.