This is an archived snapshot of W3C's public bugzilla bug tracker, decommissioned in April 2019. Please see the home page for more details.

Bug 5221 - Implement SkipHost
Summary: Implement SkipHost
Status: RESOLVED FIXED
Alias: None
Product: LogValidator
Classification: Unclassified
Component: core (show other bugs)
Version: unspecified
Hardware: All All
: P2 enhancement
Target Milestone: ---
Assignee: Olivier Thereaux
QA Contact:
URL: http://lists.w3.org/Archives/Public/w...
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2007-10-23 07:03 UTC by Olivier Thereaux
Modified: 2008-11-18 17:40 UTC (History)
1 user (show)

See Also:


Attachments
Patch to check for new ExcludeHosts configuration directive (1.78 KB, patch)
2007-10-24 17:26 UTC, Martin B. Smith
Details

Description Olivier Thereaux 2007-10-23 07:03:05 UTC
As suggested by Martin Smith, a way for the log parser to skip certain log records  if they came from specific IP (similar to the SkipHost [1] directive in awstats) would be useful.

[1] http://awstats.sourceforge.net/docs/awstats_config.html#SkipHosts

Ways to filter out on other criteria (referer, etc) also suggested.
Comment 1 Martin B. Smith 2007-10-24 17:26:43 UTC
Created attachment 495 [details]
Patch to check for new ExcludeHosts configuration directive

Patch to add new configuration behavior and corresponding configuration file entry:
-- begin
## ExcludeHosts : records that should not be processed ##
## space seperated list of expressions
## regexp-like syntax for match on the remote host or address
## DEFAULT = None
## NOTE: This uses whatever is in your log file, so you may want to ask Apache to start/stop resolving
##       remote addresses to host names if you want to do hostname-based matching
# ExcludeHosts ^localhost ^127\.0\.0\.1$
ExcludeHosts www\.bebr\.ufl\.edu ^inp-susanf
Comment 2 Martin B. Smith 2007-10-24 17:27:50 UTC
I chose to implement a directive with a name similar to that of ExcludeAreas:

## ExcludeHosts : records that should not be processed ##
## space seperated list of expressions
## regexp-like syntax for match on the remote host or address
## DEFAULT = None
## NOTE: This uses whatever is in your log file, so you may want to ask Apache to start/stop resolving
##       remote addresses to host names if you want to do hostname-based matching
# ExcludeHosts ^localhost ^127\.0\.0\.1$
Comment 3 Martin B. Smith 2007-10-24 17:29:24 UTC
(In reply to comment #1)
By the way, my original comment had an extra line that I was using to test -- it can be removed in favor of the next one :)

Comment 4 Olivier Thereaux 2008-11-18 15:48:51 UTC
Hi Martin,

Not sure why this disappeared from my radar for so long. I'm looking at your patch(es) now.
Comment 5 Olivier Thereaux 2008-11-18 17:15:08 UTC
OK, modulo one small issue with the excluding of records when a host/IP was matching, the patches were functional. I made a little fix and we're good to go (in CVS, and in CPAN by the end of the week).
Comment 6 Martin B. Smith 2008-11-18 17:40:44 UTC
Thanks again Olivier. I'm glad it finally made it in :)