Bismi'llah ir-Rahman ir-Rahim
Code
    Free software.

daily link  Thursday, February 20, 2003

« filestats 1.1 »

I just uploaded filestats.py version 1.1 to fix some stupid stuff in 1.0 -- it wouldn't run unless the dns.cache file already existed.

8:33:58 AM  comment ()   permalink  comment on this post by email  


« filestats »

I have just uploaded version 1.0 of a specialized web server log analysis program called filestats. Not a big deal, but it does something that Analog and Webalizer doesn't do, AFAIK. Given a list of one or more files on a server, filestats builds a report that shows who (i.e., what client machines) have accessed those files, by referer (the default) or by UserAgent. So, for example, if you'd like to see who is downloading your RSS feed, grouped by referer, you would say something like this (except all on one line):

python filestats -i access_log
                 -o feedstats.opml
                 -s http://www.example.com
                 -f /weblog/rss.xml

To see the same data, but grouped by UserAgent, just add the switch "-a".

I decided to use OPML as the output format, because I use Radio and ActiveRenderer, which gives me editing, rendering, and publishing without any extra work. And besides, filestats produces hierarchical data, so using an outline format just makes sense.

It's written in Python and relies on Mark Nottingham's very useful Weblog package.

I'm publishing it under the Creative Commons Attribution License, which (as I understand it) means that it's free for any use, commercial or non-commercial, and can be modified or derived from, you just have to give me credit.

I'm open to suggestions and bug reports.

8:30:59 AM  comment ()   permalink  comment on this post by email  


Copyright 2003 © Dave Seidel