WordPress.com 7 Day Referrer Parser
This is the perl script I used for parsing the WordPress.com 7 Day Referrer logs to generate the statistics and graphs for my “I Digg It” series on getting dugg by digg.com. I’m release it for my own archival purposes and because someone might find it useful — but I don’t intend it for general usage and I’m not supporting it.
- Sample output – I Digg It
- Instructions for installing Perl on Windows.
Download the Script
You’ll have to rename the file extension from .txt to .pl.
WordPress.com 7 Day Referrer Perl Script
View the Script
View the perl script as a web page
Sample CSV File
This will make more sense if you’ve looked at the script. :)
arturogoga.com,long tail adrian.warnock.info,long tail bitelia.com,long tail blog.guykawasaki.com,other articles blog.muehe.eu.org,long tail blog.outer-court.com,long tail blogcritics.org,other articles bloglines.com,REMOVE board.progaming.it,other articles chris.pirillo.com,chris.pirillo.com coolthingoftheday.blogspot.com,long tail del.icio.us,del.icio.us digg.com,digg.com diggdot.us,diggdot.us downloadsquad.com,downloadsquad.com en.wikipedia.org,normal traffic to article engtech.wordpress.com,other articles featured.gigaom.com,other articles forums.mozillazine.org,long tail furl.net,furl.net garyfeng.com,long tail geeknewscentral.com,normal traffic to article gigaom.com,other articles google.com,REMOVE googlesystem.blogspot.com,googlesystem.blogspot.com grinn.net,normal traffic to article itpro.no,long tail jkontherun.blogs.com,long tail joel.reddit.com,other articles lifehack.org,lifehack.org lifehacker.com,lifehacker.com mail.google.com,REMOVE morgat.blogspot.com,long tail motoricerca.net,long tail myweb.yahoo.com,REMOVE myweb2.search.yahoo.com,REMOVE netvibes.com,REMOVE newsgator.com,REMOVE newshutch.com,REMOVE offline.computerra.ru,long tail paul.kedrosky.com,long tail popurls.com,popurls.com rr-bb.com,other articles programming.reddit.com,other articles randsinrepose.com,other articles sandbox.sourcelabs.com,REMOVE scheduleworld.com,normal traffic to article scobleizer.wordpress.com,other articles solsie.com,long tail spratmedia.pbwiki.com,long tail stumbleupon.com,stumbleupon.com superdeluxe.ch,long tail tech-tag.com,long tail techcrunch.com,other articles techmeme.com,techmeme.com technorati.com,other articles theweblist.net,REMOVE tinyscreenfuls.com,normal traffic to article verden.abcsok.no,REMOVE wmugperu.org,long tail wordpress.com,other articles wptheme.wordpress.com,other articles