// Internet Duct Tape

WordPress.com 7 Day Referrer Parser

This is the perl script I used for parsing the WordPress.com 7 Day Referrer logs to generate the statistics and graphs for my “I Digg It” series on getting dugg by digg.com. I’m release it for my own archival purposes and because someone might find it useful — but I don’t intend it for general usage and I’m not supporting it.

Download the Script

You’ll have to rename the file extension from .txt to .pl.

WordPress.com 7 Day Referrer Perl Script

View the Script

View the perl script as a web page

Sample CSV File

This will make more sense if you’ve looked at the script. :)

arturogoga.com,long tail
adrian.warnock.info,long tail
bitelia.com,long tail
blog.guykawasaki.com,other articles
blog.muehe.eu.org,long tail
blog.outer-court.com,long tail
blogcritics.org,other articles
bloglines.com,REMOVE
board.progaming.it,other articles
chris.pirillo.com,chris.pirillo.com
coolthingoftheday.blogspot.com,long tail
del.icio.us,del.icio.us
digg.com,digg.com
diggdot.us,diggdot.us
downloadsquad.com,downloadsquad.com
en.wikipedia.org,normal traffic to article
engtech.wordpress.com,other articles
featured.gigaom.com,other articles
forums.mozillazine.org,long tail
furl.net,furl.net
garyfeng.com,long tail
geeknewscentral.com,normal traffic to article
gigaom.com,other articles
google.com,REMOVE
googlesystem.blogspot.com,googlesystem.blogspot.com
grinn.net,normal traffic to article
itpro.no,long tail
jkontherun.blogs.com,long tail
joel.reddit.com,other articles
lifehack.org,lifehack.org
lifehacker.com,lifehacker.com
mail.google.com,REMOVE
morgat.blogspot.com,long tail
motoricerca.net,long tail
myweb.yahoo.com,REMOVE
myweb2.search.yahoo.com,REMOVE
netvibes.com,REMOVE
newsgator.com,REMOVE
newshutch.com,REMOVE
offline.computerra.ru,long tail
paul.kedrosky.com,long tail
popurls.com,popurls.com
rr-bb.com,other articles
programming.reddit.com,other articles
randsinrepose.com,other articles
sandbox.sourcelabs.com,REMOVE
scheduleworld.com,normal traffic to article
scobleizer.wordpress.com,other articles
solsie.com,long tail
spratmedia.pbwiki.com,long tail
stumbleupon.com,stumbleupon.com
superdeluxe.ch,long tail
tech-tag.com,long tail
techcrunch.com,other articles
techmeme.com,techmeme.com
technorati.com,other articles
theweblist.net,REMOVE
tinyscreenfuls.com,normal traffic to article
verden.abcsok.no,REMOVE
wmugperu.org,long tail
wordpress.com,other articles
wptheme.wordpress.com,other articles

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: