Corpus, NOT Corpse: A very stupid wget crawler