Are you into scraping web content and that kinf of shady stuff?
Then go check out these datasets:
http://www.seomoz.org/blog/datasets-for-seo
And check this scraper program/plugin thingy:
http://www.outwit.com/products/hub/
There might even be some white hat uses for this crap 🙂