Coping With Noise in a Real-World Weblog Crawler and Retrieval System