Is This Collection Worth My LLM's Time? Automatically Measuring Information Potential in Text Corpora