Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation