A Continuous Information Gain Measure to Find the Most Discriminatory Problems for AI Benchmarking