Learning to Benchmark: Determining Best Achievable Misclassification Error from Training Data