Direct Prediction Set Minimization via Bilevel Conformal Classifier Training