Exploring the Generalization Capabilities of AID-based Bi-level Optimization