Stochastic Nested Compositional Bi-level Optimization for Robust Feature Learning