Reviews: Joint Active Feature Acquisition and Classification with Variable-Size Set Encoding

Neural Information Processing Systems 

I would think one would start with the feature acquisition aspect and then proceed to deep learning (although I think it really is secondary). The related work section is basically a'laundry list' of papers. Without reading them, they are mostly just citations -- while this is an opportunity to provide a general framework and demonstrate how this fits together.