Optimizing Shallow Networks for Binary Classification