Learning sum of diverse features: computational hardness and efficient gradient-based training for ridge combinations