Data-Efficient Learning via Minimizing Hyperspherical Energy