$f$-Divergence Based Classification: Beyond the Use of Cross-Entropy