A study of local optima for learning feature interactions using neural networks