Differentiable TAN Structure Learning for Bayesian Network Classifiers