Integrating Probabilistic Rules into Neural Networks: A Stochastic EM Learning Algorithm