Reinforcement Learning Based on On-Line EM Algorithm