Training Factor Graphs with Reinforcement Learning for Efficient MAP Inference