Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective