PEPS: Quantum-Inspired Reinforcement Learning for Coherent Reasoning Traces in LLMs