Multi-Path Collaborative Reasoning via Reinforcement Learning