Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers