Towards Scalable Oversight with Collaborative Multi-Agent Debate in Error Detection