Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey

Open in new window