AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play

Open in new window