s3: You Don't Need That Much Data to Train a Search Agent via RL

Open in new window