TextBandit: Evaluating Probabilistic Reasoning in LLMs Through Language-Only Decision Tasks

Open in new window