On the Optimality of Tracking Fisher Information in Adaptive Testing with Stochastic Binary Responses
Kim, Sanghwa, Ahn, Dohyun, Min, Seungki
Adaptive testing and sequential estimation problems have recently gained substantial attention due to their foundational role in modern artificial intelligence and interactive systems. Prominent applications include online preference learning, where systems dynamically adapt to user feedback to refine personalized recommendations, and reinforcement learning from human feedback (RLHF), which aims to align AI agents with human values by adaptively querying users. In these contexts, the main focus is to efficiently extract maximal information from human responses, which are inherently stochastic and limited in quantity. Among various types of such problems, this work particularly considers a fundamental yet illustrative case involving stochastic binary responses. Here, a decision-maker sequentially selects questions of varying difficulty from a continuous pool to pose to a candidate and aims to efficiently estimate the candidate's ability (represented by an unknown continuous parameter) by utilizing the binary feedback (e.g., correct/incorrect) collected, which depends probabilistically on the candidate's ability and the question's difficulty. This setup is arguably the simplest scenario that captures the essence of continuous parameter estimation under uncertainty, making it an ideal benchmark for developing fundamental theoretical insights and practical algorithms. Variants of this fundamental adaptive estimation problem have been studied in several communities.
Oct-10-2025
- Country:
- Asia
- China > Hong Kong (0.04)
- South Korea > Seoul
- Seoul (0.04)
- North America > United States
- New Jersey > Middlesex County > Piscataway (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.45)
- Industry:
- Education (0.92)