On the Optimality of Tracking Fisher Information in Adaptive Testing with Stochastic Binary Responses

Oct-10-2025–arXiv.org Machine Learning

Adaptive testing and sequential estimation problems have recently gained substantial attention due to their foundational role in modern artificial intelligence and interactive systems. Prominent applications include online preference learning, where systems dynamically adapt to user feedback to refine personalized recommendations, and reinforcement learning from human feedback (RLHF), which aims to align AI agents with human values by adaptively querying users. In these contexts, the main focus is to efficiently extract maximal information from human responses, which are inherently stochastic and limited in quantity. Among various types of such problems, this work particularly considers a fundamental yet illustrative case involving stochastic binary responses. Here, a decision-maker sequentially selects questions of varying difficulty from a continuous pool to pose to a candidate and aims to efficiently estimate the candidate's ability (represented by an unknown continuous parameter) by utilizing the binary feedback (e.g., correct/incorrect) collected, which depends probabilistically on the candidate's ability and the question's difficulty. This setup is arguably the simplest scenario that captures the essence of continuous parameter estimation under uncertainty, making it an ideal benchmark for developing fundamental theoretical insights and practical algorithms. Variants of this fundamental adaptive estimation problem have been studied in several communities.

algorithm, fisher information, probability, (12 more...)

arXiv.org Machine Learning

Oct-10-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New Jersey > Middlesex County > Piscataway (0.04)
- Asia
  - China > Hong Kong (0.04)
  - South Korea > Seoul
    - Seoul (0.04)

Genre:
- Research Report > New Finding (0.45)

Industry:
- Education (0.92)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (0.54)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.45)