Single-partition adaptive Q-learning