A Bounded rationality maximum entropy and Boltzmann rational policies
–Neural Information Processing Systems
Given the constraint that the human's expected reward is satisfactory, how should we pick a distribution to model the human's choices?
Neural Information Processing Systems
Oct-2-2025, 14:13:28 GMT
- Technology: