A Bounded rationality maximum entropy and Boltzmann rational policies

Neural Information Processing Systems 

Given the constraint that the human's expected reward is satisfactory, how should we pick a distribution to model the human's choices?

Similar Docs  Excel Report  more

TitleSimilaritySource
None found