HindsightCreditAssignment

Neural Information Processing Systems 

A reinforcement learning (RL) agent is tasked with two fundamental, interdependent problems: exploration(howtodiscoverusefuldata),andcreditassignment(howtoincorporateit). The simplest way of estimating the value function is by averaging returns (futurediscountedsumsofrewards)startingfromtaking ainx.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found