ALawofIteratedLogarithmforMulti-Agent ReinforcementLearning

Neural Information Processing Systems 

In contrast, the mathematics needed to analyze such schemes is what forms the focus in Stochastic Approximation (SA) theory [2, 4]. More generally, SA refers to an iterative scheme that helps find zeroes or optimal points of a function, for which only noisy evaluationsarepossible.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found