TowardtheFundamentalLimitsofImitation Learning

Neural Information Processing Systems 

We then propose a novel algorithm based on minimum-distance functionals in the setting where the transition model is given and the expert is deterministic.Thealgorithmissuboptimalby .|S|H3/2/N,matchingourlower

Similar Docs  Excel Report  more

TitleSimilaritySource
None found