Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models

Neural Information Processing Systems 

Unlike most reinforcement learning agents which require an unrealistic amount of environment interactions to learn a new behaviour, humans excel at learning quickly by merely observing and imitating others.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found