Goto

Collaborating Authors

 Education




Small batch deep reinforcement learning

Neural Information Processing Systems

Since the policy used to collect transitions is changing throughout learning, the replay memory contains data coming from a mixture of policies (that differ from the agent's current policy), and


Few-ShotContinualActiveLearningbyaRobot

Neural Information Processing Systems

The framework also uses uncertainty measures on the Gaussian representations of thepreviously learned classes tofindthemost informativesamples tobelabeled in an increment. We evaluate our approach on the CORe-50 dataset and on a real humanoid robot for the object classification task.


Best-of-All-WorldsBoundsfor OnlineLearningwithFeedbackGraphs

Neural Information Processing Systems

Here, θ() is the clique coveringnumberofthegraph. Online learning models a repeated interaction between a learner and an environment.