Goto

Collaborating Authors

 tothebestofourknowledge





Exponentially Weighted Imitation Learning for Batched Historical Data

Qing Wang, Jiechao Xiong, Lei Han, peng sun, Han Liu, Tong Zhang

Neural Information Processing Systems

We consider deep policy learning with only batched historical trajectories. The main challenge of this problem is that the learner no longer has a simulator or "environment oracle" as in most reinforcement learning settings.




21be9a4bd4f81549a9d1d241981cec3c-AuthorFeedback.pdf

Neural Information Processing Systems

We agree with the reviewers that reporting classification accuracy is important. As described in the paper and as noted by Reviewer 1, augmentation by adding more26 channels has been used inthe contextofResNets (e.g.




Self-SupervisedLearningbyCross-Modal Audio-VideoClustering

Neural Information Processing Systems

The first challenge is the exorbitant cost of scaling up the size of manually-labeled video datasets. The recent creation of large-scale action recognition datasets [5,15,25,26]hasundoubtedly enabled amajor leap forwardinvideo models accuracies.