Goto

Collaborating Authors

 appendixb





AppendixA AppendixB) AppendixC

Neural Information Processing Systems

A.2 ExpertRollouts The expert rollouts consist of acollection of HDF5 files, one file per clip. A.3 HostingPlan The link to the dataset can be found on the project website. The dataset website also includes the policies we trained in Section 5, i.e., the multi-clip tracking policies, RL-trained taskpolicies, andtheGPTpolicy. Training clip experts to track long clips is potentially slow and laborious, so wefollowMerel etal.[2019]bydividing Each expert is a neural network with three hidden layers, 1024 neurons in each hidden layer, and thetanh activation.



learning

Neural Information Processing Systems

Consideranews recommendation website that, when presented with a new user, sequentially offers a selection of currently trending articles. Such asystem may only haveafewopportunities tomakerecommendations before the user decides to navigate away, leaving little time to correct for misspecified or underspecified prior knowledge.