Goto

Collaborating Authors

 navigation task






00989c20ff1386dc386d8124ebcba1a5-AuthorFeedback.pdf

Neural Information Processing Systems

We thank all the reviewers for their helpful feedback and positive view of our work. To address the re-1 viewers concerns (R3,R4,R5), we have added a comparison to Duan et al.'s One Shot Imitation learning in2 Tables 3 and 2, a comparison to a non-normalized TECNet ablation, as well as an evaluation on a Viz-3 Doom navigation task in Table 1. We believe that these additions address all of the main reviewer concerns.4 We evaluate our method in ViZDoom where the goal is to visit waypoints in apredetermined order. Istheembedding14 normalized in the same way in your model?We do not15 normalizetheCPVembeddings. Toteaseaparttheeffects16 of normalization, we have added a comparison to non-17 normalized versions ofTECNet, which are labeled "TE"18 (taskembedding)inTable3and2.19




AVLEN: Audio-Visual-LanguageEmbodied Navigationin3DEnvironments

Neural Information Processing Systems

Similartoaudio-visual navigationtasks,thegoalofourembodied agentistolocalize anaudioeventvia navigating the 3D visual world; however, the agent may also seek help from a human (oracle), where the assistance is provided in free-form natural language.