Review for NeurIPS paper: Semantic Visual Navigation by Watching YouTube Videos
–Neural Information Processing Systems
Weaknesses: The first weakness of this work is the lack of analysis of the overall video-to-experience framework. Each component in this pipeline can introduce error(s) and assumption(s) that must be carefully considered and analyzed. It would greatly aid this work to include discussion of the assumptions taken on by each component, provide discussion about error introduced by each component, and discuss alternative components (and why the chosen ones were used over them). As an example, for the inverse dynamics model: What are the "handful of environments" that are used to train the inverse dynamics model? How different are they from the evaluation setting?
Neural Information Processing Systems
Jan-23-2025, 01:23:36 GMT
- Technology: