Review for NeurIPS paper: Neural Execution Engines: Learning to Execute Subroutines

Feb-5-2025, 23:44:01 GMT–Neural Information Processing Systems

Weaknesses: In general, I think the technical novelty of this work is limited. In particular, they claim that an additional mask prediction component is necessary to achieve generalization. My understanding is that the training supervision of NEE includes the desired mask at each execution step, which corresponds to the data pointers. However, it is unclear whether the training supervision of the baseline Transformer also includes the ground truth masks, or it only includes the output value at each step. Basically, I want to know whether the improvement comes from the more fine-grained supervision or the architectural design.

execute subroutine, neural execution engine, supervision, (8 more...)

Neural Information Processing Systems

Feb-5-2025, 23:44:01 GMT

Conferences Web Page

Add feedback

Genre:
- Workflow (0.59)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.56)