LearningCompositionalNeuralPrograms withRecursiveTreeSearchandPlanning
–Neural Information Processing Systems
NPI contributes structural biases in the form of modularity, hierarchy and recursion, which are helpful to reduce sample complexity, improve generalization and increase interpretability. AlphaZero contributes powerful neural network guided search algorithms, which we augment with recursion. AlphaNPI only assumes a hierarchical program specification with sparse rewards: 1 when the program execution satisfies the specification, and 0otherwise. This specification enables us to overcome the need for strong supervision in the form of execution traces andconsequently trainNPImodels effectivelywithreinforcement learning.
Neural Information Processing Systems
Feb-13-2026, 00:54:09 GMT