We contributeNeuralSolver, a novel recurrent solver that can efficiently and consistently extrapolate, i.e., learn algorithms from smaller problems (in terms of observation size) and execute those algorithms in large problems.
Toevaluate A2-winner, awidesettestenvir, including features searched importantly robotsof different games [1]). Rainbow DrQ [22]Random Human Mean Human-Norm' d0.568 0.381 0.285 0.3570.000
After annotating the entire dataset in each language, there was an additional annotator for each language who reviewed the entire set. Annotators were volunteers, and theyare acknowledged at theendofthiswork.