PolicyLearningfromTutorialBooks via Understanding, Rehearsingand Introspecting
–Neural Information Processing Systems
Inthemuch more complex football game, URI's policy beat the built-in AIs with a 37% winning rate while GPT-based agents can only achieve a 6% winning rate.
Neural Information Processing Systems
Feb-9-2026, 09:43:58 GMT