AITopics | revenge

PoE World Compositional World Modeling with Products of Experts

Neural Information Processing SystemsJun-23-2026, 06:08:14 GMT

Learning how the world works is central to building AI agents that can adapt to complex environments. Traditional world models based on deep learning demand vast amounts of training data, and do not flexibly update their knowledge from sparse observations. Recent advances in program synthesis using Large Language Models (LLMs) give an alternate approach which learns world models represented as source code, supporting strong generalization from little data. To date, application of program-structured world models remains limited to natural language and grid-world domains. We introduce a novel program synthesis method for effectively modeling complex, non-gridworld domains by representing a world model as an exponentially-weighted product of programmatic experts (PoE-World) synthesized by LLMs. We show that this approach can learn complex, stochastic world models from just a few observations. We evaluate the learned world models by embedding them in a model-based planning agent, demonstrating efficient performance and generalization to unseen levels on Atari's Pong and Montezuma's Revenge.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Genre: Research Report > Experimental Study (1.00)

Industry:

Education (1.00)
Health & Medicine (0.92)
Leisure & Entertainment > Games > Computer Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

3b712de48137572f3849aabd5666a4e3-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 23:07:23 GMT

artificial intelligence, landmark, training step, (18 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (0.30)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.33)

Add feedback

Parametrically Seek

Neural Information Processing SystemsFeb-11-2026, 22:42:24 GMT

artificial intelligence, pmax, turneretal, (15 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Technology: Information Technology > Artificial Intelligence (0.96)

Add feedback

Entropic Desired Dynamics for Intrinsic Control: Supplemental Material Steven Hansen

Neural Information Processing SystemsFeb-8-2026, 22:46:44 GMT

While this is not close to the state-of-the-art in general (c.f. Figure 2 shows the effect of action entropy on exploratory behavior in Montezuma's Revenge. Number of unique avatar positions visited. Full training curves across all 6 Atari games are shown in Figure 1, including the random policy baseline. To ensure this didn't hamper performance, we At each state visited by the agent evaluator during training, the agent's state (consisting of the avatar's The full curves are included for completeness. The compute cluster we performed experiments on is heterogenous, and has features such as host-sharing, adaptive load-balancing, etc.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country: Oceania > Australia > New South Wales > Sydney (0.04)

Industry: Leisure & Entertainment > Games > Computer Games (0.57)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.52)

Add feedback

5ca41a86596a5ed567d15af0be224952-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 13:44:10 GMT

abstract representation, exploration, forward intrinsic reward planning, (12 more...)

Neural Information Processing Systems

Genre: Research Report (0.52)

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

PoE-World: Compositional World Modeling with Products of Programmatic Experts

Piriyakulkij, Wasu Top, Liang, Yichao, Tang, Hao, Weller, Adrian, Kryven, Marta, Ellis, Kevin

arXiv.org Artificial IntelligenceNov-21-2025

Learning how the world works is central to building AI agents that can adapt to complex environments. Traditional world models based on deep learning demand vast amounts of training data, and do not flexibly update their knowledge from sparse observations. Recent advances in program synthesis using Large Language Models (LLMs) give an alternate approach which learns world models represented as source code, supporting strong generalization from little data. To date, application of program-structured world models remains limited to natural language and grid-world domains. We introduce a novel program synthesis method for effectively modeling complex, non-gridworld domains by representing a world model as an exponentially-weighted product of programmatic experts (PoE-World) synthesized by LLMs. We show that this approach can learn complex, stochastic world models from just a few observations. We evaluate the learned world models by embedding them in a model-based planning agent, demonstrating efficient performance and generalization to unseen levels on Atari's Pong and Montezuma's Revenge. We release our code and display the learned world models and videos of the agent's gameplay at https://topwasu.github.io/poe-world.

machine learning, natural language, world model, (18 more...)

arXiv.org Artificial Intelligence

2505.10819

Genre: Research Report > Experimental Study (1.00)

Industry:

Education (1.00)
Health & Medicine (0.92)
Leisure & Entertainment > Games > Computer Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback

The Right Is Attacking a Franchise It Once Loved. The Reason Why Is Laughable.

SlateOct-11-2025, 14:00:00 GMT

A new video game sparked fury and accusations of wokeness in entertainment. But we've played this game before--and it's boring. Back in the summer of 2020, during the first year of COVID lockdowns, two first-party PlayStation games were released back-to-back, just a month apart: and . Upon release, was pretty beloved by a specific right-wing culture-war gamer crowd, who placed it on a pedestal specifically as a way to directly attack . While is far from perfect (for example, Neil Druckmann, the game's creator and co-director, took inspiration from the Israel-Palestine conflict that was criticized for both-sidesism), but the game's sin on release for many on the political right was that it took a series whose lead was previously a man and continued its story with one lead who was a lesbian and another whose appearance was deemed too masculine for these players to be attracted to her.

advertisement advertisement, once loved, sign, (12 more...)

Slate

Country:

Asia > Middle East > Palestine (0.25)
Asia > Middle East > Israel (0.25)
North America > United States (0.05)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Games (0.64)
Information Technology > Communications > Social Media (0.51)

Add feedback

5ca41a86596a5ed567d15af0be224952-AuthorFeedback.pdf

Neural Information Processing SystemsOct-3-2025, 00:36:45 GMT

abstract representation, artificial intelligence, exploration, (13 more...)

Neural Information Processing Systems

Genre: Research Report (0.52)

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

3b712de48137572f3849aabd5666a4e3-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 13:56:35 GMT

artificial intelligence, landmark, training step, (18 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (0.30)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.33)

Add feedback

AI has created a new breed of cat video: addictive, disturbing and nauseatingly quick soap operas

The GuardianAug-17-2025, 15:00:33 GMT

At the (tail) end of 2024, Billie Eilish sat cross-legged on stage and began to miaow. Her fans erupted in harmony, each belting out an off-key miaow of their own. This is because Eilish's Oscar-winning track What Was I Made For? – a lachrymose Barbie cut lamenting adulthood's entailing ennui – has become the default soundtrack for a new breed of cat video. You may recognise it: the song often plays over the top of these AI-generated fantasias featuring a cartoonishly fat cat or an equally buff feline with a suspiciously veiny human body. The cat cheats on her lover, falls pregnant or seeks revenge in a weirdly condensed soap opera.

nauseatingly quick soap opera, soap opera, video, (9 more...)

The Guardian

Country: North America > United States > California > Los Angeles County > Beverly Hills (0.05)

Industry: