AITopics | Agents

4a5876b450b45371f6cfe5047ac8cd45-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 12:55:30 GMT

The goal is to find the global optimal arm, and agents are able to pull any arm; however, they can only observe the reward when the selected arm is local.

artificial intelligence, external arm, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.70)

Add feedback

4a3050ae2c77da4f9c90e2e58e8e520f-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 12:45:54 GMT

equilibrium, nash equilibrium, refinement, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Games > Poker (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

4a3050ae2c77da4f9c90e2e58e8e520f-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 12:45:50 GMT

artificial intelligence, equilibrium, game theory, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Games > Poker (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai 1,2 Yifei Zhou

Neural Information Processing SystemsFeb-8-2026, 12:17:59 GMT

While training with static demonstrations has shown some promise, we show that such methods fall short for controlling real GUIs due to their failure to deal with real world stochasticity and non-stationarity not captured in static observational data.

large language model, machine learning, reinforcement learning, (22 more...)

Neural Information Processing Systems

Country:

South America > Chile (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Illinois (0.04)
(2 more...)

Genre: Research Report > Experimental Study (0.93)

Industry:

Information Technology > Services (0.68)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

LanguageandVisualEntityRelationshipGraph forAgentNavigation

Neural Information Processing SystemsFeb-8-2026, 12:06:02 GMT

Vision-and-language navigation in the real-world is an important step towards building mobile agents that perceivetheir environments and complete specific tasks following human instructions.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.34)

Add feedback

Efficient Subgame Refinement for Extensive-form Games

Neural Information Processing SystemsFeb-8-2026, 11:46:26 GMT

However, directly applying existing subgame solving techniques may be difficult, due to the intricate nature and substantial size of many real-world games.

artificial intelligence, machine learning, subgame, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Jiangsu Province > Nanjing (0.05)
North America > United States > Texas (0.05)
North America > United States > Washington > King County > Redmond (0.04)

Genre: Research Report (0.46)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.69)

Add feedback

3d17b7f7d52c83ab6e97e2dc0bda2e71-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 11:26:35 GMT

arxiv preprint arxiv, deepgsb, mfg, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Santa Monica (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Add feedback

47951a40efc0d2f7da8ff1ecbfde80f4-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 11:25:28 GMT

dataset, social posterior collapse, vehicle, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

47951a40efc0d2f7da8ff1ecbfde80f4-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 11:25:22 GMT

Modeling the behavior of intelligent agents is an essential subject for autonomous systems. Safe operations of autonomous agents require accurate prediction of other agents' future motions.

artificial intelligence, arxivpreprintarxiv, trajectory, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California > Alameda County > Berkeley (0.05)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Off-PolicyEvaluationforAction-Dependent Non-StationaryEnvironments

Neural Information Processing SystemsFeb-8-2026, 10:56:25 GMT

Methods for sequential decision making are often built upon a foundational assumption that the underlying decision process is stationary [Sutton and Barto, 2018]. While this assumption was a cornerstone when laying the theoretical foundations of the field, and while is often reasonable, it isseldom trueinpractice andcanbeunreasonable [Dulac-Arnold etal.,2019].

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: