AITopics | Agents

We first take a theoretical approach to analyzing debate and provide a framework through which debate can be mathematically examined. Building on this framework, we provide several theoretical results for multi-agent debate.

arxiv preprint arxiv, large language model, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > West Virginia (0.04)
North America > United States > Virginia (0.04)
North America > United States > California > Santa Cruz County > Santa Cruz (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Media (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

JointPolicySearchforMulti-agentCollaboration withImperfectInformation

Neural Information Processing SystemsFeb-10-2026, 21:37:26 GMT

To learn good joint policies for multi-agent collaboration with imperfect informationremains afundamental challenge.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Leisure & Entertainment > Games > Bridge (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes Yi Tian

Neural Information Processing SystemsFeb-10-2026, 21:32:26 GMT

We study minimax optimal reinforcement learning in episodic factored Markov decision processes (FMDPs), which are MDPs with conditionally independent transition components.

factored structure, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.85)

Add feedback

e4d8163c7a068b65a64c89bd745ec360-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 21:03:26 GMT

graph, interaction graph, prediction, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.04)
North America > Canada (0.04)
Europe (0.04)
Asia > China > Guangxi Province > Nanning (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

e366d105cfd734677897aaccf51e97a3-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 20:27:43 GMT

chance move, correlated equilibrium, information, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)
North America > Canada > British Columbia > Vancouver (0.04)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)

Add feedback

FairSchedulingforTime-dependentResources

Neural Information Processing SystemsFeb-10-2026, 20:26:39 GMT

The machines gain possibly different utilities by processing different jobs, and alljobs assigned tothesame machine should beprocessed without overlap.

algorithm, allocation, artificial intelligence, (17 more...)

Neural Information Processing Systems

Country: Europe > Monaco (0.04)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)

Add feedback

Filters

Collaborating Authors

Agents

411fa9d368b5485be4c6bb62615b365e-Supplemental-Conference.pdf

411fa9d368b5485be4c6bb62615b365e-Paper-Conference.pdf

e97c864e8ac67f7aed5ce53ec28638f5-Paper.pdf

e7c573c14a09b84f6b7782ce3965f335-Paper.pdf

Multi-LLM Debate: Framework, Principals, and Interventions

JointPolicySearchforMulti-agentCollaboration withImperfectInformation

Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes Yi Tian

e4d8163c7a068b65a64c89bd745ec360-Paper.pdf

e366d105cfd734677897aaccf51e97a3-Paper.pdf

FairSchedulingforTime-dependentResources