AITopics | Agents

To promote better performance-bandwidth trade-off for multi-agent perception, weproposeanovel distilledcollaborationgraph (DiscoGraph)tomodeltrainable, pose-aware, and adaptive collaboration among agents. Our key novelties lie in twoaspects.

artificial intelligence, collaboration, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > China (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

f3f1fa1e4348bfbebdeee8c80a04c3b9-Paper.pdf

Neural Information Processing SystemsFeb-19-2026, 11:58:54 GMT

factorization, linear value factorization, value factorization, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs

Neural Information Processing SystemsFeb-19-2026, 11:24:16 GMT

However, calculating the Laplacian spectrum can be expensive in a matrix base.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
North America > United States > District of Columbia > Washington (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

ba5f85ce126aad12075a3ffa68a3e969-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-19-2026, 10:52:35 GMT

algorithm, equilibrium, inequality follow, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Hong Kong (0.04)

Genre: Workflow (0.46)

Industry:

Health & Medicine (0.46)
Energy (0.46)
Government (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

4b0eea69deea512c9e2c469187643dc2-Paper-Conference.pdf

Neural Information Processing SystemsFeb-19-2026, 10:21:54 GMT

Ta s k: Your task is to boil tin.

large language model, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Diego County > San Diego (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Asia > China > Hong Kong (0.04)

Genre: Workflow (0.46)

Industry:

Education (0.67)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Robots (0.95)
(3 more...)

Add feedback

RMIX: LearningRisk-SensitivePoliciesfor CooperativeReinforcementLearningAgents

Neural Information Processing SystemsFeb-19-2026, 08:36:25 GMT

Current value-based multi-agent reinforcement learning methods optimize individual Q values to guide individuals' behaviours via centralized training with decentralized execution (CTDE). However, such expected, i.e., risk-neutral, Q value is not sufficient even with CTDE due to the randomness of rewards and the uncertainty in environments, which causes the failure of these methods to train coordinating agents incomplexenvironments. Toaddress these issues, we propose RMIX, anovelcooperativeMARL method with theConditional Value at Risk (CVaR) measure over the learned distributions of individuals' Q values. Specifically, we first learn the return distributions of individuals to analytically calculate CVaRfordecentralized execution. Then,tohandle thetemporal nature of the stochastic outcomes during executions, we propose a dynamic risk level predictorforriskleveltuning.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: