AITopics | Optimization

The joint decisions of the agents influence both individual rewards and the transition of the environment. MARL in general is occupied with leading the multi-agent system to a favorable outcome. Through the lens of game theory, the notion of a "favorable outcome" is formally defined through concepts like a Nash

adversary, artificial intelligence, optimization problem, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > Middle East > Jordan (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(6 more...)

Genre: Research Report > Experimental Study (0.92)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

d2f6f1dfbf9cd89a78c5a58ef0dec245-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 06:54:59 GMT

artificial intelligence, machine learning, reinforcement learning, (21 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Asia > Japan > Honshū > Chūgoku > Hiroshima Prefecture > Hiroshima (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(4 more...)

Industry: Leisure & Entertainment > Games (0.93)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Efficient Model-Free Exploration in Low-Rank MDPs

Neural Information Processing SystemsFeb-17-2026, 06:54:43 GMT

What are the right computational primitives for exploration?

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > North Carolina > Wake County > Raleigh (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.46)

Add feedback

Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees Dohyeong Kim

Neural Information Processing SystemsFeb-17-2026, 06:53:18 GMT

However, the nonlinearity of risk measures makes it challenging to achieve convergence and optimality.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.65)

Add feedback

d242dafdb2c5407ae420bc54c9325fdf-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 06:53:11 GMT

artificial intelligence, machine learning, maximization, (17 more...)

Neural Information Processing Systems

Country:

Asia > China > Jiangsu Province > Xuzhou (0.04)
North America > United States (0.04)
Asia > China > Beijing > Beijing (0.04)

Industry: Health & Medicine > Therapeutic Area > Immunology (0.46)

Technology:

Information Technology > Communications (0.94)
Information Technology > Artificial Intelligence > Machine Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

Unraveling the Gradient Descent Dynamics of Transformers

Neural Information Processing SystemsFeb-17-2026, 06:30:27 GMT

By analyzing the loss landscape of a single Transformer layer using Softmax and Gaussian attention kernels, our work provides concrete answers to these questions.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: