AITopics | system

Collaborating Authors

system

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Representation Balancing MDPs for Off-policy Policy Evaluation

Yao Liu, Omer Gottesman, Aniruddh Raghu, Matthieu Komorowski, Aldo A. Faisal, Finale Doshi-Velez, Emma Brunskill

Neural Information Processing SystemsMar-13-2026, 06:54:47 GMT

See Corollary 2 in Appendixfordetail.

artificial intelligence, machine learning, representation balancing mdp, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.05)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Industry: Health & Medicine (0.32)

Technology: Information Technology > Artificial Intelligence (0.73)

Add feedback

Multi-Agent Common Knowledge Reinforcement Learning

Christian Schroeder de Witt, Jakob Foerster, Gregory Farquhar, Philip Torr, Wendelin Boehmer, Shimon Whiteson

Neural Information Processing SystemsFeb-15-2026, 05:56:21 GMT

Figure 3: Gamematrices A (top) and B (bottom) [left]. Allexperimentsuse SMACsettingsforcomparability (see Samvelyanetal. (2019) and Appendix Bfordetails).

machine learning, mackrl, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Africa > Sudan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.66)

Add feedback

Re-evaluating evaluation

David Balduzzi, Karl Tuyls, Julien Perolat, Thore Graepel

Neural Information Processing SystemsFeb-14-2026, 15:22:46 GMT

Consider An n= grad (r)+ A grad (r)+ C| 0 B @ 01 10 ... 1 C

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
North America > Canada > Quebec > Montreal (0.04)

Industry: Leisure & Entertainment > Games (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.48)

Add feedback

A Unified Framework for Extensive-Form Game Abstraction with Bounds

Christian Kroer, Tuomas Sandholm

Neural Information Processing SystemsFeb-14-2026, 02:06:46 GMT

In'00: Revised the Second International Conferenceon Computersand Games333-345, 2000.

artificial intelligence, intelligence, sandholm, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.15)
North America > United States > Texas (0.05)
North America > Canada > Quebec > Montreal (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.72)

Add feedback

Online Reciprocal Recommendation with Theoretical Performance Guarantees

Claudio Gentile, Nikos Parotsidis, Fabio Vitale

Neural Information Processing SystemsFeb-13-2026, 20:21:36 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, assumption, smile, (17 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Lazio > Rome (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)
North America > United States (0.04)
(2 more...)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.97)
Information Technology > Communications > Social Media (0.95)

Add feedback

Credit Assignment For Collective Multiagent RL With Global Rewards

Duc Thien Nguyen, Akshat Kumar, Hoong Chuin Lau

Neural Information Processing SystemsFeb-13-2026, 19:15:43 GMT

Neural Information Processing Systems http://nips.cc/

agent, conference, credit assignment, (13 more...)

Neural Information Processing Systems

Country:

Asia > Singapore (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)

Industry:

Transportation (0.94)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.47)

Add feedback

Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks

Neural Information Processing SystemsDec-23-2025, 18:53:25 GMT

This paper considers a class of reinforcement learning problems, which involve systems with two types of states: stochastic and pseudo-stochastic. In such systems, stochastic states follow a stochastic transition kernel while the transitions of pseudo-stochastic states are deterministic {\em given} the stochastic states/transitions. We refer to such systems as mixed systems, which are widely used in various applications, including Manufacturing systems, communication networks, and queueing networks. We propose a sample-efficient RL method that accelerates learning by generating augmented data samples. The proposed algorithm is data-driven (model-free), but it learns the policy from data samples from both real and augmented samples. This method significantly improves learning by reducing the sample complexity such that the dataset only needs to have sufficient coverage of the stochastic states. We analyze the sample complexity of the proposed method under Fitted Q Iteration (FQI) and demonstrate that the optimality gap decreases as $O\left(\sqrt{\frac{1}{n}}+\sqrt{\frac{1}{m}}\right),$ where $n$ represents the number of real samples, and $m$ is the number of augmented samples per real sample. It is important to note that without augmented samples, the optimality gap is $O(1)$ due to the insufficient data coverage of the pseudo-stochastic states. Our experimental results on multiple queueing network applications confirm that the proposed method indeed significantly accelerates both deep Q-learning and deep policy gradient.

application, augmented sample, sample efficient reinforcement learning, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.85)

Add feedback

40cf27290cc2bd98a428b567ba25075c-Paper-Conference.pdf

Neural Information Processing SystemsOct-10-2025, 00:27:29 GMT

al-rnn, linear subregion, subregion, (14 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England (0.14)
North America > United States > New York (0.14)
Europe > Germany > Baden-Württemberg (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry: Health & Medicine > Therapeutic Area (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Communications (0.93)
(2 more...)

Add feedback

Multi-Objective Intrinsic Reward Learning for Conversational Recommender Systems

Neural Information Processing SystemsOct-8-2025, 11:34:50 GMT

Conversational Recommender Systems (CRS) actively elicit user preferences to generate adaptive recommendations. Mainstream reinforcement learning-based CRS solutions heavily rely on handcrafted reward functions, which may not be aligned with user intent in CRS tasks.

artificial intelligence, machine learning, optimization problem, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.29)

Genre: Research Report > New Finding (0.46)

Technology: