AITopics | Europe

Collaborating Authors

Europe

Learning to Communicate with Deep Multi-Agent Reinforcement Learning

Jakob Foerster, Ioannis Alexandros Assael, Nando de Freitas, Shimon Whiteson

Neural Information Processing SystemsApr-22-2026, 03:44:24 GMT

We consider the problem of multiple agents sensing and acting in environments with the goal of maximising their shared utility. In these environments, agents must learn communication protocols in order to share information that is needed to solve the tasks. By embracing deep neural networks, we are able to demonstrate endto-end learning of protocols in complex environments inspired by communication riddles and multi-agent computer vision problems with partial observability. We propose two approaches for learning in these domains: Reinforced Inter-Agent Learning (RIAL) and Differentiable Inter-Agent Learning (DIAL). The former uses deep Q-learning, while the latter exploits the fact that, during learning, agents can backpropagate error derivatives through (noisy) communication channels. Hence, this approach uses centralised learning but decentralised execution. Our experiments introduce new environments for studying the learning of communication protocols and present a set of engineering innovations that are essential for success in these domains.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England (0.28)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Without-Replacement Sampling for Stochastic Gradient Methods Ohad Shamir Department of Computer Science and Applied Mathematics Weizmann Institute of Science Rehovot, Israel ohad.shamir@weizmann.ac.il

Neural Information Processing SystemsApr-22-2026, 03:44:07 GMT

Stochastic gradient methods for machine learning and optimization problems are usually analyzed assuming data points are sampled with replacement. In contrast, sampling without replacement is far less understood, yet in practice it is very common, often easier to implement, and usually performs better. In this paper, we provide competitive convergence guarantees for without-replacement sampling under several scenarios, focusing on the natural regime of few passes over the data. Moreover, we describe a useful application of these results in the context of distributed optimization with randomly-partitioned data, yielding a nearly-optimal algorithm for regularized least squares (in terms of both communication complexity and runtime complexity) under broad parameter regimes. Our proof techniques combine ideas from stochastic optimization, adversarial online learning and transductive learning theory, and can potentially be applied to other stochastic optimization and learning problems.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.40)
Europe (0.28)

Industry: Education (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.74)

Add feedback

A drone delivered her lethal dose of fentanyl in a church parking lot. Now her dealer is going to prison

Los Angeles TimesApr-22-2026, 03:09:08 GMT

Things to Do in L.A. Tap to enable a layout that focuses on the article. A drone delivered her lethal dose of fentanyl in a church parking lot. The Drug Enforcement Administration was among agencies involved in the investigation. This is read by an automated voice. Please report any issues or inconsistencies here .

artificial intelligence, housing & homelessness politics science, social media, (8 more...)

Los Angeles Times

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.07)
North America > Mexico (0.05)
North America > Guatemala (0.05)
Europe > Ukraine (0.05)

Genre: Personal (0.49)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Health & Medicine > Therapeutic Area (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Add feedback

Meta to capture U.S. employee mouse movements and keystrokes to train AI

The Japan TimesApr-22-2026, 02:37:00 GMT

Meta to capture U.S. employee mouse movements and keystrokes to train AI NEW YORK - Meta is installing new tracking software on U.S.-based employees' computers to capture mouse movements, clicks and keystrokes for use in training its artificial intelligence models, part of a broad initiative to build AI agents that can perform work tasks autonomously, the company told staffers in internal memos. The tool, called Model Capability Initiative (MCI), will run on work-related apps and websites and will also take occasional snapshots of the content on employees' screens, according to one of the memos, posted by a staff AI research scientist on Tuesday in a channel for the company's model-building Meta SuperIntelligence Labs team. The purpose, according to the memo, was to improve the company's AI models in areas where they struggle to replicate how humans interact with computers, like choosing from dropdown menus and using keyboard shortcuts. In a time of both misinformation and too much information, quality journalism is more crucial than ever. By subscribing, you can help us get the story right.

artificial intelligence, iran war earthquake sanae takaichi, social media, (8 more...)

The Japan Times

Country:

Asia > Middle East > Iran (0.45)
North America > United States > New York (0.25)
Oceania > Australia (0.16)
(7 more...)

Industry:

Media > News (0.71)
Consumer Products & Services > Travel (0.56)
Government (0.53)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.78)

Add feedback

Michael and Susan Dell surpass 1 billion in donations backing AI driven hospital project

FOX NewsApr-22-2026, 02:35:52 GMT

Michael Dell and Susan Dell became the first donors to give more than $1 billion to the University of Texas at Austin, funding a massive AI-native hospital and research campus.

artificial intelligence, lifestyle real estate tech science, social media, (8 more...)

FOX News

Country:

North America > United States > Texas > Travis County > Austin (0.35)
North America > Mexico (0.05)
North America > United States > Oregon (0.04)
(5 more...)

Genre: Research Report (0.47)

Industry:

Media > News (1.00)
Health & Medicine > Therapeutic Area > Oncology (0.50)
Government > Regional Government > North America Government > United States Government (0.48)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.32)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Clustering with Bregman Divergences: an Asymptotic Analysis

Chaoyue Liu, Mikhail Belkin

Neural Information Processing SystemsApr-22-2026, 01:55:03 GMT

Clustering, in particular k-means clustering, is a central topic in data analysis. Clustering with Bregman divergences is a recently proposed generalization of k-means clustering which has already been widely used in applications. In this paper we analyze theoretical properties of Bregman clustering when the number of the clusters k is large. We establish quantization rates and describe the limiting distribution of the centers as k, extending well-known results for k-means clustering.

artificial intelligence, bregman divergence, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States (0.46)
Europe (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Safe and Efficient Off-Policy Reinforcement Learning

Remi Munos, Tom Stepleton, Anna Harutyunyan, Marc Bellemare

Neural Information Processing SystemsApr-22-2026, 01:53:54 GMT

In this work, we take a fresh look at some old and new algorithms for off-policy, return-based reinforcement learning. Expressing these in a common form, we derive a novel algorithm, Retrace(λ), with three desired properties: (1) it has low variance; (2) it safely uses samples collected from any behaviour policy, whatever its degree of "off-policyness"; and (3) it is efficient as it makes the best use of samples collected from near on-policy behaviour policies. We analyze the contractive nature of the related operator under both off-policy policy evaluation and control settings and derive online sample-based algorithms. We believe this is the first return-based off-policy control algorithm converging a.s. to Q without the GLIE assumption (Greedy in the Limit with Infinite Exploration). As a corollary, we prove the convergence of Watkins' Q(λ), which was an open problem since 1989. We illustrate the benefits of Retrace(λ) on a standard suite of Atari 2600 games. One fundamental trade-off in reinforcement learning lies in the definition of the update target: should one estimate Monte Carlo returns or bootstrap from an existing Q-function?

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images

Junhua Mao, Jiajing Xu, Kevin Jing, Alan L. Yuille

Neural Information Processing SystemsApr-22-2026, 01:53:07 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.28)
Europe (0.28)

Industry: Information Technology > Services (0.49)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

Learning to Poke by Poking: Experiential Learning of Intuitive Physics

Pulkit Agrawal, Ashvin V. Nair, Pieter Abbeel, Jitendra Malik, Sergey Levine

Neural Information Processing SystemsApr-22-2026, 01:52:32 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, robot, (19 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

SpaceX secures option to buy AI startup Cursor for 60bn or partner for 10bn

The GuardianApr-22-2026, 00:13:51 GMT

Elon Musk speaks at the SpaceX Hyperloop Pod Competition II in Hawthorne, California, in 2017. Elon Musk speaks at the SpaceX Hyperloop Pod Competition II in Hawthorne, California, in 2017. Cursor is a Silicon Valley startup using AI to automate coding as Elon Musk's firm seeks foothold in the AI market SpaceX said it has secured an option to either acquire code-generation startup Cursor for $60bn later this year, or pay $10bn for their new partnership, as it pushes deeper into the lucrative market for AI developer tools. Along with OpenAI and Anthropic, Cursor is one of several Silicon Valley startups that has drawn waves of developers by using artificial intelligence to automate coding, a business where AI companies have found early commercial traction. The deal could give xAI, the Grok chatbot maker that SpaceX merged with in February, a stronger foothold in the AI coding market where it has so far lagged rivals.

artificial intelligence, chatbot, natural language, (11 more...)

The Guardian

Country:

North America > United States > California > Los Angeles County > Hawthorne (0.47)
Europe > Ukraine (0.08)
Oceania > Australia (0.05)

Industry:

Transportation > Passenger (0.99)
Transportation > Ground > Rail (0.99)
Leisure & Entertainment > Sports (0.75)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.56)

Add feedback