AITopics | Industry

Collaborating Authors

Industry

Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation

Neural Information Processing SystemsMar-17-2026, 13:31:36 GMT

In this work, we propose to apply trust region optimization to deep reinforcement learning using a recently proposed Kronecker-factored approximation to the curvature. We extend the framework of natural policy gradient and propose to optimize both the actor and the critic using Kronecker-factored approximate curvature (K-FAC) with trust region; hence we call our method Actor Critic using Kronecker-Factored Trust Region (ACKTR). To the best of our knowledge, this is the first scalable trust region natural gradient method for actor-critic methods. It is also the method that learns non-trivial tasks in continuous control as well as discrete control policies directly from raw pixel inputs. We tested our approach across discrete domains in Atari games as well as continuous domains in the MuJoCo environment. With the proposed methods, we are able to achieve higher rewards and a 2-to 3-fold improvement in sample efficiency on average, compared to previous state-of-the-art on-policy actor-critic methods.

artificial intelligence, machine learning, reinforcement learning, (7 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games > Computer Games (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

Add feedback

A multi-agent reinforcement learning model of common-pool resource appropriation

Neural Information Processing SystemsMar-17-2026, 13:04:03 GMT

Humanity faces numerous problems of common-pool resource appropriation. This class of multi-agent social dilemma includes the problems of ensuring sustainable use of fresh water, common fisheries, grazing pastures, and irrigation systems. Abstract models of common-pool resource appropriation based on non-cooperative game theory predict that self-interested agents will generally fail to find socially positive equilibria---a phenomenon called the tragedy of the commons. However, in reality, human societies are sometimes able to discover and implement stable cooperative solutions. Decades of behavioral game theory research have sought to uncover aspects of human behavior that make this possible.

artificial intelligence, machine learning, reinforcement learning, (8 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.42)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.39)

Add feedback

Online Learning of Optimal Bidding Strategy in Repeated Multi-Commodity Auctions

Neural Information Processing SystemsMar-17-2026, 13:03:03 GMT

We study the online learning problem of a bidder who participates in repeated auctions. With the goal of maximizing his T-period payoff, the bidder determines the optimal allocation of his budget among his bids for $K$ goods at each period. As a bidding strategy, we propose a polynomial-time algorithm, inspired by the dynamic programming approach to the knapsack problem. The proposed algorithm, referred to as dynamic programming on discrete set (DPDS), achieves a regret order of $O(\sqrt{T\log{T}})$. By showing that the regret is lower bounded by $\Omega(\sqrt{T})$ for any strategy, we conclude that DPDS is order optimal up to a $\sqrt{\log{T}}$ term. We evaluate the performance of DPDS empirically in the context of virtual trading in wholesale electricity markets by using historical data from the New York market. Empirical results show that DPDS consistently outperforms benchmark heuristic methods that are derived from machine learning and online learning approaches.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Country: North America > United States > New York (0.28)

Industry: Education (0.88)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Variational Laws of Visual Attention for Dynamic Scenes

Neural Information Processing SystemsMar-17-2026, 13:01:48 GMT

Computational models of visual attention are at the crossroad of disciplines like cognitive science, computational neuroscience, and computer vision. This paper proposes a model of attentional scanpath that is based on the principle that there are foundational laws that drive the emergence of visual attention. We devise variational laws of the eye-movement that rely on a generalized view of the Least Action Principle in physics.

artificial intelligence, neural information processing system 30, neurips proceedings variational law, (6 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Neurology (0.61)

Technology: Information Technology > Artificial Intelligence > Cognitive Science (0.99)

Add feedback

Aqara's Matter-compatible camera promises easier smart home integration

EngadgetMar-17-2026, 12:45:00 GMT

Aqara's Matter-compatible camera promises easier smart home integration The company says it's the first Matter-certified camera. Smart home company Aqara has launched what it says is the first camera certified for Matter, the open source standard that enables interoperability across brands, like Google and Amazon. The Aqara G350 is an indoor security cam that also functions as a Zigbee and Matter hub in the Aqara Home app, which means the camera will enable you to control various devices across smart home protocols from different brands within one location. The camera itself comes with a 4K wide-angle and a 2.5K telephoto lens, providing both panoramic and closeup views. It also has 9x hybrid zoom and a pan-tilt mechanism that can give you 360-degree coverage of the room it's in.

artificial intelligence, engadget, internet of things, (13 more...)

Engadget

Industry: Information Technology > Smart Houses & Appliances (1.00)

Technology:

Information Technology > Internet of Things (1.00)
Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Hybrid Reward Architecture for Reinforcement Learning

Neural Information Processing SystemsMar-17-2026, 12:30:49 GMT

One of the main challenges in reinforcement learning (RL) is generalisation. In typical deep RL methods this is achieved by approximating the optimal value function with a low-dimensional representation using a deep network. While this approach works well in many domains, in domains where the optimal value function cannot easily be reduced to a low-dimensional representation, learning can be very slow and unstable. This paper contributes towards tackling such challenging domains, by proposing a new method, called Hybrid Reward Architecture (HRA). HRA takes as input a decomposed reward function and learns a separate value function for each component reward function. Because each component typically only depends on a subset of all features, the corresponding value function can be approximated more easily by a low-dimensional representation, enabling more effective learning. We demonstrate HRA on a toy-problem and the Atari game Ms. Pac-Man, where HRA achieves above-human performance.

artificial intelligence, machine learning, reinforcement learning, (7 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games > Computer Games (0.98)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)

Add feedback

Continual Learning with Deep Generative Replay

Neural Information Processing SystemsMar-17-2026, 12:30:17 GMT

Attempts to train a comprehensive artificial intelligence capable of solving multiple tasks have been impeded by a chronic problem called catastrophic forgetting. Although simply replaying all previous data alleviates the problem, it requires large memory and even worse, often infeasible in real world applications where the access to past data is limited. Inspired by the generative nature of the hippocampus as a short-term memory system in primate brain, we propose the Deep Generative Replay, a novel framework with a cooperative dual model architecture consisting of a deep generative model ("generator") and a task solving model ("solver"). With only these two models, training data for previous tasks can easily be sampled and interleaved with those for a new task. We test our methods in several sequential learning settings involving image classification tasks.

artificial intelligence, machine learning, proceedings, (2 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.62)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

WATCH: Wall-climbing robot swarms crawl US Navy warships as China's fleet surges

FOX NewsMar-17-2026, 12:14:24 GMT

Navy robots from Gecko Robotics will inspect U.S. warships in $71 million effort to reduce maintenance delays as only 60% of fleet remains operational amid China's naval expansion.

artificial intelligence, lifestyle real estate tech science, navy warship, (8 more...)

FOX News

Country:

Asia > China (0.65)
Asia > Middle East > Iran (0.48)
North America > United States > Illinois (0.05)
(7 more...)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Government > Military > Navy (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Locomotion (0.42)

Add feedback

High resolution neural connectivity from incomplete tracing data using nonnegative spline regression

Neural Information Processing SystemsMar-17-2026, 12:06:57 GMT

Whole-brain neural connectivity data are now available from viral tracing experiments, which reveal the connections between a source injection site and elsewhere in the brain.

artificial intelligence, machine learning, proceedings, (9 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.59)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.56)

Add feedback

An AI image generator for non-English speakers

AIHubMar-17-2026, 11:49:45 GMT

Although text-to-image generation is rapidly advancing, these AI models are mostly English-centric. Researchers at the University of Amsterdam Faculty of Science have created NeoBabel, an AI image generator that can work in six different languages. By making all elements of their research open source, anyone can build on the model and help push inclusive AI research. When you generate an image with AI, the results are often better when your prompt is in English. This is because many AI models are English at their core: if you use another language, your prompt is translated into English before the image is created.

generator, machine learning, natural language, (19 more...)

AIHub

Country:

Europe > Netherlands > North Holland > Amsterdam (0.27)
Asia > Singapore (0.05)

Genre: Research Report (0.35)

Industry: Government (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Vision (0.90)
Information Technology > Communications > Social Media (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.52)

Add feedback