AITopics | Industry

Collaborating Authors

Industry

Task-based End-to-end Model Learning in Stochastic Optimization

Neural Information Processing SystemsMar-17-2026, 14:03:12 GMT

With the increasing popularity of machine learning techniques, it has become common to see prediction algorithms operating within some larger process. However, the criteria by which we train these algorithms often differ from the ultimate criteria on which we evaluate them. This paper proposes an end-to-end approach for learning probabilistic machine learning models in a manner that directly captures the ultimate task-based objective for which they will be used, within the context of stochastic programming. We present three experimental evaluations of the proposed approach: a classical inventory stock problem, a real-world electrical grid scheduling task, and a real-world energy storage arbitrage task. We show that the proposed approach can outperform both traditional modeling and purely black-box policy optimization approaches in these applications.

artificial intelligence, machine learning, proceedings, (3 more...)

Neural Information Processing Systems

Industry: Energy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

ELF: An Extensive, Lightweight and Flexible Research Platform for Real-time Strategy Games

Neural Information Processing SystemsMar-17-2026, 14:03:09 GMT

In this paper, we propose ELF, an Extensive, Lightweight and Flexible platform for fundamental reinforcement learning research. Using ELF, we implement a highly customizable real-time strategy (RTS) engine with three game environments (Mini-RTS, Capture the Flag and Tower Defense). Mini-RTS, as a miniature version of StarCraft, captures key game dynamics and runs at 165K frame-per-second (FPS) on a laptop. When coupled with modern reinforcement learning methods, the system can train a full-game bot against built-in AIs end-to-end in one day with 6 CPUs and 1 GPU. In addition, our platform is flexible in terms of environment-agent communication topologies, choices of RL methods, changes in game parameters, and can host existing C/C++-based game environments like ALE. Using ELF, we thoroughly explore training parameters and show that a network with Leaky ReLU and Batch Normalization coupled with long-horizon training and progressive curriculum beats the rule-based built-in AI more than 70% of the time in the full game of Mini-RTS. Strong performance is also achieved on the other two games. In game replays, we show our agents learn interesting strategies.

artificial intelligence, machine learning, reinforcement learning, (7 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.82)

Add feedback

Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation

Neural Information Processing SystemsMar-17-2026, 13:31:36 GMT

In this work, we propose to apply trust region optimization to deep reinforcement learning using a recently proposed Kronecker-factored approximation to the curvature. We extend the framework of natural policy gradient and propose to optimize both the actor and the critic using Kronecker-factored approximate curvature (K-FAC) with trust region; hence we call our method Actor Critic using Kronecker-Factored Trust Region (ACKTR). To the best of our knowledge, this is the first scalable trust region natural gradient method for actor-critic methods. It is also the method that learns non-trivial tasks in continuous control as well as discrete control policies directly from raw pixel inputs. We tested our approach across discrete domains in Atari games as well as continuous domains in the MuJoCo environment. With the proposed methods, we are able to achieve higher rewards and a 2-to 3-fold improvement in sample efficiency on average, compared to previous state-of-the-art on-policy actor-critic methods.

artificial intelligence, machine learning, reinforcement learning, (7 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games > Computer Games (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.67)

Add feedback

A multi-agent reinforcement learning model of common-pool resource appropriation

Neural Information Processing SystemsMar-17-2026, 13:04:03 GMT

Humanity faces numerous problems of common-pool resource appropriation. This class of multi-agent social dilemma includes the problems of ensuring sustainable use of fresh water, common fisheries, grazing pastures, and irrigation systems. Abstract models of common-pool resource appropriation based on non-cooperative game theory predict that self-interested agents will generally fail to find socially positive equilibria---a phenomenon called the tragedy of the commons. However, in reality, human societies are sometimes able to discover and implement stable cooperative solutions. Decades of behavioral game theory research have sought to uncover aspects of human behavior that make this possible.

artificial intelligence, machine learning, reinforcement learning, (8 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.42)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.39)

Add feedback

Online Learning of Optimal Bidding Strategy in Repeated Multi-Commodity Auctions

Neural Information Processing SystemsMar-17-2026, 13:03:03 GMT

We study the online learning problem of a bidder who participates in repeated auctions. With the goal of maximizing his T-period payoff, the bidder determines the optimal allocation of his budget among his bids for $K$ goods at each period. As a bidding strategy, we propose a polynomial-time algorithm, inspired by the dynamic programming approach to the knapsack problem. The proposed algorithm, referred to as dynamic programming on discrete set (DPDS), achieves a regret order of $O(\sqrt{T\log{T}})$. By showing that the regret is lower bounded by $\Omega(\sqrt{T})$ for any strategy, we conclude that DPDS is order optimal up to a $\sqrt{\log{T}}$ term. We evaluate the performance of DPDS empirically in the context of virtual trading in wholesale electricity markets by using historical data from the New York market. Empirical results show that DPDS consistently outperforms benchmark heuristic methods that are derived from machine learning and online learning approaches.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Country: North America > United States > New York (0.28)

Industry: Education (0.88)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Variational Laws of Visual Attention for Dynamic Scenes

Neural Information Processing SystemsMar-17-2026, 13:01:48 GMT

Computational models of visual attention are at the crossroad of disciplines like cognitive science, computational neuroscience, and computer vision. This paper proposes a model of attentional scanpath that is based on the principle that there are foundational laws that drive the emergence of visual attention. We devise variational laws of the eye-movement that rely on a generalized view of the Least Action Principle in physics.

artificial intelligence, neural information processing system 30, neurips proceedings variational law, (6 more...)

Neural Information Processing Systems

Industry: Health & Medicine > Therapeutic Area > Neurology (0.61)

Technology: Information Technology > Artificial Intelligence > Cognitive Science (0.99)

Add feedback

Aqara's Matter-compatible camera promises easier smart home integration

EngadgetMar-17-2026, 12:45:00 GMT

Aqara's Matter-compatible camera promises easier smart home integration The company says it's the first Matter-certified camera. Smart home company Aqara has launched what it says is the first camera certified for Matter, the open source standard that enables interoperability across brands, like Google and Amazon. The Aqara G350 is an indoor security cam that also functions as a Zigbee and Matter hub in the Aqara Home app, which means the camera will enable you to control various devices across smart home protocols from different brands within one location. The camera itself comes with a 4K wide-angle and a 2.5K telephoto lens, providing both panoramic and closeup views. It also has 9x hybrid zoom and a pan-tilt mechanism that can give you 360-degree coverage of the room it's in.

artificial intelligence, engadget, internet of things, (13 more...)

Engadget

Industry: Information Technology > Smart Houses & Appliances (1.00)

Technology:

Information Technology > Internet of Things (1.00)
Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Hybrid Reward Architecture for Reinforcement Learning

Neural Information Processing SystemsMar-17-2026, 12:30:49 GMT

One of the main challenges in reinforcement learning (RL) is generalisation. In typical deep RL methods this is achieved by approximating the optimal value function with a low-dimensional representation using a deep network. While this approach works well in many domains, in domains where the optimal value function cannot easily be reduced to a low-dimensional representation, learning can be very slow and unstable. This paper contributes towards tackling such challenging domains, by proposing a new method, called Hybrid Reward Architecture (HRA). HRA takes as input a decomposed reward function and learns a separate value function for each component reward function. Because each component typically only depends on a subset of all features, the corresponding value function can be approximated more easily by a low-dimensional representation, enabling more effective learning. We demonstrate HRA on a toy-problem and the Atari game Ms. Pac-Man, where HRA achieves above-human performance.

artificial intelligence, machine learning, reinforcement learning, (7 more...)

Neural Information Processing Systems

Industry: Leisure & Entertainment > Games > Computer Games (0.98)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)

Add feedback

Continual Learning with Deep Generative Replay

Neural Information Processing SystemsMar-17-2026, 12:30:17 GMT

Attempts to train a comprehensive artificial intelligence capable of solving multiple tasks have been impeded by a chronic problem called catastrophic forgetting. Although simply replaying all previous data alleviates the problem, it requires large memory and even worse, often infeasible in real world applications where the access to past data is limited. Inspired by the generative nature of the hippocampus as a short-term memory system in primate brain, we propose the Deep Generative Replay, a novel framework with a cooperative dual model architecture consisting of a deep generative model ("generator") and a task solving model ("solver"). With only these two models, training data for previous tasks can easily be sampled and interleaved with those for a new task. We test our methods in several sequential learning settings involving image classification tasks.

artificial intelligence, machine learning, proceedings, (2 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.62)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

WATCH: Wall-climbing robot swarms crawl US Navy warships as China's fleet surges

FOX NewsMar-17-2026, 12:14:24 GMT

Navy robots from Gecko Robotics will inspect U.S. warships in $71 million effort to reduce maintenance delays as only 60% of fleet remains operational amid China's naval expansion.

artificial intelligence, lifestyle real estate tech science, navy warship, (8 more...)

FOX News

Country:

Asia > China (0.65)
Asia > Middle East > Iran (0.48)
North America > United States > Illinois (0.05)
(7 more...)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Government > Military > Navy (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Locomotion (0.42)

Add feedback