AITopics

Country: Europe > Germany (0.28)

Genre: Research Report (0.46)

Industry:

Education (0.67)
Energy > Oil & Gas (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Neural Information Processing SystemsAug-21-2025, 05:49:18 GMT

Model-Based Transfer Learning for Contextual Reinforcement Learning

Deep reinforcement learning (RL) is a powerful approach to complex decision making. However, one issue that limits its practical application is its brittleness, sometimes failing to train in the presence of small changes in the environment.

machine learning, reinforcement learning, variation, (15 more...)

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.28)
Europe > Netherlands (0.14)
Asia > Taiwan (0.14)
Asia > Middle East > Republic of Türkiye (0.14)

Genre: Research Report > Experimental Study (0.93)

Industry:

Education (1.00)
Transportation > Ground > Road (0.69)
Transportation > Infrastructure & Services (0.48)
Energy > Oil & Gas (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Neural Information Processing SystemsAug-21-2025, 04:57:33 GMT

c529dba08a146ea8d6cf715ae8930cbe-Paper-Conference.pdf

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Country: Europe > Germany (0.46)

Genre: Research Report (0.46)

Industry:

Energy > Oil & Gas (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Neural Information Processing SystemsAug-21-2025, 02:46:48 GMT

ba74855789913e5ed36f87288af79e5b-Supplemental-Datasets_and_Benchmarks.pdf

distribution shift, machine learning, reinforcement learning, (18 more...)

Country: North America > United States > California (0.46)

Industry:

Transportation > Ground > Road (1.00)
Transportation > Electric Vehicle (1.00)
Energy > Power Industry (1.00)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)
Information Technology > Artificial Intelligence > Natural Language (0.93)

Neural Information Processing SystemsAug-21-2025, 02:46:44 GMT

SustainGym: Reinforcement Learning Environments for Sustainable Energy Systems Christopher Y eh 1, Victor Li

Developing better RL algorithms to address these challenges requires a means of empirically bench-marking and comparing the performance of different algorithms in real-world settings.

distribution shift, machine learning, reinforcement learning, (18 more...)

Country: North America > United States > California (0.29)

Industry:

Transportation > Ground > Road (1.00)
Energy > Renewable (1.00)
Energy > Power Industry (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Schütz, Valter, Wu, Han, Rezvan, Reza, Aronsson, Linus, Chehreghani, Morteza Haghir

AFABench: A Generic Framework for Benchmarking Active Feature Acquisition

In many real-world scenarios, acquiring all features of a data instance can be expensive or impractical due to monetary cost, latency, or privacy concerns. Active Feature Acquisition (AFA) addresses this challenge by dynamically selecting a subset of informative features for each data instance, trading predictive performance against acquisition cost. While numerous methods have been proposed for AFA, ranging from greedy information-theoretic strategies to non-myopic reinforcement learning approaches, fair and systematic evaluation of these methods has been hindered by the lack of standardized benchmarks. In this paper, we introduce AFABench, the first benchmark framework for AFA. Our benchmark includes a diverse set of synthetic and real-world datasets, supports a wide range of acquisition policies, and provides a modular design that enables easy integration of new methods and tasks. We implement and evaluate representative algorithms from all major categories, including static, greedy, and reinforcement learning-based approaches. To test the lookahead capabilities of AFA policies, we introduce a novel synthetic dataset, AFAContext, designed to expose the limitations of greedy selection. Our results highlight key trade-offs between different AFA strategies and provide actionable insights for future research. The benchmark code is available at: https://github.com/Linusaronsson/AFA-Benchmark.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

2508.14734

Country: North America > United States (0.67)

Genre: Research Report > New Finding (0.66)

Industry:

Health & Medicine (1.00)
Information Technology > Security & Privacy (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Yazdannik, Saman, Tayefi, Morteza, Sanisales, Shamim

Beyond ReLU: Chebyshev-DQN for Enhanced Deep Q-Networks

The performance of Deep Q-Networks (DQN) is critically dependent on the ability of its underlying neural network to accurately approximate the action-value function. Standard function approximators, such as multi-layer perceptrons, may struggle to efficiently represent the complex value landscapes inherent in many reinforcement learning problems. This paper introduces a novel architecture, the Chebyshev-DQN (Ch-DQN), which integrates a Chebyshev polynomial basis into the DQN framework to create a more effective feature representation. By leveraging the powerful function approximation properties of Chebyshev polynomials, we hypothesize that the Ch-DQN can learn more efficiently and achieve higher performance. We evaluate our proposed model on the CartPole-v1 benchmark and compare it against a standard DQN with a comparable number of parameters. Our results demonstrate that the Ch-DQN with a moderate polynomial degree (N=4) achieves significantly better asymptotic performance, outperforming the baseline by approximately 39\%. However, we also find that the choice of polynomial degree is a critical hyperparameter, as a high degree (N=8) can be detrimental to learning. This work validates the potential of using orthogonal polynomial bases in deep reinforcement learning while also highlighting the trade-offs involved in model complexity.

ch-dqn, machine learning, reinforcement learning, (16 more...)

2508.14536

Country: Asia > Middle East > Iran (0.15)

Genre: Research Report > New Finding (0.87)

Industry: Leisure & Entertainment (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Tholl, Konur, Mezouar, Mariam El, Mallah, Ranwa Al

A Comparative Evaluation of Teacher-Guided Reinforcement Learning Techniques for Autonomous Cyber Operations

Autonomous Cyber Operations (ACO) rely on Reinforcement Learning (RL) to train agents to make effective decisions in the cybersecurity domain. However, existing ACO applications require agents to learn from scratch, leading to slow convergence and poor early-stage performance. While teacher-guided techniques have demonstrated promise in other domains, they have not yet been applied to ACO. In this study, we implement four distinct teacher-guided techniques in the simulated CybORG environment and conduct a comparative evaluation. Our results demonstrate that teacher integration can significantly improve training efficiency in terms of early policy performance and convergence speed, highlighting its potential benefits for autonomous cybersecurity.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

2508.1434

Country: North America > Canada > Ontario > Kingston (0.29)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (0.56)
Government > Military > Cyberwarfare (0.56)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Beyond Turing: Memory-Amortized Inference as a Foundation for Cognitive Computation

Li, Xin

Intelligence is fundamentally non-ergodic: it emerges not from uniform sampling or optimization from scratch, but from the structured reuse of prior inference trajectories. We introduce Memory-Amortized Inference (MAI) as a formal framework in which cognition is modeled as inference over latent cycles in memory, rather than recomputation through gradient descent. MAI systems encode inductive biases via structural reuse, minimizing entropy and enabling context-aware, structure-preserving inference. This approach reframes cognitive systems not as ergodic samplers, but as navigators over constrained latent manifolds, guided by persistent topological memory. Through the lens of delta-homology, we show that MAI provides a principled foundation for Mountcastle's Universal Cortical Algorithm, modeling each cortical column as a local inference operator over cycle-consistent memory states. Furthermore, we establish a time-reversal duality between MAI and reinforcement learning: whereas RL propagates value forward from reward, MAI reconstructs latent causes backward from memory. This inversion paves a path toward energy-efficient inference and addresses the computational bottlenecks facing modern AI. MAI thus offers a unified, biologically grounded theory of intelligence based on structure, reuse, and memory. We also briefly discuss the profound implications of MAI for achieving artificial general intelligence (AGI).

machine learning, natural language, reinforcement learning, (18 more...)