AITopics | minto

Collaborating Authors

minto

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Use the Online Network If You Can: Towards Fast and Stable Reinforcement Learning

Hendawy, Ahmed, Metternich, Henrik, Vincent, Théo, Kallel, Mahdi, Peters, Jan, D'Eramo, Carlo

arXiv.org Artificial IntelligenceOct-6-2025

The use of target networks is a popular approach for estimating value functions in deep Reinforcement Learning (RL). While effective, the target network remains a compromise solution that preserves stability at the cost of slowly moving targets, thus delaying learning. Conversely, using the online network as a bootstrapped target is intuitively appealing, albeit well-known to lead to unstable learning. In this work, we aim to obtain the best out of both worlds by introducing a novel update rule that computes the target using the MINimum estimate between the Target and Online network, giving rise to our method, MINTO. Through this simple, yet effective modification, we show that MINTO enables faster and stable value function learning, by mitigating the potential overestimation bias of using the online network for bootstrapping. Notably, MINTO can be seamlessly integrated into a wide range of value-based and actor-critic algorithms with a negligible cost. We evaluate MINTO extensively across diverse benchmarks, spanning online and of-fline RL, as well as discrete and continuous action spaces. Across all benchmarks, MINTO consistently improves performance, demonstrating its broad applicability and effectiveness. Reinforcement Learning (RL) has demonstrated exceptional performance and achieved major breakthroughs across a diverse spectrum of decision-making challenges. Noteworthy applications include learning complex locomotion skills (Haarnoja et al., 2018b; Rudin et al., 2022) and enabling sophisticated, real-world capabilities such as robotic manipulation (Andrychowicz et al., 2020; Lu et al., 2025). The foundation of this success lies primarily in Deep RL, initiated by the introduction of the Deep Q-Network (DQN) (Mnih et al., 2013), which marked the first successful application of deep neural networks in RL. To make that happen, Mnih et al. (2013) introduce various techniques to mitigate mainly the deadly triad issue (V an Hasselt et al., 2018) due to the usage of function approximators, off-policy data, and target bootstrapping.

machine learning, minto, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2510.0259

Country: Europe > Germany (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Breaking the silence after 16 years

BBC NewsMar-20-2016, 12:02:26 GMT

Voiceless in his life so far, a severely disabled 16-year-old is marvelling at being able to speak for the first time after breaking his silence with the words "Hello Mum", using a digital communication aid. James Walker is a rugby fan, likes pop music, lives with his family in Hull and has a girlfriend - Emily. He has a condition which caused hundreds of daily seizures when he was a child. Known as Lennox-Gastaut Syndrome, it left him with a severe learning disability and without the ability to walk or move. He says it's "funny" after being silent for so long that he can now communicate with friends and family and, as he puts it, "learn something exciting".

artificial intelligence, cognitive ability, seizure, (10 more...)

BBC News

Industry:

Education (0.50)
Leisure & Entertainment (0.35)
Health & Medicine > Therapeutic Area > Neurology (0.32)

Technology: Information Technology > Artificial Intelligence (0.33)

Add feedback