AITopics | trainability

Collaborating Authors

trainability

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Initialization of ReLUs for Dynamical Isometry

Rebekka Burkholz, Alina Dubatovka

Neural Information Processing SystemsFeb-14-2026, 12:41:02 GMT

Neural Information Processing Systems http://nips.cc/

initialization, international conference, neural network, (15 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > New South Wales > Sydney (0.14)
Europe > Switzerland > Zürich > Zürich (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(9 more...)

Industry: Health & Medicine (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

25d463c05b414125f598cdf8022b3b46-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 02:32:25 GMT

Attention Network (GA T), a popular GNN architecture in which a node's neighborhood aggregation is weighted by parameterized attention coefficients.

artificial intelligence, initialization, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Wisconsin (0.04)
North America > United States > Texas (0.04)
Europe > Germany > Saarland > Saarbrücken (0.04)
Europe > Germany > North Rhine-Westphalia > Cologne Region > Cologne (0.04)

Genre: Research Report (0.68)

Industry: Information Technology (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

4c7a167bb329bd92580a99ce422d6fa6-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 13:53:48 GMT

arxiv preprint arxiv, neural network, training regime, (13 more...)

Neural Information Processing Systems

Country:

Asia > Myanmar > Tanintharyi Region > Dawei (0.05)
Asia > China > Guangdong Province > Shenzhen (0.05)
Asia > China > Hong Kong (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Vanilla

Neural Information Processing SystemsFeb-8-2026, 05:16:41 GMT

Gradient-Guided Dynamic Rewiring of GCNs.Contrary toad-hoc addition of skipconnections toimproveGCNs performance, inthis paper,we leverage Gradient Flowto introduce dynamic rewiring strategyof vanilla-GCNs with skip-connections.

artificial intelligence, arxivpreprintarxiv, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas (0.04)
North America > United States > Wisconsin (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

A Unified Noise-Curvature View of Loss of Trainability

Baveja, Gunbir Singh, Lewandowski, Alex, Schmidt, Mark

arXiv.org Artificial IntelligenceDec-11-2025

Loss of trainability refers to a phenomenon in continual learning where parameter updates no longer make progress on the optimization objective, so accuracy stalls or degrades as the learning problem changes over time. In this paper, we analyze loss of trainability through an optimization lens and find that the phenomenon is not reliably predicted by existing individual indicators such as Hessian rank, sharpness level, weight or gradient norms, gradient-to-parameter ratios, and unit-sign entropy. Motivated by our analysis, we introduce two complementary indicators: a batch-size-aware gradient-noise bound and a curvature volatility-controlled bound. We then combine these two indicators into a per-layer adaptive noise threshold on the effective step-size that anticipates trainability behavior. Using this insight, we propose a step-size scheduler that keeps each layer's effective parameter update below this bound, thereby avoiding loss of trainability. We demonstrate that our scheduler can improve the accuracy maintained by previously proposed approaches, such as concatenated ReLU (CReLU), Wasserstein regularizer, and L2 weight decay. Surprisingly, our scheduler produces adaptive step-size trajectories that, without tuning, mirror the manually engineered step-size decay schedules.

artificial intelligence, machine learning, trainability, (17 more...)

arXiv.org Artificial Intelligence

2509.19698

Country:

North America > Canada > Alberta (0.14)
North America > Canada > British Columbia (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.51)

Industry: Education (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback

Golden retrievers and humans share 'striking' genetic similarities

Science Biology Golden retrievers and humans share'striking' genetic similarities The same genes influence intelligence, anxiety, and depression in both species. Breakthroughs, discoveries, and DIY tips sent every weekday. You're likely not reading too much into your dog's mood: according to researchers at the University of Cambridge, certain genes influencing golden retriever behavior are also traceable to human emotions including intelligence, depression, and anxiety. "The findings are really striking," Eleanor Raffan, a neuroscience researcher and coauthor of a study published in the, said in a statement . "They provide strong evidence that humans and golden retrievers have shared genetic roots for their behavior."

artificial intelligence, golden retriever, golden retriever and human share, (12 more...)

Popular Science

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.25)
Europe > Ukraine > Kyiv Oblast > Chernobyl (0.05)
Asia > South Korea (0.05)

Genre: Research Report > New Finding (1.00)

Industry:

Retail (0.72)
Health & Medicine > Therapeutic Area > Neurology (0.36)

Technology: Information Technology > Artificial Intelligence > Cognitive Science (0.56)

Add feedback

Dissecting Quantum Reinforcement Learning: A Systematic Evaluation of Key Components

Lazaro, Javier, Vazquez, Juan-Ignacio, Garcia-Bringas, Pablo

arXiv.org Artificial IntelligenceNov-24-2025

Parameterised quantum circuit (PQC) based Quantum Reinforcement Learning (QRL) has emerged as a promising paradigm at the intersection of quantum computing and reinforcement learning (RL). By design, PQCs create hybrid quantum-classical models, but their practical applicability remains uncertain due to training instabilities, barren plateaus (BPs), and the difficulty of isolating the contribution of individual pipeline components. In this work, we dissect PQC based QRL architectures through a systematic experimental evaluation of three aspects recurrently identified as critical: (i) data embedding strategies, with Data Reuploading (DR) as an advanced approach; (ii) ansatz design, particularly the role of entanglement; and (iii) post-processing blocks after quantum measurement, with a focus on the underexplored Output Reuse (OR) technique. Using a unified PPO-CartPole framework, we perform controlled comparisons between hybrid and classical agents under identical conditions. Our results show that OR, though purely classical, exhibits distinct behaviour in hybrid pipelines, that DR improves trainability and stability, and that stronger entanglement can degrade optimisation, offsetting classical gains. Together, these findings provide controlled empirical evidence of the interplay between quantum and classical contributions, and establish a reproducible framework for systematic benchmarking and component-wise analysis in QRL.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2511.17112

Country: