AITopics | training paradigm

Collaborating Authors

training paradigm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Federated Survival Analysis in Healthcare: A Multi-Model Evaluation on Cross-Institutional Heterogeneous Breast Cancer Data

Moreno-Blasco, Natalia, Ihalapathirana, Anusha, Siirtola, Pekka, Fernandez-de-Retana, Miguel

arXiv.org Machine LearningJun-24-2026

Survival analysis is central to clinical decision-making, yet reliable time-to-event models require large, diverse cohorts that are rarely available at a single institution, while privacy regulations restrict the centralization of patient data. Federated learning (FL) offers a privacy-preserving alternative by training shared models without exchanging raw data, but its effectiveness for survival modeling under realistic, heterogeneous conditions remains insufficiently understood. This paper presents a systematic, multi-model evaluation of federated survival analysis on a cross-institutional breast cancer cohort with naturally heterogeneous distributed clients. Three representative survival models, the Cox Proportional Hazards model, DeepSurv, and Random Survival Forest (RSF), are compared across centralized, local, and federated training, and three federated optimization strategies (FedAvg, FedProx, and FedAdam) are assessed for the gradient-based models. Results show that FL consistently outperforms local training and approaches, and occasionally exceeds, centralized performance, while RSF offers the best overall balance of discrimination, calibration, and robustness across heterogeneous clients. We further find that performance depends on the diversity of client distributions, and that FedAvg and FedProx are stronger and more stable than FedAdam. Based on these findings, we derive practical, decision-oriented guidelines mapping data, privacy, interpretability, and resource constraints to recommended model and training-paradigm choices for federated survival modeling in healthcare.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Machine Learning

2606.23871

Country: Europe > Finland (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.88)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.61)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

3ba7560b4c3e66d760fbdd472cf4a5a9-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-26-2026, 17:38:07 GMT

artificial intelligence, machine learning, synthetic image, (12 more...)

Neural Information Processing Systems

Country: Asia > China (0.16)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

Navigating the Pitfalls of Active Learning Evaluation Framework for Meaningful Performance Assessment

Neural Information Processing SystemsApr-25-2026, 17:15:23 GMT

Active Learning (AL) aims to reduce the labeling burden by interactively selecting the most informative samples from a pool of unlabeled data. While there has been extensive research on improving AL query methods in recent years, some studies have questioned the effectiveness of AL compared to emerging paradigms such as semi-supervised (Semi-SL) and self-supervised learning (Self-SL), or a simple optimization of classifier configurations. Thus, today's AL literature presents an inconsistent and contradictory landscape, leaving practitioners uncertain about whether and how to use AL in their tasks. In this work, we make the case that this inconsistency arises from a lack of systematic and realistic evaluation of AL methods. Specifically, we identify five key pitfalls in the current literature that reflect the delicate considerations required for AL evaluation. Further, we present an evaluation framework that overcomes these pitfalls and thus enables meaningful statements about the performance of AL methods. To demonstrate the relevance of our protocol, we present a large-scale empirical study and benchmark for image classification spanning various data sets, query methods, AL settings, and training paradigms. Our findings clarify the inconsistent picture in the literature and enable us to give hands-on recommendations for practitioners.

artificial intelligence, deep learning, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.67)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation (0.68)
Health & Medicine > Therapeutic Area > Oncology (0.67)
Education > Educational Technology > Educational Software > Computer Based Training (0.64)
Health & Medicine > Therapeutic Area > Dermatology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

To Learn or Not to Learn, That is the Question -- A Feature-Task Dual Learning Model of Perceptual Learning

Neural Information Processing SystemsMar-19-2026, 23:16:41 GMT

Perceptual learning refers to the practices through which participants learn to improve their performance in perceiving sensory stimuli. Two seemingly conflicting phenomena of specificity and transfer have been widely observed in perceptual learning. Here, we propose a dual-learning model to reconcile these two phenomena. The model consists of two learning processes. One is task-based learning, which is fast and enables the brain to adapt to a task rapidly by using existing feature representations.

artificial intelligence, machine learning, proceedings, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.42)

Add feedback

DynamicNeuralRegeneration: EnhancingDeep LearningGeneralizationonSmallDatasets

Neural Information Processing SystemsFeb-15-2026, 23:04:03 GMT

Recent works have explored evolutionary or iterativetraining paradigms, which reinitialize asubset ofparameters toenhance generalization performance forsmalldatasets.

artificial intelligence, dataset, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.93)

Industry: Health & Medicine (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

1ed4723f12853cbd02aecb8160f5e0c9-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 17:44:19 GMT

dataset, experiment, query size, (14 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
(4 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation (0.68)
Health & Medicine > Therapeutic Area > Oncology (0.67)
Health & Medicine > Therapeutic Area > Dermatology (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Data Science (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

RoboScape-R: Unified Reward-Observation World Models for Generalizable Robotics Training via RL

Tang, Yinzhou, Shang, Yu, Chen, Yinuo, Wei, Bingwen, Zhang, Xin, Yu, Shu'ang, Shi, Liangzhi, Yu, Chao, Gao, Chen, Wu, Wei, Li, Yong

arXiv.org Artificial IntelligenceDec-4-2025

Achieving generalizable embodied policies remains a key challenge. Traditional policy learning paradigms, including both Imitation Learning (IL) and Reinforcement Learning (RL), struggle to cultivate generalizability across diverse scenarios. While IL policies often overfit to specific expert trajectories, RL suffers from the inherent lack of a unified and general reward signal necessary for effective multi-scene generalization. We posit that the world model is uniquely capable of serving as a universal environment proxy to address this limitation. However, current world models primarily focus on their ability to predict observations and still rely on task-specific, handcrafted reward functions, thereby failing to provide a truly general training environment. Toward this problem, we propose RoboScape-R, a framework leveraging the world model to serve as a versatile, general-purpose proxy for the embodied environment within the RL paradigm. We introduce a novel world model-based general reward mechanism that generates ''endogenous'' rewards derived from the model's intrinsic understanding of real-world state transition dynamics. Extensive experiments demonstrate that RoboScape-R effectively addresses the limitations of traditional RL methods by providing an efficient and general training environment that substantially enhances the generalization capability of embodied policies. Our approach offers critical insights into utilizing the world model as an online training strategy and achieves an average 37.5% performance improvement over baselines under out-of-domain scenarios.

artificial intelligence, machine learning, world model, (16 more...)

arXiv.org Artificial Intelligence

2512.03556

Genre: Research Report (0.64)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.74)
Education > Educational Setting > Online (0.54)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

Sensing and Understanding the World over Air: A Large Multimodal Model for Mobile Networks

Duan, Zhuoran, Wei, Yuhao, Nan, Guoshun, Wang, Zijun, Yan, Yan, Xiong, Lihua, Ran, Yuhan, Zhang, Ji, Li, Jian, Cui, Qimei, Tao, Xiaofeng, Quek, Tony Q. S.

arXiv.org Artificial IntelligenceDec-1-2025

Abstract--Large models (LMs), such as ChatGPT, have made a significant impact across diverse domains and hold great potential to facilitate the evolution of network intelligence. Wireless-native multi-modal large models (WMLMs) can sense and understand the physical world through multi-modal data, serving as a key enabler that integrates communication, sensing, and intelligence, and thus they can boost various smart services to billions of users. However, research on WMLMs remains in its infancy, and the construction of domain-specific multi-modal large models for wireless networks is still underexplored. In this paper, we outlines the key characteristics of WMLMs and summarizes existing methods, on the basis of which a wireless-native multimodal training paradigm is proposed. Specifically, we constructed a GPT -style WMLM model and trained it on a real-world large-scale dataset, leveraging wireless signals as an anchor modality for contrastive learning. Our approach demonstrates outstanding performance compared with existing small-scale models and large multi-modal models, validating the feasibility of using wireless signals as a universal modality and highlighting WMLM's potential to emerge as a new paradigm for future wireless networks. The advent of large AI models (LMs) such as ChatGPT has propelled network intelligence into a new evolutionary phase. These remarkable enablers are poised to revolutionize future wireless networks through their advanced performance and generalization capability.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2511.21707

Country: Asia > China (0.29)

Genre: Research Report (1.00)

Industry:

Telecommunications (0.94)
Information Technology (0.68)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Filters

Collaborating Authors

training paradigm

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Federated Survival Analysis in Healthcare: A Multi-Model Evaluation on Cross-Institutional Heterogeneous Breast Cancer Data

3ba7560b4c3e66d760fbdd472cf4a5a9-Supplemental-Conference.pdf

Navigating the Pitfalls of Active Learning Evaluation Framework for Meaningful Performance Assessment

To Learn or Not to Learn, That is the Question -- A Feature-Task Dual Learning Model of Perceptual Learning

DynamicNeuralRegeneration: EnhancingDeep LearningGeneralizationonSmallDatasets

3ba7560b4c3e66d760fbdd472cf4a5a9-Supplemental-Conference.pdf

1ed4723f12853cbd02aecb8160f5e0c9-Supplemental-Conference.pdf

1ed4723f12853cbd02aecb8160f5e0c9-Paper-Conference.pdf

RoboScape-R: Unified Reward-Observation World Models for Generalizable Robotics Training via RL

Sensing and Understanding the World over Air: A Large Multimodal Model for Mobile Networks