AITopics | adaptive loss function

Collaborating Authors

adaptive loss function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Quantifying Multimodal Imbalance: A GMM-Guided Adaptive Loss for Audio-Visual Learning

Liu, Zhaocheng, Yu, Zhiwen, Liu, Xiaoqing

arXiv.org Artificial IntelligenceOct-30-2025

The heterogeneity of multimodal data leads to inconsistencies and imbalance, allowing a dominant modality to steer gradient updates. Existing solutions mainly focus on optimization- or data-based strategies but rarely exploit the information inherent in multimodal imbalance or conduct its quantitative analysis. To address this gap, we propose a novel quantitative analysis framework for Multimodal Imbalance and design a sample-level adaptive loss function. We define the Modality Gap as the Softmax score difference between modalities for the correct class and model its distribution using a bimodal Gaussian Mixture Model(GMM), representing balanced and imbalanced samples. Using Bayes' theorem, we estimate each sample's posterior probability of belonging to these two groups. Based on this, our adaptive loss (1) minimizes the overall Modality Gap, (2) aligns imbalanced samples with balanced ones, and (3) adaptively penalizes each according to its imbalance degree. A two-stage training strategy-warm-up and adaptive phases,yields state-of-the-art performance on CREMA-D (80.65%), AVE (70.40%), and KineticSound (72.42%). Fine-tuning with high-quality samples identified by the GMM further improves results, highlighting their value for effective multimodal fusion.

artificial intelligence, machine learning, modality, (18 more...)

arXiv.org Artificial Intelligence

2510.21797

Genre: Research Report (1.00)

Add feedback

ASRL:A robust loss function with potential for development

Hui, Chenyu, Zhang, Anran, Li, Xintong

arXiv.org Artificial IntelligenceApr-10-2025

Abstract--In this article, we proposed a partition-wise robust loss function (ASRL -Adapative segmented robust loss)based on the previous robust loss function. The characteristics of this loss function are that it achieves high robustness and a wide range of applicability through partition-wise design and adaptive parameter adjustment. Finally, the advantages and development potential of this loss function were verified by applying this loss function to the XGBoost and using five different datasets (with different dimensions, different sample numbers, and different fields) to compare with the XGBoost using other loss functions. The results of multiple experiments have proven the advantages of ASRL in MSE, MAE, R2, etc. ASRL's dynamic segmentation design and adaptive threshold make it more robust and can be applied to more fields, such as as a loss function for multimodal learning and reinforcement learning, and has a large room for development.The implementation code repository github link in this paper is:ASRLCODE Index Terms--ASRL,Robustness,MSE,MAE,Loss Function I. INTRODUCTION In regression prediction of machine learning, the loss function is the core tool to measure the difference between the model prediction value and the true value. Its role runs through the entire process of model training, optimization and evaluation.

artificial intelligence, loss function, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2504.06935

Country:

Asia (0.30)
North America > United States (0.29)

Genre: Research Report (0.40)

Industry: Energy (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)

Add feedback

FedDUAL: A Dual-Strategy with Adaptive Loss and Dynamic Aggregation for Mitigating Data Heterogeneity in Federated Learning

Sahoo, Pranab, Tripathi, Ashutosh, Saha, Sriparna, Mondal, Samrat

arXiv.org Artificial IntelligenceDec-5-2024

Federated Learning (FL) marks a transformative approach to distributed model training by combining locally optimized models from various clients into a unified global model. While FL preserves data privacy by eliminating centralized storage, it encounters significant challenges such as performance degradation, slower convergence, and reduced robustness of the global model due to the heterogeneity in client data distributions. Among the various forms of data heterogeneity, label skew emerges as a particularly formidable and prevalent issue, especially in domains such as image classification. To address these challenges, we begin with comprehensive experiments to pinpoint the underlying issues in the FL training process. Based on our findings, we then introduce an innovative dual-strategy approach designed to effectively resolve these issues. First, we introduce an adaptive loss function for client-side training, meticulously crafted to preserve previously acquired knowledge while maintaining an optimal equilibrium between local optimization and global model coherence. Secondly, we develop a dynamic aggregation strategy for aggregating client models at the server. This approach adapts to each client's unique learning patterns, effectively addressing the challenges of diverse data across the network. Our comprehensive evaluation, conducted across three diverse real-world datasets, coupled with theoretical convergence guarantees, demonstrates the superior efficacy of our method compared to several established state-of-the-art approaches.

aggregation, federated learning, global model, (11 more...)

arXiv.org Artificial Intelligence

2412.04416

Country:

Asia > India > Bihar > Patna (0.04)
North America > United States > Virginia (0.04)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (0.88)
Overview > Innovation (0.86)

Industry: Information Technology > Security & Privacy (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

OWAdapt: An adaptive loss function for deep learning using OWA operators

Maldonado, Sebastián, Vairetti, Carla, Jara, Katherine, Carrasco, Miguel, López, Julio

arXiv.org Artificial IntelligenceOct-12-2023

In this paper, we propose a fuzzy adaptive loss function for enhancing deep learning performance in classification tasks. Specifically, we redefine the cross-entropy loss to effectively address class-level noise conditions, including the challenging problem of class imbalance. Our approach introduces aggregation operators, leveraging the power of fuzzy logic to improve classification accuracy. The rationale behind our proposed method lies in the iterative up-weighting of class-level components within the loss function, focusing on those with larger errors. To achieve this, we employ the ordered weighted average (OWA) operator and combine it with an adaptive scheme for gradient-based learning. Through extensive experimentation, our method outperforms other commonly used loss functions, such as the standard cross-entropy or focal loss, across various binary and multiclass classification tasks. Furthermore, we explore the influence of hyperparameters associated with the OWA operators and present a default configuration that performs well across different experimental settings.

adaptive loss function, loss function, operator, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.knosys.2023.111022

2305.19443

Country:

South America > Uruguay > Maldonado > Maldonado (0.05)
North America > United States > Iowa (0.05)
South America > Chile > Valparaíso Region > Los Andes Province > Los Andes (0.04)
North America > United States > Oregon (0.04)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Adaptive Hybrid Model for Enhanced Stock Market Predictions Using Improved VMD and Stacked Informer

Zhang, Jianan, Duan, Hongyi

arXiv.org Artificial IntelligenceOct-3-2023

Financial markets play a pivotal role in global economic activities, and their operations and dynamic evolutions are intricately linked to a myriad of chaotic and complex factors, including economic configurations, seasonal components, and the international milieu [1] [2]. As the economy progresses and financial markets expand continuously, time series analysis in finance has become indispensable [3]. This analytical approach has significantly advanced the understanding of market dynamics, refined intelligent decision-making processes, and bolstered developments in forecasting investment returns [4][2]. Consequently, it has garnered immense scholarly attention, leading to abundant research contributions in this domain. In stark contrast to conventional time series prediction endeavors characterizing various scientific domains--such as the temporal allocation mechanisms associated with wind energy integration [5], the granular analysis of protracted energy consumption patterns in architectural structures [6], or the intricate forecasting of load dynamics within thermal frameworks [7]--the sphere of financial time series forecasting is imbued with an elevated level of complexity and unpredictability.

data mining, forecasting, machine learning, (25 more...)

arXiv.org Artificial Intelligence

2310.01884

Country:

Asia > China (1.00)
Europe (0.14)

Genre:

Overview (0.93)
Research Report > New Finding (0.68)

Industry:

Energy > Oil & Gas (1.00)
Banking & Finance > Trading (1.00)
Government > Regional Government > Asia Government > China Government (0.46)
Energy > Renewable > Wind (0.34)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Add feedback

A generalized forecasting solution to enable future insights of COVID-19 at sub-national level resolutions

Marikkar, Umar, Weligampola, Harshana, Perera, Rumali, Hassan, Jameel, Sritharan, Suren, Jayatilaka, Gihan, Godaliyadda, Roshan, Herath, Vijitha, Ekanayake, Parakrama, Ekanayake, Janaka, Rathnayake, Anuruddhika, Dharmaratne, Samath

arXiv.org Artificial IntelligenceAug-21-2021

COVID-19 continues to cause a significant impact on public health. To minimize this impact, policy makers undertake containment measures that however, when carried out disproportionately to the actual threat, as a result if errorneous threat assessment, cause undesirable long-term socio-economic complications. In addition, macro-level or national level decision making fails to consider the localized sensitivities in small regions. Hence, the need arises for region-wise threat assessments that provide insights on the behaviour of COVID-19 through time, enabled through accurate forecasts. In this study, a forecasting solution is proposed, to predict daily new cases of COVID-19 in regions small enough where containment measures could be locally implemented, by targeting three main shortcomings that exist in literature; the unreliability of existing data caused by inconsistent testing patterns in smaller regions, weak deploy-ability of forecasting models towards predicting cases in previously unseen regions, and model training biases caused by the imbalanced nature of data in COVID-19 epi-curves. Hence, the contributions of this study are three-fold; an optimized smoothing technique to smoothen less deterministic epi-curves based on epidemiological dynamics of that region, a Long-Short-Term-Memory (LSTM) based forecasting model trained using data from select regions to create a representative and diverse training set that maximizes deploy-ability in regions with lack of historical data, and an adaptive loss function whilst training to mitigate the data imbalances seen in epi-curves. The proposed smoothing technique, the generalized training strategy and the adaptive loss function largely increased the overall accuracy of the forecast, which enables efficient containment measures at a more localized micro-level.

alert level, covid-19 case, loss function, (13 more...)

arXiv.org Artificial Intelligence

2108.09556

Country:

North America > United States > Texas > Cottle County (0.14)
North America > United States > Texas > Lubbock County (0.05)
Asia > Bangladesh (0.05)
(19 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Epidemiology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback