AITopics | different loss function

Collaborating Authors

different loss function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

0f2818101a7ac4b96ceeba38de4b934c-Supplemental.pdf

Neural Information Processing SystemsApr-24-2026, 17:30:40 GMT

artificial intelligence, loss function, machine learning, (12 more...)

Neural Information Processing Systems

Industry: Health & Medicine (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

cdce17de141c9fba3bdf175a0b721941-Paper-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 23:29:07 GMT

arxiv preprint arxiv, classifier, loss function, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Ohio (0.04)
North America > United States > Michigan (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

SEVIR: AStormEventImageryDatasetforDeep LearningApplicationsinRadarandSatellite Meteorology

Neural Information Processing SystemsFeb-11-2026, 05:08:34 GMT

Modern deep learning approaches haveshown promising results inmeteorological applications like precipitation nowcasting, synthetic radar generation, front detection and several others. Inorder toeffectively train and validate these complex algorithms, large and diverse datasets containing high-resolution imagery are required. Petabytes of weather data, such as from the Geostationary Environmental SatelliteSystem(GOES)andtheNext-Generation Radar(NEXRAD) system, are available to the public; however, the size and complexity of these datasets isahindrance todeveloping and training deep models.

artificial intelligence, machine learning, sevir, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > Texas > Travis County > Austin (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Government (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)

Add feedback

Training Uncertainty

Neural Information Processing SystemsFeb-10-2026, 16:17:20 GMT

The first subset (in red) is utilized to evaluate a traditional accuracy-basedlossfunction `a,suchasthecrossentropy. This benchmark is based on a loss function designed to incentivize the trained model to produce the smallest possible conformal prediction sets with the desired coverage (e.g., 90% ifα = 0.1). The hybrid training procedure is similar to Algorithm 1, in the sense that it relies on analogous soft-sorting, soft-ranking, and soft-indexing algorithms toevaluate adifferentiable approximation Wi oftheconformity scoreWi in(8). Above, the second equality follows directly from the fact thatS(x,U;π,t), defined in (A2), is by construction increasing in t, and therefore Y / S(x,U;π,1 α) if and only if min{t [0,1]:Y S(x,U;π,t)}>1 α. The proof consists of showing that`a and`u are separately minimized by ˆπ = π,although only approximately inthelatter case.

artificial intelligence, early stopping, machine learning, (10 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.04)
North America > United States (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback

22bb543b251c39ccdad8063d486987bb-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 19:33:19 GMT

However, both L2 and BL have two deficiencies. First, the noise in the annotation process is not considered in a principled way. L2 and BL make an assumption about per-pixel i.i.d.

artificial intelligence, density map, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.69)

Add feedback

165a59f7cf3b5c4396ba65953d679f17-Supplemental.pdf

Neural Information Processing SystemsFeb-7-2026, 14:32:08 GMT

conv, different loss function, leaky relu 128 16 16, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

Assessment of different loss functions for fitting equivalent circuit models to electrochemical impedance spectroscopy data

Jaberi, Ali, Sadeghi, Amin, Zhang, Runze, Zhao, Zhaoyang, Shi, Qiuyu, Black, Robert, Sadighi, Zoya, Hattrick-Simpers, Jason

arXiv.org Artificial IntelligenceOct-14-2025

Electrochemical impedance spectroscopy (EIS) data is typically modeled using an equivalent circuit model (ECM), with parameters obtained by minimizing a loss function via nonlinear least squares fitting. This paper introduces two new loss functions, log-B and log-BW, derived from the Bode representation of EIS. Using a large dataset of generated EIS data, the performance of proposed loss functions was evaluated alongside existing ones in terms of R2 scores, chi-squared, computational efficiency, and the mean absolute percentage error (MAPE) between the predicted component values and the original values. Statistical comparisons revealed that the choice of loss function impacts convergence, computational efficiency, quality of fit, and MAPE. Our analysis showed that X2 loss function (squared sum of residuals with proportional weighting) achieved the highest performance across multiple quality of fit metrics, making it the preferred choice when the quality of fit is the primary goal. On the other hand, log-B offered a slightly lower quality of fit while being approximately 1.4 times faster and producing lower MAPE for most circuit components, making log-B as a strong alternative. This is a critical factor for large-scale least squares fitting in data-driven applications, such as training machine learning models on extensive datasets or iterations.

artificial intelligence, machine learning, optimization problem, (16 more...)

arXiv.org Artificial Intelligence

2510.09662

Country: North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report (0.64)

Industry: Energy > Energy Storage (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)

Add feedback

165a59f7cf3b5c4396ba65953d679f17-Supplemental.pdf

Neural Information Processing SystemsOct-2-2025, 05:23:00 GMT

artificial intelligence, conv, different loss function, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

Appendices

Neural Information Processing SystemsAug-19-2025, 00:23:54 GMT

The appendix is organized as follows. We first introduce the basic definitions and inequalities used throughout the appendices. In Appendix A, we provide more details about the datasets, computational resources, and more experiment results on CIFAR10, CIFAR100 and miniImageNet datasets. In Appendix B, we prove that CE, FL and LS satisfy the contrastive property in Definition 1. In Appendix C, we provide a detailed proof for Theorem 1, showing that the Simplex ETFs are the only global minimizers, as long as the loss function satisfies the Definition 1. Finally, in Appendix D, we present the whole proof for Theorem 2 that the FL function is a locally strict saddle function with no spurious local minimizers existing locally and LS function is a globally strict saddle function with no spurious local minimizers existing globally. The following Lemma extends the standard variational form of the nuclear norm.

artificial intelligence, loss function, minimizer, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.68)

Add feedback