AITopics | mll

Collaborating Authors

mll

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

OnePositiveLabelisSufficient: Single-PositiveMulti-LabelLearningwithLabel Enhancement

Neural Information Processing SystemsFeb-10-2026, 13:28:48 GMT

Experiments on twelve corrupted MLL datasets show the effectiveness of SMILEoverseveral existing SPMLL approaches.

artificial intelligence, classification, machine learning, (14 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
Asia > China (0.04)
North America > Canada > Alberta > Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.47)

Add feedback

219e052492f4008818b8adb6366c7ed6-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 19:05:48 GMT

Absent assumptions onthe nature of shift, the problem is underspecified. Multiple assumptions may be compatible with the same observations while implying different courses ofaction.

artificial intelligence, calibration, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Unified View of Label Shift Estimation

Neural Information Processing SystemsDec-23-2025, 20:41:45 GMT

Under label shift, the label distribution $p(y)$ might change but the class-conditional distributions $p(x|y)$ do not. There are two dominant approaches for estimating the label marginal. BBSE, a moment-matching approach based on confusion matrices, is provably consistent and provides interpretable error bounds. However, a maximum likelihood estimation approach, which we call MLLS, dominates empirically. In this paper, we present a unified view of the two methods and the first theoretical characterization of MLLS. Our contributions include (i) consistency conditions for MLLS, which include calibration of the classifier and a confusion matrix invertibility condition that BBSE also requires; (ii) a unified framework, casting BBSE as roughly equivalent to MLLS for a particular choice of calibration method; and (iii) a decomposition of MLLS's finite-sample error into terms reflecting miscalibration and estimation error. Our analysis attributes BBSE's statistical inefficiency to a loss of information due to coarse calibration.

label shift estimation, name change, unified view, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.61)

Add feedback

219e052492f4008818b8adb6366c7ed6-Paper.pdf

Neural Information Processing SystemsOct-2-2025, 10:58:35 GMT

artificial intelligence, calibration, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)

Add feedback

219e052492f4008818b8adb6366c7ed6-AuthorFeedback.pdf

Neural Information Processing SystemsOct-2-2025, 10:58:25 GMT

artificial intelligence, calibration, calibration error, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.31)

Add feedback

Reviewer # 1: We thank you for appreciating our contributions and providing valuable feedback, which will be taken

Neural Information Processing SystemsOct-2-2025, 10:52:41 GMT

The empirical results comparing parameter tying vs. naive design are in fact reported in Table 3 of Appendix C.2; a Zhou, 2018) are related to IPVI, as you have suggested. We would like to address your comments and questions below. Regarding the necessity of parameter tying, we think overfitting is still an issue to be addressed. We provide some experimental evidence below, as you have suggested. Train/test mean log-likelihood (MLL) achieved by IPVI with and without parameter tying over 10 runs.

artificial intelligence, machine learning, reviewer, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.97)

Add feedback

A Unified View of Label Shift Estimation

Neural Information Processing SystemsOct-9-2024, 18:05:01 GMT

Under label shift, the label distribution p(y) might change but the class-conditional distributions p(x y) do not. There are two dominant approaches for estimating the label marginal. BBSE, a moment-matching approach based on confusion matrices, is provably consistent and provides interpretable error bounds. However, a maximum likelihood estimation approach, which we call MLLS, dominates empirically. In this paper, we present a unified view of the two methods and the first theoretical characterization of MLLS.

label shift estimation, mll, unified view, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.64)

Add feedback

Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques

Kasa, Siva Rajesh, Goel, Aniket, Gupta, Karan, Roychowdhury, Sumegh, Bhanushali, Anish, Pattisapu, Nikhil, Murthy, Prasanna Srinivasa

arXiv.org Artificial IntelligenceMay-20-2024

Ordinal Classification (OC) is a widely encountered challenge in Natural Language Processing (NLP), with applications in various domains such as sentiment analysis, rating prediction, and more. Previous approaches to tackle OC have primarily focused on modifying existing or creating novel loss functions that \textbf{explicitly} account for the ordinal nature of labels. However, with the advent of Pretrained Language Models (PLMs), it became possible to tackle ordinality through the \textbf{implicit} semantics of the labels as well. This paper provides a comprehensive theoretical and empirical examination of both these approaches. Furthermore, we also offer strategic recommendations regarding the most effective approach to adopt based on specific settings.

experiment, loss function, verbaliser, (15 more...)

arXiv.org Artificial Intelligence

2405.11775

Country:

North America > United States > Washington > King County > Seattle (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > India (0.05)
(3 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Rethinking Loss Functions for Fact Verification

Mukobara, Yuta, Shigeto, Yutaro, Shimbo, Masashi

arXiv.org Artificial IntelligenceMar-12-2024

We explore loss functions for fact verification in the FEVER shared task. While the cross-entropy loss is a standard objective for training verdict predictors, it fails to capture the heterogeneity among the FEVER verdict classes. In this paper, we develop two task-specific objectives tailored to FEVER. Experimental results confirm that the proposed objective functions outperform the standard cross-entropy. Performance is further improved when these objectives are combined with simple class weighting, which effectively overcomes the imbalance in the training data. The souce code is available at https://github.com/yuta-mukobara/RLF-KGAT

computational linguistic, objective, weighting, (14 more...)

arXiv.org Artificial Intelligence

2403.08174

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Dominican Republic (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(4 more...)

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Particle-Based Score Estimation for State Space Model Learning in Autonomous Driving

Singh, Angad, Makhlouf, Omar, Igl, Maximilian, Messias, Joao, Doucet, Arnaud, Whiteson, Shimon

arXiv.org Artificial IntelligenceDec-13-2022

Multi-object state estimation is a fundamental problem for robotic applications where a robot must interact with other moving objects. Typically, other objects' relevant state features are not directly observable, and must instead be inferred from observations. Particle filtering can perform such inference given approximate transition and observation models. However, these models are often unknown a priori, yielding a difficult parameter estimation problem since observations jointly carry transition and observation noise. In this work, we consider learning maximum-likelihood parameters using particle methods. Recent methods addressing this problem typically differentiate through time in a particle filter, which requires workarounds to the non-differentiable resampling step, that yield biased or high variance gradient estimates. By contrast, we exploit Fisher's identity to obtain a particle-based approximation of the score function (the gradient of the log likelihood) that yields a low variance estimate while only requiring stepwise differentiation through the transition and observation models. We apply our method to real data collected from autonomous vehicles (AVs) and show that it learns better models than existing techniques and is more stable in training, yielding an effective smoother for tracking the trajectories of vehicles around an AV.

artificial intelligence, deep learning, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2212.06968

Country: