AITopics | whichcompletestheproof

Collaborating Authors

whichcompletestheproof

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Neyman-Pearson multiclass classification under label noise via empirical likelihood

Zhang, Qiong, Tian, Qinglong, Li, Pengfei

arXiv.org Machine LearningMar-24-2026

In many classification problems, the costs of misclassifying observations from different classes can be highly unequal. The Neyman-Pearson multiclass classification (NPMC) framework addresses this issue by minimizing a weighted misclassification risk while imposing upper bounds on class-specific error probabilities. Existing NPMC methods typically assume that training labels are correctly observed. In practice, however, labels are often corrupted due to measurement error or annotation, and the effect of such label noise on NPMC procedures remains largely unexplored. We study the NPMC problem when only noisy labels are available in the training data. We propose an empirical likelihood (EL)-based method that relates the distributions of noisy and true labels through an exponential tilting density ratio model. The resulting maximum EL estimators recover the class proportions and posterior probabilities of the clean labels required for error control. We establish consistency, asymptotic normality, and optimal convergence rates for these estimators. Under mild conditions, the resulting classifier satisfies NP oracle inequalities with respect to the true labels asymptotically. An expectation-maximization algorithm computes the maximum EL estimators. Simulations show that the proposed method performs comparably to the oracle classifier under clean labels and substantially improves over procedures that ignore label noise.

artificial intelligence, eyi, machine learning, (18 more...)

arXiv.org Machine Learning

2603.21623

Country: Asia > China (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

AdaptingtoOnlineLabelShiftwith ProvableGuarantees YongBai1, Yu-JieZhang2,1, PengZhao1, MasashiSugiyama3,2, Zhi-HuaZhou1

Neural Information Processing SystemsFeb-11-2026, 17:57:04 GMT

Thestandard supervised learning paradigm workseffectivelywhentraining data shares the same distribution as the upcoming testing samples.

artificial intelligence, brt, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > Japan > Honshū > Kantō > Chiba Prefecture > Chiba (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

PreciseRegretBoundsforLog-lossviaaTruncated BayesianAlgorithm

Neural Information Processing SystemsFeb-11-2026, 07:22:07 GMT

After prediction, the true labelyt is revealed and the lossℓ(yt,ˆyt)is incurred.

artificial intelligence, informationtheory, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States > Hawaii (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

e9bf14a419d77534105016f5ec122d62-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 22:55:57 GMT

Therefore, if ν() < +, then we can bound(10) with eαν(). To avoid crowded notations, we drop the conditioning onz from Pr[ |ρ = z]. The issue is how to proceed. Let φ be the standard normal density function andΦ be the CDF. The algorithm using SVT suchthat itonly releases the private answerstothe queries if the answer is sufficiently different from the "guess".

algorithm, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

a376033f78e144f494bfc743c0be3330-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 10:14:37 GMT

Inthis section, we provide theoretical analysis ofHSPG. Moreover, we further point out that: (1) theSub-gradient Descent Stepwe used to achieve a "close enough" solution canbereplaced byothermethods, and(2)theAssumption 4isonlyasufficientcondition thatwecouldusetoshowthe"closeenough"condition. B.1 RelatedWork Problem (12)has been well studied indeterministic optimization with various algorithms that are capable ofreturning solutions with both lowobjectivevalueandhigh group sparsity under proper λ(95;73;42;64). For example, proximal stochastic variance-reduced gradient method (Prox-SVRG)(88)and proximal spider (Prox-Spider) (97) are developed to adopt multi-stage schemes based on the well-known variance reduction technique SVRG proposed in (46) and Spider developed in (22) respectively. Under Assumption 1, the search directiondk is a descent direction forψBk(xk), i.e., d>k ψBk(xk)<0.

artificial intelligence, gk 2, machine learning, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.34)

Add feedback

7bc4f74e35bcfe8cfe43b0a860786d6a-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 03:20:56 GMT

artificial intelligence, inaddition, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

789ba2ae4d335e8a2ad283a3f7effced-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 01:12:50 GMT

A is given byAk,`,Pr[yk(x)=`], which represents the probability ofkth service producing label `. The scalar function Fk,`(X), Pr[qk(x) X|y(x) = `] is the probability of the produced quality score from thekth service less than a thresholdX conditional on that its predicted label is`. There are 3 steps for solving problem 3.3. Thus,ψi() and ψi,j() are piece wise quadratic functions. To solve Problem 3.2, let us first denoteΩ3 = {x In other words,z0 has the same objective function value asz .

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

28553688c204ddbb06a51e00684f8bb7-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 00:23:10 GMT

In the sequel, we empirically show the effect of different numbers of local updates on the fixed point. We consider cases withK = 1, K = 10, K = 20, K = 50. From Assumption 1, it is obvious thatgi(x,y) is convex-concave. Then, we conclude that there exists someη1 > 0 such that h(η) > 0, 0 < η < η1.

artificial intelligence, inthissection, whichcompletestheproof, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.47)

Add feedback