AITopics | cross entropy loss

4c4c937b67cc8d785cea1e42ccea185c-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 19:14:41 GMT

Proof of Proposition 1. Due to Jensen's inequality and the fact that, by assumption, the distribution of human predictions P(h|x) is not a point-mass, it holds that Eh[`(h(x),y) |x] > `(µh(x),y). Proof of Theorem 3. We first provide the proof of the unconstrained case. Note that the above problem is a linear program and it decouples with respect to x. Therefore, for each x, the optimal solution is clearly given by: π m(d= 1 |x) = 1 if Ey|x[`(m(x),y) Eh|x[`(h,y)]] >0 0 otherwise Next, we provide the proof of the constrained case. To this aim, we consider the dual formulation of the optimization problem, where we only introduce a Lagrangian multiplier τP,b for the first constraint, i.e., maximize Ex π(x) Ey,h|x[`(h,y)] Ey|x[`(m(x),y)] + Ex [τP,b(π(x) b)] (13) subject to 0 π(x) 1 x X. (14) 13 The inner minimization problem can be solved using the similar argument for the unconstrained case.

artificial intelligence, machine learning, test sample, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)

Add feedback

33d6548e48d4318ceb0e3916a79afc84-Supplemental.pdf

Neural Information Processing SystemsApr-25-2026, 10:29:14 GMT

artificial intelligence, machine learning, probability, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.93)

Add feedback

03a90e1bb2ceb2ea165424f2d96aa3a1-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 08:34:26 GMT

artificial intelligence, classification, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > Japan (0.28)

Genre: Research Report (1.00)

Industry: Social Sector (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

03a90e1bb2ceb2ea165424f2d96aa3a1-Paper-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 08:34:23 GMT

artificial intelligence, classification, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > Japan (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

b432f34c5a997c8e7c806a895ecc5e25-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 19:41:14 GMT

loss function, prediction, vit, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota (0.04)
Asia > Singapore (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

b432f34c5a997c8e7c806a895ecc5e25-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 19:41:10 GMT

incorrect prediction, prediction, trustworthiness, (13 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > Singapore (0.04)
North America > United States > Minnesota (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Education (0.93)
Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Vision (0.94)
(2 more...)

Add feedback

In our method and all the baselines except surrogate-based triage, we use the cross-entropy loss and implement SGD using Adam optimizer [40] with initial learning rate set by cross validation independently foreachmethod andleveloftriageb. Insurrogate-based triage, weusethelossand optimization method used by the authors in their public implementation. Moreover, we use early stopping with the patience parameterep = 10,i.e.,we stop the training process ifno reduction of cross entropy loss is observed on the validation set. This suggests that the humans aremore accurate than thepredictivemodel throughout theentire feature space. This suggests that the humans are less accurate than the predictive model in some regions of the featurespace.

artificial intelligence, machine learning, pb isfoundusingcrossvalidation, (16 more...)

Neural Information Processing Systems

Technology: