AITopics | spo loss

Neural Information Processing Systems http://nips.cc/

generalization, spo loss, spö, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > United States > California > Alameda County > Berkeley (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

a70145bf8b173e4496b554ce57969e24-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-13-2026, 10:56:35 GMT

classification, representation, spo framework, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

RiskBoundsandCalibrationforaSmart Predict-then-OptimizeMethod

Neural Information Processing SystemsFeb-10-2026, 21:33:22 GMT

Moreover, since the SPO loss is not continuous nor convex in general [Elmachtoub and Grigas, 2021], which makesthe training ofaprediction model computationally intractable, Elmachtoub and Grigas [2021] introduced a novel convex surrogate loss, referred to as the SPO+ loss.

artificial intelligence, loss function, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Decision-Focused Sequential Experimental Design: A Directional Uncertainty-Guided Approach

Wan, Beichen, Liu, Mo, Grigas, Paul, Shen, Zuo-Jun Max

arXiv.org Machine LearningFeb-6-2026

We consider the sequential experimental design problem in the predict-then-optimize paradigm. In this paradigm, the outputs of the prediction model are used as coefficient vectors in a downstream linear optimization problem. Traditional sequential experimental design aims to control the input variables (features) so that the improvement in prediction accuracy from each experimental outcome (label) is maximized. However, in the predict-then-optimize setting, performance is ultimately evaluated based on the decision loss induced by the downstream optimization, rather than by prediction error. This mismatch between prediction accuracy and decision loss renders traditional decision-blind designs inefficient. To address this issue, we propose a directional-based metric to quantify predictive uncertainty. This metric does not require solving an optimization oracle and is therefore computationally tractable. We show that the resulting sequential design criterion enjoys strong consistency and convergence guarantees. Under a broad class of distributions, we demonstrate that our directional uncertainty-based design attains an earlier stopping time than decision-blind designs. This advantage is further supported by real-world experiments on an LLM job allocation problem.

large language model, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

2602.0534

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
North America > United States > North Carolina > Orange County > Chapel Hill (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.86)

Industry:

Media > Film (0.46)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

Risk Bounds and Calibration for a Smart Predict-then-Optimize Method

Neural Information Processing SystemsDec-24-2025, 20:06:55 GMT

The predict-then-optimize framework is fundamental in practical stochastic decision-making problems: first predict unknown parameters of an optimization model, then solve the problem using the predicted values. A natural loss function in this setting is defined by measuring the decision error induced by the predicted parameters, which was named the Smart Predict-then-Optimize (SPO) loss by Elmachtoub and Grigas [2021]. Since the SPO loss is typically nonconvex and possibly discontinuous, Elmachtoub and Grigas [2021] introduced a convex surrogate, called the SPO+ loss, that importantly accounts for the underlying structure of the optimization model. In this paper, we greatly expand upon the consistency results for the SPO+ loss provided by Elmachtoub and Grigas [2021]. We develop risk bounds and uniform calibration results for the SPO+ loss relative to the SPO loss, which provide a quantitative way to transfer the excess surrogate risk to excess true risk. By combining our risk bounds with generalization bounds, we show that the empirical minimizer of the SPO+ loss achieves low excess true risk with high probability. We first demonstrate these results in the case when the feasible region of the underlying optimization problem is a polyhedron, and then we show that the results can be strengthened substantially when the feasible region is a level set of a strongly convex function. We perform experiments to empirically demonstrate the strength of the SPO+ surrogate, as compared to standard $\ell_1$ and squared $\ell_2$ prediction error losses, on portfolio allocation and cost-sensitive multi-class classification problems.

name change, risk bound and calibration, smart predict-then-optimize method, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.82)
Information Technology > Artificial Intelligence > Machine Learning (0.75)

Add feedback

Generalization Bounds in the Predict-then-Optimize Framework

Othman El Balghiti, Adam Elmachtoub, Paul Grigas, Ambuj Tewari

Neural Information Processing SystemsAug-19-2025, 22:49:02 GMT

The predict-then-optimize framework is fundamental in many practical settings: predict the unknown parameters of an optimization problem, and then solve the problem using the predicted values of the parameters.

generalization, spo loss, spö, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > United States > California > Alameda County > Berkeley (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

a70145bf8b173e4496b554ce57969e24-AuthorFeedback.pdf

Neural Information Processing SystemsAug-19-2025, 22:48:48 GMT

classification, representation, spo framework, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

b943325cc7b7422d2871b345bf9b067f-Paper.pdf

Neural Information Processing SystemsAug-17-2025, 02:09:45 GMT

artificial intelligence, machine learning, optimization problem, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)

Add feedback

Reviews: Generalization Bounds in the Predict-then-Optimize Framework

Neural Information Processing SystemsJan-26-2025, 07:21:44 GMT

This paper considers a learning framework called predict-then-optimize. The problem in this setting is that parameters which are used in making predictions are not necessarily at hand when predictions should be made (costs of taking certain roads at particular moment are needed when a route has to be planned), and should be predicted before optimizing over them (in previous example, costs in the past are known and associated with other features known also at the moment). The interesting part of the framework is, that the learning problem used in learning the costs uses a loss function over error on the decision of the optimizer (SPO loss), instead of a direct error over the learned cost. In this framework, the authors provide several generalization bounds in different settings over a linear objective function, such as when feasible region where problem is solved is either polyhedron or any compact and convex region. They further work with stronger convexity assumptions and in their framework generalize margin guarantees for binary classification, and also give two modified versions of the SPO loss.

generalization bound, predict-then-optimize framework, spo loss, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.97)

Add feedback

Risk Bounds and Calibration for a Smart Predict-then-Optimize Method

Neural Information Processing SystemsJan-18-2025, 21:37:25 GMT

The predict-then-optimize framework is fundamental in practical stochastic decision-making problems: first predict unknown parameters of an optimization model, then solve the problem using the predicted values. A natural loss function in this setting is defined by measuring the decision error induced by the predicted parameters, which was named the Smart Predict-then-Optimize (SPO) loss by Elmachtoub and Grigas [2021]. Since the SPO loss is typically nonconvex and possibly discontinuous, Elmachtoub and Grigas [2021] introduced a convex surrogate, called the SPO loss, that importantly accounts for the underlying structure of the optimization model. In this paper, we greatly expand upon the consistency results for the SPO loss provided by Elmachtoub and Grigas [2021]. We develop risk bounds and uniform calibration results for the SPO loss relative to the SPO loss, which provide a quantitative way to transfer the excess surrogate risk to excess true risk.

risk bound and calibration, smart predict-then-optimize method, spo loss, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.65)

Add feedback

Filters

Collaborating Authors

spo loss

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Generalization Bounds in the Predict-then-Optimize Framework

a70145bf8b173e4496b554ce57969e24-AuthorFeedback.pdf

RiskBoundsandCalibrationforaSmart Predict-then-OptimizeMethod

Decision-Focused Sequential Experimental Design: A Directional Uncertainty-Guided Approach

Risk Bounds and Calibration for a Smart Predict-then-Optimize Method

Generalization Bounds in the Predict-then-Optimize Framework

a70145bf8b173e4496b554ce57969e24-AuthorFeedback.pdf

b943325cc7b7422d2871b345bf9b067f-Paper.pdf

Reviews: Generalization Bounds in the Predict-then-Optimize Framework

Risk Bounds and Calibration for a Smart Predict-then-Optimize Method