split conformal prediction
Conformalized Quantile Regression
Yaniv Romano, Evan Patterson, Emmanuel Candes
Conformal prediction is atechnique for constructing prediction intervals that attainvalidcoverage infinite samples, without making distributional assumptions. Despite this appeal, existing conformal methods can be unnecessarily conservativebecause theyform intervals ofconstant orweakly varying length across the input space.
- North America > United States > Pennsylvania (0.04)
- North America > United States > Alaska (0.04)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
Predictive inference for time series: why is split conformal effective despite temporal dependence?
Barber, Rina Foygel, Pananjady, Ashwin
We consider the problem of uncertainty quantification for prediction in a time series: if we use past data to forecast the next time point, can we provide valid prediction intervals around our forecasts? To avoid placing distributional assumptions on the data, in recent years the conformal prediction method has been a popular approach for predictive inference, since it provides distribution-free coverage for any iid or exchangeable data distribution. However, in the time series setting, the strong empirical performance of conformal prediction methods is not well understood, since even short-range temporal dependence is a strong violation of the exchangeability assumption. Using predictors with "memory" -- i.e., predictors that utilize past observations, such as autoregressive models -- further exacerbates this problem. In this work, we examine the theoretical properties of split conformal prediction in the time series setting, including the case where predictors may have memory. Our results bound the loss of coverage of these methods in terms of a new "switch coefficient", measuring the extent to which temporal dependence within the time series creates violations of exchangeability. Our characterization of the coverage probability is sharp over the class of stationary, $β$-mixing processes. Along the way, we introduce tools that may prove useful in analyzing other predictive inference methods for dependent data.
- North America > United States > New York (0.04)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- North America > United States > Colorado > Jefferson County > Golden (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- North America > Canada (0.04)
- Asia (0.04)
Singleton-Optimized Conformal Prediction
Wang, Tao, Sun, Yan, Dobriban, Edgar
Conformal prediction can be used to construct prediction sets that cover the true outcome with a desired probability, but can sometimes lead to large prediction sets that are costly in practice. The most useful outcome is a singleton prediction-an unambiguous decision-yet existing efficiency-oriented methods primarily optimize average set size. Motivated by this, we propose a new nonconformity score that aims to minimize the probability of producing non-singleton sets. Starting from a non-convex constrained optimization problem as a motivation, we provide a geometric reformulation and associated algorithm for computing the nonconformity score and associated split conformal prediction sets in O(K) time for K-class problems. Using this score in split conformal prediction leads to our proposed Singleton-Optimized Conformal Prediction (SOCOP) method. We evaluate our method in experiments on image classification and LLM multiple-choice question-answering, comparing with standard nonconformity scores such as the (negative) label probability estimates and their cumulative distribution function; both of which are motivated by optimizing length. The results show that SOCOP increases singleton frequency (sometimes by over 20%) compared to the above scores, with minimal impact on average set size.
- North America > United States > Pennsylvania (0.04)
- North America > United States > New Jersey (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- (2 more...)
- North America > United States > Pennsylvania (0.04)
- North America > United States > Alaska (0.04)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- North America > United States > Pennsylvania (0.05)
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Europe > France (0.04)
Improving the statistical efficiency of cross-conformal prediction
Gasparin, Matteo, Ramdas, Aaditya
Conformal prediction has emerged as a general and versatile framework for constructing prediction sets in regression and classification tasks (Shafer and Vovk, 2008). Unlike traditional methods, which often depend on rigid distributional assumptions, conformal prediction transforms point predictions from any prediction (or black-box) algorithm into prediction sets that guarantee valid finite-sample marginal coverage. Originally introduced by Vovk et al. (2005), it has become increasingly influential, with numerous methods and extensions being proposed since its introduction. In particular, full conformal prediction by Vovk et al. (2005), demonstrates favorable properties regarding the coverage and the size of the prediction set. However, these advantages are counterbalanced by a substantial computational cost, which limits its practical application.
- North America > United States (0.14)
- Europe > Greece (0.04)
- Europe > Finland > Uusimaa > Helsinki (0.04)
Projected random forests and conformal prediction of circular data
F., Paulo C. Marques, Artes, Rinaldo, Graziadei, Helton
We apply split conformal prediction techniques to regression problems with circular responses by introducing a suitable conformity score, leading to prediction sets with adaptive arc length and finite-sample coverage guarantees for any circular predictive model under exchangeable data. Leveraging the high performance of existing predictive models designed for linear responses, we analyze a general projection procedure that converts any linear response regression model into one suitable for circular responses. When random forests serve as basis models in this projection procedure, we harness the out-of-bag dynamics to eliminate the necessity for a separate calibration sample in the construction of prediction sets. For synthetic and real datasets the resulting projected random forests model produces more efficient out-of-bag conformal prediction sets, with shorter median arc length, when compared to the split conformal prediction sets generated by two existing alternative models.
- Europe > Austria > Vienna (0.14)
- South America > Brazil > São Paulo (0.04)
- South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
- Africa > Cabo Verde > Praia > Praia (0.04)
- Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.96)
- Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.86)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)
Conformalized Physics-Informed Neural Networks
Podina, Lena, Rad, Mahdi Torabi, Kohandel, Mohammad
Physics-informed neural networks (PINNs) are an influential method of solving differential equations and estimating their parameters given data. However, since they make use of neural networks, they provide only a point estimate of differential equation parameters, as well as the solution at any given point, without any measure of uncertainty. Ensemble and Bayesian methods have been previously applied to quantify the uncertainty of PINNs, but these methods may require making strong assumptions on the data-generating process, and can be computationally expensive. Here, we introduce Conformalized PINNs (C-PINNs) that, without making any additional assumptions, utilize the framework of conformal prediction to quantify the uncertainty of PINNs by providing intervals that have finite-sample, distribution-free statistical validity.
- North America > Canada > Ontario > Waterloo Region > Waterloo (0.05)
- Europe > Finland > Uusimaa > Helsinki (0.04)