AITopics | heteroscedasticity

Collaborating Authors

heteroscedasticity

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Absolute Neighbour Difference based Correlation Test for Detecting Heteroscedastic Relationships

Neural Information Processing SystemsApr-27-2026, 06:53:41 GMT

It is a challenge to detect complicated data relationships thoroughly. Here, we propose a new statistical measure, named the absolute neighbour difference based neighbour correlation coefficient, to detect the associations between variables through examining the heteroscedasticity of the unpredictable variation of dependent variables. Different from previous studies, the new method concentrates on measuring nonfunctional relationships rather than functional or mixed associations. Either used alone or in combination with other measures, it enables not only a convenient test of heteroscedasticity, but also measuring functional and nonfunctional relationships separately that obviously leads to a deeper insight into the data associations. The method is concise and easy to implement that does not rely on explicitly estimating the regression residuals or the dependencies between variables so that it is not restrict to any kind of model assumption. The mechanisms of the correlation test are proved in theory and demonstrated with numerical analyses.

artificial intelligence, heteroscedasticity, machine learning, (15 more...)

Neural Information Processing Systems

Country:

North America (0.46)
Asia > China (0.28)

Genre: Research Report > New Finding (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

d5cfead94f5350c12c322b5b664544c1-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 08:47:28 GMT

heteroscedasticity, nullnull, nullnullnull, (14 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada (0.04)
Europe > United Kingdom (0.04)
(2 more...)

Genre: Research Report > New Finding (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)

Add feedback

Supplementary Material of Absolute Neighbour Difference based Correlation T est for Detecting Heteroscedastic Relationships

Neural Information Processing SystemsAug-17-2025, 15:47:42 GMT

According to the Cauchy Schwarz inequality, it should also have a value between 1. 2 Second, consider the numerator of (7). For M > 2, it can be proved in the same way as above. White test was set to be 15, otherwise it may fail to detect the heteroscedasticity of the residuals. This is because when 99% or even 99.9% of the variance of Four existing association measures were also implemented for make comparisons with the proposed method. These approaches are typical and well-established.

artificial intelligence, machine learning, subdomain, (13 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > Canada (0.04)
Asia > China > Beijing > Beijing (0.04)

Industry: Health & Medicine > Therapeutic Area (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Conformalized Regression for Continuous Bounded Outcomes

Wu, Zhanli, Leisen, Fabrizio, Rubio, F. Javier

arXiv.org Machine LearningJul-21-2025

Regression problems with bounded continuous outcomes frequently arise in real-world statistical and machine learning applications, such as the analysis of rates and proportions. A central challenge in this setting is predicting a response associated with a new covariate value. Most of the existing statistical and machine learning literature has focused either on point prediction of bounded outcomes or on interval prediction based on asymptotic approximations. We develop conformal prediction intervals for bounded outcomes based on transformation models and beta regression. We introduce tailored non-conformity measures based on residuals that are aligned with the underlying models, and account for the inherent heteroscedasticity in regression settings with bounded outcomes. We present a theoretical result on asymptotic marginal and conditional validity in the context of full conformal prediction, which remains valid under model misspecification. For split conformal prediction, we provide an empirical coverage analysis based on a comprehensive simulation study. The simulation study demonstrates that both methods provide valid finite-sample predictive coverage, including settings with model misspecification. Finally, we demonstrate the practical performance of the proposed conformal prediction intervals on real data and compare them with bootstrap-based alternatives.

artificial intelligence, machine learning, regression model, (15 more...)

arXiv.org Machine Learning

2507.14023

Country: Europe > United Kingdom > England > Greater London > London (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Education > Curriculum > Subject-Specific Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.32)

Add feedback

ALPCAH: Subspace Learning for Sample-wise Heteroscedastic Data

Cavazos, Javier Salazar, Fessler, Jeffrey A., Balzano, Laura

arXiv.org Machine LearningMay-13-2025

Principal component analysis (PCA) is a key tool in the field of data dimensionality reduction. However, some applications involve heterogeneous data that vary in quality due to noise characteristics associated with each data sample. Heteroscedastic methods aim to deal with such mixed data quality. This paper develops a subspace learning method, named ALPCAH, that can estimate the sample-wise noise variances and use this information to improve the estimate of the subspace basis associated with the low-rank structure of the data. Our method makes no distributional assumptions of the low-rank component and does not assume that the noise variances are known. Further, this method uses a soft rank constraint that does not require subspace dimension to be known. Additionally, this paper develops a matrix factorized version of ALPCAH, named LR-ALPCAH, that is much faster and more memory efficient at the cost of requiring subspace dimension to be known or estimated. Simulations and real data experiments show the effectiveness of accounting for data heteroscedasticity compared to existing algorithms. Code available at https://github.com/javiersc1/ALPCAH.

artificial intelligence, lr-alpcah, machine learning, (14 more...)

arXiv.org Machine Learning

doi: 10.1109/TSP.2025.3537867

2505.07272

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.28)
North America > United States > Wisconsin > Dane County > Madison (0.14)
North America > United States > Texas > Tarrant County > Arlington (0.14)
(4 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

H-AddiVortes: Heteroscedastic (Bayesian) Additive Voronoi Tessellations

Stone, Adam J., Gosling, John Paul

arXiv.org Machine LearningMar-17-2025

This paper introduces the Heteroscedastic AddiVortes model, a Bayesian non-parametric regression framework that simultaneously models the conditional mean and variance of a response variable using adaptive Voronoi tessellations. By employing a sum-of-tessellations approach for the mean and a product-of-tessellations approach for the variance, the model provides a flexible and interpretable means to capture complex, predictor-dependent relationships and heteroscedastic patterns in data. This dual-layer representation enables precise inference, even in high-dimensional settings, while maintaining computational feasibility through efficient Markov Chain Monte Carlo (MCMC) sampling and conjugate prior structures. We illustrate the model's capability through both simulated and real-world datasets, demonstrating its ability to capture nuanced variance structures, provide reliable predictive uncertainty quantification, and highlight key predictors influencing both the mean response and its variability. Empirical results show that the Heteroscedastic AddiVortes model offers a substantial improvement in capturing distributional properties compared to both homoscedastic and heteroscedastic alternatives, making it a robust tool for complex regression problems in various applied settings.

addivorte model, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

2503.13037

Country:

North America > United States (0.04)
Europe > Switzerland (0.04)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Heteroscedastic Double Bayesian Elastic Net

Kimura, Masanari

arXiv.org Machine LearningFeb-4-2025

In many practical applications, regression models are employed to uncover relationships between predictors and a response variable, yet the common assumption of constant error variance is frequently violated. This issue is further compounded in high-dimensional settings where the number of predictors exceeds the sample size, necessitating regularization for effective estimation and variable selection. To address this problem, we propose the Heteroscedastic Double Bayesian Elastic Net (HDBEN), a novel framework that jointly models the mean and log-variance using hierarchical Bayesian priors incorporating both $\ell_1$ and $\ell_2$ penalties. Our approach simultaneously induces sparsity and grouping in the regression coefficients and variance parameters, capturing complex variance structures in the data. Theoretical results demonstrate that proposed HDBEN achieves posterior concentration, variable selection consistency, and asymptotic normality under mild conditions which justifying its behavior. Simulation studies further illustrate that HDBEN outperforms existing methods, particularly in scenarios characterized by heteroscedasticity and high dimensionality.

artificial intelligence, heteroscedasticity, machine learning, (16 more...)

arXiv.org Machine Learning

2502.02032

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Advancing sleep detection by modelling weak label sets: A novel weakly supervised learning approach

Boeker, Matthias, Thambawita, Vajira, Riegler, Michael, Halvorsen, Pål, Hammer, Hugo L.

arXiv.org Artificial IntelligenceFeb-27-2024

Understanding sleep and activity patterns plays a crucial role in physical and mental health. This study introduces a novel approach for sleep detection using weakly supervised learning for scenarios where reliable ground truth labels are unavailable. The proposed method relies on a set of weak labels, derived from the predictions generated by conventional sleep detection algorithms. Introducing a novel approach, we suggest a novel generalised non-linear statistical model in which the number of weak sleep labels is modelled as outcome of a binomial distribution. The probability of sleep in the binomial distribution is linked to the outcomes of neural networks trained to detect sleep based on actigraphy. We show that maximizing the likelihood function of the model, is equivalent to minimizing the soft cross-entropy loss. Additionally, we explored the use of the Brier score as a loss function for weak labels. The efficacy of the suggested modelling framework was demonstrated using the Multi-Ethnic Study of Atherosclerosis dataset. A \gls{lstm} trained on the soft cross-entropy outperformed conventional sleep detection algorithms, other neural network architectures and loss functions in accuracy and model calibration. This research not only advances sleep detection techniques in scenarios where ground truth data is scarce but also contributes to the broader field of weakly supervised learning by introducing innovative approach in modelling sets of weak labels.

algorithm, neural network, prediction uncertainty, (14 more...)

arXiv.org Artificial Intelligence

2402.17601

Country:

Europe > Norway > Eastern Norway > Oslo (0.04)
Oceania > Australia (0.04)
North America > United States > Pennsylvania > Westmoreland County > Murrysville (0.04)
(4 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Promising Solution (0.88)
Overview > Innovation (0.74)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Add feedback

Statistical Agnostic Regression: a machine learning method to validate regression models

Gorriz, Juan M, Ramirez, J., Segovia, F., Martinez-Murcia, F. J., Jiménez-Mesa, C., Suckling, J.

arXiv.org Machine LearningFeb-23-2024

Regression analysis is a central topic in statistical modeling, aiming to estimate the relationships between a dependent variable, commonly referred to as the response variable, and one or more independent variables, i.e., explanatory variables. Linear regression is by far the most popular method for performing this task in several fields of research, such as prediction, forecasting, or causal inference. Beyond various classical methods to solve linear regression problems, such as Ordinary Least Squares, Ridge, or Lasso regressions - which are often the foundation for more advanced machine learning (ML) techniques - the latter have been successfully applied in this scenario without a formal definition of statistical significance. At most, permutation or classical analyses based on empirical measures (e.g., residuals or accuracy) have been conducted to reflect the greater ability of ML estimations for detection. In this paper, we introduce a method, named Statistical Agnostic Regression (SAR), for evaluating the statistical significance of an ML-based linear regression based on concentration inequalities of the actual risk using the analysis of the worst case. To achieve this goal, similar to the classification problem, we define a threshold to establish that there is sufficient evidence with a probability of at least 1-eta to conclude that there is a linear relationship in the population between the explanatory (feature) and the response (label) variables. Simulations in only two dimensions demonstrate the ability of the proposed agnostic test to provide a similar analysis of variance given by the classical $F$ test for the slope parameter.

correlation level, statistical agnostic regression, validate regression model, (13 more...)

arXiv.org Machine Learning

2402.15213

Country:

North America > United States > New York (0.04)
North America > United States > District of Columbia > Washington (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(3 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

A general framework for multi-step ahead adaptive conformal heteroscedastic time series forecasting

Sousa, Martim, Tomé, Ana Maria, Moreira, José

arXiv.org Machine LearningOct-11-2023

While considerable effort has been dedicated to identifying the most effective approaches, with the widely recognized M forecasting competition, initiated in 1982 (Makridakis et al., 1982), serving as a arena for these breakthroughs, quantifying the uncertainty of such predictions has received diminished attention. Indeed, the inclusion of prediction intervals (PIs) began to be considered in forecasting competitions only from the M4 competition (Makridakis et al., 2020) onward. Another example of this discrepancy was observed in the M5 forecasting competitions. The M5 "Accuracy" competition (Makridakis et al., 2022a), which centered on optimizing point predictions, garnered significant interest and participation, with a staggering 7, 092 participants 2 vying for top honors. In stark contrast, the M5 "Uncertainty" competition (Makridakis et al., 2022b), which aimed to assess the quality of estimated conditional quantiles, drew considerably less attention, involving only 1, 137 participants despite offering the same prize incentives.

aenbmimocqr, forecasting, machine learning, (21 more...)

arXiv.org Machine Learning

2207.14219

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.05)
Europe > Portugal > Aveiro > Aveiro (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine (0.67)
Energy > Power Industry (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback