AITopics | Regression

Collaborating Authors

Regression

News Overviews Instructional Materials AI-Alerts Classics

Automated Benchmark-Driven Design and Explanation of Hyperparameter Optimizers

Moosbauer, Julia, Binder, Martin, Schneider, Lennart, Pfisterer, Florian, Becker, Marc, Lang, Michel, Kotthoff, Lars, Bischl, Bernd

arXiv.org Machine LearningNov-29-2021

Automated hyperparameter optimization (HPO) has gained great popularity and is an important ingredient of most automated machine learning frameworks. The process of designing HPO algorithms, however, is still an unsystematic and manual process: Limitations of prior work are identified and the improvements proposed are -- even though guided by expert knowledge -- still somewhat arbitrary. This rarely allows for gaining a holistic understanding of which algorithmic components are driving performance, and carries the risk of overlooking good algorithmic design choices. We present a principled approach to automated benchmark-driven algorithm design applied to multifidelity HPO (MF-HPO): First, we formalize a rich space of MF-HPO candidates that includes, but is not limited to common HPO algorithms, and then present a configurable framework covering this space. To find the best candidate automatically and systematically, we follow a programming-by-optimization approach and search over the space of algorithm candidates via Bayesian optimization. We challenge whether the found design choices are necessary or could be replaced by more naive and simpler ones by performing an ablation analysis. We observe that using a relatively simple configuration, in some ways simpler than established methods, performs very well as long as some critical configuration parameters have the right value.

algorithm, configuration, optimization, (10 more...)

arXiv.org Machine Learning

2111.14756

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > Canada > Ontario > Toronto (0.14)
Europe > Spain > Andalusia > Cádiz Province > Cadiz (0.04)
(11 more...)

Genre:

Research Report > New Finding (0.67)
Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Data Science > Data Mining (0.92)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.92)
(2 more...)

Add feedback

From Kepler to Newton: Explainable AI for Science Discovery

Li, Zelong, Ji, Jianchao, Zhang, Yongfeng

arXiv.org Artificial IntelligenceNov-29-2021

The Observation--Hypothesis--Prediction--Experimentation loop paradigm for scientific research has been practiced by researchers for years towards scientific discoveries. However, with data explosion in both mega-scale and milli-scale scientific research, it has been sometimes very difficult to manually analyze the data and propose new hypothesis to drive the cycle for scientific discovery. In this paper, we discuss the role of Explainable AI in scientific discovery process by demonstrating an Explainable AI-based paradigm for science discovery. The key is to use Explainable AI to help derive data or model interpretations as well as scientific discoveries or insights. We show how computational and data-intensive methodology -- together with experimental and theoretical methodology -- can be seamlessly integrated for scientific research. To demonstrate the AI-based science discovery process, and to pay our respect to some of the greatest minds in human history, we show how Kepler's laws of planetary motion and the Newton's law of universal gravitation can be rediscovered by (Explainable) AI based on Tycho Brahe's astronomical observation data, whose works were leading the scientific revolution in the 16-17th century. This work also highlights the important role of Explainable AI (as compared to Blackbox AI) in science discovery to help humans prevent or better prepare for the possible technological singularity that may happen in the future.

discovery, explainable ai, kepler, (15 more...)

arXiv.org Artificial Intelligence

2111.1221

Country:

North America > United States > Washington > King County > Redmond (0.04)
North America > United States > New Jersey > Middlesex County > New Brunswick (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Education (0.93)
Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
(2 more...)

Add feedback

How Can Regression Models Help Us in Making Better Predictions?

#artificialintelligenceNov-27-2021, 11:50:22 GMT

You'd probably enjoy being able to make predictions about something important to you, right? In this post, I'll show you how to use regression information to analyze predictions and see if they're both unbiased and accurate. In these best data analytics courses online, you will have a better understanding of data analytics. Predictions can be made using regression equations. After fitting a model, regression equations are an important part of the statistical output.

dependent variable, prediction, regression, (8 more...)

#artificialintelligence

Industry: Education > Educational Setting > Online (0.74)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.73)

Add feedback

The 'How' Behind Machine Learning: An Overview of ML Algorithms

#artificialintelligenceNov-26-2021, 00:05:06 GMT

Now, we get into the more technical part of this article: ML Algorithms. I'll include a TL;DR, although I encourage you to browse through the different algorithms. I'll also include a really helpful AI course video that I personally used to learn and write this article at the very end. Without further wait, let's start going in-depth on the different algorithms used in machine learning. Linear regression is a supervised learning model used for regression that predicts a dependent variable (Y) based on the independent variable(s) (X) by fitting a line linearly.

algorithm, decision tree, regression, (12 more...)

#artificialintelligence

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.80)

Add feedback

Feature Selection for Causal Inference from High Dimensional Observational Data with Outcome Adaptive Elastic Net

Islam, Md Saiful, Noor-E-Alam, Md.

arXiv.org Artificial IntelligenceNov-26-2021

Feature selection is an extensively studied technique in the machine learning literature where the main objective is to identify the subset of features that provides the highest predictive power. However, in causal inference, our goal is to identify the set of variables that are associated with both the treatment variable and outcome (i.e., the confounders). While controlling for the confounding variables helps us to achieve an unbiased estimate of causal effect, recent research shows that controlling for purely outcome predictors along with the confounders can reduce the variance of the estimate. In this paper, we propose an Outcome Adaptive Elastic-Net (OAENet) method specifically designed for causal inference to select the confounders and outcome predictors for inclusion in the propensity score model or in the matching mechanism. OAENet provides two major advantages over existing methods: it performs superiorly on correlated data, and it can be applied to any matching method and any estimates. In addition, OAENet is computationally efficient compared to state-of-the-art methods.

causal inference, predictor, selection, (12 more...)

arXiv.org Artificial Intelligence

2111.138

Country:

North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > New Finding (0.48)
Research Report > Promising Solution (0.34)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Intro to Deep Learning project in TensorFlow 2.x and Python

#artificialintelligenceNov-25-2021, 10:45:05 GMT

The Black Friday Udemy sale begins. Shop to save on thousands of online courses. Welcome to the Course Introduction to Deep Learning with TensorFlow 2.0: In this course, you will learn advanced linear regression technique process and with this, you can be able to build any regression problem. Using this you can solve real-world problems like customer lifetime value, predictive analytics, etc. All the above-mentioned techniques are explained in TensorFlow.

deep learning project, python, tensorflow 2, (2 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.62)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.66)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.62)

Add feedback

MARS via LASSO

Ki, Dohyeong, Fang, Billy, Guntuboyina, Adityanand

arXiv.org Machine LearningNov-23-2021

MARS is a popular method for nonparametric regression introduced by Friedman in 1991. MARS fits simple nonlinear and non-additive functions to regression data. We propose and study a natural LASSO variant of the MARS method. Our method is based on least squares estimation over a convex class of functions obtained by considering infinite-dimensional linear combinations of functions in the MARS basis and imposing a variation based complexity constraint. We show that our estimator can be computed via finite-dimensional convex optimization and that it is naturally connected to nonparametric function estimation techniques based on smoothness constraints. Under a simple design assumption, we prove that our estimator achieves a rate of convergence that depends only logarithmically on dimension and thus avoids the usual curse of dimensionality to some extent. We implement our method with a cross-validation scheme for the selection of the involved tuning parameter and show that it has favorable performance compared to the usual MARS method in simulation and real data settings.

estimator, guntuboyina, mars, (16 more...)

arXiv.org Machine Learning

2111.11694

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York (0.04)
(4 more...)

Genre: Research Report (0.81)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Flexible Bayesian Nonlinear Model Configuration

Hubin, Aliaksandr | Storvik, Geir (University of Oslo) | Frommlet, Florian (Medical University of Vienna)

Journal of Artificial Intelligence ResearchNov-22-2021

Regression models are used in a wide range of applications providing a powerful scientific tool for researchers from different fields. Linear, or simple parametric, models are often not sufficient to describe complex relationships between input variables and a response. Such relationships can be better described through flexible approaches such as neural networks, but this results in less interpretable models and potential overfitting. Alternatively, specific parametric nonlinear functions can be used, but the specification of such functions is in general complicated. In this paper, we introduce a flexible approach for the construction and selection of highly flexible nonlinear parametric regression models. Nonlinear features are generated hierarchically, similarly to deep learning, but have additional flexibility on the possible types of features to be considered. This flexibility, combined with variable selection, allows us to find a small set of important features and thereby more interpretable models. Within the space of possible functions, a Bayesian approach, introducing priors for functions based on their complexity, is considered. A genetically modified mode jumping Markov chain Monte Carlo algorithm is adopted to perform Bayesian inference and estimate posterior probabilities for model averaging. In various applications, we illustrate how our approach is used to obtain meaningful nonlinear models. Additionally, we compare its predictive performance with several machine learning algorithms.

algorithm, bgnlm, transformation, (17 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.13047

AI Access Foundation

13047

Journal of Artificial Intelligence Research

Country:

Europe > Norway > Eastern Norway > Oslo (0.04)
Europe > Austria > Vienna (0.04)
Oceania > Australia > Tasmania (0.04)
(5 more...)

Genre: Research Report (0.68)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
(3 more...)

Add feedback

Decorrelated Variable Importance

Verdinelli, Isabella, Wasserman, Larry

arXiv.org Machine LearningNov-21-2021

Because of the widespread use of black box prediction methods such as random forests and neural nets, there is renewed interest in developing methods for quantifying variable importance as part of the broader goal of interpretable prediction. A popular approach is to define a variable importance parameter - known as LOCO (Leave Out COvariates) - based on dropping covariates from a regression model. This is essentially a nonparametric version of R-squared. This parameter is very general and can be estimated nonparametrically, but it can be hard to interpret because it is affected by correlation between covariates. We propose a method for mitigating the effect of correlation by defining a modified version of LOCO. This new parameter is difficult to estimate nonparametrically, but we show how to estimate it using semiparametric models.

artificial intelligence, influence function, machine learning, (17 more...)

arXiv.org Machine Learning

2111.10853

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)

Add feedback

A Data-Driven Line Search Rule for Support Recovery in High-dimensional Data Analysis

Li, Peili, Jiao, Yuling, Lu, Xiliang, Kang, Lican

arXiv.org Machine LearningNov-21-2021

In this work, we consider the algorithm to the (nonlinear) regression problems with $\ell_0$ penalty. The existing algorithms for $\ell_0$ based optimization problem are often carried out with a fixed step size, and the selection of an appropriate step size depends on the restricted strong convexity and smoothness for the loss function, hence it is difficult to compute in practical calculation. In sprite of the ideas of support detection and root finding \cite{HJK2020}, we proposes a novel and efficient data-driven line search rule to adaptively determine the appropriate step size. We prove the $\ell_2$ error bound to the proposed algorithm without much restrictions for the cost functional. A large number of numerical comparisons with state-of-the-art algorithms in linear and logistic regression problems show the stability, effectiveness and superiority of the proposed algorithms.

algorithm, regression problem, sdarl algorithm, (12 more...)

arXiv.org Machine Learning

2111.10806

Country:

Asia > China > Hubei Province > Wuhan (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre:

Research Report > New Finding (0.50)
Research Report > Experimental Study (0.50)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback