AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.90)

Dantas, Cassio, Soubies, Emmanuel, Févotte, Cédric

Accelerating Non-Negative and Bounded-Variable Linear Regression Algorithms with Safe Screening

arXiv.org Machine LearningFeb-15-2022

Non-negative and bounded-variable linear regression problems arise in a variety of applications in machine learning and signal processing. In this paper, we propose a technique to accelerate existing solvers for these problems by identifying saturated coordinates in the course of iterations. This is akin to safe screening techniques previously proposed for sparsity-regularized regression problems. The proposed strategy is provably safe as it provides theoretical guarantees that the identified coordinates are indeed saturated in the optimal solution. Experimental results on synthetic and real data show compelling accelerations for both non-negative and bounded-variable problems.

accelerating non-negative, bounded-variable linear regression algorithm, safe screening

2202.07258

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

#artificialintelligenceFeb-14-2022, 23:10:09 GMT

Glossary of Machine Learning Terminology: A Beginner's Guide

Machine learning algorithms, models, strategies, and other influential features are assisting us in unlocking a wide range of applications. These computer systems are capable of self-learning and making business decisions, as well as assisting research and improving technology. As machine learning finds new applications across various sectors, the demand for professionals in the field is growing. According to the US Bureau of Labor Statistics, the job outlook will rise 22 percent until 2030 for computer and information research scientists. Whichever area of machine learning interests you more, you must first familiarize yourself with machine learning terminology.

algorithm, application, learning, (14 more...)

Country: North America > United States (0.55)

Genre: Research Report (0.33)

Industry: Government > Regional Government > North America Government > United States Government (0.55)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.72)

#artificialintelligenceFeb-13-2022, 03:30:05 GMT

How to Learn Math for Machine Learning - KDnuggets

One of the most common questions that data science aspirants have is "how much math do I need to know for machine learning?" Students looking to break into machine learning often see math as a huge barrier to entry. Gatekeepers in the industry don't help with this concern, labelling students as unqualified unless they have a Master's degree of PhD in the subject. So how much math do you need to know in order to work in the data science industry? The answer: Not as much as you think.

linear algebra, machine learning, math, (10 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Setting (0.93)
Education > Curriculum > Subject-Specific Education (0.52)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.30)

#artificialintelligenceFeb-12-2022, 16:05:48 GMT

Logistic Regression using Python and Excel - Analytics Vidhya

This article was published as a part of the Data Science Blogathon. Will tomorrow be a sunny day? What are the chances that a student will get into that dream university? These and many more real-world "decision" scenarios need a standard mechanism. Step in Logistic Regression may be stated very simply as an estimation of the probability of an event occurring. In the next few minutes, we shall understand Logistic Regression from A-to-Z.

dataset, logistic regression, regression coefficient, (11 more...)

Genre:

Research Report > New Finding (0.90)
Research Report > Experimental Study (0.90)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Xu, Jing, Teng, Jiaye, Yao, Andrew Chi-Chih

Relaxing the Feature Covariance Assumption: Time-Variant Bounds for Benign Overfitting in Linear Regression

arXiv.org Machine LearningFeb-12-2022

Benign overfitting demonstrates that overparameterized models can perform well on test data while fitting noisy training data. However, it only considers the final min-norm solution in linear regression, which ignores the algorithm information and the corresponding training procedure. In this paper, we generalize the idea of benign overfitting to the whole training trajectory instead of the min-norm solution and derive a time-variant bound based on the trajectory analysis. Starting from the time-variant bound, we further derive a time interval that suffices to guarantee a consistent generalization error for a given feature covariance. Unlike existing approaches, the newly proposed generalization bound is characterized by a time-variant effective dimension of feature covariance. By introducing the time factor, we relax the strict assumption on the feature covariance matrix required in previous benign overfitting under the regimes of overparameterized linear regression with gradient descent. This paper extends the scope of benign overfitting, and experiment results indicate that the proposed bound accords better with empirical evidence.

excess risk, gradient descent, min-norm solution, (13 more...)

2202.06054

Country:

North America > United States > California > Los Angeles County > Long Beach (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(13 more...)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.82)

#artificialintelligenceFeb-11-2022, 05:05:21 GMT

Flux.jl-A simplified way to build custom ML models with ease 🤖

Congratulations!, you have just created a Linear Regression Model using Julia's Flux.jl library. Hopefully this blog has helped you to understand the basics of Flux.jl library so that you can create your own personalized ML models from scratch. I am planning to create a series in which this blog is part one of that series. The other parts could include deploying this Regression Model using streamlit/flask and also developing Deep Learning models using the Flux.jl

build custom ml model, flux, jl library, (2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

arXiv.org Machine LearningFeb-11-2022

Bernstein Flows for Flexible Posteriors in Variational Bayes

Dürr, Oliver, Hörling, Stephan, Dold, Daniel, Kovylov, Ivonne, Sick, Beate

Variational inference (VI) is a technique to approximate difficult to compute posteriors by optimization. In contrast to MCMC, VI scales to many observations. In the case of complex posteriors, however, state-of-the-art VI approaches often yield unsatisfactory posterior approximations. This paper presents Bernstein flow variational inference (BF-VI), a robust and easy-to-use method, flexible enough to approximate complex multivariate posteriors. BF-VI combines ideas from normalizing flows and Bernstein polynomial-based transformation models. In benchmark experiments, we compare BF-VI solutions with exact posteriors, MCMC solutions, and state-of-the-art VI methods including normalizing flow based VI. We show for low-dimensional models that BF-VI accurately approximates the true posterior; in higher-dimensional models, BF-VI outperforms other VI methods. Further, we develop with BF-VI a Bayesian model for the semi-structured Melanoma challenge data, combining a CNN model part for image data with an interpretable model part for tabular data, and demonstrate for the first time how the use of VI in semi-structured models.

artificial intelligence, machine learning, posterior, (20 more...)

2202.0565

Country:

Europe (0.28)
Asia (0.14)

Genre: Research Report (1.00)

Industry:

Energy > Oil & Gas (1.00)
Health & Medicine > Therapeutic Area > Dermatology (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)

Kato, Masahiro, Imaizumi, Masaaki

Benign-Overfitting in Conditional Average Treatment Effect Prediction with Linear Regression

arXiv.org Machine LearningFeb-11-2022

We study the benign overfitting theory in the prediction of the conditional average treatment effect (CATE), with linear regression models. As the development of machine learning for causal inference, a wide range of large-scale models for causality are gaining attention. One problem is that suspicions have been raised that the large-scale models are prone to overfitting to observations with sample selection, hence the large models may not be suitable for causal prediction. In this study, to resolve the suspicious, we investigate on the validity of causal inference methods for overparameterized models, by applying the recent theory of benign overfitting (Bartlett et al., 2020). Specifically, we consider samples whose distribution switches depending on an assignment rule, and study the prediction of CATE with linear models whose dimension diverges to infinity. We focus on two methods: the T-learner, which based on a difference between separately constructed estimators with each treatment group, and the inverse probability weight (IPW)-learner, which solves another regression problem approximated by a propensity score. In both methods, the estimator consists of interpolators that fit the samples perfectly. As a result, we show that the T-learner fails to achieve the consistency except the random assignment, while the IPW-learner converges the risk to zero if the propensity score is known. This difference stems from that the T-learner is unable to preserve eigenspaces of the covariances, which is necessary for benign overfitting in the overparameterized setting. Our result provides new insights into the usage of causal inference methods in the overparameterizated setting, in particular, doubly robust estimators.

conditional average treatment effect prediction, linear regression, probability, (10 more...)

2202.05245

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Oceania > Australia (0.04)
(2 more...)

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength High (0.93)
Research Report > New Finding (0.86)

Industry: Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

arXiv.org Machine LearningFeb-11-2022

Fast and Robust Sparsity Learning over Networks: A Decentralized Surrogate Median Regression Approach

Liu, Weidong, Mao, Xiaojun, Zhang, Xin

Decentralized sparsity learning has attracted a significant amount of attention recently due to its rapidly growing applications. To obtain the robust and sparse estimators, a natural idea is to adopt the non-smooth median loss combined with a $\ell_1$ sparsity regularizer. However, most of the existing methods suffer from slow convergence performance caused by the {\em double} non-smooth objective. To accelerate the computation, in this paper, we proposed a decentralized surrogate median regression (deSMR) method for efficiently solving the decentralized sparsity learning problem. We show that our proposed algorithm enjoys a linear convergence rate with a simple implementation. We also investigate the statistical guarantee, and it shows that our proposed estimator achieves a near-oracle convergence rate without any restriction on the number of network nodes. Moreover, we establish the theoretical results for sparse support recovery. Thorough numerical experiments and real data study are provided to demonstrate the effectiveness of our method.

algorithm, convergence rate, estimator, (16 more...)

2202.05498

Country:

Asia > China > Shanghai > Shanghai (0.04)
North America > United States > Iowa > Story County > Ames (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.82)

Industry: Education (0.34)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)