AITopics | Regression

Collaborating Authors

Regression

News Overviews Instructional Materials AI-Alerts Classics

Logistic Regression

#artificialintelligenceMay-28-2021, 01:45:40 GMT

In this article, we will be learning about how we can implement logistic regression by writing Python code. You must be wondering what is logistic regression and what is the theory behind it? What python packages are involved while implementing logistic regression? You must be coming up with many more questions but I will try to answer as many as questions possible. Well, you have chosen the right article.

algorithm, dataset, sigmoid function, (10 more...)

#artificialintelligence

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Stochastic Intervention for Causal Inference via Reinforcement Learning

Duong, Tri Dung, Li, Qian, Xu, Guandong

arXiv.org Artificial IntelligenceMay-27-2021

Causal inference methods are widely applied in various decision-making domains such as precision medicine, optimal policy and economics. Central to causal inference is the treatment effect estimation of intervention strategies, such as changes in drug dosing and increases in financial aid. Existing methods are mostly restricted to the deterministic treatment and compare outcomes under different treatments. However, they are unable to address the substantial recent interest of treatment effect estimation under stochastic treatment, e.g., "how all units health status change if they adopt 50\% dose reduction". In other words, they lack the capability of providing fine-grained treatment effect estimation to support sound decision-making. In our study, we advance the causal inference research by proposing a new effective framework to estimate the treatment effect on stochastic intervention. Particularly, we develop a stochastic intervention effect estimator (SIE) based on nonparametric influence function, with the theoretical guarantees of robustness and fast convergence rates. Additionally, we construct a customised reinforcement learning algorithm based on the random search solver which can effectively find the optimal policy to produce the greatest expected outcomes for the decision-making process. Finally, we conduct an empirical study to justify that our framework can achieve significant performance in comparison with state-of-the-art baselines.

intervention, propensity score, treatment effect, (13 more...)

arXiv.org Artificial Intelligence

2105.13514

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.88)

Industry: Health & Medicine > Consumer Health (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.91)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

Characterizing the SLOPE Trade-off: A Variational Perspective and the Donoho-Tanner Limit

Bu, Zhiqi, Klusowski, Jason, Rush, Cynthia, Su, Weijie J.

arXiv.org Machine LearningMay-27-2021

Sorted l1 regularization has been incorporated into many methods for solving high-dimensional statistical estimation problems, including the SLOPE estimator in linear regression. In this paper, we study how this relatively new regularization technique improves variable selection by characterizing the optimal SLOPE trade-off between the false discovery proportion (FDP) and true positive proportion (TPP) or, equivalently, between measures of type I error and power. Assuming a regime of linear sparsity and working under Gaussian random designs, we obtain an upper bound on the optimal trade-off for SLOPE, showing its capability of breaking the Donoho-Tanner power limit. To put it into perspective, this limit is the highest possible power that the Lasso, which is perhaps the most popular l1-based method, can achieve even with arbitrarily strong effect sizes. Next, we derive a tight lower bound that delineates the fundamental limit of sorted l1 regularization in optimally trading the FDP off for the TPP. Finally, we show that on any problem instance, SLOPE with a certain regularization sequence outperforms the Lasso, in the sense of having a smaller FDP, larger TPP and smaller l2 estimation risk simultaneously. Our proofs are based on a novel technique that reduces a variational calculus problem to a class of infinite-dimensional convex optimization problems and a very recent result from approximate message passing theory.

eff, lasso, penalty, (15 more...)

arXiv.org Machine Learning

2105.13302

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > New York (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre:

Research Report > New Finding (0.48)
Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback

Reaction GIFs Offer A New Key To Emotion Recognition In NLP

#artificialintelligenceMay-26-2021, 09:05:08 GMT

New research out of China is offering a novel method for Natural Language Processing (NLP) to perform sentiment analysis on social media forums and language research datasets – by categorizing and labeling animated GIFs that are posted in response to text announcements. The researchers, led by Boaz Shmueli of National Tsing Hua University at Taiwan, have used Twitter's in-built database of reaction GIFs as an index to quantify the affective state of a user's response, obviating the need to negotiate multiple language responses, the challenge of detecting sarcasm, or of identifying core emotional temperature from ambiguous or excessively brief responses. Clicking the'GIF' button when composing a Twitter post offers a standard set of labeled animated GIFs that are potentially easier for NLP to parse into'identified' emotions than plain-text language. The paper characterizes the use of reaction GIFs in this way as'a new type of label, not yet available in NLP emotion datasets', and notes that existing datasets either use the dimensional model of emotion or the discrete emotions model, neither of which offers this kind of insight. An animated GIF response to a user post.

category, dataset, gif, (12 more...)

#artificialintelligence

Country:

Asia > Taiwan (0.25)
Asia > China (0.25)

Genre: Research Report (0.71)

Industry: Information Technology > Services (0.39)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.31)

Add feedback

Networked Federated Multi-Task Learning

SarcheshmehPour, Yasmin, Tian, Yu, Zhang, Linli, Jung, Alexander

arXiv.org Machine LearningMay-26-2021

Many important application domains generate distributed collections of heterogeneous local datasets. These local datasets are often related via an intrinsic network structure that arises from domain-specific notions of similarity between local datasets. Different notions of similarity are induced by spatiotemporal proximity, statistical dependencies, or functional relations. We use this network structure to adaptively pool similar local datasets into nearly homogenous training sets for learning tailored models. Our main conceptual contribution is to formulate networked federated learning using the concept of generalized total variation (GTV) minimization as a regularizer. This formulation is highly flexible and can be combined with almost any parametric model including Lasso or deep neural networks. We unify and considerably extend some well-known approaches to federated multi-task learning. Our main algorithmic contribution is a novel federated learning algorithm that is well suited for distributed computing environments such as edge computing over wireless networks. This algorithm is robust against model misspecification and numerical errors arising from limited computational resources including processing time or wireless channel bandwidth. As our main technical contribution, we offer precise conditions on the local models as well on their network structure such that our algorithm learns nearly optimal local models. Our analysis reveals an interesting interplay between the (information-) geometry of local models and the (cluster-) geometry of their network.

algorithm 1, local dataset, network structure, (13 more...)

arXiv.org Machine Learning

2105.12769

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(7 more...)

Genre: Research Report (0.82)

Industry: Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Stochastic Intervention for Causal Effect Estimation

Duong, Tri Dung, Li, Qian, Xu, Guandong

arXiv.org Artificial IntelligenceMay-26-2021

Causal inference methods are widely applied in various decision-making domains such as precision medicine, optimal policy and economics. Central to these applications is the treatment effect estimation of intervention strategies. Current estimation methods are mostly restricted to the deterministic treatment, which however, is unable to address the stochastic space treatment policies. Moreover, previous methods can only make binary yes-or-no decisions based on the treatment effect, lacking the capability of providing fine-grained effect estimation degree to explain the process of decision making. In our study, we therefore advance the causal inference research to estimate stochastic intervention effect by devising a new stochastic propensity score and stochastic intervention effect estimator (SIE). Meanwhile, we design a customized genetic algorithm specific to stochastic intervention effect (Ge-SIO) with the aim of providing causal evidence for decision making. We provide the theoretical analysis and conduct an empirical study to justify that our proposed measures and algorithms can achieve a significant performance lift in comparison with state-of-the-art baselines.

intervention, propensity score, treatment effect, (14 more...)

arXiv.org Artificial Intelligence

2105.12898

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia (0.04)

Genre:

Research Report > Experimental Study (0.94)
Research Report > New Finding (0.66)

Industry:

Health & Medicine (1.00)
Marketing (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science > Data Mining (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

Logistic Regression -Beginners Guide in Python - Analytics India Magazine

#artificialintelligenceMay-25-2021, 17:25:15 GMT

Most of the supervised learning problems in machine learning are classification problems. Classification is the task of assigning a data point with a suitable class. Suppose a pet classification problem. If we input certain features, the machine learning model will tell us whether the given features belong to a cat or a dog. Cat and dog are the two classes here.

binary classification problem, classification problem, probability distribution, (10 more...)

#artificialintelligence

Country: Asia > India (0.40)

Genre:

Research Report > Experimental Study (0.62)
Research Report > New Finding (0.51)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.96)

Add feedback

15 Best YouTube Channels to Learn Data Science in 2021

#artificialintelligenceMay-25-2021, 13:53:10 GMT

YouTube is a great platform for learners and has some best channels for learning data science. That's why I thought to share with you the 15 Best YouTube Channels to Learn Data Science. So if you are planning to learn data science, then these data science YouTube channels will help you to understand the fundamentals of data science. Now without any further ado, let's start finding the best youtube channels to learn data science- Math is essential for data science and machine learning to understand how machine learning algorithms work. So if you want to learn math concepts, then you should check this YouTube channel.

data science, learning, youtube channel, (12 more...)

#artificialintelligence

Country: North America > United States > California (0.05)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education > Educational Setting > Online (0.91)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.33)

Add feedback

Group selection and shrinkage with application to sparse semiparametric modeling

Thompson, Ryan, Vahid, Farshid

arXiv.org Machine LearningMay-25-2021

Sparse regression and classification estimators capable of group selection have application to an assortment of statistical problems, from multitask learning to sparse additive modeling to hierarchical selection. This work introduces a class of group-sparse estimators that combine group subset selection with group lasso or ridge shrinkage. We develop an optimization framework for fitting the nonconvex regularization surface and present finite-sample error bounds for estimation of the regression function. Our methods and analyses accommodate the general setting where groups overlap. As an application of group selection, we study sparse semiparametric modeling, a procedure that allows the effect of each predictor to be zero, linear, or nonlinear. For this task, the new estimators improve across several metrics on synthetic data compared to alternatives. Finally, we demonstrate their efficacy in modeling supermarket foot traffic and economic recessions using many predictors. All of our proposals are made available in the scalable implementation grpsel.

estimator, selection, subset, (15 more...)

arXiv.org Machine Learning

2105.12081

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > Florida > Palm Beach County > Boca Raton (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.64)

Industry: Banking & Finance > Economy (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)

Add feedback

Sometimes more data can hurt!

#artificialintelligenceMay-24-2021, 10:20:11 GMT

On a recent blog post I've discussed a scalable sparse linear regression model I've developed at work. One of it's interesting properties is that it's an interpolating model – meaning it has 0-training error. This is because it's over parameterized and thus can fit the training data perfectly. While 0-training error is usually associated with over-fiting, the model seems to perform pretty well on the test set. Reports of hugely over-parameterized models that seem to not suffer from overfiting (especially in deep learning) have been accumulating in recent years and so the literature on subject.

0-training error, inverse, nakkiran

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.55)

Add feedback