Regression
Coded Computing for Federated Learning at the Edge
Prakash, Saurav, Dhakal, Sagar, Akdeniz, Mustafa, Avestimehr, A. Salman, Himayat, Nageen
Federated Learning (FL) is an exciting new paradigm that enables training a global model from data generated locally at the client nodes, without moving client data to a centralized server. Performance of FL in a multi-access edge computing (MEC) network suffers from slow convergence due to heterogeneity and stochastic fluctuations in compute power and communication link qualities across clients. A recent work, Coded Federated Learning (CFL), proposes to mitigate stragglers and speed up training for linear regression tasks by assigning redundant computations at the MEC server. Coding redundancy in CFL is computed by exploiting statistical properties of compute and communication delays. We develop CodedFedL that addresses the difficult task of extending CFL to distributed non-linear regression and classification problems with multioutput labels. The key innovation of our work is to exploit distributed kernel embedding using random Fourier features that transforms the training task into distributed linear regression. We provide an analytical solution for load allocation, and demonstrate significant performance gains for CodedFedL through experiments over benchmark datasets using practical network parameters.
Recovery of Sparse Signals from a Mixture of Linear Samples
Mazumdar, Arya, Pal, Soumyabrata
Mixture of linear regressions is a popular learning theoretic model that is used widely to represent heterogeneous data. In the simplest form, this model assumes that the labels are generated from either of two different linear models and mixed together. Recent works of Yin et al. and Krishnamurthy et al., 2019, focus on an experimental design setting of model recovery for this problem. It is assumed that the features can be designed and queried with to obtain their label. When queried, an oracle randomly selects one of the two different sparse linear models and generates a label accordingly. How many such oracle queries are needed to recover both of the models simultaneously? This question can also be thought of as a generalization of the well-known compressed sensing problem (Cand\`es and Tao, 2005, Donoho, 2006). In this work, we address this query complexity problem and provide efficient algorithms that improves on the previously best known results.
Logistic Regression for Classification Task
With pattern recognition approach, there are many methods to implement a handwriting digits recognition task. In my previous stories, I have introduced the Linear Discriminant Analysis base on the Gaussian model maximum likelihood estimation. In this post, I apply the Logistic Regression model on the English numeral handwriting digits recognition task. In the logistic regression model, an occurrence probability of an event is represented by a logistic function. For example, in a two-class problem, the logistic sigmoid function is commonly used.
Statistical Analysis of Simple Linear Regression
Regression is unarguably one of the most used models in data science and statistics. It is prevalent in almost every field in industry and academia. I will go through in this blog the statistical concepts involved in Simple Linear Regression i.e. regression involving only one predictor variable. The readers are assumed to have some basic knowledge of probability theory and statistics, although I have given references to the concepts. Now, we need to estimate the parameters (Beta_0, Beta_1) of model and also the value of sigma squared, which is the variance of error term.
How to Choose a Machine Learning Technique
Why are there so many machine learning techniques? The thing is that different algorithms solve various problems. The results that you get directly depend on the model you choose. That is why it is so important to know how to match a machine learning algorithm to a particular problem. In this post, we are going to talk about just that. First of all, to choose an algorithm for your project, you need to know about what kinds of them exist.
A unified survey on treatment effect heterogeneity modeling and uplift modeling
Zhang, Weijia, Li, Jiuyong, Liu, Lin
A central question in many fields of scientific research is to determine how an outcome would be affected by an action, or to measure the effect of an action (a.k.a treatment effect). In recent years, a need for estimating the heterogeneous treatment effects conditioning on the different characteristics of individuals has emerged from research fields such as personalized healthcare, social science, and online marketing. To meet the need, researchers and practitioners from different communities have developed algorithms by taking the treatment effect heterogeneity modeling approach and the uplift modeling approach, respectively. In this paper, we provide a unified survey of these two seemingly disconnected yet closely related approaches under the potential outcome framework. We then provide a structured survey of existing methods by emphasizing on their inherent connections with a set of unified notations to make comparisons of the different methods easy. We then review the main applications of the surveyed methods in personalized marketing, personalized medicine, and social studies. Finally, we summarize the existing software packages and present discussions based on the use of methods on synthetic, semi-synthetic and real world data sets and provide some general guidelines for choosing methods.
Writing your first hello world in machine learning with Python
Machine Learning is the process of taking data as input, identifying the trends and patterns in data and giving a program as output. This program also or model is the representation of the patterns that define the data. New data can be given as input to this model/program and it will be able to classify it or make predictions on it. Python has been found to be a simple and easy to learn language that works very well with requirements of machine learning. Python has been designed to favor data analysis.
9 Key Machine Learning Algorithms Explained in Plain English
Machine learning [https://gum.co/pGjwd] is changing the world. Google uses machine learning to suggest search results to users. Netflix uses it to recommend movies for you to watch. Facebook uses machine learning to suggest people you may know. Machine learning has never been more important. At the same time, understanding machine learning is hard. The field is full of jargon. And the number of different ML algorithms grows each year. This article will introduce you to the fundamental concepts
Predicting and Mapping of Soil Organic Carbon Using Machine Learning Algorithms in Northern Iran
Emadi, Mostafa, Taghizadeh-Mehrjardi, Ruhollah, Cherati, Ali, Danesh, Majid, Mosavi, Amir, Scholten, Thomas
Estimation of the soil organic carbon content is of utmost importance in understanding the chemical, physical, and biological functions of the soil. This study proposes machine learning algorithms of support vector machines, artificial neural networks, regression tree, random forest, extreme gradient boosting, and conventional deep neural network for advancing prediction models of SOC. Models are trained with 1879 composite surface soil samples, and 105 auxiliary data as predictors. The genetic algorithm is used as a feature selection approach to identify effective variables. The results indicate that precipitation is the most important predictor driving 15 percent of SOC spatial variability followed by the normalized difference vegetation index, day temperature index of moderate resolution imaging spectroradiometer, multiresolution valley bottom flatness and land use, respectively. Based on 10 fold cross validation, the DNN model reported as a superior algorithm with the lowest prediction error and uncertainty. In terms of accuracy, DNN yielded a mean absolute error of 59 percent, a root mean squared error of 75 percent, a coefficient of determination of 0.65, and Lins concordance correlation coefficient of 0.83. The SOC content was the highest in udic soil moisture regime class with mean values of 4 percent, followed by the aquic and xeric classes, respectively. Soils in dense forestlands had the highest SOC contents, whereas soils of younger geological age and alluvial fans had lower SOC. The proposed DNN is a promising algorithm for handling large numbers of auxiliary data at a province scale, and due to its flexible structure and the ability to extract more information from the auxiliary data surrounding the sampled observations, it had high accuracy for the prediction of the SOC baseline map and minimal uncertainty.
Inside: Logistic Regression
This is a part of a series of blogs where I'll be demonstrating different aspects and the theory of Machine Learning Algorithms by using math and code. This includes the usual modeling structure of the algorithm and the intuition on why and how it works, using Python code. Logistic Regression is one of the first algorithms that is introduced when someone learns about classification. You probably would have read about Regression and the continuous nature of the predictor variable. Classification is done on discrete variables, which means your predictions are finite and class-based like a Yes/No, True/False for binary outcomes.