AITopics | Regression

Collaborating Authors

Regression

News Overviews Instructional Materials AI-Alerts Classics

A Framework for Democratizing AI

Ahmed, Shakkeel, Mula, Ravi S., Dhavala, Soma S.

arXiv.org Artificial IntelligenceJan-1-2020

Machine Learning and Artificial Intelligence are considered an integral part of the Fourth Industrial Revolution. Their impact, and far-reaching consequences, while acknowledged, are yet to be comprehended. These technologies are very specialized, and few organizations and select highly trained professionals have the wherewithal, in terms of money, manpower, and might, to chart the future. However, concentration of power can lead to marginalization, causing severe inequalities. Regulatory agencies and governments across the globe are creating national policies, and laws around these technologies to protect the rights of the digital citizens, as well as to empower them. Even private, not-for-profit organizations are also contributing to democratizing the technologies by making them \emph{accessible} and \emph{affordable}. However, accessibility and affordability are all but a few of the facets of democratizing the field. Others include, but not limited to, \emph{portability}, \emph{explainability}, \emph{credibility}, \emph{fairness}, among others. As one can imagine, democratizing AI is a multi-faceted problem, and it requires advancements in science, technology and policy. At \texttt{mlsquare}, we are developing scientific tools in this space. Specifically, we introduce an opinionated, extensible, \texttt{Python} framework that provides a single point of interface to a variety of solutions in each of the categories mentioned above. We present the design details, APIs of the framework, reference implementations, road map for development, and guidelines for contributions.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2001.00818

Country:

North America > United States (0.14)
Europe > Austria > Vienna (0.14)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (1.00)
Law (0.88)
Government (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science > Data Mining (0.93)
(2 more...)

Add feedback

Sample Complexity of Learning Mixture of Sparse Linear Regressions

Krishnamurthy, Akshay, Mazumdar, Arya, McGregor, Andrew, Pal, Soumyabrata

Neural Information Processing SystemsDec-31-2019

In the problem of learning mixtures of linear regressions, the goal is to learn a collection of signal vectors from a sequence of (possibly noisy) linear measurements, where each measurement is evaluated on an unknown signal drawn uniformly from this collection. This setting is quite expressive and has been studied both in terms of practical applications and for the sake of establishing theoretical guarantees. In this paper, we consider the case where the signal vectors are sparse; this generalizes the popular compressed sensing paradigm. We improve upon the state-of-the-art results as follows: In the noisy case, we resolve an open question of Yin et al. (IEEE Transactions on Information Theory, 2019) by showing how to handle collections of more than two vectors and present the first robust reconstruction algorithm, i.e., if the signals are not perfectly sparse, we still learn a good sparse approximation of the signals. In the noiseless case, as well as in the noisy case, we show how to circumvent the need for a restrictive assumption required in the previous work. Our techniques are quite different from those in the previous work: for the noiseless case, we rely on a property of sparse polynomials and for the noisy case, we provide new connections to learning Gaussian mixtures and use ideas from the theory of error correcting codes.

matrix, query, vector, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Indiana (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.72)

Add feedback

TOP 20 Machine Learning Course on Mighty Machine Learning Bundle Deals

#artificialintelligenceDec-30-2019, 22:24:43 GMT

In this python machine learning course, learn both supervised and unsupervised learning in python from scratch. Enroll in this course and boost your career now In this age of big data, companies across the globe use Python to sift through the avalanche of information at their disposal. By becoming proficient in unsupervised & supervised learning in Python, you can give your company a competitive edge - and boost your career to the next level.

artificial intelligence, deep learning, machine learning, (11 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education > Educational Setting > Online (0.71)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.53)

Add feedback

Differentially Private M-band Wavelet-Based Mechanisms in Machine Learning Environments

Choi, Kenneth, Lee, Tony

arXiv.org Machine LearningDec-30-2019

In the post-industrial world, data science and analytics have gained paramount importance regarding digital data privacy. Improper methods of establishing privacy for accessible datasets can compromise large amounts of user data even if the adversary has a small amount of preliminary knowledge of a user. Many researchers have been developing high-level privacy-preserving mechanisms that also retain the statistical integrity of the data to apply to machine learning. Recent developments of differential privacy, such as the Laplace and Privelet mechanisms, drastically decrease the probability that an adversary can distinguish the elements in a data set and thus extract user information. In this paper, we develop three privacy-preserving mechanisms with the discrete M-band wavelet transform that embed noise into data. The first two methods (LS and LS+) add noise through a Laplace-Sigmoid distribution that multiplies Laplace-distributed values with the sigmoid function, and the third method utilizes pseudo-quantum steganography to embed noise into the data. We then show that our mechanisms successfully retain both differential privacy and learnability through statistical analysis in various machine learning environments.

dataset, mechanism, noise, (14 more...)

arXiv.org Machine Learning

2001.00012

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > Massachusetts (0.04)
(2 more...)

Genre: Research Report > New Finding (0.69)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)
(2 more...)

Add feedback

Localized Debiased Machine Learning: Efficient Estimation of Quantile Treatment Effects, Conditional Value at Risk, and Beyond

Kallus, Nathan, Mao, Xiaojie, Uehara, Masatoshi

arXiv.org Machine LearningDec-30-2019

We consider the efficient estimation of a low-dimensional parameter in the presence of very high-dimensional nuisances that may depend on the parameter of interest. An important example is the quantile treatment effect (QTE) in causal inference, where the efficient estimation equation involves as a nuisance the conditional cumulative distribution evaluated at the quantile to be estimated. Debiased machine learning (DML) is a data-splitting approach to address the need to estimate nuisances using flexible machine learning methods that may not satisfy strong metric entropy conditions, but applying it to problems with estimand-dependent nuisances would require estimating too many nuisances to be practical. For the QTE estimation, DML requires we learn the whole conditional cumulative distribution function, which may be challenging in practice and stands in contrast to only needing to estimate just two regression functions as in the efficient estimation of average treatment effects. Instead, we propose localized debiased machine learning (LDML), a new three-way data-splitting approach that avoids this burdensome step and needs only estimate the nuisances at a single initial bad guess for the parameters. In particular, under a Frechet-derivative orthogonality condition, we show the oracle estimation equation is asymptotically equivalent to one where the nuisance is evaluated at the true parameter value and we provide a strategy to target this alternative formulation. In the case of QTE estimation, this involves only learning two binary regression models, for which many standard, time-tested machine learning methods exist. We prove that under certain lax rate conditions, our estimator has the same favorable asymptotic behavior as the infeasible oracle estimator that solves the estimating equation with the true nuisance functions.

equation, estimation, init, (15 more...)

arXiv.org Machine Learning

1912.12945

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > New York (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.54)

Add feedback

Smell Pittsburgh: Engaging Community Citizen Science for Air Quality

Hsu, Yen-Chia, Cross, Jennifer, Dille, Paul, Tasota, Michael, Dias, Beatrice, Sargent, Randy, Huang, Ting-Hao 'Kenneth', Nourbakhsh, Illah

arXiv.org Artificial IntelligenceDec-30-2019

Urban air pollution has been linked to various human health concerns, including cardiopulmonary diseases. Communities who suffer from poor air quality often rely on experts to identify pollution sources due to the lack of accessible tools. Taking this into account, we developed Smell Pittsburgh, a system that enables community members to report odors and track where these odors are frequently concentrated. All smell report data are publicly accessible online. These reports are also sent to the local health department and visualized on a map along with air quality data from monitoring stations. This visualization provides a comprehensive overview of the local pollution landscape. Additionally, with these reports and air quality data, we developed a model to predict upcoming smell events and send push notifications to inform communities. We also applied regression analysis to identify statistically significant effects of push notifications on user engagement. Our evaluation of this system demonstrates that engaging residents in documenting their experiences with pollution odors can help identify local air pollution patterns, and can empower communities to advocate for better air quality. All citizen-contributed smell data are publicly accessible and can be downloaded from https://smellpgh.org.

interactive intelligent system, notification, pittsburgh, (11 more...)

arXiv.org Artificial Intelligence

1912.11936

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
(6 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Law > Environmental Law (1.00)
Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Public Health (0.88)
(2 more...)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (1.00)
(4 more...)

Add feedback

Linear Regression in Python

#artificialintelligenceDec-28-2019, 13:51:44 GMT

In Linear Regression (LR) we used one independent variable for prediction. In Multiple Linear Regression (MLR) we used multiple independent variable for prediction. So, with the help of above code we can implement Linear Regression on our data.

independent variable, linear regression, python, (3 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Gradient descent for linear regression using Golang - Backlog

#artificialintelligenceDec-28-2019, 03:37:52 GMT

I recently decided to dive into machine learning, a field I have wanted to understand for a long time but have never had the time to pursue. I've been taking the free (and amazing!) course from Stanford University's Andrew Ng on Coursera. The first two weeks are dedicated to the Linear Gradient algorithm. In this post, I'll provide an overview of how it works and share how I implemented the vectorized version and parts of the non-vectorized version in Golang using the gonum library. Linear regression is a technique used in modeling the linear relationship between an input and its output.

algorithm, cost function, hypothesis, (13 more...)

#artificialintelligence

Country: Europe > Netherlands > North Holland > Amsterdam (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.46)

Add feedback

Logistic Regression With A Real-World Example in Python MarkTechPost

#artificialintelligenceDec-27-2019, 05:55:51 GMT

In this tutorial, You'll learn Logistic Regression. Here you'll know what exactly is Logistic Regression and you'll also see an Example with Python. Logistic Regression is an important topic of Machine Learning and I'll try to make it as simple as possible. In the early twentieth century, Logistic regression was mainly used in Biology after this, it was used in some social science applications. If you are curious, You may ask where we should use logistic regression? So we use Logistic Regression when our independent variable is categorical.

accuracy, classifier, logistic regression, (8 more...)

#artificialintelligence

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)

Add feedback

Predicting Attributes of Nodes Using Network Structure

Ali, Sarwan, Shakeel, Muhammad Haroon, Khan, Imdadullah, Faizullah, Safiullah, Khan, Muhammad Asad

arXiv.org Machine LearningDec-27-2019

In many graphs such as social networks, nodes have associated attributes representing their behavior. Predicting node attributes in such graphs is an important problem with applications in many domains like recommendation systems, privacy preservation, and targeted advertisement. Attributes values can be predicted by analyzing patterns and correlations among attributes and employing classification/regression algorithms. However, these approaches do not utilize readily available network topology information. In this regard, interconnections between different attributes of nodes can be exploited to improve the prediction accuracy. In this paper, we propose an approach to represent a node by a feature map with respect to an attribute $a_i$ (which is used as input for machine learning algorithms) using all attributes of neighbors to predict attributes values for $a_i$. We perform extensive experimentation on ten real-world datasets and show that the proposed feature map significantly improves the prediction accuracy as compared to baseline approaches on these datasets.

classifier, dataset, node, (15 more...)

arXiv.org Machine Learning

1912.12264

Country:

Asia > Pakistan > Punjab > Lahore Division > Lahore (0.05)
North America > United States > Mississippi (0.05)
North America > United States > New York (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.93)
Health & Medicine > Therapeutic Area > Immunology (0.93)
Health & Medicine > Epidemiology (0.93)
(2 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.49)

Add feedback