Goto

Collaborating Authors

 Regression


An Absolute Guide to Take Off in Machine Learning – Good Audience

#artificialintelligence

Whenever we look at any online course, they take off with linear regression and this is a concept that most of us know, that is, an equation of a line initially and then gradually fitting of the best fit line. The application of this algorithm is used in machine learning as a way to predict results in the future given the feature vectors, x. So, why is the cost function a squared cost function? Why not have an absolute cost function? Well, there are plenty of reasons as to why we consider this, but when we derive this mathematically, we come across the concept of exponential families under general linear models, which generalize the notion of loss functions for any given model, and thus the square function is actually an exponential family curve.


Integrative Multi-View Reduced-Rank Regression: Bridging Group-Sparse and Low-Rank Models

arXiv.org Machine Learning

Multi-view data have been routinely collected in various fields of science and engineering. A general problem is to study the predictive association between multivariate responses and multi-view predictor sets, all of which can be of high dimensionality. It is likely that only a few views are relevant to prediction, and the predictors within each relevant view contribute to the prediction collectively rather than sparsely. We cast this new problem under the familiar multivariate regression framework and propose an integrative reduced-rank regression (iRRR), where each view has its own low-rank coefficient matrix. As such, latent features are extracted from each view in a supervised fashion. For model estimation, we develop a convex composite nuclear norm penalization approach, which admits an efficient algorithm via alternating direction method of multipliers. Extensions to non-Gaussian and incomplete data are discussed. Theoretically, we derive non-asymptotic oracle bounds of iRRR under a restricted eigenvalue condition. Our results recover oracle bounds of several special cases of iRRR including Lasso, group Lasso and nuclear norm penalized regression. Therefore, iRRR seamlessly bridges group-sparse and low-rank methods and can achieve substantially faster convergence rate under realistic settings of multi-view learning. Simulation studies and an application in the Longitudinal Studies of Aging further showcase the efficacy of the proposed methods.


Machine Learning, Data Science, and Statistics

#artificialintelligence

There are no universally agreed-upon definitions for the terms "machine learning", "data science", and "statistics". In my mind, classical statistics consists of traditional techniques that were developed from the 1920s through the 1970s. Statistics techniques include things like correlation, linear regression, and the t-test for hypothesis testing. In my mind, machine learning consists of techniques that make predictions based on data and usually require computer analysis. Examples include logistic regression classification, neural network classification, and k-means clustering.


An Absolute Guide to Take Off in Machine Learning – DataTurks: Data Annotations Made Super Easy – Medium

#artificialintelligence

Whenever we look at any online course, they take off with linear regression and this is a concept that most of us study write from our 8th grades, that is, an equation of a line initially and then gradually fitting of the best fit line. The application of this algorithm is used in machine learning as a way to predict results in the future given the feature vectors, x. So, why is the cost function a squared cost function? Why not have an absolute cost function? Well, there are plenty of reasons as to why we consider this, but when we derive this mathematically, we come across the concept of exponential families under general linear models, which generalize the notion of loss functions for any given model, and thus the square function is actually an exponential family curve.


From shallow to deep learning in fraud – Lyft Engineering

#artificialintelligence

One week into my Research Science role at Lyft, I merged my first pull request into the Fraud team's code repository and deployed our fraud decision service. No, it wasn't to launch a groundbreaking user behavior activity-based convolutional recurrent neural network trained in a semi-supervised, adversarial fashion that challenges a user to prove her identity -- it would be a couple of years before that. Embarrassingly, it was to remove a duplicate line of feature coefficients in a hand-coded logistic regression model rolled out a little less than a year before. This small bug exposed a number of limitations of a system built primarily for a different type of usage -- that of business rules that encapsulate simple, human-readable handcrafted logic. In our old worldview, models were simply extensions of business rules.


Graphs and ML: Linear Regression – Towards Data Science

#artificialintelligence

To kick off a series of Neo4j extensions for machine learning, I implemented a set of user-defined procedures that create a linear regression model in the graph database. In this post, I demonstrate use of linear regression from the Neo4j browser to suggest prices for short term rentals in Austin, Texas. Let's check out the use case: The most popular area in Austin, Texas is identified by the last two digits of its zip code: "04". With the trendiest clubs, restaurants, shops, and parks, "04" is a frequent destination for tourists. Suppose you're an Austin local who's going on vacation.


Using AI to Optimize Marketing across Multiple Platforms

#artificialintelligence

A key aspect behind the success of KAYAK lies in the way we do marketing. Today, our company portfolio consists of 6 brands operating in 60 countries around the world, and successful marketing strategies are vital to ensure further global expansion. To aid our strategic decisions, we apply a range of advanced analytics tools to measure and compare the performance of different marketing activities. One challenging problem in particular is to ensure that we provide a fair comparison between offline (TV) and online marketing (Facebook, YouTube, etc.) for use in high level budget allocation. To resolve this problem, we developed a customized machine learning framework that measures the individual contribution of each of our activities and uses the evaluation to recommend an optimal media mix.


Jensen: An Easily-Extensible C++ Toolkit for Production-Level Machine Learning and Convex Optimization

arXiv.org Machine Learning

This paper introduces Jensen, an easily extensible and scalable toolkit for production-level machine learning and convex optimization. Jensen implements a framework of convex (or loss) functions, convex optimization algorithms (including Gradient Descent, L-BFGS, Stochastic Gradient Descent, Conjugate Gradient, etc.), and a family of machine learning classifiers and regressors (Logistic Regression, SVMs, Least Square Regression, etc.). This framework makes it possible to deploy and train models with a few lines of code, and also extend and build upon this by integrating new loss functions and optimization algorithms.


Home – LearnDataSci

#artificialintelligence

Pulled from the web, here is a our collection of the best, free books on Data Science, Big Data, Data Mining, Machine Learning, Python, R, SQL, NoSQL and more. Let's use the Reddit API to grab news headlines and perform Sentiment Analysis Expanding on the previous article, we'll be looking at how to incorporate recent price behaviors into our strategy In this post, we'll walk through building linear regression models to predict housing prices resulting from economic activity.


Teaching machines to understand data science code by semantic enrichment of dataflow graphs

arXiv.org Artificial Intelligence

Your computer is continuously executing programs, but does it really understand them? Not in any meaningful sense. That burden falls upon human knowledge workers, who are increasingly asked to write and understand code. They would benefit greatly from intelligent tools that reveal the connections between their code and its subject matter. Towards this prospect, we develop an AI system that forms semantic representations of computer programs, using techniques from knowledge representation and program analysis. We focus on code written for data science, although our method is more generally applicable. The semantic representations are created through a novel algorithm for the semantic enrichment of dataflow graphs. This algorithm is undergirded by a new ontology language for modeling computer programs and a new ontology about data science, written in this language.