Goto

Collaborating Authors

 Instructional Material


[P]I wrote a tutorial about Inverse Reinforcement Learning and three basic algorithms. More to follow. • r/MachineLearning

@machinelearnbot

This idea is really interesting. Sadly I don't have nearly enough linear algebra experience to understand the details though. Would IRL still be feasible if the state was not explicit? It seems like this technique depends on prior knowledge of the state machine, but from what I understand about deep reinforcement learning, the state may be very complex, and the value function could actually be a deep neural network.


A Gentle Introduction to Matrix Factorization for Machine Learning - Machine Learning Mastery

#artificialintelligence

The LU decomposition is found using an iterative numerical process and can fail for those matrices that cannot be decomposed or decomposed easily. A variation of this decomposition that is numerically more stable to solve in practice is called the LUP decomposition, or the LU decomposition with partial pivoting. The rows of the parent matrix are re-ordered to simplify the decomposition process and the additional P matrix specifies a way to permute the result or return the result to the original order. There are also other variations of the LU. The LU decomposition is often used to simplify the solving of systems of linear equations, such as finding the coefficients in a linear regression, as well as in calculating the determinant and inverse of a matrix.


From 0 to 1:Machine Learning Techniques, NLP & Python-Cut to the Chase

#artificialintelligence

Prerequisites: No prerequisites, knowledge of some undergraduate level mathematics would help but is not mandatory. Working knowledge of Python would be helpful if you want to run the source code that is provided. Taught by a Stanford-educated, ex-Googler and an IIT, IIM – educated ex-Flipkart lead analyst. This team has decades of practical experience in quant trading, analytics and e-commerce. The course is shy but confident: It is authoritative, drawn from decades of practical experience -but shies away from needlessly complicating stuff.


Datasets for Natural Language Processing - Machine Learning Mastery

#artificialintelligence

You need datasets to practice on when getting started with deep learning for natural language processing tasks. It is better to use small datasets that you can download quickly and do not take too long to fit models. Further, it is also helpful to use standard datasets that are well understood and widely used so that you can compare your results to see if you are making progress. In this post, you will discover a suite of standard datasets for natural language processing tasks that you can use when getting started with deep learning. I have tried to provide a mixture of datasets that are popular for use in academic papers that are modest in size.


Machine Learning Puts New Lens on #IoT. A Step-by-Step Guide to #Azure #MachineLearning

#artificialintelligence

Healthcare organizations need predictive analytics for providing quality healthcare and population health management. Building predictive models by applying machine learning algorithms is complex in the infrastructure-as-a-service or platform-as-as-a-service environment as it involves distributed computing. The emergence of predictive analytics in the healthcare industry has offered enormous opportunity to be able to predict the events in healthcare organization and other industries as well such as aerospace industry. Predictive analytics is a subfield of data science that deploys several multi-disciplinary fields such as statistical inference, machine learning, clustering, data visualization, and machine learning iteratively through the lifecycle of the data analytics. The stages can be defined as defining the problem statement for the organization, scope of the data analytics project, collection of big data, exploratory data analysis, data preparation, deployment of predictive models leveraging machine learning algorithms.


What is the future of work?

#artificialintelligence

A new podcast series from the McKinsey Global Institute explores how technologies like automation, robotics, and artificial intelligence are shaping how we work, where we work, and the skills we need to work. The future of work is one of the hottest topics in 2017, with conflicting information from various experts leaving plenty of room for debate around what impact automation technology like artificial intelligence (AI) and robotics will have on jobs, skills, and wages. In the first episode of the New World of Work podcast from the McKinsey Global Institute--which is being featured in the McKinsey Podcast series--MGI chairman and director James Manyika speaks with senior editor Peter Gumbel about what these technologies are, how they will change work, and what new research says we can expect. This is our new series on work, the world of work, and the changing world of work. Today, for our first podcast on this issue, I'm with James Manyika, who is the chairman and director of the McKinsey Global Institute; he's also a senior partner at McKinsey and is based in the San Francisco office. James, this issue of work and the future of work is one that you have been looking at for some time, with work on automation and with the latest report on jobs, Jobs lost, jobs gained. Perhaps, you can start off by telling us about the broader issues, and which ones you're focusing on. James Manyika: Well, I think we're having an interesting time in our history and our economy around the future of work. It comes up in almost every conversation with students, workers, CEOs, and policymakers.


Unsupervised Deep Learning in Python Udemy

@machinelearnbot

This course is the next logical step in my deep learning, data science, and machine learning series. I've done a lot of courses about deep learning, and I just released a course about unsupervised learning, where I talked about clustering and density estimation. So what do you get when you put these 2 together? In these course we'll start with some very basic stuff - principal components analysis (PCA), and a popular nonlinear dimensionality reduction technique known as t-SNE (t-distributed stochastic neighbor embedding). Next, we'll look at a special type of unsupervised neural network called the autoencoder.


Dropout Model Evaluation in MOOCs

arXiv.org Machine Learning

The field of learning analytics needs to adopt a more rigorous approach for predictive model evaluation that matches the complex practice of model-building. In this work, we present a procedure to statistically test hypotheses about model performance which goes beyond the state-of-the-practice in the community to analyze both algorithms and feature extraction methods from raw data. We apply this method to a series of algorithms and feature sets derived from a large sample of Massive Open Online Courses (MOOCs). While a complete comparison of all potential modeling approaches is beyond the scope of this paper, we show that this approach reveals a large gap in dropout prediction performance between forum-, assignment-, and clickstream-based feature extraction methods, where the latter is significantly better than the former two, which are in turn indistinguishable from one another. This work has methodological implications for evaluating predictive or AI-based models of student success, and practical implications for the design and targeting of at-risk student models and interventions.


Online Machine Learning in Big Data Streams

arXiv.org Machine Learning

The area of online machine learning in big data streams covers algorithms that are (1) distributed and (2) work from data streams with only a limited possibility to store past data. The first requirement mostly concerns software architectures and efficient algorithms. The second one also imposes nontrivial theoretical restrictions on the modeling methods: In the data stream model, older data is no longer available to revise earlier suboptimal modeling decisions as the fresh data arrives. In this article, we provide an overview of distributed software architectures and libraries as well as machine learning models for online learning. We highlight the most important ideas for classification, regression, recommendation, and unsupervised modeling from streaming data, and we show how they are implemented in various distributed data stream processing systems. This article is a reference material and not a survey. We do not attempt to be comprehensive in describing all existing methods and solutions; rather, we give pointers to the most important resources in the field. All related sub-fields, online algorithms, online learning, and distributed data processing are hugely dominant in current research and development with conceptually new research results and software components emerging at the time of writing. In this article, we refer to several survey results, both for distributed data processing and for online machine learning. Compared to past surveys, our article is different because we discuss recommender systems in extended detail.


SpectralLeader: Online Spectral Learning for Single Topic Models

arXiv.org Machine Learning

We study the problem of learning a latent variable model from a stream of data. Latent variable models are popular in practice because they can explain observed data in terms of unobserved concepts. These models have been traditionally studied in the offline setting. The online EM is arguably the most popular algorithm for learning latent variable models online. Although it is computationally efficient, it typically converges to a local optimum. In this work, we develop a new online learning algorithm for latent variable models, which we call SpectralLeader. SpectralLeader always converges to the global optimum, and we derive a $O(\sqrt{n})$ upper bound up to log factors on its $n$-step regret in the bag-of-words model. We show that SpectralLeader performs similarly to or better than the online EM with tuned hyper-parameters, in both synthetic and real-world experiments.