Education
Applied Machine Learning in Python Coursera
About this course: This course will introduce the learner to applied machine learning, focusing more on the techniques and methods than on the statistics behind these methods. The course will start with a discussion of how machine learning is different than descriptive statistics, and introduce the scikit learn toolkit through a tutorial. The issue of dimensionality of data will be discussed, and the task of clustering data, as well as evaluating those clusters, will be tackled. Supervised approaches for creating predictive models will be described, and learners will be able to apply the scikit learn predictive modelling methods while understanding process issues related to data generalizability (e.g. The course will end with a look at more advanced techniques, such as building ensembles, and practical limitations of predictive models.
Decoupled Learning for Factorial Marked Temporal Point Processes
Wu, Weichang, Yan, Junchi, Yang, Xiaokang, Zha, Hongyuan
This paper introduces the factorial marked temporal point process model and presents efficient learning methods. In conventional (multi-dimensional) marked temporal point process models, event is often encoded by a single discrete variable i.e. a marker. In this paper, we describe the factorial marked point processes whereby time-stamped event is factored into multiple markers. Accordingly the size of the infectivity matrix modeling the effect between pairwise markers is in power order w.r.t. the number of the discrete marker space. We propose a decoupled learning method with two learning procedures: i) directly solving the model based on two techniques: Alternating Direction Method of Multipliers and Fast Iterative Shrinkage-Thresholding Algorithm; ii) involving a reformulation that transforms the original problem into a Logistic Regression model for more efficient learning. Moreover, a sparse group regularizer is added to identify the key profile features and event labels. Empirical results on real world datasets demonstrate the efficiency of our decoupled and reformulated method. The source code is available online.
Visual Analytics in Deep Learning: An Interrogative Survey for the Next Frontiers
Hohman, Fred, Kahng, Minsuk, Pienta, Robert, Chau, Duen Horng
Deep learning has recently seen rapid development and significant attention due to its state-of-the-art performance on previously-thought hard problems. However, because of the innate complexity and nonlinear structure of deep neural networks, the underlying decision making processes for why these models are achieving such high performance are challenging and sometimes mystifying to interpret. As deep learning spreads across domains, it is of paramount importance that we equip users of deep learning with tools for understanding when a model works correctly, when it fails, and ultimately how to improve its performance. Standardized toolkits for building neural networks have helped democratize deep learning; visual analytics systems have now been developed to support model explanation, interpretation, debugging, and improvement. We present a survey of the role of visual analytics in deep learning research, noting its short yet impactful history and summarize the state-of-the-art using a human-centered interrogative framework, focusing on the Five W's and How (Why, Who, What, How, When, and Where), to thoroughly summarize deep learning visual analytics research. We conclude by highlighting research directions and open research problems. This survey helps new researchers and practitioners in both visual analytics and deep learning to quickly learn key aspects of this young and rapidly growing body of research, whose impact spans a diverse range of domains.
Learning Path: R: Real-World Data Science Solutions with R
R is an open source programming language and software environment for statistical computing and graphics. R language is widely used among statisticians and data miners for developing statistical software and data analysis. R is open source and allows integration with other applications and systems. Compared to other data analysis platforms, R has an extensive set of data products. Problems faced with data like optimization and analyzation are cleared with R's excellent data visualization feature.
Decision Tree - Theory, Application and Modeling using R
Decision Tree Model building is one of the most applied technique in analytics vertical. The decision tree model is quick to develop and easy to understand. The technique is simple to learn. A number of business scenarios in lending business / telecom / automobile etc. require decision tree model building. How long the course should take?
Deep Learning & Computer Vision in the Microsoft Azure Cloud
This is the first in a multi-part series by guest blogger Adrian Rosebrock. Adrian writes at PyImageSearch.com about computer vision and deep learning using Python, and he recently finished authoring a new book on deep learning for computer vision and image recognition. I had two goals when I set out to write my new book, Deep Learning for Computer Vision with Python. The first was to create a book/self-study program that was accessible to both novices and experienced researchers and practitioners -- we start off with the fundamentals of neural networks and machine learning and by the end of the program you're training state-of-the-art networks on the ImageNet dataset from scratch. Along the way I quickly realized that a stumbling block for many readers is configuring their development environment -- especially true for those wanted to utilize their GPU(s) and train deep neural networks on massive image datasets (such as ImageNet).
10 Artificial Intelligence Trends to Watch in 2018
Nand Kishor is the Product Manager of House of Bots. After finishing his studies in computer science, he ideated & re-launched Real Estate Business Intelligence Tool, where he created one of the leading Business Intelligence Tool for property price analysis in 2012. He also writes, research and sharing knowledge about Artificial Intelligence (AI), Machine Learning (ML), Data Science, Big Data, Python Language etc... ... Nand Kishor is the Product Manager of House of Bots. After finishing his studies in computer science, he ideated & re-launched Real Estate Business Intelligence Tool, where he created one of the leading Business Intelligence Tool for property price analysis in 2012. He also writes, research and sharing knowledge about Artificial Intelligence (AI), Machine Learning (ML), Data Science, Big Data, Python Language etc... Microsoft's Latest AI Creation Reveals Just How Much Computers Can Imagine today Google announces scholarship program to train 1.3 lakh Indian developers in emerging technologies 43890 views Want to be a millionaire before you turn 25? Study artificial intelligence or machine learning 43215 views
Programming Statistical Applications in R Udemy
Programming Statistical Applications in R is an introductory course teaching the basics of programming mathematical and statistical applications using the R language. The course makes extensive use of the Introduction to Scientific Programming and Simulation using R (spuRs) package from the Comprehensive R Archive Network (CRAN). The course is a scientific-programming foundations course and is a useful complement and precursor to the more simulation-application oriented R Programming for Simulation and Monte-Carlo Methods Udemy course. The two courses were originally developed as a two-course sequence (although they do share some exercises in common). Together, both courses provide a powerful set of unique and useful instruction about how to create your own mathematical and statistical functions and applications using R software.
PyTorch, a year in....
Today marks 1 year since PyTorch was released publicly. It's been a wild ride -- our quest to build a flexible deep learning research platform. Over the last year, we've seen an amazing community of people using, contributing to and evangelizing PyTorch -- thank you for the love. Looking back, we wanted to summarize PyTorch over the past year: the progress, the news and highlights from the community. We've been blessed with a strong organic community of researchers and engineers who fell in love with PyTorch.
These robots beat humans in the Stanford reading test
In less than 20 years, Alibaba has become one of the top ten largest companies in the world, primarily due to its success as an online retailer. The internet has changed the way that we shop, and as such the company is pumping money into research projects that will help ensure that it can keep up with the next game-changing advance in e-commerce.