Inductive Learning
Physiognomy's New Clothes – Blaise Aguera y Arcas – Medium
In 1844, a laborer from a small town in southern Italy was put on trial for stealing "five ricottas, a hard cheese, two loaves of bread […] and two kid goats". The laborer, Giuseppe Villella, was reportedly convicted of being a brigante (bandit), at a time when brigandage -- banditry and state insurrection -- was seen as endemic. Villella died in prison in Pavia, northern Italy, in 1864. Villella's death led to the birth of modern criminology. Nearby lived a scientist and surgeon named Cesare Lombroso, who believed that brigantes were a primitive type of people, prone to crime.
Marketing Machines: Is Machine Learning Helping Marketers or Making Us Obsolete?
Supervised learning systems rely upon humans to label the incoming data -- at least to begin with -- in order for the systems to better predict how to classify future input data. Gmail's spam filter is a great example of this. When you label incoming mail as either spam or not spam, you're not only cleaning up your inbox, you're also training Gmail's filter (a machine learning system) to identify what you consider to be spam (or not spam) in the future. According to Tommy, this type of machine learning can be likened to the relationship between a parent and a young child. When a child does something positive they're rewarded.
People on Drugs: Credibility of User Statements in Health Communities
Mukherjee, Subhabrata, Weikum, Gerhard, Danescu-Niculescu-Mizil, Cristian
Online health communities are a valuable source of information for patients and physicians. However, such user-generated resources are often plagued by inaccuracies and misinformation. In this work we propose a method for automatically establishing the credibility of user-generated medical statements and the trustworthiness of their authors by exploiting linguistic cues and distant supervision from expert sources. To this end we introduce a probabilistic graphical model that jointly learns user trustworthiness, statement credibility, and language objectivity. We apply this methodology to the task of extracting rare or unknown side-effects of medical drugs --- this being one of the problems where large scale non-expert data has the potential to complement expert medical knowledge. We show that our method can reliably extract side-effects and filter out false statements, while identifying trustworthy users that are likely to contribute valuable medical information.
One-Class Semi-Supervised Learning: Detecting Linearly Separable Class by its Mean
Bauman, Evgeny, Bauman, Konstantin
In this paper, we presented a novel semi-supervised one-class classification algorithm which assumes that class is linearly separable from other elements. We proved theoretically that class is linearly separable if and only if it is maximal by probability within the sets with the same mean. Furthermore, we presented an algorithm for identifying such linearly separable class utilizing linear programming. We described three application cases including an assumption of linear separability, Gaussian distribution, and the case of linear separability in transformed space of kernel functions. Finally, we demonstrated the work of the proposed algorithm on the USPS dataset and analyzed the relationship of the performance of the algorithm and the size of the initially labeled sample.
Handling imbalanced dataset in supervised learning using family of SMOTE algorithm.
The algorithm adaptively updates the distribution and there are no assumptions made for the underlying distribution of the data. The algorithm uses Euclidean distance for KNN Algorithm. The key difference between ADASYN and SMOTE is that the former uses a density distribution, as a criterion to automatically decide the number of synthetic samples that must be generated for each minority sample by adaptively changing the weights of the different minority samples to compensate for the skewed distributions. The latter generates the same number of synthetic samples for each original minority sample.
Google Research Blog posts Bridge and Tunnel Investor
October 06, 2016 Posted by Sujith Ravi, Staff Research Scientist, Google Research Recently, there have been significant advances in Machine Learning that enable computer systems to solve complex real-world problems. One of those advances is Google's large scale, graph-based machine learning platform, built by the Expander team in Google Research. A technology that is behind many of the Google products and features you may use everyday, graph-based machine learning is a powerful tool that can be used to power useful features such as reminders in Inbox and smart messaging in Allo, or used in conjunction with deep neural networks to power the latest image recognition system in Google Photos. Learning with Minimal Supervision Much of the recent success in deep learning, and machine learning in general, can be attributed to models that demonstrate high predictive capacity when trained on large amounts of labeled data -- often millions of training examples. This is commonly referred to as "supervised learning" since it requires supervision, in the form of labeled data, to train the machine learning systems.
Sergey Levine: Deep Robotic Learning CMU RI Seminar
Abstract: "Deep learning methods have provided us with remarkably powerful, flexible, and robust solutions in a wide range of passive perception areas: computer vision, speech recognition, and natural language processing. However, active decision making domains such as robotic control present a number of additional challenges, standard supervised learning methods do not extend readily to robotic decision making, where supervision is difficult to obtain. In this talk, I will discuss experimental results that hint at the potential of deep learning to transform robotic decision making and control, present a number of algorithms and models that can allow us to combine expressive, high-capacity deep models with reinforcement learning and optimal control, and describe some of our recent work on scaling up robotic learning through collective learning with multiple robots."
A Neural Probabilistic Structured-Prediction Method for Transition-Based Natural Language Processing
Zhou, Hao, Zhang, Yue, Cheng, Chuan, Huang, Shujian, Dai, Xinyu, Chen, Jiajun
We propose a neural probabilistic structured-prediction method for transition-based natural language processing, which integrates beam search and contrastive learning. The method uses a global optimization model, which can leverage arbitrary features over non-local context. Beam search is used for efficient heuristic decoding, and contrastive learning is performed for adjusting the model according to search errors. When evaluated on both chunking and dependency parsing tasks, the proposed method achieves significant accuracy improvements over the locally normalized greedy baseline on the two tasks, respectively.
Just What Is Deep Learning, and What Does It Solve In Marketing?
How is a network trained? When given input data with a labeled answer (meaning it's already been classified), the deviation between the network's predictions and the actual answer produces an error signal. The error signal and non-linearity of each neuron's decision function tells us whether to increase or decrease each weight. The error signal gets propagated backwards all the way to the lowest layer. Over many training examples, the network weights are repeatedly tuned until finally reaching some satisfactory benchmark, such as accuracy level.