Education
[P] A reinforcement learning environment for the protein folding problem (2-D HP Lattice Model) • r/MachineLearning
This project is cool, but it's an extreme oversimplification of protein folding. And that's fine for what it is. Folding@home attempts to simulate real biomolecular systems in atomic detail, using 3D physics-based molecular models (usually based on approximations to quantum mechanics) and Markov chain Monte Carlo sampling to search geometric/folding configurations with a simulated set of thermodynamic conditions. One could think of Folding@home simulations as a reinforcement learning problem where the reward function is the minimization of Gibbs free energy with respect to the protein's geometry/folding.
AI technology helps students who are deaf learn
As stragglers settle into their seats for general biology class, real-time captions of the professor's banter about general and special senses – "Which receptor picks up pain? An interpreter stands a few feet away and interprets the professor's spoken words into American Sign Language, the primary language used by the deaf in the US. Except for the real-time captions on the screens in front of the room, this is a typical class at the Rochester Institute of Technology in upstate New York. About 1,500 students who are deaf and hard of hearing are an integral part of campus life at the sprawling university, which has 15,000 undergraduates. Nearly 700 of the students who are deaf and hard of hearing take courses with students who are hearing, including several dozen in Sandra Connelly's general biology class of 250 students. The captions on the screens behind Connelly, who wears a headset, are generated by Microsoft Translator, an AI-powered communication technology. The system uses an ...
Artificial intelligence gives HR an opportunity to transform the enterprise
Technological innovations have impacted almost every aspect of life over the past century. The steam engine and electricity allowed total labor productivity to grow at more than 2 percent per year and the total number of weekly work hours to drop from 60 to 40. The concept of artificial intelligence is not new but has recently come into view as a technology that is capable of revolutionizing the world and bringing a new industrial revolution. A significant amount of research and discussion among scientists and economists has focused on the critical topic of disruption to labor markets and potential productivity gains from AI. However, there is a more nuanced picture of the way in which AI will reshape how we work – in many cases augmenting our abilities and supporting organizations to redefine their operating models for improved performance and agility.
Microsoft Professional Program Artificial Intelligence track
Reinforcement Learning (RL) is an area of machine learning, where an agent learns by interacting with its environment to achieve a goal.In this course, you will be introduced to the world of reinforcement learning. You will learn how to frame reinforcement learning problems and start tackling classic examples like news recommendation, learning to navigate in a grid-world, and balancing a cart-pole. You will explore the basic algorithms from multi-armed bandits, dynamic programming, TD (temporal difference) learning, and progress towards larger state space using function approximation, in particular using deep learning. You will also learn about algorithms that focus on searching the best policy with policy gradient and actor critic methods. Along the way, you will get introduced to Project Malmo, a platform for Artificial Intelligence experimentation and research built on top of the Minecraft game.
Cross-Language Learning for Program Classification Using Bilateral Tree-Based Convolutional Neural Networks
Bui, Nghi D. Q. (Singapore Management University) | Jiang, Lingxiao (Singapore Management University) | Yu, Yijun (The Open University)
Towards the vision of translating code that implements an algorithm from one programming language into another, this paper proposes an approach for automated program classification using bilateral tree-based convolutional neural networks (BiTBCNNs). It is layered on top of two tree-based convolutional neural networks (TBCNNs), each of which recognizes the algorithm of code written in an individual programming language. The combination layer of the networks recognizes the similarities and differences among code in different programming languages. The BiTBCNNs are trained using the source code in different languages but known to implement the same algorithms and/or functionalities. For a preliminary evaluation, we use 3591 Java and 3534 C++ code snippets from 6 algorithms we crawled systematically from GitHub. We obtained over 90% accuracy in the cross-language binary classification task to tell whether any given two code snippets implement a same algorithm. Also, for the algorithm classification task, i.e., to predict which one of the six algorithm labels is implemented by an arbitrary C++ code snippet, we achieved over 80% precision.
Multiple-Implementation Testing of Supervised Learning Software
Srisakaokul, Siwakorn (University of Illinois at Urbana-Champaign) | Wu, Zhengkai (University of Illinois at Urbana-Champaign) | Astorga, Angello (University of Illinois at Urbana-Champaign) | Alebiosu, Oreoluwa (University of Illinois at Urbana-Champaign) | Xie, Tao (University of Illinois at Urbana-Champaign)
Machine Learning (ML) algorithms are now used in a wide range of application domains in society. Naturally, software implementations of these algorithms have become ubiquitous. Faults in ML software can cause substantial losses in these application domains. Thus, it is very critical to conduct effective testing of ML software to detect and eliminate its faults. However, testing ML software is difficult, partly because producing test oracles used for checking behavior correctness (such as using expected properties or expected test outputs) is challenging. In this paper, we propose an approach of multiple-implementation testing to test supervised learning software, a major type of ML software. In particular, our approach derives a test input's proxy oracle from the majority-voted output running the test input of multiple implementations of the same algorithm (based on a pre-defined percentage threshold). Our approach reports likely those test inputs whose outputs (produced by an implementation under test) are different from the majority-voted outputs as failing tests. We evaluate our approach on two highly popular supervised learning algorithms: k-Nearest Neighbor (kNN) and Naive Bayes (NB). Our results show that our approach is highly effective in detecting faults in real-world supervised learning software. In particular, our approach detects 13 real faults and 1 potential fault from 19 kNN implementations and 16 real faults from 7 NB implementations. Our approach can even detect 7 real faults and 1 potential fault among the three popularly used open-source ML projects (Weka, RapidMiner, and KNIME).
Adaptive Cost-sensitive Online Classification
Zhao, Peilin, Zhang, Yifan, Wu, Min, Hoi, Steven C. H., Tan, Mingkui, Huang, Junzhou
Cost-Sensitive Online Classification has drawn extensive attention in recent years, where the main approach is to directly online optimize two well-known cost-sensitive metrics: (i) weighted sum of sensitivity and specificity; (ii) weighted misclassification cost. However, previous existing methods only considered first-order information of data stream. It is insufficient in practice, since many recent studies have proved that incorporating second-order information enhances the prediction performance of classification models. Thus, we propose a family of cost-sensitive online classification algorithms with adaptive regularization in this paper. We theoretically analyze the proposed algorithms and empirically validate their effectiveness and properties in extensive experiments. Then, for better trade off between the performance and efficiency, we further introduce the sketching technique into our algorithms, which significantly accelerates the computational speed with quite slight performance loss. Finally, we apply our algorithms to tackle several online anomaly detection tasks from real world. Promising results prove that the proposed algorithms are effective and efficient in solving cost-sensitive online classification problems in various real-world domains.
Differentiable plasticity: training plastic neural networks with backpropagation
Miconi, Thomas, Clune, Jeff, Stanley, Kenneth O.
How can we build agents that keep learning from experience, quickly and efficiently, after their initial training? Here we take inspiration from the main mechanism of learning in biological brains: synaptic plasticity, carefully tuned by evolution to produce efficient lifelong learning. We show that plasticity, just like connection weights, can be optimized by gradient descent in large (millions of parameters) recurrent networks with Hebbian plastic connections. First, recurrent plastic networks with more than two million parameters can be trained to memorize and reconstruct sets of novel, high-dimensional (1,000 pixels) natural images not seen during training. Crucially, traditional non-plastic recurrent networks fail to solve this task. Furthermore, trained plastic networks can also solve generic meta-learning tasks such as the Omniglot task, with competitive results and little parameter overhead. Finally, in reinforcement learning settings, plastic networks outperform a non-plastic equivalent in a maze exploration task. We conclude that differentiable plasticity may provide a powerful novel approach to the learning-to-learn problem.
Machine Learning in a Year – Learning New Stuff – Medium
During the christmas vacation of 2015, I got a motivational boost again and decided try out Kaggle. So I spent quite some time experimenting with various algorithms for their Homesite Quote Conversion, Otto Group Product Classification and Bike Sharing Demand contests. The main takeaway from this was the experience of iteratively improving the results by experimenting with the algorithms and the data. I learned to trust my logic when doing machine learning. If tweaking a parameter or engineering a new feature seems like a good idea logically, it's quite likely that it actually will help.