Goto

Collaborating Authors

 Education



Scientists slash computations for deep learning: 'Hashing' can eliminate more than 95 percent of computations

#artificialintelligence

"This applies to any deep-learning architecture, and the technique scales sublinearly, which means that the larger the deep neural network to which this is applied, the more the savings in computations there will be," said lead researcher Anshumali Shrivastava, an assistant professor of computer science at Rice. The research will be presented in August at the KDD 2017 conference in Halifax, Nova Scotia. It addresses one of the biggest issues facing tech giants like Google, Facebook and Microsoft as they race to build, train and deploy massive deep-learning networks for a growing body of products as diverse as self-driving cars, language translators and intelligent replies to emails. Shrivastava and Rice graduate student Ryan Spring have shown that techniques from "hashing," a tried-and-true data-indexing method, can be adapted to dramatically reduce the computational overhead for deep learning. Hashing involves the use of smart hash functions that convert data into manageable small numbers called hashes.


So You Want to be a Data Scientist

@machinelearnbot

Summary: In which we attempt to answer the question, how does someone in school or recently out enter the exciting world of data science. There is no question that comes up more frequently than'how do I become a data scientist'. I've actually written several articles on this topic (and will reference them liberally in this post) but they lacked the global perspective that potential new entrants to data science want. I'm going to try to resolve here. I thought about changing the title to "Doing Data Science" instead of becoming a Data Scientist to focus on the activity and not just the job title. There are two good reasons. First, not everyone doing data science is necessarily a data scientist.


College Students Had An AI Teaching Assistant, And ...

#artificialintelligence

Goel got the idea to recruit an AI teaching assistant because of the sheer workload he and his graduate students faced every semester. Knowledge Based Artificial Intelligence is a core requirement of Georgia Tech's online master's of computer science program, and as a result, 300 students take it each semester--and rack up around 10,000 messages in the class's online forums. To develop the AI, Goel and his students gathered up every message that had ever been posted in the forums since the class first started, amassing about 40,000 messages in all. Then, they fed the postings to their robo-TA to help her learn the kinds of questions that might be asked and the kind of answers that would be helpful. At first, Jill Watson wasn't able to answer enough questions to be a viable force for good on the message boards.


Programming for Data Science the Polyglot approach: Python R SQL

@machinelearnbot

Outside of Data science, I also co-founded a social enterprise to teach Computer Science to kids Feynlabs. At Feynlabs, we have been working with ways to accelerate learning to Code. One way to do this is to compare and contrast multiple programming languages. This approach makes sense for Data Science also because a learner can potentially approach Data science from many directions. To learn programming for Data Science, it would thus help to build up from an existing foundation they are already familiar with and then co-relate new ideas to this foundation through other approaches. From a pedagogical standpoint, this approach is similar to David Asubel who stressed the importance of prior knowledge in being able to learn new concepts: "The most important single factor influencing learning is what the learner already knows."


Text Mining course taught by Anurag Bhardwaj

@machinelearnbot

In this online course, you will be introduced to the essential techniques of text mining, understood here as the extension of data mining's standard predictive methods to unstructured text. This course will discuss these standard techniques, and will devote considerable attention to the data preparation and handling methods that are required to transform unstructured text into a form in which it can be mined.


Non-convex learning via Stochastic Gradient Langevin Dynamics: a nonasymptotic analysis

arXiv.org Machine Learning

Stochastic Gradient Langevin Dynamics (SGLD) is a popular variant of Stochastic Gradient Descent, where properly scaled isotropic Gaussian noise is added to an unbiased estimate of the gradient at each iteration. This modest change allows SGLD to escape local minima and suffices to guarantee asymptotic convergence to global minimizers for sufficiently regular non-convex objectives (Gelfand and Mitter, 1991). The present work provides a nonasymptotic analysis in the context of non-convex learning problems, giving finite-time guarantees for SGLD to find approximate minimizers of both empirical and population risks. As in the asymptotic setting, our analysis relates the discrete-time SGLD Markov chain to a continuous-time diffusion process. A new tool that drives the results is the use of weighted transportation cost inequalities to quantify the rate of convergence of SGLD to a stationary distribution in the Euclidean $2$-Wasserstein distance.


How Artificial Intelligence enhances education

#artificialintelligence

In the past years, a collection of hardware, software and online service have managed to bring changes and reforms to classrooms and teaching methods. But the true disruption of education is yet to arrive. Artificial Intelligence has proven its role as a game changing factor in an increasing number of fields, causing transformations unimaginable in the past. It's now showing glimmers of how it might forever change the learning process, one of the oldest skills that mankind has mastered. Here's how AI and its derivatives are gradually finding their way into the classroom, and beyond.


50 Shades of Grey – The Psychology of a Data Scientist

@machinelearnbot

Unless you've recently graduated from one of the new Data Science courses that have been popping up online and in various universities around the world, then becoming a Data Scientist was most likely slightly accidental and was more about the journey than the destination. I started out as a physicist and had a strong mathematical grounding, but I had a passion for medicine. After completing my bachelor's degree I took a master's degree in medical physics. This is where I gained an appreciation for the importance of image analysis and the role that data plays in medicine. I created a virtual model of a human torso by segmenting images from the Visible Human Project.


Scientists slash computations for deep learning

#artificialintelligence

Rice University computer scientists have adapted a widely used technique for rapid data lookup to slash the amount of computation--and thus energy and time--required for deep learning, a computationally intense form of machine learning. "This applies to any deep-learning architecture, and the technique scales sublinearly, which means that the larger the deep neural network to which this is applied, the more the savings in computations there will be," said lead researcher Anshumali Shrivastava, an assistant professor of computer science at Rice. The research will be presented in August at the KDD 2017 conference in Halifax, Nova Scotia. It addresses one of the biggest issues facing tech giants like Google, Facebook and Microsoft as they race to build, train and deploy massive deep-learning networks for a growing body of products as diverse as self-driving cars, language translators and intelligent replies to emails. Shrivastava and Rice graduate student Ryan Spring have shown that techniques from "hashing," a tried-and-true data-indexing method, can be adapted to dramatically reduce the computational overhead for deep learning.