Goto

Collaborating Authors

 Education


Python Machine Learning: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow, 2nd Edition: Sebastian Raschka, Vahid Mirjalili: 9781787125933: Amazon.com: Books

@machinelearnbot

Sebastian Raschka, author of the bestselling book, Python Machine Learning, has many years of experience with coding in Python, and he has given several seminars on the practical applications of data science, machine learning, and deep learning, including a machine learning tutorial at SciPy - the leading conference for scientific computing in Python. While Sebastian's academic research projects are mainly centered around problem-solving in computational biology, he loves to write and talk about data science, machine learning, and Python in general, and he is motivated to help people develop data-driven solutions without necessarily requiring a machine learning background. His work and contributions have recently been recognized by the departmental outstanding graduate student award 2016-2017, as well as the ACM Computing Reviews' Best of 2016 award. In his free time, Sebastian loves to contribute to open source projects, and the methods that he has implemented are now successfully used in machine learning competitions, such as Kaggle. Vahid Mirjalili obtained his PhD in mechanical engineering working on novel methods for large-scale, computational simulations of molecular structures.


Dibakar Saha Talks About His Image Processing and Machine Learning Projects. - Cool Python Codes

#artificialintelligence

Do you know OpenCV, Machine Learning and Image Processing and you find it difficult to come up with cool amazing projects? Basically, he is a beginner in Python with experience in Image Processing and a little bit in machine learning. He has designed a very simple classification programs like spam detection and sentiment analysis using machine learning in Python. Using image processing he has also designed a very simple gesture recognition system. He has also designed a gesture-driven keyboard. And presently he is working on an app that he calls NFS Most Wanted 2013 Remote, that can control the cars in the game using your phone's accelerometer. He also revealed some tips that will help a lot of programmers out there, especially the newbies.


How to Write the Perfect Data Scientist Resume

#artificialintelligence

A job search is just a numbers game with plenty of conversion rates. Today, we'll look at how you can improve your rate of Applications Interviews by writing a winning data scientist resume. We've compiled our favorite tips for writing the perfect data scientist CV, and they're broken into 3 sections: Resumes are often misused as a "credential dump," a hodge-podge of skills and experiences. Instead, your resume should tell a persuasive story with YOU as the protagonist. Each section should work in harmony and each bullet point should add colorful details.


Scalable Generalized Linear Bandits: Online Computation and Hashing

arXiv.org Machine Learning

Generalized Linear Bandits (GLBs), a natural extension of the stochastic linear bandits, has been popular and successful in recent years. However, existing GLBs scale poorly with the number of rounds and the number of arms, limiting their utility in practice. This paper proposes new, scalable solutions to the GLB problem in two respects. First, unlike existing GLBs, whose per-time-step space and time complexity grow at least linearly with time $t$, we propose a new algorithm that performs online computations to enjoy a constant space and time complexity. At its heart is a novel Generalized Linear extension of the Online-to-confidence-set Conversion (GLOC method) that takes \emph{any} online learning algorithm and turns it into a GLB algorithm. As a special case, we apply GLOC to the online Newton step algorithm, which results in a low-regret GLB algorithm with much lower time and memory complexity than prior work. Second, for the case where the number $N$ of arms is very large, we propose new algorithms in which each next arm is selected via an inner product search. Such methods can be implemented via hashing algorithms (i.e., "hash-amenable") and result in a time complexity sublinear in $N$. While a Thompson sampling extension of GLOC is hash-amenable, its regret bound for $d$-dimensional arm sets scales with $d^{3/2}$, whereas GLOC's regret bound scales with $d$. Towards closing this gap, we propose a new hash-amenable algorithm whose regret bound scales with $d^{5/4}$. Finally, we propose a fast approximate hash-key computation (inner product) with a better accuracy than the state-of-the-art, which can be of independent interest. We conclude the paper with preliminary experimental results confirming the merits of our methods.


Optimal Rates for Multi-pass Stochastic Gradient Methods

arXiv.org Machine Learning

We analyze the learning properties of the stochastic gradient method when multiple passes over the data and mini-batches are allowed. We study how regularization properties are controlled by the step-size, the number of passes and the mini-batch size. In particular, we consider the square loss and show that for a universal step-size choice, the number of passes acts as a regularization parameter, and optimal finite sample bounds can be achieved by early-stopping. Moreover, we show that larger step-sizes are allowed when considering mini-batches. Our analysis is based on a unifying approach, encompassing both batch and stochastic gradient methods as special cases. As a byproduct, we derive optimal convergence results for batch gradient methods (even in the non-attainable cases).


How the judge on Oracle v. Google taught himself to code

#artificialintelligence

On May 18th, 2012, attorneys for Oracle and Google were battling over nine lines of code in a hearing before Judge William H. Alsup of the northern district of California. The first jury trial in Oracle v. Google, the fight over whether Google had hijacked code from Oracle for its Android system, was wrapping up. The argument centered on a function called rangeCheck. Of all the lines of code that Oracle had tested -- 15 million in total -- these were the only ones that were "literally" copied. Every keystroke, a perfect duplicate. It was in Oracle's interest to play up the significance of rangeCheck as much as possible, and David Boies, Oracle's lawyer, began to argue that Google had copied rangeCheck so that it could take Android to market more quickly. Judge Alsup was not buying it. "I couldn't have told you the first thing about Java before this trial," said the judge. "But, I have done and still do a lot of programming myself in other languages. I have written blocks of code like ...


How to Spot a Machine Learning Opportunity, Even If You Aren't a Data Scientist

@machinelearnbot

Artificial intelligence is no longer just a niche subfield of computer science. Tech giants have been using AI for years: Machine learning algorithms power Amazon product recommendations, Google Maps, and the content that Facebook, Instagram, and Twitter display in social media feeds. But William Gibson's adage applies well to AI adoption: The future is already here, it's just not evenly distributed. The average company faces many challenges in getting started with machine learning, including a shortage of data scientists. But just as important is a shortage of executives and nontechnical employees able to spot AI opportunities.


which is the best book for python machine learning ? • r/Python

@machinelearnbot

I would recommend that you start with Introduction to Statistical Learning with R (usually shortened as ISLR). A lot of people have adapted the examples to Python if you google a bit and it's an excellent book that hides just enough complexity to not be overwhelming. Plus, once you have a good understanding of all of it, you can either graduate to the more extensive version (Elements of Statistical Learning, usually shortened as ESL) for a more rigorous treatment of the same thing, or choose to go for something different like Bishop's Pattern Recognition and Machine Learning. ISLR is free as a pdf and has a corresponding MOOC. ESL doesn't, but is also free on the author's website.


Introduction to Discrete Mathematics for Computer Science Coursera

@machinelearnbot

The programme has been created based on the experience of leading American and European universities, such as Stanford University (U.S.) and EPFL (Switzerland). Also taken into consideration when creating the faculty was the School of Data Analysis, which is one of the strongest postgraduate schools in the field of computer science in Russia. In the faculty, learning is based on practice and projects. National Research University - Higher School of Economics (HSE) is one of the top research universities in Russia. Established in 1992 to promote new research and teaching in economics and related disciplines, it now offers programs at all levels of university education across an extraordinary range of fields of study including business, sociology, cultural studies, philosophy, political science, international relations, law, Asian studies, media and communications, IT, mathematics, engineering, and more.


We Need Computers with Empathy

MIT Technology Review

I was rehearsing a speech for an AI conference recently when I happened to mention Amazon Alexa. At which point Alexa woke up and announced: "Playing Selena Gomez." I had to yell "Alexa, stop!" a few times before she even heard me. But Alexa was oblivious to my annoyance. We're now surrounded by hyper-connected smart devices that are autonomous, conversational, and relational, but they're completely devoid of any ability to tell how annoyed or happy or depressed we are.