Goto

Collaborating Authors

probability


A Complete Anomaly Detection Algorithm From Scratch in Python: Step by Step Guide

#artificialintelligence

Anomaly detection can be treated as a statistical task as an outlier analysis. But if we develop a machine learning model, it can be automated and as usual, can save a lot of time. There are so many use cases of anomaly detection. Credit card fraud detection, detection of faulty machines, or hardware systems detection based on their anomalous features, disease detection based on medical records are some good examples. There are many more use cases.


Train Your GAN With 1/10th of the Data! NVIDIA ADA Explained

#artificialintelligence

This new paper covers a technique for training a GAN architecture. They are used in many applications related to computer vision, where we want to generate a realistic transformation of an image following a specific style. If you are not familiar with how GANs work, I definitely recommend you to watch the video I made explaining it before continuing this one. As you know, GANs architecture trains in an adversarial way. Meaning that there are two networks training at the same time, one training to generate a transformed image from the input, the generator, and the other one training to differentiate the generated images from the training images' ground truths.


Introduction to Markov Chain Programming

#artificialintelligence

Now that you are familiar with how a Markov Chain works, you can deep dive into more complex stochastic modelling techniques, such as the Hidden Markov Process or MCMC. Stay tuned for more of these follow-up contents!


Are Crossword Solvers About to Go the Way of Chess Players?

#artificialintelligence

Nearly 1,300 people spent this past weekend racing to fill little boxes inside larger boxes, ever mindful of spelling, trivia, wordplay, and a ticking clock. They were competitors--newcomers, ardent hobbyists, and elite speed solvers--in the American Crossword Puzzle Tournament, the pastime's most prestigious competition. And most of them got creamed by some software. The annual event, normally set in a packed hotel ballroom with solvers separated by yellow dividers, was virtual this year, pencils swapped for keyboards. After millions of little boxes had been filled, a computer program topped the leaderboard for the first time.


Genetic Algorithm (GA) Introduction with Example Code

#artificialintelligence

This tutorial will be diving into genetic algorithms in detail and explaining their implementation in Python. We will also explore the different methods involved in each step diagrammatically. As always, we are including code for reproducibility purposes. We have split the code when required while exploring the different steps involved during our implementation. Make sure to check the full implementation from this tutorial on either Google Colab or Github.


Data Science Quiz

#artificialintelligence

Interviews are the most challenging part of getting any job especially for Data Scientist and Machine Learning Engineer roles where you are tested on Machine Learning and Deep Learning concepts. So, Given below is a short quiz that consists of 25 Questions consisting of MCQs(One or more correct), True-False, and Integer Type Questions to check your knowledge. Explanation: The derivative of Leaky RELU activation function h(z) is 1 only for z 0, while for z 0, it has a very small value. Explanation: Residuals are vertical offset and the sum of residuals is always zero. Explanation: For deciding class w1, the conditional Risk for w1 is smaller than w2.


Bayesian and frequentist results are not the same, ever

#artificialintelligence

I often hear people say that the results from Bayesian methods are the same as the results from frequentist methods, at least under certain conditions. And sometimes it even comes from people who understand Bayesian methods. Today I saw this tweet from Julia Rohrer: "Running a Bayesian multi-membership multi-level probit model with a custom function to generate average marginal effects only to find that the estimate is precisely the same as the one generated by linear regression with dummy-coded group membership." Which elicited what I interpret as good-natured teasing, like this tweet from Daniël Lakens: "I always love it when people realize that the main difference between a frequentist and Bayesian analysis is that for the latter approach you first need to wait 24 hours for the results." Ok, that's funny, but there is a serious point here I want to respond to because both of these comments are based on the premise that we can compare the results from Bayesian and frequentist methods.


Wolfram JavaScript Character-Level Language Model V1 - Wolfram Neural Net Repository

#artificialintelligence

A built-in option for temperature sampling is available in Wolfram Language 12.0, while it has to be implemented explicitly in earlier versions. Generate for 100 steps using "alert" as an initial string: The third optional argument is a "temperature" parameter that scales the input to the final softmax. Very low temperature settings are equivalent to always picking the character with maximum probability. It is typical for sampling to "get stuck in a loop":


10 Best Data Science Books for Beginners and Advanced Data Scientist [Updated]

#artificialintelligence

Apart from the fact that Data Science is one of the highest-paid and most popular fields of date, it is also important to note that it will continue to be more innovative and challenging for another decade or more. There will be enough data science jobs that can fetch you a handsome salary as well as opportunities to grow. That said, there is nothing better than reading data science books to get the ball rolling. Learning data science through books will help you get a holistic view of Data Science as data science is not just about computing, it also includes mathematics, probability, statistics, programming, machine learning, and much more. Just like other books of Headfirst, the tone of this book is friendly and conversational and the best book for data science to start with. The book covers a lot of statistics starting with descriptive statistics – mean, median, mode, standard deviation – and then go on to probability and inferential statistics like correlation, regression, etc… If you were a science or commerce student in school, you may have studied all of it, and the book is a great start to refresh everything you have already learned in a detailed manner. There are a lot of pictures and graphics and bits on the sides that are easy to remember. You can find some good real-life examples to keep you hooked on to the book.


10 Must-Know Statistical Concepts for Data Scientists - KDnuggets

#artificialintelligence

Data science is an interdisciplinary field. One of the building blocks of data science is statistics. Without a decent level of statistics knowledge, it would be highly difficult to understand or interpret the data. Statistics helps us explain the data. We use statistics to infer results about a population based on a sample drawn from that population.