Goto

Collaborating Authors

probability


The Word "AI" Has Become a Marketing Ploy

#artificialintelligence

It turns out that Scry is a "social forecasting platform." Users join for free and can enter their personal estimates of the probabilities that certain events will happen, with Scry calculating the average probability. For example, one question is, "Will Apple launch a commercial self-driving electric vehicle before the end of 2024?" As I write this, there are 18 responses, entered up to six months ago. Eight answers are 50-50 and two are 100% yes.


N-gram language models -- Part 1

#artificialintelligence

Language modeling -- that is, predicting the probability of a word in a sentence -- is a fundamental task in natural language processing. It is used in many NLP applications such as autocomplete, spelling correction, or text generation. Currently, language models based on neural networks, especially transformers, are the state of the art: they predict very accurately a word in a sentence based on surrounding words. However, in this project, I will revisit the most classic language model: the n-gram models. In this project, my training data set -- appropriately called train -- is "A Game of Thrones", the first book in the George R. R. Martin fantasy series that inspired the popular TV show of the same name.


Machine Learning Testing for Data Scientists

#artificialintelligence

In one software development project after another, it has been proven that testing saves time. Does this hold true for machine learning projects? Should data scientists write tests? Will it make their work better and/or faster? We believe the answer is YES! In this post we describe a full development and deployment flow that speeds The post Machine Learning Testing for Data Scientists appeared first on Blog.


Top 70+ Data Science Interview Questions and Answers for 2021

#artificialintelligence

We can see Pr value here, and there are three stars associated with this Pr value. This basically means that we can reject the null hypothesis which states that there is no relationship between the age and the target columns. But since we have three stars over here, this null hypothesis can be rejected. There is a strong relationship between the age column and the target column. Now, we have other parameters like null deviance and residual deviance.


No More "What" Without the "Why"

#artificialintelligence

Throughout the last months, I had the chance to enable various organizations and leaders leveraging their large databases with machine learning. I was particularly engaging with member organisations which struggle with rising dropout rates (churns) -- an issue that became even more serious throughout the pandemic when individual income has been on a declining and the fear of job loss on a rising path. With machine learning, we used very large membership databases with individual-level information (e.g. Machine Learning tells us the "What", Causal Inference the "Why" Despite the overall good performance of the machine learning models, our clients were always interested in one obvious question: Why does an individual member leave? Unfortunately, machine learning models are not suited to identify the causes of things but rather they are built to predict things.


Effect of Unbalanced and Mixed Dataset on ML Model

#artificialintelligence

Machine learning requires creating a robust training dataset since the training performs as the seed for subsequent model evaluation. If the training data is corrupted, the model will perform badly…


When Smart Cars make Bad Choices

#artificialintelligence

Its 2030 and a SUV driven by an Autonomous Driving System (ADS) is heading west on a highway. The SUV contains two parents in the front seats and two small children in the back seat. The SUV is going the speed limit of 100 km/hour. The SUV drives through a tight corner and as the SUV makes the final turn a large bull moose weighing over six hundred kilograms shambles onto the road. The autonomous driving system driving the SUV was trained to select the best alternative out of as set of possible outcomes and so the SUV abruptly swerves into the left lane currently occupied by a small sedan going the same speed as the SUV. The SUV ADS had determined that saving the lives of two adults and two children was the greater good even though there was a significant risk that the small sedan would be forced into oncoming traffic travelling East putting the two adult occupants at mortal risk.


Conversing with chatbots: DialoGPT

#artificialintelligence

In a previous module, we examined language models and explored n-gram and neural approaches. We found that the n-gram approach is generally better for higher values of N but this may be constrained by available compute resources. There was also the concern about the lack of representation for n-grams not present in the training corpus. On the other hand, applying subword tokenization methods such as Byte Pair Encoding and Wordpiece, recent neural approaches are able to resolve the issues with n-gram language models and show impressive results. We also traced the development of neural language models from feedforward networks that rely on word embeddings and fixed input length to recurrent neural networks which allowed for variable length input but struggled to capture long term dependencies.


Five Reasoning Methods To Rule Them All.

#artificialintelligence

The five reasoning methods are also called the five tribes. They help to solve the Master Algorithm. Each of the five tribes has a different technique and strategy for solving problems that result in unique algorithms. If we are successful to combine these algorithms, then it will lead us to (theoretically) the master algorithm. These are defined by the Portugues author, Pedro Domingos in his book The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World.


Skip-Gram Neural Network for Graphs

#artificialintelligence

This article will go into more details of node embeddings. If you lack intuition and understanding of node embeddings, check out this previous article that covered the intuition of node embeddings. But if you are ready. In the level-1 explanation of node embeddings I motivated why we need embeddings such that we have a vector form of graph data. Embeddings should capture the graph topology, relationships between nodes and further information.