Goto

Collaborating Authors

The Humble Logarithm

Forbes - Tech

Every semester I teach calculus I ask my students if they know why we care about logarithms. And every semester I am met with a sea of blank faces and shoulder shrugs. "What is the most important property logs have?" "What is the logarithm of a product?" Logarithms turn products into sums.


Log Base 2 Calculator

#artificialintelligence

The online Log Base 2 Calculator is used to calculate the log base 2 of a number x, which is generally written as lb(x) or log2(x). Log base 2, also known as the binary logarithm, is the logarithm to the base 2. The binary logarithm of x is the power to which the number 2 must be raised to obtain the value x. For example, the binary logarithm of 1 is 0, the binary logarithm of 2 is 1 and the binary logarithm of 4 is 2. It is often used in computer science and information theory.


The 17 equations that changed the course of history

@machinelearnbot

Mathematics is all around us, and it has shaped our understanding of the world in countless ways. In 2013, mathematician and science author Ian Stewart published a book on 17 Equations That Changed The World. We recently came across this convenient table on Dr. Paul Coxon's twitter account by mathematics tutor and blogger Larry Phillips that summarizes the equations. For example, a right triangle drawn on the surface of a sphere need not follow the Pythagorean theorem. A logarithm for a particular base tells you what power you need to raise that base to to get a number.


The Naive Bayes Classifier explained

@machinelearnbot

With the Naive Bayes model, we do not take only a small set of positive and negative words into account, but all words the NB Classifier was trained with, i.e. all words presents in the training set. If a word has not appeared in the training set, we have no data available and apply Laplacian smoothing (use 1 instead of the conditional probability of the word). The probability a document belongs to a class C is given by the class probability P(C) multiplied by the products of the conditional probabilities of each word for that class. In theory we want a training set as large as possible, since that will increase the accuracy. Taking the n-th power of such a large number, will definitely result in computational problems, so we should normalize it.


GloVe: Global Vectors for Word Representation

@machinelearnbot

GloVe is essentially a log-bilinear model with a weighted least-squares objective. The main intuition underlying the model is the simple observation that ratios of word-word co-occurrence probabilities have the potential for encoding some form of meaning. For example, consider the co-occurrence probabilities for target words ice and steam with various probe words from the vocabulary. As one might expect, ice co-occurs more frequently with solid than it does with gas, whereas steam co-occurs more frequently with gas than it does with solid. Both words co-occur with their shared property water frequently, and both co-occur with the unrelated word fashion infrequently.