AITopics | Nearest Neighbor Methods

Collaborating Authors

Nearest Neighbor Methods

News Overviews Instructional Materials AI-Alerts Classics

KNNImputer

#artificialintelligenceJul-21-2020, 14:51:15 GMT

The idea in kNN methods is to identify'k' samples in the dataset that are similar or close in the space. Then we use these'k' samples to estimate the value of the missing data points. Each sample's missing values are imputed using the mean value of the'k'-neighbors found in the dataset. Let's look at an example to understand this. Consider a pair of observations in a two-dimensional space (2,0), (2,2), (3,3).

artificial intelligence, machine learning, neighbor, (18 more...)

#artificialintelligence

Technology:

Information Technology > Data Science (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.32)

Add feedback

Every Machine Learning Algorithm Can Be Represented as a Neural Network

#artificialintelligenceJul-18-2020, 04:05:21 GMT

It seems that all of the work in machine learning -- starting from early research in the 1950s -- cumulated with the creation of the neural network. Successively, algorithm after new algorithm were proposed, from logistic regression to support vector machines, but the neural network is, very literally, the algorithm of algorithms and the pinnacle of machine learning. It's a universal generalization of what machine learning is, instead of one attempt of doing it. In this sense, it is more of a framework and a concept than simply an algorithm, and this is evident given the massive amount of freedom in constructing neural networks -- hidden layer & node counts, activation functions, optimizers, loss functions, network types (convolutional, recurrent, etc.), and specialized layers (batch norm, dropout, etc.), to name a few. From this perspective of neural networks being a concept rather than a rigid algorithm comes a very interesting corollary: any machine learning algorithm, be it decision trees or k-nearest neighbors, can be represented using a neural network.

artificial intelligence, machine learning, neural network, (18 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.50)

Add feedback

Recommender Systems from Learned Embeddings

#artificialintelligenceJul-14-2020, 20:01:03 GMT

We will use Movie ID and User ID to generate their corresponding embeddings. These embeddings are generated through the model training process along with other parameters. Once we have the embeddings, we build a K-Nearest Neighbor (KNN) model. Then whenever there is a user, we can get that user's embedding from our Neural Network model. We use this embedding to lookup in the KNN database and recommend top -- K movies to this user.

artificial intelligence, movie, neural network, (13 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.74)

Add feedback

Overcoming an Imbalanced Dataset using Oversampling.

#artificialintelligenceJul-13-2020, 02:57:34 GMT

How oversampling yielded great results for classifying cases of Sexual Harassment. When it comes to data science, sexual harassment is an imbalanced data problem, meaning there are few (known) instances of harassment in the entire dataset. An imbalanced problem is defined as a dataset which has disproportional class counts. Oversampling is one way to combat this by creating synthetic minority samples. SMOTE -- Synthetic Minority Over-sampling Technique -- is a common oversampling method widely used in machine learning with imbalanced high-dimensional datasets using Oversampling.

artificial intelligence, dataset, machine learning, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.36)

Add feedback

9 Key Machine Learning Algorithms Explained in Plain English

#artificialintelligenceJul-12-2020, 02:49:34 GMT

Machine learning [https://gum.co/pGjwd] is changing the world. Google uses machine learning to suggest search results to users. Netflix uses it to recommend movies for you to watch. Facebook uses machine learning to suggest people you may know. Machine learning has never been more important. At the same time, understanding machine learning is hard. The field is full of jargon. And the number of different ML algorithms grows each year. This article will introduce you to the fundamental concepts

algorithm, artificial intelligence, information technology services, (18 more...)

#artificialintelligence

Industry:

Media > Television (0.48)
Information Technology > Services (0.34)
Leisure & Entertainment > Sports (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.34)

Add feedback

K-Nearest Neighbors (KNN) Algorithm In Machine Learning

#artificialintelligenceJul-10-2020, 19:41:46 GMT

K-Nearest Neighbors (KNN) Algoritm in Machine Learning will help you to master all the concepts of KNN. KNN algorithm is one of the simplest, non-parametric, lazy classification learning algorithm. Its purpose is to use a dataset in which the data points are separated into several classes to predict the classification of a new sample point. This tutorial on "KNN Algoritm in Machine Learning" will help you to master all the concepts of K-nearest neighbors. Its purpose is to use a dataset in which the data points are separated into several classes to predict the classification of a new sample point.

artificial intelligence, k-nearest neighbor, machine learning, (6 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (1.00)

Add feedback

An Efficient Data Imputation Technique for Human Activity Recognition

Pires, Ivan Miguel, Hussain, Faisal, Garcia, Nuno M., Zdravevski, Eftim

arXiv.org Artificial IntelligenceJul-8-2020

The tremendous applications of human activity recognition are surging its span from health monitoring systems to virtual reality applications. Thus, the automatic recognition of daily life activities has become significant for numerous applications. In recent years, many datasets have been proposed to train the machine learning models for efficient monitoring and recognition of human daily living activities. However, the performance of machine learning models in activity recognition is crucially affected when there are incomplete activities in a dataset, i.e., having missing samples in dataset captures. Therefore, in this work, we propose a methodology for extrapolating the missing samples of a dataset to better recognize the human daily living activities. The proposed method efficiently pre-processes the data captures and utilizes the k-Nearest Neighbors (KNN) imputation technique to extrapolate the missing samples in dataset captures. The proposed methodology elegantly extrapolated a similar pattern of activities as they were in the real dataset.

artificial intelligence, dataset, health & medicine, (18 more...)

arXiv.org Artificial Intelligence

2007.04456

Country: Europe > Portugal (0.47)

Genre: Research Report (0.50)

Industry: Health & Medicine > Consumer Health (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.54)

Add feedback

Understanding Machine learning

#artificialintelligenceJul-5-2020, 18:31:19 GMT

Artificial intelligence is everywhere in the press. Machine learning is everywhere in companies. What is the link between these three disciplines, and above all, what differentiates them? Indeed, in the common imagination, when we talk about artificial intelligence, we mean by this a program that can perform human tasks, learning on its own. However, AI as defined in the industry is rather "more or less advanced algorithms which imitate human actions".

health & medicine, immunology, neural network, (11 more...)

#artificialintelligence

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.30)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.76)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.30)

Add feedback

Missing Features Reconstruction Using a Wasserstein Generative Adversarial Imputation Network

Friedjungová, Magda, Vašata, Daniel, Balatsko, Maksym, Jiřina, Marcel

arXiv.org Machine LearningJun-21-2020

Missing data is one of the most common preprocessing problems. In this paper, we experimentally research the use of generative and non-generative models for feature reconstruction. Variational Autoencoder with Arbitrary Conditioning (VAEAC) and Generative Adversarial Imputation Network (GAIN) were researched as representatives of generative models, while the denoising autoencoder (DAE) represented non-generative models. Performance of the models is compared to traditional methods k-nearest neighbors (k-NN) and Multiple Imputation by Chained Equations (MICE). Moreover, we introduce WGAIN as the Wasserstein modification of GAIN, which turns out to be the best imputation model when the degree of missingness is less than or equal to 30%. Experiments were performed on real-world and artificial datasets with continuous features where different percentages of features, varying from 10% to 50%, were missing. Evaluation of algorithms was done by measuring the accuracy of the classification model previously trained on the uncorrupted dataset. The results show that GAIN and especially WGAIN are the best imputers regardless of the conditions. In general, they outperform or are comparative to MICE, k-NN, DAE, and VAEAC.

artificial intelligence, machine learning, neural network, (15 more...)

arXiv.org Machine Learning

doi: 10.1007/978-3-030-50423-6_17

2006.11783

Country: North America > Canada > Quebec (0.14)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.55)

Add feedback

Dex-Net AR uses Apple's ARKit to train robots to grasp objects

#artificialintelligenceJun-14-2020, 18:52:23 GMT

UC Berkeley AI researchers are using an iPhone X and Apple's ARKit to train a robotic arm how to grasp an object. ARKit creates point clouds from data generated by moving an RGB camera around an object for two minutes. Robotic grasping is a particular robotics subfield focused on the challenge of teaching a robot to pick up, move, manipulate, or grasp an object. The Dexterity Network, or Dex-Net, research project at UC Berkeley's Autolab dates back to 2017 and includes open source training data sets and pretrained models for robotic grasping in an ecommerce bin-picking scenario. The ability for robots to quickly learn how to grasp objects has a big impact on how automated warehouses like Amazon fulfillment centers can become.

artificial intelligence, dex-net ar, machine learning, (7 more...)

#artificialintelligence

AI-Alerts: 2020 > 2020-06 > AAAI AI-Alert for Jun 16, 2020 (1.00)

Industry: Education > Educational Setting (0.80)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.34)

Add feedback