AITopics

Genre: Instructional Material (0.33)

Industry:

Automobiles & Trucks (0.78)
Information Technology (0.72)
Leisure & Entertainment > Games (0.56)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.54)
(2 more...)

Lorena, Ana C., Garcia, Luís P. F., Lehmann, Jens, Souto, Marcilio C. P., Ho, Tin K.

How Complex is your classification problem? A survey on measuring classification complexity

arXiv.org Machine LearningAug-10-2018

Extracting characteristics from the training datasets of classification problems has proven effective in a number of meta-analyses. Among them, measures of classification complexity can estimate the difficulty in separating the data points into their expected classes. Descriptors of the spatial distribution of the data and estimates of the shape and size of the decision boundary are among the existent measures for this characterization. This information can support the formulation of new data-driven pre-processing and pattern recognition techniques, which can in turn be focused on challenging characteristics of the problems. This paper surveys and analyzes measures which can be extracted from the training datasets in order to characterize the complexity of the respective classification problems. Their use in recent literature is also reviewed and discussed, allowing to prospect opportunities for future work in the area. Finally, descriptions are given on an R package named Extended Complexity Library (ECoL) that implements a set of complexity measures and is made publicly available.

artificial intelligence, fuzzy logic, machine learning, (19 more...)

1808.03591

Country:

South America > Brazil > São Paulo (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California (0.04)
(5 more...)

Genre:

Overview (1.00)
Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.67)
(2 more...)

Poulos, Jason, Valle, Rafael

Missing Data Imputation for Supervised Learning

arXiv.org Machine LearningAug-6-2018

Missing data imputation can help improve the performance of prediction models in situations where missing data hide useful information. This paper compares methods for imputing missing categorical data for supervised classification tasks. We experiment on two machine learning benchmark datasets with missing categorical data, comparing classifiers trained on non-imputed (i.e., one-hot encoded) or imputed data with different levels of additional missing-data perturbation. We show imputation methods can increase predictive accuracy in the presence of missing-data perturbation, which can actually improve prediction accuracy by regularizing the classifier. We achieve the state-of-the-art on the Adult dataset with missing-data perturbation and k-nearest-neighbors (k-NN) imputation.

artificial intelligence, data quality, machine learning, (17 more...)

1610.09075

Country:

North America > United States > California > Alameda County > Berkeley (0.14)
Africa > South Africa (0.05)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Government > Regional Government (0.47)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)

#artificialintelligenceJul-23-2018, 00:35:04 GMT

7 Machine Learning Algorithms To Start Learning.... MarkTechPost

It is a simple algorithm which can be used as a performance baseline. This algorithm methodology is used mostly for forecasting and finding out cause and effect relationship between data variables. Its purpose from a database is to read the data points which are separated into several classes and then predict the new sample point classification. It gives great results when used for textual data analysis. It is an unsupervised learning used in unlabelled data sources. However, it is mostly used in classification cases.

algorithm, artificial intelligence, machine learning, (9 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.43)

#artificialintelligenceJul-21-2018, 08:54:00 GMT

Machine Learning Training Bootcamp : Tonex.Com

Machine Learning training bootcamp is a 3-day specialized training course that covers the essentials of machine learning, a shape and utilization of man-made reasoning (AI). Machine learning computerizes the information investigation process by empowering PCs, machines and IoT to learn and adjust through experience connected to particular undertakings without unequivocal programming. Learning Objectives: Learn about Artificial Intelligence and Machine Learning List similarities and differences between AI, Machine Learning and Data Mining Learn how Artificial Intelligence uses data to offer solutions to existing problems Explore how Machine Learning goes beyond AI to offer data necessary for a machine to learn, adapt and optimize / Clarify how Data Mining can serve as foundation for AI and machine learning to use existing information to highlight patterns List the various applications of machine learning and related algorithms Learn how to classify the types of learning such as supervised and unsupervised learning Implement supervised learning techniques such as linear and logistic regression Use unsupervised learning algorithms including deep learning, clustering and recommender systems (RS) used to help users find new items or services, such as books, music, transportation, people and jobs based on information about the user or the recommended item Learn about classification data and Machine Learning models Select the best algorithms applied to Machine Learning Make accurate predictions and analysis to effectively solve potential problems List Machine Learning concepts, principles, algorithms, tools and applications Learn the concepts and operation of support neural networks, vector machines, kernel SVM, naive bayes, decision tree classifier, random forest classifier, logistic regression, K-nearest neighbors, K-means and clustering Comprehend the theoretical concepts and how they relate to the practical aspects of machine learning / Be able to model a wide variety of robust machine learning algorithms including deep learning, clustering and recommendation systems Course Agenda and Topics: The Basics of Machine Learning Machine Learning Techniques, Tools and Algorithms Data and Data Science Review of Terminology and Principles Applied Artificial Intelligence (AI) and Machine Learning Popular Machine Learning Methods Learning Applied to Machine Learning Principal component Analysis Principles of Supervised Machine Learning Algorithms Principles of Unsupervised Machine Learning Regression Applied to Machines Learning Principles of Neural Networks Large Scale Machine Learning Introduction to Deep Learning Applying Machine Learning Overview of Algorithms Overview of Tools and Processes Request More Information .

artificial intelligence, deep learning, machine learning training bootcamp, (6 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.97)

Industry: Education (0.61)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.58)

#artificialintelligenceJul-13-2018, 00:39:41 GMT

AI threatens yet more jobs – now, lab rats: Animal testing could be on the way out, thanks to machine learning

Machine learning algorithms can help scientists predict chemical toxicity to a similar degree of accuracy as animal testing, according to a paper published this week in Toxicological Sciences. A whopping €3bn (over $3.5bn) is spent every year to study how the negative impacts of chemicals on animals like rats, rabbits or monkeys. The top nine most frequently tested safety experiments resulted in the death of the poor critters 57 per cent of the time in Europe in 2011. By using software, chemists may be able to spend less on animal testing and save more creatures. To demonstrate this, first, a team of researchers scoured through a range of databases to label 80,908 different chemicals.

animal testing, artificial intelligence, machine learning, (3 more...)

Country: Europe (0.26)

Genre: Research Report (0.43)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.35)

Sarker, Md. Kamruzzaman, Alam, Kazi Md. Rokibul, Arifuzzaman, Md.

Emotion Recognition from Speech based on Relevant Feature and Majority Voting

arXiv.org Machine LearningJul-10-2018

This paper proposes an approach to detect emotion from human speech employing majority voting technique over several machine learning techniques. The contribution of this work is in two folds: firstly it selects those features of speech which is most promising for classification and secondly it uses the majority voting technique that selects the exact class of emotion. Here, majority voting technique has been applied over Neural Network (NN), Decision Tree (DT), Support Vector Machine (SVM) and K-Nearest Neighbor (KNN). Input vector of NN, DT, SVM and KNN consists of various acoustic and prosodic features like Pitch, Mel-Frequency Cepstral coefficients etc. From speech signal many feature have been extracted and only promising features have been selected. To consider a feature as promising, Fast Correlation based feature selection (FCBF) and Fisher score algorithms have been used and only those features are selected which are highly ranked by both of them. The proposed approach has been tested on Berlin dataset of emotional speech [3] and Electromagnetic Articulography (EMA) dataset [4]. The experimental result shows that majority voting technique attains better accuracy over individual machine learning techniques. The employment of the proposed approach can effectively recognize the emotion of human beings in case of social robot, intelligent chat client, call-center of a company etc.

artificial intelligence, emotion, machine learning, (15 more...)

doi: 10.1109/ICIEV.2014.6850685

1807.03909

Country:

Asia > China > Beijing > Beijing (0.05)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > New York > New York County > New York City (0.04)
(7 more...)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.56)

arXiv.org Machine LearningJul-9-2018

Towards Non-Parametric Learning to Rank

Liu, Ao, Wu, Qiong, Liu, Zhenming, Xia, Lirong

This paper studies a stylized, yet natural, learning-to-rank problem and points out the critical incorrectness of a widely used nearest neighbor algorithm. We consider a model with $n$ agents (users) $\{x_i\}_{i \in [n]}$ and $m$ alternatives (items) $\{y_j\}_{j \in [m]}$, each of which is associated with a latent feature vector. Agents rank items nondeterministically according to the Plackett-Luce model, where the higher the utility of an item to the agent, the more likely this item will be ranked high by the agent. Our goal is to find neighbors of an arbitrary agent or alternative in the latent space. We first show that the Kendall-tau distance based kNN produces incorrect results in our model. Next, we fix the problem by introducing a new algorithm with features constructed from "global information" of the data matrix. Our approach is in sharp contrast to most existing feature engineering methods. Finally, we design another new algorithm identifying similar alternatives. The construction of alternative features can be done using "local information," highlighting the algorithmic difference between finding similar agents and similar alternatives.

artificial intelligence, machine learning, proceedings, (13 more...)

1807.03395

Country:

North America > United States > Washington > King County > Bellevue (0.04)
North America > United States > Virginia > Williamsburg (0.04)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
(5 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.34)

arXiv.org Machine LearningJul-4-2018

An Unsupervised Learning Classifier with Competitive Error Performance

Nissani, Daniel N.

An unsupervised learning classification model is described. It achieves classification error probability competitive with that of popular supervised learning classifiers such as SVM or kNN. The model is based on the incremental execution of small step shift and rotation operations upon selected discriminative hyperplanes at the arrival of input samples. When applied, in conjunction with a selected feature extractor, to a subset of the ImageNet dataset benchmark, it yields 6.2 % Top 3 probability of error; this exceeds by merely about 2 % the result achieved by (supervised) k-Nearest Neighbor, both using same feature extractor. This result may also be contrasted with popular unsupervised learning schemes such as k-Means which is shown to be practically useless on same dataset.

artificial intelligence, hyperplane, machine learning, (15 more...)

1806.09385

Country:

North America > United States > California (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)

arXiv.org Machine LearningJul-3-2018

Breast Cancer Diagnosis via Classification Algorithms

Entezari, Reihaneh

In this paper, we analyze the Wisconsin Diagnostic Breast Cancer Data using Machine Learning classification techniques, such as the SVM, Bayesian Logistic Regression (Variational Approximation), and K-Nearest-Neighbors. We describe each model, and compare their performance through different measures. We conclude that SVM has the best performance among all other classifiers, while it competes closely with the Bayesian Logistic Regression that is ranked second best method for this dataset.

artificial intelligence, logistic regression, machine learning, (14 more...)

1807.01334

Country:

North America > United States > Wisconsin (0.25)
North America > Canada > Ontario > Toronto (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(6 more...)

Genre: Research Report > New Finding (0.59)

Industry: Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.71)