AITopics

Industry: Health & Medicine > Health Care Technology (0.51)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.69)

arXiv.org Machine LearningAug-25-2017

Joint Structured Learning and Predictions under Logical Constraints in Conditional Random Fields

Meunier, Jean-Luc

This paper is concerned with structured machine learning, in a supervised machine learning context. It discusses how to make joint structured learning on interdependent objects of different nature, as well as how to enforce logical constraints when predicting labels. We explain how this need arose in a Document Understanding task. We then discuss a general extension to Conditional Random Fields (CRF) for this purpose and present the contributed open source implementation on top of the open source PyStruct library. We evaluate its performance on a publicly available dataset. Keywords: supervised machine learning, structured prediction, conditional random fields.

artificial intelligence, machine learning, node, (15 more...)

1708.07644

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.91)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.84)

@machinelearnbotAug-24-2017, 04:20:05 GMT

Understanding overfitting: an inaccurate meme in Machine Learning

This post was inspired by a recent post by Andrew Gelman, who defined'overfitting' as follows: Overfitting is when you have a complicated model that gives worse predictions, on average, than a simpler model. Preamble There is a lot of confusion among practitioners regarding the concept of overfitting. Applying cross-validation prevents overfitting and a good out-of-sample performance, low generalisation error in unseen data, indicates not an overfit. This statement is of course not true: cross-validation does not prevent your model to overfit and good out-of-sample performance does not guarantee not-overfitted model. What actually people refer to in one aspect of this statement is called overtraining.

artificial intelligence, inductive learning, machine learning, (15 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.42)

Communications of the ACMAug-23-2017, 17:55:30 GMT

Moving Beyond the Turing Test with the Allen AI Science Challenge

The field of artificial intelligence has made great strides recently, as in AlphaGo's victories in the game of Go over world champion South Korean Lee Sedol in March 2016 and top-ranked Chinese Go player Ke Jie in May 2017, leading to great optimism for the field. But are we really moving toward smarter machines, or are these successes restricted to certain classes of problems, leaving others untouched? In 2015, the Allen Institute for Artificial Intelligence (AI2) ran its first Allen AI Science Challenge, a competition to test machines on an ostensibly difficult task--answering eighth-grade science questions. Our motivations were to encourage the field to set its sights more broadly by exploring a problem that appears to require modeling, reasoning, language understanding, and commonsense knowledge in order to probe the state of the art while sowing the seeds for possible future breakthroughs. Challenge problems have historically played an important role in motivating and driving progress in research.

allen ai science challenge, competition, turing test, (12 more...)

Communications of the ACM

Country:

Asia > South Korea (0.24)
North America > United States > Washington > King County > Seattle (0.06)
North America > United States > New York (0.05)
(7 more...)

Industry:

Leisure & Entertainment > Games > Go (0.88)
Education > Educational Setting > K-12 Education (0.56)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Issues > Turing's Test (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.46)

#artificialintelligenceAug-19-2017, 19:36:57 GMT

deeplearnjs-machine-learning-library-136309.html?utm_content=buffer3a7f6&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer

It's a machine learning's world! Thorat and Smilkov, both software engineers in the Big Picture team at Google revealed in a blog post announcing deeplearn.js And if nothing else, the browser is one of the world's most popular programming platforms." Web machine learning libraries are hardly a novelty but one of the biggest disadvantages is that they have been either limited by the speed of Javascript or restricted to inference. The software engineers explained that the API imitates the structure of TensorFlow and NumPy, with a delayed execution model for training (like TensorFlow), and an immediate execution model for inference (like NumPy).

artificial intelligence, inductive learning, library, (14 more...)

Industry: Information Technology (0.98)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.64)

#artificialintelligenceAug-17-2017, 17:30:31 GMT

Understanding overfitting: an inaccurate meme in supervised learning

Preamble There is a lot of confusion among practitioners regarding the concept of overfitting. It seems like, a kind of an urban legend or a meme, a folklore is circulating in data science or allied fields with the following statement: Applying cross-validation prevents overfitting and a good out-of-sample performance, low generalisation error in unseen data, indicates not an overfit. This statement is of course not true: cross-validation does not prevent your model to overfit and good out-of-sample performance does not guarantee not-overfitted model. What actually people refer to in one aspect of this statement is called overtraining. Unfortunately, this meme is not only propagated in industry but in some academic papers as well. This might be at best a confusion on jargon.

artificial intelligence, inductive learning, machine learning, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.42)

arXiv.org Machine LearningAug-15-2017

Theoretical Foundation of Co-Training and Disagreement-Based Algorithms

Wang, Wei, Zhou, Zhi-Hua

Disagreement-based approaches generate multiple classifiers and exploit the disagreement among them with unlabeled data to improve learning performance. Co-training is a representative paradigm of them, which trains two classifiers separately on two sufficient and redundant views; while for the applications where there is only one view, several successful variants of co-training with two different classifiers on single-view data instead of two views have been proposed. For these disagreement-based approaches, there are several important issues which still are unsolved, in this article we present theoretical analyses to address these issues, which provides a theoretical foundation of co-training and disagreement-based approaches. Keywords: machine learning, semi-supervised learning, disagreement-based learning, co-training, multi-view classification, combination 1. Introduction Learning from labeled training data is well-established in traditional machine learning, but labeling the data is time-consuming, sometimes may be very expensive since it requires human efforts. In many practical applications, unlabeled data can be obtained abundantly and cheaply.

artificial intelligence, classifier, machine learning, (18 more...)

1708.04403

Country:

Europe (1.00)
North America > United States > California (0.67)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.76)

Ciliberto, Carlo, Rudi, Alessandro, Rosasco, Lorenzo, Pontil, Massimiliano

Consistent Multitask Learning with Nonlinear Output Relations

arXiv.org Machine LearningAug-10-2017

Key to multitask learning is exploiting relationships between different tasks to improve prediction performance. If the relations are linear, regularization approaches can be used successfully. However, in practice assuming the tasks to be linearly related might be restrictive, and allowing for nonlinear structures is a challenge. In this paper, we tackle this issue by casting the problem within the framework of structured prediction. Our main contribution is a novel algorithm for learning multiple tasks which are related by a system of nonlinear equations that their joint outputs need to satisfy. We show that the algorithm is consistent and can be efficiently implemented. Experimental results show the potential of the proposed method.

artificial intelligence, estimator, machine learning, (17 more...)

1705.08118

Country: North America > United States > Massachusetts (0.28)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.85)

Northcutt, Curtis G., Wu, Tailin, Chuang, Isaac L.

Learning with Confident Examples: Rank Pruning for Robust Classification with Noisy Labels

arXiv.org Machine LearningAug-9-2017

Noisy PN learning is the problem of binary classification when training examples may be mislabeled (flipped) uniformly with noise rate rho1 for positive examples and rho0 for negative examples. We propose Rank Pruning (RP) to solve noisy PN learning and the open problem of estimating the noise rates, i.e. the fraction of wrong positive and negative labels. Unlike prior solutions, RP is time-efficient and general, requiring O(T) for any unrestricted choice of probabilistic classifier with T fitting time. We prove RP has consistent noise estimation and equivalent expected risk as learning with uncorrupted labels in ideal conditions, and derive closed-form solutions when conditions are non-ideal. RP achieves state-of-the-art noise estimation and F1, error, and AUC-PR for both MNIST and CIFAR datasets, regardless of the amount of noise and performs similarly impressively when a large portion of training examples are noise drawn from a third distribution. To highlight, RP with a CNN classifier can predict if an MNIST digit is a "one"or "not" with only 0.25% error, and 0.46 error across all digits, even when 50% of positive examples are mislabeled and 50% of observed positive labels are mislabeled negative examples.

artificial intelligence, inductive learning, machine learning, (17 more...)

1705.01936

Country: North America > United States > Massachusetts (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.93)

#artificialintelligenceAug-7-2017, 20:50:21 GMT

K - Nearest Neighbors - KNN Fun and Easy Machine Learning

In pattern recognition, the KNN algorithm is a method for classifying objects based on closest training examples in the feature space. KNN is a type of instance-based learning, or lazy learning where the function is only approximated locally and all computation is delayed until classification. The KNN is the fundamental and simplest classification technique when there is little or no prior knowledge about the distribution of the data. The K in KNN refers to number of nearest neighbors that the classifier will use to make its predication. In this video we use Game of Thrones example to explain kNN.

artificial intelligence, fun and easy machine learning, inductive learning, (2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.85)
Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (0.67)