AITopics

Country: Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.06)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.52)

#artificialintelligenceJul-5-2016, 14:30:30 GMT

Overfitting In Machine Learning (IT Best Kept Secret Is Optimization)

artificial intelligence, machine learning, training example, (16 more...)

Country: Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.06)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.53)

Los Angeles TimesJul-4-2016, 16:25:11 GMT

Roger Federer ties a Wimbledon record set by Jimmy Connors

Looking in fine form after two days of rest, Roger Federer equaled Jimmy Connors' Open-era record by reaching his 14th Wimbledon quarterfinal and added to his own mark by making it at least that far at a Grand Slam tournament for the 48th time. Federer, a seven-time champion at the All England Club, has not dropped a set in the tournament through four matches after beating unseeded American Steve Johnson 6-2, 6-3, 7-5 at Centre Court on Monday. Johnson was making his debut in the fourth round of a major. The No. 3-seeded Federer hadn't played since Friday, when he was the only man to finish a third-round match. He next faces No. 9 Marin Cilic, the 2014 US Open champion, who advanced when Kei Nishikori retired from their fourth-round match.

artificial intelligence, jimmy connor, machine learning, (9 more...)

Los Angeles Times

Country:

Europe > United Kingdom > England > Greater London > London > Wimbledon (0.76)
Europe > Germany (0.08)
Europe > France (0.08)

Industry: Leisure & Entertainment > Sports > Tennis (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.40)

#artificialintelligenceJul-3-2016, 09:20:21 GMT

A small and easy introduction to Transductive Learning

Input: a) A set of labelled examples where every is the input vector, and is the corresponding output label. Output: The set of expected labels for all instances in . There are two ways (or rather, two philosophies) you could use, to solve this problem. Induction, in the context of learning, is the attempted discovery of rules/generalizations based on analysis of collected data. 'Attempted discovery' is the key term here – the generalizations are not facts, but approximations based on evidence you have gathered.

artificial intelligence, inductive learning, machine learning, (10 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.79)

Wongchaisuwat, Papis, Klabjan, Diego, Jonnalagadda, Siddhartha R.

A Semi-supervised learning approach to enhance health care Community-based Question Answering: A case study in alcoholism

arXiv.org Machine LearningJul-3-2016

Community-based Question Answering (CQA) sites play an important role in addressing health information needs. However, a significant number of posted questions remain unanswered. Automatically answering the posted questions can provide a useful source of information for online health communities. In this study, we developed an algorithm to automatically answer health-related questions based on past questions and answers (QA). We also aimed to understand information embedded within online health content that are good features in identifying valid answers. Our proposed algorithm uses information retrieval techniques to identify candidate answers from resolved QA. In order to rank these candidates, we implemented a semi-supervised leaning algorithm that extracts the best answer to a question. We assessed this approach on a curated corpus from Yahoo! Answers and compared against a rule-based string similarity baseline. On our dataset, the semi-supervised learning algorithm has an accuracy of 86.2%. UMLS-based (health-related) features used in the model enhance the algorithm's performance by proximately 8 %. A reasonably high rate of accuracy is obtained given that the data is considerably noisy. Important features distinguishing a valid answer from an invalid answer include text length, number of stop words contained in a test question, a distance between the test question and other questions in the corpus as well as a number of overlapping health-related terms between questions. Overall, our automated QA system based on historical QA pairs is shown to be effective according to the data set in this case study. It is developed for general use in the health care domain which can also be applied to other CQA sites.

information retrieval, machine learning, question answering, (23 more...)

1607.00706

Country:

Europe (1.00)
North America > United States > Massachusetts (0.28)

Genre: Research Report > Experimental Study (0.93)

Industry: Health & Medicine > Consumer Health (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.87)
(4 more...)

arXiv.org Machine LearningJun-30-2016

Ballpark Learning: Estimating Labels from Rough Group Comparisons

Hope, Tom, Shahaf, Dafna

We are interested in estimating individual labels given only coarse, aggregated signal over the data points. In our setting, we receive sets ("bags") of unlabeled instances with constraints on label proportions. We relax the unrealistic assumption of known label proportions, made in previous work; instead, we assume only to have upper and lower bounds, and constraints on bag differences. We motivate the problem, propose an intuitive formulation and algorithm, and apply our methods to real-world scenarios. Across several domains, we show how using only proportion constraints and no labeled examples, we can achieve surprisingly high accuracy. In particular, we demonstrate how to predict income level using rough stereotypes and how to perform sentiment analysis using very little information. We also apply our method to guide exploratory analysis, recovering geographical differences in twitter dialect.

constraint, data mining, machine learning, (22 more...)

1607.00034

Country: North America > United States > Texas (0.28)

Genre: Research Report (1.00)

Industry:

Information Technology (0.68)
Leisure & Entertainment (0.47)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

#artificialintelligenceJun-24-2016, 14:13:20 GMT

Computer vision system studies word use to recognize objects it has never seen before

Computer vision systems typically learn how to recognize an object by analyzing images of thousands of examples. But scientists at Disney Research have shown that computers also can learn to recognize objects they have never seen before, based in part on studying vocabulary. People, after all, can get an idea of what things might look like based on reading a book. Similarly, a computer that already has been taught to recognize certain objects - apples, for instance - can analyze word use to get hints about the existence of fruits such as pears and peaches, and how they might differ from apples, said Leonid Sigal, senior research scientist at Disney Research. The knowledge that other fruits exist also is helpful in teaching the computer about important characteristics of apples themselves, he added.

artificial intelligence, inductive learning, machine learning, (9 more...)

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.06)
Africa (0.06)

Technology:

Information Technology > Artificial Intelligence > Vision (0.79)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.32)

Belanger, David, McCallum, Andrew

Structured Prediction Energy Networks

arXiv.org Machine LearningJun-23-2016

We introduce structured prediction energy networks (SPENs), a flexible framework for structured prediction. A deep architecture is used to define an energy function of candidate labels, and then predictions are produced by using back-propagation to iteratively optimize the energy with respect to the labels. This deep architecture captures dependencies between labels that would lead to intractable graphical models, and performs structure learning by automatically learning discriminative features of the structured output. One natural application of our technique is multi-label classification, which traditionally has required strict prior assumptions about the interactions between labels to ensure tractable learning and prediction. We are able to apply SPENs to multi-label problems with substantially larger label sets than previous applications of structured prediction, while modeling high-order interactions using minimal structural assumptions. Overall, deep learning provides remarkable tools for learning features of the inputs to a prediction problem, and this work extends these techniques to learning features of structured outputs. Our experiments provide impressive performance on a variety of benchmark multi-label classification tasks, demonstrate that our technique can be used to provide interpretable structure learning, and illuminate fundamental trade-offs between feed-forward and iterative structured prediction.

artificial intelligence, machine learning, prediction, (18 more...)

1511.0635

Country: North America > United States (0.46)

Genre: Research Report > Experimental Study (0.34)

Industry: Energy > Power Industry (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

#artificialintelligenceJun-21-2016, 15:31:18 GMT

The first steps with Machine learning -- learning-ai

The learning that is being done is always based on some sort of observations or data, such as examples, direct experience, or instruction. For instance, you might wish to predict how much a user Bob will like a movie that he hasn't seen, based on her ratings of movies that he has seen. This means making informed guesses about some unobserved property of some object, based on observed properties of that object. Supervised learning is a type of machine learning algorithm that uses a known dataset (called the training dataset) to make predictions. The training dataset includes input data and response values.

artificial intelligence, classifier, machine learning, (9 more...)

Genre: Workflow (0.42)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.38)

arXiv.org Machine LearningJun-20-2016

Quantifying and Reducing Stereotypes in Word Embeddings

Bolukbasi, Tolga, Chang, Kai-Wei, Zou, James, Saligrama, Venkatesh, Kalai, Adam

Machine learning algorithms are optimized to model statistical properties of the training data. If the input data reflects stereotypes and biases of the broader society, then the output of the learning algorithm also captures these stereotypes. In this paper, we initiate the study of gender stereotypes in {\em word embedding}, a popular framework to represent text data. As their use becomes increasingly common, applications can inadvertently amplify unwanted stereotypes. We show across multiple datasets that the embeddings contain significant gender stereotypes, especially with regard to professions. We created a novel gender analogy task and combined it with crowdsourcing to systematically quantify the gender bias in a given embedding. We developed an efficient algorithm that reduces gender stereotype using just a handful of training examples while preserving the useful geometric properties of the embedding. We evaluated our algorithm on several metrics. While we focus on male/female stereotypes, our framework may be applicable to other types of embedding biases.

artificial intelligence, machine learning, stereotype, (17 more...)

1606.06121

Country:

North America > United States > Massachusetts (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)

Genre: Research Report (0.40)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.55)
Information Technology > Communications > Social Media > Crowdsourcing (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.34)