AITopics

2014 AAAI Fall Symposium Series

Country:

North America > United States > California > San Diego County > San Diego (0.06)
North America > United States > Washington > Grant County > Moses Lake (0.05)
North America > United States > Texas (0.05)
(5 more...)

Industry:

Government > Space Agency (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Humanoid Robots (0.43)

AAAI ConferencesNov-1-2014

Affordance Templates for Shared Robot Control

Hart, Stephen (General Motors) | Dinh, Paul (Oceaneering Space Systems) | Hambuchen, Kimberly A. (NASA Johnson Space Center)

This paper introduces the Affordance Template framework used to supervise task behaviors on the NASA-JSC Valkyrie robot at the 2013 DARPA Robotics Challenge (DRC) Trials. This framework provides graphical interfaces to human supervisors that are adjustable based on the run-time environmental context (e.g., size, location, and shape of objects that the robot must interact with, etc.). Additional improvements, described below, inject degrees of autonomy into instantiations of affordance templates at run-time in order to enable efficient human supervision of the robot for accomplishing tasks.

affordance template, artificial intelligence, template, (12 more...)

2014 AAAI Fall Symposium Series

Country:

North America > United States > Texas > Harris County > Houston (0.05)
North America > United States > Michigan > Macomb County > Warren (0.05)

Industry:

Government > Space Agency (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

AAAI ConferencesOct-31-2014

Crowdsourcing for Participatory Democracies: Efficient Elicitation of Social Choice Functions

Lee, David Timothy (Stanford University) | Goel, Ashish (Stanford University) | Aitamurto, Tanja (Stanford University) | Landemore, Helene (Yale University)

We present theoretical and empirical results demonstrating the usefulness of social choice functions in crowdsourcing for participatory democracies. First, we demonstrate the scalability of social choice functions by defining a natural notion of epsilon-approximation, and giving algorithms which efficiently elicit such approximations for two prominent social choice functions: the Borda rule and the Condorcet winner. This result circumvents previous prohibitive lower bounds and is surprisingly strong: even if the number of ideas is as large as the number of participants, each participant will only have to make a logarithmic number of comparisons, an exponential improvement over the linear number of comparisons previously needed. Second, we apply these ideas to Finland's recent off-road traffic law reform, an experiment on participatory democracy in real life. This allows us to verify the scaling predicted in our theory and show that the constant involved is also not large. In addition, by collecting data on the time that users take to complete rankings of varying sizes, we observe that eliciting partial rankings can further decrease elicitation time as compared to the common method of eliciting pairwise comparisons.

artificial intelligence, participant, social media, (18 more...)

Second AAAI Conference on Human Computation and Crowdsourcing

Country:

Europe > Finland (0.25)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(4 more...)

Genre: Research Report > New Finding (0.48)

Industry: Government > Voting & Elections (0.39)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Communications > Social Media > Crowdsourcing (0.72)

AAAI ConferencesOct-31-2014

Predicting Next Label Quality: A Time-Series Model of Crowdwork

Jung, Hyun Joon (University of Texas at Austin) | Park, Yubin (University of Texas at Austin) | Lease, Matthew (University of Texas at Austin)

While temporal behavioral patterns can be discerned to underlie real crowd work, prior studies have typically modeled worker performance under a simplified i.i.d. assumption. To better model such temporal worker behavior, we propose a time-series label prediction model for crowd work. This latent variable model captures and summarizes past worker behavior, enabling us to better predict the quality of each worker's next label. Given inherent uncertainty in prediction, we also investigate a decision reject option to balance the tradeoff between prediction accuracy vs. coverage. Results show our model improves accuracy of both label prediction on real crowd worker data, as well as data quality overall.

artificial intelligence, machine learning, prediction, (18 more...)

Second AAAI Conference on Human Computation and Crowdsourcing

Country:

North America > United States > Texas > Travis County > Austin (0.05)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.04)
Europe > United Kingdom (0.04)

Genre: Research Report > New Finding (0.88)

Industry:

Government (0.46)
Banking & Finance (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
(2 more...)

Miller, Benjamin A., Beard, Michelle S., Wolfe, Patrick J., Bliss, Nadya T.

A Spectral Framework for Anomalous Subgraph Detection

arXiv.org Machine LearningOct-22-2014

A wide variety of application domains are concerned with data consisting of entities and their relationships or connections, formally represented as graphs. Within these diverse application areas, a common problem of interest is the detection of a subset of entities whose connectivity is anomalous with respect to the rest of the data. While the detection of such anomalous subgraphs has received a substantial amount of attention, no application-agnostic framework exists for analysis of signal detectability in graph-based data. In this paper, we describe a framework that enables such analysis using the principal eigenspace of a graph's residuals matrix, commonly called the modularity matrix in community detection. Leveraging this analytical tool, we show that the framework has a natural power metric in the spectral norm of the anomalous subgraph's adjacency matrix (signal power) and of the background graph's residuals matrix (noise power). We propose several algorithms based on spectral properties of the residuals matrix, with more computationally expensive techniques providing greater detection power. Detection and identification performance are presented for a number of signal and noise models, including clusters and bipartite foregrounds embedded into simple random backgrounds as well as graphs with community structure and realistic degree distributions. The trends observed verify intuition gleaned from other signal processing areas, such as greater detection power when the signal is embedded within a less active portion of the background. We demonstrate the utility of the proposed techniques in detecting small, highly anomalous subgraphs in real graphs derived from Internet traffic and product co-purchases.

data mining, machine learning, subgraph, (19 more...)

doi: 10.1109/TSP.2015.2437841

1401.7702

Country: North America > United States > Massachusetts > Middlesex County (0.28)

Genre: Research Report (0.81)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Communications > Networks (0.93)

Claesen, Marc, De Smet, Frank, Suykens, Johan A. K., De Moor, Bart

A Robust Ensemble Approach to Learn From Positive and Unlabeled Data Using SVM Base Models

arXiv.org Machine LearningOct-21-2014

We present a novel approach to learn binary classifiers when only positive and unlabeled instances are available (PU learning). This problem is routinely cast as a supervised task with label noise in the negative set. We use an ensemble of SVM models trained on bootstrap resamples of the training data for increased robustness against label noise. The approach can be considered in a bagging framework which provides an intuitive explanation for its mechanics in a semi-supervised setting. We compared our method to state-of-the-art approaches in simulations using multiple public benchmark data sets. The included benchmark comprises three settings with increasing label noise: (i) fully supervised, (ii) PU learning and (iii) PU learning with false positives. Our approach shows a marginal improvement over existing methods in the second setting and a significant improvement in the third. Frank De Smet is a member of the medical management department of the National Alliance of Christian Mutualities. Accepted at Neurocomputing: SI on Advances in Learning with Label Noise 20/10/2014 1. Introduction Training binary classifiers on positive and unlabeled data is referred to as PU learning [31]. The absence of known negative training instances warrants appropriate learning methods. Inaccurate label information can be more problematic than attribute noise [45]. Specialised PU learning approaches are recommended when (i) negative labels cannot be acquired, (ii) the training data contains a large amount of false negatives or (iii) the positive set has many outliers. Practical applications of PU learning typically feature large, imbalanced training sets with a small amount of labeled (positive) and a large amount of unlabeled training instances. The PU learning problem arises in various settings, including web page classification [44], intrusion detection [26] and bioinformatics tasks such as variant prioritization [42], gene prioritization [1, 35] and virtual screening of drug compounds [41]. Though these applications share a common underlying learning problem, the final evaluation criteria may be fundamentally different.

artificial intelligence, inductive learning, machine learning, (18 more...)

doi: 10.1016/j.neucom.2014.10.081

1402.3144

Country:

Europe (0.95)
North America > United States > California (0.46)

Genre:

Personal (1.00)
Research Report > Experimental Study (0.94)
Research Report > New Finding (0.93)

Industry:

Government (0.93)
Health & Medicine > Therapeutic Area > Oncology (0.46)
Education > Focused Education > Special Education (0.44)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)

Yu, Yaoliang, Zhang, Xinhua, Schuurmans, Dale

Generalized Conditional Gradient for Sparse Estimation

arXiv.org Machine LearningOct-17-2014

Structured sparsity is an important modeling tool that expands the applicability of convex formulations for data analysis, however it also creates significant challenges for efficient algorithm design. In this paper we investigate the generalized conditional gradient (GCG) algorithm for solving structured sparse optimization problems---demonstrating that, with some enhancements, it can provide a more efficient alternative to current state of the art approaches. After providing a comprehensive overview of the convergence properties of GCG, we develop efficient methods for evaluating polar operators, a subroutine that is required in each GCG iteration. In particular, we show how the polar operator can be efficiently evaluated in two important scenarios: dictionary learning and structured sparse estimation. A further improvement is achieved by interleaving GCG with fixed-rank local subspace optimization. A series of experiments on matrix completion, multi-class classification, multi-view dictionary learning and overlapping group lasso shows that the proposed method can significantly reduce the training cost of current alternatives.

artificial intelligence, generalized conditional gradient, machine learning, (13 more...)

1410.4828

Country:

North America > United States (0.45)
North America > Canada > Alberta (0.28)
Oceania > Australia (0.27)

Genre: Research Report (1.00)

Industry: Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Broderick, Tamara, Steorts, Rebecca C.

Variational Bayes for Merging Noisy Databases

arXiv.org Machine LearningOct-17-2014

Bayesian entity resolution merges together multiple, noisy databases and returns the minimal collection of unique individuals represented, together with their true, latent record values. Bayesian methods allow flexible generative models that share power across databases as well as principled quantification of uncertainty for queries of the final, resolved database. However, existing Bayesian methods for entity resolution use Markov monte Carlo method (MCMC) approximations and are too slow to run on modern databases containing millions or billions of records. Instead, we propose applying variational approximations to allow scalable Bayesian inference in these models. We derive a coordinate-ascent approximation for mean-field variational Bayes, qualitatively compare our algorithm to existing methods, note unique challenges for inference that arise from the expected distribution of cluster sizes in entity resolution, and discuss directions for future work in this domain.

artificial intelligence, machine learning, natural language, (19 more...)

1410.4792

Country: North America > United States (1.00)

Genre: Research Report (0.51)

Industry:

Health & Medicine (0.46)
Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Frandi, Emanuele, Nanculef, Ricardo, Suykens, Johan

Complexity Issues and Randomization Strategies in Frank-Wolfe Algorithms for Machine Learning

arXiv.org Machine LearningOct-15-2014

Frank-Wolfe algorithms for convex minimization have recently gained considerable attention from the Optimization and Machine Learning communities, as their properties make them a suitable choice in a variety of applications. However, as each iteration requires to optimize a linear model, a clever implementation is crucial to make such algorithms viable on large-scale datasets. For this purpose, approximation strategies based on a random sampling have been proposed by several researchers. In this work, we perform an experimental study on the effectiveness of these techniques, analyze possible alternatives and provide some guidelines based on our results.

algorithm, artificial intelligence, machine learning, (12 more...)

1410.4062

Country:

North America > United States (0.49)
Europe (0.47)

Genre: Research Report > New Finding (1.00)

Industry: Government (0.33)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.49)

Cho, Kyunghyun, van Merrienboer, Bart, Bahdanau, Dzmitry, Bengio, Yoshua

On the Properties of Neural Machine Translation: Encoder-Decoder Approaches

arXiv.org Machine LearningOct-7-2014

On the Properties of Neural Machine Translation: Encoder-Decoder Approaches Kyunghyun Cho Bart van Merri enboer Universit e de Montr eal Dzmitry Bahdanau Jacobs University, Germany Yoshua Bengio Universit e de Montr eal, CIFAR Senior Fellow Abstract Neural machine translation is a relatively new approach to statistical machine translation based purely on neural networks. The neural machine translation models often consist of an encoder and a decoder. The encoder extracts a fixed-length representation from a variable-length input sentence, and the decoder generates a correct translation from this representation. In this paper, we focus on analyzing the properties of the neural machine translation using two models; RNN Encoder-Decoder and a newly proposed gated recursive con-volutional neural network. We show that the neural machine translation performs relatively well on short sentences without unknown words, but its performance degrades rapidly as the length of the sentence and the number of unknown words increase. Furthermore, we find that the proposed gated recursive convolutional network learns a grammatical structure of a sentence automatically. 1 Introduction A new approach for statistical machine translation based purely on neural networks has recently been proposed (Kalchbrenner and Blunsom, 2013; Sutskever et al., 2014). This new approach, which we refer to as neural machine translation, is inspired by the recent trend of deep representational learning. All the neural network models used in (Kalchbrenner and Blunsom, 2013; Sutskever et al., 2014; Cho et al., 2014) consist of an encoder and a decoder.

machine learning, natural language, translation, (20 more...)

1409.1259

Country: North America > United States (0.47)

Genre: Research Report (0.40)

Industry: Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)