AITopics | Education

Collaborating Authors

Education

Adaptive Mixtures of Factor Analyzers

arXiv.org Machine LearningOct-22-2015

A mixture of factor analyzers is a semi-parametric density estimator that generalizes the well-known mixtures of Gaussians model by allowing each Gaussian in the mixture to be represented in a different lower-dimensional manifold. This paper presents a robust and parsimonious model selection algorithm for training a mixture of factor analyzers, carrying out simultaneous clustering and locally linear, globally nonlinear dimensionality reduction. Permitting different number of factors per mixture component, the algorithm adapts the model complexity to the data complexity. We compare the proposed algorithm with related automatic model selection algorithms on a number of benchmarks. The results indicate the effectiveness of this fast and robust approach in clustering, manifold learning and class-conditional modeling.

algorithm, artificial intelligence, machine learning, (13 more...)

arXiv.org Machine Learning

1507.02801

Country: Europe (0.68)

Genre: Research Report (1.00)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Piecewise-Linear Approximation for Feature Subset Selection in a Sequential Logit Model

Sato, Toshiki, Takano, Yuichi, Miyashiro, Ryuhei

arXiv.org Machine LearningOct-19-2015

This paper concerns a method of selecting a subset of features for a sequential logit model. Tanaka and Nakagawa (2014) proposed a mixed integer quadratic optimization formulation for solving the problem based on a quadratic approximation of the logistic loss function. However, since there is a significant gap between the logistic loss function and its quadratic approximation, their formulation may fail to find a good subset of features. To overcome this drawback, we apply a piecewise-linear approximation to the logistic loss function. Accordingly, we frame the feature subset selection problem of minimizing an information criterion as a mixed integer linear optimization problem. The computational results demonstrate that our piecewise-linear approximation approach found a better subset of features than the quadratic approximation approach.

artificial intelligence, logistic loss function, machine learning, (13 more...)

arXiv.org Machine Learning

1510.05417

Genre: Research Report > New Finding (0.34)

Industry:

Leisure & Entertainment (0.50)
Education (0.46)
Banking & Finance (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.34)

Add feedback

The Limitations of Standardized Science Tests as Benchmarks for Artificial Intelligence Research: Position Paper

Davis, Ernest

arXiv.org Artificial IntelligenceOct-16-2015

In this position paper, I argue that standardized tests for elementary science such as SAT or Regents tests are not very good benchmarks for measuring the progress of artificial intelligence systems in understanding basic science. The primary problem is that these tests are designed to test aspects of knowledge and ability that are challenging for people; the aspects that are challenging for AI systems are very different. In particular, standardized tests do not test knowledge that is obvious for people; none of this knowledge can be assumed in AI systems. Individual standardized tests also have specific features that are not necessarily appropriate for an AI benchmark. I analyze the Physics subject SAT in some detail and the New York State Regents Science test more briefly. I also argue that the apparent advantages offered by using standardized tests are mostly either minor or illusory. The one major real advantage is that the significance is easily explained to the public; but I argue that even this is a somewhat mixed blessing. I conclude by arguing that, first, more appropriate collections of exam style problems could be assembled, and second, that there are better kinds of benchmarks than exam-style problems. In an appendix I present a collection of sample exam-style problems that test kinds of knowledge missing from the standardized tests.

artificial intelligence, expert system, natural language, (20 more...)

arXiv.org Artificial Intelligence

1411.1629

Country: North America > United States > New York (0.34)

Genre: Research Report (0.50)

Industry:

Education > Assessment & Standards (1.00)
Education > Curriculum > Subject-Specific Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.67)

Add feedback

On Equivalence of Martingale Tail Bounds and Deterministic Regret Inequalities

Rakhlin, Alexander, Sridharan, Karthik

arXiv.org Machine LearningOct-13-2015

We study an equivalence of (i) deterministic pathwise statements appearing in the online learning literature (termed \emph{regret bounds}), (ii) high-probability tail bounds for the supremum of a collection of martingales (of a specific form arising from uniform laws of large numbers for martingales), and (iii) in-expectation bounds for the supremum. By virtue of the equivalence, we prove exponential tail bounds for norms of Banach space valued martingales via deterministic regret bounds for the online mirror descent algorithm with an adaptive step size. We extend these results beyond the linear structure of the Banach space: we define a notion of \emph{martingale type} for general classes of real-valued functions and show its equivalence (up to a logarithmic factor) to various sequential complexities of the class (in particular, the sequential Rademacher complexity and its offset version). For classes with the general martingale type 2, we exhibit a finer notion of variation that allows partial adaptation to the function indexing the martingale. Our proof technique rests on sequential symmetrization and on certifying the \emph{existence} of regret minimization strategies for certain online prediction problems.

artificial intelligence, inequality, machine learning, (16 more...)

arXiv.org Machine Learning

1510.03925

Genre: Research Report (0.50)

Industry: Education (0.56)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.88)

Add feedback

Use of the Triangular Fuzzy Numbers for Student Assessment

Voskoglou, Michael

arXiv.org Artificial IntelligenceOct-12-2015

In an earlier work we have used the Triangular Fuzzy Numbers (TFNs)as an assessment tool of student skills.This approach led to an approximate linguistic characterization of the students' overall performance, but it was not proved to be sufficient in all cases for comparing the performance of two different student groups, since tywo TFNs are not always comparable. In the present paper we complete the above fuzzy assessment approach by presenting a defuzzification method of TFNS based on the Center of Gravity (COG) technique, which enables the required comparison. In addition we extend our results by using the Trapezoidal Fuzzy Numbers (TpFNs) too, which are a generalization of the TFNs, for student assessment and we present suitable examples illustrating our new results in practice.

artificial intelligence, fuzzy logic, tpfn, (18 more...)

arXiv.org Artificial Intelligence

1507.03257

Country: Europe > Greece (0.29)

Genre: Research Report > New Finding (0.68)

Industry: Education (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.69)

Add feedback

p-Markov Gaussian Processes for Scalable and Expressive Online Bayesian Nonparametric Time Series Forecasting

Samo, Yves-Laurent Kom, Roberts, Stephen J.

arXiv.org Machine LearningOct-9-2015

In this paper we introduce a novel online time series forecasting model we refer to as the pM-GP filter. We show that our model is equivalent to Gaussian process regression, with the advantage that both online forecasting and online learning of the hyper-parameters have a constant (rather than cubic) time complexity and a constant (rather than squared) memory requirement in the number of observations, without resorting to approximations. Moreover, the proposed model is expressive in that the family of covariance functions of the implied latent process, namely the spectral Matern kernels, have recently been proven to be capable of approximating arbitrarily well any translation-invariant covariance function. The benefit of our approach compared to competing models is demonstrated using experiments on several real-life datasets.

artificial intelligence, data mining, machine learning, (15 more...)

arXiv.org Machine Learning

1510.0283

Genre: Research Report (0.50)

Industry: Education (0.49)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Data Science > Data Mining (0.70)

Add feedback

Distributed Multitask Learning

Wang, Jialei, Kolar, Mladen, Srebro, Nathan

arXiv.org Machine LearningOct-2-2015

We consider the problem of distributed multi-task learning, where each machine learns a separate, but related, task. Specifically, each machine learns a linear predictor in high-dimensional space,where all tasks share the same small support. We present a communication-efficient estimator based on the debiased lasso and show that it is comparable with the optimal centralized method.

artificial intelligence, group lasso, machine learning, (18 more...)

arXiv.org Machine Learning

1510.00633

Country: North America > United States (0.46)

Genre: Research Report (0.64)

Industry:

Education (0.46)
Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)

Add feedback

Symbol Emergence in Robotics: A Survey

Taniguchi, Tadahiro, Nagai, Takayuki, Nakamura, Tomoaki, Iwahashi, Naoto, Ogata, Tetsuya, Asoh, Hideki

arXiv.org Artificial IntelligenceSep-29-2015

Humans can learn the use of language through physical interaction with their environment and semiotic communication with other people. It is very important to obtain a computational understanding of how humans can form a symbol system and obtain semiotic skills through their autonomous mental development. Recently, many studies have been conducted on the construction of robotic systems and machine-learning methods that can learn the use of language through embodied multimodal interaction with their environment and other systems. Understanding human social interactions and developing a robot that can smoothly communicate with human users in the long term, requires an understanding of the dynamics of symbol systems and is crucially important. The embodied cognition and social interaction of participants gradually change a symbol system in a constructive manner. In this paper, we introduce a field of research called symbol emergence in robotics (SER). SER is a constructive approach towards an emergent symbol system. The emergent symbol system is socially self-organized through both semiotic communications and physical interactions with autonomous cognitive developmental agents, i.e., humans and developmental robots. Specifically, we describe some state-of-art research topics concerning SER, e.g., multimodal categorization, word discovery, and a double articulation analysis, that enable a robot to obtain words and their embodied meanings from raw sensory--motor information, including visual information, haptic information, auditory information, and acoustic speech signals, in a totally unsupervised manner. Finally, we suggest future directions of research in SER.

artificial intelligence, machine learning, symbol system, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1080/01691864.2016.1164622

1509.08973

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Asia > Middle East > Jordan (0.04)
(9 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Education (0.67)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(4 more...)

Add feedback

Semantics, Representations and Grammars for Deep Learning

Balduzzi, David

arXiv.org Machine LearningSep-29-2015

Deep learning is currently the subject of intensive study. However, fundamental concepts such as representations are not formally defined -- researchers "know them when they see them" -- and there is no common language for describing and analyzing algorithms. This essay proposes an abstract framework that identifies the essential features of current practice and may provide a foundation for future developments. The backbone of almost all deep learning algorithms is backpropagation, which is simply a gradient computation distributed over a neural network. The main ingredients of the framework are thus, unsurprisingly: (i) game theory, to formalize distributed optimization; and (ii) communication protocols, to track the flow of zeroth and first-order information. The framework allows natural definitions of semantics (as the meaning encoded in functions), representations (as functions whose semantics is chosen to optimized a criterion) and grammars (as communication protocols equipped with first-order convergence guarantees). Much of the essay is spent discussing examples taken from the literature. The ultimate aim is to develop a graphical language for describing the structure of deep learning algorithms that backgrounds the details of the optimization procedure and foregrounds how the components interact. Inspiration is taken from probabilistic graphical models and factor graphs, which capture the essential structural features of multivariate distributions.

artificial intelligence, machine learning, representation, (14 more...)

arXiv.org Machine Learning

1509.08627

Genre: Research Report (0.40)

Industry:

Education (0.67)
Leisure & Entertainment > Games (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AAAI News

Hamilton, Carol (AAAI)

AI MagazineSep-28-2015

Park and the Heard Museum, sports and discuss their work during its early AAAI-16 Workshop Program -- Submissions stadiums, restaurants, and shopping.

artificial intelligence, information, machine learning, (17 more...)

AI Magazine

Country:

Europe (0.94)
Asia (0.94)
North America > United States > California > Santa Cruz County > Santa Cruz (0.14)

Genre:

Contests & Prizes (0.69)
Personal > Honors (0.46)

Industry: Education > Educational Setting (0.69)

Technology:

Information Technology > Human Computer Interaction (0.94)
Information Technology > Artificial Intelligence > Robots (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback