AITopics | Education

Collaborating Authors

Education

A Tutorial on Probabilistic Latent Semantic Analysis

arXiv.org Machine LearningDec-21-2012

In this tutorial, I will discuss the details about how Probabilistic Latent Semantic Analysis (PLSA) is formalized and how different learning algorithms are proposed to learn the model.

likelihood, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1212.39

Country: North America > United States (0.47)

Genre: Instructional Material > Course Syllabus & Notes (0.85)

Industry: Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Safe Exploration of State and Action Spaces in Reinforcement Learning

Garcia, J., Fernandez, F.

Journal of Artificial Intelligence ResearchDec-19-2012

In this paper, we consider the important problem of safe exploration in reinforcement learning. While reinforcement learning is well-suited to domains with complex transition dynamics and high-dimensional state-action spaces, an additional challenge is posed by the need for safe and efficient exploration. Traditional exploration techniques are not particularly useful for solving dangerous tasks, where the trial and error process may lead to the selection of actions whose execution in some states may result in damage to the learning system (or any other system). Consequently, when an agent begins an interaction with a dangerous and high-dimensional state-action space, an important question arises; namely, that of how to avoid (or at least minimize) damage caused by the exploration of the state-action space. We introduce the PI-SRL algorithm which safely improves suboptimal albeit robust behaviors for continuous state and action control tasks and which efficiently learns from the experience gained from the environment. We evaluate the proposed method in four complex tasks: automatic car parking, pole-balancing, helicopter hovering, and business management.

algorithm, cumulative reward, pi-srl, (12 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.3761

AI Access Foundation

10789

Journal of Artificial Intelligence Research

Country:

Europe > Spain > Galicia > Madrid (0.04)
North America > United States > New York > New York County > New York City (0.04)
Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre:

Workflow (0.93)
Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment (1.00)
Transportation > Air (0.92)
Education (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Online Learning for Ground Trajectory Prediction

Hadjaz, Areski, Marceau, Gaétan, Savéant, Pierre, Schoenauer, Marc

arXiv.org Artificial IntelligenceDec-17-2012

This paper presents a model based on an hybrid system to numerically simulate the climbing phase of an aircraft. This model is then used within a trajectory prediction tool. Finally, the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) optimization algorithm is used to tune five selected parameters, and thus improve the accuracy of the model. Incorporated within a trajectory prediction tool, this model can be used to derive the order of magnitude of the prediction error over time, and thus the domain of validity of the trajectory prediction. A first validation experiment of the proposed model is based on the errors along time for a one-time trajectory prediction at the take off of the flight with respect to the default values of the theoretical BADA model. This experiment, assuming complete information, also shows the limit of the model. A second experiment part presents an on-line trajectory prediction, in which the prediction is continuously updated based on the current aircraft position. This approach raises several issues, for which improvements of the basic model are proposed, and the resulting trajectory prediction tool shows statistically significantly more accurate results than those of the default model.

evolutionary algorithm, machine learning, trajectory, (16 more...)

arXiv.org Artificial Intelligence

1212.3998

Country: North America > United States (0.68)

Genre: Research Report (1.00)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Air (1.00)
Aerospace & Defense > Aircraft (0.95)
Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Inductive Policy Selection for First-Order MDPs

Yoon, Sung Wook, Fern, Alan, Givan, Robert

arXiv.org Artificial IntelligenceDec-12-2012

We select policies for large Markov Decision Processes (MDPs) with compact first-order representations. We find policies that generalize well as the number of objects in the domain grows, potentially without bound. Existing dynamic-programming approaches based on flat, propositional, or first-order representations either are impractical here or do not naturally scale as the number of objects grows without bound. We implement and evaluate an alternative approach that induces first-order policies using training data constructed by solving small problem instances using PGraphplan (Blum & Langford, 1999). Our policies are represented as ensembles of decision lists, using a taxonomic concept language. This approach extends the work of Martin and Geffner (2000) to stochastic domains, ensemble learning, and a wider variety of problems. Empirically, we find "good" policies for several stochastic first-order MDPs that are beyond the scope of previous approaches. We also discuss the application of this work to the relational reinforcement-learning problem.

artificial intelligence, inductive learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

1301.0614

Genre: Research Report (0.50)

Industry: Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.50)
(2 more...)

Add feedback

A Study on Fuzzy Systems

Voskoglou, Michael Gr.

arXiv.org Artificial IntelligenceDec-11-2012

In the present paper we use principles of fuzzy logic to develop a general model representing several processes in a system's operation characterized by a degree of vagueness and/or uncertainty. For this, the main stages of the corresponding process are represented as fuzzy subsets of a set of linguistic labels characterizing the system's performance at each stage. We also introduce three alternative measures of a fuzzy system's effectiveness connected to our general model. These measures include the system's total possibilistic uncertainty, the Shannon's entropy properly modified for use in a fuzzy environment and the "centroid" method in which the coordinates of the center of mass of the graph of the membership function involved provide an alternative measure of the system's performance. The advantages and disadvantages of the above measures are discussed and a combined use of them is suggested for achieving a worthy of credit mathematical analysis of the corresponding situation. An application is also developed for the Mathematical Modelling process illustrating the use of our results in practice.

artificial intelligence, fuzzy logic, mm process, (18 more...)

arXiv.org Artificial Intelligence

1212.2614

Country: Europe (0.68)

Genre: Research Report (0.70)

Industry: Education (0.94)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (1.00)

Add feedback

Multiscale Markov Decision Problems: Compression, Solution, and Transfer Learning

Bouvrie, Jake, Maggioni, Mauro

arXiv.org Artificial IntelligenceDec-5-2012

Many problems in sequential decision making and stochastic control often have natural multiscale structure: sub-tasks are assembled together to accomplish complex goals. Systematically inferring and leveraging hierarchical structure, particularly beyond a single level of abstraction, has remained a longstanding challenge. We describe a fast multiscale procedure for repeatedly compressing, or homogenizing, Markov decision processes (MDPs), wherein a hierarchy of sub-problems at different scales is automatically determined. Coarsened MDPs are themselves independent, deterministic MDPs, and may be solved using existing algorithms. The multiscale representation delivered by this procedure decouples sub-tasks from each other and can lead to substantial improvements in convergence rates both locally within sub-problems and globally across sub-problems, yielding significant computational savings. A second fundamental aspect of this work is that these multiscale decompositions yield new transfer opportunities across different problems, where solutions of sub-tasks at different levels of the hierarchy may be amenable to transfer to new problems. Localized transfer of policies and potential operators at arbitrary scales is emphasized. Finally, we demonstrate compression and transfer in a collection of illustrative domains, including examples involving discrete and continuous statespaces. Keywords: Markov decision processes, hierarchical reinforcement learning, transfer, multiscale analysis.

algorithm, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

1212.1143

Country:

Asia (0.67)
North America > United States > Massachusetts (0.45)

Genre:

Workflow (0.93)
Overview > Growing Problem (0.34)

Industry: Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.92)

Add feedback

Training Support Vector Machines Using Frank-Wolfe Optimization Methods

Frandi, Emanuele, Nanculef, Ricardo, Gasparo, Maria Grazia, Lodi, Stefano, Sartori, Claudio

arXiv.org Machine LearningDec-4-2012

Training a Support Vector Machine (SVM) requires the solution of a quadratic programming problem (QP) whose computational complexity becomes prohibitively expensive for large scale datasets. Traditional optimization methods cannot be directly applied in these cases, mainly due to memory restrictions. By adopting a slightly different objective function and under mild conditions on the kernel used within the model, efficient algorithms to train SVMs have been devised under the name of Core Vector Machines (CVMs). This framework exploits the equivalence of the resulting learning problem with the task of building a Minimal Enclosing Ball (MEB) problem in a feature space, where data is implicitly embedded by a kernel function. In this paper, we improve on the CVM approach by proposing two novel methods to build SVMs based on the Frank-Wolfe algorithm, recently revisited as a fast method to approximate the solution of a MEB problem. In contrast to CVMs, our algorithms do not require to compute the solutions of a sequence of increasingly complex QPs and are defined by using only analytic optimization steps. Experiments on a large collection of datasets show that our methods scale better than CVMs in most cases, sometimes at the price of a slightly lower accuracy. As CVMs, the proposed methods can be easily extended to machine learning problems other than binary classification. However, effective classifiers are also obtained using kernels which do not satisfy the condition required by CVMs and can thus be used for a wider set of problems.

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

doi: 10.1142/S0218001413600033

1212.0695

Country:

North America > United States (1.00)
Europe (1.00)

Genre: Research Report > New Finding (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (0.46)
Health & Medicine > Pharmaceuticals & Biotechnology (0.46)
Education > Focused Education > Special Education (0.44)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

Add feedback

Problem Solving and Computational Thinking in a Learning Environment

Voskoglou, Michael Gr., Buckley, Sheryl

arXiv.org Artificial IntelligenceDec-2-2012

Computational thinking is a new problem solving method named for its extensive use of computer science techniques. It synthesizes critical thinking and existing knowledge and applies them to solve complex technological problems. The term was coined by J. Wing [1], but the relationship between computational and critical thinking, the two modes of thinking in solving problems, has not been yet clearly established. This paper aims in shedding some light into this relationship. We also present two classroom experiments performed recently at the Graduate Technological Educational Institute (TEI) of Patras, Greece. The result of these experiment give a strong indication that the use of computers as a tool for problem solving enhances the students‟ abilities in solving real world problems involving mathematical modelling. This is crossed by earlier findings of other researchers for the problem solving process in general (not only for mathematical problems).

artificial intelligence, egyptian computer science journal, knowledge management, (14 more...)

arXiv.org Artificial Intelligence

1212.075

Country:

North America > United States > New Jersey (0.28)
Europe > Greece > West Greece > Patra (0.24)

Genre:

Research Report > Experimental Study (0.67)
Research Report > New Finding (0.48)

Industry:

Health & Medicine (0.93)
Education > Educational Setting (0.93)
Education > Curriculum > Subject-Specific Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
Information Technology > Knowledge Management (0.93)

Add feedback

The Interplay Between Stability and Regret in Online Learning

Saha, Ankan, Jain, Prateek, Tewari, Ambuj

arXiv.org Machine LearningNov-26-2012

This paper considers the stability of online learning algorithms and its implications for learnability (bounded regret). We introduce a novel quantity called {\em forward regret} that intuitively measures how good an online learning algorithm is if it is allowed a one-step look-ahead into the future. We show that given stability, bounded forward regret is equivalent to bounded regret. We also show that the existence of an algorithm with bounded regret implies the existence of a stable algorithm with bounded regret and bounded forward regret. The equivalence results apply to general, possibly non-convex problems. To the best of our knowledge, our analysis provides the first general connection between stability and regret in the online setting that is not restricted to a particular class of algorithms. Our stability-regret connection provides a simple recipe for analyzing regret incurred by any online learning algorithm. Using our framework, we analyze several existing online learning algorithms as well as the "approximate" versions of algorithms like RDA that solve an optimization problem at each iteration. Our proofs are simpler than existing analysis for the respective algorithms, show a clear trade-off between stability and forward regret, and provide tighter regret bounds in some cases. Furthermore, using our recipe, we analyze "approximate" versions of several algorithms such as follow-the-regularized-leader (FTRL) that requires solving an optimization problem at each step.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

1211.6158

Country: North America > United States (0.67)

Genre:

Workflow (0.67)
Research Report (0.50)

Industry: Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Automated Feedback Generation for Introductory Programming Assignments

Singh, Rishabh, Gulwani, Sumit, Solar-Lezama, Armando

arXiv.org Artificial IntelligenceNov-16-2012

We present a new method for automatically providing feedback for introductory programming problems. In order to use this method, we need a reference implementation of the assignment, and an error model consisting of potential corrections to errors that students might make. Using this information, the system automatically derives minimal corrections to student's incorrect solutions, providing them with a quantifiable measure of exactly how incorrect a given solution was, as well as feedback about what they did wrong. We introduce a simple language for describing error models in terms of correction rules, and formally define a rule-directed translation strategy that reduces the problem of finding minimal corrections in an incorrect program to the problem of synthesizing a correct program from a sketch. We have evaluated our system on thousands of real student attempts obtained from 6.00 and 6.00x. Our results show that relatively simple error models can correct on average 65% of all incorrect submissions.

machine learning, natural language, programming language, (19 more...)

arXiv.org Artificial Intelligence

1204.1751

Country: North America > United States (0.68)

Genre:

Instructional Material > Course Syllabus & Notes (0.93)
Research Report > New Finding (0.86)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
(2 more...)

Add feedback