AITopics | Country

Plotting

Country

Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes

Weinstein, Ari (Rutgers University) | Littman, Michael L. (Rutgers University)

AAAI ConferencesJun-8-2012

Recent research leverages results from the continuous-armed bandit literature to create a reinforcement-learning algorithm for continuous state and action spaces. Initially proposed in a theoretical setting, we provide the first examination of the empirical properties of the algorithm. Through experimentation, we demonstrate the effectiveness of this planning method when coupled with exploration and model learning and show that, in addition to its formal guarantees, the approach is very competitive with other continuous-action reinforcement learners.

algorithm, optimization problem, planning & scheduling, (19 more...)

AAAI Conferences

Twenty-Second International Conference on Automated Planning and Scheduling

Country: North America > United States (0.28)

Genre:

Workflow (0.68)
Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.41)

Add feedback

Optimizing Plans through Analysis of Action Dependencies and Independencies

Chrpa, Lukáš (University of Huddersfield) | McCluskey, Thomas Leo (University of Huddersfield) | Osborne, Hugh (University of Huddersfield)

AAAI ConferencesJun-8-2012

The problem of automated planning is known to be intractable in general. Moreover, it has been proven that in some cases finding an optimal solution is much harder than finding any solution. Existing techniques have to compromise between speed of the planning process and quality of solutions. For example, techniques based on greedy search often are able to obtain solutions quickly, but the quality of the solutions is usually low. Similarly, adding macro-operators to planning domains often enables planning speed-up, but solution sequences are typically longer. In this paper, we propose a method for optimizing plans with respect to their length, by post-planning analysis. The method is based on analyzing action dependencies and independencies by which we are able to identify redundant actions or non-optimal sub-plans. To evaluate the process we provide preliminary empirical evidence using benchmark domains.

artificial intelligence, planning & scheduling, sequence, (12 more...)

AAAI Conferences

Twenty-Second International Conference on Automated Planning and Scheduling

Country: Europe > United Kingdom > England (0.14)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Add feedback

Making Hybrid Plans More Clear to Human Users - A Formal Approach for Generating Sound Explanations

Seegebarth, Bastian (Ulm University) | Müller, Felix (Ulm University) | Schattenberg, Bernd (Ulm University) | Biundo, Susanne (Ulm University)

AAAI ConferencesJun-8-2012

Human users who execute an automatically generated plan want to understand the rationale behind it. Knowledge-rich plans are particularly suitable for this purpose, because they provide the means to give reason for causal, temporal, and hierarchical relationships between actions. Based on this information, focused arguments can be generated that constitute explanations on an appropriate level of abstraction. In this paper, we present a formal approach to plan explanation. Information about plans is represented as first-order logic formulae and explanations are constructed as proofs in the resulting axiomatic system. With that, plan explanations are provably correct w.r.t. the planning system that produced the plan. A prototype plan explanation system implements our approach and first experiments give evidence that finding plan explanations is feasible in real-time.

explanation, logic programming, planning & scheduling, (21 more...)

AAAI Conferences

Twenty-Second International Conference on Automated Planning and Scheduling

Country: North America > United States (0.28)

Genre: Research Report (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.88)

Add feedback

A New Greedy Algorithm for Multiple Sparse Regression

Jalali, Ali, Sanghavi, Sujay

arXiv.org Machine LearningJun-7-2012

This paper proposes a new algorithm for multiple sparse regression in high dimensions, where the task is to estimate the support and values of several (typically related) sparse vectors from a few noisy linear measurements. Our algorithm is a "forward-backward" greedy procedure that -- uniquely -- operates on two distinct classes of objects. In particular, we organize our target sparse vectors as a matrix; our algorithm involves iterative addition and removal of both (a) individual elements, and (b) entire rows (corresponding to shared features), of the matrix. Analytically, we establish that our algorithm manages to recover the supports (exactly) and values (approximately) of the sparse vectors, under assumptions similar to existing approaches based on convex optimization. However, our algorithm has a much smaller computational complexity. Perhaps most interestingly, it is seen empirically to require visibly fewer samples. Ours represents the first attempt to extend greedy algorithms to the class of models that can only/best be represented by a combination of component structural assumptions (sparse and group-sparse, in our case).

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

1206.1402

Country: Europe > Netherlands (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Multiple Kernel Learning: A Unifying Probabilistic Viewpoint

Nickisch, Hannes, Seeger, Matthias

arXiv.org Machine LearningJun-7-2012

We present a probabilistic viewpoint to multiple kernel learning unifying well-known regularised risk approaches and recent advances in approximate Bayesian inference relaxations. The framework proposes a general objective function suitable for regression, robust regression and classification that is lower bound of the marginal likelihood and contains many regularised risk approaches as special cases. Furthermore, we derive an efficient and provably convergent optimisation algorithm.

artificial intelligence, bayesian inference, equation, (14 more...)

arXiv.org Machine Learning

1103.0897

Country:

Europe > Switzerland (0.14)
Europe > Germany (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.67)

Add feedback

Soil Data Analysis Using Classification Techniques and Soil Attribute Prediction

Gholap, Jay, Ingole, Anurag, Gohil, Jayesh, Gargade, Shailesh, Attar, Vahida

arXiv.org Machine LearningJun-7-2012

Agricultural research has been profited by technical advances such as automation, data mining. Today, data mining is used in a vast areas and many off-the-shelf data mining system products and domain specific data mining application soft wares are available, but data mining in agricultural soil datasets is a relatively a young research field. The large amounts of data that are nowadays virtually harvested along with the crops have to be analyzed and should be used to their full extent. This research aims at analysis of soil dataset using data mining techniques. It focuses on classification of soil using various algorithms available. Another important purpose is to predict untested attributes using regression technique, and implementation of automated soil sample classification.

artificial intelligence, data mining, decision tree learning, (14 more...)

arXiv.org Machine Learning

1206.1557

Country:

Asia > India > Maharashtra (0.15)
Oceania > New Zealand > North Island > Waikato (0.14)

Genre: Research Report (0.64)

Industry:

Food & Agriculture > Agriculture (1.00)
Government > Regional Government (0.70)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.76)
(2 more...)

Add feedback

The Generalization Ability of Online Algorithms for Dependent Data

Agarwal, Alekh, Duchi, John C.

arXiv.org Machine LearningJun-6-2012

We study the generalization performance of online learning algorithms trained on samples coming from a dependent source of data. We show that the generalization error of any stable online algorithm concentrates around its regret--an easily computable statistic of the online performance of the algorithm--when the underlying ergodic process is $\beta$- or $\phi$-mixing. We show high probability error bounds assuming the loss function is convex, and we also establish sharp convergence rates and deviation bounds for strongly convex losses and several linear prediction problems such as linear and logistic regression, least-squares SVM, and boosting on dependent data. In addition, our results have straightforward applications to stochastic optimization with dependent data, and our analysis requires only martingale convergence arguments; we need not rely on more powerful statistical tools such as empirical process theory.

algorithm, artificial intelligence, machine learning, (17 more...)

arXiv.org Machine Learning

1110.2529

Country: Asia (0.14)

Genre: Research Report > New Finding (0.87)

Industry: Education (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback

Manifold estimation and singular deconvolution under Hausdorff loss

Genovese, Christopher R., Perone-Pacifico, Marco, Verdinelli, Isabella, Wasserman, Larry

arXiv.org Machine LearningJun-5-2012

We find lower and upper bounds for the risk of estimating a manifold in Hausdorff distance under several models. We also show that there are close connections between manifold estimation and the problem of deconvolving a singular measure.

artificial intelligence, machine learning, manifold, (17 more...)

arXiv.org Machine Learning

doi: 10.1214/12-AOS994

1109.454

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Kullback-Leibler aggregation and misspecified generalized linear models

Rigollet, Philippe

arXiv.org Machine LearningJun-5-2012

In a regression setup with deterministic design, we study the pure aggregation problem and introduce a natural extension from the Gaussian distribution to distributions in the exponential family. While this extension bears strong connections with generalized linear models, it does not require identifiability of the parameter or even that the model on the systematic component is true. It is shown that this problem can be solved by constrained and/or penalized likelihood maximization and we derive sharp oracle inequalities that hold both in expectation and with high probability. Finally all the bounds are proved to be optimal in a minimax sense.

aggregation, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

doi: 10.1214/11-AOS961

0911.2919

Country:

North America > United States > New Jersey (0.14)
North America > United States > California (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Inverse-Category-Frequency based supervised term weighting scheme for text categorization

Wang, Deqing, Zhang, Hui

arXiv.org Artificial IntelligenceJun-5-2012

Term weighting schemes often dominate the performance of many classifiers, such as kNN, centroid-based classifier and SVMs. The widely used term weighting scheme in text categorization, i.e., tf.idf, is originated from information retrieval (IR) field. The intuition behind idf for text categorization seems less reasonable than IR. In this paper, we introduce inverse category frequency (icf) into term weighting scheme and propose two novel approaches, i.e., tf.icf and icf-based supervised term weighting schemes. The tf.icf adopts icf to substitute idf factor and favors terms occurring in fewer categories, rather than fewer documents. And the icf-based approach combines icf and relevance frequency (rf) to weight terms in a supervised way. Our cross-classifier and cross-corpus experiments have shown that our proposed approaches are superior or comparable to six supervised term weighting schemes and three traditional schemes in terms of macro-F1 and micro-F1.

artificial intelligence, term weighting scheme, text processing, (20 more...)

arXiv.org Artificial Intelligence

1012.2609

Country: North America > United States (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.92)

Add feedback