AITopics

We discuss a novel approach for dealing with single-stage stochastic constraint satisfaction problems (SCSPs) that include random variables over a continuous or large discrete support. Our approach is based on two novel tools: sampled SCSPs and (α,ϑ)-solutions. Instead of explicitly enumerating a very large or infinite set of future scenarios, we employ statistical estimation to determine if a given assignment is consistent for a SCSP. As in statistical estimation, the quality of our estimate is determined via confidence interval analysis. In contrast to existing approaches based on sampling, we provide likelihood guarantees for the quality of the solutions found. Our approach can be used in concert with existing strategies for solving SCSPs.

policy tree, probability, random variable, (16 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

North America > United States > New York (0.04)
Europe > Netherlands (0.04)
Europe > Ireland > Munster > County Cork > Cork (0.04)
(2 more...)

Genre: Overview (0.34)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)

Jr., Luiz A. Celiberto (Technological Institute of Aeronautics) | Matsuura, Jackson P. (Technological Institute of Aeronautics) | Mantaras, Ramon Lopez de (Artificial Intelligence Research Institute (IIIA-CSIC)) | Bianchi, Reinaldo A. C. (Centro Universitario da FEI)

Using Cases as Heuristics in Reinforcement Learning: A Transfer Learning Application

Another way to speed up a RL algorithm is by using Transfer Learning, a paradigm of machine learning that In this paper we propose to combine three AI techniques reuses knowledge accumulated in a previous task to speed up to speed up a Reinforcement Learning algorithm the learning of a novel, but related, target task [Taylor and in a Transfer Learning problem: Casebased Stone, 2009]. Reasoning, Heuristically Accelerated Reinforcement This paper investigates the use of the Case-Based Heuristically Learning and Neural Networks. To do Accelerated Reinforcement Learning (CB-HARL) algorithm so, we propose a new algorithm, called L3, which [Bianchi et al., 2009] as a means to transfer learning works in 3 stages: in the first stage, it uses Reinforcement acquired by one agent during its training in one problem to Learning to learn how to perform one another agent that has to learn how to solve a similar, but task, and stores the optimal policy for this problem more complex, problem. To do so, we propose a new algorithm, as a case-base; in the second stage, it uses a Neural called L3, which works in 3 stages: in the first stage, Network to map actions from one domain to actions it uses the Q-learning algorithm [Watkins, 1989] to learn how in the other domain and; in the third stage, it uses to perform one task, and stores the optimal policy for this the case-base learned in the first stage as heuristics problem as a case-base; in the second stage, it uses a Neural to speed up the learning performance in a related, Network to map actions from one domain to actions in but different, task. The RL algorithm used the other domain and; in the third stage, it uses the case-base in the first phase is the Q-learning and in the third learned in the first stage as heuristics in the CB-HARL algorithm, phase is the recently proposed Case-based Heuristically speeding up the learning process.

agent, algorithm, learning, (14 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

South America > Brazil (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Spain (0.04)

Genre:

Research Report (0.86)
Instructional Material > Course Syllabus & Notes (0.56)
Overview (0.55)

Industry: Leisure & Entertainment > Sports (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Revising Horn Theories

Delgrande, James P. (Simon Fraser University) | Peppas, Pavlos (University of Patras)

This paper investigates belief revision where the underlying logic is that governing Horn clauses. It proves to be the case that classical (AGM) belief revision doesn’t immediately generalise to the Horn case. In particular, a standard construction based on a total preorder over possible worlds may violate the accepted (AGM) postulates. Conversely, Horn revision functions in the obvious extension to the AGM approach are not captured by total preorders over possible worlds. We address these difficulties by first restricting the semantic construction to "well behaved" orderings; and second, by augmenting the revision postulates by an additional postulate. This additional postulate is redundant in the AGM approach but not in the Horn case. In a representation result we show that these two approaches coincide. Arguably this work is interesting for several reasons. It extends AGM revision to inferentially-weaker Horn theories; hence it sheds light on the theoretical underpinnings of belief change, as well as generalising the AGM paradigm. Thus, this work is relevant to revision in areas that employ Horn clauses, such as deductive databases and logic programming, as well as areas in which inference is weaker than classical logic, such as in description logic.

formula, postulate, revision, (17 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > Ontario > Toronto (0.04)
(2 more...)

Genre:

Research Report (0.86)
Overview (0.68)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (1.00)

Morales, Javier (Universitat de Barcelona (UB)) | López-Sánchez, Maite (Universitat de Barcelona) | Esteva, Marc (Artificial Intelligence Research Institute (IIIA - CSIC))

Using Experience to Generate New Regulations

Humans have developed jurisprudence as a mechanism to solve conflictive situations by using past experiences. Following this principle, we propose an approach to enhance a multi-agent system by adding an authority which is able to generate new regulations whenever conflicts arise. Regulations are generated by learning from previous similar situations, using a machine learning technique (based on Case-Based Reasoning) that solves new problems using previous experiences. This approach requires: to be able to gather and evaluate experiences; and to be described in such a way that similar social situations require similar regulations. As a scenario to evaluate our proposal, we use a simplified version of a traffic scenario, where agents are traveling cars. Our goals are to avoid collisions between cars and to avoid heavy traffic. These situations, when happen, lead to the synthesis of new regulations. At each simulation step, applicable regulations are evaluated in terms of their effectiveness and necessity. Overtime the system generates a set of regulations that, if followed, improve system performance (i.e. goal achievement).

collision, regulation, scenario, (15 more...)

Twenty-Second International Joint Conference on Artificial Intelligence

Country: Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Overview (0.49)

Industry: Law > Statutes (0.90)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Richtárik, Peter, Takáč, Martin

Iteration Complexity of Randomized Block-Coordinate Descent Methods for Minimizing a Composite Function

arXiv.org Machine LearningJul-14-2011

In this paper we develop a randomized block-coordinate descent method for minimizing the sum of a smooth and a simple nonsmooth block-separable convex function and prove that it obtains an $\epsilon$-accurate solution with probability at least $1-\rho$ in at most $O(\tfrac{n}{\epsilon} \log \tfrac{1}{\rho})$ iterations, where $n$ is the number of blocks. For strongly convex functions the method converges linearly. This extends recent results of Nesterov [Efficiency of coordinate descent methods on huge-scale optimization problems, CORE Discussion Paper #2010/2], which cover the smooth case, to composite minimization, while at the same time improving the complexity by the factor of 4 and removing $\epsilon$ from the logarithmic term. More importantly, in contrast with the aforementioned work in which the author achieves the results by applying the method to a regularized version of the objective function with an unknown scaling factor, we show that this is not necessary, thus achieving true iteration complexity bounds. In the smooth case we also allow for arbitrary probability vectors and non-Euclidean norms. Finally, we demonstrate numerically that the algorithm is able to solve huge-scale $\ell_1$-regularized least squares and support vector machine problems with a billion variables.

artificial intelligence, iteration, machine learning, (19 more...)

arXiv.org Machine Learning

1107.2848

Country: Europe > United Kingdom (0.28)

Genre:

Research Report (0.64)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.69)

AAAI ConferencesJul-12-2011

Scalable Event-Based Clustering of Social Media Via Record Linkage Techniques

Reuter, Timo (CITEC, University of Bielefeld) | Cimiano, Philipp (CITEC, University of Bielefeld) | Drumond, Lucas (University of Hildesheim) | Buza, Krisztian (University of Hildesheim) | Schmidt-Thieme, Lars (University of Hildesheim)

We tackle the problem of grouping content available in social media applications such as Flickr, Youtube, Panoramino etc. into clusters of documents describing the same event. This task has been referred to as event identiﬁcation before. We present a new formalization of the event identiﬁcation task as a record linkage problem and show that this formulation leads to a principled and highly efﬁcient solution to the problem. We present results on two datasets derived from Flickr — last.fm and upcoming — comparing the results in terms of Normalized Mutual Information and F-Measure with respect to several baselines, showing that a record linkage approach outperforms all baselines as well as a state-of-the-art system. We demonstrate that our approach can scale to large amounts of data, reducing the processing time considerably compared to a state-of-the-art approach. The scalability is achieved by applying an appropriate blocking strategy and relying on a Single Linkage clustering algorithm which avoids the exhaustive computation of pairwise similarities.

artificial intelligence, data mining, machine learning, (17 more...)

Fifth International AAAI Conference on Weblogs and Social Media

Country:

Europe > Germany (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre:

Research Report > Promising Solution (0.48)
Overview > Innovation (0.34)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.48)

Oblinger, Daniel (Defense Advanced Research Projects Agency)

Toward a Computational Model of Transfer

AI MagazineJul-9-2011

TLP and the field as a whole made great strides in each of these dimensions. Indeed, the program has helped TL become a recognized subdiscipline of machine learning. Other articles in this special issue detail the work accomplished in TLP; this article focuses on a broad framing of the research conducted and an assessment of its progress, limitations, and challenges, from an admittedly personal but DARPAinfluenced perspective. Traditionally every DARPA program has focused its research by requiring a precise measure of progress. The DARPA TLP decided to measure transfer by comparing the learning of tasks A and B versus the learning of B alone. In figure 1 the curve labeled B represents a traditional learning curve of the performance on target task B as a function of the number of training instances.

artificial intelligence, knowledge, machine learning, (17 more...)

AI Magazine

Country: North America > United States (0.58)

Genre:

Collection > Journal > Special Issue (0.54)
Overview (0.34)

Industry:

Leisure & Entertainment > Sports > Football (0.70)
Government > Regional Government > North America Government > United States Government (0.58)
Government > Military (0.58)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.70)

An Application of Transfer to American Football: From Observation of Raw Video to Control in a Simulated Environment

Stracuzzi, David J. (Sandia National Laboratories) | Fern, Alan (Oregon State University) | Ali, Kamal (Stanford University) | Hess, Robin (Oregon State University) | Pinto, Jervis (Oregon State University) | Li, Nan (Carnegie Mellon University) | Konik, Tolga (Stanford University) | Shapiro, Daniel G. (Institute for the Study of Learning and Expertise)

AI MagazineJul-9-2011

Automatic transfer of learned knowledge from one task or domain to another offers great potential to simplify and expedite the construction and deployment of intelligent systems. In practice however, there are many barriers to achieving this goal. In this article, we present a prototype system for the real-world context of transferring knowledge of American football from video observation to control in a game simulator. We trace an example play from the raw video through execution and adaptation in the simulator, highlighting the system's component algorithms along with issues of complexity, generality, and scale. We then conclude with a discussion of the implications of this work for other applications, along with several possible improvements.

artificial intelligence, knowledge management, machine learning, (18 more...)

AI Magazine

Country: North America > United States (1.00)

Genre:

Workflow (1.00)
Overview (0.67)
Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment > Sports > Football (1.00)
Leisure & Entertainment > Games (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
(2 more...)

AAAI ConferencesJul-5-2011

A Novel Technique for Compressing Pattern Databases in the Pancake Sorting Problems

Keshtkaran, Morteza (Shiraz University) | Taghizadeh, Roohollah (Shiraz University) | Ziarati, Koorush (Shiraz University)

In this paper we present a lossless technique to compress pattern databases (PDBs) in the Pancake Sorting problems. This compression technique together with the choice of zero-cost operators in the construction of additive PDBs reduces the memory requirement for PDBs in these problems to a great extent, thus making otherwise intractable problems able to be efficiently handled. Also, using this method, we can construct some problem-size independent PDBs. This precludes the necessity of constructing new PDBs for new problems with different numbers of pancakes. In addition to our compression technique, by maximizing over the heuristic value of additive PDBs and the modified version of the gap heuristic, we have obtained powerful heuristics for the burnt pancake problem.

gap space, pattern tile, pdb, (14 more...)

Fourth Annual Symposium on Combinatorial Search

Country: Asia > Middle East > Iran > Fars Province > Shiraz (0.04)

Genre: Overview (0.34)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.46)

Law, Edith, Ahn, Luis von

Human Computation

Morgan & Claypool PublishersJun-30-2011

Human computation is a new and evolving research area that centers around harnessing human intelligence to solve computational problems that are beyond the scope of existing Artificial Intelligence (AI) algorithms. With the growth of the Web, human computation systems can now leverage the abilities of an unprecedented number of people via the Web to perform complex computation. There are various genres of human computation applications that exist today. Games with a purpose (e.g., the ESP Game) specifically target online gamers who generate useful data (e.g., image tags) while playing an enjoyable game. Crowdsourcing marketplaces (e.g., Amazon Mechanical Turk) are human computation systems that coordinate workers to perform tasks in exchange for monetary rewards.

artificial intelligence, optical character recognition, top description table, (13 more...)

Morgan & Claypool Publishers

Genre:

Overview (0.32)
Personal > Honors (0.31)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Communications > Social Media > Crowdsourcing (0.73)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.72)