AITopics | Search

Collaborating Authors

Search

"Search is a problem-solving technique that systematically explores a space of problem states, i.e., successive and alternative stages in the problem-solving process. Examples of problem states might include the different board configurations in a game or intermediate steps in a reasoning process. This space of alternative solutions is then searched to find an answer. Newell and Simon (1976) have argued that this is the essential basis of human problem solving. Indeed, when a chess player examines the effects of different moves or a doctor considers a number of alternative diagnoses, they are searching among alternatives."
– from Section 1.2 of Chapter One of George F. Luger's textbook, Artificial Intelligence: Structures and Strategies for Complex Problem Solving, 5th Edition (Addison-Wesley; 2005).

News Overviews Instructional Materials AI-Alerts Classics

Leveraging vague prior information in general models via iteratively constructed Gamma-minimax estimators

Qiu, Hongxiang, Luedtke, Alex

arXiv.org Machine LearningDec-10-2020

Gamma-minimax estimation is an approach to incorporate prior information into an estimation procedure when it is implausible to specify one particular prior distribution. In this approach, we aim for an estimator that minimizes the worst-case Bayes risk over a set $\Gamma$ of prior distributions. Traditionally, Gamma-minimax estimation is defined for parametric models. In this paper, we define Gamma-minimaxity for general models and propose iterative algorithms with convergence guarantees to compute Gamma-minimax estimators for a general model space and a set of prior distributions constrained by generalized moments. We also propose encoding the space of candidate estimators by neural networks to enable flexible estimation. We illustrate our method in two settings, namely entropy estimation and a problem that arises in biodiversity studies.

bayes risk, estimator, sup, (17 more...)

arXiv.org Machine Learning

2012.05465

Country:

North America > United States > New York > New York County > New York City (0.14)
Europe > Poland (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Hybrid Quantum Computing -- Tabu Search Algorithm for Partitioning Problems: preliminary study on the Traveling Salesman Problem

Osaba, Eneko, Villar-Rodriguez, Esther, Oregi, Izaskun, Moreno-Fernandez-de-Leceta, Aitor

arXiv.org Artificial IntelligenceDec-9-2020

Quantum Computing is considered as the next frontier in computing, and it is attracting a lot of attention from the current scientific community. This kind of computation provides to researchers with a revolutionary paradigm for addressing complex optimization problems, offering a significant speed advantage and an efficient search ability. Anyway, despite hopes placed in this field are high, Quantum Computing is still in an incipient stage of development. For this reason, present architectures show certain limitations in terms of computational capabilities and performance. These limitations have motivated the carrying out of this paper. With this paper, we contribute to the field introducing a novel solving scheme coined as hybrid Quantum Computing - Tabu Search Algorithm. Main pillars of operation of the proposed method are a greater control over the access to quantum resources, and a considerable reduction of non-profitable accesses. For assessing the quality of our method, we have used the well-known TSP as benchmarking problem. Furthermore, the performance of QTA has been compared with QBSolv -- a state-of-the-art decomposing solver -- on a set of 7 different TSP instances. The obtained experimental outcomes support the preliminary conclusion that QTA is an approach which offers promising results for solving partitioning problems, while it drastically reduces the access to QC resources. Furthermore, we also contribute in this paper to the field of Transfer Optimization by developing and using a evolutionary multiform multitasking algorithm as initialization method for the introduced hybrid Quantum Computing - Tabu Search Algorithm. Concretely, the evolutionary multitasking algorithm implemented is a multiform variant of the recently published Coevolutionary Variable Neighborhood Search Algorithm for Discrete Multitasking.

algorithm, optimization, quantum computing, (14 more...)

arXiv.org Artificial Intelligence

2012.04984

Country:

Europe > Spain (0.04)
North America > Canada (0.04)
Asia (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

Ensemble Squared: A Meta AutoML System

Yoo, Jason, Joseph, Tony, Yung, Dylan, Nasseri, S. Ali, Wood, Frank

arXiv.org Artificial IntelligenceDec-9-2020

The continuing rise in the number of problems amenable to machine learning solutions, coupled with simultaneous growth in both computing power and variety of machine learning techniques has led to an explosion of interest in automated machine learning (AutoML). This paper presents Ensemble Squared (Ensemble$^2$), a "meta" AutoML system that ensembles at the level of AutoML systems. Ensemble$^2$ exploits the diversity of existing, competing AutoML systems by ensembling the top-performing models simultaneously generated by a set of them. Our work shows that diversity in AutoML systems is sufficient to justify ensembling at the AutoML system level. In demonstrating this, we also establish a new state of the art AutoML result on the OpenML classification challenge.

automl system, ensemble 2, pipeline, (11 more...)

arXiv.org Artificial Intelligence

2012.0539

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre: Research Report (1.00)

Industry:

Education (0.68)
Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.94)

Add feedback

Minimax Regret Optimisation for Robust Planning in Uncertain Markov Decision Processes

Rigter, Marc, Lacerda, Bruno, Hawes, Nick

arXiv.org Artificial IntelligenceDec-8-2020

The parameters for a Markov Decision Process (MDP) often cannot be specified exactly. Uncertain MDPs (UMDPs) capture this model ambiguity by defining sets which the parameters belong to. Minimax regret has been proposed as an objective for planning in UMDPs to find robust policies which are not overly conservative. In this work, we focus on planning for Stochastic Shortest Path (SSP) UMDPs with uncertain cost and transition functions. We introduce a Bellman equation to compute the regret for a policy. We propose a dynamic programming algorithm that utilises the regret Bellman equation, and show that it optimises minimax regret exactly for UMDPs with independent uncertainties. For coupled uncertainties, we extend our approach to use options to enable a trade off between computation and solution quality. We evaluate our approach on both synthetic and real-world domains, showing that it significantly outperforms existing baselines.

adversary, reg, umdp, (13 more...)

arXiv.org Artificial Intelligence

2012.04626

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Atlantic Ocean > North Atlantic Ocean > Norwegian Sea (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.84)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.84)

Add feedback

MIT Develops A New System That Creates Different Kind Of Robots

#artificialintelligenceDec-7-2020, 10:25:14 GMT

"Robot design is still a very manual process. RoboGrammar is a way to come up with new, more inventive robot designs." The way humans move effortlessly can obscure them of the complex motions of the joints and limbs. One would be shocked if they were to build a robot considering all the degrees of freedom, the weight of the payload, 3D geometry and more. The sophisticated designs of robots have been more or less the same.

artificial intelligence, create different kind, robot design, (13 more...)

#artificialintelligence

Genre: Research Report (0.75)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.33)

Add feedback

Mapping Network States Using Connectivity Queries

Rodríguez, Alexander, Adhikari, Bijaya, González, Andrés D., Nicholson, Charles, Vullikanti, Anil, Prakash, B. Aditya

arXiv.org Artificial IntelligenceDec-7-2020

Can we infer all the failed components of an infrastructure network, given a sample of reachable nodes from supply nodes? One of the most critical post-disruption processes after a natural disaster is to quickly determine the damage or failure states of critical infrastructure components. However, this is non-trivial, considering that often only a fraction of components may be accessible or observable after a disruptive event. Past work has looked into inferring failed components given point probes, i.e. with a direct sample of failed components. In contrast, we study the harder problem of inferring failed components given partial information of some `serviceable' reachable nodes and a small sample of point probes, being the first often more practical to obtain. We formulate this novel problem using the Minimum Description Length (MDL) principle, and then present a greedy algorithm that minimizes MDL cost effectively. We evaluate our algorithm on domain-expert simulations of real networks in the aftermath of an earthquake. Our algorithm successfully identify failed components, especially the critical ones affecting the overall system performance.

connectivity query, demand node, node, (16 more...)

arXiv.org Artificial Intelligence

2012.03413

Country:

North America > United States > California (0.14)
North America > United States > Virginia (0.04)
North America > United States > Tennessee > Shelby County (0.04)
(3 more...)

Genre: Research Report (0.82)

Industry: Energy > Power Industry (1.00)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory > Minimum Complexity Machines (0.54)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.48)

Add feedback

Adaptive Local Bayesian Optimization Over Multiple Discrete Variables

Kim, Taehyeon, Ahn, Jaeyeon, Kim, Nakyil, Yun, Seyoung

arXiv.org Machine LearningDec-7-2020

In the machine learning algorithms, the choice of the hyperparameter is often an art more than a science, requiring labor-intensive search with expert experience. Therefore, automation on hyperparameter optimization to exclude human intervention is a great appeal, especially for the black-box functions. Recently, there have been increasing demands of solving such concealed tasks for better generalization, though the task-dependent issue is not easy to solve. The Black-Box Optimization challenge (NeurIPS 2020) required competitors to build a robust black-box optimizer across different domains of standard machine learning problems. This paper describes the approach of team KAIST OSI in a step-wise manner, which outperforms the baseline algorithms by up to +20.39%. We first strengthen the local Bayesian search under the concept of region reliability. Then, we design a combinatorial kernel for a Gaussian process kernel. In a similar vein, we combine the methodology of Bayesian and multi-armed bandit,(MAB) approach to select the values with the consideration of the variable types; the real and integer variables are with Bayesian, while the boolean and categorical variables are with MAB. Empirical evaluations demonstrate that our method outperforms the existing methods across different tasks.

kernel, optimization, trust region, (13 more...)

arXiv.org Machine Learning

2012.03501

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)

Genre: Research Report (0.82)

Industry: Education (0.36)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.71)

Add feedback

Adaptive Stress Testing: Finding Likely Failure Events with Reinforcement Learning

Lee, Ritchie (Stinger Ghaffarian Technologies) | Mengshoel, Ole J. (Norwegian University of Science and Technology) | Saksena, Anshu (Johns Hopkins University Applied Physics Laboratory) | Gardner, Ryan W. (Johns Hopkins University Applied Physics Laboratory) | Genin, Daniel (Johns Hopkins University Applied Physics Laboratory) | Silbermann, Joshua | Owen, Michael (MIT Lincoln Laboratory) | Kochenderfer, Mykel J. (Stanford University)

Journal of Artificial Intelligence ResearchDec-6-2020

Finding the most likely path to a set of failure states is important to the analysis of safety-critical systems that operate over a sequence of time steps, such as aircraft collision avoidance systems and autonomous cars. In many applications such as autonomous driving, failures cannot be completely eliminated due to the complex stochastic environment in which the system operates. As a result, safety validation is not only concerned about whether a failure can occur, but also discovering which failures are most likely to occur. This article presents adaptive stress testing (AST), a framework for finding the most likely path to a failure event in simulation. We consider a general black box setting for partially observable and continuous-valued systems operating in an environment with stochastic disturbances. We formulate the problem as a Markov decision process and use reinforcement learning to optimize it. The approach is simulation-based and does not require internal knowledge of the system, making it suitable for black-box testing of large systems. We present different formulations depending on whether the state is fully observable or partially observable. In the latter case, we present a modified Monte Carlo tree search algorithm that only requires access to the pseudorandom number generator of the simulator to overcome partial observability. We also present an extension of the framework, called differential adaptive stress testing (DAST), that can find failures that occur in one system but not in another. This type of differential analysis is useful in applications such as regression testing, where we are concerned with finding areas of relative weakness compared to a baseline. We demonstrate the effectiveness of the approach on an aircraft collision avoidance application, where a prototype aircraft collision avoidance system is stress tested to find the most likely scenarios of near mid-air collision.

aircraft, collision avoidance system, simulator, (15 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.12190

AI Access Foundation

12190

Journal of Artificial Intelligence Research

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Lexington (0.04)
(4 more...)

Industry:

Transportation > Air (1.00)
Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Biclustering and Boolean Matrix Factorization in Data Streams

Neumann, Stefan, Miettinen, Pauli

arXiv.org Artificial IntelligenceDec-5-2020

We study the clustering of bipartite graphs and Boolean matrix factorization in data streams. We consider a streaming setting in which the vertices from the left side of the graph arrive one by one together with all of their incident edges. We provide an algorithm that, after one pass over the stream, recovers the set of clusters on the right side of the graph using sublinear space; to the best of our knowledge, this is the first algorithm with this property. We also show that after a second pass over the stream, the left clusters of the bipartite graph can be recovered and we show how to extend our algorithm to solve the Boolean matrix factorization problem (by exploiting the correspondence of Boolean matrices and bipartite graphs). We evaluate an implementation of the algorithm on synthetic data and on real-world data. On real-world datasets the algorithm is orders of magnitudes faster than a static baseline algorithm while providing quality results within a factor 2 of the baseline algorithm. Our algorithm scales linearly in the number of edges in the graph. Finally, we analyze the algorithm theoretically and provide sufficient conditions under which the algorithm recovers a set of planted clusters under a standard random graph model.

algorithm, data structure, vertex, (15 more...)

arXiv.org Artificial Intelligence

2012.03138

Country:

Europe > Austria > Vienna (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > Germany > Baden-Württemberg > Freiburg (0.04)
Europe > Finland > Northern Savo > Kuopio (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.46)

Add feedback

Online learning with dynamics: A minimax perspective

Bhatia, Kush, Sridharan, Karthik

arXiv.org Machine LearningDec-3-2020

We study the problem of online learning with dynamics, where a learner interacts with a stateful environment over multiple rounds. In each round of the interaction, the learner selects a policy to deploy and incurs a cost that depends on both the chosen policy and current state of the world. The state-evolution dynamics and the costs are allowed to be time-varying, in a possibly adversarial way. In this setting, we study the problem of minimizing policy regret and provide non-constructive upper bounds on the minimax rate for the problem. Our main results provide sufficient conditions for online learnability for this setup with corresponding rates. The rates are characterized by 1) a complexity term capturing the expressiveness of the underlying policy class under the dynamics of state change, and 2) a dynamics stability term measuring the deviation of the instantaneous loss from a certain counterfactual loss. Further, we provide matching lower bounds which show that both the complexity terms are indeed necessary. Our approach provides a unifying analysis that recovers regret bounds for several well studied problems including online learning with memory, online control of linear quadratic regulators, online Markov decision processes, and tracking adversarial targets. In addition, we show how our tools help obtain tight regret bounds for a new problems (with non-linear dynamics and non-convex losses) for which such bounds were not known prior to our work.

adversary, online, stability parameter, (16 more...)

arXiv.org Machine Learning

2012.01705

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.49)

Industry: Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.83)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.61)

Add feedback