AITopics | Search

Collaborating Authors

Search

"Search is a problem-solving technique that systematically explores a space of problem states, i.e., successive and alternative stages in the problem-solving process. Examples of problem states might include the different board configurations in a game or intermediate steps in a reasoning process. This space of alternative solutions is then searched to find an answer. Newell and Simon (1976) have argued that this is the essential basis of human problem solving. Indeed, when a chess player examines the effects of different moves or a doctor considers a number of alternative diagnoses, they are searching among alternatives."
– from Section 1.2 of Chapter One of George F. Luger's textbook, Artificial Intelligence: Structures and Strategies for Complex Problem Solving, 5th Edition (Addison-Wesley; 2005).

News Overviews Instructional Materials AI-Alerts Classics

Solving Rubik's Cube

#artificialintelligenceOct-15-2019, 17:54:08 GMT

Sign in to report inappropriate content. We've trained a pair of neural networks to solve the Rubik's Cube with a human-like robot hand.

cube, rubik

#artificialintelligence

Industry: Leisure & Entertainment > Games > Rubik's Cube (0.89)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.89)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.33)

Add feedback

Extremely dexterous robot can solve a Rubik's cube one-handed

New ScientistOct-15-2019, 17:05:15 GMT

Artificial intelligence can now solve a Rubik's cube one-handed. The task requires so much dexterity that even humans find the movements difficult. The system was developed by researchers at OpenAI, a technology firm that has previously created an AI that could outplay humans at the video game Dota 2. The team taught an AI to control a commercially available robotic hand developed by the Shadow Robot Company. The AI learned using a technique called reinforcement learning, which involves trial and error. "It starts from not knowing anything about how to move a hand or how a cube would react if you push on the sides or on the faces," says Peter Welinder, part of the team.

cube one-handed, dexterous robot, rubik, (1 more...)

New Scientist

AI-Alerts: 2019 > 2019-10 > AAAI AI-Alert for Oct 22, 2019 (1.00)

Industry: Leisure & Entertainment > Games > Rubik's Cube (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.75)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.75)

Add feedback

Watch OpenAI's 'human-like' robot solve a Rubik's Cube one-handed – TechCrunch

#artificialintelligenceOct-15-2019, 16:14:38 GMT

There's always been something so annoying about people who found the need to stack additional challenges onto solving a Rubik's Cube quickly, whether it was doing it blind-folded or while juggling or one-handed. While it might have just been a challenge for them, it also seemed like a need to show off. OpenAI is clearly interested in showing off what its Dactyl robotic-hand can do with a Rubik's Cube. The organization announced that the robot has learned to solve a Rubik's Cube one-handed, an accomplishment that speaks to the robot's dexterity in handling and manipulating the cube more than anything. Previously, we had seen the robot interact with unknown objects without any real-world training, only virtual simulations.

openai, rubik, watch openai, (2 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games > Rubik's Cube (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
(3 more...)

Add feedback

OpenAI teaches a robotic hand to solve a Rubik's cube

#artificialintelligenceOct-15-2019, 16:12:25 GMT

Robots with truly humanlike dexterity are far from becoming reality, but progress accelerated by AI has brought us closer to achieving this vision than ever before. In a research paper published in September, a team of scientists at Google detailed their tests with a robotic hand that enabled it to rotate Baoding balls with minimal training data. And at a computer vision conference in June, MIT researchers presented their work on an AI model capable of predicting the tactility of physical things from snippets of visual data alone. Now, OpenAI -- the San Francisco-based AI research firm cofounded by Elon Musk and others, with backing from luminaries like LinkedIn cofounder Reid Hoffman and former Y Combinator president Sam Altman -- says it's on the cusp of solving something of a grand challenge in robotics and AI systems: solving a Rubik's cube. Unlike breakthroughs achieved by teams at the University of California, Irvine and elsewhere, which leveraged machines tailor-built to manipulate Rubik's cubes with speed, the approach devised by OpenAI researchers uses a five-fingered humanoid hand guided by an AI model with 13,000 years of cumulative experience -- on the same order of magnitude as the 40,000 years used by OpenAI's Dota-playing bot.

cube, robotic hand, rubik, (14 more...)

#artificialintelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.24)
North America > United States > California > Orange County > Irvine (0.24)
Asia > China > Hong Kong (0.04)

Genre: Research Report (0.35)

Industry: Leisure & Entertainment > Games > Rubik's Cube (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(3 more...)

Add feedback

Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions

Buesing, Lars, Heess, Nicolas, Weber, Theophane

arXiv.org Artificial IntelligenceOct-15-2019

A plethora of problems in AI, engineering and the sciences are naturally formalized as inference in discrete probabilistic models. Exact inference is often prohibitively expensive, as it may require evaluating the (unnormalized) target density on its entire domain. Here we consider the setting where only a limited budget of calls to the unnormalized density oracle is available, raising the challenge of where in the domain to allocate these function calls in order to construct a good approximate solution. We formulate this problem as an instance of sequential decision-making under uncertainty and leverage methods from reinforcement learning for probabilistic inference with budget constraints. In particular, we propose the TreeSample algorithm, an adaptation of Monte Carlo Tree Search to approximate inference. This algorithm caches all previous queries to the density oracle in an explicit search tree, and dynamically allocates new queries based on a "best-first" heuristic for exploration, using existing upper confidence bound methods. Our non-parametric inference method can be effectively combined with neural networks that compile approximate conditionals of the target, which are then used to guide the inference search and enable generalization across multiple target distributions. We show empirically that TreeSample outperforms standard approximate inference methods on synthetic factor graphs.

artificial intelligence, bayesian inference, inference, (18 more...)

arXiv.org Artificial Intelligence

1910.06862

Country: Europe > United Kingdom > England (0.14)

Genre: Research Report (0.64)

Industry: Energy > Oil & Gas (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback

Negatively Correlated Search as a Parallel Exploration Search Strategy

Yang, Peng, Tang, Ke, Yao, Xin

arXiv.org Artificial IntelligenceOct-15-2019

Parallel exploration is a key to a successful search. The recently proposed Negatively Correlated Search (NCS) achieved this ability by constructing a set of negatively correlated search processes and has been applied to many real-world problems. In NCS, the key technique is to explicitly model and maximize the diversity among search processes in parallel. However, the original diversity model was mostly devised by intuition, which introduced several drawbacks to NCS. In this paper, a mathematically principled diversity model is proposed to solve the existing drawbacks of NCS, resulting a new NCS framework. A new instantiation of NCS is also derived and its effectiveness is verified on a set of multi-modal continuous optimization problems.

diversity model, fitness value, sub, (15 more...)

arXiv.org Artificial Intelligence

1910.07151

Country:

North America > United States (0.14)
Asia > China > Guangdong Province > Shenzhen (0.04)
Oceania > New Zealand (0.04)
(4 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.87)

Add feedback

Evolving Gaussian Process kernels from elementary mathematical expressions

Roman, Ibai, Santana, Roberto, Mendiburu, Alexander, Lozano, Jose A.

arXiv.org Machine LearningOct-14-2019

Choosing the most adequate kernel is crucial in many Machine Learning applications. Gaussian Process is a state-of-the-art technique for regression and classification that heavily relies on a kernel function. However, in the Gaussian Process literature, kernels have usually been either ad hoc designed, selected from a predefined set, or searched for in a space of compositions of kernels which have been defined a priori. In this paper, we propose a Genetic-Programming algorithm that represents a kernel function as a tree of elementary mathematical expressions. By means of this representation, a wider set of kernels can be modeled, where potentially better solutions can be found, although new challenges also arise. The proposed algorithm is able to overcome these difficulties and find kernels that accurately model the characteristics of the data. This method has been tested in several real-world time-series extrapolation problems, improving the state-of-the-art results while reducing the complexity of the kernels.

algorithm, hyperparameter, kernel, (16 more...)

arXiv.org Machine Learning

1910.05173

Country:

North America > Canada > Quebec (0.05)
Oceania > Australia (0.04)
North America > United States > Virginia > Arlington County > Arlington (0.04)
(5 more...)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
(3 more...)

Add feedback

Old Dog Learns New Tricks: Randomized UCB for Bandit Problems

Vaswani, Sharan, Mehrabian, Abbas, Durand, Audrey, Kveton, Branislav

arXiv.org Machine LearningOct-10-2019

We propose $\tt RandUCB$, a bandit strategy that uses theoretically derived confidence intervals similar to upper confidence bound (UCB) algorithms, but akin to Thompson sampling (TS), uses randomization to trade off exploration and exploitation. In the $K$-armed bandit setting, we show that there are infinitely many variants of $\tt RandUCB$, all of which achieve the minimax-optimal $\widetilde{O}(\sqrt{K T})$ regret after $T$ rounds. Moreover, in a specific multi-armed bandit setting, we show that both UCB and TS can be recovered as special cases of $\tt RandUCB.$ For structured bandits, where each arm is associated with a $d$-dimensional feature vector and rewards are distributed according to a linear or generalized linear model, we prove that $\tt RandUCB$ achieves the minimax-optimal $\widetilde{O}(d \sqrt{T})$ regret even in the case of infinite arms. We demonstrate the practical effectiveness of $\tt RandUCB$ with experiments in both the multi-armed and structured bandit settings. Our results illustrate that $\tt RandUCB$ matches the empirical performance of TS while obtaining the theoretically optimal regret bounds of UCB algorithms, thus achieving the best of both worlds.

health & medicine, randucb, upstream oil & gas, (22 more...)

arXiv.org Machine Learning

1910.04928

Country:

North America > United States > New York (0.14)
Oceania > Australia (0.14)

Genre: Research Report > New Finding (0.66)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Energy > Oil & Gas > Upstream (0.48)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)

Add feedback

A simple and effective hybrid genetic search for the job sequencing and tool switching problem

Mecler, Jordana, Subramanian, Anand, Vidal, Thibaut

arXiv.org Artificial IntelligenceOct-10-2019

The job sequencing and tool switching problem (SSP) has been extensively studied in the field of operations research, due to its practical relevance and methodological interest. Given a machine that can load a limited amount of tools simultaneously and a number of jobs that require a subset of the available tools, the SSP seeks a job sequence that minimizes the number of tool switches in the machine. To solve this problem, we propose a simple and efficient hybrid genetic search based on a generic solution representation, a tailored decoding operator, efficient local searches and diversity management techniques. To guide the search, we introduce a secondary objective designed to break ties. These techniques allow to explore structurally different solutions and escape local optima. As shown in our computational experiments on classical benchmark instances, our algorithm significantly outperforms all previous approaches while remaining simple to apprehend and easy to implement. We finally report results on a new set of larger instances to stimulate future research and comparative analyses.

algorithm, objective, tool switch, (16 more...)

arXiv.org Artificial Intelligence

1910.10021

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.05)
South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.04)
North America > United States (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Kernels over Sets of Finite Sets using RKHS Embeddings, with Application to Bayesian (Combinatorial) Optimization

Buathong, Poompol, Ginsbourger, David, Krityakierne, Tipaluck

arXiv.org Machine LearningOct-9-2019

We focus on kernel methods for set-valued inputs and their application to Bayesian set optimization, notably combinatorial optimization. We introduce a class of (strictly) positive definite kernels that relies on Reproducing Kernel Hilbert Space embeddings, and successfully generalizes "double sum" set kernels recently considered in Bayesian set optimization, which turn out to be unsuitable for combinatorial optimization. The proposed class of kernels, for which we provide theoretical guarantees, essentially consists in applying an outer kernel on top of the canonical distance induced by a double sum kernel. Proofs of theoretical results about considered kernels are complemented by a few practicalities regarding hyperparameter fitting. We furthermore demonstrate the applicability of our approach in prediction and optimization tasks, relying both on toy examples and on two test cases from mechanical engineering and hydrogeology, respectively. Experimental results illustrate the added value of the approach and open new perspectives in prediction and sequential design with set inputs.

double sum kernel, kernel, optimization, (13 more...)

arXiv.org Machine Learning

1910.04086

Country:

Asia > Thailand > Bangkok > Bangkok (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Switzerland > Bern > Bern (0.04)
Europe > France (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
(2 more...)

Add feedback