AITopics | Search

Collaborating Authors

Search

"Search is a problem-solving technique that systematically explores a space of problem states, i.e., successive and alternative stages in the problem-solving process. Examples of problem states might include the different board configurations in a game or intermediate steps in a reasoning process. This space of alternative solutions is then searched to find an answer. Newell and Simon (1976) have argued that this is the essential basis of human problem solving. Indeed, when a chess player examines the effects of different moves or a doctor considers a number of alternative diagnoses, they are searching among alternatives."
– from Section 1.2 of Chapter One of George F. Luger's textbook, Artificial Intelligence: Structures and Strategies for Complex Problem Solving, 5th Edition (Addison-Wesley; 2005).

News Overviews Instructional Materials AI-Alerts Classics

Why a robot that can 'solve' Rubik's Cube one-handed has the AI community at war

#artificialintelligenceOct-24-2019, 14:59:03 GMT

OpenAI, a non-profit co-founded by Elon Musk, recently unveiled its newest trick: A robot hand that can'solve' Rubik's Cube. Whether this is a feat of science or mere prestidigitation is a matter of some debate in the AI community right now. In case you missed it, OpenAI posted an article on its blog last week titled "Solving Rubik's Cube With a Robot Hand." Based on this title, you'd be forgiven if you thought the research discussed in said article was about solving Rubik's Cube with a robot hand. Don't get me wrong, OpenAI created a software and machine learning pipeline by which a robot hand can physically manipulate a Rubik's Cube from an'unsolved' state to a solved one. But the truly impressive bit here is that a robot hand can hold an object and move it around (to accomplish a goal) without dropping it.

openai, robot hand, rubik, (14 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games > Rubik's Cube (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Manipulation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
(3 more...)

Add feedback

This robot can now solve a Rubik's cube with one hand

#artificialintelligenceOct-24-2019, 06:11:10 GMT

Once again, a robot can do something I cannot do. Researchers at the artificial intelligence lab OpenAI just revealed that its humanoid robotic hand can solve a Rubik's cube. The researchers utilized a pair of neural networks to make it happen. The team has been working on this project, named Dactyl, since the middle of 2017, and they felt showing their robotic hand could solve a Rubik's cube would show it had adequate dexterity. It can now solve the cube about 60 percent of the time.

cube, robot, rubik, (7 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games > Rubik's Cube (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.90)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.90)
(3 more...)

Add feedback

OpenAI's AI-powered robot learned how to solve a Rubik's cube one-handed

#artificialintelligenceOct-23-2019, 07:34:05 GMT

Artificial intelligence research organization OpenAI has achieved a new milestone in its quest to build general purpose, self-learning robots. The group's robotics division says Dactyl, its humanoid robotic hand first developed last year, has learned to solve a Rubik's cube one-handed. OpenAI sees the feat as a leap forward both for the dexterity of robotic appendages and its own AI software, which allows Dactyl to learn new tasks using virtual simulations before it is presented with a real, physical challenge to overcome. In a demonstration video showcasing Dactyl's new talent, we can see the robotic hand fumble its way toward a complete cube solve with clumsy yet accurate maneuvers. It takes many minutes, but Dactyl is eventually able to solve the puzzle.

dactyl, openai, robot, (15 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games > Rubik's Cube (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Robots in the Workplace (0.92)
Information Technology > Artificial Intelligence > Robots > Manipulation (0.92)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)
(4 more...)

Add feedback

Minimax Regret of Switching-Constrained Online Convex Optimization: No Phase Transition

Chen, Lin, Yu, Qian, Lawrence, Hannah, Karbasi, Amin

arXiv.org Machine LearningOct-23-2019

We study the problem of switching-constrained online convex optimization (OCO), where the player has a limited number of opportunities to change her action. While the discrete analog of this online learning task has been studied extensively, previous work in the continuous setting has neither established the minimax rate nor algorithmically achieved it. We here show that $ T $-round switching-constrained OCO with fewer than $ K $ switches has a minimax regret of $ \Theta(\frac{T}{\sqrt{K}}) $. In particular, it is at least $ \frac{T}{\sqrt{2K}} $ for one dimension and at least $ \frac{T}{\sqrt{K}} $ for higher dimensions. The lower bound in higher dimensions is attained by an orthogonal subspace argument. The minimax analysis in one dimension is more involved. To establish the one-dimensional result, we introduce the fugal game relaxation, whose minimax regret lower bounds that of switching-constrained OCO. We show that the minimax regret of the fugal game is at least $ \frac{T}{\sqrt{2K}} $ and thereby establish the minimax lower bound in one dimension. We next show that a mini-batching algorithm provides an $ O(\frac{T}{\sqrt{K}}) $ upper bound, and therefore we conclude that the minimax regret of switching-constrained OCO is $ \Theta(\frac{T}{\sqrt{K}}) $ for any $K$. This is in sharp contrast to its discrete counterpart, the switching-constrained prediction-from-experts problem, which exhibits a phase transition in minimax regret between the low-switching and high-switching regimes. In the case of bandit feedback, we first determine a novel linear (in $T$) minimax regret for bandit linear optimization against the strongly adaptive adversary of OCO, implying that a slightly weaker adversary is appropriate. We also establish the minimax regret of switching-constrained bandit convex optimization in dimension $n>2$ to be $\tilde{\Theta}(\frac{T}{\sqrt{K}})$.

adversary, minimax regret, oco, (17 more...)

arXiv.org Machine Learning

1910.10873

Country:

North America > United States > California (0.14)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Europe > Czechia > Prague (0.04)

Genre: Research Report (0.49)

Industry:

Leisure & Entertainment (0.46)
Education > Educational Setting (0.34)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)

Add feedback

Efficient Decoupled Neural Architecture Search by Structure and Operation Sampling

Lee, Heung-Chang, Kim, Do-Guk, Han, Bohyung

arXiv.org Machine LearningOct-23-2019

We propose a novel neural architecture search algorithm via reinforcement learning by decoupling structure and operation search processes. Our approach samples candidate models from the multinomial distribution on the policy vectors defined on the two search spaces independently. The proposed technique improves the efficiency of architecture search process significantly compared to the conventional methods based on reinforcement learning with the RNN controllers while achieving competitive accuracy and model size in target tasks. Our policy vectors are easily interpretable throughout the training procedure, which allows to analyze the search progress and the discovered architectures; the black-box characteristics of the RNN controllers hamper understanding training progress in terms of policy parameter updates. Our experiments demonstrate outstanding performance compared to the state-of-the-art methods with a fraction of search cost.

architecture, architecture search, policy vector, (14 more...)

arXiv.org Machine Learning

1910.10397

Country: Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Auto-Model: Utilizing Research Papers and HPO Techniques to Deal with the CASH problem

Wang, Chunnan, Wang, Hongzhi, Mu, Tianyu, Li, Jianzhong, Gao, Hong

arXiv.org Artificial IntelligenceOct-23-2019

Auto-Model: Utilizing Research Papers and HPO Techniques to Deal with the CASH problem Chunnan Wang, Hongzhi Wang, Tianyu Mu, Jianzhong Li, Hong Gao Department of Computer Science Harbin Institute of T echnology Harbin, China {WangChunnan, wangzh, mutianyu, lijzh, honggao }@hit.edu.cn Abstract --In many fields, a mass of algorithms with completely different hyperparameters have been developed to address the same type of problems. Choosing the algorithm and hyperpa-rameter setting correctly can promote the overall performance greatly, but users often fail to do so due to the absence of knowledge. How to help users to effectively and quickly select the suitable algorithm and hyperparameter settings for the given task instance is an important research topic nowadays, which is known as the CASH problem. In this paper, we design the Auto-Model approach, which makes full use of known information in the related research paper and introduces hyperparameter optimization techniques, to solve the CASH problem effectively. Auto-Model tremendously reduces the cost of algorithm implementations and hyperparameter configuration space, and thus capable of dealing with the CASH problem efficiently and easily. T o demonstrate the benefit of Auto-Model, we compare it with classical Auto-Weka approach. The experimental results show that our proposed approach can provide superior results and achieves better performance in a short time. Index T erms--Algorithm selection, Hyperparameter optimization, Combined algorithm selection and hyperparameter optimization problem, Auto-Weka, Classification algorithms I. I NTRODUCTION In many fields, such as machine learning, data mining, artificial intelligence and constraint satisfaction, a variety of algorithms and heuristics have been developed to address the same type of problem [1], [2]. Each of these algorithms has its own advantages and disadvantages, and often they are complementary in the sense that one algorithm works well when others fail and vice versa [2]. If we are capable of selecting the algorithm and hyperparameter setting best suited to the task instance, any particular task instance will be well solved, and our ability of dealing with the problem will be improved considerably [3]. However, it is not trivial to achieve this goal. There are a mass of powerful and different algorithms to deal with a certain problem, and these algorithms have completely different hyperparameters, which have great effect on their performance. Even domain experts cannot easily and correctly select the appropriate algorithm with corresponding optimal hyperparameters from such a huge and complex choice space.

algorithm, classification algorithm, crelation, (17 more...)

arXiv.org Artificial Intelligence

1910.10902

Country:

Asia > China > Heilongjiang Province > Harbin (0.44)
South America > Argentina > Patagonia > Río Negro Province > Viedma (0.04)
North America > United States > Nevada (0.04)
(14 more...)

Genre: Research Report > New Finding (0.48)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.93)

Add feedback

Minimax Rate Optimal Adaptive Nearest Neighbor Classification and Regression

Zhao, Puning, Lai, Lifeng

arXiv.org Machine LearningOct-22-2019

For both classification and regression problems, existing works have shown that, if the distribution of the feature vector has bounded support and the probability density function is bounded away from zero in its support, the convergence rate of the standard kNN method, in which k is the same for all test samples, is minimax optimal. On the contrary, if the distribution has unbounded support, we show that there is a gap between the convergence rate achieved by the standard kNN method and the minimax bound. To close this gap, we propose an adaptive kNN method, in which different k is selected for different samples. Our selection rule does not require precise knowledge of the underlying distribution of features. The new proposed method significantly outperforms the standard one. We characterize the convergence rate of the proposed adaptive method, and show that it matches the minimax lower bound.

assumption 1, convergence rate, regression, (14 more...)

arXiv.org Machine Learning

1910.10513

Country:

North America > United States > California > Yolo County > Davis (0.14)
Europe > France > Île-de-France > Paris > Paris (0.04)
North America > Canada > Quebec > Montreal (0.04)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback

New Robot Can Solve a Rubik's Cube with Just One Hand Lwin Htut Kyaw Digital Creator Mandalay Myanmar

#artificialintelligenceOct-20-2019, 19:54:34 GMT

OpenAI has come up with a new robot capable of solving a Rubik's Cube with a single hand. The AI-based company trained neural networks in simulation using reinforcement learning to make this achievement possible. The company has been working on this project since May 2017 and has now achieved its goal marking this as a milestone towards its progress in the field of AI. The time taken by the robotic hand varies depending on how the cube is shuffled but on average, it takes about four minutes to solve the puzzle. However, it is worth noting that this is not the first-ever robot that managed to solve the Rubik's cube.

kyaw digital creator mandalay myanmar, openai, rubik, (5 more...)

#artificialintelligence

Country: Asia > Myanmar > Mandalay Region > Mandalay (0.40)

Industry: Leisure & Entertainment > Games > Rubik's Cube (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.92)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.42)

Add feedback

This Week In AI: Algolia Raises $110M, OpenAI Debuts Rubik's Cube Solving Bot, Kleiner Perkins Backs Cell Therapy Startup - CB Insights Research

#artificialintelligenceOct-20-2019, 07:35:51 GMT

Medical data analytics startup Healx raised $56M from Atomico and others. Standard Cognition patented an inventory management system. Here's what went down in artificial intelligence this week. Become a CB Insights customer. If you're already a customer, log in here.

cb insight research, openai debut rubik, perkin back cell therapy startup, (4 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games > Rubik's Cube (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.40)
(2 more...)

Add feedback

Research Guide: Data Augmentation for Deep Learning

#artificialintelligenceOct-19-2019, 06:01:25 GMT

AutoAugment is an augmentation strategy that employs a search algorithm to find an augmentation policy that will yield the best results on the model. Each policy has several sub-policies. One sub-policy is randomly chosen for each image. Each sub-policy consists of an image processing function and the probability that the functions are applied with. The image processing operations could be translation, shearing or rotation.

data augmentation, opération, search algorithm, (10 more...)

#artificialintelligence

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback