AITopics | Search

Collaborating Authors

Search

"Search is a problem-solving technique that systematically explores a space of problem states, i.e., successive and alternative stages in the problem-solving process. Examples of problem states might include the different board configurations in a game or intermediate steps in a reasoning process. This space of alternative solutions is then searched to find an answer. Newell and Simon (1976) have argued that this is the essential basis of human problem solving. Indeed, when a chess player examines the effects of different moves or a doctor considers a number of alternative diagnoses, they are searching among alternatives."
– from Section 1.2 of Chapter One of George F. Luger's textbook, Artificial Intelligence: Structures and Strategies for Complex Problem Solving, 5th Edition (Addison-Wesley; 2005).

News Overviews Instructional Materials AI-Alerts Classics

Welcome BERT: Google's latest search algorithm to better understand natural language - Search Engine Land

#artificialintelligenceNov-1-2019, 22:18:08 GMT

Note: By submitting this form, you agree to Third Door Media's terms. Google is making the largest change to its search system since the company introduced RankBrain, almost five-years ago. The company said this will impact 1 in 10 queries in terms of changing the results that rank for those queries. BERT started rolling out this week and will be fully live shortly. It is rolling out for English language queries now and will expand to other languages in the future.

bert, google, query, (12 more...)

#artificialintelligence

Country:

South America > Brazil (0.06)
North America > United States (0.05)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.40)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.40)

Add feedback

Generalized Mean Estimation in Monte-Carlo Tree Search

Dam, Tuan, Klink, Pascal, D'Eramo, Carlo, Peters, Jan, Pajarinen, Joni

arXiv.org Artificial IntelligenceNov-1-2019

We consider Monte-Carlo Tree Search (MCTS) applied to Markov Decision Processes (MDPs) and Partially Observable MDPs (POMDPs), and the well-known Upper Confidence bound for Trees (UCT) algorithm. In UCT, a tree with nodes (states) and edges (actions) is incrementally built by the expansion of nodes, and the values of nodes are updated through a backup strategy based on the average value of child nodes. However, it has been shown that with enough samples the maximum operator yields more accurate node value estimates than averaging. Instead of settling for one of these value estimates, we go a step further proposing a novel backup strategy which uses the power mean operator, which computes a value between the average and maximum value. We call our new approach Power-UCT and argue how the use of the power mean operator helps to speed up the learning in MCTS. We theoretically analyze our method providing guarantees of convergence to the optimum. Moreover, we discuss a heuristic approach to balance the greediness of backups by tuning the power mean operator according to the number of visits to each node. Finally, we empirically demonstrate the effectiveness of our method in well-known MDP and POMDP benchmarks, showing significant improvement in performance and convergence speed w.r.t. UCT.

algorithm, node, power-uct, (15 more...)

arXiv.org Artificial Intelligence

1911.00384

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.76)

Add feedback

Learning Algorithmic Solutions to Symbolic Planning Tasks with a Neural Computer

Tanneberg, Daniel, Rueckert, Elmar, Peters, Jan

arXiv.org Artificial IntelligenceOct-30-2019

A key feature of intelligent behavior is the ability to learn abstract strategies that transfer to unfamiliar problems. Therefore, we present a novel architecture, based on memory-augmented networks, that is inspired by the von Neumann and Harvard architectures of modern computers. This architecture enables the learning of abstract algorithmic solutions via Evolution Strategies in a reinforcement learning setting. Applied to Sokoban, sliding block puzzle and robotic manipulation tasks, we show that the architecture can learn algorithmic solutions with strong generalization and abstraction: scaling to arbitrary task configurations and complexities, and being independent of both the data representation and the task domain.

algorithmic solution, architecture, module, (15 more...)

arXiv.org Artificial Intelligence

1911.00926

Country: Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Bayesian Optimization with Unknown Search Space

Ha, Huong, Rana, Santu, Gupta, Sunil, Nguyen, Thanh, Tran-The, Hung, Venkatesh, Svetha

arXiv.org Machine LearningOct-29-2019

Applying Bayesian optimization in problems wherein the search space is unknown is challenging. To address this problem, we propose a systematic volume expansion strategy for the Bayesian optimization. We devise a strategy to guarantee that in iterative expansions of the search space, our method can find a point whose function value within epsilon of the objective function maximum. Without the need to specify any parameters, our algorithm automatically triggers a minimal expansion required iteratively. We derive analytic expressions for when to trigger the expansion and by how much to expand. We also provide theoretical analysis to show that our method achieves epsilon-accuracy after a finite number of iterations. We demonstrate our method on both benchmark test functions and machine learning hyper-parameter tuning tasks and demonstrate that our method outperforms baselines.

acquisition function, algorithm, search space, (15 more...)

arXiv.org Machine Learning

1910.13092

Country:

Oceania > Australia (0.14)
North America > United States (0.04)
North America > Canada (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.87)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.87)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

GLIMPS: A Greedy Mixed Integer Approach for Super Robust Matched Subspace Detection

Rahman, Md Mahfuzur, Pimentel-Alarcon, Daniel

arXiv.org Machine LearningOct-29-2019

Due to diverse nature of data acquisition and modern applications, many contemporary problems involve high dimensional datum $\x \in \R^\d$ whose entries often lie in a union of subspaces and the goal is to find out which entries of $\x$ match with a particular subspace $\sU$, classically called \emph {matched subspace detection}. Consequently, entries that match with one subspace are considered as inliers w.r.t the subspace while all other entries are considered as outliers. Proportion of outliers relative to each subspace varies based on the degree of coordinates from subspaces. This problem is a combinatorial NP-hard in nature and has been immensely studied in recent years. Existing approaches can solve the problem when outliers are sparse. However, if outliers are abundant or in other words if $\x$ contains coordinates from a fair amount of subspaces, this problem can't be solved with acceptable accuracy or within a reasonable amount of time. This paper proposes a two-stage approach called \emph{Greedy Linear Integer Mixed Programmed Selector} (GLIMPS) for this abundant-outliers setting, which combines a greedy algorithm and mixed integer formulation and can tolerate over 80\% outliers, outperforming the state-of-the-art.

algorithm, outlier, subspace, (15 more...)

arXiv.org Machine Learning

1910.13089

Country: North America > United States > Wisconsin > Dane County > Madison (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Data Science (0.94)
Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

A robot hand taught itself to solve a Rubik's Cube after creating its own training regime

#artificialintelligenceOct-27-2019, 16:07:54 GMT

Over a year ago, OpenAI, the San Francisco–based for-profit AI research lab, announced that it had trained a robotic hand to manipulate a cube with remarkable dexterity. That might not sound earth-shattering. But in the AI world, it was impressive for two reasons. First, the hand had taught itself how to fidget with the cube using a reinforcement-learning algorithm, a technique modeled on the way animals learn. Second, all the training had been done in simulation, but it managed to successfully translate to the real world.

algorithm, robot, rubik, (14 more...)

#artificialintelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.25)
North America > United States > Michigan (0.05)

Industry: Leisure & Entertainment > Games > Rubik's Cube (0.89)

Technology:

Information Technology > Artificial Intelligence > Robots > Manipulation (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.61)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.44)
(2 more...)

Add feedback

Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment

Yasuda, Yusuke, Wang, Xin, Yamagishi, Junichi

arXiv.org Machine LearningOct-27-2019

Sequence-to-sequence text-to-speech (TTS) is dominated by soft-attention-based methods. Recently, hard-attention-based methods have been proposed to prevent fatal alignment errors, but their sampling method of discrete alignment is poorly investigated. This research investigates various combinations of sampling methods and probability distributions for alignment transition modeling in a hard-alignment-based sequence-to-sequence TTS method called SSNT-TTS. We clarify the common sampling methods of discrete variables including greedy search, beam search, and random sampling from a Bernoulli distribution in a more general way. Furthermore, we introduce the binary Concrete distribution to model discrete variables more properly. The results of a listening test shows that deterministic search is more preferable than stochastic search, and the binary Concrete distribution is robust with stochastic search for natural alignment transition.

alignment, concrete distribution, logistic noise, (12 more...)

arXiv.org Machine Learning

1910.12383

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Synthesis (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.75)

Add feedback

Deep Reinforcement Learning in HOL4

Gauthier, Thibault

arXiv.org Artificial IntelligenceOct-25-2019

The paper describes an implementation of deep reinforcement learning through self-supervised learning within the proof assistant HOL4. A close interaction between the machine learning modules and the HOL4 library is achieved by the choice of tree neural networks (TNNs) as machine learning models and the internal use of HOL4 terms to represent tree structures of TNNs. Recursive improvement is possible when a given task is expressed as a search problem. In this case, a Monte Carlo Tree Search (MCTS) algorithm guided by a TNN can be used to explore the search space and produce better examples for training the next TNN. As an illustration, tasks over propositional and arithmetical terms, representative of fundamental theorem proving techniques, are specified and learned: truth estimation, end-to-end computation, term rewriting and term synthesis.

algorithm, deep reinforcement learning, tnn, (13 more...)

arXiv.org Artificial Intelligence

1910.11797

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Czechia > Prague (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(9 more...)

Genre:

Research Report (0.50)
Overview (0.46)
Instructional Material > Course Syllabus & Notes (0.34)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Rubik's Cube owner loses EU trademark for iconic puzzle's shape

FOX NewsOct-24-2019, 18:37:18 GMT

Fox News Flash top headlines for Oct. 24 are here. Check out what's clicking on Foxnews.com The owner of the Rubik's Cube has lost an appeal to regain the European Union trademark rights to the classic puzzle's iconic shape in a new twist to the ongoing legal drama. Rubik's Brand Ltd. lost the protection rights to the puzzle's shape in 2017, after the EU's top court ruled that law prevents the firm from having "a monopoly on technical solutions or functional characteristics of a product," Bloomberg reported. The EU General Court in Luxembourg upheld that decision on Thursday.

puzzle, rubik, trademark, (6 more...)

FOX News

Country:

North America > United States > New York (0.20)
Europe > Hungary > Budapest > Budapest (0.07)

Industry:

Leisure & Entertainment > Games > Rubik's Cube (1.00)
Law > Intellectual Property & Technology Law (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.66)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.66)

Add feedback

Filters

Collaborating Authors

Search

Welcome BERT: Google's latest search algorithm to better understand natural language - Search Engine Land

Generalized Mean Estimation in Monte-Carlo Tree Search

Learning Algorithmic Solutions to Symbolic Planning Tasks with a Neural Computer

Bayesian Optimization with Unknown Search Space

GLIMPS: A Greedy Mixed Integer Approach for Super Robust Matched Subspace Detection

A robot hand taught itself to solve a Rubik's Cube after creating its own training regime

Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment

Google Lifts Veil, a Little, Into Secretive Search Algorithm Changes

Deep Reinforcement Learning in HOL4

Rubik's Cube owner loses EU trademark for iconic puzzle's shape