AITopics

arXiv.org Artificial IntelligenceMay-6-2025

RNA design consists of discovering a nucleotide sequence that folds into a target secondary structure. It is useful for synthetic biology, medicine, and nanotechnology. We propose Montparnasse, a Multi Objective Generalized Nested Rollout Policy Adaptation with Limited Repetition (MOGNRP ALR) RNA design algorithm. It solves the Eterna benchmark.

artificial intelligence, evaluation, machine learning, (18 more...)

2505.0211

Genre: Research Report (0.40)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)

Driss, Brahim, Arjonilla, Jérôme, Wang, Hui, Saffidine, Abdallah, Cazenave, Tristan

Deep Reinforcement Learning for 5*5 Multiplayer Go

arXiv.org Artificial IntelligenceMay-23-2024

In recent years, much progress has been made in computer Go and most of the results have been obtained thanks to search algorithms (Monte Carlo Tree Search) and Deep Reinforcement Learning (DRL). In this paper, we propose to use and analyze the latest algorithms that use search and DRL (AlphaZero and Descent algorithms) to automatically learn to play an extended version of the game of Go with more than two players. We show that using search and DRL we were able to improve the level of play, even though there are more than two players.

algorithm, alphazero, deep reinforcement learning, (12 more...)

doi: 10.1007/978-3-031-30229-9_48

2405.14265

Country:

South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)
(4 more...)

Genre: Research Report (0.65)

Industry: Leisure & Entertainment > Games > Go (0.71)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Monte Carlo Search Algorithms Discovering Monte Carlo Tree Search Exploration Terms

arXiv.org Artificial IntelligenceApr-14-2024

Monte Carlo Tree Search and Monte Carlo Search have good results for many combinatorial problems. In this paper we propose to use Monte Carlo Search to design mathematical expressions that are used as exploration terms for Monte Carlo Tree Search algorithms. The optimized Monte Carlo Tree Search algorithms are PUCT and SHUSS. We automatically design the PUCT and the SHUSS root exploration terms. For small search budgets of 32 evaluations the discovered root exploration terms make both algorithms competitive with usual PUCT.

algorithm, exploration term, expression, (13 more...)

2404.09304

Country:

South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games > Go (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Sagri, Amani, Cazenave, Tristan, Arjonilla, Jérôme, Saffidine, Abdallah

Vision Transformers for Computer Go

arXiv.org Artificial IntelligenceSep-22-2023

Motivated by the success of transformers in various fields, such as language understanding and image analysis, this investigation explores their application in the context of the game of Go. In particular, our study focuses on the analysis of the Transformer in Vision. Through a detailed analysis of numerous points such as prediction accuracy, win rates, memory, speed, size, or even learning rate, we have been able to highlight the substantial role that transformers can play in the game of Go. This study was carried out by comparing them to the usual Residual Networks.

residual, residual network, transformer, (16 more...)

2309.12675

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)
(2 more...)

Genre: Research Report > New Finding (0.48)

Industry: Leisure & Entertainment > Games > Go (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Games > Go (1.00)
(2 more...)

Nested Search versus Limited Discrepancy Search

arXiv.org Artificial IntelligenceOct-1-2022

Limited Discrepancy Search (LDS) is a popular algorithm to search a state space with a heuristic to order the possible actions. Nested Search (NS) is another algorithm to search a state space with the same heuristic. NS spends more time on the move associated to the best heuristic playout while LDS spends more time on the best heuristic move. They both use similar times for the same level of search. We advocate in this paper that it is often better to follow the best heuristic playout as in NS than to follow the heuristic as in LDS.

algorithm, artificial intelligence, machine learning, (16 more...)

2210.00216

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
Europe > Austria (0.04)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Roucairol, Milo, Cazenave, Tristan

Refutation of Spectral Graph Theory Conjectures with Monte Carlo Search

arXiv.org Artificial IntelligenceAug-3-2022

We demonstrate how Monte Carlo Search (MCS) algorithms, namely Nested Monte Carlo Search (NMCS) and Nested Rollout Policy Adaptation (NRPA), can be used to build graphs and find counter-examples to spectral graph theory conjectures in minutes.

algorithm, conjecture, graph, (13 more...)

2207.03343

Country: North America > United States > Hawaii > Honolulu County > Honolulu (0.04)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.74)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.71)

AAAI ConferencesFeb-8-2022, 11:40:58 GMT

Cazenave

Monte Carlo Tree Search (MCTS) is the state of the art algorithm for many games including the game of Go and General Game Playing (GGP). The standard algorithm for MCTS is Upper Confidence bounds applied to Trees (UCT). For games such as Go a big improvement over UCT is the Rapid Action Value Estimation (RAVE) heuristic. We propose to generalize the RAVE heuristic so as to have more accurate estimates near the leaves. We test the resulting algorithm named GRAVE for Atarigo, Knighthrough, Domineering and Go.

algorithm, cazenave

AAAI Conferences

Industry: Leisure & Entertainment > Games (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.76)

Sentuc, Julien, Cazenave, Tristan, Lucas, Jean-Yves

Generalized Nested Rollout Policy Adaptation with Dynamic Bias for Vehicle Routing

arXiv.org Artificial IntelligenceDec-29-2021

In this paper we present an extension of the Nested Rollout Policy Adaptation algorithm (NRPA), namely the Generalized Nested Rollout Policy Adaptation (GNRPA), as well as its use for solving some instances of the Vehicle Routing Problem. We detail some results obtained on the Solomon instances set which is a conventional benchmark for the Capacitated Vehicle Routing Problem with Time Windows (CVRPTW). We show that on all instances, GN-RPA performs better than NRPA. On some instances, it performs better than the Google OR Tool module dedicated to VRP.

algorithm, generalized nested rollout policy adaptation, vehicle, (11 more...)

2111.06928

Country:

Europe > France (0.04)
North America > United States > Florida > Miami-Dade County > Miami Beach (0.04)
Europe > Netherlands > South Holland > Leiden (0.04)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Transportation > Freight & Logistics Services (0.94)
Leisure & Entertainment > Games (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.95)

Improving Model and Search for Computer Go

arXiv.org Artificial IntelligenceFeb-5-2021

The standard for Deep Reinforcement Learning in games, following Alpha Zero, is to use residual networks and to increase the depth of the network to get better results. We propose to improve mobile networks as an alternative to residual networks and experimentally show the playing strength of the networks according to both their width and their depth. We also propose a generalization of the PUCT search algorithm that improves on PUCT.

accuracy, model and search, residual network, (15 more...)

2102.03467

Country:

Europe > France (0.14)
Europe > Italy > Piedmont > Turin Province > Turin (0.04)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Games > Go (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Games > Go (1.00)

Cazenave, Tristan, Sevestre, Jean-Baptiste, Toulemont, Matthieu

Stabilized Nested Rollout Policy Adaptation

arXiv.org Artificial IntelligenceJan-10-2021

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to modify NRPA in order to improve the stability of the algorithm. Experiments show it improves the algorithm for different application domains: SameGame, Traveling Salesman with Time Windows and Expression Discovery.

algorithm, nrpa, sequence, (15 more...)

2101.03563

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Arizona > Maricopa County > Phoenix (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(7 more...)

Genre: Research Report (0.50)

Industry:

Transportation (0.47)
Leisure & Entertainment > Games (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.67)