Improving continuous Monte Carlo Tree Search

Monte-Carlo Tree Search (MCTS) is largely responsible for the improvement not only of many computer games, including Go and General Game Playing (GPP), but also of real-world continuous Markov decision process problems. MCTS initially uses the Upper Confidence bounds applied to Trees (UCT), but the Rapid Action Value Estimation (RAVE) heuristic has rapidly taken over in the discrete and continuous domains. Recently, generalized RAVE (GRAVE) outperformed such heuristics in the discrete domain. This paper is concerned with extending the GRAVE heuristic to continuous action and state spaces. To enhance its performances, we suggest an action decomposition strategy to break down multidimensional actions into multiple unidimensional actions, and we propose a selective policy based on constraints that can be used to bias the playouts and in the tree to select promising actions. The approach is experimentally validated on a real-world biological problem.

Mots clés

Monte Carlo Tree Search continuous Markov Decision Process hybrid Gene Regulatory Network

Domaines

Intelligence artificielle [cs.AI] Bio-Informatique, Biologie Systémique [q-bio.QM]

Fichier principal

Improving_continuous_MCTS_hGRN.pdf (945.14 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Denis Pallez : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04557914

Soumis le : mercredi 17 juillet 2024-14:16:10

Dernière modification le : jeudi 18 juillet 2024-08:41:45

Dates et versions

hal-04557914 , version 1 (17-07-2024)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

HAL Id : hal-04557914 , version 1

Citer

Romain Michelucci, Denis Pallez, Tristan Cazenave, Jean-Paul Comet. Improving continuous Monte Carlo Tree Search. Parallel Problem Solving From Nature, Sep 2024, Hagenberg Castle, Austria. ⟨hal-04557914⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-DAUPHINE I3S LAMSADE-DAUPHINE PSL UNIV-COTEDAZUR

18 Consultations

0 Téléchargements