Improving continuous Monte Carlo Tree Search - Université Côte d'Azur Accéder directement au contenu
Communication Dans Un Congrès Année : 2024

Improving continuous Monte Carlo Tree Search

Résumé

Monte-Carlo Tree Search (MCTS) is largely responsible for the improvement not only of many computer games, including Go and General Game Playing (GPP), but also of real-world continuous Markov decision process problems. MCTS initially uses the Upper Confidence bounds applied to Trees (UCT), but the Rapid Action Value Estimation (RAVE) heuristic has rapidly taken over in the discrete and continuous domains. Recently, generalized RAVE (GRAVE) outperformed such heuristics in the discrete domain. This paper is concerned with extending the GRAVE heuristic to continuous action and state spaces. To enhance its performances, we suggest an action decomposition strategy to break down multidimensional actions into multiple unidimensional actions, and we propose a selective policy based on constraints that can be used to bias the playouts and in the tree to select promising actions. The approach is experimentally validated on a real-world biological problem.
Fichier principal
Vignette du fichier
Improving_continuous_MCTS_hGRN.pdf (945.14 Ko) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04557914 , version 1 (17-07-2024)

Licence

Identifiants

  • HAL Id : hal-04557914 , version 1

Citer

Romain Michelucci, Denis Pallez, Tristan Cazenave, Jean-Paul Comet. Improving continuous Monte Carlo Tree Search. Parallel Problem Solving From Nature, Sep 2024, Hagenberg Castle, Austria. ⟨hal-04557914⟩
18 Consultations
0 Téléchargements

Partager

Gmail Mastodon Facebook X LinkedIn More