GPI-tree search : algorithms for decision-time planning with the general policy improvement theorem

Bron
Neural computing and applications - ISSN 0941-0643-37:23 (2025) p. 18989-19007