Structure and Randomness in Planning and Reinforcement Learning - Publication - Bridge of Knowledge

Search

Structure and Randomness in Planning and Reinforcement Learning

Abstract

Planning in large state spaces inevitably needs to balance the depth and breadth of the search. It has a crucial impact on the performance of a planner and most manage this interplay implicitly. We present a novel method \textit{Shoot Tree Search (STS)}, which makes it possible to control this trade-off more explicitly. Our algorithm can be understood as an interpolation between two celebrated search mechanisms: MCTS and random shooting. It also lets the user control the bias-variance trade-off, akin to TD(n), but in the tree search context. In experiments on challenging domains, we show that STS can get the best of both worlds consistently achieving higher scores.

Citations

  • 0

    CrossRef

  • 0

    Web of Science

  • 1

    Scopus

Authors (5)

  • Photo of magister Konrad Czechowski

    Konrad Czechowski magister

    • Faculty of Mathematics, Informatics and Mechanics, University of Warsaw
  • Photo of magister Piotr Kozakowski

    Piotr Kozakowski magister

    • Faculty of Mathematics, Informatics and Mechanics, University of Warsaw
  • Photo of doktor Łukasz Kuciński

    Łukasz Kuciński doktor

    • Institute of Mathematics, Polish Academy of Science
  • Photo of doktor habilitowany Piotr Miłoś

    Piotr Miłoś doktor habilitowany

    • Institute of Mathematics, Polish Academy of Science

Cite as

Full text

full text is not available in portal

Keywords

Details

Category:
Conference activity
Type:
materiały konferencyjne indeksowane w Web of Science
Language:
English
Publication year:
2021
Bibliographic description:
Czechowski K., Januszewski P., Kozakowski P., Kuciński Ł., Miłoś P..: Structure and Randomness in Planning and Reinforcement Learning, W: , 2021, ,.
DOI:
Digital Object Identifier (open in new tab) 10.1109/ijcnn52387.2021.9533317
Sources of funding:
  • Polish National Science Center grant UMO-2017/26/E/ST6/00622
Verified by:
Gdańsk University of Technology

seen 102 times

Recommended for you

Meta Tags