Piotr Januszewski

Zatrudnienie

Brak danych

Słowa kluczowe Pomoc

Kontakt dla biznesu

Centrum Transferu Wiedzy i Technologii

Lokalizacja: Al. Zwycięstwa 27, 80-219 Gdańsk
Telefon: +48 58 348 62 62
E-mail: biznes@pg.edu.pl

Media społecznościowe

Kontakt

E-mail: piotr.januszewski@pg.edu.pl

Wybrane publikacje

Structure and Randomness in Planning and Reinforcement Learning
- K. Czechowski
- P. Januszewski
- P. Kozakowski
- Ł. Kuciński
- P. Miłoś
- Rok 2021
Planning in large state spaces inevitably needs to balance the depth and breadth of the search. It has a crucial impact on the performance of a planner and most manage this interplay implicitly. We present a novel method \textit{Shoot Tree Search (STS)}, which makes it possible to control this trade-off more explicitly. Our algorithm can be understood as an interpolation between two celebrated search mechanisms: MCTS and random...

Pełny tekst do pobrania w serwisie zewnętrznym
Dataset Characteristics and Their Impact on Offline Policy Learning of Contextual Multi-Armed Bandits
- Rok 2024
The Contextual Multi-Armed Bandits (CMAB) framework is pivotal for learning to make decisions. However, due to challenges in deploying online algorithms, there is a shift towards offline policy learning, which relies on pre-existing datasets. This study examines the relationship between the quality of these datasets and the performance of offline policy learning algorithms, specifically, Neural Greedy and NeuraLCB. Our results...

Pełny tekst do pobrania w portalu
Model-free and Model-based Reinforcement Learning, the Intersection of Learning and Planning
- P. Januszewski
- Rok 2022
My doctoral dissertation is intended as the compound of four publications considering: structure and randomness in planning and reinforcement learning, continuous control with ensemble deep deterministic policy gradients, toddler-inspired active representation learning, and large-scale deep reinforcement learning costs.

Pełny tekst do pobrania w serwisie zewnętrznym

wyświetlono 829 razy

Wyszukiwarka

Piotr Januszewski

Zatrudnienie

Słowa kluczowe Pomoc

Kontakt dla biznesu

Media społecznościowe

Kontakt

Wybrane publikacje

Structure and Randomness in Planning and Reinforcement Learning

Dataset Characteristics and Their Impact on Offline Policy Learning of Contextual Multi-Armed Bandits

Model-free and Model-based Reinforcement Learning, the Intersection of Learning and Planning