×
Jan 30, 2023 · In this paper, we propose the novel model-based EnVI method. In contrast to traditional dynamic programming methods, the proposed method uses a ...
To solve this unbalanced reward problem, in this paper, we propose a novel model-based reinforcement learning algorithm called the expected n-step value ...
A novel model-based reinforcement learning algorithm called the expected n-step value iteration (EnVI) is proposed, which uses a new return function that ...
People also ask
Oct 9, 2023 · This work proposes a novel reptile search algorithm (RSA) to solve optimization problems called reinforcement reptile search algorithm (RLRSA).
Sep 28, 2022 · Model-based reinforcement learning is like planning your actions by understanding the rules of a game, while model-free learns by trying things ...
Missing: novel unbalanced
A new multi-step state-action value algorithm is proposed to solve the problem of reward hacking by using a new return function, which alters the discount ...
This study integrates supervisory control theory and a model-based deep reinforcement learning method to synthesize a nonblocking coordinator for the modular ...
Missing: novel | Show results with:novel
Feb 9, 2019 · Reinforcement learning with appropriately designed reward signal could be used to solve many sequential learning problems.
Aug 7, 2024 · We introduce SafeDreamer, a safe model-based RL algorithm that utilizes safety-reward planning of world models and the Lagrangian methods to ...