RL
-
Multi-Arm Bandits
9/18/2024Note for Chapter 2 of RL: an introduction
-
Paper Notes - Diffusion Policies creating a Trust Region for Offline Reinforcement Learning
9/16/2024This paper proposed to train a one-step policy guided by a diffusion model via a trust region loss, enabling fast inference without iterative sampling.
-
Paper Notes - Efficient Diffusion Policies for Offline Reinforcement Learning
9/12/2024Notes for this paper.
-
Paper Notes - Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
9/10/2024Notes for this paper.
-
Paper Notes - Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
9/8/2024Notes for this paper.
-
Paper Notes - S2AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic
6/16/2024Notes for this paper.