June 07, 2024
Offline Reinforcement Learning without Regularization and Pessimism
Longyang Huang, Botao Dong, Ning Pang, et al.