Homepage of Yingru Li
Homepage of Yingru Li
Home
Posts
Talks
Publications
Teaching
Contact
RL-Seminar
Light
Dark
Automatic
Policy Optimization
Divergence-Augmented Policy Optimization
In deep reinforcement learning, policy optimization methods need to deal with issues such as function approximation and the reuse of off-policy data. Standard policy gradient methods do not handle off-policy data well, leading to premature …
Cite
×