Home
Publications
Contact
Light
Dark
Automatic
Reinforcement Learning
Information Loss Bounded Policy Optimization
Transfering the KL-divergence constraint in policy search into a bounded penalty.
PDF
Cite
×