Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution
Chou, Po-Wei
and
Maturana, Daniel
and
Scherer, Sebastian A.
International Conference on Machine Learning - 2017 via Bibsonomy
Keywords:
dblp