Login
Deep Reinforcement Learning from Human Preferences.
1 Summary
Paul F. Christiano and
Jan Leike and
Tom Brown and
Miljan Martic and
Shane Legg and
Dario Amodei
Neural Information Processing Systems Conference - 2017
via Local dblp
Keywords:
ShortScience.org allows researchers to publish paper summaries that are voted on and ranked!
About
Sponsored by: