Deep RL Bootcamp Lecture 4B Policy Gradients Revisited

Posted 2020-11-02 ecoflex

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了Deep RL Bootcamp Lecture 4B Policy Gradients Revisited相关的知识，希望对你有一定的参考价值。

https://drive.google.com/file/d/0BxXI_RttTZAhTUpqUFdEZ3BXNFE/view

game of Pong is a MDP.

终于一睹AK真容了，很有想法，很幽默

http://karpathy.github.io/

以上是关于Deep RL Bootcamp Lecture 4B Policy Gradients Revisited的主要内容，如果未能解决你的问题，请参考以下文章

Deep RL Bootcamp Lecture 8 Derivative Free Methods

Deep RL Bootcamp Lecture 4B Policy Gradients Revisited

Deep RL Bootcamp Lecture 5: Natural Policy Gradients, TRPO, PPO

Deep RL Bootcamp Lecture 7: SVG, DDPG, and Stochastic Computation Graphs

Deep RL Bootcamp Lecture 2: Sampling-based Approximations and Function Fitting

Deep RL Bootcamp TAs Research Overview