To discount or not to discount in reinforcement learning: A case study comparing R learning and Q le

Posted 2020-10-09 rsapaper

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了To discount or not to discount in reinforcement learning: A case study comparing R learning and Q le相关的知识，希望对你有一定的参考价值。

https://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a-html/node26.html

【平均-打折奖励】

Schwartz [106] examined the problem of adapting Q-learning to an average-reward framework. Although his R-learning algorithm seems to exhibit convergence problems for some MDPs, several researchers have found the average-reward criterion closer to the true problem they wish to solve than a discounted criterion and therefore prefer R-learning to Q-learning [69].

以上是关于To discount or not to discount in reinforcement learning: A case study comparing R learning and Q le的主要内容，如果未能解决你的问题，请参考以下文章

To be or not to be

1092. To Buy or Not to Buy (20)

1033 To Fill or Not to Fill

1092. To Buy or Not to Buy (20)