To discount or not to discount in reinforcement learning: A case study comparing R learning and Q le
Posted rsapaper
tags:
篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了To discount or not to discount in reinforcement learning: A case study comparing R learning and Q le相关的知识,希望对你有一定的参考价值。
https://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a-html/node26.html
【平均-打折奖励】
Schwartz [106] examined the problem of adapting Q-learning to an average-reward framework. Although his R-learning algorithm seems to exhibit convergence problems for some MDPs, several researchers have found the average-reward criterion closer to the true problem they wish to solve than a discounted criterion and therefore prefer R-learning to Q-learning [69].
以上是关于To discount or not to discount in reinforcement learning: A case study comparing R learning and Q le的主要内容,如果未能解决你的问题,请参考以下文章
1092. To Buy or Not to Buy (20)