To discount or not to discount in reinforcement learning: A case study comparing R learning and Q le

Posted rsapaper

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了To discount or not to discount in reinforcement learning: A case study comparing R learning and Q le相关的知识,希望对你有一定的参考价值。

 

 

https://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a-html/node26.html

【平均-打折奖励】

Schwartz [106] examined the problem of adapting Q-learning to an average-reward framework. Although his R-learning algorithm seems to exhibit convergence problems for some MDPs, several researchers have found the average-reward criterion closer to the true problem they wish to solve than a discounted criterion and therefore prefer R-learning to Q-learning [69].

以上是关于To discount or not to discount in reinforcement learning: A case study comparing R learning and Q le的主要内容,如果未能解决你的问题,请参考以下文章

To be or not to be

1092. To Buy or Not to Buy (20)

1033 To Fill or Not to Fill

1033 To Fill or Not to Fill

1092. To Buy or Not to Buy (20)

1092. To Buy or Not to Buy (20)