读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈贝叶斯博弈

Posted 2020-10-21 想想你应该干什么

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈贝叶斯博弈相关的知识，希望对你有一定的参考价值。

读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈贝叶斯博弈

贝叶斯博弈(Bayesian Games)

本文是Game Theory An Introduction (by Steven Tadelis) 的学习笔记。

不完整信息的静态博弈(Incomplete information static games)

不完整信息博弈意味着玩家之间缺乏共识(common knowledge)，具体指的是其它对手的行动集、结果集和收益函数等信息。
对不完整信息博弈的处理方法来自于Harsanyi。
他引进了两个概念来解决这个问题。
type space: 将对手隐藏的信息(行动集、结果集和收益函数等)转化为多个types，每个type中的信息都是可知的。
belief: 由于不知道对手的具体type是什么，因此使用分布概率表示对手选择某个type的可能性。
这样就可以通过概率统计来计算可能的收益。

静态不完整信息贝叶斯博弈(static Bayesian game of incomplete information)的normal-form描述

\\[\\left \\langle N, \\{ A_i \\}_{i=1}^n, \\{ \\Theta_i \\}_{i=1}^n, \\{ v_i(\\cdot; \\theta_i), \\theta_i \\in \\Theta_i \\}_{i=1}^n, \\{ \\phi_i \\}_{i=1}^n \\right \\rangle \\\\ where \\\\ N = \\{ 1,2,\\cdots, n\\} \\text{ : is the set of players} \\\\ A_i \\text{ : the action set of player i} \\\\ \\Theta_i \\text{ : the type space of player i} \\\\ v_i : A \\times \\Theta_i \\to \\mathbb{R} \\text{ : type dependent pay of function of player i} \\\\ \\phi \\text{ : the belief of player i with respect to the uncertainty over the other players\' types} \\\\ \\phi(\\theta_{-i} | \\theta_i) \\text{ : the posterior conditional distribution on } \\theta_{-i} \\]

静态不完整信息贝叶斯博弈处理流程：
1. 自然选择一个类型组合(profile of types)\\(\\theta_1, \\theta_2, \\cdots, \\theta_n\\)。
2. 每个玩家知道自己\\(\\theta_i\\)，使用先前的\\(\\phi_i\\)来形成对对手type的分布概率。
3. 玩家选择行动。
4. 根据玩家们的行动\\(a = (a_i, a_2, \\cdots, a_n)\\)，可以或者收益\\(v_i(a; \\theta)\\).
条件概率(conditional probability)
当事件S发生时，事件H发生的条件概率为：

\\[\\Pr{H|S} = \\frac{\\phi(S \\land H)}{\\phi(S)} \\]

静态不完整信息贝叶斯博弈 - 纯策略

\\[\\left \\langle N, \\{ A_i \\}_{i=1}^n, \\{ \\Theta_i \\}_{i=1}^n, \\{ v_i(\\cdot; \\theta_i), \\theta_i \\in \\Theta_i \\}_{i=1}^n, \\{ \\phi_i \\}_{i=1}^n \\right \\rangle \\\\ \\]

玩家i的一个纯策略\\(s_i(\\theta_i) \\to a_i\\)

静态不完整信息贝叶斯博弈 - 混合策略
玩家i的一个混合策略是一个在纯策略之上的概率分布。
静态不完整信息贝叶斯博弈 - 纯策略贝叶斯纳什均衡(pure-strategy Bayesian Nash equilibrium)
一个纯策略贝叶斯纳什均衡\\(s^* = (s_1^*, \\cdots, s_n^*)\\)，如果对于每个玩家i，每个玩家的类型\\(\\theta_i \\in \\Theta_i\\)，每个行动\\(a_i \\in A_i\\)，满足：

\\[\\sum_{\\theta_{-i} \\in \\Theta_{-i}} \\phi_i(\\theta_{-i}|\\theta_i) v_i(s_i^*(\\theta_i), s_{-i}^*(\\theta_{-i});\\theta_i) \\geq \\sum_{\\theta_{-i} \\in \\Theta_{-i}} \\phi_i(\\theta_{-i}|\\theta_i) v_i(a_i, s_{-i}^*(\\theta_{-i});\\theta_i) \\\\ where \\\\ v_i(a_i, s_{-i}^*(\\theta_{-i});\\theta_i) \\text{ : only on type } \\theta_i \\text{, the player i\'s payoff function} \\]

其含义：对于每个玩家，其行动\\(s_i^*(\\theta_i)\\)的分布概率收益总和总是最大的。

关于这章（甚至整本书），重要的是学会如何使用这些理论，书中提供了很好的示例。但这里就不介绍了。

参照

以上是关于读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈贝叶斯博弈的主要内容，如果未能解决你的问题，请参考以下文章

读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈 贝叶斯博弈

读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈 贝叶斯博弈

贝叶斯博弈(Bayesian Games)

不完整信息的静态博弈(Incomplete information static games)

参照

读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈贝叶斯博弈

读书笔记: 博弈论导论 - 12 - 不完整信息的静态博弈贝叶斯博弈