Automated Design of Affine Maximizer Mechanisms in Dynamic Settings

Dynamic mechanism design is a challenging extension to ordinary mechanism design in which the mechanism designer must make a sequence of decisions over time in the face of possibly untruthful reports of participating agents. Optimizing dynamic mechanisms for welfare is relatively well understood. However, there has been less work on optimizing for other goals (e.g. revenue), and without restrictive assumptions on valuations, it is remarkably challenging to characterize good mechanisms. Instead, we turn to automated mechanism design to find mechanisms with good performance in specific problem instances. In fact, the situation is similar even in static mechanism design. However, in the static case, optimization/machine learning-based automated mechanism design techniques have been successful in finding high-revenue mechanisms in cases beyond the reach of analytical results. We extend the class of affine maximizer mechanisms to MDPs where agents may untruthfully report their rewards. This extension results in a challenging bilevel optimization problem in which the upper problem involves choosing optimal mechanism parameters, and the lower problem involves solving the resulting MDP. Our approach can find truthful dynamic mechanisms that achieve strong performance on goals other than welfare, and can be applied to essentially any problem setting-without restrictions on valuations-for which RL can learn optimal policies.

翻译：动态机制设计是普通机制设计的一个具有挑战性的扩展，其中机制设计者必须在一段时间内面对可能虚假报告参与代理人信息的情况下，进行一系列决策。优化福利导向的动态机制已较为清晰。然而，针对其他目标（如收益）的优化研究较少，且在没有对估值施加严格假设的情况下，刻画优质机制极为困难。为此，我们转向自动化机制设计，以在特定问题实例中寻找性能良好的机制。事实上，在静态机制设计中情况类似。然而，在静态场景中，基于优化/机器学习的自动化机制设计技术已成功在超出分析结果范围的情况下找到高收益机制。我们将仿射最大化机制类扩展到马尔可夫决策过程（MDP）中，其中代理人可能虚假报告其奖励。这一扩展引出了一个具有挑战性的双层优化问题：上层问题涉及选择最优机制参数，下层问题则需求解相应的MDP。我们的方法能够找到除福利外其他目标上表现优异的可信动态机制，且可应用于几乎任何问题场景——无需对估值施加限制——只要强化学习（RL）能为其学习到最优策略。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

Query2box: 使用盒嵌入对向量空间中的知识图谱进行推理，Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings

专知会员服务

46+阅读 · 2020年5月11日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日