We build a computational model of how humans actively infer hidden rules by doing experiments. The basic principles behind the model is that, even if the rule is deterministic, the learner considers a broader space of fuzzy probabilistic rules, which it represents in natural language, and updates its hypotheses online after each experiment according to approximately Bayesian principles. In the same framework we also model experiment design according to information-theoretic criteria. We find that the combination of these three principles -- explicit hypotheses, probabilistic rules, and online updates -- can explain human performance on a Zendo-style task, and that removing any of these components leaves the model unable to account for the data.
翻译:我们构建了一个计算模型,模拟人类如何通过实验主动推断隐藏规则。该模型的基本原理在于,即使规则是确定性的,学习者仍会考虑一个由模糊概率规则构成的更广阔空间,这些规则以自然语言表示,并依据近似贝叶斯原理在每次实验后在线更新假设。在同一框架下,我们还根据信息论标准对实验设计进行建模。我们发现,这三个原则——显式假设、概率规则和在线更新——的结合能够解释人类在Zendo式任务中的表现,而移除其中任何一个组件都会使模型无法解释相关数据。