Towards a Path Dependent Account of Category Fluency

Category fluency is a widely studied cognitive phenomenon, yet two conflicting accounts have been proposed as the underlying retrieval mechanism -- an optimal foraging process deliberately searching through memory (Hills et al., 2012) and a random walk sampling from a semantic network (Abbott et al., 2015). Evidence for both accounts has centered around predicting human patch switches, where both existing models of category fluency produce paradoxically identical results. We begin by peeling back the assumptions made by existing models, namely that each named example only depends on the previous example, by (i) adding an additional bias to model the category transition probability directly and (ii) relying on a large language model to predict based on the entire existing sequence. Then, we present evidence towards resolving the disagreement between each account of foraging by reformulating models as sequence generators. To evaluate, we compare generated category fluency runs to a bank of human-written sequences by proposing a metric based on n-gram overlap. We find category switch predictors do not necessarily produce human-like sequences, in fact the additional biases used by the Hills et al. (2012) model are required to improve generation quality, which are later improved by our category modification. Even generating exclusively with an LLM requires an additional global cue to trigger the patch switching behavior during production. Further tests on only the search process on top of the semantic network highlight the importance of deterministic search to replicate human behavior.

翻译：类别流畅性是一种被广泛研究的认知现象，然而其潜在检索机制存在两种相互矛盾的解释——一种是有意识搜索记忆的最优觅食过程（Hills等，2012），另一种是从语义网络中随机游走采样（Abbott等，2015）。两种解释的证据均集中于预测人类“补丁切换”行为，而现有类别流畅性模型却产生了看似一致的矛盾结果。我们首先通过剥离现有模型的假设（即每个命名示例仅依赖于前一个示例），采取两种改进方式：（i）直接添加额外偏置以建模类别转移概率，（ii）依赖大型语言模型基于整个现有序列进行预测。随后，我们将觅食模型重新表述为序列生成器，为解决两种解释间的分歧提供证据。为了评估模型，我们提出基于n-gram重叠的度量指标，将生成的类别流畅性序列与人类撰写的序列库进行比较。研究发现：类别切换预测器不一定能生成类人序列，实际上需采用Hills等（2012）模型中的额外偏置来提升生成质量，而我们的类别修正进一步优化了该效果。即使完全使用大型语言模型生成序列，也需额外全局线索触发生产过程中的补丁切换行为。仅对语义网络上的搜索过程进行测试进一步凸显了确定性搜索在复现人类行为中的重要性。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日