Thematic Apperception Test (TAT) is a psychometrically grounded, multidimensional assessment framework that systematically differentiates between cognitive-representational and affective-relational components of personality-like functioning. This test is a projective psychological framework designed to uncover unconscious aspects of personality. This study examines whether the personality traits of Large Multimodal Models (LMMs) can be assessed through non-language-based modalities, using the Social Cognition and Object Relations Scale - Global (SCORS-G). LMMs are employed in two distinct roles: as subject models (SMs), which generate stories in response to TAT images, and as evaluator models (EMs), who assess these narratives using the SCORS-G framework. Evaluators demonstrated an excellent ability to understand and analyze TAT responses. Their interpretations are highly consistent with those of human experts. Assessment results highlight that all models understand interpersonal dynamics very well and have a good grasp of the concept of self. However, they consistently fail to perceive and regulate aggression. Performance varied systematically across model families, with larger and more recent models consistently outperforming smaller and earlier ones across SCORS-G dimensions.
翻译:主题统觉测试(TAT)是一种基于心理测量学、多维度的评估框架,能够系统地区分人格类功能中的认知表征成分与情感关系成分。该测试是一种旨在揭示人格无意识层面的投射心理框架。本研究探讨是否可以通过非语言模态,运用社会认知与客体关系量表-全球版(SCORS-G)来评估大型多模态模型(LMMs)的人格特质。LMMs在研究中扮演两种不同角色:作为主体模型(SMs),根据TAT图像生成故事;以及作为评估者模型(EMs),使用SCORS-G框架对这些叙事进行评估。评估者模型展现出卓越的理解与分析TAT反应的能力,其解释与人类专家的评估高度一致。评估结果突出表明,所有模型均能很好地理解人际动态,并对自我概念有良好的把握。然而,它们持续无法感知和调节攻击性。不同模型系列的表现存在系统性差异,更大、更新的模型在SCORS-G各维度上持续优于更小、更早的模型。