Automatic Authorship Attribution (AAA) is the result of applying tools and techniques from Digital Humanities to authorship attribution studies. Through a quantitative and statistical approach this discipline can draw further conclusions about renowned authorship issues which traditional critics have been dealing with for centuries, opening a new door to style comparison. The aim of this paper is to prove the potential of these tools and techniques by testing the authorship of five comedies traditionally attributed to Spanish playwright Tirso de Molina (1579-1648): La ninfa del cielo, El burlador de Sevilla, Tan largo me lo fiais, La mujer por fuerza and El condenado por desconfiado. To accomplish this purpose some experiments concerning clustering analysis by Stylo package from R and four distance measures are carried out on a corpus built with plays by Tirso, Andres de Claramonte (c. 1560-1626), Antonio Mira de Amescua (1577-1644) and Luis Velez de Guevara (1579-1644). The results obtained point to the denial of all the attributions to Tirso except for the case of La mujer por fuerza.
翻译:自动作者归属(AAA)是数字人文领域工具与技术应用于作者归属研究的结果。通过定量与统计方法,该学科能够对传统评论家数百年来探讨的著名作者归属问题得出进一步结论,为风格比较开辟了新途径。本文旨在通过检验西班牙剧作家蒂尔索·德·莫利纳(1579-1648)传统上被归为其所作的五部喜剧——《天堂仙女》《塞维利亚的骗子》《你让我等这么久》《被迫的女人》和《因不信任而被定罪》——的作者身份,证明这些工具与技术的潜力。为此,我们基于蒂尔索、安德烈斯·德·克拉拉蒙特(约1560-1626)、安东尼奥·米拉·德·阿梅斯夸(1577-1644)和路易斯·贝莱斯·德·格瓦拉(1579-1644)的剧作构建语料库,运用R语言Stylo包的聚类分析及四种距离度量进行实验。结果显示,除《被迫的女人》外,其余剧作均否认归因于蒂尔索。