Sarcasm is a pragmatic phenomenon in which speakers convey meanings that diverge from literal content, relying on an interaction between semantics and prosodic expression. However, how these cues jointly contribute to the recognition of sarcasm remains poorly understood. We propose a computational framework that models sarcasm as the integration of semantic interpretation and prosodic realization. Semantic cues are derived from an LLaMA 3 model fine-tuned to capture discourse-level markers of sarcastic intent, while prosodic cues are extracted through semantically aligned utterances drawn from a database of sarcastic speech, providing prosodic exemplars of sarcastic delivery. Using a speech synthesis testbed, perceptual evaluations demonstrate that both semantic and prosodic cues independently enhance listeners' perception of sarcasm, with the strongest effects emerging when the two are combined. These findings highlight the complementary roles of semantics and prosody in pragmatic interpretation and illustrate how modeling can shed light on the mechanisms underlying sarcastic communication.
翻译:讽刺是一种语用现象,说话者通过语义与韵律表达的交互作用传递与字面内容相悖的含义。然而,这些线索如何共同促成讽刺识别仍不甚明晰。本文提出一个计算框架,将讽刺建模为语义解读与韵律实现的整合过程。语义线索源自经过微调的LLaMA 3模型,该模型专门捕捉话语层面的讽刺意图标记;韵律线索则通过从讽刺语音数据库中提取语义对齐的语句获得,从而提供讽刺表达的韵律范例。在语音合成测试平台上进行的感知评估表明,语义与韵律线索均能独立增强听者对讽刺的感知,且二者结合时产生最显著的效果。这些发现揭示了语义与韵律在语用解读中的互补作用,并展示了建模方法如何揭示讽刺交际的内在机制。