Towards Multimodal Prediction of Spontaneous Humour: A Novel Dataset and First Results

from arxiv, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible (Major Revision)

Humour is a substantial element of human affect and cognition. Its automatic understanding can facilitate a more naturalistic human-device interaction and the humanisation of artificial intelligence. Current methods of humour detection are solely based on staged data making them inadequate for 'real-world' applications. We address this deficiency by introducing the novel Passau-Spontaneous Football Coach Humour (Passau-SFCH) dataset, comprising of about 11 hours of recordings. The Passau-SFCH dataset is annotated for the presence of humour and its dimensions (sentiment and direction) as proposed in Martin's Humor Style Questionnaire. We conduct a series of experiments, employing pretrained Transformers, convolutional neural networks, and expert-designed features. The performance of each modality (text, audio, video) for spontaneous humour recognition is analysed and their complementarity is investigated. Our findings suggest that for the automatic analysis of humour and its sentiment, facial expressions are most promising, while humour direction can be best modelled via text-based features. The results reveal considerable differences among various subjects, highlighting the individuality of humour usage and style. Further, we observe that a decision-level fusion yields the best recognition result. Finally, we make our code publicly available at https://www.github.com/EIHW/passau-sfch. The Passau-SFCH dataset is available upon request.

翻译：幽默是人类情感与认知的重要组成元素。对其自动理解有助于实现更自然的人机交互以及人工智能的人性化。当前幽默检测方法完全基于编排式数据，这使其难以适用于"真实世界"场景。为弥补这一缺陷，我们引入了全新的Passau-Spontaneous Football Coach Humour（帕绍自发性足球教练幽默，简称Passau-SFCH）数据集，包含约11小时的录音。该数据集基于Martin幽默风格问卷中提出的维度（情感倾向与指向性）进行了幽默存在性标注。我们开展了一系列实验，采用预训练Transformer、卷积神经网络及专家设计特征。分析了各模态（文本、音频、视频）在自发性幽默识别中的表现，并探究了其互补性。研究表明，在幽默及其情感的自动分析中，面部表情最具潜力，而幽默指向性则可通过文本特征进行最佳建模。结果揭示了不同受试者间的显著差异，突出了幽默使用方式与风格的个体性。此外，我们发现决策级融合能获得最佳识别效果。最后，我们已将代码公开于https://www.github.com/EIHW/passau-sfch，Passau-SFCH数据集可按需获取。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

《生成式模型: 变分自编码器与扩散模型》，75页ppt，Google DeepMind科学家Ruiqi Gao

专知会员服务

66+阅读 · 2023年6月10日

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日