Memes are the new-age conveyance mechanism for humor on social media sites. Memes often include an image and some text. Memes can be used to promote disinformation or hatred, thus it is crucial to investigate in details. We introduce Memotion 3, a new dataset with 10,000 annotated memes. Unlike other prevalent datasets in the domain, including prior iterations of Memotion, Memotion 3 introduces Hindi-English Codemixed memes while prior works in the area were limited to only the English memes. We describe the Memotion task, the data collection and the dataset creation methodologies. We also provide a baseline for the task. The baseline code and dataset will be made available at https://github.com/Shreyashm16/Memotion-3.0
翻译:模因是社交媒体上用于幽默表达的新兴传播机制。模因通常包含图像和文本。模可能被用来传播虚假信息或仇恨言论,因此对其进行详细研究至关重要。我们推出了Memotion 3,这是一个包含10,000个标注模因的新数据集。与领域中其他流行数据集(包括Memotion的先前版本)不同,Memotion 3引入了印地语-英语混合编码模因,而该领域的先前工作仅限于英语模因。我们描述了Memotion任务、数据收集和数据集创建方法。我们还为该任务提供了基线模型。基线代码和数据集将在https://github.com/Shreyashm16/Memotion-3.0 上公开。