IMPACTeen is a dataset of textual social influence scenarios spanning interpersonal, media-based, and digital settings in an adolescent context. It contains 1,021 texts, 5,100 individual annotation records, and gold labels for social influence techniques, with each text annotated from five distinct perspectives: teenagers, parents, psychologists, communication experts, and teachers. The resource was constructed through constrained LLM generation, followed by a two-step human editing and validation phase aimed at ensuring youth-context realism. A multi-dimensional annotation covered influence presence, techniques, intentions, consequences, resistance, reactions, and annotation confidence. The dataset supports research on social influence detection, annotator disagreement, cross-lingual modeling, and the training and evaluation of language models. The dataset was created in Polish and is accompanied by a corresponding English version.
翻译:IMPACTeen是一个面向青少年情境的文本社会影响场景数据集,涵盖人际交往、媒介传播及数字环境等多种情境。该数据集包含1021篇文本、5100条独立标注记录及社会影响技巧的黄金标签,每篇文本从青少年、家长、心理学家、传播学专家及教师五个不同视角进行标注。该资源通过受限的大语言模型生成构建,随后经过人工编辑与验证两阶段处理,以确保青少年情境的真实性。多维标注涵盖影响存在性、技巧、意图、后果、抵抗、反应及标注置信度。该数据集支持社会影响检测、标注者分歧分析、跨语言建模以及语言模型的训练与评估。数据集原始语言为波兰语,并配有对应的英文版本。