What ethical concerns, if any, do LLM researchers have? We introduce EthiCon, a corpus of 1,580 ethical concern statements extracted from scientific papers published in the ACL Anthology. We extract ethical concern keywords from the statements and show promising results in automating the concern identification process. Through a survey, we compare the ethical concerns of the corpus to the concerns listed by the general public and professionals in the field. Finally, we compare our retrieved ethical concerns with existing taxonomies pointing to gaps and future research directions.
翻译:LLM研究者具有哪些伦理关切(如果存在的话)?我们提出了EthiCon——一个包含1,580条伦理关切声明的语料库,这些声明提取自ACL文献库中发表的科学论文。我们从声明中提取伦理关切关键词,并在自动化关切识别过程中展示了有前景的结果。通过一项调查,我们将语料库中的伦理关切与公众及领域专业人士列出的关切进行了比较。最后,我们将检索到的伦理关切与现有分类体系进行对比,指出了研究空白和未来研究方向。