The Automated Verification of Textual Claims (AVeriTeC) shared task asks participants to retrieve evidence and predict veracity for real-world claims checked by fact-checkers. Evidence can be found either via a search engine, or via a knowledge store provided by the organisers. Submissions are evaluated using AVeriTeC score, which considers a claim to be accurately verified if and only if both the verdict is correct and retrieved evidence is considered to meet a certain quality threshold. The shared task received 21 submissions, 18 of which surpassed our baseline. The winning team was TUDA_MAI with an AVeriTeC score of 63%. In this paper we describe the shared task, present the full results, and highlight key takeaways from the shared task.
翻译:文本声明自动验证(AVeriTeC)共享任务要求参与者为事实核查机构验证过的真实世界声明检索证据并预测真实性。证据可通过搜索引擎获取,或通过组织者提供的知识库查找。提交结果采用AVeriTeC评分进行评估,该评分认为当且仅当判定结果正确且检索到的证据达到特定质量阈值时,声明才被视为被准确验证。本次共享任务共收到21份提交结果,其中18份超越基线水平。获胜团队TUDA_MAI的AVeriTeC得分为63%。本文详细描述了该共享任务,呈现完整结果,并重点总结了任务中的关键发现。