Position: The ML Community Must Build an AI-Augmented Peer-Review Ecosystem

Peer review, the bedrock of scientific advancement in machine learning (ML), is strained by a crisis of scale. Exponential growth in manuscript submissions to premier ML venues such as NeurIPS, ICML, and ICLR is outpacing the finite capacity of qualified reviewers, leading to concerns about review quality, consistency, and reviewer fatigue. This position paper argues that AI-assisted peer review must become an urgent research and infrastructure priority. We advocate for a comprehensive AI-augmented ecosystem, leveraging Large Language Models (LLMs) not as replacements for human judgment, but as sophisticated collaborators for authors, reviewers, and Area Chairs (ACs). We propose specific roles for AI in enhancing factual verification, guiding reviewer performance, assisting authors in quality improvement, and supporting ACs in decision-making. Crucially, we contend that the development of such systems hinges on access to more granular, structured, and ethically-sourced peer review process data. We outline a research agenda, including illustrative experiments, to develop and validate these AI assistants, and discuss significant technical and ethical challenges. We call upon the ML community to proactively build this AI-assisted future, ensuring the continued integrity and scalability of scientific validation, while maintaining high standards of peer review.

翻译：同行评审作为机器学习（ML）领域科学进步的基石，正面临规模危机的严峻挑战。在NeurIPS、ICML和ICLR等顶级ML会议中，投稿数量的指数级增长已超越合格审稿人的有限承载能力，引发关于评审质量、一致性和审稿人倦怠的担忧。本文立场声明认为，AI辅助同行评审必须成为亟待研究与基础设施建设的优先事项。我们倡导构建全面的AI增强型生态系统，在利用大语言模型（LLMs）时，并非将其作为人类判断的替代品，而是作为作者、审稿人和领域主席（ACs）的智能协作工具。我们提出AI在增强事实核查、引导审稿人表现、协助作者提升稿件质量以及支持AC决策等环节的具体职责。至关重要的是，我们认为此类系统的开发依赖于获取更细粒度、结构化且符合伦理规范的同行评审过程数据。我们制定了包含示例性实验的研究路线图以开发验证这些AI助手，并深入探讨了重大技术与伦理挑战。我们呼吁ML社区主动构建这一AI辅助的未来，在维护高标准的同行评审体系前提下，确保科学验证的持续完整性与可扩展性。

相关内容

关注 7111

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

《改进机器学习管道中的人类集成》人机协作最新263页论文

专知会员服务

33+阅读 · 2024年8月13日

《多域作战中用于人工智能（AI）和机器学习（ML）的合成环境》（中文版）美国陆军研究实验室报告

专知会员服务

162+阅读 · 2023年7月12日

如何对AI进行监管？人工智能伦理与大规模ML, Jean-Gabriel Ganascia, 41页ppt

专知会员服务

30+阅读 · 2022年12月3日

机器学习可解释如何客观评估？CMU-Yeh博士论文《可解释机器学习的客观标准》，148页pdf

专知会员服务

79+阅读 · 2022年11月23日