代码越多，验证越少：科学家过度依赖AI编程工具的风险因素 (More code, less validation: Risk factors for over-reliance on AI coding tools among scientists) - 专知论文

会员服务 ·

0

编程 · 代码 · 工具 · 科学家 · AI ·

2025 年 12 月 22 日

More code, less validation: Risk factors for over-reliance on AI coding tools among scientists

翻译：代码越多，验证越少：科学家过度依赖AI编程工具的风险因素

Gabrielle O'Brien,Alexis Parker,Nasir Eisty,Jeffrey Carver

Programming is essential to modern scientific research, yet most scientists report inadequate training for the software development their work demands. Generative AI tools capable of code generation may support scientific programmers, but user studies indicate risks of over-reliance, particularly among inexperienced users. We surveyed 868 scientists who program, examining adoption patterns, tool preferences, and factors associated with perceived productivity. Adoption is highest among students and less experienced programmers, with variation across fields. Scientific programmers overwhelmingly prefer general-purpose conversational interfaces like ChatGPT over developer-specific tools. Both inexperience and limited use of development practices (like testing, code review, and version control) are associated with greater perceived productivity-but these factors interact, suggesting formal practices may partially compensate for inexperience. The strongest predictor of perceived productivity is the number of lines of generated code typically accepted at once. These findings suggest scientific programmers using generative AI may gauge productivity by code generation rather than validation, raising concerns about research code integrity.

翻译：编程是现代科学研究不可或缺的组成部分，然而大多数科学家表示，他们未接受过充分训练以满足工作所需的软件开发能力。具备代码生成能力的生成式AI工具可能为科学编程人员提供支持，但用户研究表明存在过度依赖的风险，特别是在经验不足的用户中。我们对868名从事编程的科学家进行了调查，考察了工具采用模式、偏好选择以及与感知生产力相关的因素。学生群体和编程经验较少的科研人员采用率最高，且存在跨学科差异。科学编程人员明显更倾向于使用ChatGPT等通用对话式界面，而非专为开发者设计的工具。经验不足与开发实践（如测试、代码审查和版本控制）的有限使用均与更高的感知生产力相关——但这些因素存在交互作用，表明规范化的开发实践可能部分弥补经验缺失。感知生产力的最强预测因子是一次性接受的生成代码行数。这些发现表明，使用生成式AI的科学编程人员可能通过代码生成量而非验证质量来评估生产力，这引发了关于研究代码完整性的担忧。

0

相关内容

人们为了让计算机解决各种棘手的问题，使用编程语言 编写程序代码并通过计算机运算得到最终结果的过程。

【CVPR2025】CarPlanner: 一种用于自动驾驶大规模强化学习的一致性自回归轨迹规划

【CVPR2025】CarPlanner: 一种用于自动驾驶大规模强化学习的一致性自回归轨迹规划

专知会员服务

14+阅读 · 2025年3月2日

机器学习如何做科学发现？牛津大学Moseley博士论文《基于物理信息的机器学习: 概念到实际应用》,268页详述科学机器学习内涵

机器学习如何做科学发现？牛津大学Moseley博士论文《基于物理信息的机器学习: 概念到实际应用》,268页详述科学机器学习内涵

专知会员服务

102+阅读 · 2022年9月27日

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

专知会员服务

38+阅读 · 2022年3月24日

【机器学习傻瓜式入门，443页pdf】Machine Learning For Dummies, 2nd Edition

【机器学习傻瓜式入门，443页pdf】Machine Learning For Dummies, 2nd Edition

专知会员服务

71+阅读 · 2021年1月26日

【2020新书】Pandas编程思想，190页pdf阐述正确使用Python数据分析库

【2020新书】Pandas编程思想，190页pdf阐述正确使用Python数据分析库

专知会员服务

157+阅读 · 2020年6月7日

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

专知会员服务

21+阅读 · 2020年3月28日

【Google 大脑】使用上千个优化任务学习超参数搜索策略，Using a thousand optimization tasks to learn hyperparameter search strategies

【Google 大脑】使用上千个优化任务学习超参数搜索策略，Using a thousand optimization tasks to learn hyperparameter search strategies

专知会员服务

18+阅读 · 2020年3月14日

【479页笔记：数据科学基础】《Foundations of Data Science》by Avrim Blum, John Hopcroft, and Ravindran Kannan (2018)

【479页笔记：数据科学基础】《Foundations of Data Science》by Avrim Blum, John Hopcroft, and Ravindran Kannan (2018)

专知会员服务

95+阅读 · 2020年2月4日

【深度图相似学习综述】Deep Graph Similarity Learning: A Survey，29页pdf，117条参考文献

【深度图相似学习综述】Deep Graph Similarity Learning: A Survey，29页pdf，117条参考文献

专知会员服务

98+阅读 · 2019年12月31日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

【干货书-斯坦福】最优化算法，521页pdf，《Algorithms for Optimization》MIT出版社

【干货书-斯坦福】最优化算法，521页pdf，《Algorithms for Optimization》MIT出版社

专知

58+阅读 · 2020年7月2日

Pytorch多模态框架MMF

Pytorch多模态框架MMF

专知

50+阅读 · 2020年6月20日

【CVPR2020-牛津-谷歌】语音到动作:动作识别的跨模态监督，Cross-modal Supervision

【CVPR2020-牛津-谷歌】语音到动作:动作识别的跨模态监督，Cross-modal Supervision

专知

10+阅读 · 2020年3月31日

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

初学者的 Keras：实现卷积神经网络

初学者的 Keras：实现卷积神经网络

Python程序员

24+阅读 · 2019年9月8日

预知未来——Gluon 时间序列工具包（GluonTS）

预知未来——Gluon 时间序列工具包（GluonTS）

ApacheMXNet

24+阅读 · 2019年6月25日

IBM-小样本学习（Few-shot Learning）State of the art 方法及论文讲解

IBM-小样本学习（Few-shot Learning）State of the art 方法及论文讲解

专知

105+阅读 · 2019年4月15日

Auto-Keras与AutoML：入门指南

Auto-Keras与AutoML：入门指南

云栖社区

18+阅读 · 2019年2月9日

斯坦福Jure Leskovec图表示学习：无监督和有监督方法（附PPT下载）

斯坦福Jure Leskovec图表示学习：无监督和有监督方法（附PPT下载）

专知

24+阅读 · 2017年12月17日

多视角识别长非编码RNA和人类复杂疾病关联预测研究

国家自然科学基金

4+阅读 · 2017年12月31日

组合测试用例优先排序算法及选择策略研究

国家自然科学基金

9+阅读 · 2015年12月31日

最小化加权完工时间和的在线排序研究

国家自然科学基金

0+阅读 · 2015年12月31日

T-S模糊神经网络的容错同步性分析

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

中英文论文中的中国作者姓名消歧研究

国家自然科学基金

0+阅读 · 2014年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

复杂多元数据的半参数统计推断

国家自然科学基金

5+阅读 · 2014年12月31日

反问题的数学建模、计算及应用

国家自然科学基金

2+阅读 · 2014年12月31日

可重构的环境自适应RS码软判决译码器研究

国家自然科学基金

0+阅读 · 2014年12月31日

Using LLMs to Evaluate Architecture Documents: Results from a Digital Marketplace Environment

Arxiv

0+阅读 · 1月27日

Modeling Behavioral Signals in Job Scams: A Human-Centered Security Study

Arxiv

0+阅读 · 1月27日

RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation

Arxiv

0+阅读 · 1月26日

LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics

Arxiv

0+阅读 · 1月26日

Unknown Unknowns: Why Hidden Intentions in LLMs Evade Detection

Arxiv

0+阅读 · 1月26日

AutoSciDACT: Automated Scientific Discovery through Contrastive Embedding and Hypothesis Testing

Arxiv

0+阅读 · 1月23日

SAGe: A Lightweight Algorithm-Architecture Co-Design for Mitigating the Data Preparation Bottleneck in Large-Scale Genome Sequence Analysis

Arxiv

0+阅读 · 1月22日

Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program

Arxiv

0+阅读 · 1月20日

Challenges and Practices in Quantum Software Testing and Debugging: Insights from Practitioners

Arxiv

0+阅读 · 1月16日

From Rows to Reasoning: A Retrieval-Augmented Multimodal Framework for Spreadsheet Understanding

Arxiv

0+阅读 · 1月13日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2025】CarPlanner: 一种用于自动驾驶大规模强化学习的一致性自回归轨迹规划

【CVPR2025】CarPlanner: 一种用于自动驾驶大规模强化学习的一致性自回归轨迹规划

专知会员服务

14+阅读 · 2025年3月2日

机器学习如何做科学发现？牛津大学Moseley博士论文《基于物理信息的机器学习: 概念到实际应用》,268页详述科学机器学习内涵

机器学习如何做科学发现？牛津大学Moseley博士论文《基于物理信息的机器学习: 概念到实际应用》,268页详述科学机器学习内涵

专知会员服务

102+阅读 · 2022年9月27日

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

【nature machine intelligence】终身学习机器的生物基础，Biological underpinnings for lifelong learning machines

专知会员服务

38+阅读 · 2022年3月24日

【机器学习傻瓜式入门，443页pdf】Machine Learning For Dummies, 2nd Edition

【机器学习傻瓜式入门，443页pdf】Machine Learning For Dummies, 2nd Edition

专知会员服务

71+阅读 · 2021年1月26日

【2020新书】Pandas编程思想，190页pdf阐述正确使用Python数据分析库

【2020新书】Pandas编程思想，190页pdf阐述正确使用Python数据分析库

专知会员服务

157+阅读 · 2020年6月7日

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

专知会员服务

21+阅读 · 2020年3月28日

【Google 大脑】使用上千个优化任务学习超参数搜索策略，Using a thousand optimization tasks to learn hyperparameter search strategies

【Google 大脑】使用上千个优化任务学习超参数搜索策略，Using a thousand optimization tasks to learn hyperparameter search strategies

专知会员服务

18+阅读 · 2020年3月14日

【479页笔记：数据科学基础】《Foundations of Data Science》by Avrim Blum, John Hopcroft, and Ravindran Kannan (2018)

【479页笔记：数据科学基础】《Foundations of Data Science》by Avrim Blum, John Hopcroft, and Ravindran Kannan (2018)

专知会员服务

95+阅读 · 2020年2月4日

【深度图相似学习综述】Deep Graph Similarity Learning: A Survey，29页pdf，117条参考文献

【深度图相似学习综述】Deep Graph Similarity Learning: A Survey，29页pdf，117条参考文献

专知会员服务

98+阅读 · 2019年12月31日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基于自适应表征的高效视觉建模

《多域作战中融合网络、电子战与动能机动》

AI智能体时代大模型安全风险与攻防新挑战

迈向个性化大语言模型驱动的智能体：基础、评估与未来方向

相关资讯

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

最新最全《深度元学习》2021综述论文，68页pdf，A Survey of Deep Meta-Learning

专知

11+阅读 · 2021年4月23日

【干货书-斯坦福】最优化算法，521页pdf，《Algorithms for Optimization》MIT出版社

【干货书-斯坦福】最优化算法，521页pdf，《Algorithms for Optimization》MIT出版社

专知

58+阅读 · 2020年7月2日

Pytorch多模态框架MMF

Pytorch多模态框架MMF

专知

50+阅读 · 2020年6月20日

【CVPR2020-牛津-谷歌】语音到动作:动作识别的跨模态监督，Cross-modal Supervision

【CVPR2020-牛津-谷歌】语音到动作:动作识别的跨模态监督，Cross-modal Supervision

专知

10+阅读 · 2020年3月31日

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

初学者的 Keras：实现卷积神经网络

初学者的 Keras：实现卷积神经网络

Python程序员

24+阅读 · 2019年9月8日

预知未来——Gluon 时间序列工具包（GluonTS）

预知未来——Gluon 时间序列工具包（GluonTS）

ApacheMXNet

24+阅读 · 2019年6月25日

IBM-小样本学习（Few-shot Learning）State of the art 方法及论文讲解

IBM-小样本学习（Few-shot Learning）State of the art 方法及论文讲解

专知

105+阅读 · 2019年4月15日

Auto-Keras与AutoML：入门指南

Auto-Keras与AutoML：入门指南

云栖社区

18+阅读 · 2019年2月9日

斯坦福Jure Leskovec图表示学习：无监督和有监督方法（附PPT下载）

斯坦福Jure Leskovec图表示学习：无监督和有监督方法（附PPT下载）

专知

24+阅读 · 2017年12月17日

相关论文

Using LLMs to Evaluate Architecture Documents: Results from a Digital Marketplace Environment

Arxiv

0+阅读 · 1月27日

Modeling Behavioral Signals in Job Scams: A Human-Centered Security Study

Arxiv

0+阅读 · 1月27日

RocqStar: Leveraging Similarity-driven Retrieval and Agentic Systems for Rocq generation

Arxiv

0+阅读 · 1月26日

LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics

Arxiv

0+阅读 · 1月26日

Unknown Unknowns: Why Hidden Intentions in LLMs Evade Detection

Arxiv

0+阅读 · 1月26日

AutoSciDACT: Automated Scientific Discovery through Contrastive Embedding and Hypothesis Testing

Arxiv

0+阅读 · 1月23日

SAGe: A Lightweight Algorithm-Architecture Co-Design for Mitigating the Data Preparation Bottleneck in Large-Scale Genome Sequence Analysis

Arxiv

0+阅读 · 1月22日

Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program

Arxiv

0+阅读 · 1月20日

Challenges and Practices in Quantum Software Testing and Debugging: Insights from Practitioners

Arxiv

0+阅读 · 1月16日

From Rows to Reasoning: A Retrieval-Augmented Multimodal Framework for Spreadsheet Understanding

Arxiv

0+阅读 · 1月13日

相关基金

多视角识别长非编码RNA和人类复杂疾病关联预测研究

国家自然科学基金

4+阅读 · 2017年12月31日

组合测试用例优先排序算法及选择策略研究

国家自然科学基金

9+阅读 · 2015年12月31日

最小化加权完工时间和的在线排序研究

国家自然科学基金

0+阅读 · 2015年12月31日

T-S模糊神经网络的容错同步性分析

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

中英文论文中的中国作者姓名消歧研究

国家自然科学基金

0+阅读 · 2014年12月31日

高维数据下的模型平均方法

国家自然科学基金

6+阅读 · 2014年12月31日

复杂多元数据的半参数统计推断

国家自然科学基金

5+阅读 · 2014年12月31日

反问题的数学建模、计算及应用

国家自然科学基金

2+阅读 · 2014年12月31日

可重构的环境自适应RS码软判决译码器研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员