ReasonScaffold: A Scaffolded Reasoning-based Annotation Protocol for Human-AI Co-Annotation - 专知论文

会员服务 ·

0

标注 · NLP · 语言模型化 · MoDELS · 控制器 ·

ReasonScaffold: A Scaffolded Reasoning-based Annotation Protocol for Human-AI Co-Annotation

翻译：暂无翻译

Smitha Muthya Sudheendra,Jaideep Srivastava

Human annotation is central to NLP evaluation, yet subjective tasks often exhibit substantial variability across annotators. While large language models (LLMs) can provide structured reasoning to support annotation, their influence on human annotation behavior remains underexplored. We introduce \textbf{ReasonScaffold}, a scaffolded reasoning annotation protocol that exposes LLM-generated explanations while withholding predicted labels. We study how reasoning affects human annotation behavior in a controlled setting, rather than evaluating annotation accuracy. Using a two-pass protocol inspired by Delphi-style revision, annotators first label instances independently and then revise their decisions after viewing model-generated reasoning. We evaluate the approach on sentiment classification and opinion detection tasks, analyzing changes in inter-annotator agreement and revision behavior. To quantify these effects, we introduce the Annotator Effort Proxy (AEP), a metric capturing the proportion of labels revised after exposure to reasoning. Our results show that exposure to reasoning is associated with increased agreement, along with minimal revision, suggesting that reasoning helps resolve ambiguous cases without inducing widespread changes. These findings provide insight into how reasoning explanations shape annotation consistency and highlight reasoning-based scaffolds as a practical mechanism for human--AI co-annotation workflows.

翻译：暂无翻译

0

相关内容

联合国：2025年军事人工智能、和平与安全对话核心要义

联合国：2025年军事人工智能、和平与安全对话核心要义

专知会员服务

19+阅读 · 2025年9月21日

认知优势：人工智能在国家安全决策中的核心作用

认知优势：人工智能在国家安全决策中的核心作用

专知会员服务

14+阅读 · 2025年8月16日

《以人为中心的大型语言模型（LLM）研究综述》

《以人为中心的大型语言模型（LLM）研究综述》

专知会员服务

41+阅读 · 2024年11月25日

395页|2022中国人工智能系列白皮书——多语种智能信息处理

395页|2022中国人工智能系列白皮书——多语种智能信息处理

专知会员服务

53+阅读 · 2022年12月25日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

联合国教科文组织发布《人工智能伦理建议书》

联合国教科文组织发布《人工智能伦理建议书》

专知会员服务

51+阅读 · 2021年12月7日

【DeepMind】人工智能、价值与对齐，Artificial Intelligence, Values, and Alignment

【DeepMind】人工智能、价值与对齐，Artificial Intelligence, Values, and Alignment

专知会员服务

38+阅读 · 2020年1月13日

【北京智源大会2019】增强人类智能：从搜索引擎到智能任务助理（ Augmenting Human Intelligence: From Search Engines to Intelligent Task Assistants ）

【北京智源大会2019】增强人类智能：从搜索引擎到智能任务助理（ Augmenting Human Intelligence: From Search Engines to Intelligent Task Assistants ）

专知会员服务

20+阅读 · 2019年11月22日

【北京智源大会2019】人与人工智能共生的伦理与责任（ Ethical and Responsible AI for Human-AI Symbiosis ）中国科学院自动化研究所| 曾毅

【北京智源大会2019】人与人工智能共生的伦理与责任（ Ethical and Responsible AI for Human-AI Symbiosis ）中国科学院自动化研究所| 曾毅

专知会员服务

13+阅读 · 2019年11月22日

【ICML 2019 Tutorials】(Neural Approaches to Conversational AI)，微软高级研究员| Michel Galley，微软研究经理|高剑峰

【ICML 2019 Tutorials】(Neural Approaches to Conversational AI)，微软高级研究员| Michel Galley，微软研究经理|高剑峰

专知会员服务

17+阅读 · 2019年6月10日

重磅！最新《人工智能白皮书（2022年）》发布，42页pdf

重磅！最新《人工智能白皮书（2022年）》发布，42页pdf

专知

25+阅读 · 2022年4月13日

Awesome-Chinese-NLP：中文自然语言处理相关资料

Awesome-Chinese-NLP：中文自然语言处理相关资料

AINLP

30+阅读 · 2019年2月17日

西湖大学张岳：自然语言处理中的多任务联合学习（384页PPT）

西湖大学张岳：自然语言处理中的多任务联合学习（384页PPT）

专知

21+阅读 · 2018年11月20日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

【混合智能】人机混合智能的哲学思考

【混合智能】人机混合智能的哲学思考

产业智能官

12+阅读 · 2018年10月28日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

深思考人工智能蝉联SMP2018多轮语义对话冠军，报告解读多轮人机对话实现过程

深思考人工智能蝉联SMP2018多轮语义对话冠军，报告解读多轮人机对话实现过程

人工智能学家

15+阅读 · 2018年8月4日

CCCF专栏文章：人机共融智能

CCCF专栏文章：人机共融智能

中国计算机学会

15+阅读 · 2017年12月21日

群体智能：新一代人工智能的重要方向

群体智能：新一代人工智能的重要方向

走向智能论坛

12+阅读 · 2017年8月16日

【干货】神经机器翻译全流程解析，one-shot 和 zero-shot 学习成亮点

【干货】神经机器翻译全流程解析，one-shot 和 zero-shot 学习成亮点

新智元

10+阅读 · 2017年4月2日

基于深度学习的联合作战态势智能辅助分析研究

国家自然科学基金

335+阅读 · 2017年12月31日

多视角识别长非编码RNA和人类复杂疾病关联预测研究

国家自然科学基金

4+阅读 · 2017年12月31日

共融机器人战略规划研究和学术交流

国家自然科学基金

15+阅读 · 2016年12月31日

基于犹豫模糊语言信息的定性决策理论与方法

国家自然科学基金

2+阅读 · 2015年12月31日

强调与对比影响语篇理解的认知过程及其神经机制

国家自然科学基金

4+阅读 · 2015年12月31日

中文句子语义概念图自动构建方法及应用研究

国家自然科学基金

3+阅读 · 2014年12月31日

多域网络安全的异构策略语义形态与验证机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于群体智能的多无人机编队自主协调控制及验证

国家自然科学基金

21+阅读 · 2013年12月31日

不确定性推理与语义网中知识表示的数学基础

国家自然科学基金

18+阅读 · 2012年12月31日

不确定环境下强化学习和决策的神经机制

国家自然科学基金

11+阅读 · 2012年12月31日

HAAS: A Policy-Aware Framework for Adaptive Task Allocation Between Humans and Artificial Intelligence Systems

Arxiv

0+阅读 · 5月4日

Foreclassing: A new machine learning perspective on human decision making with temporal data

Arxiv

0+阅读 · 4月30日

Persona-Based Process Design for Assistive Human-Robot Workplaces for Persons with Disabilities

Arxiv

0+阅读 · 4月29日

Measuring Successful Cooperation in Human-AI Teamwork: Development and Validation of the Perceived Cooperativity and Teaming Perception Scales

Arxiv

0+阅读 · 4月27日

Cognitive Amplification vs Cognitive Delegation in Human-AI Systems: A Metric Framework

Arxiv

0+阅读 · 4月23日

Contexty: Capturing and Organizing In-situ Thoughts for Context-Aware AI Support

Arxiv

0+阅读 · 4月13日

Mixed-Initiative Context: Structuring and Managing Context for Human-AI Collaboration

Arxiv

0+阅读 · 4月8日

Toward a Human-AI Task Tensor: A Taxonomy for Organizing Work in the Age of Generative AI

Arxiv

0+阅读 · 3月26日

Evaluating Reasoning-Based Scaffolds for Human-AI Co-Annotation: The ReasonAlign Annotation Protocol

Arxiv

0+阅读 · 3月22日

From Accuracy to Readiness: Metrics and Benchmarks for Human-AI Decision-Making

Arxiv

0+阅读 · 3月19日

VIP会员

文章信息

相关主题

语言模型化

最新内容

DeepSeek 版Claude Code，免费小白安装教程来了！

DeepSeek 版Claude Code，免费小白安装教程来了！

专知会员服务

9+阅读 · 5月5日

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

专知会员服务

5+阅读 · 5月5日

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

专知会员服务

5+阅读 · 5月5日

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

专知会员服务

6+阅读 · 5月5日

《火炮弹药快速效能建模：提升互操作性与技术优势》（报告）

《火炮弹药快速效能建模：提升互操作性与技术优势》（报告）

专知会员服务

9+阅读 · 5月5日

《美空军条令出版物 2-0：情报（2026版）》

《美空军条令出版物 2-0：情报（2026版）》

专知会员服务

14+阅读 · 5月5日

美陆军“飞蝇陷阱5.0”项目将新兴技术交到作战人员手中

美陆军“飞蝇陷阱5.0”项目将新兴技术交到作战人员手中

专知会员服务

6+阅读 · 5月5日

帕兰提尔 Gotham：一个游戏规则改变器

帕兰提尔 Gotham：一个游戏规则改变器

专知会员服务

9+阅读 · 5月5日

【ICML 2026】用测试时训练线性化视觉Transformer：T⁵ 实现 Softmax 注意力到线性复杂度的快速转换

【ICML 2026】用测试时训练线性化视觉Transformer：T⁵ 实现 Softmax 注意力到线性复杂度的快速转换

专知会员服务

3+阅读 · 5月5日

【AAAI 2026】大模型做知识蒸馏：CMM将LLM特征拆解给小模型协同学习

【AAAI 2026】大模型做知识蒸馏：CMM将LLM特征拆解给小模型协同学习

专知会员服务

3+阅读 · 5月5日

【ICML Spotlight 2026 】NonZero：交互引导探索的多智能体蒙特卡洛树搜索

【ICML Spotlight 2026 】NonZero：交互引导探索的多智能体蒙特卡洛树搜索

专知会员服务

8+阅读 · 5月4日

【综述】机器人学习中的世界模型：全面综述

【综述】机器人学习中的世界模型：全面综述

专知会员服务

12+阅读 · 5月4日

伊朗的导弹-无人机行动及其对美国威慑的影响

伊朗的导弹-无人机行动及其对美国威慑的影响

专知会员服务

9+阅读 · 5月4日

《未来战术无人机系统案例研究：量身定制采办策略方法》100页报告

《未来战术无人机系统案例研究：量身定制采办策略方法》100页报告

专知会员服务

9+阅读 · 5月4日

战争贩子：2026年第一季度美国对中东潜在军售激增

战争贩子：2026年第一季度美国对中东潜在军售激增

专知会员服务

7+阅读 · 5月4日

相关VIP内容

联合国：2025年军事人工智能、和平与安全对话核心要义

联合国：2025年军事人工智能、和平与安全对话核心要义

专知会员服务

19+阅读 · 2025年9月21日

认知优势：人工智能在国家安全决策中的核心作用

认知优势：人工智能在国家安全决策中的核心作用

专知会员服务

14+阅读 · 2025年8月16日

《以人为中心的大型语言模型（LLM）研究综述》

《以人为中心的大型语言模型（LLM）研究综述》

专知会员服务

41+阅读 · 2024年11月25日

395页|2022中国人工智能系列白皮书——多语种智能信息处理

395页|2022中国人工智能系列白皮书——多语种智能信息处理

专知会员服务

53+阅读 · 2022年12月25日

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

【USC-Aaron Chan博士答辩Slides】可信自然语言处理机器解释的生成与利用, 242页ppt，Generating and Utilizing Machine Explanations for Trustworthy NLP

专知会员服务

16+阅读 · 2022年3月13日

联合国教科文组织发布《人工智能伦理建议书》

联合国教科文组织发布《人工智能伦理建议书》

专知会员服务

51+阅读 · 2021年12月7日

【DeepMind】人工智能、价值与对齐，Artificial Intelligence, Values, and Alignment

【DeepMind】人工智能、价值与对齐，Artificial Intelligence, Values, and Alignment

专知会员服务

38+阅读 · 2020年1月13日

【北京智源大会2019】增强人类智能：从搜索引擎到智能任务助理（ Augmenting Human Intelligence: From Search Engines to Intelligent Task Assistants ）

【北京智源大会2019】增强人类智能：从搜索引擎到智能任务助理（ Augmenting Human Intelligence: From Search Engines to Intelligent Task Assistants ）

专知会员服务

20+阅读 · 2019年11月22日

【北京智源大会2019】人与人工智能共生的伦理与责任（ Ethical and Responsible AI for Human-AI Symbiosis ）中国科学院自动化研究所| 曾毅

【北京智源大会2019】人与人工智能共生的伦理与责任（ Ethical and Responsible AI for Human-AI Symbiosis ）中国科学院自动化研究所| 曾毅

专知会员服务

13+阅读 · 2019年11月22日

【ICML 2019 Tutorials】(Neural Approaches to Conversational AI)，微软高级研究员| Michel Galley，微软研究经理|高剑峰

【ICML 2019 Tutorials】(Neural Approaches to Conversational AI)，微软高级研究员| Michel Galley，微软研究经理|高剑峰

专知会员服务

17+阅读 · 2019年6月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML Spotlight 2026】 T²PO: 不确定性引导的探索控制框架，实现稳定多轮Agentic强化学习

《机动炮兵的演进与未来：技术进步、历史沿革与炮兵作战前瞻》

DeepSeek 版Claude Code，免费小白安装教程来了！

基础模型驱动的工业智能体：技术成熟度、能力变迁与未竟之挑战

相关资讯

重磅！最新《人工智能白皮书（2022年）》发布，42页pdf

重磅！最新《人工智能白皮书（2022年）》发布，42页pdf

专知

25+阅读 · 2022年4月13日

Awesome-Chinese-NLP：中文自然语言处理相关资料

Awesome-Chinese-NLP：中文自然语言处理相关资料

AINLP

30+阅读 · 2019年2月17日

西湖大学张岳：自然语言处理中的多任务联合学习（384页PPT）

西湖大学张岳：自然语言处理中的多任务联合学习（384页PPT）

专知

21+阅读 · 2018年11月20日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

【混合智能】人机混合智能的哲学思考

【混合智能】人机混合智能的哲学思考

产业智能官

12+阅读 · 2018年10月28日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

深思考人工智能蝉联SMP2018多轮语义对话冠军，报告解读多轮人机对话实现过程

深思考人工智能蝉联SMP2018多轮语义对话冠军，报告解读多轮人机对话实现过程

人工智能学家

15+阅读 · 2018年8月4日

CCCF专栏文章：人机共融智能

CCCF专栏文章：人机共融智能

中国计算机学会

15+阅读 · 2017年12月21日

群体智能：新一代人工智能的重要方向

群体智能：新一代人工智能的重要方向

走向智能论坛

12+阅读 · 2017年8月16日

【干货】神经机器翻译全流程解析，one-shot 和 zero-shot 学习成亮点

【干货】神经机器翻译全流程解析，one-shot 和 zero-shot 学习成亮点

新智元

10+阅读 · 2017年4月2日

相关论文

HAAS: A Policy-Aware Framework for Adaptive Task Allocation Between Humans and Artificial Intelligence Systems

Arxiv

0+阅读 · 5月4日

Foreclassing: A new machine learning perspective on human decision making with temporal data

Arxiv

0+阅读 · 4月30日

Persona-Based Process Design for Assistive Human-Robot Workplaces for Persons with Disabilities

Arxiv

0+阅读 · 4月29日

Measuring Successful Cooperation in Human-AI Teamwork: Development and Validation of the Perceived Cooperativity and Teaming Perception Scales

Arxiv

0+阅读 · 4月27日

Cognitive Amplification vs Cognitive Delegation in Human-AI Systems: A Metric Framework

Arxiv

0+阅读 · 4月23日

Contexty: Capturing and Organizing In-situ Thoughts for Context-Aware AI Support

Arxiv

0+阅读 · 4月13日

Mixed-Initiative Context: Structuring and Managing Context for Human-AI Collaboration

Arxiv

0+阅读 · 4月8日

Toward a Human-AI Task Tensor: A Taxonomy for Organizing Work in the Age of Generative AI

Arxiv

0+阅读 · 3月26日

Evaluating Reasoning-Based Scaffolds for Human-AI Co-Annotation: The ReasonAlign Annotation Protocol

Arxiv

0+阅读 · 3月22日

From Accuracy to Readiness: Metrics and Benchmarks for Human-AI Decision-Making

Arxiv

0+阅读 · 3月19日

相关基金

基于深度学习的联合作战态势智能辅助分析研究

国家自然科学基金

335+阅读 · 2017年12月31日

多视角识别长非编码RNA和人类复杂疾病关联预测研究

国家自然科学基金

4+阅读 · 2017年12月31日

共融机器人战略规划研究和学术交流

国家自然科学基金

15+阅读 · 2016年12月31日

基于犹豫模糊语言信息的定性决策理论与方法

国家自然科学基金

2+阅读 · 2015年12月31日

强调与对比影响语篇理解的认知过程及其神经机制

国家自然科学基金

4+阅读 · 2015年12月31日

中文句子语义概念图自动构建方法及应用研究

国家自然科学基金

3+阅读 · 2014年12月31日

多域网络安全的异构策略语义形态与验证机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于群体智能的多无人机编队自主协调控制及验证

国家自然科学基金

21+阅读 · 2013年12月31日

不确定性推理与语义网中知识表示的数学基础

国家自然科学基金

18+阅读 · 2012年12月31日

不确定环境下强化学习和决策的神经机制

国家自然科学基金

11+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员