代码本注入式对话分割用于多话语结构标注：基于大语言模型辅助与无需黄金标签的评估 (Codebook-Injected Dialogue Segmentation for Multi-Utterance Constructs Annotation: LLM-Assisted and Gold-Label-Free Evaluation) - 专知论文

会员服务 ·

0

分割 · 一致 · 标注 · 基线 · 代码 ·

Codebook-Injected Dialogue Segmentation for Multi-Utterance Constructs Annotation: LLM-Assisted and Gold-Label-Free Evaluation

翻译：代码本注入式对话分割用于多话语结构标注：基于大语言模型辅助与无需黄金标签的评估

Jinsook Lee,Kirk Vanacore,Zhuqian Zhou,Bakhtawar Ahtisham,Jeanine Grutter,Rene F. Kizilcec

from arxiv, Under Review for ACL 2026

Dialogue Act (DA) annotation typically treats communicative or pedagogical intent as localized to individual utterances or turns. This leads annotators to agree on the underlying action while disagreeing on segment boundaries, reducing apparent reliability. We propose codebook-injected segmentation, which conditions boundary decisions on downstream annotation criteria, and evaluate LLM-based segmenters against standard and retrieval-augmented baselines. To assess these without gold labels, we introduce evaluation metrics for span consistency, distinctiveness, and human-AI distributional agreement. We found DA-awareness produces segments that are internally more consistent than text-only baselines. While LLMs excel at creating construct-consistent spans, coherence-based baselines remain superior at detecting global shifts in dialogue flow. Across two datasets, no single segmenter dominates. Improvements in within-segment coherence frequently trade off against boundary distinctiveness and human-AI distributional agreement. These results highlight segmentation as a consequential design choice that should be optimized for downstream objectives rather than a single performance score.

翻译：对话行为标注通常将交际或教学意图视为局限于单个话语或话轮。这导致标注者在底层行为上达成一致，却在片段边界上存在分歧，从而降低了表面信度。我们提出代码本注入式分割方法，该方法将边界决策条件化于下游标注标准，并评估基于大语言模型的分割器与标准基线及检索增强基线的性能。为在无黄金标签的情况下评估这些方法，我们引入了针对跨度一致性、区分度以及人机分布一致性的评估指标。研究发现，具备对话行为意识的分割方法产生的片段，其内部一致性优于纯文本基线。虽然大语言模型在创建结构一致的跨度方面表现优异，但基于连贯性的基线在检测对话流全局转换方面仍更具优势。在两个数据集上，没有任何单一分割器占据全面优势。片段内连贯性的提升往往以边界区分度及人机分布一致性的降低为代价。这些结果表明，分割是一项关键的设计选择，应针对下游目标进行优化，而非追求单一性能分数。

0

相关内容

小样本语义分割研究现状与分析

小样本语义分割研究现状与分析

专知会员服务

23+阅读 · 2024年11月11日

基于深度学习的实时语义分割综述

基于深度学习的实时语义分割综述

专知会员服务

32+阅读 · 2023年11月27日

【ICML2022】Branchformer:并行MLP-Attention架构，捕捉局部和全局上下文，用于语音识别和理解

【ICML2022】Branchformer:并行MLP-Attention架构，捕捉局部和全局上下文，用于语音识别和理解

专知会员服务

25+阅读 · 2022年7月8日

上海交大最新《标签高效深度分割》研究进展综述，全面阐述无监督、粗监督、不完全监督和噪声监督的深度分割方法

上海交大最新《标签高效深度分割》研究进展综述，全面阐述无监督、粗监督、不完全监督和噪声监督的深度分割方法

专知会员服务

42+阅读 · 2022年7月7日

【CVPR 2022-UCSD&英伟达】GroupViT:从文本监督中产生语义分割，Semantic Segmentation Emerges from Text Supervision

【CVPR 2022-UCSD&英伟达】GroupViT:从文本监督中产生语义分割，Semantic Segmentation Emerges from Text Supervision

专知会员服务

12+阅读 · 2022年3月9日

[NeurIPS 2020 oral] 基于因果干预的弱监督语义分割

专知会员服务

47+阅读 · 2020年10月5日

【KDD2020】通用文档预训练模型LayoutLM：文档结构信息和视觉信息进行建模，让模型在预训练阶段进行多模态对齐。

【KDD2020】通用文档预训练模型LayoutLM：文档结构信息和视觉信息进行建模，让模型在预训练阶段进行多模态对齐。

专知会员服务

32+阅读 · 2020年8月23日

注意力图神经网络的多标签文本分类

注意力图神经网络的多标签文本分类

专知会员服务

112+阅读 · 2020年3月28日

Transformer文本分类代码

Transformer文本分类代码

专知会员服务

118+阅读 · 2020年2月3日

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

专知会员服务

92+阅读 · 2019年12月22日

【资源】NLP多标签文本分类代码实现工具包

【资源】NLP多标签文本分类代码实现工具包

专知

40+阅读 · 2019年11月20日

【微软ICLR2020提交论文】多模态预训练表示UNITER：通用图像-文本语言表示学习

【微软ICLR2020提交论文】多模态预训练表示UNITER：通用图像-文本语言表示学习

专知

50+阅读 · 2019年10月20日

EMNLP2019 | 南大NLP，基于细粒度知识融合的序列标注领域适应

EMNLP2019 | 南大NLP，基于细粒度知识融合的序列标注领域适应

AI科技评论

20+阅读 · 2019年9月24日

DL | 语义分割综述

DL | 语义分割综述

机器学习算法与Python学习

58+阅读 · 2019年3月13日

语义分割如何「拉关系」?

语义分割如何「拉关系」?

计算机视觉life

11+阅读 · 2019年2月15日

入门 | 一文了解什么是语义分割及常用的语义分割方法有哪些

入门 | 一文了解什么是语义分割及常用的语义分割方法有哪些

机器之心

10+阅读 · 2018年6月4日

【论文笔记】用图卷积网络( GCN)来做语义角色标注

【论文笔记】用图卷积网络( GCN)来做语义角色标注

专知

61+阅读 · 2018年5月26日

语义分割+视频分割开源代码集合

语义分割+视频分割开源代码集合

极市平台

35+阅读 · 2018年3月5日

怎样构建中文文本标注工具?（附工具、代码、论文等资源）

怎样构建中文文本标注工具?（附工具、代码、论文等资源）

数据派THU

14+阅读 · 2017年11月26日

NLP自然语言处理（二）——基础文本分析

NLP自然语言处理（二）——基础文本分析

乐享数据DataScientists

12+阅读 · 2017年2月7日

基于多源语义表示学习的社交媒体文本属性情感分类研究

国家自然科学基金

4+阅读 · 2017年12月31日

基于因子分析的会话语音说话人识别研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向CELP语音压缩域的通用隐写分析方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于区分型码本的图像表示的研究与应用

国家自然科学基金

1+阅读 · 2015年12月31日

多标记文本数据流分类方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于形状信息和结果反馈的多图谱图像分割方法

国家自然科学基金

0+阅读 · 2015年12月31日

基于多样化查询的多标记主动学习研究

国家自然科学基金

0+阅读 · 2015年12月31日

数据驱动的人体图像语义分割研究

国家自然科学基金

4+阅读 · 2014年12月31日

半监督进化文本聚类算法在动态多源文本分析上的研究

国家自然科学基金

2+阅读 · 2014年12月31日

多域网络安全的异构策略语义形态与验证机制

国家自然科学基金

0+阅读 · 2014年12月31日

Large Language Models as Automatic Annotators and Annotation Adjudicators for Fine-Grained Opinion Analysis

Arxiv

0+阅读 · 2月18日

Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model

Arxiv

0+阅读 · 2月11日

LATA: A Tool for LLM-Assisted Translation Annotation

Arxiv

0+阅读 · 2月11日

LinguistAgent: A Reflective Multi-Model Platform for Automated Linguistic Annotation

Arxiv

0+阅读 · 2月5日

LEANCODE: Understanding Models Better for Code Simplification of Pre-trained Large Language Models

Arxiv

0+阅读 · 2月5日

LLMs as Span Annotators: A Comparative Study of LLMs and Humans

Arxiv

0+阅读 · 2月2日

LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech

Arxiv

0+阅读 · 1月30日

AI Annotation Orchestration: Evaluating LLM verifiers to Improve the Quality of LLM Annotations in Learning Analytics

Arxiv

0+阅读 · 1月28日

Unsupervised Text Segmentation via Kernel Change-Point Detection on Sentence Embeddings

Arxiv

0+阅读 · 1月26日

Large Language Models as Automatic Annotators and Annotation Adjudicators for Fine-Grained Opinion Analysis

Arxiv

0+阅读 · 1月23日

VIP会员

文章信息

相关主题

相关VIP内容

小样本语义分割研究现状与分析

小样本语义分割研究现状与分析

专知会员服务

23+阅读 · 2024年11月11日

基于深度学习的实时语义分割综述

基于深度学习的实时语义分割综述

专知会员服务

32+阅读 · 2023年11月27日

【ICML2022】Branchformer:并行MLP-Attention架构，捕捉局部和全局上下文，用于语音识别和理解

【ICML2022】Branchformer:并行MLP-Attention架构，捕捉局部和全局上下文，用于语音识别和理解

专知会员服务

25+阅读 · 2022年7月8日

上海交大最新《标签高效深度分割》研究进展综述，全面阐述无监督、粗监督、不完全监督和噪声监督的深度分割方法

上海交大最新《标签高效深度分割》研究进展综述，全面阐述无监督、粗监督、不完全监督和噪声监督的深度分割方法

专知会员服务

42+阅读 · 2022年7月7日

【CVPR 2022-UCSD&英伟达】GroupViT:从文本监督中产生语义分割，Semantic Segmentation Emerges from Text Supervision

【CVPR 2022-UCSD&英伟达】GroupViT:从文本监督中产生语义分割，Semantic Segmentation Emerges from Text Supervision

专知会员服务

12+阅读 · 2022年3月9日

[NeurIPS 2020 oral] 基于因果干预的弱监督语义分割

专知会员服务

47+阅读 · 2020年10月5日

【KDD2020】通用文档预训练模型LayoutLM：文档结构信息和视觉信息进行建模，让模型在预训练阶段进行多模态对齐。

【KDD2020】通用文档预训练模型LayoutLM：文档结构信息和视觉信息进行建模，让模型在预训练阶段进行多模态对齐。

专知会员服务

32+阅读 · 2020年8月23日

注意力图神经网络的多标签文本分类

注意力图神经网络的多标签文本分类

专知会员服务

112+阅读 · 2020年3月28日

Transformer文本分类代码

Transformer文本分类代码

专知会员服务

118+阅读 · 2020年2月3日

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

【AAAI2020】多模态注意力语义图嵌入多标签分类（Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification）

专知会员服务

92+阅读 · 2019年12月22日

热门VIP内容

开通专知VIP会员享更多权益服务

美国防部门开始扩建金穹反导系统基础设施

《基于选择性深度神经网络分类的弹性无线通信》最新报告

《多域作战中融合网络、电子战与动能机动》

《在东欧磨砺反无人机技能》美陆军最新反无人机训练报告

相关资讯

【资源】NLP多标签文本分类代码实现工具包

【资源】NLP多标签文本分类代码实现工具包

专知

40+阅读 · 2019年11月20日

【微软ICLR2020提交论文】多模态预训练表示UNITER：通用图像-文本语言表示学习

【微软ICLR2020提交论文】多模态预训练表示UNITER：通用图像-文本语言表示学习

专知

50+阅读 · 2019年10月20日

EMNLP2019 | 南大NLP，基于细粒度知识融合的序列标注领域适应

EMNLP2019 | 南大NLP，基于细粒度知识融合的序列标注领域适应

AI科技评论

20+阅读 · 2019年9月24日

DL | 语义分割综述

DL | 语义分割综述

机器学习算法与Python学习

58+阅读 · 2019年3月13日

语义分割如何「拉关系」?

语义分割如何「拉关系」?

计算机视觉life

11+阅读 · 2019年2月15日

入门 | 一文了解什么是语义分割及常用的语义分割方法有哪些

入门 | 一文了解什么是语义分割及常用的语义分割方法有哪些

机器之心

10+阅读 · 2018年6月4日

【论文笔记】用图卷积网络( GCN)来做语义角色标注

【论文笔记】用图卷积网络( GCN)来做语义角色标注

专知

61+阅读 · 2018年5月26日

语义分割+视频分割开源代码集合

语义分割+视频分割开源代码集合

极市平台

35+阅读 · 2018年3月5日

怎样构建中文文本标注工具?（附工具、代码、论文等资源）

怎样构建中文文本标注工具?（附工具、代码、论文等资源）

数据派THU

14+阅读 · 2017年11月26日

NLP自然语言处理（二）——基础文本分析

NLP自然语言处理（二）——基础文本分析

乐享数据DataScientists

12+阅读 · 2017年2月7日

相关论文

Large Language Models as Automatic Annotators and Annotation Adjudicators for Fine-Grained Opinion Analysis

Arxiv

0+阅读 · 2月18日

Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model

Arxiv

0+阅读 · 2月11日

LATA: A Tool for LLM-Assisted Translation Annotation

Arxiv

0+阅读 · 2月11日

LinguistAgent: A Reflective Multi-Model Platform for Automated Linguistic Annotation

Arxiv

0+阅读 · 2月5日

LEANCODE: Understanding Models Better for Code Simplification of Pre-trained Large Language Models

Arxiv

0+阅读 · 2月5日

LLMs as Span Annotators: A Comparative Study of LLMs and Humans

Arxiv

0+阅读 · 2月2日

LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech

Arxiv

0+阅读 · 1月30日

AI Annotation Orchestration: Evaluating LLM verifiers to Improve the Quality of LLM Annotations in Learning Analytics

Arxiv

0+阅读 · 1月28日

Unsupervised Text Segmentation via Kernel Change-Point Detection on Sentence Embeddings

Arxiv

0+阅读 · 1月26日

Large Language Models as Automatic Annotators and Annotation Adjudicators for Fine-Grained Opinion Analysis

Arxiv

0+阅读 · 1月23日

相关基金

基于多源语义表示学习的社交媒体文本属性情感分类研究

国家自然科学基金

4+阅读 · 2017年12月31日

基于因子分析的会话语音说话人识别研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向CELP语音压缩域的通用隐写分析方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于区分型码本的图像表示的研究与应用

国家自然科学基金

1+阅读 · 2015年12月31日

多标记文本数据流分类方法研究

国家自然科学基金

3+阅读 · 2015年12月31日

基于形状信息和结果反馈的多图谱图像分割方法

国家自然科学基金

0+阅读 · 2015年12月31日

基于多样化查询的多标记主动学习研究

国家自然科学基金

0+阅读 · 2015年12月31日

数据驱动的人体图像语义分割研究

国家自然科学基金

4+阅读 · 2014年12月31日

半监督进化文本聚类算法在动态多源文本分析上的研究

国家自然科学基金

2+阅读 · 2014年12月31日

多域网络安全的异构策略语义形态与验证机制

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员