Readability Controllable Biomedical Document Summarization - 专知论文

会员服务 ·

0

Readability · 控制器 · 语言模型化 · MoDELS · 掩码语言模型化 ·

2023 年 5 月 1 日

Readability Controllable Biomedical Document Summarization

翻译：可读性可控的生物医学文档摘要

Zheheng Luo,Qianqian Xie,Sophia Ananiadou

from arxiv, accepted to the Findings of EMNLP 2022

Different from general documents, it is recognised that the ease with which people can understand a biomedical text is eminently varied, owing to the highly technical nature of biomedical documents and the variance of readers' domain knowledge. However, existing biomedical document summarization systems have paid little attention to readability control, leaving users with summaries that are incompatible with their levels of expertise. In recognition of this urgent demand, we introduce a new task of readability controllable summarization for biomedical documents, which aims to recognise users' readability demands and generate summaries that better suit their needs: technical summaries for experts and plain language summaries (PLS) for laymen. To establish this task, we construct a corpus consisting of biomedical papers with technical summaries and PLSs written by the authors, and benchmark multiple advanced controllable abstractive and extractive summarization models based on pre-trained language models (PLMs) with prevalent controlling and generation techniques. Moreover, we propose a novel masked language model (MLM) based metric and its variant to effectively evaluate the readability discrepancy between lay and technical summaries. Experimental results from automated and human evaluations show that though current control techniques allow for a certain degree of readability adjustment during generation, the performance of existing controllable summarization methods is far from desirable in this task.

翻译：与普通文档不同，由于生物医学文档的高度技术性以及读者领域知识的差异，人们理解生物医学文本的难易程度存在显著差异。然而，现有的生物医学文档摘要系统很少关注可读性控制，导致生成的摘要与用户的专业水平不匹配。鉴于这一迫切需求，我们提出了一项新的任务——生物医学文档的可读性可控摘要。该任务旨在识别用户的可读性需求，并生成更符合其需求的摘要：为专家生成技术性摘要，为普通人生成通俗语言摘要。为建立该任务，我们构建了一个语料库，包含生物医学论文及其由作者撰写的技术摘要和通俗语言摘要，并基于预训练语言模型（PLMs）结合主流控制与生成技术，对多种先进的可控抽象式和抽取式摘要模型进行了基准测试。此外，我们提出了一种基于掩码语言模型（MLM）的度量指标及其变体，以有效评估通俗摘要与技术摘要之间的可读性差异。自动化和人工评估的实验结果表明，尽管当前的控制技术允许在生成过程中进行一定程度的可读性调整，但现有可控摘要方法在该任务上的表现远未达到理想水平。

0

相关内容

Readability

一个旨在提升互联网阅读体验的工具。 http://readability.com/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

分数阶偏微分方程的不变流形

国家自然科学基金

0+阅读 · 2015年12月31日

基于3D/2D内在约束的复杂形状描述及抽象特征研究

国家自然科学基金

0+阅读 · 2013年12月31日

GSK-3β/β-catenin信号通路参与ARDS后认知功能障碍发生的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

E3泛素连接酶CUL7修饰Caspase-8调节乳腺癌细胞生存的研究

国家自然科学基金

0+阅读 · 2013年12月31日

MicroRNA-10a/b靶向调控ABCA1和ABCG1对胆固醇流出的影响

国家自然科学基金

0+阅读 · 2013年12月31日

电磁辐射与阿尔茨海默病的相关性及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

ABCA1甲基化在动脉粥样硬化中的作用及miR-155靶向调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

Fucí意义下的跨共振的Sturm-Liouville问题

国家自然科学基金

0+阅读 · 2012年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

GmMADS1在大豆花发育中的调控机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

ChatGPT vs Human-authored Text: Insights into Controllable Text Summarization and Sentence Style Transfer

Arxiv

0+阅读 · 2023年6月13日

Understand Legal Documents with Contextualized Large Language Models

Arxiv

0+阅读 · 2023年6月13日

HELP ME THINK: A Simple Prompting Strategy for Non-experts to Create Customized Content with Models

Arxiv

0+阅读 · 2023年6月12日

MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images

Arxiv

0+阅读 · 2023年6月12日

On the Amplification of Linguistic Bias through Unintentional Self-reinforcement Learning by Generative Language Models -- A Perspective

Arxiv

0+阅读 · 2023年6月12日

Weakly supervised information extraction from inscrutable handwritten document images

Arxiv

0+阅读 · 2023年6月12日

ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries

Arxiv

0+阅读 · 2023年6月11日

Embodied Executable Policy Learning with Language-based Scene Summarization

Arxiv

0+阅读 · 2023年6月9日

Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization

Arxiv

0+阅读 · 2023年6月8日

Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition

Arxiv

10+阅读 · 2018年4月28日

VIP会员

文章信息

相关主题

语言模型化

掩码语言模型化

最新内容

AUTOLAB：86亿Token实测前沿模型的长程自动科研能力

AUTOLAB：86亿Token实测前沿模型的长程自动科研能力

专知会员服务

3+阅读 · 6月12日

CVPR 2026趋势报告：视觉AI正在走向世界模型与物理智能，165页ppt

CVPR 2026趋势报告：视觉AI正在走向世界模型与物理智能，165页ppt

专知会员服务

13+阅读 · 6月12日

乌克兰战场背后的新武器

乌克兰战场背后的新武器

专知会员服务

5+阅读 · 6月12日

《信任但需验证：军事决策背景下的大型语言模型品格、能力与控制》2026最新59页报告

《信任但需验证：军事决策背景下的大型语言模型品格、能力与控制》2026最新59页报告

专知会员服务

11+阅读 · 6月12日

未来战争：乌克兰2026年反攻中的作战经验教训 - 新军事战略之“后勤封锁”（中文下载）

未来战争：乌克兰2026年反攻中的作战经验教训 - 新军事战略之“后勤封锁”（中文下载）

专知会员服务

7+阅读 · 6月12日

基于博弈论的陆军人机协同（长文报告）

基于博弈论的陆军人机协同（长文报告）

专知会员服务

11+阅读 · 6月12日

《天气对反无人机系统“探测-跟踪-识别-失效”链路的影响：俄乌战场分析》

《天气对反无人机系统“探测-跟踪-识别-失效”链路的影响：俄乌战场分析》

专知会员服务

10+阅读 · 6月12日

美国陆军航空兵：以愿景引领转型

美国陆军航空兵：以愿景引领转型

专知会员服务

6+阅读 · 6月12日

CVPR 2026教程｜扩散模型原理：连续、离散与实时生成

CVPR 2026教程｜扩散模型原理：连续、离散与实时生成

专知会员服务

5+阅读 · 6月11日

重磅综述｜大模型智能体环境工程：建模、合成、评估与协同演化

重磅综述｜大模型智能体环境工程：建模、合成、评估与协同演化

专知会员服务

6+阅读 · 6月11日

面向特种部队的、以操作员为中心的人工智能决策支持系统框架

面向特种部队的、以操作员为中心的人工智能决策支持系统框架

专知会员服务

8+阅读 · 6月11日

《多域战场上反制小型无人机系统》150页

《多域战场上反制小型无人机系统》150页

专知会员服务

17+阅读 · 6月11日

《基于成果军事教育框架下的军官联合职业军事教育认证程序》2026最新170页

《基于成果军事教育框架下的军官联合职业军事教育认证程序》2026最新170页

专知会员服务

5+阅读 · 6月11日

战场人工智能：增强陆地作战能力的发现与要求

战场人工智能：增强陆地作战能力的发现与要求

专知会员服务

3+阅读 · 6月11日

人工智能赋能指挥所：以人工智能为中心的指挥控制的核心要素

人工智能赋能指挥所：以人工智能为中心的指挥控制的核心要素

专知会员服务

16+阅读 · 6月11日

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

CVPR 2026趋势报告：视觉AI正在走向世界模型与物理智能，165页ppt

《信任但需验证：军事决策背景下的大型语言模型品格、能力与控制》2026最新59页报告

AUTOLAB：86亿Token实测前沿模型的长程自动科研能力

乌克兰战场背后的新武器

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

ChatGPT vs Human-authored Text: Insights into Controllable Text Summarization and Sentence Style Transfer

Arxiv

0+阅读 · 2023年6月13日

Understand Legal Documents with Contextualized Large Language Models

Arxiv

0+阅读 · 2023年6月13日

HELP ME THINK: A Simple Prompting Strategy for Non-experts to Create Customized Content with Models

Arxiv

0+阅读 · 2023年6月12日

MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images

Arxiv

0+阅读 · 2023年6月12日

On the Amplification of Linguistic Bias through Unintentional Self-reinforcement Learning by Generative Language Models -- A Perspective

Arxiv

0+阅读 · 2023年6月12日

Weakly supervised information extraction from inscrutable handwritten document images

Arxiv

0+阅读 · 2023年6月12日

ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries

Arxiv

0+阅读 · 2023年6月11日

Embodied Executable Policy Learning with Language-based Scene Summarization

Arxiv

0+阅读 · 2023年6月9日

Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization

Arxiv

0+阅读 · 2023年6月8日

Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition

Arxiv

10+阅读 · 2018年4月28日

相关基金

分数阶偏微分方程的不变流形

国家自然科学基金

0+阅读 · 2015年12月31日

基于3D/2D内在约束的复杂形状描述及抽象特征研究

国家自然科学基金

0+阅读 · 2013年12月31日

GSK-3β/β-catenin信号通路参与ARDS后认知功能障碍发生的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

E3泛素连接酶CUL7修饰Caspase-8调节乳腺癌细胞生存的研究

国家自然科学基金

0+阅读 · 2013年12月31日

MicroRNA-10a/b靶向调控ABCA1和ABCG1对胆固醇流出的影响

国家自然科学基金

0+阅读 · 2013年12月31日

电磁辐射与阿尔茨海默病的相关性及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

ABCA1甲基化在动脉粥样硬化中的作用及miR-155靶向调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

Fucí意义下的跨共振的Sturm-Liouville问题

国家自然科学基金

0+阅读 · 2012年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

GmMADS1在大豆花发育中的调控机理研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员