Multilingual Simplification of Medical Texts

Automated text simplification aims to produce simple versions of complex texts. This task is especially useful in the medical domain, where the latest medical findings are typically communicated via complex and technical articles. This creates barriers for laypeople seeking access to up-to-date medical findings, consequently impeding progress on health literacy. Most existing work on medical text simplification has focused on monolingual settings, with the result that such evidence would be available only in just one language (most often, English). This work addresses this limitation via multilingual simplification, i.e., directly simplifying complex texts into simplified texts in multiple languages. We introduce MultiCochrane, the first sentence-aligned multilingual text simplification dataset for the medical domain in four languages: English, Spanish, French, and Farsi. We evaluate fine-tuned and zero-shot models across these languages, with extensive human assessments and analyses. Although models can now generate viable simplified texts, we identify outstanding challenges that this dataset might be used to address.

翻译：自动文本简化旨在将复杂文本转化为简洁版本。该任务在医学领域尤为重要，因为最新医学发现通常通过复杂且专业的技术文章传播，这为普通民众获取前沿医疗信息设置了障碍，进而影响健康素养的提升。现有医学文本简化研究多聚焦于单语言场景，导致相关证据只能以单一语言（通常为英语）呈现。本研究通过多语言简化突破这一局限，即直接将复杂文本简化为多语言简化版本。我们构建了MultiCochrane数据集——首个面向医学领域（涵盖英语、西班牙语、法语和波斯语四种语言）的句子对齐多语言文本简化数据集。通过大量人工评估与分析，我们对这些语言的微调模型与零样本模型进行了评测。尽管现有模型已能生成可行的简化文本，我们仍识别出该数据集可用于解决的一系列突出挑战。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日