TESS: Text-to-Text Self-Conditioned Simplex Diffusion - 专知论文

会员服务 ·

0

单纯形 · MoDELS · Extensibility · state-of-the-art · Performer ·

2023 年 5 月 15 日

TESS: Text-to-Text Self-Conditioned Simplex Diffusion

翻译：TESS：文本到文本的自条件单纯形扩散

Rabeeh Karimi Mahabadi,Jaesung Tae,Hamish Ivison,James Henderson,Iz Beltagy,Matthew E. Peters,Arman Cohan

from arxiv, 9 pages, 4 figures, preprint

Diffusion models have emerged as a powerful paradigm for generation, obtaining strong performance in various domains with continuous-valued inputs. Despite the promises of fully non-autoregressive text generation, applying diffusion models to natural language remains challenging due to its discrete nature. In this work, we propose Text-to-text Self-conditioned Simplex Diffusion (TESS), a text diffusion model that is fully non-autoregressive, employs a new form of self-conditioning, and applies the diffusion process on the logit simplex space rather than the typical learned embedding space. Through extensive experiments on natural language understanding and generation tasks including summarization, text simplification, paraphrase generation, and question generation, we demonstrate that TESS outperforms state-of-the-art non-autoregressive models and is competitive with pretrained autoregressive sequence-to-sequence models.

翻译：扩散模型已成为一种强大的生成范式，在连续值输入的多个领域取得了优异表现。尽管全非自回归文本生成前景广阔，但由于自然语言的离散特性，将扩散模型应用于该领域仍具挑战性。本文提出文本到文本的自条件单纯形扩散（TESS）——一种全非自回归文本扩散模型，它采用新型自条件机制，并在logit单纯形空间（而非典型的嵌入学习空间）上应用扩散过程。通过涵盖摘要生成、文本简化、释义生成及问题生成等自然语言理解与生成任务的广泛实验，我们证明TESS性能超越现有最先进非自回归模型，且与预训练自回归序列到序列模型具有竞争力。

0

相关内容

单纯形

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

129+阅读 · 2020年11月20日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

自然语言处理 (NLP)资源大全

自然语言处理 (NLP)资源大全

机械鸡

35+阅读 · 2017年9月17日

β-arrestins通过ER stress/Puma调控门脉高压性胃病的机制

国家自然科学基金

0+阅读 · 2012年12月31日

短小芽孢杆菌TUBP1抗棉花黄萎病菌活性成分及作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

IDW矿产资源/储量估算方法精细幂指数的智能优化

国家自然科学基金

0+阅读 · 2012年12月31日

鄱阳湖区血吸虫病动力学模型与防治评测研究

国家自然科学基金

0+阅读 · 2012年12月31日

Lé过程和分数阶Lé过程驱动的动力系统的动力学性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

CeO2催化CO2和甲醇合成碳酸二甲酯构效关系和反应机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

长链非编码RNA HOST2在卵巢癌发生与转移中作用的研究

国家自然科学基金

0+阅读 · 2011年12月31日

钩端螺旋体对不同宿主与细胞致病性差异及其播散与排菌分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

仿刺参补体C3、Bf、H基因遗传多态性与抗病相关性研究

国家自然科学基金

0+阅读 · 2009年12月31日

金属有机骨架化合物在功能化固体表面上的生长研究

国家自然科学基金

0+阅读 · 2009年12月31日

Counting Guidance for High Fidelity Text-to-Image Synthesis

Arxiv

0+阅读 · 2023年6月30日

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

Arxiv

0+阅读 · 2023年6月29日

Benchmarking Large Language Model Capabilities for Conditional Generation

Arxiv

0+阅读 · 2023年6月29日

Probabilistic Linguistic Knowledge and Token-level Text Augmentation

Arxiv

0+阅读 · 2023年6月29日

Lossy Image Compression with Conditional Diffusion Models

Arxiv

0+阅读 · 2023年6月28日

SVNR: Spatially-variant Noise Removal with Denoising Diffusion

Arxiv

0+阅读 · 2023年6月28日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Counterfactual Zero-Shot and Open-Set Visual Recognition

Arxiv

12+阅读 · 2021年3月1日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

VIP会员

文章信息

相关主题

state-of-the-art

最新内容

《曝光下的战争：战场过滤与乌克兰军事选择的窄化》

《曝光下的战争：战场过滤与乌克兰军事选择的窄化》

专知会员服务

2+阅读 · 今天7:13

俄乌无人机战争的六大启示

俄乌无人机战争的六大启示

专知会员服务

4+阅读 · 今天7:07

《无人机空中监控：通信实验洞察》

《无人机空中监控：通信实验洞察》

专知会员服务

3+阅读 · 今天7:05

《无全球定位系统及通信拒止环境下用于地面目标防护的分布式无人机蜂群》（含代码）

《无全球定位系统及通信拒止环境下用于地面目标防护的分布式无人机蜂群》（含代码）

专知会员服务

3+阅读 · 今天6:59

从采集到决策：美军视角下的战术情报范式重构

从采集到决策：美军视角下的战术情报范式重构

专知会员服务

12+阅读 · 8月2日

乌克兰“德尔塔”系统揭示无人机、数据与领导力如何重塑现代安全格局

乌克兰“德尔塔”系统揭示无人机、数据与领导力如何重塑现代安全格局

专知会员服务

5+阅读 · 8月2日

大规模作战中的参谋流程：作为联合兵种作战组成部分的目标锁定

大规模作战中的参谋流程：作为联合兵种作战组成部分的目标锁定

专知会员服务

10+阅读 · 8月2日

《北约概念开发与实验（CD&E）手册：概念开发者工具箱》100页手册

《北约概念开发与实验（CD&E）手册：概念开发者工具箱》100页手册

专知会员服务

12+阅读 · 8月2日

《履带式无人地面战车技术发展现状》

《履带式无人地面战车技术发展现状》

专知会员服务

6+阅读 · 8月2日

《美国空军B-2“幽灵”隐身轰炸机系统工程案例研究》117页

《美国空军B-2“幽灵”隐身轰炸机系统工程案例研究》117页

专知会员服务

10+阅读 · 8月1日

隐身技术前沿综述：物理机理、工程实践与战略展望

隐身技术前沿综述：物理机理、工程实践与战略展望

专知会员服务

8+阅读 · 8月1日

《多变海洋环境下无人水面艇与自主水下机器人对接的最优路径规划》

《多变海洋环境下无人水面艇与自主水下机器人对接的最优路径规划》

专知会员服务

8+阅读 · 8月1日

《以机反机：基于无人机载麦克风的空中周界入侵检测》

《以机反机：基于无人机载麦克风的空中周界入侵检测》

专知会员服务

8+阅读 · 8月1日

《无人机脆弱性利用：网络空间力量的新域》

《无人机脆弱性利用：网络空间力量的新域》

专知会员服务

6+阅读 · 8月1日

美空军如何将人工智能从战场部署至后方机关

美空军如何将人工智能从战场部署至后方机关

专知会员服务

13+阅读 · 7月31日

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

129+阅读 · 2020年11月20日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

热门VIP内容

开通专知VIP会员享更多权益服务

俄乌无人机战争的六大启示

《无全球定位系统及通信拒止环境下用于地面目标防护的分布式无人机蜂群》（含代码）

《曝光下的战争：战场过滤与乌克兰军事选择的窄化》

《无人机空中监控：通信实验洞察》

相关资讯

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

笔记 | Sentiment Analysis

笔记 | Sentiment Analysis

黑龙江大学自然语言处理实验室

10+阅读 · 2018年5月6日

自然语言处理 (NLP)资源大全

自然语言处理 (NLP)资源大全

机械鸡

35+阅读 · 2017年9月17日

相关论文

Counting Guidance for High Fidelity Text-to-Image Synthesis

Arxiv

0+阅读 · 2023年6月30日

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

Arxiv

0+阅读 · 2023年6月29日

Benchmarking Large Language Model Capabilities for Conditional Generation

Arxiv

0+阅读 · 2023年6月29日

Probabilistic Linguistic Knowledge and Token-level Text Augmentation

Arxiv

0+阅读 · 2023年6月29日

Lossy Image Compression with Conditional Diffusion Models

Arxiv

0+阅读 · 2023年6月28日

SVNR: Spatially-variant Noise Removal with Denoising Diffusion

Arxiv

0+阅读 · 2023年6月28日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Counterfactual Zero-Shot and Open-Set Visual Recognition

Arxiv

12+阅读 · 2021年3月1日

Text Generation from Knowledge Graphs with Graph Transformers

Arxiv

35+阅读 · 2019年4月4日

相关基金

β-arrestins通过ER stress/Puma调控门脉高压性胃病的机制

国家自然科学基金

0+阅读 · 2012年12月31日

短小芽孢杆菌TUBP1抗棉花黄萎病菌活性成分及作用机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

IDW矿产资源/储量估算方法精细幂指数的智能优化

国家自然科学基金

0+阅读 · 2012年12月31日

鄱阳湖区血吸虫病动力学模型与防治评测研究

国家自然科学基金

0+阅读 · 2012年12月31日

Lé过程和分数阶Lé过程驱动的动力系统的动力学性质研究

国家自然科学基金

0+阅读 · 2012年12月31日

CeO2催化CO2和甲醇合成碳酸二甲酯构效关系和反应机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

长链非编码RNA HOST2在卵巢癌发生与转移中作用的研究

国家自然科学基金

0+阅读 · 2011年12月31日

钩端螺旋体对不同宿主与细胞致病性差异及其播散与排菌分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

仿刺参补体C3、Bf、H基因遗传多态性与抗病相关性研究

国家自然科学基金

0+阅读 · 2009年12月31日

金属有机骨架化合物在功能化固体表面上的生长研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员