Event Stream GPT: A Data Pre-processing and Modeling Library for Generative, Pre-trained Transformers over Continuous-time Sequences of Complex Events

MoDELS · 流 · Processing（编程语言） · 变换 · Performer ·

2023 年 6 月 21 日

翻译：事件流GPT：针对连续时间复杂事件序列的生成式预训练Transformer数据预处理与建模库

Matthew B. A. McDermott,Bret Nestor,Peniel Argaw,Isaac Kohane

Generative, pre-trained transformers (GPTs, a.k.a. "Foundation Models") have reshaped natural language processing (NLP) through their versatility in diverse downstream tasks. However, their potential extends far beyond NLP. This paper provides a software utility to help realize this potential, extending the applicability of GPTs to continuous-time sequences of complex events with internal dependencies, such as medical record datasets. Despite their potential, the adoption of foundation models in these domains has been hampered by the lack of suitable tools for model construction and evaluation. To bridge this gap, we introduce Event Stream GPT (ESGPT), an open-source library designed to streamline the end-to-end process for building GPTs for continuous-time event sequences. ESGPT allows users to (1) build flexible, foundation-model scale input datasets by specifying only a minimal configuration file, (2) leverage a Hugging Face compatible modeling API for GPTs over this modality that incorporates intra-event causal dependency structures and autoregressive generation capabilities, and (3) evaluate models via standardized processes that can assess few and even zero-shot performance of pre-trained models on user-specified fine-tuning tasks.

翻译：生成式预训练Transformer（GPT，亦称“基础模型”）因其在多样化下游任务中的通用性，重塑了自然语言处理（NLP）领域。然而，其潜力远不止于NLP。本文提供了一套软件工具，旨在将GPT的适用范围扩展至具有内部依赖关系的连续时间复杂事件序列（如医疗记录数据集）。尽管基础模型在这些领域具有巨大潜力，但缺乏合适的模型构建与评估工具阻碍了其应用。为填补这一空白，我们推出了事件流GPT（ESGPT）——一个开源库，旨在简化针对连续时间事件序列的GPT端到端构建流程。ESGPT允许用户：（1）仅通过指定最简配置文件，即可构建灵活且支持基础模型规模的数据集；（2）利用兼容Hugging Face的建模接口，为此类数据模态构建整合了事件内因果依赖结构及自回归生成能力的GPT模型；（3）通过标准化流程评估模型，可检验预训练模型在用户指定的微调任务上的少样本乃至零样本性能。

相关内容

MoDELS

关注 46

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

36+阅读 · 2020年5月20日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

35+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日