Event Stream GPT: A Data Pre-processing and Modeling Library for Generative, Pre-trained Transformers over Continuous-time Sequences of Complex Events

MoDELS · 流 · Processing（编程语言） · 变换 · Performer ·

2023 年 6 月 20 日

翻译：事件流GPT：面向复杂事件连续时间序列的生成式预训练Transformer数据预处理与建模库

Matthew B. A. McDermott,Bret Nestor,Peniel Argaw,Isaac Kohane

Generative, pre-trained transformers (GPTs, a.k.a. "Foundation Models") have reshaped natural language processing (NLP) through their versatility in diverse downstream tasks. However, their potential extends far beyond NLP. This paper provides a software utility to help realize this potential, extending the applicability of GPTs to continuous-time sequences of complex events with internal dependencies, such as medical record datasets. Despite their potential, the adoption of foundation models in these domains has been hampered by the lack of suitable tools for model construction and evaluation. To bridge this gap, we introduce Event Stream GPT (ESGPT), an open-source library designed to streamline the end-to-end process for building GPTs for continuous-time event sequences. ESGPT allows users to (1) build flexible, foundation-model scale input datasets by specifying only a minimal configuration file, (2) leverage a Hugging Face compatible modeling API for GPTs over this modality that incorporates intra-event causal dependency structures and autoregressive generation capabilities, and (3) evaluate models via standardized processes that can assess few and even zero-shot performance of pre-trained models on user-specified fine-tuning tasks.

翻译：生成式预训练Transformer（GPT，又称“基础模型”）凭借其在多种下游任务中的通用性，重塑了自然语言处理领域。然而，其潜力远不止于自然语言处理。本文提供了一套软件工具以助力实现这一潜力，将GPT的适用性扩展到具有内部依赖关系的复杂事件连续时间序列（如医疗记录数据集）。尽管基础模型在这些领域具有应用潜力，但由于缺乏合适的模型构建与评估工具，其实际应用一直受到阻碍。为弥补这一空白，我们推出了事件流GPT（Event Stream GPT，ESGPT），这是一款开源库，旨在简化面向连续时间事件序列构建GPT的端到端流程。ESGPT允许用户：（1）仅通过指定最小配置文件，即可构建灵活、达到基础模型规模的输入数据集；（2）利用兼容Hugging Face的建模API，在该模态上构建GPT模型，该API融合了事件内部因果依赖结构与自回归生成能力；（3）通过标准化流程评估模型，可评估预训练模型在用户指定的微调任务上的少样本乃至零样本性能。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

语言视觉预训练语言模型揭密，Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models

专知会员服务

36+阅读 · 2020年5月20日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日