AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning

The pervasive deployment of Large Language Models-LLMs in various sectors often neglects the nuanced requirements of individuals and small organizations, who benefit more from models precisely tailored to their specific business contexts rather than those with broadly superior general capabilities. This work introduces \textbf{AnyTaskTune}, a novel fine-tuning methodology coined as \textbf{Task-Fine-Tune}, specifically developed to elevate model performance on a diverse array of domain-specific tasks. This method involves a meticulous process to identify and define targeted sub-tasks within a domain, followed by the creation of specialized enhancement datasets for fine-tuning, thereby optimizing task-specific model performance. We conducted comprehensive fine-tuning experiments not only in the legal domain for tasks such as keyword extraction and sentence prediction but across over twenty different sub-tasks derived from the domains of finance, healthcare, law, psychology, consumer services, and human resources. To substantiate our approach and facilitate community engagement, we will open-source these bilingual task datasets. Our findings demonstrate that models fine-tuned using the \textbf{Task-Fine-Tune} methodology not only achieve superior performance on these specific tasks but also significantly outperform models with higher general capabilities in their respective domains. Our work is publicly available at \url{https://github.com/PandaVT/DataTager}.

翻译：大型语言模型（LLMs）在各行各业的广泛部署，往往忽视了个人及小型组织的细微需求。对于这些用户而言，精确适配其特定业务场景的模型，比那些仅具备广泛通用优势的模型更具价值。本研究提出了 \textbf{AnyTaskTune}，一种新颖的微调方法，我们将其定义为 \textbf{任务微调}。该方法专为提升模型在多种领域特定任务上的性能而设计，其核心流程包括：在特定领域内细致地识别并定义目标子任务，随后创建专门的增强数据集用于微调，从而优化模型在具体任务上的表现。我们不仅针对法律领域的关键词提取、句子预测等任务进行了全面的微调实验，更将实验范围扩展至源自金融、医疗、法律、心理学、消费者服务及人力资源等领域的超过二十种不同子任务。为验证我们的方法并促进社区参与，我们将开源这些双语任务数据集。我们的研究结果表明，采用 \textbf{任务微调} 方法微调的模型，不仅在这些特定任务上取得了更优的性能，而且在各自领域内显著超越了那些通用能力更强的模型。我们的工作已公开于 \url{https://github.com/PandaVT/DataTager}。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日