Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey

Chen Ling,Xujiang Zhao,Jiaying Lu,Chengyuan Deng,Can Zheng,Junxiang Wang,Tanmoy Chowdhury,Yun Li,Hejie Cui,Xuchao Zhang,Tianjiao Zhao,Amit Panalkar,Wei Cheng,Haoyu Wang,Yanchi Liu,Zhengzhang Chen,Haifeng Chen,Chris White,Quanquan Gu,Jian Pei,Liang Zhao

Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of domain objectives, and the diversity of the constraints (e.g., various social norms, cultural conformity, religious beliefs, and ethical standards in the domain applications). Domain specification techniques are key to make large language models disruptive in many applications. Specifically, to solve these hurdles, there has been a notable increase in research and practices conducted in recent years on the domain specialization of LLMs. This emerging field of study, with its substantial potential for impact, necessitates a comprehensive and systematic review to better summarize and guide ongoing work in this area. In this article, we present a comprehensive survey on domain specification techniques for large language models, an emerging direction critical for large language model applications. First, we propose a systematic taxonomy that categorizes the LLM domain-specialization techniques based on the accessibility to LLMs and summarizes the framework for all the subcategories as well as their relations and differences to each other. Second, we present an extensive taxonomy of critical application domains that can benefit dramatically from specialized LLMs, discussing their practical significance and open challenges. Last, we offer our insights into the current research status and future trends in this area.

翻译：大型语言模型（LLMs）显著推动了自然语言处理领域的发展，为各类应用提供了高度实用、任务无关的基础能力。然而，将LLMs直接应用于解决特定领域的复杂问题面临诸多挑战，这些挑战源于领域数据的异质性、领域知识的复杂性、领域目标的独特性以及约束条件的多样性（例如领域应用中不同的社会规范、文化遵从性、宗教信仰和伦理标准）。领域专业化技术是使大型语言模型在众多应用中发挥颠覆性作用的关键。具体而言，为应对这些挑战，近年来针对LLMs领域专业化的研究与实践显著增加。这一新兴研究领域具有巨大的影响潜力，亟需全面系统的综述来总结并指导该领域的持续工作。本文对大型语言模型的领域专业化技术进行了全面综述——这一新兴方向对LLM应用至关重要。首先，我们提出一个系统化的分类体系，基于对LLMs的可访问性对领域专业化技术进行归类，并总结各子类别的框架及其相互关系与差异。其次，我们构建了能显著受益于专业化LLM的关键应用领域的广泛分类体系，探讨其实际意义与开放挑战。最后，我们对该领域的研究现状及未来趋势提出见解。