Opportunities and Challenges of Large Language Models for Low-Resource Languages in Humanities Research

Low-resource languages serve as invaluable repositories of human history, embodying cultural evolution and intellectual diversity. Despite their significance, these languages face critical challenges, including data scarcity and technological limitations, which hinder their comprehensive study and preservation. Recent advancements in large language models (LLMs) offer transformative opportunities for addressing these challenges, enabling innovative methodologies in linguistic, historical, and cultural research. This study systematically evaluates the applications of LLMs in low-resource language research, encompassing linguistic variation, historical documentation, cultural expressions, and literary analysis. By analyzing technical frameworks, current methodologies, and ethical considerations, this paper identifies key challenges such as data accessibility, model adaptability, and cultural sensitivity. Given the cultural, historical, and linguistic richness inherent in low-resource languages, this work emphasizes interdisciplinary collaboration and the development of customized models as promising avenues for advancing research in this domain. By underscoring the potential of integrating artificial intelligence with the humanities to preserve and study humanity's linguistic and cultural heritage, this study fosters global efforts towards safeguarding intellectual diversity.

翻译：低资源语言是人类历史的宝贵宝库，承载着文化演进与知识多样性。尽管具有重要价值，这些语言仍面临数据稀缺和技术限制等关键挑战，阻碍了其全面研究与保护。大型语言模型（LLMs）的最新进展为解决这些挑战提供了变革性机遇，为语言、历史和文化研究带来了创新方法。本研究系统评估了LLMs在低资源语言研究中的应用，涵盖语言变异、历史文献、文化表达和文学分析等领域。通过分析技术框架、现有方法论及伦理考量，本文指出了数据可及性、模型适应性和文化敏感性等核心挑战。鉴于低资源语言本身蕴含的文化、历史与语言丰富性，本研究强调跨学科合作与定制化模型开发是推进该领域研究的重要途径。通过凸显人工智能与人文科学相结合在保护和研究人类语言文化遗产方面的潜力，本工作旨在推动全球范围内保护知识多样性的共同努力。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/