In recent years, the utilization of large language models for natural language dialogue has gained momentum, leading to their widespread adoption across various domains. However, their universal competence in addressing challenges specific to specialized fields such as law remains a subject of scrutiny. The incorporation of legal ethics into the model has been overlooked by researchers. We asserts that rigorous ethic evaluation is essential to ensure the effective integration of large language models in legal domains, emphasizing the need to assess domain-specific proficiency and domain-specific ethic. To address this, we propose a novelty evaluation methodology, utilizing authentic legal cases to evaluate the fundamental language abilities, specialized legal knowledge and legal robustness of large language models (LLMs). The findings from our comprehensive evaluation contribute significantly to the academic discourse surrounding the suitability and performance of large language models in legal domains.
翻译:近年来,利用大语言模型进行自然语言对话的趋势日益增强,使其在各个领域得到广泛应用。然而,它们在处理法律等专业领域特有挑战方面的通用能力仍受到质疑。研究人员忽视了将法律伦理融入模型的问题。我们主张,严格的伦理评估对于确保大语言模型在法律领域的有效整合至关重要,强调需要评估领域专业能力和领域特定伦理。为此,我们提出了一种新颖的评估方法,利用真实法律案例来评估大语言模型的基本语言能力、专业法律知识以及法律鲁棒性。我们的综合评估结果为关于大语言模型在法律领域适用性与性能的学术讨论提供了重要贡献。