As for multilingual language models, it is important to select languages for training because of the curse of multilinguality. It is known that using languages with similar language structures is effective for cross lingual transfer learning. However, we demonstrate that using agglutinative languages such as Korean is more effective in cross lingual transfer learning. This is a great discovery that will change the training strategy of cross lingual transfer learning.
翻译:对于多语言模型而言,由于多语言性的诅咒,选择合适的训练语言至关重要。已知使用具有相似语言结构的语言对跨语言迁移学习是有效的。然而,我们证明使用如韩语等黏着语在跨语言迁移学习中更为有效。这一重大发现将改变跨语言迁移学习的训练策略。