Language documentation is a critical aspect of language preservation, often including the creation of Interlinear Glossed Text (IGT). Creating IGT is time-consuming and tedious, and automating the process can save valuable annotator effort. This paper describes the baseline system for the SIGMORPHON 2023 Shared Task of Interlinear Glossing. In our system, we utilize a transformer architecture and treat gloss generation as a sequence labelling task.
翻译:语言记录是语言保护的关键环节,通常包含行间注释文本(IGT)的创建。制作IGT耗时且繁琐,而自动化该过程可节省宝贵的标注人力。本文介绍了SIGMORPHON 2023行间注释共享任务的基线系统。在该系统中,我们采用Transformer架构,并将注释生成视为序列标注任务。