We propose a simplified human-in-the-loop workflow for second language (L2) Korean morphosyntactic annotation by leveraging agreement between two domain-adapted parsers. We first evaluate whether parser agreement can serve as a proxy for annotation correctness by comparing it with independent human judgments. The results show strong correspondence between parser and human judgments, supporting the feasibility of semi-automatic L2-Korean UD annotation. Further analysis demonstrates that parser disagreements cluster in linguistically predictable domains such as grammatical-relation distinctions and clause-boundary ambiguity. While many disagreement cases are tractable for iterative model refinement, others reflect deeper representational challenges inherent in parsing and tagging L2-Korean corpora.
翻译:我们提出一种简化的人在回路工作流程,用于第二语言(L2)韩语形态句法标注,该方法利用两个领域自适应解析器之间的一致性。首先,通过将解析器一致性与独立人工判断进行对比,评估其能否作为标注正确性的代理指标。结果表明,解析器判断与人工判断之间存在高度对应关系,支持了半自动L2韩语UD标注的可行性。进一步分析显示,解析器分歧集中在语法关系区分和从句边界模糊等语言可预测领域。尽管许多分歧案例可通过迭代模型优化解决,但其他案例则反映了标注和解析L2韩语语料库时固有的深层表征挑战。