The Sejong dictionary dataset offers a valuable resource, providing extensive coverage of morphology, syntax, and semantic representation. This dataset can be utilized to explore linguistic information in greater depth. The labeled linguistic structures within this dataset form the basis for uncovering relationships between words and phrases and their associations with target verbs. This paper introduces a user-friendly web interface designed for the collection and consolidation of verb-related information, with a particular focus on subcategorization frames. Additionally, it outlines our efforts in mapping this information by aligning subcategorization frames with corresponding illustrative sentence examples. Furthermore, we provide a Python library that would simplify syntactic parsing and semantic role labeling. These tools are intended to assist individuals interested in harnessing the Sejong dictionary dataset to develop applications for Korean language processing.
翻译:世宗词典数据集提供了宝贵的资源,涵盖丰富的形态学、句法学及语义表征信息。该数据集可用于深入探索语言信息。其中标注的语言结构构成了揭示词语与短语之间关系及其与目标动词关联的基础。本文介绍了一个用户友好的网络界面,专为收集和整合动词相关信息而设计,尤其侧重于子语类框架。此外,本文概述了通过将子语类框架与对应示例句对齐来实现信息映射的工作。我们还提供了一个Python库,以简化句法分析和语义角色标注。这些工具旨在帮助有意利用世宗词典数据集开发韩语处理应用的研究者。