We present GR-NLP-TOOLKIT, an open-source natural language processing (NLP) toolkit developed specifically for modern Greek. The toolkit provides state-of-the-art performance in five core NLP tasks, namely part-of-speech tagging, morphological tagging, dependency parsing, named entity recognition, and Greeklishto-Greek transliteration. The toolkit is based on pre-trained Transformers, it is freely available, and can be easily installed in Python (pip install gr-nlp-toolkit). It is also accessible through a demonstration platform on HuggingFace, along with a publicly available API for non-commercial use. We discuss the functionality provided for each task, the underlying methods, experiments against comparable open-source toolkits, and future possible enhancements. The toolkit is available at: https://github.com/nlpaueb/gr-nlp-toolkit
翻译:本文介绍GR-NLP-TOOLKIT,这是一个专门为现代希腊语开发的开源自然语言处理工具包。该工具包在五个核心自然语言处理任务上提供了最先进的性能,即词性标注、形态标注、依存句法分析、命名实体识别以及希腊语到希腊语的音译转换。该工具包基于预训练的Transformer模型,可免费获取,并能通过Python轻松安装(pip install gr-nlp-toolkit)。用户还可通过HuggingFace上的演示平台以及一个面向非商业用途的公开API进行访问。我们讨论了为每项任务提供的功能、所采用的基础方法、与同类开源工具包的对比实验,以及未来可能的改进方向。该工具包可通过以下链接获取:https://github.com/nlpaueb/gr-nlp-toolkit