We present Charles Translator, a machine translation system between Ukrainian and Czech, developed as part of a society-wide effort to mitigate the impact of the Russian-Ukrainian war on individuals and society. The system was developed in the spring of 2022 with the help of many language data providers in order to quickly meet the demand for such a service, which was not available at the time in the required quality. The translator was later implemented as an online web interface and as an Android app with speech input, both featuring Cyrillic-Latin script transliteration. The system translates directly, compared to other available systems that use English as a pivot, and thus take advantage of the typological similarity of the two languages. It uses the block back-translation method, which allows for efficient use of monolingual training data. The paper describes the development process, including data collection and implementation, evaluation, mentions several use cases, and outlines possibilities for the further development of the system for educational purposes.
翻译:我们提出 Charles 翻译器,这是一个乌克兰语与捷克语之间的机器翻译系统,该系统是全社会共同努力以减轻俄乌战争对个人和社会影响的一部分。该系统于 2022 年春季开发,得益于众多语言数据提供商的帮助,以快速满足当时在所需质量下尚不可用的此类服务需求。该翻译器随后以在线网页界面和配备语音输入的 Android 应用程序形式实现,两者均支持西里尔-拉丁字母转写。与其他利用英语作为中介的现有系统相比,该系统直接进行翻译,从而利用这两种语言在类型学上的相似性。它采用块回译方法,能够高效利用单语训练数据。本文描述了开发过程,包括数据收集与实现、评估,提及了若干使用案例,并概述了该系统在教育目的上进一步发展的可能性。