The grammatical gender of Swedish nouns is a mystery. While there are few rules that can indicate the gender with some certainty, it does in general not depend on either meaning or the structure of the word. In this paper we demonstrate the surprising fact that grammatical gender for Swedish nouns can be predicted with high accuracy using a recurrent neural network (RNN) working on the raw character sequence of the word, without using any contextual information.
翻译:瑞典语名词的语法性别是一个未解之谜。尽管存在少数规则可在一定程度上指示性别,但通常既与词义无关,也与词形结构无必然联系。本文揭示了一个令人惊讶的事实:通过使用循环神经网络(RNN)处理单词的原始字符序列,无需任何上下文信息即可高精度预测瑞典语名词的语法性别。