We apply active learning to help with data scarcity problems in sign languages. In particular, we perform a novel analysis of the effect of pre-training. Since many sign languages are linguistic descendants of French sign language, they share hand configurations, which pre-training can hopefully exploit. We test this hypothesis on American, Chinese, German, and Irish fingerspelling corpora. We do observe a benefit from pre-training, but this may be due to visual rather than linguistic similarities
翻译:我们将主动学习应用于手语数据稀缺问题的缓解。具体而言,我们对预训练的效果进行了创新性分析。由于许多手语在语言学上是法语手语的衍生语种,它们共享手形构型,而预训练有望利用这一特性。我们在美国、中国、德国和爱尔兰的指拼语料库上验证了这一假设。尽管我们确实观察到预训练带来的益处,但这种优势可能源于视觉相似性而非语言学相似性。