We introduce the first highly multilingual speech and American Sign Language (ASL) comprehension dataset by extending BELEBELE. Our dataset covers 74 spoken languages at the intersection of BELEBELE and FLEURS, and one sign language (ASL). We evaluate 2M-BELEBELE dataset for both 5-shot and zero-shot settings and across languages, the speech comprehension accuracy is ~ 8% average lower compared to reading comprehension.
翻译:我们通过扩展BELEBELE,引入了首个高度多语言语音与美国手语(ASL)理解数据集。我们的数据集涵盖了BELEBELE与FLEURS交集处的74种口语,以及一种手语(ASL)。我们在5样本和零样本设置下评估了2M-BELEBELE数据集,跨语言结果显示,语音理解准确率平均比阅读理解低约8%。