We introduce the \`{I}r\`{o}y\`{i}nSpeech corpus -- a new dataset influenced by a desire to increase the amount of high quality, freely available, contemporary Yor\`{u}b\'{a} speech. We release a multi-purpose dataset that can be used for both TTS and ASR tasks. We curated text sentences from the news and creative writing domains under an open license i.e., CC-BY-4.0 and had multiple speakers record each sentence. We provide 5000 of our utterances to the Common Voice platform to crowdsource transcriptions online. The dataset has 38.5 hours of data in total, recorded by 80 volunteers.
翻译:我们介绍了ÌròyìnSpeech语料库——这是一个受提升高质量、免费、当代约鲁巴语音资源量愿望推动的新数据集。我们发布了一个可用于文本转语音(TTS)和自动语音识别(ASR)任务的多功能数据集。我们从新闻和创意写作领域收集了采用开放许可(即CC-BY-4.0)的文本句子,并邀请了多位发音人录制每个句子。我们将其中5000条语音片段提交至Common Voice平台,用于在线众包转录。该数据集总计包含38.5小时的数据,由80名志愿者录制完成。