To achieve successful deployment of AI research, it is crucial to understand the demands of the industry. In this paper, we present the results of a survey conducted with professional audio engineers, in order to determine research priorities and define various research tasks. We also summarize the current challenges in audio quality and controllability based on the survey. Our analysis emphasizes that the availability of datasets is currently the main bottleneck for achieving high-quality audio generation. Finally, we suggest potential solutions for some revealed issues with empirical evidence.
翻译:为了成功部署人工智能研究,理解行业需求至关重要。本文通过一项针对专业音频工程师的调查,确定了研究优先级并界定了各类研究任务。基于调查结果,我们总结了当前音频质量与可控性方面的挑战。分析表明,数据集的可用性是当前实现高质量音频生成的主要瓶颈。最后,我们针对所揭示的问题,结合实证证据提出了潜在的解决方案。