The Inspirational and Convincing Audio Generation Challenge 2024 (ICAGC 2024) is part of the ISCSLP 2024 Competitions and Challenges track. While current text-to-speech (TTS) technology can generate high-quality audio, its ability to convey complex emotions and controlled detail content remains limited. This constraint leads to a discrepancy between the generated audio and human subjective perception in practical applications like companion robots for children and marketing bots. The core issue lies in the inconsistency between high-quality audio generation and the ultimate human subjective experience. Therefore, this challenge aims to enhance the persuasiveness and acceptability of synthesized audio, focusing on human alignment convincing and inspirational audio generation. A total of 19 teams have registered for the challenge, and the results of the competition and the competition are described in this paper.
翻译:激励性与说服性音频生成挑战赛 2024(ICAGC 2024)是 ISCSLP 2024 竞赛与挑战赛道的一部分。尽管当前文本转语音(TTS)技术能够生成高质量音频,但其在传达复杂情感和控制细节内容方面的能力仍然有限。这一限制导致在儿童陪伴机器人、营销机器人等实际应用中,生成的音频与人类主观感知之间存在差异。核心问题在于高质量音频生成与最终人类主观体验之间的不一致性。因此,本挑战赛旨在提升合成音频的说服力与可接受度,重点关注符合人类感知、具有说服力与激励性的音频生成。共有 19 支队伍报名参赛,本文描述了竞赛结果与赛事情况。