Simultaneous Speech-to-Text translation serves a critical role in real-time crosslingual communication. Despite the advancements in recent years, challenges remain in achieving stability in the translation process, a concern primarily manifested in the flickering of partial results. In this paper, we propose a novel revision-controllable method designed to address this issue. Our method introduces an allowed revision window within the beam search pruning process to screen out candidate translations likely to cause extensive revisions, leading to a substantial reduction in flickering and, crucially, providing the capability to completely eliminate flickering. The experiments demonstrate the proposed method can significantly improve the decoding stability without compromising substantially on the translation quality.
翻译:同声语音翻译在实时跨语言交流中发挥着关键作用。尽管近年来取得了进展,但在翻译过程中实现稳定性仍面临挑战,这一问题主要体现为部分结果的闪烁现象。本文提出了一种新颖的可控制修订方法以解决该问题。该方法在束搜索剪枝过程中引入允许修订窗口,用于筛除可能引发大规模修订的候选翻译,从而显著减少闪烁现象,并且至关重要的是,具备彻底消除闪烁的能力。实验表明,所提方法能够在不过度损害翻译质量的前提下,显著提升解码稳定性。