Simultaneous speech translation (SimulST) is a demanding task that involves generating translations in real-time while continuously processing speech input. This paper offers a comprehensive overview of the recent developments in SimulST research, focusing on four major challenges. Firstly, the complexities associated with processing lengthy and continuous speech streams pose significant hurdles. Secondly, satisfying real-time requirements presents inherent difficulties due to the need for immediate translation output. Thirdly, striking a balance between translation quality and latency constraints remains a critical challenge. Finally, the scarcity of annotated data adds another layer of complexity to the task. Through our exploration of these challenges and the proposed solutions, we aim to provide valuable insights into the current landscape of SimulST research and suggest promising directions for future exploration.
翻译:同步语音翻译(SimulST)是一项要求苛刻的任务,需要在持续处理语音输入的同时实时生成译文。本文全面综述了SimulST研究的最新进展,重点关注四大挑战。首先,处理长时连续语音流带来的复杂性构成了显著障碍。其次,由于需要即时输出翻译结果,满足实时性要求存在固有困难。第三,在翻译质量与延迟约束之间取得平衡仍是关键挑战。最后,标注数据的稀缺性进一步增加了任务的复杂性。通过对这些挑战及现有解决方案的探讨,我们旨在为当前SimulST研究现状提供有价值的见解,并为未来探索指明有前景的方向。