This work introduces a novel method for binaural reproduction from arbitrary microphone arrays, based on array-aware optimization of Ambisonics encoding through Head-Related Transfer Function (HRTF) pre-processing. The proposed approach integrates array-specific information into the HRTF processing pipeline, leading to improved spatial accuracy in binaural rendering. Objective evaluations demonstrate superior performance under simulated wearable-array and head rotations compared to conventional Ambisonics encoding method. A listening experiment further confirms that the method achieves significantly higher perceptual ratings in both timbre and spatial quality. Fully compatible with standard Ambisonics, the proposed method offers a practical solution for spatial audio rendering in applications such as virtual reality, augmented reality, and wearable audio capture.
翻译:本研究提出了一种基于头部相关传输函数预处理、通过阵列感知优化Ambisonics编码的新方法,用于从任意麦克风阵列实现双耳重放。该方法将阵列特定信息整合到HRTF处理流程中,从而提升了双耳渲染的空间精度。客观评估表明,在模拟可穿戴阵列及头部旋转条件下,该方法较传统Ambisonics编码方法具有更优性能。听觉实验进一步证实,该方法在音色与空间质量方面均获得显著更高的感知评分。本方法与标准Ambisonics完全兼容,为虚拟现实、增强现实及可穿戴音频采集等应用中的空间音频渲染提供了实用解决方案。