Orientation and mobility (O&M) instruction for blind and low-vision learners is effective but difficult to standardize and repeat at scale due to the reliance on instructor availability, physical mock-ups, and variable real-world outdoor conditions. This Technical Note presents a sound-first immersive training flow that uses spatial audio and sonification as the primary channel for action and feedback in pre-street O&M and daily-living practice. The approach specifies parameterized scenario templates (e.g., signalized street crossing, public transport boarding, and kitchen tasks), a compact and consistent cue vocabulary with clear spectral placement and timing to mitigate masking, and a lightweight safety protocol enabling graded exposure, content warnings, seated starts, opt-outs, and structured debriefs. The system assumes a head-mounted device with high-quality binaural rendering and head tracking; 3D scene geometry is used as an invisible scaffold to anchor sources, trigger events, define risk/guidance volumes, and govern physically plausible motion without visuals. Session difficulty is shaped via cue density, event tempo, and task complexity while preserving cue consistency to promote transfer across scenarios. The specification aims to enable safe repetition, reduce instructor burden, and support clearer standards across rehabilitation centers, aligning with evidence that audio-first interaction is essential for blind and visually impaired users and addressing gaps in HRTF personalization, evaluation standards, and accessibility integration. Although no behavioral outcomes are reported here, this implementable flow consolidates auditory science with center-ready design, offering a pragmatic foundation for standardized evaluation and future comparative studies.
翻译:针对盲人与低视力学习者的定向行走(O&M)教学虽具成效,但由于依赖指导员的可用性、实体模拟环境以及多变的真实户外条件,难以实现标准化和大规模重复训练。本技术报告提出一种以声音为先的沉浸式训练流程,将空间音频与可听化作为街道预行走训练及日常生活技能练习中动作与反馈的主要通道。该方法明确了参数化场景模板(如信号灯控制路口通行、公共交通搭乘、厨房任务等),构建了一套紧凑且一致的提示音词汇库,通过清晰的频谱布局与时序设计以降低掩蔽效应,并制定了轻量级安全协议,支持分级暴露、内容预警、坐姿启动、自主退出及结构化任务复盘。该系统假设采用配备高质量双耳渲染与头部追踪的头戴设备;三维场景几何被用作不可见的支架,用于锚定声源、触发事件、定义风险/引导区域,并在无视觉信息的情况下控制符合物理规律的运动。训练难度通过提示音密度、事件节奏和任务复杂度进行调节,同时保持提示音的一致性以促进不同场景间的技能迁移。该规范旨在实现安全重复训练、减轻指导员负担,并支持康复中心间更清晰的标准统一,既符合“听觉优先交互对盲人与视障用户至关重要”的实证依据,也针对头相关传输函数个性化、评估标准及无障碍功能整合等方面的现有不足作出回应。尽管本文未报告行为学结果,但这一可实施的流程将听觉科学与康复中心适用设计相结合,为标准化评估及未来对比研究提供了实用基础。