This research paper addresses the limitations of current mobile accessibility services like TalkBack, which provide manual gesture-based sequential feedback to BVI users. Motivated by the promise of large language models (LLMs), this paper introduces Insight, an Android accessibility service that provides natural language interaction and real-time summarization of the screen. The paper performs a within-subject experimental study with users to compare Insight and TalkBack on usability factors. Results show Insight reduced mental effort and task time, and was preferred because of its dialogue interface, but users felt the need for interruption management. Results show LLM-based interfaces can significantly improve mobile accessibility, and describe the potential of hybrid solutions combining gesture and dialogue modalities towards more inclusive design.
翻译:本学术论文针对当前移动无障碍服务(如TalkBack)的局限性——即仅能为盲人与视障用户提供基于手势的序列性反馈。受大语言模型发展潜力的启发,本文提出Insight这一Android无障碍服务,该服务支持自然语言交互并实时生成屏幕内容摘要。论文采用被试内实验设计,围绕可用性因素对Insight与TalkBack进行了用户对比研究。实验结果表明:Insight显著降低了用户的心智负担与任务完成时间,其对话式交互界面更受用户青睐,但用户认为需要引入中断管理机制。研究证实基于大语言模型的界面能显著改善移动无障碍体验,并揭示了融合手势与对话模态的混合解决方案在推动包容性设计方面的潜力。