Enabling gaze interaction in real-time on handheld mobile devices has attracted significant attention in recent years. An increasing number of research projects have focused on sophisticated appearance-based deep learning models to enhance the precision of gaze estimation on smartphones. This inspires important research questions, including how the gaze can be used in a real-time application, and what type of gaze interaction methods are preferable under dynamic conditions in terms of both user acceptance and delivering reliable performance. To address these questions, we design four types of gaze scrolling techniques: three explicit technique based on Gaze Gesture, Dwell time, and Pursuit; and one implicit technique based on reading speed to support touch-free, page-scrolling on a reading application. We conduct a 20-participant user study under both sitting and walking settings and our results reveal that Gaze Gesture and Dwell time-based interfaces are more robust while walking and Gaze Gesture has achieved consistently good scores on usability while not causing high cognitive workload.
翻译:在手持移动设备上实现实时的注视交互近年来引起了广泛关注。越来越多的研究项目聚焦于基于外观的复杂深度学习模型,以提升智能手机视线估计的精度。这激发了重要的研究问题,包括如何将视线应用于实时应用,以及在动态条件下,从用户接受度和提供可靠性能两方面来看,何种注视交互方法更受青睐。为解决这些问题,我们设计了四种类型的视线滚动技术:三种基于显式方法的技术,包括视线手势、驻留时间和追踪;一种基于阅读速度的隐式技术,用于在阅读应用中实现免触控的页面滚动。我们在坐姿和行走两种环境下开展了20名参与者的用户研究,结果显示,基于视线手势和驻留时间的界面在行走时具有更强的鲁棒性,且视线手势在可用性方面持续获得较高评分,同时未引发较高的认知负荷。