Beyond-Voice: Towards Continuous 3D Hand Pose Tracking on Commercial Home Assistant Devices

Increasingly popular home assistants are widely utilized as the central controller for smart home devices. However, current designs heavily rely on voice interfaces with accessibility and usability issues; some latest ones are equipped with additional cameras and displays, which are costly and raise privacy concerns. These concerns jointly motivate Beyond-Voice, a novel deep-learning-driven acoustic sensing system that allows commodity home assistant devices to track and reconstruct hand poses continuously. It transforms the home assistant into an active sonar system using its existing onboard microphones and speakers. We feed a high-resolution range profile to the deep learning model that can analyze the motions of multiple body parts and predict the 3D positions of 21 finger joints, bringing the granularity for acoustic hand tracking to the next level. It operates across different environments and users without the need for personalized training data. A user study with 11 participants in 3 different environments shows that Beyond-Voice can track joints with an average mean absolute error of 16.47mm without any training data provided by the testing subject.

翻译：日益普及的家居助手被广泛用作智能家居设备的中央控制器。然而，当前的设计严重依赖语音接口，存在可访问性和可用性问题；部分最新产品配备了额外的摄像头和显示屏，这不仅成本高昂，还引发了隐私担忧。这些问题共同推动了Beyond-Voice这一新型深度学习驱动声学感知系统的诞生，该系统使商用家居助手设备能够持续追踪和重建手部姿态。它利用设备现有的内置麦克风和扬声器，将家居助手转变为主动声呐系统。我们将高分辨率距离剖面输入深度学习模型，该模型能够分析多个身体部位的运动，并预测21个手指关节的三维位置，从而将声学手部追踪的精细度提升至新水平。该系统无需个性化训练数据，即可在不同环境和用户间运行。针对3种不同环境中11名参与者的用户研究表明，Beyond-Voice能够在测试对象未提供任何训练数据的情况下，以平均绝对误差16.47毫米追踪关节。

相关内容

Continuity

关注 4

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日