This research paper describes a realtime system for identifying American Sign Language (ASL) movements that employs modern computer vision and machine learning approaches. The suggested method makes use of the Mediapipe library for feature extraction and a Convolutional Neural Network (CNN) for ASL gesture classification. The testing results show that the suggested system can detect all ASL alphabets with an accuracy of 99.95%, indicating its potential for use in communication devices for people with hearing impairments. The proposed approach can also be applied to additional sign languages with similar hand motions, potentially increasing the quality of life for people with hearing loss. Overall, the study demonstrates the effectiveness of using Mediapipe and CNN for real-time sign language recognition, making a significant contribution to the field of computer vision and machine learning.
翻译:本研究论文描述了一种利用现代计算机视觉与机器学习方法识别美国手语(ASL)动作的实时系统。所提出的方法采用Mediapipe库进行特征提取,并利用卷积神经网络(CNN)进行ASL手势分类。测试结果表明,该系统能够以99.95%的准确率识别全部ASL字母,彰显了其在听力障碍人士通信设备中的应用潜力。此外,该方案也可扩展至其他具有相似手部动作的手语系统,有望提升听力损失人群的生活质量。总体而言,本研究验证了Mediapipe与CNN在实时手语识别中的有效性,为计算机视觉与机器学习领域做出了重要贡献。