Rene: A Pre-trained Multi-modal Architecture for Auscultation of Respiratory Diseases

Compared with invasive examinations that require tissue sampling, respiratory sound testing is a non-invasive examination method that is safer and easier for patients to accept. In this study, we introduce Rene, a pioneering large-scale model tailored for respiratory sound recognition. Rene has been rigorously fine-tuned with an extensive dataset featuring a broad array of respiratory audio samples, targeting disease detection, sound pattern classification, and event identification. Our innovative approach applies a pre-trained speech recognition model to process respiratory sounds, augmented with patient medical records. The resulting multi-modal deep-learning framework addresses interpretability and real-time diagnostic challenges that have hindered previous respiratory-focused models. Benchmark comparisons reveal that Rene significantly outperforms existing models, achieving improvements of 10.27%, 16.15%, 15.29%, and 18.90% in respiratory event detection and audio classification on the SPRSound database. Disease prediction accuracy on the ICBHI database improved by 23% over the baseline in both mean average and harmonic scores. Moreover, we have developed a real-time respiratory sound discrimination system utilizing the Rene architecture. Employing state-of-the-art Edge AI technology, this system enables rapid and accurate responses for respiratory sound auscultation(https://github.com/zpforlove/Rene).

翻译：与需要组织采样的侵入性检查相比，呼吸音测试是一种非侵入性检查方法，对患者而言更安全且更易接受。本研究介绍了Rene，一种专为呼吸音识别而设计的开创性大规模模型。Rene已通过包含广泛呼吸音频样本的大规模数据集进行了严格微调，目标涵盖疾病检测、声音模式分类和事件识别。我们的创新方法采用预训练的语音识别模型处理呼吸音，并结合患者病历信息进行增强。由此构建的多模态深度学习框架解决了以往呼吸领域模型在可解释性和实时诊断方面面临的挑战。基准测试比较表明，Rene在SPRSound数据库的呼吸事件检测和音频分类任务中显著优于现有模型，分别实现了10.27%、16.15%、15.29%和18.90%的性能提升。在ICBHI数据库的疾病预测任务中，其平均准确率与调和分数较基线模型均提高了23%。此外，我们基于Rene架构开发了实时呼吸音鉴别系统。该系统采用前沿的Edge AI技术，能够为呼吸音听诊提供快速准确的分析响应(https://github.com/zpforlove/Rene)。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR 2022】一个完全无监督的框架，从噪声和部分测量中学习图像，Robust Equivariant Imaging: a fully unsupervised framework for learning to image

专知会员服务

25+阅读 · 2022年3月3日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日