The prevalence of dementia has increased over time as global life expectancy improves and populations age. An individual's risk of developing dementia is influenced by various genetic, lifestyle, and environmental factors, among others. Predicting dementia risk may enable individuals to employ mitigation strategies or lifestyle changes to delay dementia onset. Current computational approaches to dementia prediction only return risk upon narrow categories of variables and do not account for interactions between different risk variables. The proposed framework utilizes a novel holistic approach to dementia risk prediction and is the first to incorporate various sources of tabular environmental pollution and lifestyle factor data with network systems biology-based genetic data. LightGBM gradient boosting was employed to ensure validity of included factors. This approach successfully models interactions between variables through an original weighted integration method coined Sysable. Multiple machine learning models trained the algorithm to reduce reliance on a single model. The developed approach surpassed all existing dementia risk prediction approaches, with a sensitivity of 85%, specificity of 99%, geometric accuracy of 92%, and AUROC of 91.7%. A transfer learning model was implemented as well. De-biasing algorithms were run on the model via the AI Fairness 360 Library. Effects of demographic disparities on dementia prevalence were analyzed to potentially highlight areas in need and promote equitable and accessible care. The resulting model was additionally integrated into a user-friendly app providing holistic predictions and personalized risk mitigation strategies. The developed model successfully employs holistic computational dementia risk prediction for clinical use.
翻译:随着全球预期寿命延长和人口老龄化,痴呆症患病率随时间持续上升。个体罹患痴呆症的风险受遗传、生活方式和环境等多重因素影响。预测痴呆风险可使人们采取干预策略或调整生活方式以延缓疾病发作。当前基于计算方法的痴呆预测仅针对狭窄变量类别返回风险值,且未考虑不同风险变量间的相互作用。本研究提出的框架采用新型整体性痴呆风险预测方法,首次将各种表格化的环境污染因素与生活方式数据,与基于网络系统生物学的遗传数据相整合。采用LightGBM梯度提升算法确保纳入因素的有效性。通过原创的加权整合方法(命名为Sysable)成功建模变量间的相互作用。多个机器学习模型协同训练算法以降低对单一模型的依赖。本方法超越现有所有痴呆风险预测方法,灵敏度达85%,特异性达99%,几何准确率达92%,AUROC达91.7%。同时实施了迁移学习模型。通过AI公平性360库对模型执行去偏算法,分析人口统计学差异对痴呆患病率的影响,以潜在揭示需重点关注的领域并促进公平可及的治疗。最终将模型整合至用户友好型应用程序中,提供整体预测与个性化风险缓解策略。本模型成功实现了面向临床使用的整体计算性痴呆风险预测。