SepsisAI Orchestrator: A Containerized and Scalable Platform for Deploying AI Models and Real-Time Monitoring in Early Sepsis Detection

Despite strong predictive results in the clinical machine learning literature, the translation of these models into bedside use remains limited by systems-level barriers: heterogeneous data representations, the absence of standardized deployment workflows, and a mismatch between research prototypes and the concurrency and latency requirements of hospital environments. We present the SepsisAI-Orchestrator, an open-source modular platform that addresses this deployment gap for early sepsis detection. The platform integrates HL7 FHIR-inspired Clinical Document Architecture (CDA) preprocessing, NoSQL storage, a containerized LightGBM classifier served via REST APIs, and a Streamlit clinical dashboard, orchestrated with Docker and Kubernetes. A previously validated LightGBM model (F1 0.87-0.94 on PhysioNet 2019) is reused without modification; the contribution lies in the surrounding infrastructure and its empirical characterization under load. Using k6 with 50-1000 concurrent virtual users, we find that replica count must be matched to the physical CPU thread count of the host: scaling from 3 to 12 replicas on a 12-thread CPU reduces p95 latency from 3.3s to 1.41s (57.3% reduction) and eliminates all request failures, while over-provisioning to 24 or 48 replicas degrades performance due to scheduler contention. To our knowledge this U-shaped scaling behavior has not been quantified previously for clinical AI inference workloads. We do not claim prospective clinical validation. Source code and deployment manifests are available at https://github.com/nucleusai/sepsisai-orchestrator.

翻译：尽管临床机器学习文献已展现出卓越的预测性能，但将此类模型转化为临床床旁应用仍面临系统层面的多重障碍：异构数据表征、标准化部署工作流的缺失，以及研究原型与医院环境并发性及延迟要求之间的不匹配。本文提出开源模块化平台SepsisAI-Orchestrator，旨在填补早期脓毒症检测的部署鸿沟。该平台整合了基于HL7 FHIR（健康信息交换第七层框架的临床文档架构）的预处理模块、NoSQL存储、通过REST API服务的容器化LightGBM分类器、以及基于Streamlit的临床仪表盘，并通过Docker与Kubernetes进行编排。研究中直接复用了先前经PhysioNet 2019验证的LightGBM模型（F1值0.87-0.94），核心贡献在于构建配套基础架构并开展负载下的实证特征分析。通过使用k6模拟50-1000个并发虚拟用户的测试，本研究发现：副本数量必须与宿主机物理CPU线程数匹配——在12线程CPU上将副本数从3扩展至12，可使p95延迟从3.3秒降至1.41秒（降幅57.3%），并消除全部请求失败；而当过度配置至24或48副本时，调度争用导致性能退化。据我们所知，这种U型缩放行为在临床AI推理工作负载中此前尚未被量化。本平台不构成前瞻性临床验证。源代码及部署清单请参见https://github.com/nucleusai/sepsisai-orchestrator。

相关内容

关注 7110

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

【ICML spotlight 2026】HELIX：通过可学习特征身份嵌入实现时间序列插补的混合编码框架

专知会员服务

8+阅读 · 5月6日

PaperOrchestra：一种面向自动化 AI 学术论文撰写的多智能体框架

专知会员服务

13+阅读 · 4月9日

【博士论文】数据驱动决策：通过数据集成与预测性决策支持优化重症监护

专知会员服务

20+阅读 · 2月10日

超越生成式人工智能：用于临床预测、反事实推断与规划的世界模型

专知会员服务

22+阅读 · 2025年11月23日