StreamBed is a capacity planning system for stream processing. It predicts, ahead of any production deployment, the resources that a query will require to process an incoming data rate sustainably, and the appropriate configuration of these resources. StreamBed builds a capacity planning model by piloting a series of runs of the target query in a small-scale, controlled testbed. We implement StreamBed for the popular Flink DSP engine. Our evaluation with large-scale queries of the Nexmark benchmark demonstrates that StreamBed can effectively and accurately predict capacity requirements for jobs spanning more than 1,000 cores using a testbed of only 48 cores.
翻译:StreamBed是一个面向流处理的容量规划系统。它能在任何生产部署之前,预测查询为实现可持续处理传入数据速率所需的资源量及适当的资源配置方案。StreamBed通过在小型受控测试平台中运行目标查询的一系列试点任务,构建容量规划模型。我们针对流行的Flink DSP引擎实现了StreamBed系统。基于Nexmark基准测试中大规模查询的评估表明,StreamBed能够仅使用48核的测试平台,有效且准确地预测涵盖1000核以上作业的容量需求。