StreamBed is a capacity planning system for stream processing. It predicts, ahead of any production deployment, the resources that a query will require to process an incoming data rate sustainably, and the appropriate configuration of these resources. StreamBed builds a capacity planning model by piloting a series of runs of the target query in a small-scale, controlled testbed. We implement StreamBed for the popular Flink DSP engine. Our evaluation with large-scale queries of the Nexmark benchmark demonstrates that StreamBed can effectively and accurately predict capacity requirements for jobs spanning more than 1,000 cores using a testbed of only 48 cores.
翻译:StreamBed是一个面向流处理的容量规划系统。该系统能够在生产部署之前,预先预测查询在处理持续输入数据速率时所需的资源量,以及这些资源的合理配置方案。StreamBed通过在小型受控测试平台上执行一系列目标查询的试点运行,构建容量规划模型。我们针对流行的Flink DSP引擎实现了StreamBed。基于Nexmark基准测试的大规模查询评估表明,StreamBed仅需使用48个核心的测试平台,即可有效且精准地预测跨越1000余个核心的作业所需的容量需求。