The peculiar congestion patterns in data centers are caused by the bursty and composite nature of traffic, the small bandwidth-delay product, and the tiny switch buffers. It is not practical to modify TCP to adapt to data centers, especially in public clouds where multiple congestion control protocols coexist. In this work, we design a switch-based method to address such congestion issues; our approach does not require any modification to TCP, which enables easy and seamless deployment in public data centers via switch software update. We first present a simple analysis to demonstrate the stability and effectiveness of the scheme, and then we discuss a hardware NetFPGA switch-based prototype. The experimental results from real deployments in a small testbed cluster show the effectiveness of our approach.
翻译:摘要:数据中心中特殊的拥塞模式是由流量的突发性和复合性、小的带宽-时延积以及微小的交换机缓冲区共同造成的。修改TCP以适应数据中心并不切实际,尤其是在多个拥塞控制协议共存的公共云环境中。本研究设计了一种基于交换机的方案来解决此类拥塞问题;该方法无需对TCP进行任何修改,从而可通过交换机软件更新在公共数据中心中实现简单且无缝的部署。我们首先通过简要分析论证了该方案的稳定性与有效性,随后讨论了基于硬件NetFPGA交换机的原型实现。在小型测试平台集群中的实际部署实验结果表明了该方法的有效性。