Throughput-Optimal Multiresource-Job Scheduling with Continuous Requirement Distribution

Modern computing systems process jobs with resource requirements such as CPU and memory, which are described by multiresource jobs (MRJ) queueing models. In practice, job resource requirements are spread out over so many values, that it is rare to see the same value twice. This pattern is best modeled by a continuous distribution of requirement values. However, the existing theoretical work on stability or throughput-optimality focuses on queueing models with class-based resource requirements. In class-based models, the number of distinct resource requirements must be small to demonstrate strong empirical performance, making them a poor match for these practical systems. We introduce the first throughput-optimal family of scheduling policies for the continuous MRJ model, with both preemptive and nonpreemptive variants. We further introduce several efficient policy families, which remain throughput-optimal while considerably improving computational efficiency, under some distributional assumptions. We use a discretization approach, where we choose the discretization granularity based on the system load and the distribution of resource requirements. We validate the real-world applicability of our policies by comparing them against existing index-based policies on parametrized distributions and on datacenter trace data from the Google Borg scheduler, demonstrating state-of-the-art performance.

翻译：现代计算系统处理具有CPU和内存等资源需求的作业，这类需求通过多资源作业（MRJ）排队模型描述。实践中，作业的资源需求取值分布广泛，重复出现相同取值的概率极低。这种模式最适合用需求值的连续分布来建模。然而，现有关于稳定性或吞吐最优性的理论工作主要关注基于类别的资源需求排队模型。在基于类别的模型中，为展现强劲的实证性能，不同资源需求的数量必须保持较小，这使得该类模型与实际系统匹配度较差。我们首次针对连续MRJ模型提出具有抢占与非抢占变体的吞吐最优调度策略族。进一步地，我们引入若干高效策略族，在特定分布假设下，这些策略族在保持吞吐最优性的同时显著提升了计算效率。我们采用离散化方法，根据系统负载与资源需求分布选择离散化粒度。通过在参数化分布及Google Borg调度器的数据中心轨迹数据上对比现有基于索引的策略，验证了所提策略在实际应用中的卓越性能，展现了当前最优水平。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

《分层多智能体系统分类：设计范式、协调机制与工业应用》最新28页

专知会员服务

36+阅读 · 2025年8月20日

《分布式多域协同作战中的互依性任务管理界面研究》最新报告

专知会员服务

56+阅读 · 2025年5月28日

《多智能体系统的神经协调：多领域任务环境中基于深度学习的智能体最优选择框架》

专知会员服务

27+阅读 · 2025年5月7日

TransMLA：多头潜在注意力（MLA）即为所需

专知会员服务

23+阅读 · 2025年2月13日