Starjob: Dataset for LLM-Driven Job Shop Scheduling

Large Language Models (LLMs) have shown remarkable capabilities across various domains, but their potential for solving combinatorial optimization problems remains largely unexplored. In this paper, we investigate the applicability of LLMs to the Job Shop Scheduling Problem (JSSP), a classic challenge in combinatorial optimization that requires efficient job allocation to machines to minimize makespan. To this end, we introduce Starjob, the first supervised dataset for JSSP, comprising 130k instances specifically designed for training LLMs. Leveraging this dataset, we fine-tune the LLaMA 8B 4-bit quantized model with the LoRA method to develop an end-to-end scheduling approach. Our evaluation on standard benchmarks demonstrates that the proposed LLM-based method not only surpasses traditional Priority Dispatching Rules (PDRs) but also achieves notable improvements over state-of-the-art neural approaches like L2D, with an average improvement of 15.36% on DMU and 7.85% on Taillard benchmarks. These results highlight the untapped potential of LLMs in tackling combinatorial optimization problems, paving the way for future advancements in this area.

翻译：大语言模型（LLMs）已在多个领域展现出卓越能力，但其在组合优化问题求解方面的潜力尚未得到充分探索。本文研究LLMs在作业车间调度问题（JSSP）中的适用性——该问题是组合优化领域的经典挑战，需要将作业高效分配至机器以最小化完工时间。为此，我们提出了首个面向JSSP的监督数据集Starjob，包含13万个专门为训练LLMs设计的调度实例。基于该数据集，我们采用LoRA方法对LLaMA 8B 4位量化模型进行微调，开发出端到端调度方法。在标准基准测试上的评估表明，所提出的基于LLM的方法不仅超越了传统优先级调度规则（PDRs），相较L2D等先进神经方法也取得显著提升：在DMU基准上平均提升15.36%，在Taillard基准上平均提升7.85%。这些成果揭示了LLMs在解决组合优化问题中尚未开发的潜力，为该领域的未来发展开辟了新路径。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日