A Reinforcement Learning-assisted Genetic Programming Algorithm for Team Formation Problem Considering Person-Job Matching - 专知论文

会员服务 ·

0

程序设计 · 代理模型 · 算法 · 强化学习 · 规划模型 ·

2023 年 4 月 8 日

A Reinforcement Learning-assisted Genetic Programming Algorithm for Team Formation Problem Considering Person-Job Matching

翻译：一种面向人岗匹配的团队组建问题的强化学习辅助遗传规划算法

Yangyang Guo,Hao Wang,Lei He,Witold Pedrycz,P. N. Suganthan,Yanjie Song

from arxiv, 16 pages

An efficient team is essential for the company to successfully complete new projects. To solve the team formation problem considering person-job matching (TFP-PJM), a 0-1 integer programming model is constructed, which considers both person-job matching and team members' willingness to communicate on team efficiency, with the person-job matching score calculated using intuitionistic fuzzy numbers. Then, a reinforcement learning-assisted genetic programming algorithm (RL-GP) is proposed to enhance the quality of solutions. The RL-GP adopts the ensemble population strategies. Before the population evolution at each generation, the agent selects one from four population search modes according to the information obtained, thus realizing a sound balance of exploration and exploitation. In addition, surrogate models are used in the algorithm to evaluate the formation plans generated by individuals, which speeds up the algorithm learning process. Afterward, a series of comparison experiments are conducted to verify the overall performance of RL-GP and the effectiveness of the improved strategies within the algorithm. The hyper-heuristic rules obtained through efficient learning can be utilized as decision-making aids when forming project teams. This study reveals the advantages of reinforcement learning methods, ensemble strategies, and the surrogate model applied to the GP framework. The diversity and intelligent selection of search patterns along with fast adaptation evaluation, are distinct features that enable RL-GP to be deployed in real-world enterprise environments.

翻译：高效团队对于公司成功完成新项目至关重要。为解决考虑人岗匹配的团队组建问题（TFP-PJM），构建了0-1整数规划模型，该模型综合考虑人岗匹配度及团队成员沟通意愿对团队效率的影响，其中人岗匹配度通过直觉模糊数计算。随后，提出一种强化学习辅助的遗传规划算法（RL-GP）以提升解的质量。RL-GP采用集成种群策略：在每代种群进化前，智能体根据获取的信息从四种种群搜索模式中选择一种，从而在探索与开发之间实现良好平衡。此外，算法中引入代理模型评估个体生成的组建方案，加速算法学习过程。通过系列对比实验验证RL-GP的整体性能及算法内改进策略的有效性。通过高效学习获得的超启发式规则可作为组建项目团队时的决策辅助工具。本研究揭示了强化学习方法、集成策略及代理模型在遗传规划框架中的应用优势。搜索模式的多样性与智能选择，以及快速适应评估能力，使得RL-GP能够部署于真实企业环境。

0

相关内容

程序设计

【华盛顿大学Simon S. Du】离线单智能体和多智能体强化学习

【华盛顿大学Simon S. Du】离线单智能体和多智能体强化学习

专知会员服务

46+阅读 · 2022年11月10日

斯坦福大学最新【强化学习】2022课程，含ppt

斯坦福大学最新【强化学习】2022课程，含ppt

专知会员服务

134+阅读 · 2022年2月27日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

43+阅读 · 2020年4月11日

【机器学习面试】《Machine Learning Interviews - YouTube》by Huyen Chip [Senior Deep Learning Engineer, NVIDIA]

【机器学习面试】《Machine Learning Interviews - YouTube》by Huyen Chip [Senior Deep Learning Engineer, NVIDIA]

专知会员服务

44+阅读 · 2019年12月24日

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

专知会员服务

38+阅读 · 2019年12月17日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

122+阅读 · 2019年11月24日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

专知会员服务

19+阅读 · 2019年11月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

WSDM2022推荐算法部分论文整理（附直播课程）

WSDM2022推荐算法部分论文整理（附直播课程）

机器学习与推荐算法

0+阅读 · 2022年7月21日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

美国化学会 (ACS) 北京代表处招聘

美国化学会 (ACS) 北京代表处招聘

知社学术圈

11+阅读 · 2018年9月4日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

专知

13+阅读 · 2017年12月10日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

水稻转录因子OsMADS57参与硝酸盐调控根系伸长的机制

国家自然科学基金

0+阅读 · 2014年12月31日

微小RNA-34家族抑制EMT逆转肺癌EGFR-TKI获得性耐药的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

进化数据驱动的群体智能算法及其分布式计算模型研究

国家自然科学基金

5+阅读 · 2014年12月31日

异质多智能体系统的分布式协调问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

针对大规模复杂制造系统多重入多瓶颈特征的混合智能调度优化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

HIC1调控CIITA转录机制研究及其在B细胞分化中的意义

国家自然科学基金

0+阅读 · 2012年12月31日

协同生态粒子群计算模型及动态优化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

非线性系统优先级多目标模型预测控制的稳定性与鲁棒性理论及应用

国家自然科学基金

0+阅读 · 2012年12月31日

吴茱萸有效成分的累积和转化规律与其资源品质的相关性研究

国家自然科学基金

0+阅读 · 2012年12月31日

智能仿真优化理论与方法研究

国家自然科学基金

9+阅读 · 2011年12月31日

A Simulation Environment and Reinforcement Learning Method for Waste Reduction

Arxiv

0+阅读 · 2023年5月26日

MARLlib: A Scalable Multi-agent Reinforcement Learning Library

Arxiv

0+阅读 · 2023年5月26日

Metaheuristic planner for cooperative multi-agent wall construction with UAVs

Arxiv

0+阅读 · 2023年5月25日

DeepFreight: Integrating Deep Reinforcement Learning and Mixed Integer Programming for Multi-transfer Truck Freight Delivery

Arxiv

0+阅读 · 2023年5月25日

DIFFER: Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年5月25日

No-Regret Online Prediction with Strategic Experts

Arxiv

0+阅读 · 2023年5月24日

Learning Reward Machines in Cooperative Multi-Agent Tasks

Arxiv

0+阅读 · 2023年5月24日

Towards Efficient Multi-Agent Learning Systems

Arxiv

0+阅读 · 2023年5月24日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

VIP会员

文章信息

相关主题

最新内容

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

专知会员服务

2+阅读 · 6月19日

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

专知会员服务

4+阅读 · 6月19日

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

专知会员服务

5+阅读 · 6月18日

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

专知会员服务

6+阅读 · 6月18日

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

《廉价自杀式无人机战争的军事战略影响：乌克兰和伊朗案例研究》

专知会员服务

11+阅读 · 6月18日

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

《面向反无人机作战的联邦式可解释射频–光电/红外情报融合：边缘人工智能优化、电子战韧性及分布式监视验证》

专知会员服务

9+阅读 · 6月18日

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

ICML 2026 | FR3D：解耦自车运动的未来动态三维重建世界模型

专知会员服务

6+阅读 · 6月17日

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

【伯克利博士论文】迈向可扩展与自我演进的大语言模型智能体

专知会员服务

9+阅读 · 6月17日

学习数据的几何：形状空间分析数学综述

学习数据的几何：形状空间分析数学综述

专知会员服务

7+阅读 · 6月17日

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

《现代防空系统综述：架构、传感器、拦截器及新兴威胁环境对基础设施受限防御环境的影响》2026最新长综述

专知会员服务

13+阅读 · 6月17日

定向能反无人机系统最新发展动态

定向能反无人机系统最新发展动态

专知会员服务

8+阅读 · 6月17日

从燃煤战舰到算法战争：水面指挥的永恒要求

从燃煤战舰到算法战争：水面指挥的永恒要求

专知会员服务

6+阅读 · 6月17日

《短程弹道再入飞行器拦截时间中的一项异常现象》

《短程弹道再入飞行器拦截时间中的一项异常现象》

专知会员服务

8+阅读 · 6月17日

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

《基于回归方法与任务上下文的对抗环境动态战术网络报文优先级排序》

专知会员服务

8+阅读 · 6月17日

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

美智库《战术级指挥控制的迫切要求：构建弹性机动式指挥控制网络》报告

专知会员服务

10+阅读 · 6月17日

相关VIP内容

【华盛顿大学Simon S. Du】离线单智能体和多智能体强化学习

【华盛顿大学Simon S. Du】离线单智能体和多智能体强化学习

专知会员服务

46+阅读 · 2022年11月10日

斯坦福大学最新【强化学习】2022课程，含ppt

斯坦福大学最新【强化学习】2022课程，含ppt

专知会员服务

134+阅读 · 2022年2月27日

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

强化学习的对比无监督表示，CURL: Contrastive Unsupervised Representations for Reinforcement Learning

专知会员服务

43+阅读 · 2020年4月11日

【机器学习面试】《Machine Learning Interviews - YouTube》by Huyen Chip [Senior Deep Learning Engineer, NVIDIA]

【机器学习面试】《Machine Learning Interviews - YouTube》by Huyen Chip [Senior Deep Learning Engineer, NVIDIA]

专知会员服务

44+阅读 · 2019年12月24日

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

【斯坦福大学Chelsea Finn-NeurIPS 2019】贝叶斯元学习

专知会员服务

38+阅读 · 2019年12月17日

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

【微软Alekh等开放新书】强化学习理论与算法（Reinforcement Learning:Theory and Algorithms），附83页pdf

专知会员服务

122+阅读 · 2019年11月24日

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

实时强化学习《Real-Time Reinforcement Learning》S Ramstedt, C Pal [Mila, Element AI] (2019)

专知会员服务

13+阅读 · 2019年11月17日

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

专知会员服务

19+阅读 · 2019年11月5日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

ACL 2026综述 | 大规模手语数据集：资源、基准与标注标准

综述 | 周期表视角下的大模型推理：范式、方法与失败模式

ICML 2026 | 多任务贝叶斯上下文学习：让 Transformer 在测试时显式适应新先验

ICML 2026 Spotlight | SmoothSMoE：解析稀疏 MoE 路由不连续

相关资讯

WSDM2022推荐算法部分论文整理（附直播课程）

WSDM2022推荐算法部分论文整理（附直播课程）

机器学习与推荐算法

0+阅读 · 2022年7月21日

量化金融强化学习论文集合

量化金融强化学习论文集合

专知

14+阅读 · 2019年12月18日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

美国化学会 (ACS) 北京代表处招聘

美国化学会 (ACS) 北京代表处招聘

知社学术圈

11+阅读 · 2018年9月4日

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec 精选：基于LSTM的序列推荐实现（PyTorch）

LibRec智能推荐

50+阅读 · 2018年8月27日

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

【资源】Python强化学习实战，Anaconda公司的高级数据科学家讲解（附相关Python开源库）

专知

13+阅读 · 2017年12月10日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

A Simulation Environment and Reinforcement Learning Method for Waste Reduction

Arxiv

0+阅读 · 2023年5月26日

MARLlib: A Scalable Multi-agent Reinforcement Learning Library

Arxiv

0+阅读 · 2023年5月26日

Metaheuristic planner for cooperative multi-agent wall construction with UAVs

Arxiv

0+阅读 · 2023年5月25日

DeepFreight: Integrating Deep Reinforcement Learning and Mixed Integer Programming for Multi-transfer Truck Freight Delivery

Arxiv

0+阅读 · 2023年5月25日

DIFFER: Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年5月25日

No-Regret Online Prediction with Strategic Experts

Arxiv

0+阅读 · 2023年5月24日

Learning Reward Machines in Cooperative Multi-Agent Tasks

Arxiv

0+阅读 · 2023年5月24日

Towards Efficient Multi-Agent Learning Systems

Arxiv

0+阅读 · 2023年5月24日

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Arxiv

19+阅读 · 2022年5月13日

Differentiable Dynamic Programming for Structured Prediction and Attention

Arxiv

56+阅读 · 2018年2月20日

相关基金

水稻转录因子OsMADS57参与硝酸盐调控根系伸长的机制

国家自然科学基金

0+阅读 · 2014年12月31日

微小RNA-34家族抑制EMT逆转肺癌EGFR-TKI获得性耐药的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

进化数据驱动的群体智能算法及其分布式计算模型研究

国家自然科学基金

5+阅读 · 2014年12月31日

异质多智能体系统的分布式协调问题研究

国家自然科学基金

1+阅读 · 2013年12月31日

针对大规模复杂制造系统多重入多瓶颈特征的混合智能调度优化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

HIC1调控CIITA转录机制研究及其在B细胞分化中的意义

国家自然科学基金

0+阅读 · 2012年12月31日

协同生态粒子群计算模型及动态优化方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

非线性系统优先级多目标模型预测控制的稳定性与鲁棒性理论及应用

国家自然科学基金

0+阅读 · 2012年12月31日

吴茱萸有效成分的累积和转化规律与其资源品质的相关性研究

国家自然科学基金

0+阅读 · 2012年12月31日

智能仿真优化理论与方法研究

国家自然科学基金

9+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员