RankPO: Preference Optimization for Job-Talent Matching

Matching job descriptions (JDs) with suitable talent requires models capable of understanding not only textual similarities between JDs and candidate resumes but also contextual factors such as geographical location and academic seniority. To address this challenge, we propose a two-stage training framework for large language models (LLMs). In the first stage, a contrastive learning approach is used to train the model on a dataset constructed from real-world matching rules, such as geographical alignment and research area overlap. While effective, this model primarily learns patterns that defined by the matching rules. In the second stage, we introduce a novel preference-based fine-tuning method inspired by Direct Preference Optimization (DPO), termed Rank Preference Optimization (RankPO), to align the model with AI-curated pairwise preferences emphasizing textual understanding. Our experiments show that while the first-stage model achieves strong performance on rule-based data (nDCG@20 = 0.706), it lacks robust textual understanding (alignment with AI annotations = 0.46). By fine-tuning with RankPO, we achieve a balanced model that retains relatively good performance in the original tasks while significantly improving the alignment with AI preferences. The code and data are available at https://github.com/yflyzhang/RankPO.

翻译：将职位描述（JD）与合适的人才相匹配，需要模型不仅能理解JD与候选人简历之间的文本相似性，还能理解地理位置和学术资历等上下文因素。为应对这一挑战，我们提出了一个用于大语言模型（LLM）的两阶段训练框架。在第一阶段，我们使用对比学习方法，在一个基于现实世界匹配规则（如地理位置对齐和研究领域重叠）构建的数据集上训练模型。该模型虽然有效，但主要学习的是由匹配规则定义的模式。在第二阶段，我们引入了一种新颖的、受直接偏好优化（DPO）启发的基于偏好的微调方法，称为排序偏好优化（RankPO），旨在使模型与强调文本理解的AI生成的成对偏好对齐。我们的实验表明，第一阶段模型在基于规则的数据上表现出色（nDCG@20 = 0.706），但缺乏稳健的文本理解能力（与AI标注的对齐度 = 0.46）。通过使用RankPO进行微调，我们获得了一个平衡的模型，该模型在原始任务上保持了相对良好的性能，同时显著提高了与AI偏好的对齐度。代码和数据可在 https://github.com/yflyzhang/RankPO 获取。

相关内容

MoDELS

关注 45

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【亚马逊-WWW2020】不解析,生成!用于面向任务的语义分析的序列到序列体系结构，Don't Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing

专知会员服务

15+阅读 · 2020年2月1日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日