大语言模型对齐论文 - 专知

会员服务 ·

大语言模型对齐

大语言模型对齐

f-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

Arxiv

0+阅读 · 5月11日

Are Dilemmas and Conflicts in LLM Alignment Solvable? A View from Priority Graph

Are Dilemmas and Conflicts in LLM Alignment Solvable? A View from Priority Graph

Arxiv

0+阅读 · 3月16日

f-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

Arxiv

0+阅读 · 2月9日

Less is More: Improving LLM Alignment via Preference Data Selection

Arxiv

0+阅读 · 2月15日

$f$-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

$f$-GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment

Arxiv

0+阅读 · 2月5日

OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment

Arxiv

0+阅读 · 2月3日

One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment

Arxiv

0+阅读 · 1月26日

A Survey of LLM Alignment: Instruction Understanding, Intention Reasoning, and Reliable Generation

Arxiv

0+阅读 · 1月29日

Control Barrier Function for Aligning Large Language Models

Arxiv

0+阅读 · 2025年11月6日

参考链接

微信扫码咨询专知VIP会员