大型推理模型论文 - 专知

会员服务 ·

大型推理模型

大型推理模型

Adaptive and Explicit safe: Triggering Latent Safety Awareness in Large Reasoning Models

Arxiv

0+阅读 · 6月15日

Stop When Further Reasoning Won't Help: Attention-State Adaptive Generation in Reasoning Models

Arxiv

0+阅读 · 6月13日

Entropy-Gradient Inversion: Moving Toward Internal Mechanism of Large Reasoning Models

Arxiv

0+阅读 · 6月11日

Quantifying Faithful Confidence Expression in Large Reasoning Models

Arxiv

0+阅读 · 6月2日

Reasoning Models Will Sometimes Lie About Their Reasoning

Arxiv

0+阅读 · 4月21日

AutoRAN: Automated Hijacking of Safety Reasoning in Large Reasoning Models

Arxiv

0+阅读 · 4月16日

Understanding Performance Gap Between Parallel and Sequential Sampling in Large Reasoning Models

Arxiv

0+阅读 · 4月7日

When to Retrieve During Reasoning: Adaptive Retrieval for Large Reasoning Models

Arxiv

0+阅读 · 4月29日

Towards Safe Reasoning in Large Reasoning Models via Corrective Intervention

Arxiv

0+阅读 · 2月28日

ExpLang: Improved Exploration and Exploitation in LLM Reasoning with On-Policy Thinking Language Selection

Arxiv

0+阅读 · 2月25日

Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models

Arxiv

0+阅读 · 3月11日

Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models

Arxiv

0+阅读 · 3月3日

RFEval: Benchmarking Reasoning Faithfulness under Counterfactual Reasoning Intervention in Large Reasoning Models

Arxiv

0+阅读 · 2月20日

RFEval: Benchmarking Reasoning Faithfulness under Counterfactual Reasoning Intervention in Large Reasoning Models

Arxiv

0+阅读 · 2月23日

REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Reasoning

Arxiv

0+阅读 · 2月27日

参考链接

微信扫码咨询专知VIP会员