检查点论文 - 专知

会员服务 ·

检查点

Scrutinizing Variables for Checkpoint Using Automatic Differentiation

Arxiv

0+阅读 · 2月17日

Stop Testing Attacks, Start Diagnosing Defenses: The Four-Checkpoint Framework Reveals Where LLM Safety Breaks

Arxiv

0+阅读 · 2月10日

The SJTU X-LANCE Lab System for MSR Challenge 2025

Arxiv

0+阅读 · 2月4日

Constraint-Rectified Training for Efficient Chain-of-Thought

Arxiv

0+阅读 · 2月13日

Model soups need only one ingredient

Arxiv

0+阅读 · 2月10日

Architectural Foundations for Checkpointing and Restoration in Quantum HPC Systems

Arxiv

0+阅读 · 2月10日

Correct Reasoning Paths Visit Shared Decision Pivots

Arxiv

0+阅读 · 2月7日

$C$-$ΔΘ$: Circuit-Restricted Weight Arithmetic for Selective Refusal

Arxiv

0+阅读 · 2月4日

Position: Explaining Behavioral Shifts in Large Language Models Requires a Comparative Approach

Arxiv

0+阅读 · 2月2日

Merging Beyond: Streaming LLM Updates via Activation-Guided Rotations

Arxiv

0+阅读 · 2月3日

Preferences for Idiomatic Language are Acquired Slowly -- and Forgotten Quickly: A Case Study on Swedish

Arxiv

0+阅读 · 2月3日

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

Arxiv

0+阅读 · 2月3日

Discovering Hidden Gems in Model Repositories

Arxiv

0+阅读 · 1月29日

LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR

Arxiv

0+阅读 · 1月20日

No Validation, No Problem: Predicting Model Performance from a Single Gradient

Arxiv

0+阅读 · 1月23日

参考链接

微信扫码咨询专知VIP会员