困惑度论文 - 专知

会员服务 ·

困惑度

Greedy Coordinate Diffusion: Effective and Semantically Coherent Adversarial Attacks via Diffusion Guidance

Arxiv

0+阅读 · 6月16日

Tying the Loop -- Tied Expert Layers in Mixture-of-Experts Language Models

Arxiv

0+阅读 · 6月15日

Beyond Perplexity: UTF-8 Validity in Byte-aware Language Models

Arxiv

0+阅读 · 6月12日

BioMamba: Domain-Adaptive Biomedical Language Models

Arxiv

0+阅读 · 6月10日

Influence-Inspired Spectral Rotations for Extreme Low-Bit LLM Quantization

Arxiv

0+阅读 · 5月24日

A Quantitative Experimental Repeated Measures Study of Training Dynamics in a Small Llama Style Language Model Under a Compute-Aware Token Budget

Arxiv

0+阅读 · 6月11日

LLMs Can Better Capture Human Judgments--With the Right Prompts

Arxiv

0+阅读 · 6月10日

HE-SNR: Uncovering Latent Logic via Entropy for Guiding Mid-Training on SWE-bench

Arxiv

0+阅读 · 5月28日

Only relative ranks matter in weight-clustered large language models

Arxiv

0+阅读 · 3月18日

HubRouter: A Pluggable Sub-Quadratic Routing Primitive for Hybrid Sequence Models

Arxiv

0+阅读 · 4月24日

TuneShift-KD: Knowledge Distillation and Transfer for Fine-tuned Models

Arxiv

0+阅读 · 3月25日

PolyKV: A Shared Asymmetrically-Compressed KV Cache Pool for Multi-Agent LLM Inference

Arxiv

0+阅读 · 4月27日

RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference

Arxiv

0+阅读 · 3月18日

Aligning Dense Retrievers with LLM Utility via DistillationAligning Dense Retrievers with LLM Utility via Distillation

Arxiv

0+阅读 · 4月24日

Luminol-AIDetect: Fast Zero-shot Machine-Generated Text Detection based on Perplexity under Text Shuffling

Arxiv

0+阅读 · 4月28日

参考链接

微信扫码咨询专知VIP会员