GPT-2论文 - 专知

会员服务 ·

GPT-2

The Anxiety of Influence: Bloom Filters in Transformer Attention Heads

Arxiv

0+阅读 · 2月19日

Large Language Models and Impossible Language Acquisition: "False Promise" or an Overturn of our Current Perspective towards AI

Arxiv

0+阅读 · 2月17日

Context-Emotion Aware Therapeutic Dialogue Generation: A Multi-component Reinforcement Learning Approach to Language Models for Mental Health Support

Arxiv

0+阅读 · 2月16日

Large Language Models and Impossible Language Acquisition: "False Promise" or an Overturn of our Current Perspective towards AI

Arxiv

0+阅读 · 2月9日

Large Language Models and Impossible Language Acquisition: "False Promise" or an Overturn of our Current Perspective towards AI

Arxiv

0+阅读 · 2月13日

Large Language Models and Impossible Language Acquisition: "False Promise" or an Overturn of our Current Perspective towards AI

Arxiv

0+阅读 · 2月11日

$\infty$-MoE: Generalizing Mixture of Experts to Infinite Experts

Arxiv

0+阅读 · 1月25日

Power-of-Two Quantization-Aware-Training (PoT-QAT) in Large Language Models (LLMs)

Power-of-Two Quantization-Aware-Training (PoT-QAT) in Large Language Models (LLMs)

Arxiv

0+阅读 · 1月5日

Modeling Language as a Sequence of Thoughts

Arxiv

0+阅读 · 2025年12月31日

SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance

SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance

Arxiv

0+阅读 · 2025年12月24日

Context-Emotion Aware Therapeutic Dialogue Generation: A Multi-component Reinforcement Learning Approach to Language Models for Mental Health Support

Arxiv

0+阅读 · 2025年11月14日

Dissecting the Ledger: Locating and Suppressing "Liar Circuits" in Financial Large Language Models

Arxiv

0+阅读 · 2025年11月24日

Universal Neurons in GPT-2: Emergence, Persistence, and Functional Impact

Arxiv

0+阅读 · 2025年11月9日

Weak-to-Strong Generalization Even in Random Feature Networks, Provably

Arxiv

0+阅读 · 2025年11月9日

RETTA: Retrieval-Enhanced Test-Time Adaptation for Zero-Shot Video Captioning

Arxiv

0+阅读 · 2025年10月28日

参考链接

微信扫码咨询专知VIP会员