Anthropic论文 - 专知

会员服务 ·

Anthropic

A Red-Team Study of Anthropic Fable 5 & Opus 4.8 Models

Arxiv

0+阅读 · 6月16日

Do Large Language Models Have Emotions?

Arxiv

0+阅读 · 6月3日

AMEL: Accumulated Message Effects on LLM Judgments

Arxiv

0+阅读 · 6月9日

AMEL: Accumulated Message Effects on LLM Judgments

Arxiv

0+阅读 · 5月21日

Can AI Make Conflicts Worse? An Alignment Failure in LLM Deployment Across Conflict Contexts

Arxiv

0+阅读 · 5月21日

Does Claude's Constitution Have a Culture?

Arxiv

0+阅读 · 3月30日

Corporations Constitute Intelligence

Arxiv

0+阅读 · 4月3日

Hidden Topics: Measuring Sensitive AI Beliefs with List Experiments

Arxiv

0+阅读 · 2月25日

Benchmarking Political Persuasion Risks Across Frontier Large Language Models

Arxiv

0+阅读 · 3月10日

Split Personality Training: Revealing Latent Knowledge Through Alternate Personalities

Arxiv

0+阅读 · 2月5日

Agentic LLMs as Powerful Deanonymizers: Re-identification of Participants in the Anthropic Interviewer Dataset

Arxiv

0+阅读 · 1月9日

Strategic Intelligence in Large Language Models: Evidence from evolutionary Game Theory

Arxiv

0+阅读 · 2025年7月3日

An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models

Arxiv

0+阅读 · 2025年3月15日

An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models

Arxiv

0+阅读 · 2025年1月21日

Toward Democracy Levels for AI

Arxiv

0+阅读 · 2024年12月8日

参考链接

微信扫码咨询专知VIP会员