Claude论文 - 专知

会员服务 ·

Claude

When Helpfulness Overrides Causal Caution: Context-Dependent Suppression and Recovery in LLMs

Arxiv

0+阅读 · 6月23日

Detecting AI Coding Agents in Open Source: A Validated Multi-Method Census of 180 Million Repositories

Arxiv

0+阅读 · 6月23日

Configuration Smells in AGENTS.md Files: Common Mistakes in Configuring Coding Agents

Arxiv

0+阅读 · 6月19日

Evaluating LLMs for Real-World Web Vulnerability Detection

Arxiv

0+阅读 · 6月19日

Can LLMs Reason About Brand Ownership? An Empirical Study of Domain Attribution Intelligence

Arxiv

0+阅读 · 6月18日

Beyond the GUI Paradigm: Do Mobile Agents Need the Phone Screen?

Arxiv

0+阅读 · 6月16日

VISUALSKILL: Multimodal Skills for Computer-Use Agents

Arxiv

0+阅读 · 6月16日

Your AI Travel Agent Would Book You a Bullfight: An Agentic Benchmark for Implicit Animal Welfare in Frontier AI Models

Arxiv

0+阅读 · 6月17日

Evaluating Prompting-Based Defenses Against Domain-Camouflaged Injection Attacks

Arxiv

0+阅读 · 6月16日

Your AI Travel Agent Would Book You a Bullfight: An Agentic Benchmark for Implicit Animal Welfare in Frontier AI Models

Arxiv

0+阅读 · 6月16日

Trust Between AI Agents: Measuring Formation, Breakage, and Recovery, with Implications for Governing Multi-Agent Systems

Arxiv

0+阅读 · 6月12日

Configuration Smells in AGENTS.md Files: Common Mistakes in Configuring Coding Agents

Arxiv

0+阅读 · 6月14日

Evolutionary Dynamics of Cooperation in Next-Generation LLM Agent Systems: A Cross-Provider Empirical Extension

Arxiv

0+阅读 · 6月14日

Do Large Language Models Have Emotions?

Arxiv

0+阅读 · 6月3日

HyDRA: Hybrid Dynamic Routing Architecture for Heterogeneous LLM Pools

Arxiv

0+阅读 · 6月12日

参考链接

微信扫码咨询专知VIP会员