Amplify, Don't Create: Temporal Accumulation for Slow-Burn Prompt Injection - 专知论文

会员服务 ·

0

AUC · 边缘化 · Prompt · 得分 · 统计量 ·

Amplify, Don't Create: Temporal Accumulation for Slow-Burn Prompt Injection

翻译：暂无翻译

Most prompt-injection detectors score a single event or message. Control-plane attacks against tool-using agents can instead distribute weak directives across a trajectory while keeping each event below threshold. We test whether a proxy-side temporal accumulator recovers this slow-burn signal by reducing frozen per-event scores to peak and CUSUM persistence statistics. To avoid circularity, grafts are generated against a held-out autoregressive cloaking target and then re-scored under a detector of record: a frozen char-ngram SVM plus an embedding-contrastive head. Only floor-met grafts bound to executed action edges and still sub-threshold under the detector of record enter the slow-burn endpoint. This is a boundary result, not a deployable detector. On concentrated attacks, trajectory-level accumulation beats the per-event foil under a clustered bootstrap (gap +0.092, 95% CI [+0.025, +0.155]), while persistence and peak are statistically tied. On git repo-exfil, density-four floor-met sub-threshold grafts add persistence mass that matched benign shams do not (persistence-delta AUC 0.708 over four attack survivors and six benign shams), while the matched peak-delta control does not separate attack from sham (AUC 0.417), localizing the effect to accumulated persistence rather than a single hot graft. The effect fails on broader clean-path actions (persistence-delta AUC 0.167), where the detector assigns attack and benign actions indistinguishable per-event scores, leaving no margin for CUSUM to bank. Independent powering is blocked by only three to four independent tasks. Temporal accumulation is therefore a narrow-band margin amplifier: it can bank elevated sub-threshold signal but cannot create margin where the per-event detector has none. As byproducts, we contribute a pseudo-replication warning and an independence-audit standard for agent-benchmark evaluation.

翻译：暂无翻译

0

相关内容

AUC

《防空协同制导：用于中段目标分配的多目标成本函数》

《防空协同制导：用于中段目标分配的多目标成本函数》

专知会员服务

22+阅读 · 5月6日

《基于无模型深度强化学习的导弹规避机动生成》

《基于无模型深度强化学习的导弹规避机动生成》

专知会员服务

19+阅读 · 2月10日

《不灭穹顶：高超音速时代重新定义下一代防空》最新报告

《不灭穹顶：高超音速时代重新定义下一代防空》最新报告

专知会员服务

37+阅读 · 2月6日

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

专知会员服务

31+阅读 · 2025年8月18日

《战场不可信传输环境下的边缘计算与通信》48页报告

《战场不可信传输环境下的边缘计算与通信》48页报告

专知会员服务

25+阅读 · 2025年6月23日

美军最新条令《空军基地点防御》

美军最新条令《空军基地点防御》

专知会员服务

47+阅读 · 2025年4月16日

《导弹规避的优化控制方法》200页论文

《导弹规避的优化控制方法》200页论文

专知会员服务

59+阅读 · 2023年12月25日

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

专知会员服务

24+阅读 · 2020年2月22日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

【泡泡图灵智库】使用平面特征IMU-Kinect融合SLAM的退化情况检测与补偿

【泡泡图灵智库】使用平面特征IMU-Kinect融合SLAM的退化情况检测与补偿

泡泡机器人SLAM

13+阅读 · 2019年9月20日

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

机器之心

15+阅读 · 2019年7月13日

【泡泡图灵智库】Detect-SLAM：目标检测和SLAM相互收益

【泡泡图灵智库】Detect-SLAM：目标检测和SLAM相互收益

泡泡机器人SLAM

14+阅读 · 2019年6月28日

深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习与NLP

10+阅读 · 2019年2月18日

用PyTorch做物体检测和追踪

用PyTorch做物体检测和追踪

AI研习社

12+阅读 · 2019年1月6日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新九篇目标检测相关论文—常识性知识转移、尺度不敏感、多尺度位置感知、渐进式域适应、时间感知特征图、人机合作

【论文推荐】最新九篇目标检测相关论文—常识性知识转移、尺度不敏感、多尺度位置感知、渐进式域适应、时间感知特征图、人机合作

专知

17+阅读 · 2018年4月11日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

关于处理样本不平衡问题的Trick整理

关于处理样本不平衡问题的Trick整理

机器学习算法与Python学习

14+阅读 · 2017年12月3日

模型汇总24 - 深度学习中Attention Mechanism详细介绍：原理、分类及应用

模型汇总24 - 深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习与NLP

12+阅读 · 2017年11月30日

基于子模优化的远程预警传感器管理研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于ELM和D-S证据理论的“低慢小”目标识别中的不确定信息融合方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

高速冲击破碎问题的Hamilton粒子重构单元方法

国家自然科学基金

0+阅读 · 2015年12月31日

燃料多次通过堆芯模式下不确定性传播的方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

大规模MIMO-OFDM系统中的同相/正交支路不平衡问题及其补偿方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于支撑函数的不规则形态扩展目标建模和估计研究

国家自然科学基金

0+阅读 · 2015年12月31日

脉冲式干扰下高超声速飞行器的有限时间状态受限控制

国家自然科学基金

0+阅读 · 2015年12月31日

自适应两阶段非线性容积Kalman滤波融合方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

不确定与动态信息环境下基于预规划-重规划集成建模的应急物流选址-调度鲁棒优化研究

国家自然科学基金

3+阅读 · 2014年12月31日

不确定多管火箭多体系统动力学控制机理、方法及实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

Plans Don't Persist: Why Context Management Is Load Bearing for LLM Agents

Arxiv

0+阅读 · 6月22日

Confidently Wrong: Severity-Aware Calibration of Prompt-Injection Detectors under Attack Shift

Arxiv

0+阅读 · 6月21日

Keyless Attention: Value-Space Routing and Value-Only Caching for Efficient Transformers

Arxiv

0+阅读 · 6月20日

T-Rex: Tactile-Reactive Dexterous Manipulation

Arxiv

0+阅读 · 6月18日

When Lower Privileges Suffice: Investigating Over-Privileged Tool Selection in LLM Agents

Arxiv

0+阅读 · 6月18日

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction

Arxiv

0+阅读 · 6月17日

The Gate Is Only as Honest as Its Contracts: ContractGuard for the Contract Layer of Risk-Aware Causal Gating

Arxiv

0+阅读 · 6月17日

Multi-Source Cybersecurity Logs: An ATT&CK-Labeled Dataset and SLM Evaluation

Arxiv

0+阅读 · 6月16日

Transformer Tracking

Arxiv

17+阅读 · 2021年3月29日

Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection

Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection

Arxiv

15+阅读 · 2019年3月18日

VIP会员

文章信息

相关主题

最新内容

无人机自主控制与人工智能：系统性综述

无人机自主控制与人工智能：系统性综述

专知会员服务

8+阅读 · 今天7:25

巡飞弹与反无人机系统——现代战场的两大支柱

巡飞弹与反无人机系统——现代战场的两大支柱

专知会员服务

3+阅读 · 今天6:54

《打造“黄金舰队”》57页报告

《打造“黄金舰队”》57页报告

专知会员服务

2+阅读 · 今天6:52

《北约数字教官网络发展路径》128页报告

《北约数字教官网络发展路径》128页报告

专知会员服务

2+阅读 · 今天6:33

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

ECCV 2026 | MIMFlow：MIM与归一化流统一图像生成

专知会员服务

7+阅读 · 6月25日

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

超越自回归边界：扩散模型、世界模型与SSM如何重塑代码智能

专知会员服务

6+阅读 · 6月25日

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

重塑决策优势：美军作战艺术与多域作战中联盟联合全域指挥控制（CJADC2）体系的融合

专知会员服务

9+阅读 · 6月25日

网状网络及其在军事领域的运用

网状网络及其在军事领域的运用

专知会员服务

7+阅读 · 6月25日

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

《意识即战场——全球安全体系中认知战的演进：乌克兰构建认知作战体系的展望》

专知会员服务

8+阅读 · 6月25日

无美国参与的欧洲战争方式（万字长文）

无美国参与的欧洲战争方式（万字长文）

专知会员服务

8+阅读 · 6月25日

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

重构“下一场战争”的制胜理论：超越兰彻斯特方程与现代系统

专知会员服务

10+阅读 · 6月25日

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

《国防工业中基于模型定义的实施：产品定义数字化转型的战略路径》90页

专知会员服务

9+阅读 · 6月25日

《国防领域敏感性分析白皮书》

《国防领域敏感性分析白皮书》

专知会员服务

9+阅读 · 6月25日

综述 | 从问答到任务完成：Agent系统与Harness设计

综述 | 从问答到任务完成：Agent系统与Harness设计

专知会员服务

10+阅读 · 6月24日

Agentic RL：框架、实践与长程智能体训练

Agentic RL：框架、实践与长程智能体训练

专知会员服务

10+阅读 · 6月24日

相关VIP内容

《防空协同制导：用于中段目标分配的多目标成本函数》

《防空协同制导：用于中段目标分配的多目标成本函数》

专知会员服务

22+阅读 · 5月6日

《基于无模型深度强化学习的导弹规避机动生成》

《基于无模型深度强化学习的导弹规避机动生成》

专知会员服务

19+阅读 · 2月10日

《不灭穹顶：高超音速时代重新定义下一代防空》最新报告

《不灭穹顶：高超音速时代重新定义下一代防空》最新报告

专知会员服务

37+阅读 · 2月6日

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

专知会员服务

31+阅读 · 2025年8月18日

《战场不可信传输环境下的边缘计算与通信》48页报告

《战场不可信传输环境下的边缘计算与通信》48页报告

专知会员服务

25+阅读 · 2025年6月23日

美军最新条令《空军基地点防御》

美军最新条令《空军基地点防御》

专知会员服务

47+阅读 · 2025年4月16日

《导弹规避的优化控制方法》200页论文

《导弹规避的优化控制方法》200页论文

专知会员服务

59+阅读 · 2023年12月25日

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

专知会员服务

24+阅读 · 2020年2月22日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

热门VIP内容

开通专知VIP会员享更多权益服务

巡飞弹与反无人机系统——现代战场的两大支柱

《北约数字教官网络发展路径》128页报告

无人机自主控制与人工智能：系统性综述

《打造“黄金舰队”》57页报告

相关资讯

【泡泡图灵智库】使用平面特征IMU-Kinect融合SLAM的退化情况检测与补偿

【泡泡图灵智库】使用平面特征IMU-Kinect融合SLAM的退化情况检测与补偿

泡泡机器人SLAM

13+阅读 · 2019年9月20日

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

参数少一半，效果还更好，天津大学和微软提出Transformer压缩模型

机器之心

15+阅读 · 2019年7月13日

【泡泡图灵智库】Detect-SLAM：目标检测和SLAM相互收益

【泡泡图灵智库】Detect-SLAM：目标检测和SLAM相互收益

泡泡机器人SLAM

14+阅读 · 2019年6月28日

深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习与NLP

10+阅读 · 2019年2月18日

用PyTorch做物体检测和追踪

用PyTorch做物体检测和追踪

AI研习社

12+阅读 · 2019年1月6日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新九篇目标检测相关论文—常识性知识转移、尺度不敏感、多尺度位置感知、渐进式域适应、时间感知特征图、人机合作

【论文推荐】最新九篇目标检测相关论文—常识性知识转移、尺度不敏感、多尺度位置感知、渐进式域适应、时间感知特征图、人机合作

专知

17+阅读 · 2018年4月11日

Focal Loss for Dense Object Detection

Focal Loss for Dense Object Detection

统计学习与视觉计算组

12+阅读 · 2018年3月15日

关于处理样本不平衡问题的Trick整理

关于处理样本不平衡问题的Trick整理

机器学习算法与Python学习

14+阅读 · 2017年12月3日

模型汇总24 - 深度学习中Attention Mechanism详细介绍：原理、分类及应用

模型汇总24 - 深度学习中Attention Mechanism详细介绍：原理、分类及应用

深度学习与NLP

12+阅读 · 2017年11月30日

相关论文

Plans Don't Persist: Why Context Management Is Load Bearing for LLM Agents

Arxiv

0+阅读 · 6月22日

Confidently Wrong: Severity-Aware Calibration of Prompt-Injection Detectors under Attack Shift

Arxiv

0+阅读 · 6月21日

Keyless Attention: Value-Space Routing and Value-Only Caching for Efficient Transformers

Arxiv

0+阅读 · 6月20日

T-Rex: Tactile-Reactive Dexterous Manipulation

Arxiv

0+阅读 · 6月18日

When Lower Privileges Suffice: Investigating Over-Privileged Tool Selection in LLM Agents

Arxiv

0+阅读 · 6月18日

MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction

Arxiv

0+阅读 · 6月17日

The Gate Is Only as Honest as Its Contracts: ContractGuard for the Contract Layer of Risk-Aware Causal Gating

Arxiv

0+阅读 · 6月17日

Multi-Source Cybersecurity Logs: An ATT&CK-Labeled Dataset and SLM Evaluation

Arxiv

0+阅读 · 6月16日

Transformer Tracking

Arxiv

17+阅读 · 2021年3月29日

Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection

Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection

Arxiv

15+阅读 · 2019年3月18日

相关基金

基于子模优化的远程预警传感器管理研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于ELM和D-S证据理论的“低慢小”目标识别中的不确定信息融合方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

高速冲击破碎问题的Hamilton粒子重构单元方法

国家自然科学基金

0+阅读 · 2015年12月31日

燃料多次通过堆芯模式下不确定性传播的方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

大规模MIMO-OFDM系统中的同相/正交支路不平衡问题及其补偿方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于支撑函数的不规则形态扩展目标建模和估计研究

国家自然科学基金

0+阅读 · 2015年12月31日

脉冲式干扰下高超声速飞行器的有限时间状态受限控制

国家自然科学基金

0+阅读 · 2015年12月31日

自适应两阶段非线性容积Kalman滤波融合方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

不确定与动态信息环境下基于预规划-重规划集成建模的应急物流选址-调度鲁棒优化研究

国家自然科学基金

3+阅读 · 2014年12月31日

不确定多管火箭多体系统动力学控制机理、方法及实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员