用于探索的噪声脉冲执行器网络 (Noisy Spiking Actor Network for Exploration) - 专知论文

会员服务 ·

0

噪声 · 脉冲 · 执行器 · 鲁棒 · 扰动 ·

2025 年 12 月 11 日

Noisy Spiking Actor Network for Exploration

翻译：用于探索的噪声脉冲执行器网络

Ding Chen,Peixi Peng,Tiejun Huang,Yonghong Tian

from arxiv, There are some issues with the method and it needs to be withdrawn

As a general method for exploration in deep reinforcement learning (RL), NoisyNet can produce problem-specific exploration strategies. Spiking neural networks (SNNs), due to their binary firing mechanism, have strong robustness to noise, making it difficult to realize efficient exploration with local disturbances. To solve this exploration problem, we propose a noisy spiking actor network (NoisySAN) that introduces time-correlated noise during charging and transmission. Moreover, a noise reduction method is proposed to find a stable policy for the agent. Extensive experimental results demonstrate that our method outperforms the state-of-the-art performance on a wide range of continuous control tasks from OpenAI gym.

翻译：作为深度强化学习（RL）中一种通用的探索方法，NoisyNet能够生成针对特定问题的探索策略。脉冲神经网络（SNNs）由于其二元发放机制，对噪声具有较强的鲁棒性，这使得通过局部扰动实现高效探索变得困难。为解决这一探索问题，我们提出了一种噪声脉冲执行器网络（NoisySAN），该网络在充电和传输过程中引入时间相关噪声。此外，我们还提出了一种降噪方法，旨在为智能体寻找稳定的策略。大量的实验结果表明，我们的方法在OpenAI gym的一系列连续控制任务上超越了现有最先进方法的性能。

0

相关内容

【CVPR2024】VideoMAC: 视频掩码自编码器与卷积神经网络

【CVPR2024】VideoMAC: 视频掩码自编码器与卷积神经网络

专知会员服务

17+阅读 · 2024年3月4日

【CIKM2023】GiGaMAE: 通过协同潜在空间重建的可泛化图掩码自编码器

【CIKM2023】GiGaMAE: 通过协同潜在空间重建的可泛化图掩码自编码器

专知会员服务

23+阅读 · 2023年8月22日

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

专知会员服务

17+阅读 · 2022年5月10日

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

22+阅读 · 2021年4月20日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知会员服务

112+阅读 · 2019年11月25日

AAAI 2022 | ProtGNN：自解释图神经网络

AAAI 2022 | ProtGNN：自解释图神经网络

专知

10+阅读 · 2022年2月28日

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

专知

18+阅读 · 2020年6月22日

【KDD2020】XGNN-可解释图神经网络，从模型级解释构建可信赖GNN

【KDD2020】XGNN-可解释图神经网络，从模型级解释构建可信赖GNN

专知

17+阅读 · 2020年6月7日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知

245+阅读 · 2019年11月18日

软件定义网络（SDN）环境下基于机器学习的路由预规划研究

国家自然科学基金

5+阅读 · 2015年12月31日

基于二维电子气等离极化激元共振的高速太赫兹调制器

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

带有输入饱和的非线性控制系统的量化反馈控制

国家自然科学基金

0+阅读 · 2015年12月31日

基于决策模型和预备电位的运动想象BCI研究

国家自然科学基金

3+阅读 · 2015年12月31日

Schwarz Information Criterion Aided Multi-Armed Bandit for Decentralized Resource Allocation in Dynamic LoRa Networks

Arxiv

0+阅读 · 1月2日

Depth-Synergized Mamba Meets Memory Experts for All-Day Image Reflection Separation

Arxiv

0+阅读 · 1月1日

Feature Slice Matching for Precise Bug Detection

Arxiv

0+阅读 · 2025年12月31日

Projection-based Adversarial Attack using Physics-in-the-Loop Optimization for Monocular Depth Estimation

Arxiv

0+阅读 · 2025年12月31日

Hybrid Convolution and Vision Transformer NAS Search Space for TinyML Image Classification

Arxiv

0+阅读 · 2025年12月31日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR2024】VideoMAC: 视频掩码自编码器与卷积神经网络

【CVPR2024】VideoMAC: 视频掩码自编码器与卷积神经网络

专知会员服务

17+阅读 · 2024年3月4日

【CIKM2023】GiGaMAE: 通过协同潜在空间重建的可泛化图掩码自编码器

【CIKM2023】GiGaMAE: 通过协同潜在空间重建的可泛化图掩码自编码器

专知会员服务

23+阅读 · 2023年8月22日

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

【超越消息传递:图神经网络的物理启发范式】Beyond Message Passing: a Physics-Inspired Paradigm for Graph Neural Networks

专知会员服务

17+阅读 · 2022年5月10日

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

【CVPR 2021】变换器跟踪TransT: Transformer Tracking

专知会员服务

22+阅读 · 2021年4月20日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知会员服务

112+阅读 · 2019年11月25日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体评判者（Agent-as-a-Judge）研究综述

《空战中心自动化持续训练》报告

区块链自主智能体：标准规范、执行模型与信任边界研究

面向无人机战场调整作战训练中心

相关资讯

AAAI 2022 | ProtGNN：自解释图神经网络

AAAI 2022 | ProtGNN：自解释图神经网络

专知

10+阅读 · 2022年2月28日

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

专知

18+阅读 · 2020年6月22日

【KDD2020】XGNN-可解释图神经网络，从模型级解释构建可信赖GNN

【KDD2020】XGNN-可解释图神经网络，从模型级解释构建可信赖GNN

专知

17+阅读 · 2020年6月7日

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

【CVPR2020-旷视】DPGN：分布传播图网络的小样本学习

专知

13+阅读 · 2020年4月1日

【NeurIPS2019】图变换网络：Graph Transformer Network

【NeurIPS2019】图变换网络：Graph Transformer Network

专知

245+阅读 · 2019年11月18日

相关论文

Schwarz Information Criterion Aided Multi-Armed Bandit for Decentralized Resource Allocation in Dynamic LoRa Networks

Arxiv

0+阅读 · 1月2日

Depth-Synergized Mamba Meets Memory Experts for All-Day Image Reflection Separation

Arxiv

0+阅读 · 1月1日

Feature Slice Matching for Precise Bug Detection

Arxiv

0+阅读 · 2025年12月31日

Projection-based Adversarial Attack using Physics-in-the-Loop Optimization for Monocular Depth Estimation

Arxiv

0+阅读 · 2025年12月31日

Hybrid Convolution and Vision Transformer NAS Search Space for TinyML Image Classification

Arxiv

0+阅读 · 2025年12月31日

相关基金

软件定义网络（SDN）环境下基于机器学习的路由预规划研究

国家自然科学基金

5+阅读 · 2015年12月31日

基于二维电子气等离极化激元共振的高速太赫兹调制器

国家自然科学基金

0+阅读 · 2015年12月31日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

46+阅读 · 2015年12月31日

带有输入饱和的非线性控制系统的量化反馈控制

国家自然科学基金

0+阅读 · 2015年12月31日

基于决策模型和预备电位的运动想象BCI研究

国家自然科学基金

3+阅读 · 2015年12月31日

微信扫码咨询专知VIP会员