Resilient Control of Networked Microgrids using Vertical Federated Reinforcement Learning: Designs and Real-Time Test-Bed Validations

Improving system-level resiliency of networked microgrids is an important aspect with increased population of inverter-based resources (IBRs). This paper (1) presents resilient control design in presence of adversarial cyber-events, and proposes a novel federated reinforcement learning (Fed-RL) approach to tackle (a) model complexities, unknown dynamical behaviors of IBR devices, (b) privacy issues regarding data sharing in multi-party-owned networked grids, and (2) transfers learned controls from simulation to hardware-in-the-loop test-bed, thereby bridging the gap between simulation and real world. With these multi-prong objectives, first, we formulate a reinforcement learning (RL) training setup generating episodic trajectories with adversaries (attack signal) injected at the primary controllers of the grid forming (GFM) inverters where RL agents (or controllers) are being trained to mitigate the injected attacks. For networked microgrids, the horizontal Fed-RL method involving distinct independent environments is not appropriate, leading us to develop vertical variant Federated Soft Actor-Critic (FedSAC) algorithm to grasp the interconnected dynamics of networked microgrid. Next, utilizing OpenAI Gym interface, we built a custom simulation set-up in GridLAB-D/HELICS co-simulation platform, named Resilient RL Co-simulation (ResRLCoSIM), to train the RL agents with IEEE 123-bus benchmark test systems comprising 3 interconnected microgrids. Finally, the learned policies in simulation world are transferred to the real-time hardware-in-the-loop test-bed set-up developed using high-fidelity Hypersim platform. Experiments show that the simulator-trained RL controllers produce convincing results with the real-time test-bed set-up, validating the minimization of sim-to-real gap.

翻译：提升网络微电网系统级弹性是伴随逆变器资源（IBR）日益普及的重要课题。本文（1）提出了对抗网络攻击事件下的弹性控制设计，并创新性地提出联邦强化学习（Fed-RL）方法，以应对（a）模型复杂度与IBR设备未知动态行为、（b）多方共有网络电网数据共享中的隐私问题；（2）将学习到的控制策略从仿真迁移至硬件在环实验平台，从而弥合仿真与现实世界的差距。基于这些多维度目标，首先构建强化学习（RL）训练框架，生成在主控制器（电网形成逆变器）中注入攻击信号的情景轨迹，RL智能体（或控制器）通过训练消除注入攻击。针对网络微电网，水平Fed-RL方法因涉及独立异构环境而不适用，因此我们开发了垂直变体联邦软演员-评论家（FedSAC）算法来捕捉网络微电网的互联动态特性。其次，利用OpenAI Gym接口，在GridLAB-D/HELICS联合仿真平台上构建了自定义仿真环境（弹性RL联合仿真平台ResRLCoSIM），基于包含三个互联微电网的IEEE 123节点基准测试系统训练RL智能体。最后，将仿真环境学习到的策略迁移至基于高保真Hypersim平台开发的实时硬件在环实验平台。实验表明，仿真训练的RL控制器在实时实验平台上产生可靠结果，验证了仿真到现实差距的最小化。

相关内容

Networking

关注 23

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

O’Reilly报告：知识图谱崛起——面向现代数据集成和数据结构体系，“The Rise of the Knowledge Graph——Toward Modern Data Integration and the Data Fabric Architecture”

专知会员服务

49+阅读 · 2022年2月18日

UCM《机器学习导论笔记》，80页pdf CSE176 Introduction to Machine Learning

专知会员服务

32+阅读 · 2021年9月29日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日