Local-remote systems allow robots to execute complex tasks in hazardous environments such as space and nuclear power stations. However, establishing accurate positional mapping between local and remote devices can be difficult due to time delays that can compromise system performance and stability. Enhancing the synchronicity and stability of local-remote systems is vital for enabling robots to interact with environments at greater distances and under highly challenging network conditions, including time delays. We introduce an adaptive control method employing reinforcement learning to tackle the time-delayed control problem. By adjusting controller parameters in real-time, this adaptive controller compensates for stochastic delays and improves synchronicity between local and remote robotic manipulators. To improve the adaptive PD controller's performance, we devise a model-based reinforcement learning approach that effectively incorporates multi-step delays into the learning framework. Utilizing this proposed technique, the local-remote system's performance is stabilized for stochastic communication time-delays of up to 290ms. Our results demonstrate that the suggested model-based reinforcement learning method surpasses the Soft-Actor Critic and augmented state Soft-Actor Critic techniques. Access the code at: https://github.com/CAV-Research-Lab/Predictive-Model-Delay-Correction
翻译:本地-远程系统使机器人能够在太空和核电站等危险环境中执行复杂任务。然而,由于时延会降低系统性能和稳定性,在本地设备与远程设备之间建立精确的位置映射往往较为困难。增强本地-远程系统的同步性与稳定性对于实现机器人在更大距离和更具挑战性的网络条件(包括时延)下与环境交互至关重要。我们提出了一种采用强化学习的自适应控制方法来解决时延控制问题。通过实时调整控制器参数,该自适应控制器能够补偿随机时延并提升本地与远程机械臂之间的同步性。为改进自适应PD控制器的性能,我们设计了一种基于模型的强化学习方法,该方法有效地将多步时延融入学习框架。利用所提出的技术,本地-远程系统在通信随机时延高达290毫秒的情况下仍能保持稳定性能。实验结果表明,所提出的基于模型的强化学习方法优于软演员-评论家算法和增强状态软演员-评论家技术。代码访问地址:https://github.com/CAV-Research-Lab/Predictive-Model-Delay-Correction