Chrome Extension
WeChat Mini Program
Use on ChatGLM

Expert System-Based Multiagent Deep Deterministic Policy Gradient for Swarm Robot Decision Making.

IEEE TRANSACTIONS ON CYBERNETICS(2024)

Northwestern Polytech Univ | Singapore Univ Technol & Design | South China Univ Technol

Cited 10|Views151
Abstract
In this article, an expert system-based multiagent deep deterministic policy gradient (ESB-MADDPG) is proposed to realize the decision making for swarm robots. Multiagent deep deterministic policy gradient (MADDPG) is a multiagent reinforcement learning algorithm proposed to utilize a centralized critic within the actor-critic learning framework, which can reduce policy gradient variance. However, it is difficult to apply traditional MADDPG to swarm robots directly as it is time consuming during the path planning, rendering it necessary to propose a faster method to gather the trajectories. Besides, the trajectories obtained by the MADDPG are continuous by straight lines, which is not smooth and will be difficult for the swarm robots to track. This article aims to solve these problems by closing the above gaps. First, the ESB-MADDPG method is proposed to improve the training speed. The smooth processing of the trajectory is designed in the ESB-MADDPG. Furthermore, the expert system also provides us with many trained offline trajectories, which avoid the retraining each time we use the swarm robots. Considering the gathered trajectories, the model predictive control (MPC) algorithm is introduced to realize the optimal tracking of the offline trajectories. Simulation results show that combining ESB-MADDPG and MPC can realize swarm robot decision making efficiently.
More
Translated text
Key words
Swarm robotics,Robots,Trajectory,Path planning,Expert systems,Training,Optimization,Model prediction control,multiagent deep deterministic policy gradient (MADDPG),swarm robot decision making
PDF
Bibtex
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Related Papers
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:本文提出了一种基于专家系统的多智能体深度确定性策略梯度算法(ESB-MADDPG),以优化群机器人决策过程,通过引入模型预测控制(MPC)实现轨迹的最优跟踪。

方法】:通过结合专家系统与多智能体深度确定性策略梯度(MADDPG)算法,提高了训练速度并优化了轨迹平滑性。

实验】:在仿真实验中,使用ESB-MADDPG算法与MPC相结合,验证了算法在群机器人决策中的有效性,具体数据集名称未提及。