Chrome Extension

WeChat Mini Program

Use on ChatGLM

Log in

Academic Profile User Profile

My Following Paper Collections Browse History

Expert System-Based Multiagent Deep Deterministic Policy Gradient for Swarm Robot Decision Making.

Zhen Wang,Xiaoyue Jin,Tao Zhang, Jiahao Li,Dengxiu Yu,Kang Hao Cheong,C. L. Philip Chen

IEEE TRANSACTIONS ON CYBERNETICS（2024）

Northwestern Polytech Univ | Singapore Univ Technol & Design | South China Univ Technol

Cited 10|Views151

Abstract

In this article, an expert system-based multiagent deep deterministic policy gradient (ESB-MADDPG) is proposed to realize the decision making for swarm robots. Multiagent deep deterministic policy gradient (MADDPG) is a multiagent reinforcement learning algorithm proposed to utilize a centralized critic within the actor-critic learning framework, which can reduce policy gradient variance. However, it is difficult to apply traditional MADDPG to swarm robots directly as it is time consuming during the path planning, rendering it necessary to propose a faster method to gather the trajectories. Besides, the trajectories obtained by the MADDPG are continuous by straight lines, which is not smooth and will be difficult for the swarm robots to track. This article aims to solve these problems by closing the above gaps. First, the ESB-MADDPG method is proposed to improve the training speed. The smooth processing of the trajectory is designed in the ESB-MADDPG. Furthermore, the expert system also provides us with many trained offline trajectories, which avoid the retraining each time we use the swarm robots. Considering the gathered trajectories, the model predictive control (MPC) algorithm is introduced to realize the optimal tracking of the offline trajectories. Simulation results show that combining ESB-MADDPG and MPC can realize swarm robot decision making efficiently.

More

Translated text

Key words

Swarm robotics,Robots,Trajectory,Path planning,Expert systems,Training,Optimization,Model prediction control,multiagent deep deterministic policy gradient (MADDPG),swarm robot decision making

Bibtex

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Related Papers

Reference papers

Cited Papers

Understanding the Stochastic Dynamics of Sequential Decision-Making Processes: A Path-Integral Analysis of Multi-Armed Bandits

Bo Li,Chi Ho Yeung

CHAOS 2023

被引用0

Multi-agent Continual Coordination Via Progressive Task Contextualization

Lei Yuan,Lihe Li,Ziqian Zhang,Fuxiang Zhang,Cong Guan,Yang Yu

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024

被引用2

A Multi-Agent Adaptive Co-Evolution Method in Dynamic Environments

Yan Li,Huazhi Zhang, Weiming Xu,Jianan Wang, Jialu Wang,Suyu Wang

MATHEMATICS 2023

被引用0

Optimal Navigation for AGVs: A Soft Actor–critic-Based Reinforcement Learning Approach with Composite Auxiliary Rewards

Haisen Guo,Zhigang Ren,Jialun Lai,Zongze Wu,Shengli Xie

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE 2023

被引用10

A Fractal-based Complex Belief Entropy for Uncertainty Measure in Complex Evidence Theory

Keming Wu,Fuyuan Xiao,Yi Zhang

IEEE Trans Syst Man Cybern Syst 2025

被引用0

A Novel Approach for Target Attraction and Obstacle Avoidance of a Mobile Robot in Unknown Environments Using a Customized Spiking Neural Network

Brwa Abdulrahman Abubaker,Jafar Razmara,Jaber Karimpour

APPLIED SCIENCES-BASEL 2023

被引用2

Human Skill Knowledge Guided Global Trajectory Policy Reinforcement Learning Method

Yajing Zang,Pengfei Wang,Fusheng Zha,Wei Guo, Chuanfeng Li,Lining Sun

FRONTIERS IN NEUROROBOTICS 2024

被引用0

Neural Network-Based Distributed Intelligent Supervisory Control for Multi-Agent Systems with Unknown Input Powers

Deyang Jiang,Jiyu Zhu,Qikun Shen

Int J Syst Sci 2024

被引用0

Machining Parameter Optimization for a Batch Milling System Using Multi-Task Deep Reinforcement Learning

Pei Wang, Yixin Cui, Haizhen Tao,Xun Xu,Sheng Yang

JOURNAL OF MANUFACTURING SYSTEMS 2025

被引用1

A Swarm-Independent Behaviors-based Orbit Maneuvering Approach for Target-attacker-defender Games of Satellites

Hanyu Qian,Zhaoyue Chen, Xin Wang,Bing Xiao, Ling Meng, Yanan Ma

INFORMATION SCIENCES 2025

被引用0

Data Disclaimer

The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn

Chat Paper

【要点】：本文提出了一种基于专家系统的多智能体深度确定性策略梯度算法（ESB-MADDPG），以优化群机器人决策过程，通过引入模型预测控制（MPC）实现轨迹的最优跟踪。

【方法】：通过结合专家系统与多智能体深度确定性策略梯度（MADDPG）算法，提高了训练速度并优化了轨迹平滑性。

【实验】：在仿真实验中，使用ESB-MADDPG算法与MPC相结合，验证了算法在群机器人决策中的有效性，具体数据集名称未提及。

去 AI 文献库对话