Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

来源 :上海交通大学学报(英文版) | 被引量 : 0次 | 上传用户:shuo19871108
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
To solve the problems of difficult control law design,poor portability,and poor stability of traditional multi-agent formation obstacle avoidance algorithms,a multi-agent formation obstacle avoidance method based on deep reinforcement learning (DRL) is proposed.This method combines the perception ability of convolutional neural networks (CNNs) with the decision-making ability of reinforcement learning in a general form and realizes direct output control from the visual perception input of the environment to the action through an end-to-end learning method.The multi-agent system (MAS) model of the follow-leader formation method was designed with the wheelbarrow as the control object.An improved deep Q netwrok (DQN) algorithm (we improved its discount factor and learning efficiency and designed a reward value function that considers the distance relationship between the agent and the obstacle and the coordination factor between the multi-agents) was designed to achieve obstacle avoidance and collision avoidance in the process of multi-agent formation into the desired formation.The simulation results show that the proposed method achieves the expected goal of multi-agent formation obstacle avoidance and has stronger portability compared with the traditional algorithm.
其他文献
High-definition (HD) maps are key components that provide rich topologic and semantic information for decision-making in vehicle autonomous driving systems.A complete ground orthophoto is usually used as the base image to construct the HD map.The ground o
As an emerging visual task,vehicle re-identification refers to the identification of the same vehicle across multiple cameras.Herein,we propose a novel vehicle re-identification method that uses an improved ResNet-50 architecture and utilizes the topology
In this study,a multi-object tracking (MOT) scheme based on a light detection and ranging sensor was proposed to overcome imprecise velocity observations in object occlusion scenarios.By applying real-time velocity estimation,a modified unscented Kalman f
Analyzing a vehicle\'s abnormal behavior in surveillance videos is a challenging field,mainly due to the wide variety of anomaly cases and the complexity of surveillance videos.In this study,a novel intelligent vehicle behavior analysis framework based
Multi-object tracking is a vital problem as many applications require better tracking approaches.Although learning-based detectors are becoming extremely powerful,there are few tracking methods designed to work with them in real time.We explored an effici
Contemporary autonomous-driving technology relies on good environmental-perception systems and high-precision maps.For unknown environments or scenarios where perception fails,a human-in-the-loop remote-driving system can effectively complement common sol
In real-world scenarios,the uncertainty of measurements cannot be handled efficiently by traditional model predictive control (MPC).A stochastic MPC (SMPC) method for handling the uncertainty of states in autonomous driving lane-keeping scenarios is prese
The magic formula (MF) tire model is a semi-empirical tire model that can precisely simulate tire behavior.The heuristic optimization algorithm is typically used for parameter identification of the MF tire model.To avoid the defect of the traditional heur
This study proposes two speed controllers based on a robust adaptive non-singular terminal sliding mode control approach for the cooperative adaptive cruise control problem in a connected and automated vehicular platoon.The delay-based spacing policy is a
A high-precision map (HPM) is the key infrastructure to realizing the function of automated driving(AD) and ensuring its safety.However,the current laws and regulations on HPMs in China can lead to serious legal compliance problems.Thus,proper measures sh