Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

来源 :上海交通大学学报（英文版） | 被引量 : 0次 | 上传用户：shuo19871108

【摘要】

：

【作者】

：

JI Xiukun HAI Jintao LUO Wenguang LIN Cuixia XIONG Yu OU Zengkai WEN Jiayan

【机构】

：

Guangxi Key Laboratory of Auto Parts and Vehicle Technology;School of Electrical and Information Eng

【出处】

：

上海交通大学学报（英文版）

【发表日期】

：

2021年5期

【关键词】

：

wheelbarrow multi-agent deep reinforcement learning (DRL) formation obstacle avo

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

To solve the problems of difficult control law design,poor portability,and poor stability of traditional multi-agent formation obstacle avoidance algorithms,a multi-agent formation obstacle avoidance method based on deep reinforcement learning (DRL) is proposed.This method combines the perception ability of convolutional neural networks (CNNs) with the decision-making ability of reinforcement learning in a general form and realizes direct output control from the visual perception input of the environment to the action through an end-to-end learning method.The multi-agent system (MAS) model of the follow-leader formation method was designed with the wheelbarrow as the control object.An improved deep Q netwrok (DQN) algorithm (we improved its discount factor and learning efficiency and designed a reward value function that considers the distance relationship between the agent and the obstacle and the coordination factor between the multi-agents) was designed to achieve obstacle avoidance and collision avoidance in the process of multi-agent formation into the desired formation.The simulation results show that the proposed method achieves the expected goal of multi-agent formation obstacle avoidance and has stronger portability compared with the traditional algorithm.

其他文献

Wavelet Transform-Based High-Definition Map Construction From a Panoramic Camera

High-definition (HD) maps are key components that provide rich topologic and semantic information for decision-making in vehicle autonomous driving systems.A complete ground orthophoto is usually used as the base image to construct the HD map.The ground o

期刊

high-definition (HD) mapwavelet transformimage registrationpanorama

Lightweight Method for Vehicle Re-identification Using Reranking Algorithm Based on Topology Informa

As an emerging visual task,vehicle re-identification refers to the identification of the same vehicle across multiple cameras.Herein,we propose a novel vehicle re-identification method that uses an improved ResNet-50 architecture and utilizes the topology

期刊

intelligent transportation systemvehicle re-identificationdeep learning

Multi-Object Tracking Strategy of Autonomous Vehicle Using Modified Unscented Kalman Filter and Refe

In this study,a multi-object tracking (MOT) scheme based on a light detection and ranging sensor was proposed to overcome imprecise velocity observations in object occlusion scenarios.By applying real-time velocity estimation,a modified unscented Kalman f

期刊

multi-object tracking (MOT)light detection and ranging (LiDAR) sensorunscented

Intelligent Analysis of Abnormal Vehicle Behavior Based on a Digital Twin

Analyzing a vehicle\'s abnormal behavior in surveillance videos is a challenging field,mainly due to the wide variety of anomaly cases and the complexity of surveillance videos.In this study,a novel intelligent vehicle behavior analysis framework based

期刊

digital twindeep learningvehicle detectionabnormal behavior

Efficient Online Vehicle Tracking for Real-Virtual Mapping Systems

Multi-object tracking is a vital problem as many applications require better tracking approaches.Although learning-based detectors are becoming extremely powerful,there are few tracking methods designed to work with them in real time.We explored an effici

期刊

multi-object trackingvehicle trackingtracklet associationreal virtual mapping

Intelligent-Assist Algorithm for Remote Shared-Control Driving Based on Game Theory

Contemporary autonomous-driving technology relies on good environmental-perception systems and high-precision maps.For unknown environments or scenarios where perception fails,a human-in-the-loop remote-driving system can effectively complement common sol

期刊

remote drivingshared controlgame theory

Stochastic Model Predictive Control Approach to Autonomous Vehicle Lane Keeping

In real-world scenarios,the uncertainty of measurements cannot be handled efficiently by traditional model predictive control (MPC).A stochastic MPC (SMPC) method for handling the uncertainty of states in autonomous driving lane-keeping scenarios is prese

期刊

stochastic model predictive control (SMPC)autonomous drivinglane keeping

Parameter Identification of Magic Formula Tire Model Based on Fibonacci Tree Optimization Algorithm

The magic formula (MF) tire model is a semi-empirical tire model that can precisely simulate tire behavior.The heuristic optimization algorithm is typically used for parameter identification of the MF tire model.To avoid the defect of the traditional heur

期刊

magic formulatire modelparameter identificationFibonacci tree optimization (F

Cooperative Adaptive Cruise Control Using Delay-Based Spacing Policy:A Robust Adaptive Non-Singular

This study proposes two speed controllers based on a robust adaptive non-singular terminal sliding mode control approach for the cooperative adaptive cruise control problem in a connected and automated vehicular platoon.The delay-based spacing policy is a

期刊

cooperative adaptive cruise controldelay-based spacing policyadaptive non-sing

Developing High-Precision Maps for Automated Driving in China:Legal Obstacles and the Way to Overcom

A high-precision map (HPM) is the key infrastructure to realizing the function of automated driving(AD) and ensuring its safety.However,the current laws and regulations on HPMs in China can lead to serious legal compliance problems.Thus,proper measures sh

期刊

automated driving (AD)navigation electronic map (NEM)high-precision map (HPM)

Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

与本文相关的学术论文