Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning

Finding feasible, collision-free paths for multiagent systems can be challenging, particularly in non-communicating scenarios where each agent's intent (e.g. goal) is unobservable to the others. In particular, finding time efficient paths often requires anticipating interaction with neighboring...

Full description

Bibliographic Details
Main Authors: Chen, Yu Fan, Liu, Miao, Everett, Michael, How, Jonathan P.
Format: Text
Language:unknown
Published: 2016
Subjects:
Online Access:http://arxiv.org/abs/1609.07845
id ftarxivpreprints:oai:arXiv.org:1609.07845
record_format openpolar
spelling ftarxivpreprints:oai:arXiv.org:1609.07845 2023-09-05T13:22:22+02:00 Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning Chen, Yu Fan Liu, Miao Everett, Michael How, Jonathan P. 2016-09-26 http://arxiv.org/abs/1609.07845 unknown http://arxiv.org/abs/1609.07845 Computer Science - Multiagent Systems text 2016 ftarxivpreprints 2023-08-16T14:08:26Z Finding feasible, collision-free paths for multiagent systems can be challenging, particularly in non-communicating scenarios where each agent's intent (e.g. goal) is unobservable to the others. In particular, finding time efficient paths often requires anticipating interaction with neighboring agents, the process of which can be computationally prohibitive. This work presents a decentralized multiagent collision avoidance algorithm based on a novel application of deep reinforcement learning, which effectively offloads the online computation (for predicting interaction patterns) to an offline learning procedure. Specifically, the proposed approach develops a value network that encodes the estimated time to the goal given an agent's joint configuration (positions and velocities) with its neighbors. Use of the value network not only admits efficient (i.e., real-time implementable) queries for finding a collision-free velocity vector, but also considers the uncertainty in the other agents' motion. Simulation results show more than 26 percent improvement in paths quality (i.e., time to reach the goal) when compared with optimal reciprocal collision avoidance (ORCA), a state-of-the-art collision avoidance strategy. Comment: 8 pages, 10 figures Text Orca ArXiv.org (Cornell University Library)
institution Open Polar
collection ArXiv.org (Cornell University Library)
op_collection_id ftarxivpreprints
language unknown
topic Computer Science - Multiagent Systems
spellingShingle Computer Science - Multiagent Systems
Chen, Yu Fan
Liu, Miao
Everett, Michael
How, Jonathan P.
Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning
topic_facet Computer Science - Multiagent Systems
description Finding feasible, collision-free paths for multiagent systems can be challenging, particularly in non-communicating scenarios where each agent's intent (e.g. goal) is unobservable to the others. In particular, finding time efficient paths often requires anticipating interaction with neighboring agents, the process of which can be computationally prohibitive. This work presents a decentralized multiagent collision avoidance algorithm based on a novel application of deep reinforcement learning, which effectively offloads the online computation (for predicting interaction patterns) to an offline learning procedure. Specifically, the proposed approach develops a value network that encodes the estimated time to the goal given an agent's joint configuration (positions and velocities) with its neighbors. Use of the value network not only admits efficient (i.e., real-time implementable) queries for finding a collision-free velocity vector, but also considers the uncertainty in the other agents' motion. Simulation results show more than 26 percent improvement in paths quality (i.e., time to reach the goal) when compared with optimal reciprocal collision avoidance (ORCA), a state-of-the-art collision avoidance strategy. Comment: 8 pages, 10 figures
format Text
author Chen, Yu Fan
Liu, Miao
Everett, Michael
How, Jonathan P.
author_facet Chen, Yu Fan
Liu, Miao
Everett, Michael
How, Jonathan P.
author_sort Chen, Yu Fan
title Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning
title_short Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning
title_full Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning
title_fullStr Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning
title_full_unstemmed Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning
title_sort decentralized non-communicating multiagent collision avoidance with deep reinforcement learning
publishDate 2016
url http://arxiv.org/abs/1609.07845
genre Orca
genre_facet Orca
op_relation http://arxiv.org/abs/1609.07845
_version_ 1776202886139609088