research-article

Public Access

Generalized Path Planning for Collaborative UAVs using Reinforcement and Imitation Learning

Authors:

Amirahmad Chapnevis,

Eyuphan BulutAuthors Info & Claims

MobiHoc '23: Proceedings of the Twenty-fourth International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing

Pages 457 - 462

https://doi.org/10.1145/3565287.3617622

Published: 16 October 2023 Publication History

Abstract

Cellular-connected Unmanned Aerial Vehicles (UAVs) need consistent cellular network connectivity to effectively accomplish their designated missions. However, when navigating through regions with partial coverage, such as rural areas, the task of planning the flight paths for these UAV missions becomes notably intricate. Algorithms designed to solve this issue require significant computational resources, making them infeasible for active deployment where an algorithm must run in real time using small compute power. Furthermore, these algorithms exponentially scale in run-time with respect to the number of UAVs being considered. To tackle this problem, we model the parameter space as a discrete grid-world, enable collaboration between drones, and gather supervised data from nonlinear programming and unsupervised data from a simulated version of the environment with associated rewards. We then train a Deep Neural Network (DNN) on this data and approximate optimal results by combining imitation and reinforcement learning methods. This DNN can successfully be deployed at fast speeds using relatively small computational power and can generalize to unseen maps where drone collaboration can be used to reduce mission time. By using the results of a network trained on supervised data as a guiding hand during training, our reinforcement learning approach achieves results better than either method in isolation.

References

[1]

Stuart M Adams and Carol J Friedland. 2011. A survey of unmanned aerial vehicle (UAV) usage for imagery collection in disaster research and management. In 9th international workshop on remote sensing for disaster response, Vol. 8. 1--8.

[2]

Barto and Sutton. 1992. reinforcement Learning: An Introduction. http://incompleteideas.net/book/RLbook2020.pdf

[3]

Eyuphan Bulut and Ismail Guevenc. 2018. Trajectory optimization for cellular-connected UAVs with disconnectivity constraint. In IEEE International Conference on Communications Workshops (ICC Workshops). 1--6.

[4]

Amirahmad Chapnevis, Ismail Güvenç, Laurent Njilla, and Eyuphan Bulut. 2021. Collaborative trajectory optimization for outage-aware cellular-enabled UAVs. In IEEE 93rd Vehicular Technology Conference (VTC2021-Spring). 1--6.

[5]

Yu-Jia Chen and Da-Yu Huang. 2020. Trajectory optimization for cellular-enabled UAV with connectivity outage constraint. IEEE Access 8 (2020), 29205--29218.

[6]

AS Danilov, Ur D Smirnov, and MA Pashkevich. 2015. The system of the ecological monitoring of environment which is based on the usage of UAV. Russian journal of ecology 46, 1 (2015), 14--19.

[7]

Omid Esrafilian, Rajeev Gangula, and David Gesbert. 2020. 3D-map assisted UAV trajectory design under cellular connectivity constraints. In IEEE International Conference on Communications (ICC). 1--6.

[8]

Jakob Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, and Shimon Whiteson. 2018. Counterfactual multi-agent policy gradients. In Proceedings of the AAAI conference on artificial intelligence, Vol. 32.

[9]

Kaan Gokcesu and Hakan Gokcesu. 2021. Generalized huber loss for robust learning and its efficient minimization for a robust statistics. arXiv preprint arXiv:2108.12627 (2021).

[10]

Nir Greshler, Ofir Gordon, Oren Salzman, and Nahum Shimkin. 2021. Cooperative multi-agent path finding: Beyond path planning and collision avoidance. In 2021 International Symposium on Multi-Robot and Multi-Agent Systems (MRS). IEEE, 20--28.

[11]

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013).

[12]

Francesco Nex and Fabio Remondino. 2014. UAV for 3D mapping applications: a review. Applied geomatics 6 (2014), 1--15.

[13]

Guillaume Sartoretti, Justin Kerr, Yunfei Shi, Glenn Wagner, TK Satish Kumar, Sven Koenig, and Howie Choset. 2019. Primal: Pathfinding via reinforcement and imitation multi-agent learning. IEEE Robotics and Automation Letters 4, 3 (2019), 2378--2385.

[14]

Hazim Shakhatreh, Ahmad H Sawalmeh, Ala Al-Fuqaha, Zuochao Dou, Eyad Almaita, Issa Khalil, Noor Shamsiah Othman, Abdallah Khreishah, and Mohsen Guizani. 2019. Unmanned aerial vehicles (UAVs): A survey on civil applications and key research challenges. Ieee Access 7 (2019), 48572--48634.

[15]

Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).

[16]

Shuowen Zhang and Rui Zhang. 2019. Trajectory design for cellular-connected UAV under outage duration constraint. In IEEE International Conference on Communications (ICC). 1--6.

Cited By

Index Terms

Generalized Path Planning for Collaborative UAVs using Reinforcement and Imitation Learning
1. Computing methodologies
  1. Artificial intelligence
    1. Planning and scheduling
      1. Multi-agent planning
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks
2. Networks
  1. Network types
    1. Mobile networks

Recommendations

A reinforcement learning-based path planning for collaborative UAVs
SAC '22: Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing

Unmanned Aerial Vehicles (UAVs) are widely used in search and rescue missions for unknown environments, where maximized coverage for unknown devices is required. This paper considers using collaborative UAVs (Col-UAV) to execute such tasks. It proposes ...
Learning to Perform a Perched Landing on the Ground Using Deep Reinforcement Learning

A UAV with a variable sweep wing has the potential to perform a perched landing on the ground by achieving high pitch rates to take advantage of dynamic stall. This study focuses on the generation and evaluation of a trajectory to perform a perched ...
Reinforcement Learning for UAV Attitude Control

Autopilot systems are typically composed of an “inner loop” providing stability and control, whereas an “outer loop” is responsible for mission-level objectives, such as way-point navigation. Autopilot systems for unmanned aerial vehicles are ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MobiHoc '23: Proceedings of the Twenty-fourth International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing

October 2023

621 pages

ISBN:9781450399265

DOI:10.1145/3565287

General Chairs:
Jie Wu,
Suresh Subramaniam,
Program Chairs:
Bo Ji,
Carla Fabiana Chiasserini

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMOBILE: ACM Special Interest Group on Mobility of Systems, Users, Data and Computing

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 October 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

NSF (National Science Foundation)

Conference

MobiHoc '23

Sponsor:

SIGMOBILE

MobiHoc '23: Twenty-fourth International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing

October 23 - 26, 2023

DC, Washington, USA

Acceptance Rates

Overall Acceptance Rate 296 of 1,843 submissions, 16%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
117
Total Downloads

Downloads (Last 12 months)90
Downloads (Last 6 weeks)14

Reflects downloads up to 25 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents