skip to main content
10.1145/3565287.3617622acmconferencesArticle/Chapter ViewAbstractPublication PagesmobihocConference Proceedingsconference-collections
research-article
Public Access

Generalized Path Planning for Collaborative UAVs using Reinforcement and Imitation Learning

Published: 16 October 2023 Publication History

Abstract

Cellular-connected Unmanned Aerial Vehicles (UAVs) need consistent cellular network connectivity to effectively accomplish their designated missions. However, when navigating through regions with partial coverage, such as rural areas, the task of planning the flight paths for these UAV missions becomes notably intricate. Algorithms designed to solve this issue require significant computational resources, making them infeasible for active deployment where an algorithm must run in real time using small compute power. Furthermore, these algorithms exponentially scale in run-time with respect to the number of UAVs being considered. To tackle this problem, we model the parameter space as a discrete grid-world, enable collaboration between drones, and gather supervised data from nonlinear programming and unsupervised data from a simulated version of the environment with associated rewards. We then train a Deep Neural Network (DNN) on this data and approximate optimal results by combining imitation and reinforcement learning methods. This DNN can successfully be deployed at fast speeds using relatively small computational power and can generalize to unseen maps where drone collaboration can be used to reduce mission time. By using the results of a network trained on supervised data as a guiding hand during training, our reinforcement learning approach achieves results better than either method in isolation.

References

[1]
Stuart M Adams and Carol J Friedland. 2011. A survey of unmanned aerial vehicle (UAV) usage for imagery collection in disaster research and management. In 9th international workshop on remote sensing for disaster response, Vol. 8. 1--8.
[2]
Barto and Sutton. 1992. reinforcement Learning: An Introduction. http://incompleteideas.net/book/RLbook2020.pdf
[3]
Eyuphan Bulut and Ismail Guevenc. 2018. Trajectory optimization for cellular-connected UAVs with disconnectivity constraint. In IEEE International Conference on Communications Workshops (ICC Workshops). 1--6.
[4]
Amirahmad Chapnevis, Ismail Güvenç, Laurent Njilla, and Eyuphan Bulut. 2021. Collaborative trajectory optimization for outage-aware cellular-enabled UAVs. In IEEE 93rd Vehicular Technology Conference (VTC2021-Spring). 1--6.
[5]
Yu-Jia Chen and Da-Yu Huang. 2020. Trajectory optimization for cellular-enabled UAV with connectivity outage constraint. IEEE Access 8 (2020), 29205--29218.
[6]
AS Danilov, Ur D Smirnov, and MA Pashkevich. 2015. The system of the ecological monitoring of environment which is based on the usage of UAV. Russian journal of ecology 46, 1 (2015), 14--19.
[7]
Omid Esrafilian, Rajeev Gangula, and David Gesbert. 2020. 3D-map assisted UAV trajectory design under cellular connectivity constraints. In IEEE International Conference on Communications (ICC). 1--6.
[8]
Jakob Foerster, Gregory Farquhar, Triantafyllos Afouras, Nantas Nardelli, and Shimon Whiteson. 2018. Counterfactual multi-agent policy gradients. In Proceedings of the AAAI conference on artificial intelligence, Vol. 32.
[9]
Kaan Gokcesu and Hakan Gokcesu. 2021. Generalized huber loss for robust learning and its efficient minimization for a robust statistics. arXiv preprint arXiv:2108.12627 (2021).
[10]
Nir Greshler, Ofir Gordon, Oren Salzman, and Nahum Shimkin. 2021. Cooperative multi-agent path finding: Beyond path planning and collision avoidance. In 2021 International Symposium on Multi-Robot and Multi-Agent Systems (MRS). IEEE, 20--28.
[11]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013).
[12]
Francesco Nex and Fabio Remondino. 2014. UAV for 3D mapping applications: a review. Applied geomatics 6 (2014), 1--15.
[13]
Guillaume Sartoretti, Justin Kerr, Yunfei Shi, Glenn Wagner, TK Satish Kumar, Sven Koenig, and Howie Choset. 2019. Primal: Pathfinding via reinforcement and imitation multi-agent learning. IEEE Robotics and Automation Letters 4, 3 (2019), 2378--2385.
[14]
Hazim Shakhatreh, Ahmad H Sawalmeh, Ala Al-Fuqaha, Zuochao Dou, Eyad Almaita, Issa Khalil, Noor Shamsiah Othman, Abdallah Khreishah, and Mohsen Guizani. 2019. Unmanned aerial vehicles (UAVs): A survey on civil applications and key research challenges. Ieee Access 7 (2019), 48572--48634.
[15]
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
[16]
Shuowen Zhang and Rui Zhang. 2019. Trajectory design for cellular-connected UAV under outage duration constraint. In IEEE International Conference on Communications (ICC). 1--6.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MobiHoc '23: Proceedings of the Twenty-fourth International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing
October 2023
621 pages
ISBN:9781450399265
DOI:10.1145/3565287
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 October 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. UAV
  2. trajectory optimization
  3. cellular network
  4. reinforcement learning

Qualifiers

  • Research-article

Funding Sources

Conference

MobiHoc '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 296 of 1,843 submissions, 16%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 117
    Total Downloads
  • Downloads (Last 12 months)90
  • Downloads (Last 6 weeks)14
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media