Persistent aerial monitoring under unknown stochastic dynamics in pick-up and delivery missions

Ahmet Semi Asarkaya, Derya Aksaray, Yasin Yazıcıoğlu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Scopus citations

Abstract

We consider a delivery drone that is supposed to achieve pick-up and delivery tasks that arrive stochastically during a mission. Since a delivery drone is often equipped with a camera, it can also gather useful information by monitoring the environment during the pick-up and delivery task. Motivated by the multi-use of drones, we address a persistent monitoring problem where a drone’s high-level decision making is modeled as a Markov decision process (MDP) with unknown transition probabilities. The reward function is designed based on the valuable information over the environment, and the pick-up and delivery tasks are defined by bounded time temporal logic specifications. We use a reinforcement learning (RL) algorithm that maximizes the expected sum of rewards while various dynamically arriving temporal logic specifications are satisfied with a desired probability in every episode during learning. We demonstrate the simulation results and discuss the quality of the proposed method.

Original languageEnglish (US)
Title of host publicationAIAA Scitech 2021 Forum
PublisherAmerican Institute of Aeronautics and Astronautics Inc, AIAA
Pages1-13
Number of pages13
ISBN (Print)9781624106095
DOIs
StatePublished - Jan 4 2021
EventAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2021 - Virtual, Online
Duration: Jan 11 2021Jan 15 2021

Publication series

NameAIAA Scitech 2021 Forum

Conference

ConferenceAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2021
CityVirtual, Online
Period1/11/211/15/21

Bibliographical note

Publisher Copyright:
© 2021, American Institute of Aeronautics and Astronautics Inc, AIAA. All Rights Reserved.

Fingerprint

Dive into the research topics of 'Persistent aerial monitoring under unknown stochastic dynamics in pick-up and delivery missions'. Together they form a unique fingerprint.

Cite this