Distributed and energy-efficient mobile crowdsensing with charging stations by deep reinforcement learning

Chi Harold Liu*, Zipeng Dai, Yinuo Zhao, Jon Crowcroft, Dapeng Wu, Kin K. Leung

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

101 Citations (Scopus)

Abstract

Mobile crowdsensing (MCS) represents a new sensing paradigm that utilizes the smart mobile devices to collect and share data. Traditional MCS systems mainly leverages the people carried smartphones and other wearable devices which are constrained by the limited sensing capability and battery power. With the popularity of unmanned vehicles like unmanned aerial vehicles (UAVs) and driverless cars, they can provide much more reliable, accurate and cost-efficient sensing services due to to their equipped more powerful sensors. In this paper, we propose a distributed control framework for energy-efficient and DIstributed VEhicle navigation with chaRging sTations, called 'e-Divert'. It is a distributed multi-agent deep reinforcement learning (DRL) solution, which uses a convolutional neural network (CNN) to extract useful spatial features as the input to the actor-critic network to produce a real-time action. Also, e-Divert incorporates a distributed prioritized experience replay for better exploration and exploitation, and a long short-term memory (LSTM) enabled N-step temporal sequence modeling module. The solution fully explores the spatiotemporal nature of the considered scenario for better vehicle cooperation and competition between themselves and charging stations, to maximize the energy efficiency, data collection ratio, geographic fairness, and minimize the energy consumption simultaneously. Through extensive simulations, we find an appropriate set of hyperparameters that achieve the best performance, i.e., 5 actors in Ape-X architecture, priority exponent 0.5, and LSTM sequence length 3. Finally, we compare with four baselines including one state-of-the-art approach MADDPG. Results show that our proposed e-Divert significantly improves the energy efficiency, as compared to MADDPG, by 3.62 and 2.36 times on average when varying different numbers of vehicles and charging stations, respectively.

Original languageEnglish
Article number8821415
Pages (from-to)130-146
Number of pages17
JournalIEEE Transactions on Mobile Computing
Volume20
Issue number1
DOIs
Publication statusPublished - 1 Jan 2021

Keywords

  • Mobile crowdsensing
  • charging stations
  • deep reinforcement learning

Fingerprint

Dive into the research topics of 'Distributed and energy-efficient mobile crowdsensing with charging stations by deep reinforcement learning'. Together they form a unique fingerprint.

Cite this