Deep Reinforcement Learning-Based Task Scheduling in IoT Edge Computing

Shuran Sheng; Peng Chen; Zhimin Chen; Lenan Wu; Yuxuan Yao

doi:10.3390/s21051666

Deep Reinforcement Learning-Based Task Scheduling in IoT Edge Computing

Sensors (Basel). 2021 Feb 28;21(5):1666. doi: 10.3390/s21051666.

Authors

Shuran Sheng¹, Peng Chen², Zhimin Chen³, Lenan Wu¹, Yuxuan Yao⁴

Affiliations

¹ School of Information Science and Engineering, Southeast University, Nanjing 210096, China.
² State Key Laboratory of Millimeter Waves, Southeast University, Nanjing 210096, China.
³ School of Electronic and Information, Shanghai Dianji University, Shanghai 201306, China.
⁴ Shannxi Key Laboratory of Integrated and Intelligent Navigation, Xi'an 710068, China.

Abstract

Edge computing (EC) has recently emerged as a promising paradigm that supports resource-hungry Internet of Things (IoT) applications with low latency services at the network edge. However, the limited capacity of computing resources at the edge server poses great challenges for scheduling application tasks. In this paper, a task scheduling problem is studied in the EC scenario, and multiple tasks are scheduled to virtual machines (VMs) configured at the edge server by maximizing the long-term task satisfaction degree (LTSD). The problem is formulated as a Markov decision process (MDP) for which the state, action, state transition, and reward are designed. We leverage deep reinforcement learning (DRL) to solve both time scheduling (i.e., the task execution order) and resource allocation (i.e., which VM the task is assigned to), considering the diversity of the tasks and the heterogeneity of available resources. A policy-based REINFORCE algorithm is proposed for the task scheduling problem, and a fully-connected neural network (FCN) is utilized to extract the features. Simulation results show that the proposed DRL-based task scheduling algorithm outperforms the existing methods in the literature in terms of the average task satisfaction degree and success ratio.

Keywords: Internet of Things (IoT); deep reinforcement learning (DRL); edge computing; markov decision process (MDP); task scheduling.