Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning

In this article, we study wireless power transfer (WPT) beam scheduling for a system which consists of IoT devices and a power beacon (PB) using switched beamforming. In such a system, the IoT devices have a non-static behavior (e.g., their location and power requests keep changing) in general, whic...

Full description

Bibliographic Details
Main Authors: Hyun-Suk Lee, Jang-Won Lee
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9256306/
id doaj-9777204bb40348549b4a38b17969f019
record_format Article
spelling doaj-9777204bb40348549b4a38b17969f0192021-03-30T04:17:59ZengIEEEIEEE Access2169-35362020-01-01820665920667310.1109/ACCESS.2020.30373239256306Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement LearningHyun-Suk Lee0https://orcid.org/0000-0001-5885-1711Jang-Won Lee1https://orcid.org/0000-0002-5627-5914School of Intelligent Mechatronics Engineering, Sejong University, Seoul, South KoreaDepartment of Electrical and Electronic Engineering, Yonsei University, Seoul, South KoreaIn this article, we study wireless power transfer (WPT) beam scheduling for a system which consists of IoT devices and a power beacon (PB) using switched beamforming. In such a system, the IoT devices have a non-static behavior (e.g., their location and power requests keep changing) in general, which conventional WPT beam scheduling algorithms are not capable of adaptively dealing with. To address the non-static behavior, we propose a procedure of deep neural network (DNN)-based WPT beam scheduling. In the procedure, the power-deficient IoT devices transmit a common pilot signal simultaneously. Then, the PB effectively provides power to them with a DNN-based WPT beam scheduling policy. In the DNN-based policy, an estimation of the non-static behavior from the received pilot signals and an adaptive beam generation considering the estimated non-static behavior are integrated thanks to the powerful representational capability of DNNs. To allow the DNN-based policy to learn the optimal policy, we propose a Deep WPT Beam scheduling policy Gradient (DWBG) algorithm using deep reinforcement learning. Through the simulation, we show that DWBG achieves a close performance to the optimal policy. This demonstrates that our algorithm can be applied for practical WPT IoT systems with non-static IoT devices.https://ieeexplore.ieee.org/document/9256306/Deep learningIoT devicesmobilitynon-staticpolicy-gradientswitched-beam
collection DOAJ
language English
format Article
sources DOAJ
author Hyun-Suk Lee
Jang-Won Lee
spellingShingle Hyun-Suk Lee
Jang-Won Lee
Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
IEEE Access
Deep learning
IoT devices
mobility
non-static
policy-gradient
switched-beam
author_facet Hyun-Suk Lee
Jang-Won Lee
author_sort Hyun-Suk Lee
title Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
title_short Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
title_full Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
title_fullStr Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
title_full_unstemmed Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
title_sort adaptive wireless power transfer beam scheduling for non-static iot devices using deep reinforcement learning
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description In this article, we study wireless power transfer (WPT) beam scheduling for a system which consists of IoT devices and a power beacon (PB) using switched beamforming. In such a system, the IoT devices have a non-static behavior (e.g., their location and power requests keep changing) in general, which conventional WPT beam scheduling algorithms are not capable of adaptively dealing with. To address the non-static behavior, we propose a procedure of deep neural network (DNN)-based WPT beam scheduling. In the procedure, the power-deficient IoT devices transmit a common pilot signal simultaneously. Then, the PB effectively provides power to them with a DNN-based WPT beam scheduling policy. In the DNN-based policy, an estimation of the non-static behavior from the received pilot signals and an adaptive beam generation considering the estimated non-static behavior are integrated thanks to the powerful representational capability of DNNs. To allow the DNN-based policy to learn the optimal policy, we propose a Deep WPT Beam scheduling policy Gradient (DWBG) algorithm using deep reinforcement learning. Through the simulation, we show that DWBG achieves a close performance to the optimal policy. This demonstrates that our algorithm can be applied for practical WPT IoT systems with non-static IoT devices.
topic Deep learning
IoT devices
mobility
non-static
policy-gradient
switched-beam
url https://ieeexplore.ieee.org/document/9256306/
work_keys_str_mv AT hyunsuklee adaptivewirelesspowertransferbeamschedulingfornonstaticiotdevicesusingdeepreinforcementlearning
AT jangwonlee adaptivewirelesspowertransferbeamschedulingfornonstaticiotdevicesusingdeepreinforcementlearning
_version_ 1724182000018915328