Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning

In this article, we study wireless power transfer (WPT) beam scheduling for a system which consists of IoT devices and a power beacon (PB) using switched beamforming. In such a system, the IoT devices have a non-static behavior (e.g., their location and power requests keep changing) in general, whic...

Full description

Bibliographic Details
Main Authors:	Hyun-Suk Lee, Jang-Won Lee
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Deep learning IoT devices mobility non-static policy-gradient switched-beam
Online Access:	https://ieeexplore.ieee.org/document/9256306/

id	doaj-9777204bb40348549b4a38b17969f019
record_format	Article
spelling	doaj-9777204bb40348549b4a38b17969f0192021-03-30T04:17:59ZengIEEEIEEE Access2169-35362020-01-01820665920667310.1109/ACCESS.2020.30373239256306Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement LearningHyun-Suk Lee0https://orcid.org/0000-0001-5885-1711Jang-Won Lee1https://orcid.org/0000-0002-5627-5914School of Intelligent Mechatronics Engineering, Sejong University, Seoul, South KoreaDepartment of Electrical and Electronic Engineering, Yonsei University, Seoul, South KoreaIn this article, we study wireless power transfer (WPT) beam scheduling for a system which consists of IoT devices and a power beacon (PB) using switched beamforming. In such a system, the IoT devices have a non-static behavior (e.g., their location and power requests keep changing) in general, which conventional WPT beam scheduling algorithms are not capable of adaptively dealing with. To address the non-static behavior, we propose a procedure of deep neural network (DNN)-based WPT beam scheduling. In the procedure, the power-deficient IoT devices transmit a common pilot signal simultaneously. Then, the PB effectively provides power to them with a DNN-based WPT beam scheduling policy. In the DNN-based policy, an estimation of the non-static behavior from the received pilot signals and an adaptive beam generation considering the estimated non-static behavior are integrated thanks to the powerful representational capability of DNNs. To allow the DNN-based policy to learn the optimal policy, we propose a Deep WPT Beam scheduling policy Gradient (DWBG) algorithm using deep reinforcement learning. Through the simulation, we show that DWBG achieves a close performance to the optimal policy. This demonstrates that our algorithm can be applied for practical WPT IoT systems with non-static IoT devices.https://ieeexplore.ieee.org/document/9256306/Deep learningIoT devicesmobilitynon-staticpolicy-gradientswitched-beam
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Hyun-Suk Lee Jang-Won Lee
spellingShingle	Hyun-Suk Lee Jang-Won Lee Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning IEEE Access Deep learning IoT devices mobility non-static policy-gradient switched-beam
author_facet	Hyun-Suk Lee Jang-Won Lee
author_sort	Hyun-Suk Lee
title	Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
title_short	Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
title_full	Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
title_fullStr	Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
title_full_unstemmed	Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
title_sort	adaptive wireless power transfer beam scheduling for non-static iot devices using deep reinforcement learning
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	In this article, we study wireless power transfer (WPT) beam scheduling for a system which consists of IoT devices and a power beacon (PB) using switched beamforming. In such a system, the IoT devices have a non-static behavior (e.g., their location and power requests keep changing) in general, which conventional WPT beam scheduling algorithms are not capable of adaptively dealing with. To address the non-static behavior, we propose a procedure of deep neural network (DNN)-based WPT beam scheduling. In the procedure, the power-deficient IoT devices transmit a common pilot signal simultaneously. Then, the PB effectively provides power to them with a DNN-based WPT beam scheduling policy. In the DNN-based policy, an estimation of the non-static behavior from the received pilot signals and an adaptive beam generation considering the estimated non-static behavior are integrated thanks to the powerful representational capability of DNNs. To allow the DNN-based policy to learn the optimal policy, we propose a Deep WPT Beam scheduling policy Gradient (DWBG) algorithm using deep reinforcement learning. Through the simulation, we show that DWBG achieves a close performance to the optimal policy. This demonstrates that our algorithm can be applied for practical WPT IoT systems with non-static IoT devices.
topic	Deep learning IoT devices mobility non-static policy-gradient switched-beam
url	https://ieeexplore.ieee.org/document/9256306/
work_keys_str_mv	AT hyunsuklee adaptivewirelesspowertransferbeamschedulingfornonstaticiotdevicesusingdeepreinforcementlearning AT jangwonlee adaptivewirelesspowertransferbeamschedulingfornonstaticiotdevicesusingdeepreinforcementlearning
_version_	1724182000018915328

Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning

Similar Items