Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning
In this article, we study wireless power transfer (WPT) beam scheduling for a system which consists of IoT devices and a power beacon (PB) using switched beamforming. In such a system, the IoT devices have a non-static behavior (e.g., their location and power requests keep changing) in general, whic...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9256306/ |
id |
doaj-9777204bb40348549b4a38b17969f019 |
---|---|
record_format |
Article |
spelling |
doaj-9777204bb40348549b4a38b17969f0192021-03-30T04:17:59ZengIEEEIEEE Access2169-35362020-01-01820665920667310.1109/ACCESS.2020.30373239256306Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement LearningHyun-Suk Lee0https://orcid.org/0000-0001-5885-1711Jang-Won Lee1https://orcid.org/0000-0002-5627-5914School of Intelligent Mechatronics Engineering, Sejong University, Seoul, South KoreaDepartment of Electrical and Electronic Engineering, Yonsei University, Seoul, South KoreaIn this article, we study wireless power transfer (WPT) beam scheduling for a system which consists of IoT devices and a power beacon (PB) using switched beamforming. In such a system, the IoT devices have a non-static behavior (e.g., their location and power requests keep changing) in general, which conventional WPT beam scheduling algorithms are not capable of adaptively dealing with. To address the non-static behavior, we propose a procedure of deep neural network (DNN)-based WPT beam scheduling. In the procedure, the power-deficient IoT devices transmit a common pilot signal simultaneously. Then, the PB effectively provides power to them with a DNN-based WPT beam scheduling policy. In the DNN-based policy, an estimation of the non-static behavior from the received pilot signals and an adaptive beam generation considering the estimated non-static behavior are integrated thanks to the powerful representational capability of DNNs. To allow the DNN-based policy to learn the optimal policy, we propose a Deep WPT Beam scheduling policy Gradient (DWBG) algorithm using deep reinforcement learning. Through the simulation, we show that DWBG achieves a close performance to the optimal policy. This demonstrates that our algorithm can be applied for practical WPT IoT systems with non-static IoT devices.https://ieeexplore.ieee.org/document/9256306/Deep learningIoT devicesmobilitynon-staticpolicy-gradientswitched-beam |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hyun-Suk Lee Jang-Won Lee |
spellingShingle |
Hyun-Suk Lee Jang-Won Lee Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning IEEE Access Deep learning IoT devices mobility non-static policy-gradient switched-beam |
author_facet |
Hyun-Suk Lee Jang-Won Lee |
author_sort |
Hyun-Suk Lee |
title |
Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning |
title_short |
Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning |
title_full |
Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning |
title_fullStr |
Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning |
title_full_unstemmed |
Adaptive Wireless Power Transfer Beam Scheduling for Non-Static IoT Devices Using Deep Reinforcement Learning |
title_sort |
adaptive wireless power transfer beam scheduling for non-static iot devices using deep reinforcement learning |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
In this article, we study wireless power transfer (WPT) beam scheduling for a system which consists of IoT devices and a power beacon (PB) using switched beamforming. In such a system, the IoT devices have a non-static behavior (e.g., their location and power requests keep changing) in general, which conventional WPT beam scheduling algorithms are not capable of adaptively dealing with. To address the non-static behavior, we propose a procedure of deep neural network (DNN)-based WPT beam scheduling. In the procedure, the power-deficient IoT devices transmit a common pilot signal simultaneously. Then, the PB effectively provides power to them with a DNN-based WPT beam scheduling policy. In the DNN-based policy, an estimation of the non-static behavior from the received pilot signals and an adaptive beam generation considering the estimated non-static behavior are integrated thanks to the powerful representational capability of DNNs. To allow the DNN-based policy to learn the optimal policy, we propose a Deep WPT Beam scheduling policy Gradient (DWBG) algorithm using deep reinforcement learning. Through the simulation, we show that DWBG achieves a close performance to the optimal policy. This demonstrates that our algorithm can be applied for practical WPT IoT systems with non-static IoT devices. |
topic |
Deep learning IoT devices mobility non-static policy-gradient switched-beam |
url |
https://ieeexplore.ieee.org/document/9256306/ |
work_keys_str_mv |
AT hyunsuklee adaptivewirelesspowertransferbeamschedulingfornonstaticiotdevicesusingdeepreinforcementlearning AT jangwonlee adaptivewirelesspowertransferbeamschedulingfornonstaticiotdevicesusingdeepreinforcementlearning |
_version_ |
1724182000018915328 |