Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
Satellite Internet of Things (S-IoT), which integrates satellite networks with IoT, is a new mobile Internet to provide services for social networks. However, affected by the dynamic changes of topology structure and node status, the efficient and secure forwarding of data packets in S-IoT is challe...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi-Wiley
2021-01-01
|
Series: | Security and Communication Networks |
Online Access: | http://dx.doi.org/10.1155/2021/5530023 |
id |
doaj-882fec3a62f14da98bb9b96bdfa1b9d3 |
---|---|
record_format |
Article |
spelling |
doaj-882fec3a62f14da98bb9b96bdfa1b9d32021-05-03T00:01:14ZengHindawi-WileySecurity and Communication Networks1939-01222021-01-01202110.1155/2021/5530023Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of ThingsJian Zhou0Xiaotian Gong1Lijuan Sun2Yong Xie3Xiaoyong Yan4College of ComputerCollege of ComputerCollege of ComputerCollege of ComputerCollege of ComputerSatellite Internet of Things (S-IoT), which integrates satellite networks with IoT, is a new mobile Internet to provide services for social networks. However, affected by the dynamic changes of topology structure and node status, the efficient and secure forwarding of data packets in S-IoT is challenging. In view of the abovementioned problem, this paper proposes an adaptive routing strategy based on improved double Q-learning for S-IoT. First, the whole S-IoT is regarded as a reinforcement learning environment, and satellite nodes and ground nodes in S-IoT are both regarded as intelligent agents. Each node in the S-IoT maintains two Q tables, which are used for selecting the forwarding node and for evaluating the forwarding value, respectively. In addition, the next hop node of data packets is determined depending on the mixed Q value. Second, in order to optimize the Q value, this paper makes improvements on the mixed Q value, the reward value, and the discount factor, respectively, based on the congestion degree, the hop count, and the node status. Finally, we perform extensive simulations to evaluate the performance of this adaptive routing strategy in terms of delivery rate, average delay, and overhead ratio. Evaluation results demonstrate that the proposed strategy can achieve more efficient and secure routing in the highly dynamic environment compared with the state-of-the-art strategies.http://dx.doi.org/10.1155/2021/5530023 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jian Zhou Xiaotian Gong Lijuan Sun Yong Xie Xiaoyong Yan |
spellingShingle |
Jian Zhou Xiaotian Gong Lijuan Sun Yong Xie Xiaoyong Yan Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things Security and Communication Networks |
author_facet |
Jian Zhou Xiaotian Gong Lijuan Sun Yong Xie Xiaoyong Yan |
author_sort |
Jian Zhou |
title |
Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things |
title_short |
Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things |
title_full |
Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things |
title_fullStr |
Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things |
title_full_unstemmed |
Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things |
title_sort |
adaptive routing strategy based on improved double q-learning for satellite internet of things |
publisher |
Hindawi-Wiley |
series |
Security and Communication Networks |
issn |
1939-0122 |
publishDate |
2021-01-01 |
description |
Satellite Internet of Things (S-IoT), which integrates satellite networks with IoT, is a new mobile Internet to provide services for social networks. However, affected by the dynamic changes of topology structure and node status, the efficient and secure forwarding of data packets in S-IoT is challenging. In view of the abovementioned problem, this paper proposes an adaptive routing strategy based on improved double Q-learning for S-IoT. First, the whole S-IoT is regarded as a reinforcement learning environment, and satellite nodes and ground nodes in S-IoT are both regarded as intelligent agents. Each node in the S-IoT maintains two Q tables, which are used for selecting the forwarding node and for evaluating the forwarding value, respectively. In addition, the next hop node of data packets is determined depending on the mixed Q value. Second, in order to optimize the Q value, this paper makes improvements on the mixed Q value, the reward value, and the discount factor, respectively, based on the congestion degree, the hop count, and the node status. Finally, we perform extensive simulations to evaluate the performance of this adaptive routing strategy in terms of delivery rate, average delay, and overhead ratio. Evaluation results demonstrate that the proposed strategy can achieve more efficient and secure routing in the highly dynamic environment compared with the state-of-the-art strategies. |
url |
http://dx.doi.org/10.1155/2021/5530023 |
work_keys_str_mv |
AT jianzhou adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings AT xiaotiangong adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings AT lijuansun adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings AT yongxie adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings AT xiaoyongyan adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings |
_version_ |
1714634937582223360 |