Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things

Satellite Internet of Things (S-IoT), which integrates satellite networks with IoT, is a new mobile Internet to provide services for social networks. However, affected by the dynamic changes of topology structure and node status, the efficient and secure forwarding of data packets in S-IoT is challe...

Full description

Bibliographic Details
Main Authors: Jian Zhou, Xiaotian Gong, Lijuan Sun, Yong Xie, Xiaoyong Yan
Format: Article
Language:English
Published: Hindawi-Wiley 2021-01-01
Series:Security and Communication Networks
Online Access:http://dx.doi.org/10.1155/2021/5530023
id doaj-882fec3a62f14da98bb9b96bdfa1b9d3
record_format Article
spelling doaj-882fec3a62f14da98bb9b96bdfa1b9d32021-05-03T00:01:14ZengHindawi-WileySecurity and Communication Networks1939-01222021-01-01202110.1155/2021/5530023Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of ThingsJian Zhou0Xiaotian Gong1Lijuan Sun2Yong Xie3Xiaoyong Yan4College of ComputerCollege of ComputerCollege of ComputerCollege of ComputerCollege of ComputerSatellite Internet of Things (S-IoT), which integrates satellite networks with IoT, is a new mobile Internet to provide services for social networks. However, affected by the dynamic changes of topology structure and node status, the efficient and secure forwarding of data packets in S-IoT is challenging. In view of the abovementioned problem, this paper proposes an adaptive routing strategy based on improved double Q-learning for S-IoT. First, the whole S-IoT is regarded as a reinforcement learning environment, and satellite nodes and ground nodes in S-IoT are both regarded as intelligent agents. Each node in the S-IoT maintains two Q tables, which are used for selecting the forwarding node and for evaluating the forwarding value, respectively. In addition, the next hop node of data packets is determined depending on the mixed Q value. Second, in order to optimize the Q value, this paper makes improvements on the mixed Q value, the reward value, and the discount factor, respectively, based on the congestion degree, the hop count, and the node status. Finally, we perform extensive simulations to evaluate the performance of this adaptive routing strategy in terms of delivery rate, average delay, and overhead ratio. Evaluation results demonstrate that the proposed strategy can achieve more efficient and secure routing in the highly dynamic environment compared with the state-of-the-art strategies.http://dx.doi.org/10.1155/2021/5530023
collection DOAJ
language English
format Article
sources DOAJ
author Jian Zhou
Xiaotian Gong
Lijuan Sun
Yong Xie
Xiaoyong Yan
spellingShingle Jian Zhou
Xiaotian Gong
Lijuan Sun
Yong Xie
Xiaoyong Yan
Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
Security and Communication Networks
author_facet Jian Zhou
Xiaotian Gong
Lijuan Sun
Yong Xie
Xiaoyong Yan
author_sort Jian Zhou
title Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
title_short Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
title_full Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
title_fullStr Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
title_full_unstemmed Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
title_sort adaptive routing strategy based on improved double q-learning for satellite internet of things
publisher Hindawi-Wiley
series Security and Communication Networks
issn 1939-0122
publishDate 2021-01-01
description Satellite Internet of Things (S-IoT), which integrates satellite networks with IoT, is a new mobile Internet to provide services for social networks. However, affected by the dynamic changes of topology structure and node status, the efficient and secure forwarding of data packets in S-IoT is challenging. In view of the abovementioned problem, this paper proposes an adaptive routing strategy based on improved double Q-learning for S-IoT. First, the whole S-IoT is regarded as a reinforcement learning environment, and satellite nodes and ground nodes in S-IoT are both regarded as intelligent agents. Each node in the S-IoT maintains two Q tables, which are used for selecting the forwarding node and for evaluating the forwarding value, respectively. In addition, the next hop node of data packets is determined depending on the mixed Q value. Second, in order to optimize the Q value, this paper makes improvements on the mixed Q value, the reward value, and the discount factor, respectively, based on the congestion degree, the hop count, and the node status. Finally, we perform extensive simulations to evaluate the performance of this adaptive routing strategy in terms of delivery rate, average delay, and overhead ratio. Evaluation results demonstrate that the proposed strategy can achieve more efficient and secure routing in the highly dynamic environment compared with the state-of-the-art strategies.
url http://dx.doi.org/10.1155/2021/5530023
work_keys_str_mv AT jianzhou adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings
AT xiaotiangong adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings
AT lijuansun adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings
AT yongxie adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings
AT xiaoyongyan adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings
_version_ 1714634937582223360