Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things

Satellite Internet of Things (S-IoT), which integrates satellite networks with IoT, is a new mobile Internet to provide services for social networks. However, affected by the dynamic changes of topology structure and node status, the efficient and secure forwarding of data packets in S-IoT is challe...

Full description

Bibliographic Details
Main Authors:	Jian Zhou, Xiaotian Gong, Lijuan Sun, Yong Xie, Xiaoyong Yan
Format:	Article
Language:	English
Published:	Hindawi-Wiley 2021-01-01
Series:	Security and Communication Networks
Online Access:	http://dx.doi.org/10.1155/2021/5530023

id	doaj-882fec3a62f14da98bb9b96bdfa1b9d3
record_format	Article
spelling	doaj-882fec3a62f14da98bb9b96bdfa1b9d32021-05-03T00:01:14ZengHindawi-WileySecurity and Communication Networks1939-01222021-01-01202110.1155/2021/5530023Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of ThingsJian Zhou0Xiaotian Gong1Lijuan Sun2Yong Xie3Xiaoyong Yan4College of ComputerCollege of ComputerCollege of ComputerCollege of ComputerCollege of ComputerSatellite Internet of Things (S-IoT), which integrates satellite networks with IoT, is a new mobile Internet to provide services for social networks. However, affected by the dynamic changes of topology structure and node status, the efficient and secure forwarding of data packets in S-IoT is challenging. In view of the abovementioned problem, this paper proposes an adaptive routing strategy based on improved double Q-learning for S-IoT. First, the whole S-IoT is regarded as a reinforcement learning environment, and satellite nodes and ground nodes in S-IoT are both regarded as intelligent agents. Each node in the S-IoT maintains two Q tables, which are used for selecting the forwarding node and for evaluating the forwarding value, respectively. In addition, the next hop node of data packets is determined depending on the mixed Q value. Second, in order to optimize the Q value, this paper makes improvements on the mixed Q value, the reward value, and the discount factor, respectively, based on the congestion degree, the hop count, and the node status. Finally, we perform extensive simulations to evaluate the performance of this adaptive routing strategy in terms of delivery rate, average delay, and overhead ratio. Evaluation results demonstrate that the proposed strategy can achieve more efficient and secure routing in the highly dynamic environment compared with the state-of-the-art strategies.http://dx.doi.org/10.1155/2021/5530023
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Jian Zhou Xiaotian Gong Lijuan Sun Yong Xie Xiaoyong Yan
spellingShingle	Jian Zhou Xiaotian Gong Lijuan Sun Yong Xie Xiaoyong Yan Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things Security and Communication Networks
author_facet	Jian Zhou Xiaotian Gong Lijuan Sun Yong Xie Xiaoyong Yan
author_sort	Jian Zhou
title	Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
title_short	Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
title_full	Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
title_fullStr	Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
title_full_unstemmed	Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things
title_sort	adaptive routing strategy based on improved double q-learning for satellite internet of things
publisher	Hindawi-Wiley
series	Security and Communication Networks
issn	1939-0122
publishDate	2021-01-01
description	Satellite Internet of Things (S-IoT), which integrates satellite networks with IoT, is a new mobile Internet to provide services for social networks. However, affected by the dynamic changes of topology structure and node status, the efficient and secure forwarding of data packets in S-IoT is challenging. In view of the abovementioned problem, this paper proposes an adaptive routing strategy based on improved double Q-learning for S-IoT. First, the whole S-IoT is regarded as a reinforcement learning environment, and satellite nodes and ground nodes in S-IoT are both regarded as intelligent agents. Each node in the S-IoT maintains two Q tables, which are used for selecting the forwarding node and for evaluating the forwarding value, respectively. In addition, the next hop node of data packets is determined depending on the mixed Q value. Second, in order to optimize the Q value, this paper makes improvements on the mixed Q value, the reward value, and the discount factor, respectively, based on the congestion degree, the hop count, and the node status. Finally, we perform extensive simulations to evaluate the performance of this adaptive routing strategy in terms of delivery rate, average delay, and overhead ratio. Evaluation results demonstrate that the proposed strategy can achieve more efficient and secure routing in the highly dynamic environment compared with the state-of-the-art strategies.
url	http://dx.doi.org/10.1155/2021/5530023
work_keys_str_mv	AT jianzhou adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings AT xiaotiangong adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings AT lijuansun adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings AT yongxie adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings AT xiaoyongyan adaptiveroutingstrategybasedonimproveddoubleqlearningforsatelliteinternetofthings
_version_	1714634937582223360

Adaptive Routing Strategy Based on Improved Double Q-Learning for Satellite Internet of Things

Similar Items