Power System Security Under False Data Injection Attacks With Exploitation and Exploration Based on Reinforcement Learning

The false data injection (FDI) attack is a potential threat to the security of smart grids, and therefore, such threats should be assessed carefully. This paper proposes a self-governing FDI attack method with exploitation and exploration mechanisms and then evaluates its threat to power systems. Th...

Full description

Bibliographic Details
Main Authors: Zhisheng Wang, Ying Chen, Feng Liu, Yue Xia, Xuemin Zhang
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8429074/
id doaj-1c917898555441ebb2a5cc286386ed1b
record_format Article
spelling doaj-1c917898555441ebb2a5cc286386ed1b2021-03-29T21:11:17ZengIEEEIEEE Access2169-35362018-01-016487854879610.1109/ACCESS.2018.28565208429074Power System Security Under False Data Injection Attacks With Exploitation and Exploration Based on Reinforcement LearningZhisheng Wang0https://orcid.org/0000-0002-2907-755XYing Chen1Feng Liu2Yue Xia3Xuemin Zhang4Department of Electrical Engineering, Tsinghua University, Beijing, ChinaDepartment of Electrical Engineering, Tsinghua University, Beijing, ChinaDepartment of Electrical Engineering, Tsinghua University, Beijing, ChinaDepartment of Electrical Engineering, Tsinghua University, Beijing, ChinaDepartment of Electrical Engineering, Tsinghua University, Beijing, ChinaThe false data injection (FDI) attack is a potential threat to the security of smart grids, and therefore, such threats should be assessed carefully. This paper proposes a self-governing FDI attack method with exploitation and exploration mechanisms and then evaluates its threat to power systems. The attack is executed by viruses embedded in monitoring systems in substations. First, the FDI attack is formulated as a partially observable Markov decision process. Then, an improved online reinforcement learning method called nearest sequence memory Q-learning is adopted to make the attack more effective. Finally, propagation, an inherent property of viruses, is described using a propagation-evolution model that serves as the exploration mechanism for the proposed FDI attack. To validate the proposed attack method, cosimulations of daily operations of the IEEE 39-bus system are performed in which both the automatic voltage control system and the proposed FDI attack are modeled. Test results show that the proposed FDI method can cause voltage collapse even if only a few substations are infected.https://ieeexplore.ieee.org/document/8429074/Cybersecurityfalse data injectionpartially observable Markov decision processnearest sequence memory Q-learning
collection DOAJ
language English
format Article
sources DOAJ
author Zhisheng Wang
Ying Chen
Feng Liu
Yue Xia
Xuemin Zhang
spellingShingle Zhisheng Wang
Ying Chen
Feng Liu
Yue Xia
Xuemin Zhang
Power System Security Under False Data Injection Attacks With Exploitation and Exploration Based on Reinforcement Learning
IEEE Access
Cybersecurity
false data injection
partially observable Markov decision process
nearest sequence memory Q-learning
author_facet Zhisheng Wang
Ying Chen
Feng Liu
Yue Xia
Xuemin Zhang
author_sort Zhisheng Wang
title Power System Security Under False Data Injection Attacks With Exploitation and Exploration Based on Reinforcement Learning
title_short Power System Security Under False Data Injection Attacks With Exploitation and Exploration Based on Reinforcement Learning
title_full Power System Security Under False Data Injection Attacks With Exploitation and Exploration Based on Reinforcement Learning
title_fullStr Power System Security Under False Data Injection Attacks With Exploitation and Exploration Based on Reinforcement Learning
title_full_unstemmed Power System Security Under False Data Injection Attacks With Exploitation and Exploration Based on Reinforcement Learning
title_sort power system security under false data injection attacks with exploitation and exploration based on reinforcement learning
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description The false data injection (FDI) attack is a potential threat to the security of smart grids, and therefore, such threats should be assessed carefully. This paper proposes a self-governing FDI attack method with exploitation and exploration mechanisms and then evaluates its threat to power systems. The attack is executed by viruses embedded in monitoring systems in substations. First, the FDI attack is formulated as a partially observable Markov decision process. Then, an improved online reinforcement learning method called nearest sequence memory Q-learning is adopted to make the attack more effective. Finally, propagation, an inherent property of viruses, is described using a propagation-evolution model that serves as the exploration mechanism for the proposed FDI attack. To validate the proposed attack method, cosimulations of daily operations of the IEEE 39-bus system are performed in which both the automatic voltage control system and the proposed FDI attack are modeled. Test results show that the proposed FDI method can cause voltage collapse even if only a few substations are infected.
topic Cybersecurity
false data injection
partially observable Markov decision process
nearest sequence memory Q-learning
url https://ieeexplore.ieee.org/document/8429074/
work_keys_str_mv AT zhishengwang powersystemsecurityunderfalsedatainjectionattackswithexploitationandexplorationbasedonreinforcementlearning
AT yingchen powersystemsecurityunderfalsedatainjectionattackswithexploitationandexplorationbasedonreinforcementlearning
AT fengliu powersystemsecurityunderfalsedatainjectionattackswithexploitationandexplorationbasedonreinforcementlearning
AT yuexia powersystemsecurityunderfalsedatainjectionattackswithexploitationandexplorationbasedonreinforcementlearning
AT xueminzhang powersystemsecurityunderfalsedatainjectionattackswithexploitationandexplorationbasedonreinforcementlearning
_version_ 1724193381524963328