Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism
The phishing email is one of the significant threats in the world today and has caused tremendous financial losses. Although the methods of confrontation are continually being updated, the results of those methods are not very satisfactory at present. Moreover, phishing emails are growing at an alar...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8701426/ |
id |
doaj-9565fbc655f043eba1413a4ec0dfd272 |
---|---|
record_format |
Article |
spelling |
doaj-9565fbc655f043eba1413a4ec0dfd2722021-03-29T22:41:05ZengIEEEIEEE Access2169-35362019-01-017563295634010.1109/ACCESS.2019.29137058701426Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention MechanismYong Fang0Cheng Zhang1Cheng Huang2https://orcid.org/0000-0002-5871-946XLiang Liu3Yue Yang4College of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaThe phishing email is one of the significant threats in the world today and has caused tremendous financial losses. Although the methods of confrontation are continually being updated, the results of those methods are not very satisfactory at present. Moreover, phishing emails are growing at an alarming rate in recent years. Therefore, more effective phishing detection technology is needed to curb the threat of phishing emails. In this paper, we first analyzed the email structure. Then, based on an improved recurrent convolutional neural networks (RCNN) model with multilevel vectors and attention mechanism, we proposed a new phishing email detection model named THEMIS, which is used to model emails at the email header, the email body, the character level, and the word level simultaneously. To evaluate the effectiveness of THEMIS, we use an unbalanced dataset that has realistic ratios of phishing and legitimate emails. The experimental results show that the overall accuracy of THEMIS reaches 99.848%. Meanwhile, the false positive rate (FPR) is 0.043%. High accuracy and low FPR ensure that the filter can identify phishing emails with high probability and filter out legitimate emails as little as possible. This promising result is superior to the existing detection methods and verifies the effectiveness of THEMIS in detecting phishing emails.https://ieeexplore.ieee.org/document/8701426/Emailphishing detectionclassificationRCNNattention |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yong Fang Cheng Zhang Cheng Huang Liang Liu Yue Yang |
spellingShingle |
Yong Fang Cheng Zhang Cheng Huang Liang Liu Yue Yang Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism IEEE Access phishing detection classification RCNN attention |
author_facet |
Yong Fang Cheng Zhang Cheng Huang Liang Liu Yue Yang |
author_sort |
Yong Fang |
title |
Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism |
title_short |
Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism |
title_full |
Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism |
title_fullStr |
Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism |
title_full_unstemmed |
Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism |
title_sort |
phishing email detection using improved rcnn model with multilevel vectors and attention mechanism |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
The phishing email is one of the significant threats in the world today and has caused tremendous financial losses. Although the methods of confrontation are continually being updated, the results of those methods are not very satisfactory at present. Moreover, phishing emails are growing at an alarming rate in recent years. Therefore, more effective phishing detection technology is needed to curb the threat of phishing emails. In this paper, we first analyzed the email structure. Then, based on an improved recurrent convolutional neural networks (RCNN) model with multilevel vectors and attention mechanism, we proposed a new phishing email detection model named THEMIS, which is used to model emails at the email header, the email body, the character level, and the word level simultaneously. To evaluate the effectiveness of THEMIS, we use an unbalanced dataset that has realistic ratios of phishing and legitimate emails. The experimental results show that the overall accuracy of THEMIS reaches 99.848%. Meanwhile, the false positive rate (FPR) is 0.043%. High accuracy and low FPR ensure that the filter can identify phishing emails with high probability and filter out legitimate emails as little as possible. This promising result is superior to the existing detection methods and verifies the effectiveness of THEMIS in detecting phishing emails. |
topic |
Email phishing detection classification RCNN attention |
url |
https://ieeexplore.ieee.org/document/8701426/ |
work_keys_str_mv |
AT yongfang phishingemaildetectionusingimprovedrcnnmodelwithmultilevelvectorsandattentionmechanism AT chengzhang phishingemaildetectionusingimprovedrcnnmodelwithmultilevelvectorsandattentionmechanism AT chenghuang phishingemaildetectionusingimprovedrcnnmodelwithmultilevelvectorsandattentionmechanism AT liangliu phishingemaildetectionusingimprovedrcnnmodelwithmultilevelvectorsandattentionmechanism AT yueyang phishingemaildetectionusingimprovedrcnnmodelwithmultilevelvectorsandattentionmechanism |
_version_ |
1724191061070315520 |