Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism

The phishing email is one of the significant threats in the world today and has caused tremendous financial losses. Although the methods of confrontation are continually being updated, the results of those methods are not very satisfactory at present. Moreover, phishing emails are growing at an alar...

Full description

Bibliographic Details
Main Authors: Yong Fang, Cheng Zhang, Cheng Huang, Liang Liu, Yue Yang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8701426/
id doaj-9565fbc655f043eba1413a4ec0dfd272
record_format Article
spelling doaj-9565fbc655f043eba1413a4ec0dfd2722021-03-29T22:41:05ZengIEEEIEEE Access2169-35362019-01-017563295634010.1109/ACCESS.2019.29137058701426Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention MechanismYong Fang0Cheng Zhang1Cheng Huang2https://orcid.org/0000-0002-5871-946XLiang Liu3Yue Yang4College of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaCollege of Cybersecurity, Sichuan University, Chengdu, ChinaThe phishing email is one of the significant threats in the world today and has caused tremendous financial losses. Although the methods of confrontation are continually being updated, the results of those methods are not very satisfactory at present. Moreover, phishing emails are growing at an alarming rate in recent years. Therefore, more effective phishing detection technology is needed to curb the threat of phishing emails. In this paper, we first analyzed the email structure. Then, based on an improved recurrent convolutional neural networks (RCNN) model with multilevel vectors and attention mechanism, we proposed a new phishing email detection model named THEMIS, which is used to model emails at the email header, the email body, the character level, and the word level simultaneously. To evaluate the effectiveness of THEMIS, we use an unbalanced dataset that has realistic ratios of phishing and legitimate emails. The experimental results show that the overall accuracy of THEMIS reaches 99.848%. Meanwhile, the false positive rate (FPR) is 0.043%. High accuracy and low FPR ensure that the filter can identify phishing emails with high probability and filter out legitimate emails as little as possible. This promising result is superior to the existing detection methods and verifies the effectiveness of THEMIS in detecting phishing emails.https://ieeexplore.ieee.org/document/8701426/Emailphishing detectionclassificationRCNNattention
collection DOAJ
language English
format Article
sources DOAJ
author Yong Fang
Cheng Zhang
Cheng Huang
Liang Liu
Yue Yang
spellingShingle Yong Fang
Cheng Zhang
Cheng Huang
Liang Liu
Yue Yang
Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism
IEEE Access
Email
phishing detection
classification
RCNN
attention
author_facet Yong Fang
Cheng Zhang
Cheng Huang
Liang Liu
Yue Yang
author_sort Yong Fang
title Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism
title_short Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism
title_full Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism
title_fullStr Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism
title_full_unstemmed Phishing Email Detection Using Improved RCNN Model With Multilevel Vectors and Attention Mechanism
title_sort phishing email detection using improved rcnn model with multilevel vectors and attention mechanism
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description The phishing email is one of the significant threats in the world today and has caused tremendous financial losses. Although the methods of confrontation are continually being updated, the results of those methods are not very satisfactory at present. Moreover, phishing emails are growing at an alarming rate in recent years. Therefore, more effective phishing detection technology is needed to curb the threat of phishing emails. In this paper, we first analyzed the email structure. Then, based on an improved recurrent convolutional neural networks (RCNN) model with multilevel vectors and attention mechanism, we proposed a new phishing email detection model named THEMIS, which is used to model emails at the email header, the email body, the character level, and the word level simultaneously. To evaluate the effectiveness of THEMIS, we use an unbalanced dataset that has realistic ratios of phishing and legitimate emails. The experimental results show that the overall accuracy of THEMIS reaches 99.848%. Meanwhile, the false positive rate (FPR) is 0.043%. High accuracy and low FPR ensure that the filter can identify phishing emails with high probability and filter out legitimate emails as little as possible. This promising result is superior to the existing detection methods and verifies the effectiveness of THEMIS in detecting phishing emails.
topic Email
phishing detection
classification
RCNN
attention
url https://ieeexplore.ieee.org/document/8701426/
work_keys_str_mv AT yongfang phishingemaildetectionusingimprovedrcnnmodelwithmultilevelvectorsandattentionmechanism
AT chengzhang phishingemaildetectionusingimprovedrcnnmodelwithmultilevelvectorsandattentionmechanism
AT chenghuang phishingemaildetectionusingimprovedrcnnmodelwithmultilevelvectorsandattentionmechanism
AT liangliu phishingemaildetectionusingimprovedrcnnmodelwithmultilevelvectorsandattentionmechanism
AT yueyang phishingemaildetectionusingimprovedrcnnmodelwithmultilevelvectorsandattentionmechanism
_version_ 1724191061070315520