Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks

In multimedia forensics, many efforts have been made to detect whether an image is pristine or manipulated with high enough accuracies based on specially designed features and classifiers in the past decade. However, the important task for localizing the tampering regions in a fake image still faces...

Full description

Bibliographic Details
Main Authors: Zenan Shi, Xuanjing Shen, Hui Kang, Yingda Lv
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8554259/
id doaj-fef93410d6f1476bb491750d3a02ac0f
record_format Article
spelling doaj-fef93410d6f1476bb491750d3a02ac0f2021-03-29T21:38:52ZengIEEEIEEE Access2169-35362018-01-016764377645310.1109/ACCESS.2018.28835888554259Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural NetworksZenan Shi0https://orcid.org/0000-0001-8554-4127Xuanjing Shen1Hui Kang2Yingda Lv3College of Computer Science and Technology, Jilin University, Changchun, ChinaCollege of Computer Science and Technology, Jilin University, Changchun, ChinaCollege of Computer Science and Technology, Jilin University, Changchun, ChinaCenter for Computer Fundamental Education, Jilin University, Changchun, ChinaIn multimedia forensics, many efforts have been made to detect whether an image is pristine or manipulated with high enough accuracies based on specially designed features and classifiers in the past decade. However, the important task for localizing the tampering regions in a fake image still faces more challenges compared with the manipulation detection and relatively a few algorithms attempt to tackle it. With this in mind, a technique that utilizes the dual-domain-based convolutional neural networks (D-CNNs) taking different kinds of input into consideration is proposed in this paper. In the proposed framework, two sub-networks, named the spatial-domain CNN model (Sub-SCNN) and the frequency-domain-based CNN model (Sub-FCNN), are designed and trained, respectively. With the well-trained parameters, a transfer policy is applied to the training process of the D-CNN. While CNNs are capable of learning classification features directly from data, in their standard form they tend to learn features related to the image's content. To overcome this issue in image forensics tasks, a new image pre-processing layer is proposed to jointly suppress image's content and adaptively learn manipulation detection and localization features. After investigating the properties of datasets, two post-processing operations are finally proposed and compared to obtain the final results of the pixel-wise manipulation region localization. The D-CNNs is trained and validated using 75 percent of images in the CASIA v2.0 and tested using the remaining images in the CASIA v2.0, all images in Columbia Uncompressed and Carvalho datasets. The extensive experiments show that the proposed post-processing operations optimize the final tamper probability map, and our framework with the combination of Sub-SCNN and Sub-FCNN significantly outperforms the state-of-art techniques with the best F1 scores on the datasets.https://ieeexplore.ieee.org/document/8554259/Convolutional neural networksmultimedia forensicspost-processing operationtransfer policy
collection DOAJ
language English
format Article
sources DOAJ
author Zenan Shi
Xuanjing Shen
Hui Kang
Yingda Lv
spellingShingle Zenan Shi
Xuanjing Shen
Hui Kang
Yingda Lv
Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks
IEEE Access
Convolutional neural networks
multimedia forensics
post-processing operation
transfer policy
author_facet Zenan Shi
Xuanjing Shen
Hui Kang
Yingda Lv
author_sort Zenan Shi
title Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks
title_short Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks
title_full Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks
title_fullStr Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks
title_full_unstemmed Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks
title_sort image manipulation detection and localization based on the dual-domain convolutional neural networks
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description In multimedia forensics, many efforts have been made to detect whether an image is pristine or manipulated with high enough accuracies based on specially designed features and classifiers in the past decade. However, the important task for localizing the tampering regions in a fake image still faces more challenges compared with the manipulation detection and relatively a few algorithms attempt to tackle it. With this in mind, a technique that utilizes the dual-domain-based convolutional neural networks (D-CNNs) taking different kinds of input into consideration is proposed in this paper. In the proposed framework, two sub-networks, named the spatial-domain CNN model (Sub-SCNN) and the frequency-domain-based CNN model (Sub-FCNN), are designed and trained, respectively. With the well-trained parameters, a transfer policy is applied to the training process of the D-CNN. While CNNs are capable of learning classification features directly from data, in their standard form they tend to learn features related to the image's content. To overcome this issue in image forensics tasks, a new image pre-processing layer is proposed to jointly suppress image's content and adaptively learn manipulation detection and localization features. After investigating the properties of datasets, two post-processing operations are finally proposed and compared to obtain the final results of the pixel-wise manipulation region localization. The D-CNNs is trained and validated using 75 percent of images in the CASIA v2.0 and tested using the remaining images in the CASIA v2.0, all images in Columbia Uncompressed and Carvalho datasets. The extensive experiments show that the proposed post-processing operations optimize the final tamper probability map, and our framework with the combination of Sub-SCNN and Sub-FCNN significantly outperforms the state-of-art techniques with the best F1 scores on the datasets.
topic Convolutional neural networks
multimedia forensics
post-processing operation
transfer policy
url https://ieeexplore.ieee.org/document/8554259/
work_keys_str_mv AT zenanshi imagemanipulationdetectionandlocalizationbasedonthedualdomainconvolutionalneuralnetworks
AT xuanjingshen imagemanipulationdetectionandlocalizationbasedonthedualdomainconvolutionalneuralnetworks
AT huikang imagemanipulationdetectionandlocalizationbasedonthedualdomainconvolutionalneuralnetworks
AT yingdalv imagemanipulationdetectionandlocalizationbasedonthedualdomainconvolutionalneuralnetworks
_version_ 1724192586747346944