Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks
In multimedia forensics, many efforts have been made to detect whether an image is pristine or manipulated with high enough accuracies based on specially designed features and classifiers in the past decade. However, the important task for localizing the tampering regions in a fake image still faces...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2018-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8554259/ |
id |
doaj-fef93410d6f1476bb491750d3a02ac0f |
---|---|
record_format |
Article |
spelling |
doaj-fef93410d6f1476bb491750d3a02ac0f2021-03-29T21:38:52ZengIEEEIEEE Access2169-35362018-01-016764377645310.1109/ACCESS.2018.28835888554259Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural NetworksZenan Shi0https://orcid.org/0000-0001-8554-4127Xuanjing Shen1Hui Kang2Yingda Lv3College of Computer Science and Technology, Jilin University, Changchun, ChinaCollege of Computer Science and Technology, Jilin University, Changchun, ChinaCollege of Computer Science and Technology, Jilin University, Changchun, ChinaCenter for Computer Fundamental Education, Jilin University, Changchun, ChinaIn multimedia forensics, many efforts have been made to detect whether an image is pristine or manipulated with high enough accuracies based on specially designed features and classifiers in the past decade. However, the important task for localizing the tampering regions in a fake image still faces more challenges compared with the manipulation detection and relatively a few algorithms attempt to tackle it. With this in mind, a technique that utilizes the dual-domain-based convolutional neural networks (D-CNNs) taking different kinds of input into consideration is proposed in this paper. In the proposed framework, two sub-networks, named the spatial-domain CNN model (Sub-SCNN) and the frequency-domain-based CNN model (Sub-FCNN), are designed and trained, respectively. With the well-trained parameters, a transfer policy is applied to the training process of the D-CNN. While CNNs are capable of learning classification features directly from data, in their standard form they tend to learn features related to the image's content. To overcome this issue in image forensics tasks, a new image pre-processing layer is proposed to jointly suppress image's content and adaptively learn manipulation detection and localization features. After investigating the properties of datasets, two post-processing operations are finally proposed and compared to obtain the final results of the pixel-wise manipulation region localization. The D-CNNs is trained and validated using 75 percent of images in the CASIA v2.0 and tested using the remaining images in the CASIA v2.0, all images in Columbia Uncompressed and Carvalho datasets. The extensive experiments show that the proposed post-processing operations optimize the final tamper probability map, and our framework with the combination of Sub-SCNN and Sub-FCNN significantly outperforms the state-of-art techniques with the best F1 scores on the datasets.https://ieeexplore.ieee.org/document/8554259/Convolutional neural networksmultimedia forensicspost-processing operationtransfer policy |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zenan Shi Xuanjing Shen Hui Kang Yingda Lv |
spellingShingle |
Zenan Shi Xuanjing Shen Hui Kang Yingda Lv Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks IEEE Access Convolutional neural networks multimedia forensics post-processing operation transfer policy |
author_facet |
Zenan Shi Xuanjing Shen Hui Kang Yingda Lv |
author_sort |
Zenan Shi |
title |
Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks |
title_short |
Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks |
title_full |
Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks |
title_fullStr |
Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks |
title_full_unstemmed |
Image Manipulation Detection and Localization Based on the Dual-Domain Convolutional Neural Networks |
title_sort |
image manipulation detection and localization based on the dual-domain convolutional neural networks |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2018-01-01 |
description |
In multimedia forensics, many efforts have been made to detect whether an image is pristine or manipulated with high enough accuracies based on specially designed features and classifiers in the past decade. However, the important task for localizing the tampering regions in a fake image still faces more challenges compared with the manipulation detection and relatively a few algorithms attempt to tackle it. With this in mind, a technique that utilizes the dual-domain-based convolutional neural networks (D-CNNs) taking different kinds of input into consideration is proposed in this paper. In the proposed framework, two sub-networks, named the spatial-domain CNN model (Sub-SCNN) and the frequency-domain-based CNN model (Sub-FCNN), are designed and trained, respectively. With the well-trained parameters, a transfer policy is applied to the training process of the D-CNN. While CNNs are capable of learning classification features directly from data, in their standard form they tend to learn features related to the image's content. To overcome this issue in image forensics tasks, a new image pre-processing layer is proposed to jointly suppress image's content and adaptively learn manipulation detection and localization features. After investigating the properties of datasets, two post-processing operations are finally proposed and compared to obtain the final results of the pixel-wise manipulation region localization. The D-CNNs is trained and validated using 75 percent of images in the CASIA v2.0 and tested using the remaining images in the CASIA v2.0, all images in Columbia Uncompressed and Carvalho datasets. The extensive experiments show that the proposed post-processing operations optimize the final tamper probability map, and our framework with the combination of Sub-SCNN and Sub-FCNN significantly outperforms the state-of-art techniques with the best F1 scores on the datasets. |
topic |
Convolutional neural networks multimedia forensics post-processing operation transfer policy |
url |
https://ieeexplore.ieee.org/document/8554259/ |
work_keys_str_mv |
AT zenanshi imagemanipulationdetectionandlocalizationbasedonthedualdomainconvolutionalneuralnetworks AT xuanjingshen imagemanipulationdetectionandlocalizationbasedonthedualdomainconvolutionalneuralnetworks AT huikang imagemanipulationdetectionandlocalizationbasedonthedualdomainconvolutionalneuralnetworks AT yingdalv imagemanipulationdetectionandlocalizationbasedonthedualdomainconvolutionalneuralnetworks |
_version_ |
1724192586747346944 |