Convolutional Edge Constraint-Based U-Net for Salient Object Detection
The salient object detection is receiving more and more attention from researchers. An accurate saliency map will be useful for subsequent tasks. However, in most saliency maps predicted by existing models, the objects regions are very blurred and the edges of objects are irregular. The reason is th...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8689037/ |
id |
doaj-3480b61f11354975bf8bea5bc6e9ac83 |
---|---|
record_format |
Article |
spelling |
doaj-3480b61f11354975bf8bea5bc6e9ac832021-03-29T22:32:27ZengIEEEIEEE Access2169-35362019-01-017488904890010.1109/ACCESS.2019.29105728689037Convolutional Edge Constraint-Based U-Net for Salient Object DetectionLe Han0Xuelong Li1Yongsheng Dong2https://orcid.org/0000-0002-6281-9658Key Laboratory of Spectral Imaging Technology CAS, Xi’an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi’an, ChinaSchool of Computer Science and Center for OPTical IMagery Analysis and Learning (OPTIMAL), Northwestern Polytechnical University, Xi’an, ChinaKey Laboratory of Spectral Imaging Technology CAS, Xi’an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi’an, ChinaThe salient object detection is receiving more and more attention from researchers. An accurate saliency map will be useful for subsequent tasks. However, in most saliency maps predicted by existing models, the objects regions are very blurred and the edges of objects are irregular. The reason is that the hand-crafted features are the main basis for existing traditional methods to predict salient objects, which results in different pixels belonging to the same object often being predicted different saliency scores. Besides, the convolutional neural network (CNN)-based models predict saliency maps at patch scale, which causes the objects edges of the output to be fuzzy. In this paper, we attempt to add an edge convolution constraint to a modified U-Net to predict the saliency map of the image. The network structure we adopt can fuse the features of different layers to reduce the loss of information. Our SalNet predicts the saliency map pixel-by-pixel, rather than at the patch scale as the CNN-based models do. Moreover, in order to better guide the network mining the information of objects edges, we design a new loss function based on image convolution, which adds an L1 constraint to the edge information of saliency map and ground-truth. Finally, experimental results reveal that our SalNet is effective in salient object detection task and is also competitive when compared with 11 state-of-the-art models.https://ieeexplore.ieee.org/document/8689037/Encoder-decoder architectureimage convolutionedge extractionsalient object detectionskip connectionU-Net |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Le Han Xuelong Li Yongsheng Dong |
spellingShingle |
Le Han Xuelong Li Yongsheng Dong Convolutional Edge Constraint-Based U-Net for Salient Object Detection IEEE Access Encoder-decoder architecture image convolution edge extraction salient object detection skip connection U-Net |
author_facet |
Le Han Xuelong Li Yongsheng Dong |
author_sort |
Le Han |
title |
Convolutional Edge Constraint-Based U-Net for Salient Object Detection |
title_short |
Convolutional Edge Constraint-Based U-Net for Salient Object Detection |
title_full |
Convolutional Edge Constraint-Based U-Net for Salient Object Detection |
title_fullStr |
Convolutional Edge Constraint-Based U-Net for Salient Object Detection |
title_full_unstemmed |
Convolutional Edge Constraint-Based U-Net for Salient Object Detection |
title_sort |
convolutional edge constraint-based u-net for salient object detection |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
The salient object detection is receiving more and more attention from researchers. An accurate saliency map will be useful for subsequent tasks. However, in most saliency maps predicted by existing models, the objects regions are very blurred and the edges of objects are irregular. The reason is that the hand-crafted features are the main basis for existing traditional methods to predict salient objects, which results in different pixels belonging to the same object often being predicted different saliency scores. Besides, the convolutional neural network (CNN)-based models predict saliency maps at patch scale, which causes the objects edges of the output to be fuzzy. In this paper, we attempt to add an edge convolution constraint to a modified U-Net to predict the saliency map of the image. The network structure we adopt can fuse the features of different layers to reduce the loss of information. Our SalNet predicts the saliency map pixel-by-pixel, rather than at the patch scale as the CNN-based models do. Moreover, in order to better guide the network mining the information of objects edges, we design a new loss function based on image convolution, which adds an L1 constraint to the edge information of saliency map and ground-truth. Finally, experimental results reveal that our SalNet is effective in salient object detection task and is also competitive when compared with 11 state-of-the-art models. |
topic |
Encoder-decoder architecture image convolution edge extraction salient object detection skip connection U-Net |
url |
https://ieeexplore.ieee.org/document/8689037/ |
work_keys_str_mv |
AT lehan convolutionaledgeconstraintbasedunetforsalientobjectdetection AT xuelongli convolutionaledgeconstraintbasedunetforsalientobjectdetection AT yongshengdong convolutionaledgeconstraintbasedunetforsalientobjectdetection |
_version_ |
1724191358960271360 |