Convolutional Edge Constraint-Based U-Net for Salient Object Detection

The salient object detection is receiving more and more attention from researchers. An accurate saliency map will be useful for subsequent tasks. However, in most saliency maps predicted by existing models, the objects regions are very blurred and the edges of objects are irregular. The reason is th...

Full description

Bibliographic Details
Main Authors: Le Han, Xuelong Li, Yongsheng Dong
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8689037/
id doaj-3480b61f11354975bf8bea5bc6e9ac83
record_format Article
spelling doaj-3480b61f11354975bf8bea5bc6e9ac832021-03-29T22:32:27ZengIEEEIEEE Access2169-35362019-01-017488904890010.1109/ACCESS.2019.29105728689037Convolutional Edge Constraint-Based U-Net for Salient Object DetectionLe Han0Xuelong Li1Yongsheng Dong2https://orcid.org/0000-0002-6281-9658Key Laboratory of Spectral Imaging Technology CAS, Xi’an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi’an, ChinaSchool of Computer Science and Center for OPTical IMagery Analysis and Learning (OPTIMAL), Northwestern Polytechnical University, Xi’an, ChinaKey Laboratory of Spectral Imaging Technology CAS, Xi’an Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xi’an, ChinaThe salient object detection is receiving more and more attention from researchers. An accurate saliency map will be useful for subsequent tasks. However, in most saliency maps predicted by existing models, the objects regions are very blurred and the edges of objects are irregular. The reason is that the hand-crafted features are the main basis for existing traditional methods to predict salient objects, which results in different pixels belonging to the same object often being predicted different saliency scores. Besides, the convolutional neural network (CNN)-based models predict saliency maps at patch scale, which causes the objects edges of the output to be fuzzy. In this paper, we attempt to add an edge convolution constraint to a modified U-Net to predict the saliency map of the image. The network structure we adopt can fuse the features of different layers to reduce the loss of information. Our SalNet predicts the saliency map pixel-by-pixel, rather than at the patch scale as the CNN-based models do. Moreover, in order to better guide the network mining the information of objects edges, we design a new loss function based on image convolution, which adds an L1 constraint to the edge information of saliency map and ground-truth. Finally, experimental results reveal that our SalNet is effective in salient object detection task and is also competitive when compared with 11 state-of-the-art models.https://ieeexplore.ieee.org/document/8689037/Encoder-decoder architectureimage convolutionedge extractionsalient object detectionskip connectionU-Net
collection DOAJ
language English
format Article
sources DOAJ
author Le Han
Xuelong Li
Yongsheng Dong
spellingShingle Le Han
Xuelong Li
Yongsheng Dong
Convolutional Edge Constraint-Based U-Net for Salient Object Detection
IEEE Access
Encoder-decoder architecture
image convolution
edge extraction
salient object detection
skip connection
U-Net
author_facet Le Han
Xuelong Li
Yongsheng Dong
author_sort Le Han
title Convolutional Edge Constraint-Based U-Net for Salient Object Detection
title_short Convolutional Edge Constraint-Based U-Net for Salient Object Detection
title_full Convolutional Edge Constraint-Based U-Net for Salient Object Detection
title_fullStr Convolutional Edge Constraint-Based U-Net for Salient Object Detection
title_full_unstemmed Convolutional Edge Constraint-Based U-Net for Salient Object Detection
title_sort convolutional edge constraint-based u-net for salient object detection
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description The salient object detection is receiving more and more attention from researchers. An accurate saliency map will be useful for subsequent tasks. However, in most saliency maps predicted by existing models, the objects regions are very blurred and the edges of objects are irregular. The reason is that the hand-crafted features are the main basis for existing traditional methods to predict salient objects, which results in different pixels belonging to the same object often being predicted different saliency scores. Besides, the convolutional neural network (CNN)-based models predict saliency maps at patch scale, which causes the objects edges of the output to be fuzzy. In this paper, we attempt to add an edge convolution constraint to a modified U-Net to predict the saliency map of the image. The network structure we adopt can fuse the features of different layers to reduce the loss of information. Our SalNet predicts the saliency map pixel-by-pixel, rather than at the patch scale as the CNN-based models do. Moreover, in order to better guide the network mining the information of objects edges, we design a new loss function based on image convolution, which adds an L1 constraint to the edge information of saliency map and ground-truth. Finally, experimental results reveal that our SalNet is effective in salient object detection task and is also competitive when compared with 11 state-of-the-art models.
topic Encoder-decoder architecture
image convolution
edge extraction
salient object detection
skip connection
U-Net
url https://ieeexplore.ieee.org/document/8689037/
work_keys_str_mv AT lehan convolutionaledgeconstraintbasedunetforsalientobjectdetection
AT xuelongli convolutionaledgeconstraintbasedunetforsalientobjectdetection
AT yongshengdong convolutionaledgeconstraintbasedunetforsalientobjectdetection
_version_ 1724191358960271360