A Multiscale Instance Segmentation Method Based on Cleaning Rubber Ball Images

The identification of wear rubber balls in the rubber ball cleaning system in heat exchange equipment directly affects the descaling efficiency. For the problem that the rubber ball image contains impurities and bubbles and the segmentation is low in real time, a multi-scale feature fusion real-time...

Full description

Bibliographic Details
Main Authors: Liang, E. (Author), Su, E. (Author), Tian, Y. (Author), Wang, J. (Author), Zhang, Y. (Author)
Format: Article
Language:English
Published: MDPI 2023
Subjects:
Online Access:View Fulltext in Publisher
View in Scopus
LEADER 03514nam a2200469Ia 4500
001 10.3390-s23094261
008 230529s2023 CNT 000 0 und d
020 |a 14248220 (ISSN) 
245 1 0 |a A Multiscale Instance Segmentation Method Based on Cleaning Rubber Ball Images 
260 0 |b MDPI  |c 2023 
856 |z View Fulltext in Publisher  |u https://doi.org/10.3390/s23094261 
856 |z View in Scopus  |u https://www.scopus.com/inward/record.uri?eid=2-s2.0-85159217257&doi=10.3390%2fs23094261&partnerID=40&md5=b8488e08b3ef49e878b111f93f718d2a 
520 3 |a The identification of wear rubber balls in the rubber ball cleaning system in heat exchange equipment directly affects the descaling efficiency. For the problem that the rubber ball image contains impurities and bubbles and the segmentation is low in real time, a multi-scale feature fusion real-time instance segmentation model based on the attention mechanism is proposed for the object segmentation of the rubber ball images. First, we introduce the Pyramid Vision Transformer instead of the convolution module in the backbone network and use the spatial-reduction attention layer of the transformer to improve the feature extraction ability across scales and spatial reduction to reduce computational cost; Second, we improve the feature fusion module to fuse image features across scales, combined with an attention mechanism to enhance the output feature representation; Third, the prediction head separates the mask branches separately. Combined with dynamic convolution, it improves the accuracy of the mask coefficients and increases the number of upsampling layers. It also connects the penultimate layer with the second layer feature map to achieve detection of smaller images with larger feature maps to improve the accuracy. Through the validation of the produced rubber ball dataset, the Dice score, Jaccard coefficient, and mAP of the actual segmented region of this network with the rubber ball dataset are improved by 4.5%, 4.7%, and 7.73%, respectively, and our model achieves 33.6 fps segmentation speed and 79.3% segmentation accuracy. Meanwhile, the average precision of Box and Mask can also meet the requirements under different IOU thresholds. We compared the DeepMask, Mask R-CNN, BlendMask, SOLOv1 and SOLOv2 instance segmentation networks with this model in terms of training accuracy and segmentation speed and obtained good results. The proposed modules can work together to better handle object details and achieve better segmentation performance. © 2023 by the authors. 
650 0 4 |a attention mechanism 
650 0 4 |a Attention mechanisms 
650 0 4 |a Cleaning 
650 0 4 |a Cleaning system 
650 0 4 |a Convolution 
650 0 4 |a Feature extraction 
650 0 4 |a feature fusion 
650 0 4 |a Features fusions 
650 0 4 |a Image enhancement 
650 0 4 |a Image fusion 
650 0 4 |a image segmentation 
650 0 4 |a Image segmentation 
650 0 4 |a Images segmentations 
650 0 4 |a polarized self-attention 
650 0 4 |a Polarized self-attention 
650 0 4 |a Pyramid vision transformer 
650 0 4 |a Pyramid Vision Transformer 
650 0 4 |a Real- time 
650 0 4 |a Rubber 
650 0 4 |a rubber ball cleaning system 
650 0 4 |a Rubber ball cleaning system 
650 0 4 |a Rubber balls 
650 0 4 |a Spatial reduction 
700 1 0 |a Liang, E.  |e author 
700 1 0 |a Su, E.  |e author 
700 1 0 |a Tian, Y.  |e author 
700 1 0 |a Wang, J.  |e author 
700 1 0 |a Zhang, Y.  |e author 
773 |t Sensors