3D Instance Segmentation and Object Detection Framework Based on the Fusion of Lidar Remote Sensing and Optical Image Sensing

Since single sensor and high-density point cloud data processing have certain direct processing limitations in urban traffic scenarios, this paper proposes a 3D instance segmentation and object detection framework for urban transportation scenes based on the fusion of Lidar remote sensing technology...

Full description

Bibliographic Details
Main Authors: Ling Bai, Yinguo Li, Ming Cen, Fangchao Hu
Format: Article
Language:English
Published: MDPI AG 2021-08-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/13/16/3288
id doaj-3636134ad0664d159cc5377582228af0
record_format Article
spelling doaj-3636134ad0664d159cc5377582228af02021-08-26T14:17:55ZengMDPI AGRemote Sensing2072-42922021-08-01133288328810.3390/rs131632883D Instance Segmentation and Object Detection Framework Based on the Fusion of Lidar Remote Sensing and Optical Image SensingLing Bai0Yinguo Li1Ming Cen2Fangchao Hu3Department of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaDepartment of Automation, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaDepartment of Automation, Chongqing University of Posts and Telecommunications, Chongqing 400065, ChinaDepartment of Mechanical Engineering, Chongqing University of Technology, Chongqing 400054, ChinaSince single sensor and high-density point cloud data processing have certain direct processing limitations in urban traffic scenarios, this paper proposes a 3D instance segmentation and object detection framework for urban transportation scenes based on the fusion of Lidar remote sensing technology and optical image sensing technology. Firstly, multi-source and multi-mode data pre-fusion and alignment of Lidar and camera sensor data are effectively carried out, and then a unique and innovative network of stereo regional proposal selective search-driven DAGNN is constructed. Finally, using the multi-dimensional information interaction, three-dimensional point clouds with multi-features and unique concave-convex geometric characteristics are instance over-segmented and clustered by the hypervoxel storage in the remarkable octree and growing voxels. Finally, the positioning and semantic information of significant 3D object detection in this paper are visualized by multi-dimensional mapping of the boundary box. The experimental results validate the effectiveness of the proposed framework with excellent feedback for small objects, object stacking, and object occlusion. It can be a remediable or alternative plan to a single sensor and provide an essential theoretical and application basis for remote sensing, autonomous driving, environment modeling, autonomous navigation, and path planning under the V2X intelligent network space– ground integration in the future.https://www.mdpi.com/2072-4292/13/16/3288urban transportationlidar and camera sensors fusion3D object detectionstereo regional proposal networkoctree-based hypervoxels
collection DOAJ
language English
format Article
sources DOAJ
author Ling Bai
Yinguo Li
Ming Cen
Fangchao Hu
spellingShingle Ling Bai
Yinguo Li
Ming Cen
Fangchao Hu
3D Instance Segmentation and Object Detection Framework Based on the Fusion of Lidar Remote Sensing and Optical Image Sensing
Remote Sensing
urban transportation
lidar and camera sensors fusion
3D object detection
stereo regional proposal network
octree-based hypervoxels
author_facet Ling Bai
Yinguo Li
Ming Cen
Fangchao Hu
author_sort Ling Bai
title 3D Instance Segmentation and Object Detection Framework Based on the Fusion of Lidar Remote Sensing and Optical Image Sensing
title_short 3D Instance Segmentation and Object Detection Framework Based on the Fusion of Lidar Remote Sensing and Optical Image Sensing
title_full 3D Instance Segmentation and Object Detection Framework Based on the Fusion of Lidar Remote Sensing and Optical Image Sensing
title_fullStr 3D Instance Segmentation and Object Detection Framework Based on the Fusion of Lidar Remote Sensing and Optical Image Sensing
title_full_unstemmed 3D Instance Segmentation and Object Detection Framework Based on the Fusion of Lidar Remote Sensing and Optical Image Sensing
title_sort 3d instance segmentation and object detection framework based on the fusion of lidar remote sensing and optical image sensing
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2021-08-01
description Since single sensor and high-density point cloud data processing have certain direct processing limitations in urban traffic scenarios, this paper proposes a 3D instance segmentation and object detection framework for urban transportation scenes based on the fusion of Lidar remote sensing technology and optical image sensing technology. Firstly, multi-source and multi-mode data pre-fusion and alignment of Lidar and camera sensor data are effectively carried out, and then a unique and innovative network of stereo regional proposal selective search-driven DAGNN is constructed. Finally, using the multi-dimensional information interaction, three-dimensional point clouds with multi-features and unique concave-convex geometric characteristics are instance over-segmented and clustered by the hypervoxel storage in the remarkable octree and growing voxels. Finally, the positioning and semantic information of significant 3D object detection in this paper are visualized by multi-dimensional mapping of the boundary box. The experimental results validate the effectiveness of the proposed framework with excellent feedback for small objects, object stacking, and object occlusion. It can be a remediable or alternative plan to a single sensor and provide an essential theoretical and application basis for remote sensing, autonomous driving, environment modeling, autonomous navigation, and path planning under the V2X intelligent network space– ground integration in the future.
topic urban transportation
lidar and camera sensors fusion
3D object detection
stereo regional proposal network
octree-based hypervoxels
url https://www.mdpi.com/2072-4292/13/16/3288
work_keys_str_mv AT lingbai 3dinstancesegmentationandobjectdetectionframeworkbasedonthefusionoflidarremotesensingandopticalimagesensing
AT yinguoli 3dinstancesegmentationandobjectdetectionframeworkbasedonthefusionoflidarremotesensingandopticalimagesensing
AT mingcen 3dinstancesegmentationandobjectdetectionframeworkbasedonthefusionoflidarremotesensingandopticalimagesensing
AT fangchaohu 3dinstancesegmentationandobjectdetectionframeworkbasedonthefusionoflidarremotesensingandopticalimagesensing
_version_ 1721190253489815552