Co-Training for Deep Object Detection: Comparing Single-Modal and Multi-Modal Approaches

Top-performing computer vision models are powered by convolutional neural networks (CNNs). Training an accurate CNN highly depends on both the raw sensor data and their associated ground truth (GT). Collecting such GT is usually done through human labeling, which is time-consuming and does not scale...

Full description

Bibliographic Details
Main Authors: Jose L. Gómez, Gabriel Villalonga, Antonio M. López
Format: Article
Language:English
Published: MDPI AG 2021-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/21/9/3185