YOLOrs: Object Detection in Multimodal Remote Sensing Imagery

Deep-learning object detection methods that are designed for computer vision applications tend to underperform when applied to remote sensing data. This is because contrary to computer vision, in remote sensing, training data are harder to collect and targets can be very small, occupying only a few...

Full description

Bibliographic Details
Main Authors: Manish Sharma, Mayur Dhanaraj, Srivallabha Karnam, Dimitris G. Chachlakis, Raymond Ptucha, Panos P. Markopoulos, Eli Saber
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9273212/
id doaj-0f0551c073f049a3bdaaa03800c5a3e0
record_format Article
spelling doaj-0f0551c073f049a3bdaaa03800c5a3e02021-06-03T23:07:45ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing2151-15352021-01-01141497150810.1109/JSTARS.2020.30413169273212YOLOrs: Object Detection in Multimodal Remote Sensing ImageryManish Sharma0Mayur Dhanaraj1https://orcid.org/0000-0001-5710-0561Srivallabha Karnam2Dimitris G. Chachlakis3https://orcid.org/0000-0002-6464-0723Raymond Ptucha4Panos P. Markopoulos5https://orcid.org/0000-0001-9686-779XEli Saber6Chester F. Carlson Center for Imaging Science, Rochester Institute of Technology, Rochester, NY, USADepartment of Electrical and Microelectronic Engineering, Rochester Institute of Technology, Rochester, NY, USADepartment of Computer Engineering, Rochester Institute of Technology, Rochester, NY, USADepartment of Electrical and Microelectronic Engineering, Rochester Institute of Technology, Rochester, NY, USADepartment of Computer Engineering, Rochester Institute of Technology, Rochester, NY, USADepartment of Electrical and Microelectronic Engineering, Rochester Institute of Technology, Rochester, NY, USADepartment of Electrical and Microelectronic Engineering, Rochester Institute of Technology, Rochester, NY, USADeep-learning object detection methods that are designed for computer vision applications tend to underperform when applied to remote sensing data. This is because contrary to computer vision, in remote sensing, training data are harder to collect and targets can be very small, occupying only a few pixels in the entire image, and exhibit arbitrary perspective transformations. Detection performance can improve by fusing data from multiple remote sensing modalities, including red, green, blue, infrared, hyperspectral, multispectral, synthetic aperture radar, and light detection and ranging, to name a few. In this article, we propose YOLOrs: a new convolutional neural network, specifically designed for real-time object detection in multimodal remote sensing imagery. YOLOrs can detect objects at multiple scales, with smaller receptive fields to account for small targets, as well as predict target orientations. In addition, YOLOrs introduces a novel mid-level fusion architecture that renders it applicable to multimodal aerial imagery. Our experimental studies compare YOLOrs with contemporary alternatives and corroborate its merits.https://ieeexplore.ieee.org/document/9273212/Aerial imageryfusionmultimodalobject detectionremote sensing (RS)
collection DOAJ
language English
format Article
sources DOAJ
author Manish Sharma
Mayur Dhanaraj
Srivallabha Karnam
Dimitris G. Chachlakis
Raymond Ptucha
Panos P. Markopoulos
Eli Saber
spellingShingle Manish Sharma
Mayur Dhanaraj
Srivallabha Karnam
Dimitris G. Chachlakis
Raymond Ptucha
Panos P. Markopoulos
Eli Saber
YOLOrs: Object Detection in Multimodal Remote Sensing Imagery
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Aerial imagery
fusion
multimodal
object detection
remote sensing (RS)
author_facet Manish Sharma
Mayur Dhanaraj
Srivallabha Karnam
Dimitris G. Chachlakis
Raymond Ptucha
Panos P. Markopoulos
Eli Saber
author_sort Manish Sharma
title YOLOrs: Object Detection in Multimodal Remote Sensing Imagery
title_short YOLOrs: Object Detection in Multimodal Remote Sensing Imagery
title_full YOLOrs: Object Detection in Multimodal Remote Sensing Imagery
title_fullStr YOLOrs: Object Detection in Multimodal Remote Sensing Imagery
title_full_unstemmed YOLOrs: Object Detection in Multimodal Remote Sensing Imagery
title_sort yolors: object detection in multimodal remote sensing imagery
publisher IEEE
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
issn 2151-1535
publishDate 2021-01-01
description Deep-learning object detection methods that are designed for computer vision applications tend to underperform when applied to remote sensing data. This is because contrary to computer vision, in remote sensing, training data are harder to collect and targets can be very small, occupying only a few pixels in the entire image, and exhibit arbitrary perspective transformations. Detection performance can improve by fusing data from multiple remote sensing modalities, including red, green, blue, infrared, hyperspectral, multispectral, synthetic aperture radar, and light detection and ranging, to name a few. In this article, we propose YOLOrs: a new convolutional neural network, specifically designed for real-time object detection in multimodal remote sensing imagery. YOLOrs can detect objects at multiple scales, with smaller receptive fields to account for small targets, as well as predict target orientations. In addition, YOLOrs introduces a novel mid-level fusion architecture that renders it applicable to multimodal aerial imagery. Our experimental studies compare YOLOrs with contemporary alternatives and corroborate its merits.
topic Aerial imagery
fusion
multimodal
object detection
remote sensing (RS)
url https://ieeexplore.ieee.org/document/9273212/
work_keys_str_mv AT manishsharma yolorsobjectdetectioninmultimodalremotesensingimagery
AT mayurdhanaraj yolorsobjectdetectioninmultimodalremotesensingimagery
AT srivallabhakarnam yolorsobjectdetectioninmultimodalremotesensingimagery
AT dimitrisgchachlakis yolorsobjectdetectioninmultimodalremotesensingimagery
AT raymondptucha yolorsobjectdetectioninmultimodalremotesensingimagery
AT panospmarkopoulos yolorsobjectdetectioninmultimodalremotesensingimagery
AT elisaber yolorsobjectdetectioninmultimodalremotesensingimagery
_version_ 1721398530430468096