Efficient ConvNet Feature Extraction with Multiple RoI Pooling for Landmark-Based Visual Localization of Autonomous Vehicles

Efficient and robust visual localization is important for autonomous vehicles. By achieving impressive localization accuracy under conditions of significant changes, ConvNet landmark-based approach has attracted the attention of people in several research communities including autonomous vehicles. S...

Full description

Bibliographic Details
Main Authors: Yi Hou, Hong Zhang, Shilin Zhou, Huanxin Zou
Format: Article
Language:English
Published: Hindawi Limited 2017-01-01
Series:Mobile Information Systems
Online Access:http://dx.doi.org/10.1155/2017/8104386
id doaj-e28b0ac2cd5c451187b856459ec09e6b
record_format Article
spelling doaj-e28b0ac2cd5c451187b856459ec09e6b2021-07-02T09:48:32ZengHindawi LimitedMobile Information Systems1574-017X1875-905X2017-01-01201710.1155/2017/81043868104386Efficient ConvNet Feature Extraction with Multiple RoI Pooling for Landmark-Based Visual Localization of Autonomous VehiclesYi Hou0Hong Zhang1Shilin Zhou2Huanxin Zou3College of Electronic Science and Engineering, National University of Defense Technology, Changsha, Hunan, ChinaDepartment of Computing Science, University of Alberta, Edmonton, AB, T6G 2E8, CanadaCollege of Electronic Science and Engineering, National University of Defense Technology, Changsha, Hunan, ChinaCollege of Electronic Science and Engineering, National University of Defense Technology, Changsha, Hunan, ChinaEfficient and robust visual localization is important for autonomous vehicles. By achieving impressive localization accuracy under conditions of significant changes, ConvNet landmark-based approach has attracted the attention of people in several research communities including autonomous vehicles. Such an approach relies heavily on the outstanding discrimination power of ConvNet features to match detected landmarks between images. However, a major challenge of this approach is how to extract discriminative ConvNet features efficiently. To address this challenging, inspired by the high efficiency of the region of interest (RoI) pooling layer, we propose a Multiple RoI (MRoI) pooling technique, an enhancement of RoI, and a simple yet efficient ConvNet feature extraction method. Our idea is to leverage MRoI pooling to exploit multilevel and multiresolution information from multiple convolutional layers and then fuse them to improve the discrimination capacity of the final ConvNet features. The main advantages of our method are (a) high computational efficiency for real-time applications; (b) GPU memory efficiency for mobile applications; and (c) use of pretrained model without fine-tuning or retraining for easy implementation. Experimental results on four datasets have demonstrated not only the above advantages but also the high discriminating power of the extracted ConvNet features with state-of-the-art localization accuracy.http://dx.doi.org/10.1155/2017/8104386
collection DOAJ
language English
format Article
sources DOAJ
author Yi Hou
Hong Zhang
Shilin Zhou
Huanxin Zou
spellingShingle Yi Hou
Hong Zhang
Shilin Zhou
Huanxin Zou
Efficient ConvNet Feature Extraction with Multiple RoI Pooling for Landmark-Based Visual Localization of Autonomous Vehicles
Mobile Information Systems
author_facet Yi Hou
Hong Zhang
Shilin Zhou
Huanxin Zou
author_sort Yi Hou
title Efficient ConvNet Feature Extraction with Multiple RoI Pooling for Landmark-Based Visual Localization of Autonomous Vehicles
title_short Efficient ConvNet Feature Extraction with Multiple RoI Pooling for Landmark-Based Visual Localization of Autonomous Vehicles
title_full Efficient ConvNet Feature Extraction with Multiple RoI Pooling for Landmark-Based Visual Localization of Autonomous Vehicles
title_fullStr Efficient ConvNet Feature Extraction with Multiple RoI Pooling for Landmark-Based Visual Localization of Autonomous Vehicles
title_full_unstemmed Efficient ConvNet Feature Extraction with Multiple RoI Pooling for Landmark-Based Visual Localization of Autonomous Vehicles
title_sort efficient convnet feature extraction with multiple roi pooling for landmark-based visual localization of autonomous vehicles
publisher Hindawi Limited
series Mobile Information Systems
issn 1574-017X
1875-905X
publishDate 2017-01-01
description Efficient and robust visual localization is important for autonomous vehicles. By achieving impressive localization accuracy under conditions of significant changes, ConvNet landmark-based approach has attracted the attention of people in several research communities including autonomous vehicles. Such an approach relies heavily on the outstanding discrimination power of ConvNet features to match detected landmarks between images. However, a major challenge of this approach is how to extract discriminative ConvNet features efficiently. To address this challenging, inspired by the high efficiency of the region of interest (RoI) pooling layer, we propose a Multiple RoI (MRoI) pooling technique, an enhancement of RoI, and a simple yet efficient ConvNet feature extraction method. Our idea is to leverage MRoI pooling to exploit multilevel and multiresolution information from multiple convolutional layers and then fuse them to improve the discrimination capacity of the final ConvNet features. The main advantages of our method are (a) high computational efficiency for real-time applications; (b) GPU memory efficiency for mobile applications; and (c) use of pretrained model without fine-tuning or retraining for easy implementation. Experimental results on four datasets have demonstrated not only the above advantages but also the high discriminating power of the extracted ConvNet features with state-of-the-art localization accuracy.
url http://dx.doi.org/10.1155/2017/8104386
work_keys_str_mv AT yihou efficientconvnetfeatureextractionwithmultipleroipoolingforlandmarkbasedvisuallocalizationofautonomousvehicles
AT hongzhang efficientconvnetfeatureextractionwithmultipleroipoolingforlandmarkbasedvisuallocalizationofautonomousvehicles
AT shilinzhou efficientconvnetfeatureextractionwithmultipleroipoolingforlandmarkbasedvisuallocalizationofautonomousvehicles
AT huanxinzou efficientconvnetfeatureextractionwithmultipleroipoolingforlandmarkbasedvisuallocalizationofautonomousvehicles
_version_ 1721332813711540224