Unsupervised Multi-Scale-Stage Content-Aware Homography Estimation
Homography estimation is a critical component in many computer-vision tasks. However, most deep homography methods focus on extracting local features and ignore global features or the corresponding relationship between features from two images or video frames. These methods are effective for alignme...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI
2023
|
Subjects: | |
Online Access: | View Fulltext in Publisher View in Scopus |
LEADER | 01817nam a2200229Ia 4500 | ||
---|---|---|---|
001 | 10.3390-electronics12091976 | ||
008 | 230529s2023 CNT 000 0 und d | ||
020 | |a 20799292 (ISSN) | ||
245 | 1 | 0 | |a Unsupervised Multi-Scale-Stage Content-Aware Homography Estimation |
260 | 0 | |b MDPI |c 2023 | |
856 | |z View Fulltext in Publisher |u https://doi.org/10.3390/electronics12091976 | ||
856 | |z View in Scopus |u https://www.scopus.com/inward/record.uri?eid=2-s2.0-85159186358&doi=10.3390%2felectronics12091976&partnerID=40&md5=7a8a1c203b18442094b322831d5c52f0 | ||
520 | 3 | |a Homography estimation is a critical component in many computer-vision tasks. However, most deep homography methods focus on extracting local features and ignore global features or the corresponding relationship between features from two images or video frames. These methods are effective for alignment of image pairs with small displacement. In this paper, we propose an unsupervised Multi-Scale-Stage Content-Aware Homography Estimation Network (MS2CA-HENet). In the framework, we use multi-scale input images for different stages to cope with different scales of transformations. In each stage, we consider local and global features via our Self-Attention-augmented ConvNet (SAC). Furthermore, feature matching is explicitly enhanced using feature-matching modules. By shrinking the error residual of each stage, our network achieves coarse-to-fine results. Experiments show that our MS2CA-HENet achieves better results than other methods. © 2023 by the authors. | |
650 | 0 | 4 | |a feature matching |
650 | 0 | 4 | |a multi-scale |
650 | 0 | 4 | |a multi-stage |
650 | 0 | 4 | |a self-attention-augmented ConvNet |
650 | 0 | 4 | |a unsupervised |
700 | 1 | 0 | |a Hou, B. |e author |
700 | 1 | 0 | |a Ren, J. |e author |
700 | 1 | 0 | |a Yan, W. |e author |
773 | |t Electronics (Switzerland) |