Using a Vertical-Stream Variational Auto-Encoder to Generate Segment-Based Images and Its Biological Plausibility for Modelling the Visual Pathways

Human beings have a strong capability to identify objects in different viewpoints. Unlike computer vision that requires sufficient training samples in various scales and rotations, biological visual systems can efficiently recognize objects in diverse spatial states. To achieve this objective, image...

Full description

Bibliographic Details
Main Authors: Xue-Song Tang, Hui Wei, Kuangrong Hao
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8558493/
Description
Summary:Human beings have a strong capability to identify objects in different viewpoints. Unlike computer vision that requires sufficient training samples in various scales and rotations, biological visual systems can efficiently recognize objects in diverse spatial states. To achieve this objective, images are processed into a segment-based representation and then a vertical stream variational auto-encoder (VSVAE) is utilized to generate images based on the preprocessed segments in this study. The novel structure of the two vertical streams can be also considered as a computational model for the interaction between the ventral pathway and the dorsal pathway in the visual cortex. The reconstructive capability of the VSVAE is testified by using a series of geometric information sets to enhance the segment-based representation. By visualizing the learnt features in the hidden layers of VSVAE, the biological plausibility of the model is discussed. In addition, the proposed methodology is able to facilitate the classification accuracy, especially when the images are severely transformed.
ISSN:2169-3536