Learning Discriminative Projection With Visual Semantic Alignment for Generalized Zero Shot Learning

Zero Shot Learning (ZSL) aims to solve the classification problem with no training sample, and it is realized by transferring knowledge from source classes to target classes through the semantic embeddings bridging. Generalized ZSL (GZSL) enlarges the search scope of ZSL from only the seen classes t...

Full description

Bibliographic Details
Main Authors:	Pengzhen Du, Haofeng Zhang, Jianfeng Lu
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Generalized zero-shot learning linear discriminative analysis visual semantic alignment prototype synthesis
Online Access:	https://ieeexplore.ieee.org/document/9187726/

id	doaj-955ac205c421419cb0fb6f9872d8b005
record_format	Article
spelling	doaj-955ac205c421419cb0fb6f9872d8b0052021-03-30T03:33:29ZengIEEEIEEE Access2169-35362020-01-01816627316628210.1109/ACCESS.2020.30228059187726Learning Discriminative Projection With Visual Semantic Alignment for Generalized Zero Shot LearningPengzhen Du0https://orcid.org/0000-0003-2089-5617Haofeng Zhang1https://orcid.org/0000-0002-4039-7618Jianfeng Lu2https://orcid.org/0000-0002-9190-507XSchool of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, ChinaSchool of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, ChinaSchool of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, ChinaZero Shot Learning (ZSL) aims to solve the classification problem with no training sample, and it is realized by transferring knowledge from source classes to target classes through the semantic embeddings bridging. Generalized ZSL (GZSL) enlarges the search scope of ZSL from only the seen classes to all classes. A large number of methods are proposed for these two settings, and achieve competing performance. However, most of them still suffer from the domain shift problem due to the existence of the domain gap between the seen classes and unseen classes. In this article, we propose a novel method to learn discriminative features with visual-semantic alignment for GZSL. We define a latent space, where the visual features and semantic attributes are aligned, and assume that each prototype is the linear combination of others, where the coefficients are constrained to be the same in all three spaces. To make the latent space more discriminative, a linear discriminative analysis strategy is employed to learn the projection matrix from visual space to latent space. Five popular datasets are exploited to evaluate the proposed method, and the results demonstrate the superiority of our approach compared with the state-of-the-art methods. Beside, extensive ablation studies also show the effectiveness of each module in our method.https://ieeexplore.ieee.org/document/9187726/Generalized zero-shot learninglinear discriminative analysisvisual semantic alignmentprototype synthesis
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Pengzhen Du Haofeng Zhang Jianfeng Lu
spellingShingle	Pengzhen Du Haofeng Zhang Jianfeng Lu Learning Discriminative Projection With Visual Semantic Alignment for Generalized Zero Shot Learning IEEE Access Generalized zero-shot learning linear discriminative analysis visual semantic alignment prototype synthesis
author_facet	Pengzhen Du Haofeng Zhang Jianfeng Lu
author_sort	Pengzhen Du
title	Learning Discriminative Projection With Visual Semantic Alignment for Generalized Zero Shot Learning
title_short	Learning Discriminative Projection With Visual Semantic Alignment for Generalized Zero Shot Learning
title_full	Learning Discriminative Projection With Visual Semantic Alignment for Generalized Zero Shot Learning
title_fullStr	Learning Discriminative Projection With Visual Semantic Alignment for Generalized Zero Shot Learning
title_full_unstemmed	Learning Discriminative Projection With Visual Semantic Alignment for Generalized Zero Shot Learning
title_sort	learning discriminative projection with visual semantic alignment for generalized zero shot learning
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	Zero Shot Learning (ZSL) aims to solve the classification problem with no training sample, and it is realized by transferring knowledge from source classes to target classes through the semantic embeddings bridging. Generalized ZSL (GZSL) enlarges the search scope of ZSL from only the seen classes to all classes. A large number of methods are proposed for these two settings, and achieve competing performance. However, most of them still suffer from the domain shift problem due to the existence of the domain gap between the seen classes and unseen classes. In this article, we propose a novel method to learn discriminative features with visual-semantic alignment for GZSL. We define a latent space, where the visual features and semantic attributes are aligned, and assume that each prototype is the linear combination of others, where the coefficients are constrained to be the same in all three spaces. To make the latent space more discriminative, a linear discriminative analysis strategy is employed to learn the projection matrix from visual space to latent space. Five popular datasets are exploited to evaluate the proposed method, and the results demonstrate the superiority of our approach compared with the state-of-the-art methods. Beside, extensive ablation studies also show the effectiveness of each module in our method.
topic	Generalized zero-shot learning linear discriminative analysis visual semantic alignment prototype synthesis
url	https://ieeexplore.ieee.org/document/9187726/
work_keys_str_mv	AT pengzhendu learningdiscriminativeprojectionwithvisualsemanticalignmentforgeneralizedzeroshotlearning AT haofengzhang learningdiscriminativeprojectionwithvisualsemanticalignmentforgeneralizedzeroshotlearning AT jianfenglu learningdiscriminativeprojectionwithvisualsemanticalignmentforgeneralizedzeroshotlearning
_version_	1724183314087018496

Learning Discriminative Projection With Visual Semantic Alignment for Generalized Zero Shot Learning

Similar Items