Compact video fingerprinting via an improved capsule net

Robustness, distinctiveness and compactness are the three basic performance metrics for video fingerprinting, and the three factors affect each other. It is challenging to improve them simultaneously. For this reason, an end-to-end fingerprinting via a capsule net is proposed. In order to capture vi...

Full description

Bibliographic Details
Main Authors: Li Xinwei, Xu Lianghao, Yang Yi
Format: Article
Language:English
Published: Taylor & Francis Group 2021-04-01
Series:Systems Science & Control Engineering
Subjects:
Online Access:http://dx.doi.org/10.1080/21642583.2020.1833782
id doaj-92253237539746dd9d83f212dd9afb6f
record_format Article
spelling doaj-92253237539746dd9d83f212dd9afb6f2021-05-06T16:05:14ZengTaylor & Francis GroupSystems Science & Control Engineering2164-25832021-04-019S112213010.1080/21642583.2020.18337821833782Compact video fingerprinting via an improved capsule netLi Xinwei0Xu Lianghao1Yang Yi2Henan Polytechnic UniversityHenan Polytechnic UniversityHenan Polytechnic UniversityRobustness, distinctiveness and compactness are the three basic performance metrics for video fingerprinting, and the three factors affect each other. It is challenging to improve them simultaneously. For this reason, an end-to-end fingerprinting via a capsule net is proposed. In order to capture video features, a capsule net, based on a 3D/2D mixed convolution module, is designed, which maps raw data to compact real vector directly. A new designed adaptive margin triplet loss function is introduced, and it can automatically adjust the loss according to the sample distance. It is beneficial for reducing training difficulty and improving performance. Three open access video datasets FCVID, TRECVID and You Tube are composed to train and test, large experimental results have shown that the proposed fingerprinting achieves better performance than traditional and deep learning methods.http://dx.doi.org/10.1080/21642583.2020.1833782mixed convolution moduleend-to-endadaptive margin triplet losscapsule net
collection DOAJ
language English
format Article
sources DOAJ
author Li Xinwei
Xu Lianghao
Yang Yi
spellingShingle Li Xinwei
Xu Lianghao
Yang Yi
Compact video fingerprinting via an improved capsule net
Systems Science & Control Engineering
mixed convolution module
end-to-end
adaptive margin triplet loss
capsule net
author_facet Li Xinwei
Xu Lianghao
Yang Yi
author_sort Li Xinwei
title Compact video fingerprinting via an improved capsule net
title_short Compact video fingerprinting via an improved capsule net
title_full Compact video fingerprinting via an improved capsule net
title_fullStr Compact video fingerprinting via an improved capsule net
title_full_unstemmed Compact video fingerprinting via an improved capsule net
title_sort compact video fingerprinting via an improved capsule net
publisher Taylor & Francis Group
series Systems Science & Control Engineering
issn 2164-2583
publishDate 2021-04-01
description Robustness, distinctiveness and compactness are the three basic performance metrics for video fingerprinting, and the three factors affect each other. It is challenging to improve them simultaneously. For this reason, an end-to-end fingerprinting via a capsule net is proposed. In order to capture video features, a capsule net, based on a 3D/2D mixed convolution module, is designed, which maps raw data to compact real vector directly. A new designed adaptive margin triplet loss function is introduced, and it can automatically adjust the loss according to the sample distance. It is beneficial for reducing training difficulty and improving performance. Three open access video datasets FCVID, TRECVID and You Tube are composed to train and test, large experimental results have shown that the proposed fingerprinting achieves better performance than traditional and deep learning methods.
topic mixed convolution module
end-to-end
adaptive margin triplet loss
capsule net
url http://dx.doi.org/10.1080/21642583.2020.1833782
work_keys_str_mv AT lixinwei compactvideofingerprintingviaanimprovedcapsulenet
AT xulianghao compactvideofingerprintingviaanimprovedcapsulenet
AT yangyi compactvideofingerprintingviaanimprovedcapsulenet
_version_ 1721456515450142720