Compact video fingerprinting via an improved capsule net
Robustness, distinctiveness and compactness are the three basic performance metrics for video fingerprinting, and the three factors affect each other. It is challenging to improve them simultaneously. For this reason, an end-to-end fingerprinting via a capsule net is proposed. In order to capture vi...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Taylor & Francis Group
2021-04-01
|
Series: | Systems Science & Control Engineering |
Subjects: | |
Online Access: | http://dx.doi.org/10.1080/21642583.2020.1833782 |
id |
doaj-92253237539746dd9d83f212dd9afb6f |
---|---|
record_format |
Article |
spelling |
doaj-92253237539746dd9d83f212dd9afb6f2021-05-06T16:05:14ZengTaylor & Francis GroupSystems Science & Control Engineering2164-25832021-04-019S112213010.1080/21642583.2020.18337821833782Compact video fingerprinting via an improved capsule netLi Xinwei0Xu Lianghao1Yang Yi2Henan Polytechnic UniversityHenan Polytechnic UniversityHenan Polytechnic UniversityRobustness, distinctiveness and compactness are the three basic performance metrics for video fingerprinting, and the three factors affect each other. It is challenging to improve them simultaneously. For this reason, an end-to-end fingerprinting via a capsule net is proposed. In order to capture video features, a capsule net, based on a 3D/2D mixed convolution module, is designed, which maps raw data to compact real vector directly. A new designed adaptive margin triplet loss function is introduced, and it can automatically adjust the loss according to the sample distance. It is beneficial for reducing training difficulty and improving performance. Three open access video datasets FCVID, TRECVID and You Tube are composed to train and test, large experimental results have shown that the proposed fingerprinting achieves better performance than traditional and deep learning methods.http://dx.doi.org/10.1080/21642583.2020.1833782mixed convolution moduleend-to-endadaptive margin triplet losscapsule net |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Li Xinwei Xu Lianghao Yang Yi |
spellingShingle |
Li Xinwei Xu Lianghao Yang Yi Compact video fingerprinting via an improved capsule net Systems Science & Control Engineering mixed convolution module end-to-end adaptive margin triplet loss capsule net |
author_facet |
Li Xinwei Xu Lianghao Yang Yi |
author_sort |
Li Xinwei |
title |
Compact video fingerprinting via an improved capsule net |
title_short |
Compact video fingerprinting via an improved capsule net |
title_full |
Compact video fingerprinting via an improved capsule net |
title_fullStr |
Compact video fingerprinting via an improved capsule net |
title_full_unstemmed |
Compact video fingerprinting via an improved capsule net |
title_sort |
compact video fingerprinting via an improved capsule net |
publisher |
Taylor & Francis Group |
series |
Systems Science & Control Engineering |
issn |
2164-2583 |
publishDate |
2021-04-01 |
description |
Robustness, distinctiveness and compactness are the three basic performance metrics for video fingerprinting, and the three factors affect each other. It is challenging to improve them simultaneously. For this reason, an end-to-end fingerprinting via a capsule net is proposed. In order to capture video features, a capsule net, based on a 3D/2D mixed convolution module, is designed, which maps raw data to compact real vector directly. A new designed adaptive margin triplet loss function is introduced, and it can automatically adjust the loss according to the sample distance. It is beneficial for reducing training difficulty and improving performance. Three open access video datasets FCVID, TRECVID and You Tube are composed to train and test, large experimental results have shown that the proposed fingerprinting achieves better performance than traditional and deep learning methods. |
topic |
mixed convolution module end-to-end adaptive margin triplet loss capsule net |
url |
http://dx.doi.org/10.1080/21642583.2020.1833782 |
work_keys_str_mv |
AT lixinwei compactvideofingerprintingviaanimprovedcapsulenet AT xulianghao compactvideofingerprintingviaanimprovedcapsulenet AT yangyi compactvideofingerprintingviaanimprovedcapsulenet |
_version_ |
1721456515450142720 |