Efficient Multiple Path Search for Action Tube Detection in Videos

碩士 === 國立臺灣科技大學 === 電子工程系 === 105 === This thesis presents an efficient convolutional neural network (CNN)-based approach to detect multiple spatial-temporal action tubes in videos. First, a new fusion strategy is employed, which combines the appearance and the flow information out of the two-strea...

Full description

Bibliographic Details
Main Author: Erick Hendra Putra Alwando
Other Authors: Wen-Hsien Fang
Format: Others
Language:en_US
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/08603454653555086253
id ndltd-TW-105NTUS5428080
record_format oai_dc
spelling ndltd-TW-105NTUS54280802017-10-31T04:58:52Z http://ndltd.ncl.edu.tw/handle/08603454653555086253 Efficient Multiple Path Search for Action Tube Detection in Videos 高效多路徑搜尋之影片動作偵測 Erick Hendra Putra Alwando Erick Hendra Putra Alwando 碩士 國立臺灣科技大學 電子工程系 105 This thesis presents an efficient convolutional neural network (CNN)-based approach to detect multiple spatial-temporal action tubes in videos. First, a new fusion strategy is employed, which combines the appearance and the flow information out of the two-stream CNN-based networks along with motion saliency to generate the action detection scores. Thereafter, an efficient multiple path search (MPS) algorithm, is developed to simultaneously find multiple paths in a single run. In the forward message passing of MPS, each node stores information of a prescribed number of paths based on the accumulated scores determined in the previous stages. A backward path tracing is invoked afterward to find all multiple paths at the same time by fully reusing the information generated in the forward pass without repeating the search process. Thereby, the complexity incurred can be reduced. Moreover, to rectify the potentially inaccurate bounding boxes, a video localization refinement (VLR) scheme is also addressed to further boost the detection accuracy. Simulations show that the proposed MPS provides superior performance compared with the main state-of-the-art works on the widespread UCF-101 and J-HMDB datasets. Together with VLR, the performance of MPS can be further bolstered. Wen-Hsien Fang Yie-Tarng Chen 方文賢 陳郁堂 2017 學位論文 ; thesis 49 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 電子工程系 === 105 === This thesis presents an efficient convolutional neural network (CNN)-based approach to detect multiple spatial-temporal action tubes in videos. First, a new fusion strategy is employed, which combines the appearance and the flow information out of the two-stream CNN-based networks along with motion saliency to generate the action detection scores. Thereafter, an efficient multiple path search (MPS) algorithm, is developed to simultaneously find multiple paths in a single run. In the forward message passing of MPS, each node stores information of a prescribed number of paths based on the accumulated scores determined in the previous stages. A backward path tracing is invoked afterward to find all multiple paths at the same time by fully reusing the information generated in the forward pass without repeating the search process. Thereby, the complexity incurred can be reduced. Moreover, to rectify the potentially inaccurate bounding boxes, a video localization refinement (VLR) scheme is also addressed to further boost the detection accuracy. Simulations show that the proposed MPS provides superior performance compared with the main state-of-the-art works on the widespread UCF-101 and J-HMDB datasets. Together with VLR, the performance of MPS can be further bolstered.
author2 Wen-Hsien Fang
author_facet Wen-Hsien Fang
Erick Hendra Putra Alwando
Erick Hendra Putra Alwando
author Erick Hendra Putra Alwando
Erick Hendra Putra Alwando
spellingShingle Erick Hendra Putra Alwando
Erick Hendra Putra Alwando
Efficient Multiple Path Search for Action Tube Detection in Videos
author_sort Erick Hendra Putra Alwando
title Efficient Multiple Path Search for Action Tube Detection in Videos
title_short Efficient Multiple Path Search for Action Tube Detection in Videos
title_full Efficient Multiple Path Search for Action Tube Detection in Videos
title_fullStr Efficient Multiple Path Search for Action Tube Detection in Videos
title_full_unstemmed Efficient Multiple Path Search for Action Tube Detection in Videos
title_sort efficient multiple path search for action tube detection in videos
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/08603454653555086253
work_keys_str_mv AT erickhendraputraalwando efficientmultiplepathsearchforactiontubedetectioninvideos
AT erickhendraputraalwando efficientmultiplepathsearchforactiontubedetectioninvideos
AT erickhendraputraalwando gāoxiàoduōlùjìngsōuxúnzhīyǐngpiàndòngzuòzhēncè
AT erickhendraputraalwando gāoxiàoduōlùjìngsōuxúnzhīyǐngpiàndòngzuòzhēncè
_version_ 1718558723060793344