Activity Recognition in First-Person Camera View Based onTemporal Pyramid

碩士 === 國立臺灣大學 === 資訊網路與多媒體研究所 === 101 === We present a simple but effective online recognition system for detecting interleaved activities of daily life (ADLs) in first-person-view videos. The two major difficulties in detecting ADLs are interleaving and variability in duration. We use temporal pyra...

Full description

Bibliographic Details
Main Authors: Hsuan-Ming Liu, 劉軒銘
Other Authors: Ming Ouhyoung
Format: Others
Language:en_US
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/92962830683022916719
id ndltd-TW-101NTU05641012
record_format oai_dc
spelling ndltd-TW-101NTU056410122016-03-16T04:15:17Z http://ndltd.ncl.edu.tw/handle/92962830683022916719 Activity Recognition in First-Person Camera View Based onTemporal Pyramid 基於時序金字塔之第一人稱影像行為辨識 Hsuan-Ming Liu 劉軒銘 碩士 國立臺灣大學 資訊網路與多媒體研究所 101 We present a simple but effective online recognition system for detecting interleaved activities of daily life (ADLs) in first-person-view videos. The two major difficulties in detecting ADLs are interleaving and variability in duration. We use temporal pyramid in our system to attack these difficulties, and this means we can use relatively simple models instead of time dependent probability ones such as Hidden semi-Markov model or nested models. The proposed solution includes the combination of conditional random fields (CRF) and an online inference algorithm, which explicitly considers multiple interleaved sequences by inferencing multi-stage activities on temporal pyramid. Although our system only uses linear chain-structured CRF model, which can be easily learned without a large amount of training data, it still recognizes complicated activity sequences. The system is evaluated on a data set provided by the work from state-of-the-art, and the result is comparable to their method. We also provide some experiment result using a customized dataset. Ming Ouhyoung 歐陽明 2013 學位論文 ; thesis 30 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 資訊網路與多媒體研究所 === 101 === We present a simple but effective online recognition system for detecting interleaved activities of daily life (ADLs) in first-person-view videos. The two major difficulties in detecting ADLs are interleaving and variability in duration. We use temporal pyramid in our system to attack these difficulties, and this means we can use relatively simple models instead of time dependent probability ones such as Hidden semi-Markov model or nested models. The proposed solution includes the combination of conditional random fields (CRF) and an online inference algorithm, which explicitly considers multiple interleaved sequences by inferencing multi-stage activities on temporal pyramid. Although our system only uses linear chain-structured CRF model, which can be easily learned without a large amount of training data, it still recognizes complicated activity sequences. The system is evaluated on a data set provided by the work from state-of-the-art, and the result is comparable to their method. We also provide some experiment result using a customized dataset.
author2 Ming Ouhyoung
author_facet Ming Ouhyoung
Hsuan-Ming Liu
劉軒銘
author Hsuan-Ming Liu
劉軒銘
spellingShingle Hsuan-Ming Liu
劉軒銘
Activity Recognition in First-Person Camera View Based onTemporal Pyramid
author_sort Hsuan-Ming Liu
title Activity Recognition in First-Person Camera View Based onTemporal Pyramid
title_short Activity Recognition in First-Person Camera View Based onTemporal Pyramid
title_full Activity Recognition in First-Person Camera View Based onTemporal Pyramid
title_fullStr Activity Recognition in First-Person Camera View Based onTemporal Pyramid
title_full_unstemmed Activity Recognition in First-Person Camera View Based onTemporal Pyramid
title_sort activity recognition in first-person camera view based ontemporal pyramid
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/92962830683022916719
work_keys_str_mv AT hsuanmingliu activityrecognitioninfirstpersoncameraviewbasedontemporalpyramid
AT liúxuānmíng activityrecognitioninfirstpersoncameraviewbasedontemporalpyramid
AT hsuanmingliu jīyúshíxùjīnzìtǎzhīdìyīrénchēngyǐngxiàngxíngwèibiànshí
AT liúxuānmíng jīyúshíxùjīnzìtǎzhīdìyīrénchēngyǐngxiàngxíngwèibiànshí
_version_ 1718206966107471872