Activity Recognition in First-Person Camera View Based onTemporal Pyramid
碩士 === 國立臺灣大學 === 資訊網路與多媒體研究所 === 101 === We present a simple but effective online recognition system for detecting interleaved activities of daily life (ADLs) in first-person-view videos. The two major difficulties in detecting ADLs are interleaving and variability in duration. We use temporal pyra...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2013
|
Online Access: | http://ndltd.ncl.edu.tw/handle/92962830683022916719 |
id |
ndltd-TW-101NTU05641012 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101NTU056410122016-03-16T04:15:17Z http://ndltd.ncl.edu.tw/handle/92962830683022916719 Activity Recognition in First-Person Camera View Based onTemporal Pyramid 基於時序金字塔之第一人稱影像行為辨識 Hsuan-Ming Liu 劉軒銘 碩士 國立臺灣大學 資訊網路與多媒體研究所 101 We present a simple but effective online recognition system for detecting interleaved activities of daily life (ADLs) in first-person-view videos. The two major difficulties in detecting ADLs are interleaving and variability in duration. We use temporal pyramid in our system to attack these difficulties, and this means we can use relatively simple models instead of time dependent probability ones such as Hidden semi-Markov model or nested models. The proposed solution includes the combination of conditional random fields (CRF) and an online inference algorithm, which explicitly considers multiple interleaved sequences by inferencing multi-stage activities on temporal pyramid. Although our system only uses linear chain-structured CRF model, which can be easily learned without a large amount of training data, it still recognizes complicated activity sequences. The system is evaluated on a data set provided by the work from state-of-the-art, and the result is comparable to their method. We also provide some experiment result using a customized dataset. Ming Ouhyoung 歐陽明 2013 學位論文 ; thesis 30 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣大學 === 資訊網路與多媒體研究所 === 101 === We present a simple but effective online recognition system for detecting
interleaved activities of daily life (ADLs) in first-person-view videos. The
two major difficulties in detecting ADLs are interleaving and variability in
duration. We use temporal pyramid in our system to attack these difficulties,
and this means we can use relatively simple models instead of time dependent
probability ones such as Hidden semi-Markov model or nested models.
The proposed solution includes the combination of conditional random fields
(CRF) and an online inference algorithm, which explicitly considers multiple
interleaved sequences by inferencing multi-stage activities on temporal
pyramid. Although our system only uses linear chain-structured CRF model,
which can be easily learned without a large amount of training data, it still
recognizes complicated activity sequences. The system is evaluated on a data
set provided by the work from state-of-the-art, and the result is comparable
to their method. We also provide some experiment result using a customized
dataset.
|
author2 |
Ming Ouhyoung |
author_facet |
Ming Ouhyoung Hsuan-Ming Liu 劉軒銘 |
author |
Hsuan-Ming Liu 劉軒銘 |
spellingShingle |
Hsuan-Ming Liu 劉軒銘 Activity Recognition in First-Person Camera View Based onTemporal Pyramid |
author_sort |
Hsuan-Ming Liu |
title |
Activity Recognition in First-Person Camera View Based onTemporal Pyramid |
title_short |
Activity Recognition in First-Person Camera View Based onTemporal Pyramid |
title_full |
Activity Recognition in First-Person Camera View Based onTemporal Pyramid |
title_fullStr |
Activity Recognition in First-Person Camera View Based onTemporal Pyramid |
title_full_unstemmed |
Activity Recognition in First-Person Camera View Based onTemporal Pyramid |
title_sort |
activity recognition in first-person camera view based ontemporal pyramid |
publishDate |
2013 |
url |
http://ndltd.ncl.edu.tw/handle/92962830683022916719 |
work_keys_str_mv |
AT hsuanmingliu activityrecognitioninfirstpersoncameraviewbasedontemporalpyramid AT liúxuānmíng activityrecognitioninfirstpersoncameraviewbasedontemporalpyramid AT hsuanmingliu jīyúshíxùjīnzìtǎzhīdìyīrénchēngyǐngxiàngxíngwèibiànshí AT liúxuānmíng jīyúshíxùjīnzìtǎzhīdìyīrénchēngyǐngxiàngxíngwèibiànshí |
_version_ |
1718206966107471872 |