An Iterative Decoding Algorithm for Fusion of Multimodal Information

Human activity analysis in an intelligent space is typically based on multimodal informational cues. Use of multiple modalities gives us a lot of advantages. But information fusion from different sources is a problem that has to be addressed. In this paper, we propose an iterative algorithm to fuse...

Full description

Bibliographic Details
Main Authors: Mohan M. Trivedi, Bhaskar D. Rao, Shankar T. Shivappa
Format: Article
Language:English
Published: SpringerOpen 2007-12-01
Series:EURASIP Journal on Advances in Signal Processing
Online Access:http://dx.doi.org/10.1155/2008/478396
id doaj-ca6c3ea82e564e859361e4e610e0a26c
record_format Article
spelling doaj-ca6c3ea82e564e859361e4e610e0a26c2020-11-25T01:06:23ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61722007-12-01200810.1155/2008/478396An Iterative Decoding Algorithm for Fusion of Multimodal InformationMohan M. TrivediBhaskar D. RaoShankar T. ShivappaHuman activity analysis in an intelligent space is typically based on multimodal informational cues. Use of multiple modalities gives us a lot of advantages. But information fusion from different sources is a problem that has to be addressed. In this paper, we propose an iterative algorithm to fuse information from multimodal sources. We draw inspiration from the theory of turbo codes. We draw an analogy between the redundant parity bits of the constituent codes of a turbo code and the information from different sensors in a multimodal system. A hidden Markov model is used to model the sequence of observations of individual modalities. The decoded state likelihoods from one modality are used as additional information in decoding the states of the other modalities. This procedure is repeated until a certain convergence criterion is met. The resulting iterative algorithm is shown to have lower error rates than the individual models alone. The algorithm is then applied to a real-world problem of speech segmentation using audio and visual cues.http://dx.doi.org/10.1155/2008/478396
collection DOAJ
language English
format Article
sources DOAJ
author Mohan M. Trivedi
Bhaskar D. Rao
Shankar T. Shivappa
spellingShingle Mohan M. Trivedi
Bhaskar D. Rao
Shankar T. Shivappa
An Iterative Decoding Algorithm for Fusion of Multimodal Information
EURASIP Journal on Advances in Signal Processing
author_facet Mohan M. Trivedi
Bhaskar D. Rao
Shankar T. Shivappa
author_sort Mohan M. Trivedi
title An Iterative Decoding Algorithm for Fusion of Multimodal Information
title_short An Iterative Decoding Algorithm for Fusion of Multimodal Information
title_full An Iterative Decoding Algorithm for Fusion of Multimodal Information
title_fullStr An Iterative Decoding Algorithm for Fusion of Multimodal Information
title_full_unstemmed An Iterative Decoding Algorithm for Fusion of Multimodal Information
title_sort iterative decoding algorithm for fusion of multimodal information
publisher SpringerOpen
series EURASIP Journal on Advances in Signal Processing
issn 1687-6172
publishDate 2007-12-01
description Human activity analysis in an intelligent space is typically based on multimodal informational cues. Use of multiple modalities gives us a lot of advantages. But information fusion from different sources is a problem that has to be addressed. In this paper, we propose an iterative algorithm to fuse information from multimodal sources. We draw inspiration from the theory of turbo codes. We draw an analogy between the redundant parity bits of the constituent codes of a turbo code and the information from different sensors in a multimodal system. A hidden Markov model is used to model the sequence of observations of individual modalities. The decoded state likelihoods from one modality are used as additional information in decoding the states of the other modalities. This procedure is repeated until a certain convergence criterion is met. The resulting iterative algorithm is shown to have lower error rates than the individual models alone. The algorithm is then applied to a real-world problem of speech segmentation using audio and visual cues.
url http://dx.doi.org/10.1155/2008/478396
work_keys_str_mv AT mohanmtrivedi aniterativedecodingalgorithmforfusionofmultimodalinformation
AT bhaskardrao aniterativedecodingalgorithmforfusionofmultimodalinformation
AT shankartshivappa aniterativedecodingalgorithmforfusionofmultimodalinformation
AT mohanmtrivedi iterativedecodingalgorithmforfusionofmultimodalinformation
AT bhaskardrao iterativedecodingalgorithmforfusionofmultimodalinformation
AT shankartshivappa iterativedecodingalgorithmforfusionofmultimodalinformation
_version_ 1725190481706483712