Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations

The aim of the paper is to propose a feature fusion based Audio-Visual Speaker Identification (AVSI) system with varied conditions of illumination environments. Among the different fusion strategies, feature level fusion has been used for the proposed AVSI system where Hidden Markov Model (HMM) is u...

Full description

Bibliographic Details
Main Authors:	Md. Rabiul Islam, Md. Abdus Sobhan
Format:	Article
Language:	English
Published:	Hindawi Limited 2014-01-01
Series:	Applied Computational Intelligence and Soft Computing
Online Access:	http://dx.doi.org/10.1155/2014/831830

id	doaj-d1295be51c3e45149c4cc3439b0c2542
record_format	Article
spelling	doaj-d1295be51c3e45149c4cc3439b0c25422020-11-24T22:37:43ZengHindawi LimitedApplied Computational Intelligence and Soft Computing1687-97241687-97322014-01-01201410.1155/2014/831830831830Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting VariationsMd. Rabiul Islam0Md. Abdus Sobhan1Department of Computer Science & Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, BangladeshSchool of Engineering & Computer Science, Independent University, Dhaka 1229, BangladeshThe aim of the paper is to propose a feature fusion based Audio-Visual Speaker Identification (AVSI) system with varied conditions of illumination environments. Among the different fusion strategies, feature level fusion has been used for the proposed AVSI system where Hidden Markov Model (HMM) is used for learning and classification. Since the feature set contains richer information about the raw biometric data than any other levels, integration at feature level is expected to provide better authentication results. In this paper, both Mel Frequency Cepstral Coefficients (MFCCs) and Linear Prediction Cepstral Coefficients (LPCCs) are combined to get the audio feature vectors and Active Shape Model (ASM) based appearance and shape facial features are concatenated to take the visual feature vectors. These combined audio and visual features are used for the feature-fusion. To reduce the dimension of the audio and visual feature vectors, Principal Component Analysis (PCA) method is used. The VALID audio-visual database is used to measure the performance of the proposed system where four different illumination levels of lighting conditions are considered. Experimental results focus on the significance of the proposed audio-visual speaker identification system with various combinations of audio and visual features.http://dx.doi.org/10.1155/2014/831830
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Md. Rabiul Islam Md. Abdus Sobhan
spellingShingle	Md. Rabiul Islam Md. Abdus Sobhan Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations Applied Computational Intelligence and Soft Computing
author_facet	Md. Rabiul Islam Md. Abdus Sobhan
author_sort	Md. Rabiul Islam
title	Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations
title_short	Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations
title_full	Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations
title_fullStr	Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations
title_full_unstemmed	Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations
title_sort	feature fusion based audio-visual speaker identification using hidden markov model under different lighting variations
publisher	Hindawi Limited
series	Applied Computational Intelligence and Soft Computing
issn	1687-9724 1687-9732
publishDate	2014-01-01
description	The aim of the paper is to propose a feature fusion based Audio-Visual Speaker Identification (AVSI) system with varied conditions of illumination environments. Among the different fusion strategies, feature level fusion has been used for the proposed AVSI system where Hidden Markov Model (HMM) is used for learning and classification. Since the feature set contains richer information about the raw biometric data than any other levels, integration at feature level is expected to provide better authentication results. In this paper, both Mel Frequency Cepstral Coefficients (MFCCs) and Linear Prediction Cepstral Coefficients (LPCCs) are combined to get the audio feature vectors and Active Shape Model (ASM) based appearance and shape facial features are concatenated to take the visual feature vectors. These combined audio and visual features are used for the feature-fusion. To reduce the dimension of the audio and visual feature vectors, Principal Component Analysis (PCA) method is used. The VALID audio-visual database is used to measure the performance of the proposed system where four different illumination levels of lighting conditions are considered. Experimental results focus on the significance of the proposed audio-visual speaker identification system with various combinations of audio and visual features.
url	http://dx.doi.org/10.1155/2014/831830
work_keys_str_mv	AT mdrabiulislam featurefusionbasedaudiovisualspeakeridentificationusinghiddenmarkovmodelunderdifferentlightingvariations AT mdabdussobhan featurefusionbasedaudiovisualspeakeridentificationusinghiddenmarkovmodelunderdifferentlightingvariations
_version_	1725715783516946432

Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations

Similar Items