The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment

Multimodal biometric systems have been subject of study in recent decades, theirunique characteristic of Anti spoofing and liveness detection plus ability to deal withaudio noise made them technology candidates for improving current systems such asvoice recognition, verification and identification s...

Full description

Bibliographic Details
Main Authors: Jafari Moghadamfard, Ramtin, Payvar, Saeid
Format: Others
Language:English
Published: Högskolan i Halmstad, Sektionen för Informationsvetenskap, Data– och Elektroteknik (IDE) 2014
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-27273
id ndltd-UPSALLA1-oai-DiVA.org-hh-27273
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-hh-272732018-01-12T05:09:43ZThe Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy EnvironmentengJafari Moghadamfard, RamtinPayvar, SaeidHögskolan i Halmstad, Sektionen för Informationsvetenskap, Data– och Elektroteknik (IDE)Högskolan i Halmstad, Sektionen för Informationsvetenskap, Data– och Elektroteknik (IDE)2014voice recognitionlip motionoptical flowComputer SciencesDatavetenskap (datalogi)Multimodal biometric systems have been subject of study in recent decades, theirunique characteristic of Anti spoofing and liveness detection plus ability to deal withaudio noise made them technology candidates for improving current systems such asvoice recognition, verification and identification systems.In this work we studied feasibility of incorporating audio-visual voice recognitionsystem for dealing with audio noise in the truck cab environment. Speech recognitionsystems suffer from excessive noise from the engine and road traffic and cars stereosystem. To deal with this noise different techniques including active and passive noisecancelling have been studied.Our results showed that although audio-only systems are performing better in noisefree environment their performance drops significantly by increase in the level of noisein truck cabins, which by contrast does not affect the performance of visual features.Final fused system comprising both visual and audio cues, proved to be superior toboth audio-only and video-only systems. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-27273Local IDE1310application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic voice recognition
lip motion
optical flow
Computer Sciences
Datavetenskap (datalogi)
spellingShingle voice recognition
lip motion
optical flow
Computer Sciences
Datavetenskap (datalogi)
Jafari Moghadamfard, Ramtin
Payvar, Saeid
The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment
description Multimodal biometric systems have been subject of study in recent decades, theirunique characteristic of Anti spoofing and liveness detection plus ability to deal withaudio noise made them technology candidates for improving current systems such asvoice recognition, verification and identification systems.In this work we studied feasibility of incorporating audio-visual voice recognitionsystem for dealing with audio noise in the truck cab environment. Speech recognitionsystems suffer from excessive noise from the engine and road traffic and cars stereosystem. To deal with this noise different techniques including active and passive noisecancelling have been studied.Our results showed that although audio-only systems are performing better in noisefree environment their performance drops significantly by increase in the level of noisein truck cabins, which by contrast does not affect the performance of visual features.Final fused system comprising both visual and audio cues, proved to be superior toboth audio-only and video-only systems.
author Jafari Moghadamfard, Ramtin
Payvar, Saeid
author_facet Jafari Moghadamfard, Ramtin
Payvar, Saeid
author_sort Jafari Moghadamfard, Ramtin
title The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment
title_short The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment
title_full The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment
title_fullStr The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment
title_full_unstemmed The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment
title_sort potential of visual features : to improve voice recognition systems in vehicles noisy environment
publisher Högskolan i Halmstad, Sektionen för Informationsvetenskap, Data– och Elektroteknik (IDE)
publishDate 2014
url http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-27273
work_keys_str_mv AT jafarimoghadamfardramtin thepotentialofvisualfeaturestoimprovevoicerecognitionsystemsinvehiclesnoisyenvironment
AT payvarsaeid thepotentialofvisualfeaturestoimprovevoicerecognitionsystemsinvehiclesnoisyenvironment
AT jafarimoghadamfardramtin potentialofvisualfeaturestoimprovevoicerecognitionsystemsinvehiclesnoisyenvironment
AT payvarsaeid potentialofvisualfeaturestoimprovevoicerecognitionsystemsinvehiclesnoisyenvironment
_version_ 1718605182290362368