The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment
Multimodal biometric systems have been subject of study in recent decades, theirunique characteristic of Anti spoofing and liveness detection plus ability to deal withaudio noise made them technology candidates for improving current systems such asvoice recognition, verification and identification s...
Main Authors: | , |
---|---|
Format: | Others |
Language: | English |
Published: |
Högskolan i Halmstad, Sektionen för Informationsvetenskap, Data– och Elektroteknik (IDE)
2014
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-27273 |
id |
ndltd-UPSALLA1-oai-DiVA.org-hh-27273 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-hh-272732018-01-12T05:09:43ZThe Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy EnvironmentengJafari Moghadamfard, RamtinPayvar, SaeidHögskolan i Halmstad, Sektionen för Informationsvetenskap, Data– och Elektroteknik (IDE)Högskolan i Halmstad, Sektionen för Informationsvetenskap, Data– och Elektroteknik (IDE)2014voice recognitionlip motionoptical flowComputer SciencesDatavetenskap (datalogi)Multimodal biometric systems have been subject of study in recent decades, theirunique characteristic of Anti spoofing and liveness detection plus ability to deal withaudio noise made them technology candidates for improving current systems such asvoice recognition, verification and identification systems.In this work we studied feasibility of incorporating audio-visual voice recognitionsystem for dealing with audio noise in the truck cab environment. Speech recognitionsystems suffer from excessive noise from the engine and road traffic and cars stereosystem. To deal with this noise different techniques including active and passive noisecancelling have been studied.Our results showed that although audio-only systems are performing better in noisefree environment their performance drops significantly by increase in the level of noisein truck cabins, which by contrast does not affect the performance of visual features.Final fused system comprising both visual and audio cues, proved to be superior toboth audio-only and video-only systems. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-27273Local IDE1310application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
voice recognition lip motion optical flow Computer Sciences Datavetenskap (datalogi) |
spellingShingle |
voice recognition lip motion optical flow Computer Sciences Datavetenskap (datalogi) Jafari Moghadamfard, Ramtin Payvar, Saeid The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment |
description |
Multimodal biometric systems have been subject of study in recent decades, theirunique characteristic of Anti spoofing and liveness detection plus ability to deal withaudio noise made them technology candidates for improving current systems such asvoice recognition, verification and identification systems.In this work we studied feasibility of incorporating audio-visual voice recognitionsystem for dealing with audio noise in the truck cab environment. Speech recognitionsystems suffer from excessive noise from the engine and road traffic and cars stereosystem. To deal with this noise different techniques including active and passive noisecancelling have been studied.Our results showed that although audio-only systems are performing better in noisefree environment their performance drops significantly by increase in the level of noisein truck cabins, which by contrast does not affect the performance of visual features.Final fused system comprising both visual and audio cues, proved to be superior toboth audio-only and video-only systems. |
author |
Jafari Moghadamfard, Ramtin Payvar, Saeid |
author_facet |
Jafari Moghadamfard, Ramtin Payvar, Saeid |
author_sort |
Jafari Moghadamfard, Ramtin |
title |
The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment |
title_short |
The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment |
title_full |
The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment |
title_fullStr |
The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment |
title_full_unstemmed |
The Potential of Visual Features : to Improve Voice Recognition Systems in Vehicles Noisy Environment |
title_sort |
potential of visual features : to improve voice recognition systems in vehicles noisy environment |
publisher |
Högskolan i Halmstad, Sektionen för Informationsvetenskap, Data– och Elektroteknik (IDE) |
publishDate |
2014 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:se:hh:diva-27273 |
work_keys_str_mv |
AT jafarimoghadamfardramtin thepotentialofvisualfeaturestoimprovevoicerecognitionsystemsinvehiclesnoisyenvironment AT payvarsaeid thepotentialofvisualfeaturestoimprovevoicerecognitionsystemsinvehiclesnoisyenvironment AT jafarimoghadamfardramtin potentialofvisualfeaturestoimprovevoicerecognitionsystemsinvehiclesnoisyenvironment AT payvarsaeid potentialofvisualfeaturestoimprovevoicerecognitionsystemsinvehiclesnoisyenvironment |
_version_ |
1718605182290362368 |