Detection of Human Emotion from Noise Speech

Detection of a human emotion from human speech is always a challenging task. Factors like intonation, pitch, and loudness of signal vary from different human voice. So, it's important to know the exact pitch, intonation and loudness of a speech for making it a challenging task for detection. So...

Full description

Bibliographic Details
Main Authors:	Nallamilli, Sai Chandra Sekhar Reddy, Kandi, Nihanth
Format:	Others
Language:	English
Published:	Blekinge Tekniska Högskola, Institutionen för tillämpad signalbehandling 2020
Subjects:	Neural Network Activation Function Fast Fourier Transform Karhunen-Loeve Transform speech enhancement filtering Wavelet Transform Speech preprocessing signal to noise ratio shallow neural network Signal Processing Signalbehandling
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:se:bth-19610

id	ndltd-UPSALLA1-oai-DiVA.org-bth-19610
record_format	oai_dc
spelling	ndltd-UPSALLA1-oai-DiVA.org-bth-196102020-06-10T04:24:30ZDetection of Human Emotion from Noise SpeechengNallamilli, Sai Chandra Sekhar ReddyKandi, NihanthBlekinge Tekniska Högskola, Institutionen för tillämpad signalbehandlingBlekinge Tekniska Högskola, Institutionen för tillämpad signalbehandling2020Neural NetworkActivation FunctionFast Fourier TransformKarhunen-Loeve Transformspeech enhancementfilteringWavelet TransformSpeech preprocessingsignal to noise ratioshallow neural networkSignal ProcessingSignalbehandlingDetection of a human emotion from human speech is always a challenging task. Factors like intonation, pitch, and loudness of signal vary from different human voice. So, it's important to know the exact pitch, intonation and loudness of a speech for making it a challenging task for detection. Some voices exhibit high background noise which will affect the amplitude or pitch of the signal. So, knowing the detailed properties of a speech to detect emotion is mandatory. Detection of emotion in humans from speech signals is a recent research field. One of the scenarios where this field has been applied is in situations where the human integrity and security are at risk In this project we are proposing a set of features based on the decomposition signals from discrete wavelet transform to characterize different types of negative emotions such as anger, happy, sad, and desperation. The features are measured in three different conditions: (1) the original speech signals, (2) the signals that are contaminated with noise or are affected by the presence of a phone channel, and (3) the signals that are obtained after processing using an algorithm for Speech Enhancement Transform. According to the results, when the speech enhancement is applied, the detection of emotion in speech is increased and compared to results obtained when the speech signal is highly contaminated with noise. Our objective is to use Artificial neural network because the brain is the most efficient and best machine to recognize speech. The brain is built with some neural network. At the same time, Artificial neural networks are clearly advanced with respect to several features, such as their nonlinearity and high classification capability. If we use Artificial neural networks to evolve the machine or computer that it can detect the emotion. Here we are using feedforward neural network which is suitable for classification process and using sigmoid function as activation function. The detection of human emotion from speech is achieved by training the neural network with features extracted from the speech. To achieve this, we need proper features from the speech. So, we must remove background noise in the speech. We can remove background noise by using filters. wavelet transform is the filtering technique used to remove the background noise and enhance the required features in the speech. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:bth-19610Blekinge Institute of Technology Research report, 1103-1581application/pdfinfo:eu-repo/semantics/openAccessapplication/pdfinfo:eu-repo/semantics/openAccess
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	Neural Network Activation Function Fast Fourier Transform Karhunen-Loeve Transform speech enhancement filtering Wavelet Transform Speech preprocessing signal to noise ratio shallow neural network Signal Processing Signalbehandling
spellingShingle	Neural Network Activation Function Fast Fourier Transform Karhunen-Loeve Transform speech enhancement filtering Wavelet Transform Speech preprocessing signal to noise ratio shallow neural network Signal Processing Signalbehandling Nallamilli, Sai Chandra Sekhar Reddy Kandi, Nihanth Detection of Human Emotion from Noise Speech
description	Detection of a human emotion from human speech is always a challenging task. Factors like intonation, pitch, and loudness of signal vary from different human voice. So, it's important to know the exact pitch, intonation and loudness of a speech for making it a challenging task for detection. Some voices exhibit high background noise which will affect the amplitude or pitch of the signal. So, knowing the detailed properties of a speech to detect emotion is mandatory. Detection of emotion in humans from speech signals is a recent research field. One of the scenarios where this field has been applied is in situations where the human integrity and security are at risk In this project we are proposing a set of features based on the decomposition signals from discrete wavelet transform to characterize different types of negative emotions such as anger, happy, sad, and desperation. The features are measured in three different conditions: (1) the original speech signals, (2) the signals that are contaminated with noise or are affected by the presence of a phone channel, and (3) the signals that are obtained after processing using an algorithm for Speech Enhancement Transform. According to the results, when the speech enhancement is applied, the detection of emotion in speech is increased and compared to results obtained when the speech signal is highly contaminated with noise. Our objective is to use Artificial neural network because the brain is the most efficient and best machine to recognize speech. The brain is built with some neural network. At the same time, Artificial neural networks are clearly advanced with respect to several features, such as their nonlinearity and high classification capability. If we use Artificial neural networks to evolve the machine or computer that it can detect the emotion. Here we are using feedforward neural network which is suitable for classification process and using sigmoid function as activation function. The detection of human emotion from speech is achieved by training the neural network with features extracted from the speech. To achieve this, we need proper features from the speech. So, we must remove background noise in the speech. We can remove background noise by using filters. wavelet transform is the filtering technique used to remove the background noise and enhance the required features in the speech.
author	Nallamilli, Sai Chandra Sekhar Reddy Kandi, Nihanth
author_facet	Nallamilli, Sai Chandra Sekhar Reddy Kandi, Nihanth
author_sort	Nallamilli, Sai Chandra Sekhar Reddy
title	Detection of Human Emotion from Noise Speech
title_short	Detection of Human Emotion from Noise Speech
title_full	Detection of Human Emotion from Noise Speech
title_fullStr	Detection of Human Emotion from Noise Speech
title_full_unstemmed	Detection of Human Emotion from Noise Speech
title_sort	detection of human emotion from noise speech
publisher	Blekinge Tekniska Högskola, Institutionen för tillämpad signalbehandling
publishDate	2020
url	http://urn.kb.se/resolve?urn=urn:nbn:se:bth-19610
work_keys_str_mv	AT nallamillisaichandrasekharreddy detectionofhumanemotionfromnoisespeech AT kandinihanth detectionofhumanemotionfromnoisespeech
_version_	1719318738695618560

Detection of Human Emotion from Noise Speech

Similar Items