The Study on Emotion Recognition in Continuous Mandarin Chinese Speech
博士 === 大同大學 === 資訊工程學系(所) === 98 === In this dissertation, 14 emotions, including 6 full-brown emotions: anger, boredom, fear, happiness, neutrality, and sadness, 8 underlying emotions: anxiety, despair, disgust, gratefulness, love, praise, shame, and surprise, are explored for speech emotion recogn...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/8fq8rx |
id |
ndltd-TW-098TTU05392023 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098TTU053920232019-05-15T20:32:55Z http://ndltd.ncl.edu.tw/handle/8fq8rx The Study on Emotion Recognition in Continuous Mandarin Chinese Speech 連續中文語音情緒辨識之研究 Jun-Heng Yeh 葉俊亨 博士 大同大學 資訊工程學系(所) 98 In this dissertation, 14 emotions, including 6 full-brown emotions: anger, boredom, fear, happiness, neutrality, and sadness, 8 underlying emotions: anxiety, despair, disgust, gratefulness, love, praise, shame, and surprise, are explored for speech emotion recognition study using speech processing and pattern recognition methods in the emotional space theory. In this study, a new emotional corpus, Mandarin Chinese Emotional Corpus 2010 (MCEC2010), is built up by 34 professional voice actors/actresses. This corpus consists of 19,136 short utterances recorded by voice actors/actresses and 472,017 emotional annotations labeled by 26 annotators who didn’t participate in the voice collection. Additionally, 209 dialogue scripts were recorded that were expressed freely by voice actors/actresses and annotated by 21 annotators who didn’t take part in recording these scripts. In the past, most of speech emotion recognition researches focused on recognizing emotions based on a single utterance as the recognition unit. This kind of sentence-type emotion recognition method may be inadequate to work in practice without the association of speech recognition mechanism. Moreover, the recognition rate will be affected by the accuracy of speech recognition in a hierarchy recognition scheme. Additionally, it must be noted that the heavy computation required for hierarchy recognition scheme would make it difficult to implement the emotion recognition in real time in a device with low computing ability. To overcome this shortcoming, four different speech segmentation methods for emotion recognition are tested in this dissertation. It appears that few research findings are available concerning to the analysis of mix-emotion in an utterance on emotion recognition. Most of them can only recognize the main emotional expression of the utterance. In this dissertation, the relative perceiving intensity of different emotional expressions of the segment is analyzed. And all emotional expression of segments will be mapped into the PAD (Pleasure-Arousal-Dominance) emotional space. By this visualizing presentation, it can not only quantify different emotional expressions in the utterance, but observe the gradual changes of different emotional expressions during dialogue. Tsang-Long Pao 包蒼龍 2010/08/ 學位論文 ; thesis 165 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 大同大學 === 資訊工程學系(所) === 98 === In this dissertation, 14 emotions, including 6 full-brown emotions: anger, boredom, fear, happiness, neutrality, and sadness, 8 underlying emotions: anxiety, despair, disgust, gratefulness, love, praise, shame, and surprise, are explored for speech emotion recognition study using speech processing and pattern recognition methods in the emotional space theory. In this study, a new emotional corpus, Mandarin Chinese Emotional Corpus 2010 (MCEC2010), is built up by 34 professional voice actors/actresses. This corpus consists of 19,136 short utterances recorded by voice actors/actresses and 472,017 emotional annotations labeled by 26 annotators who didn’t participate in the voice collection. Additionally, 209 dialogue scripts were recorded that were expressed freely by voice actors/actresses and annotated by 21 annotators who didn’t take part in recording these scripts.
In the past, most of speech emotion recognition researches focused on recognizing emotions based on a single utterance as the recognition unit. This kind of sentence-type emotion recognition method may be inadequate to work in practice without the association of speech recognition mechanism. Moreover, the recognition rate will be affected by the accuracy of speech recognition in a hierarchy recognition scheme. Additionally, it must be noted that the heavy computation required for hierarchy recognition scheme would make it difficult to implement the emotion recognition in real time in a device with low computing ability. To overcome this shortcoming, four different speech segmentation methods for emotion recognition are tested in this dissertation.
It appears that few research findings are available concerning to the analysis of mix-emotion in an utterance on emotion recognition. Most of them can only recognize the main emotional expression of the utterance. In this dissertation, the relative perceiving intensity of different emotional expressions of the segment is analyzed. And all emotional expression of segments will be mapped into the PAD (Pleasure-Arousal-Dominance) emotional space. By this visualizing presentation, it can not only quantify different emotional expressions in the utterance, but observe the gradual changes of different emotional expressions during dialogue.
|
author2 |
Tsang-Long Pao |
author_facet |
Tsang-Long Pao Jun-Heng Yeh 葉俊亨 |
author |
Jun-Heng Yeh 葉俊亨 |
spellingShingle |
Jun-Heng Yeh 葉俊亨 The Study on Emotion Recognition in Continuous Mandarin Chinese Speech |
author_sort |
Jun-Heng Yeh |
title |
The Study on Emotion Recognition in Continuous Mandarin Chinese Speech |
title_short |
The Study on Emotion Recognition in Continuous Mandarin Chinese Speech |
title_full |
The Study on Emotion Recognition in Continuous Mandarin Chinese Speech |
title_fullStr |
The Study on Emotion Recognition in Continuous Mandarin Chinese Speech |
title_full_unstemmed |
The Study on Emotion Recognition in Continuous Mandarin Chinese Speech |
title_sort |
study on emotion recognition in continuous mandarin chinese speech |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/8fq8rx |
work_keys_str_mv |
AT junhengyeh thestudyonemotionrecognitionincontinuousmandarinchinesespeech AT yèjùnhēng thestudyonemotionrecognitionincontinuousmandarinchinesespeech AT junhengyeh liánxùzhōngwényǔyīnqíngxùbiànshízhīyánjiū AT yèjùnhēng liánxùzhōngwényǔyīnqíngxùbiànshízhīyánjiū AT junhengyeh studyonemotionrecognitionincontinuousmandarinchinesespeech AT yèjùnhēng studyonemotionrecognitionincontinuousmandarinchinesespeech |
_version_ |
1719099981607993344 |