Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient
<p class="Abstract">Human-machine interaction evolves toward a more adaptive and interactive system. There are several media that can be used in human-machine interaction systems, such as voice signals. The process includes converting analog signals into the appropriate meaning, whic...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Universitas Muhammadiyah Yogyakarta
2017-11-01
|
Series: | Semesta Teknika |
Subjects: | |
Online Access: | https://journal.umy.ac.id/index.php/st/article/view/2358 |
id |
doaj-36b7defd2e34497899a3a7189b2905b3 |
---|---|
record_format |
Article |
spelling |
doaj-36b7defd2e34497899a3a7189b2905b32020-11-25T03:41:25ZengUniversitas Muhammadiyah YogyakartaSemesta Teknika1411-061X2502-54812017-11-0120175802364Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral CoefficientKarisma Trinanda Putra0Universitas Muhammadiyah Yogyakarta<p class="Abstract">Human-machine interaction evolves toward a more adaptive and interactive system. There are several media that can be used in human-machine interaction systems, such as voice signals. The process includes converting analog signals into the appropriate meaning, which depend on the noise and reliability of signal characteristic extraction methods. In fact, variations of pronunciation by different people will result in a diversity of voice signal patterns. This research develops technology that can recognize and translate speech according to data that has been trained and can be modified based on user requirement. The voice signal will be separated from the silent signal using voice activity detection. Then, the voice signal is converted to the frequency domain before it is extracted using mel-frequency cepstral coefficients. Cepstral value from MFCC extraction will be identified as words using artificial neural network. This study utilizes a computer with a microphone as a sound recording device and pascal programming language as the basis for building applications. Based on the experimental results, the accuracy is 87% on the speech recognition process with 28 vocabulary sets. Accuracy decreases with more sets of vocabulary. However, the more pronounced speech variations, the greater the accuracy with an average number around 93%.</p>https://journal.umy.ac.id/index.php/st/article/view/2358voice activity detection, mel-frequency cepstral coefficient, artificial neural network |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Karisma Trinanda Putra |
spellingShingle |
Karisma Trinanda Putra Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient Semesta Teknika voice activity detection, mel-frequency cepstral coefficient, artificial neural network |
author_facet |
Karisma Trinanda Putra |
author_sort |
Karisma Trinanda Putra |
title |
Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient |
title_short |
Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient |
title_full |
Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient |
title_fullStr |
Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient |
title_full_unstemmed |
Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient |
title_sort |
sistem pengenal wicara menggunakan mel-frequency cepstral coefficient |
publisher |
Universitas Muhammadiyah Yogyakarta |
series |
Semesta Teknika |
issn |
1411-061X 2502-5481 |
publishDate |
2017-11-01 |
description |
<p class="Abstract">Human-machine interaction evolves toward a more adaptive and interactive system. There are several media that can be used in human-machine interaction systems, such as voice signals. The process includes converting analog signals into the appropriate meaning, which depend on the noise and reliability of signal characteristic extraction methods. In fact, variations of pronunciation by different people will result in a diversity of voice signal patterns. This research develops technology that can recognize and translate speech according to data that has been trained and can be modified based on user requirement. The voice signal will be separated from the silent signal using voice activity detection. Then, the voice signal is converted to the frequency domain before it is extracted using mel-frequency cepstral coefficients. Cepstral value from MFCC extraction will be identified as words using artificial neural network. This study utilizes a computer with a microphone as a sound recording device and pascal programming language as the basis for building applications. Based on the experimental results, the accuracy is 87% on the speech recognition process with 28 vocabulary sets. Accuracy decreases with more sets of vocabulary. However, the more pronounced speech variations, the greater the accuracy with an average number around 93%.</p> |
topic |
voice activity detection, mel-frequency cepstral coefficient, artificial neural network |
url |
https://journal.umy.ac.id/index.php/st/article/view/2358 |
work_keys_str_mv |
AT karismatrinandaputra sistempengenalwicaramenggunakanmelfrequencycepstralcoefficient |
_version_ |
1724529744039378944 |