Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient

<p class="Abstract">Human-machine interaction evolves toward a more adaptive and interactive system. There are several media that can be used in human-machine interaction systems, such as voice signals. The process includes converting analog signals into the appropriate meaning, whic...

Full description

Bibliographic Details
Main Author: Karisma Trinanda Putra
Format: Article
Language:English
Published: Universitas Muhammadiyah Yogyakarta 2017-11-01
Series:Semesta Teknika
Subjects:
Online Access:https://journal.umy.ac.id/index.php/st/article/view/2358
id doaj-36b7defd2e34497899a3a7189b2905b3
record_format Article
spelling doaj-36b7defd2e34497899a3a7189b2905b32020-11-25T03:41:25ZengUniversitas Muhammadiyah YogyakartaSemesta Teknika1411-061X2502-54812017-11-0120175802364Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral CoefficientKarisma Trinanda Putra0Universitas Muhammadiyah Yogyakarta<p class="Abstract">Human-machine interaction evolves toward a more adaptive and interactive system. There are several media that can be used in human-machine interaction systems, such as voice signals. The process includes converting analog signals into the appropriate meaning, which depend on the noise and reliability of signal characteristic extraction methods. In fact, variations of pronunciation by different people will result in a diversity of voice signal patterns. This research develops technology that can recognize and translate speech according to data that has been trained and can be modified based on user requirement. The voice signal will be separated from the silent signal using voice activity detection. Then, the voice signal is converted to the frequency domain before it is extracted using mel-frequency cepstral coefficients. Cepstral value from MFCC extraction will be identified as words using artificial neural network. This study utilizes a computer with a microphone as a sound recording device and pascal programming language as the basis for building applications. Based on the experimental results, the accuracy is 87% on the speech recognition process with 28 vocabulary sets. Accuracy decreases with more sets of vocabulary. However, the more pronounced speech variations, the greater the accuracy with an average number around 93%.</p>https://journal.umy.ac.id/index.php/st/article/view/2358voice activity detection, mel-frequency cepstral coefficient, artificial neural network
collection DOAJ
language English
format Article
sources DOAJ
author Karisma Trinanda Putra
spellingShingle Karisma Trinanda Putra
Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient
Semesta Teknika
voice activity detection, mel-frequency cepstral coefficient, artificial neural network
author_facet Karisma Trinanda Putra
author_sort Karisma Trinanda Putra
title Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient
title_short Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient
title_full Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient
title_fullStr Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient
title_full_unstemmed Sistem Pengenal Wicara Menggunakan Mel-Frequency Cepstral Coefficient
title_sort sistem pengenal wicara menggunakan mel-frequency cepstral coefficient
publisher Universitas Muhammadiyah Yogyakarta
series Semesta Teknika
issn 1411-061X
2502-5481
publishDate 2017-11-01
description <p class="Abstract">Human-machine interaction evolves toward a more adaptive and interactive system. There are several media that can be used in human-machine interaction systems, such as voice signals. The process includes converting analog signals into the appropriate meaning, which depend on the noise and reliability of signal characteristic extraction methods. In fact, variations of pronunciation by different people will result in a diversity of voice signal patterns. This research develops technology that can recognize and translate speech according to data that has been trained and can be modified based on user requirement. The voice signal will be separated from the silent signal using voice activity detection. Then, the voice signal is converted to the frequency domain before it is extracted using mel-frequency cepstral coefficients. Cepstral value from MFCC extraction will be identified as words using artificial neural network. This study utilizes a computer with a microphone as a sound recording device and pascal programming language as the basis for building applications. Based on the experimental results, the accuracy is 87% on the speech recognition process with 28 vocabulary sets. Accuracy decreases with more sets of vocabulary. However, the more pronounced speech variations, the greater the accuracy with an average number around 93%.</p>
topic voice activity detection, mel-frequency cepstral coefficient, artificial neural network
url https://journal.umy.ac.id/index.php/st/article/view/2358
work_keys_str_mv AT karismatrinandaputra sistempengenalwicaramenggunakanmelfrequencycepstralcoefficient
_version_ 1724529744039378944