Normalisasi Kata Tidak Baku yang Tidak Disingkat dengan Jarak Perubahan
Voice assistant technology is growing rapidly and its use has begun to spread to daily use. However, voice assistant usages are still limited to standard conversation languages. Meanwhile, Indonesian people are accustomed to informal language in daily conversation. This research gives solution to ov...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Universitas Gadjah Mada
2019-08-01
|
Series: | Jurnal Nasional Teknik Elektro dan Teknologi Informasi |
Subjects: | |
Online Access: | http://ejnteti.jteti.ugm.ac.id/index.php/JNTETI/article/view/516 |
Summary: | Voice assistant technology is growing rapidly and its use has begun to spread to daily use. However, voice assistant usages are still limited to standard conversation languages. Meanwhile, Indonesian people are accustomed to informal language in daily conversation. This research gives solution to overcome the problem of voice assistants with informal words or words that will not be found in formal word dictionary. We propose text normalization using Levenshtein distance. Test result shows that normalization using Levenshtein distance outperform the normalization using Longest Common Subsequence (LCS) distance with accuracy difference of 8.34% |
---|---|
ISSN: | 2301-4156 2460-5719 |