Normalisasi Kata Tidak Baku yang Tidak Disingkat dengan Jarak Perubahan

Voice assistant technology is growing rapidly and its use has begun to spread to daily use. However, voice assistant usages are still limited to standard conversation languages. Meanwhile, Indonesian people are accustomed to informal language in daily conversation. This research gives solution to ov...

Full description

Bibliographic Details
Main Authors:	I Gusti Bagus Baskara Nugraha, Rafi Dwi Rizqullah
Format:	Article
Language:	English
Published:	Universitas Gadjah Mada 2019-08-01
Series:	Jurnal Nasional Teknik Elektro dan Teknologi Informasi
Subjects:	voice assistant; kamus; kata tidak baku; normalisasi; jarak levenshtein; jarak jaro-winkler
Online Access:	http://ejnteti.jteti.ugm.ac.id/index.php/JNTETI/article/view/516

Description
Summary:	Voice assistant technology is growing rapidly and its use has begun to spread to daily use. However, voice assistant usages are still limited to standard conversation languages. Meanwhile, Indonesian people are accustomed to informal language in daily conversation. This research gives solution to overcome the problem of voice assistants with informal words or words that will not be found in formal word dictionary. We propose text normalization using Levenshtein distance. Test result shows that normalization using Levenshtein distance outperform the normalization using Longest Common Subsequence (LCS) distance with accuracy difference of 8.34%
ISSN:	2301-4156 2460-5719

Normalisasi Kata Tidak Baku yang Tidak Disingkat dengan Jarak Perubahan

Similar Items