Turkish Music Genre Classification using Audio and Lyrics Features
<p>Music Information Retrieval (MIR) has become a popular research area in recent years. In this context, researchers have developed music information systems to find solutions for such major problems as automatic playlist creation, hit song detection, and music genre or mood classification. M...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Suleyman Demirel University
2017-05-01
|
Series: | Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi |
Subjects: | |
Online Access: | http://dergipark.ulakbim.gov.tr/sdufenbed/article/view/5000210681 |
id |
doaj-0960226bd33248acbdd73b9f4c63358c |
---|---|
record_format |
Article |
spelling |
doaj-0960226bd33248acbdd73b9f4c63358c2020-11-25T00:20:36ZengSuleyman Demirel UniversitySüleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi1308-65292017-05-0121232233110.19113/sdufbed.883035000174261Turkish Music Genre Classification using Audio and Lyrics FeaturesÖnder ÇOBAN0Çukurova Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü<p>Music Information Retrieval (MIR) has become a popular research area in recent years. In this context, researchers have developed music information systems to find solutions for such major problems as automatic playlist creation, hit song detection, and music genre or mood classification. Meta-data information, lyrics, or melodic content of music are used as feature resource in previous works. However, lyrics do not often used in MIR systems and the number of works in this field is not enough especially for Turkish. In this paper, firstly, we have extended our previously created Turkish MIR (TMIR) dataset, which comprises of Turkish lyrics, by including the audio file of each song. Secondly, we have investigated the effect of using audio and textual features together or separately on automatic Music Genre Classification (MGC). We have extracted textual features from lyrics using different feature extraction models such as word2vec and traditional Bag of Words. We have conducted our experiments on Support Vector Machine (SVM) algorithm and analysed the impact of feature selection and different feature groups on MGC. We have considered lyrics based MGC as a text classification task and also investigated the effect of term weighting method. Experimental results show that textual features can also be effective as well as audio features for Turkish MGC, especially when a supervised term weighting method is employed. We have achieved the highest success rate as 99,12\% by using both audio and textual features together.</p>http://dergipark.ulakbim.gov.tr/sdufenbed/article/view/5000210681Music genre classificationLyrics analysisWord2vecAudio classificationMachine learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Önder ÇOBAN |
spellingShingle |
Önder ÇOBAN Turkish Music Genre Classification using Audio and Lyrics Features Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi Music genre classification Lyrics analysis Word2vec Audio classification Machine learning |
author_facet |
Önder ÇOBAN |
author_sort |
Önder ÇOBAN |
title |
Turkish Music Genre Classification using Audio and Lyrics Features |
title_short |
Turkish Music Genre Classification using Audio and Lyrics Features |
title_full |
Turkish Music Genre Classification using Audio and Lyrics Features |
title_fullStr |
Turkish Music Genre Classification using Audio and Lyrics Features |
title_full_unstemmed |
Turkish Music Genre Classification using Audio and Lyrics Features |
title_sort |
turkish music genre classification using audio and lyrics features |
publisher |
Suleyman Demirel University |
series |
Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi |
issn |
1308-6529 |
publishDate |
2017-05-01 |
description |
<p>Music Information Retrieval (MIR) has become a popular research area in recent years. In this context, researchers have developed music information systems to find solutions for such major problems as automatic playlist creation, hit song detection, and music genre or mood classification. Meta-data information, lyrics, or melodic content of music are used as feature resource in previous works. However, lyrics do not often used in MIR systems and the number of works in this field is not enough especially for Turkish. In this paper, firstly, we have extended our previously created Turkish MIR (TMIR) dataset, which comprises of Turkish lyrics, by including the audio file of each song. Secondly, we have investigated the effect of using audio and textual features together or separately on automatic Music Genre Classification (MGC). We have extracted textual features from lyrics using different feature extraction models such as word2vec and traditional Bag of Words. We have conducted our experiments on Support Vector Machine (SVM) algorithm and analysed the impact of feature selection and different feature groups on MGC. We have considered lyrics based MGC as a text classification task and also investigated the effect of term weighting method. Experimental results show that textual features can also be effective as well as audio features for Turkish MGC, especially when a supervised term weighting method is employed. We have achieved the highest success rate as 99,12\% by using both audio and textual features together.</p> |
topic |
Music genre classification Lyrics analysis Word2vec Audio classification Machine learning |
url |
http://dergipark.ulakbim.gov.tr/sdufenbed/article/view/5000210681 |
work_keys_str_mv |
AT ondercoban turkishmusicgenreclassificationusingaudioandlyricsfeatures |
_version_ |
1725366458773405696 |