Turkish Music Genre Classification using Audio and Lyrics Features

<p>Music Information Retrieval (MIR) has become a popular research area in recent years. In this context, researchers have developed music information systems to find solutions for such major problems as automatic playlist creation, hit song detection, and music genre or mood classification. M...

Full description

Bibliographic Details
Main Author: Önder ÇOBAN
Format: Article
Language:English
Published: Suleyman Demirel University 2017-05-01
Series:Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi
Subjects:
Online Access:http://dergipark.ulakbim.gov.tr/sdufenbed/article/view/5000210681
id doaj-0960226bd33248acbdd73b9f4c63358c
record_format Article
spelling doaj-0960226bd33248acbdd73b9f4c63358c2020-11-25T00:20:36ZengSuleyman Demirel UniversitySüleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi1308-65292017-05-0121232233110.19113/sdufbed.883035000174261Turkish Music Genre Classification using Audio and Lyrics FeaturesÖnder ÇOBAN0Çukurova Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü<p>Music Information Retrieval (MIR) has become a popular research area in recent years. In this context, researchers have developed music information systems to find solutions for such major problems as automatic playlist creation, hit song detection, and music genre or mood classification. Meta-data information, lyrics, or melodic content of music are used as feature resource in previous works. However, lyrics do not often used in MIR systems and the number of works in this field is not enough especially for Turkish. In this paper, firstly, we have extended our previously created Turkish MIR (TMIR) dataset, which comprises of Turkish lyrics, by including the audio file of each song. Secondly, we have investigated the effect of using audio and textual features together or separately on automatic Music Genre Classification (MGC). We have extracted textual features from lyrics using different feature extraction models such as word2vec and traditional Bag of Words. We have conducted our experiments on Support Vector Machine (SVM) algorithm and analysed the impact of feature selection and different feature groups on MGC. We have considered lyrics based MGC as a text classification task and also investigated the effect of term weighting method. Experimental results show that textual features can also be effective as well as audio features for Turkish MGC, especially when a supervised term weighting method is employed. We have achieved the highest success rate as 99,12\% by using both audio and textual features together.</p>http://dergipark.ulakbim.gov.tr/sdufenbed/article/view/5000210681Music genre classificationLyrics analysisWord2vecAudio classificationMachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Önder ÇOBAN
spellingShingle Önder ÇOBAN
Turkish Music Genre Classification using Audio and Lyrics Features
Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi
Music genre classification
Lyrics analysis
Word2vec
Audio classification
Machine learning
author_facet Önder ÇOBAN
author_sort Önder ÇOBAN
title Turkish Music Genre Classification using Audio and Lyrics Features
title_short Turkish Music Genre Classification using Audio and Lyrics Features
title_full Turkish Music Genre Classification using Audio and Lyrics Features
title_fullStr Turkish Music Genre Classification using Audio and Lyrics Features
title_full_unstemmed Turkish Music Genre Classification using Audio and Lyrics Features
title_sort turkish music genre classification using audio and lyrics features
publisher Suleyman Demirel University
series Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi
issn 1308-6529
publishDate 2017-05-01
description <p>Music Information Retrieval (MIR) has become a popular research area in recent years. In this context, researchers have developed music information systems to find solutions for such major problems as automatic playlist creation, hit song detection, and music genre or mood classification. Meta-data information, lyrics, or melodic content of music are used as feature resource in previous works. However, lyrics do not often used in MIR systems and the number of works in this field is not enough especially for Turkish. In this paper, firstly, we have extended our previously created Turkish MIR (TMIR) dataset, which comprises of Turkish lyrics, by including the audio file of each song. Secondly, we have investigated the effect of using audio and textual features together or separately on automatic Music Genre Classification (MGC). We have extracted textual features from lyrics using different feature extraction models such as word2vec and traditional Bag of Words. We have conducted our experiments on Support Vector Machine (SVM) algorithm and analysed the impact of feature selection and different feature groups on MGC. We have considered lyrics based MGC as a text classification task and also investigated the effect of term weighting method. Experimental results show that textual features can also be effective as well as audio features for Turkish MGC, especially when a supervised term weighting method is employed. We have achieved the highest success rate as 99,12\% by using both audio and textual features together.</p>
topic Music genre classification
Lyrics analysis
Word2vec
Audio classification
Machine learning
url http://dergipark.ulakbim.gov.tr/sdufenbed/article/view/5000210681
work_keys_str_mv AT ondercoban turkishmusicgenreclassificationusingaudioandlyricsfeatures
_version_ 1725366458773405696