Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations
碩士 === 國立交通大學 === 工學院聲音與音樂創意科技碩士學位學程 === 103 === Music is the spice of human life. In recent years, a research field called Music Information Retrieval (MIR) springs up with advances in technology and needs of listener. Automatic music genre recognition is one of the classical issues in the field. In...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2014
|
Online Access: | http://ndltd.ncl.edu.tw/handle/11633531873609859029 |
id |
ndltd-TW-103NCTU5248009 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103NCTU52480092016-08-28T04:12:40Z http://ndltd.ncl.edu.tw/handle/11633531873609859029 Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations 使用時頻變化調變之稀疏編碼於自動音樂曲風辨識 Lin, Chih-Shan 林至善 碩士 國立交通大學 工學院聲音與音樂創意科技碩士學位學程 103 Music is the spice of human life. In recent years, a research field called Music Information Retrieval (MIR) springs up with advances in technology and needs of listener. Automatic music genre recognition is one of the classical issues in the field. In this thesis, we assume that a specify music instrument with a specific playing style forms a specific spectral pattern on a spectrogram. Then we consider a music spectrogram as the composition of many specify spectral patterns. We believe that the proportion of spectral patterns can be discriminative among music genre. We use short-time Fourier transform spectrogram and spectral-temporal modulation feature as spectral pattern descriptors. These descriptors are represented as the composition of many specify spectral patterns through dictionary learning and sparse coding and used for classifier training. In addition, auditory spectrogram, constant-Q transform spectrogram and corresponding spectral-temporal modulation feature are also used in the experiments. The result shows that systems based on constant-Q transform-based modulation feature performs better than conventional one which usually based on short-time Fourier transform spectrogram. 冀泰石 2014 學位論文 ; thesis 41 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 工學院聲音與音樂創意科技碩士學位學程 === 103 === Music is the spice of human life. In recent years, a research field called Music Information Retrieval (MIR) springs up with advances in technology and needs of listener.
Automatic music genre recognition is one of the classical issues in the field. In this thesis, we assume that a specify music instrument with a specific playing style forms a specific spectral pattern on a spectrogram. Then we consider a music spectrogram as the composition of many specify spectral patterns. We believe that the proportion of spectral patterns can be discriminative among music genre. We use short-time Fourier transform spectrogram and spectral-temporal modulation feature as spectral pattern descriptors. These descriptors are represented as the composition of many specify spectral patterns through dictionary learning and sparse coding and used for classifier training. In addition, auditory spectrogram, constant-Q transform spectrogram and corresponding spectral-temporal modulation feature are also used in the experiments. The result shows that systems based on constant-Q transform-based modulation feature performs better than conventional one which usually based on short-time Fourier transform spectrogram.
|
author2 |
冀泰石 |
author_facet |
冀泰石 Lin, Chih-Shan 林至善 |
author |
Lin, Chih-Shan 林至善 |
spellingShingle |
Lin, Chih-Shan 林至善 Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations |
author_sort |
Lin, Chih-Shan |
title |
Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations |
title_short |
Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations |
title_full |
Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations |
title_fullStr |
Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations |
title_full_unstemmed |
Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations |
title_sort |
sparse coding based music genre classification using spectro-temporal modulations |
publishDate |
2014 |
url |
http://ndltd.ncl.edu.tw/handle/11633531873609859029 |
work_keys_str_mv |
AT linchihshan sparsecodingbasedmusicgenreclassificationusingspectrotemporalmodulations AT línzhìshàn sparsecodingbasedmusicgenreclassificationusingspectrotemporalmodulations AT linchihshan shǐyòngshípínbiànhuàdiàobiànzhīxīshūbiānmǎyúzìdòngyīnlèqūfēngbiànshí AT línzhìshàn shǐyòngshípínbiànhuàdiàobiànzhīxīshūbiānmǎyúzìdòngyīnlèqūfēngbiànshí |
_version_ |
1718381048042094592 |