Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations

碩士 === 國立交通大學 === 工學院聲音與音樂創意科技碩士學位學程 === 103 === Music is the spice of human life. In recent years, a research field called Music Information Retrieval (MIR) springs up with advances in technology and needs of listener. Automatic music genre recognition is one of the classical issues in the field. In...

Full description

Bibliographic Details
Main Authors: Lin, Chih-Shan, 林至善
Other Authors: 冀泰石
Format: Others
Language:zh-TW
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/11633531873609859029
id ndltd-TW-103NCTU5248009
record_format oai_dc
spelling ndltd-TW-103NCTU52480092016-08-28T04:12:40Z http://ndltd.ncl.edu.tw/handle/11633531873609859029 Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations 使用時頻變化調變之稀疏編碼於自動音樂曲風辨識 Lin, Chih-Shan 林至善 碩士 國立交通大學 工學院聲音與音樂創意科技碩士學位學程 103 Music is the spice of human life. In recent years, a research field called Music Information Retrieval (MIR) springs up with advances in technology and needs of listener. Automatic music genre recognition is one of the classical issues in the field. In this thesis, we assume that a specify music instrument with a specific playing style forms a specific spectral pattern on a spectrogram. Then we consider a music spectrogram as the composition of many specify spectral patterns. We believe that the proportion of spectral patterns can be discriminative among music genre. We use short-time Fourier transform spectrogram and spectral-temporal modulation feature as spectral pattern descriptors. These descriptors are represented as the composition of many specify spectral patterns through dictionary learning and sparse coding and used for classifier training. In addition, auditory spectrogram, constant-Q transform spectrogram and corresponding spectral-temporal modulation feature are also used in the experiments. The result shows that systems based on constant-Q transform-based modulation feature performs better than conventional one which usually based on short-time Fourier transform spectrogram. 冀泰石 2014 學位論文 ; thesis 41 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 工學院聲音與音樂創意科技碩士學位學程 === 103 === Music is the spice of human life. In recent years, a research field called Music Information Retrieval (MIR) springs up with advances in technology and needs of listener. Automatic music genre recognition is one of the classical issues in the field. In this thesis, we assume that a specify music instrument with a specific playing style forms a specific spectral pattern on a spectrogram. Then we consider a music spectrogram as the composition of many specify spectral patterns. We believe that the proportion of spectral patterns can be discriminative among music genre. We use short-time Fourier transform spectrogram and spectral-temporal modulation feature as spectral pattern descriptors. These descriptors are represented as the composition of many specify spectral patterns through dictionary learning and sparse coding and used for classifier training. In addition, auditory spectrogram, constant-Q transform spectrogram and corresponding spectral-temporal modulation feature are also used in the experiments. The result shows that systems based on constant-Q transform-based modulation feature performs better than conventional one which usually based on short-time Fourier transform spectrogram.
author2 冀泰石
author_facet 冀泰石
Lin, Chih-Shan
林至善
author Lin, Chih-Shan
林至善
spellingShingle Lin, Chih-Shan
林至善
Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations
author_sort Lin, Chih-Shan
title Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations
title_short Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations
title_full Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations
title_fullStr Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations
title_full_unstemmed Sparse Coding based Music Genre Classification using Spectro-Temporal Modulations
title_sort sparse coding based music genre classification using spectro-temporal modulations
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/11633531873609859029
work_keys_str_mv AT linchihshan sparsecodingbasedmusicgenreclassificationusingspectrotemporalmodulations
AT línzhìshàn sparsecodingbasedmusicgenreclassificationusingspectrotemporalmodulations
AT linchihshan shǐyòngshípínbiànhuàdiàobiànzhīxīshūbiānmǎyúzìdòngyīnlèqūfēngbiànshí
AT línzhìshàn shǐyòngshípínbiànhuàdiàobiànzhīxīshūbiānmǎyúzìdòngyīnlèqūfēngbiànshí
_version_ 1718381048042094592