Supervised feature learning via sparse coding for music information rerieval

This thesis explores the ideas of feature learning and sparse coding for Music Information Retrieval (MIR). Sparse coding is an algorithm which aims to learn new feature representations from data automatically. In contrast to previous work which uses sparse coding in an MIR context the concept of su...

Full description

Bibliographic Details
Main Author:	O'Brien, Cian John
Other Authors:	Lerch, Alexander
Format:	Others
Language:	en_US
Published:	Georgia Institute of Technology 2015
Subjects:	Music Sparse coding Music information retrieval Music genre recognition Music emotion recognition
Online Access:	http://hdl.handle.net/1853/53615

id	ndltd-GATECH-oai-smartech.gatech.edu-1853-53615
record_format	oai_dc
spelling	ndltd-GATECH-oai-smartech.gatech.edu-1853-536152015-07-01T03:38:15ZSupervised feature learning via sparse coding for music information rerievalO'Brien, Cian JohnMusicSparse codingMusic information retrievalMusic genre recognitionMusic emotion recognitionThis thesis explores the ideas of feature learning and sparse coding for Music Information Retrieval (MIR). Sparse coding is an algorithm which aims to learn new feature representations from data automatically. In contrast to previous work which uses sparse coding in an MIR context the concept of supervised sparse coding is also investigated, which makes use of the ground-truth labels explicitly during the learning process. Here sparse coding and supervised coding are applied to two MIR problems: classification of musical genre and recognition of the emotional content of music. A variation of Label Consistent K-SVD is used to add supervision during the dictionary learning process. In the case of Music Genre Recognition (MGR) an additional discriminative term is added to encourage tracks from the same genre to have similar sparse codes. For Music Emotion Recognition (MER) a linear regression term is added to learn an optimal classifier and dictionary pair. These results indicate that while sparse coding performs well for MGR, the additional supervision fails to improve the performance. In the case of MER, supervised coding significantly outperforms both standard sparse coding and commonly used designed features, namely MFCC and pitch chroma.Georgia Institute of TechnologyLerch, Alexander2015-06-08T18:40:22Z2015-06-08T18:40:22Z2015-052015-04-24May 20152015-06-08T18:40:22ZThesisapplication/pdfhttp://hdl.handle.net/1853/53615en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
topic	Music Sparse coding Music information retrieval Music genre recognition Music emotion recognition
spellingShingle	Music Sparse coding Music information retrieval Music genre recognition Music emotion recognition O'Brien, Cian John Supervised feature learning via sparse coding for music information rerieval
description	This thesis explores the ideas of feature learning and sparse coding for Music Information Retrieval (MIR). Sparse coding is an algorithm which aims to learn new feature representations from data automatically. In contrast to previous work which uses sparse coding in an MIR context the concept of supervised sparse coding is also investigated, which makes use of the ground-truth labels explicitly during the learning process. Here sparse coding and supervised coding are applied to two MIR problems: classification of musical genre and recognition of the emotional content of music. A variation of Label Consistent K-SVD is used to add supervision during the dictionary learning process. In the case of Music Genre Recognition (MGR) an additional discriminative term is added to encourage tracks from the same genre to have similar sparse codes. For Music Emotion Recognition (MER) a linear regression term is added to learn an optimal classifier and dictionary pair. These results indicate that while sparse coding performs well for MGR, the additional supervision fails to improve the performance. In the case of MER, supervised coding significantly outperforms both standard sparse coding and commonly used designed features, namely MFCC and pitch chroma.
author2	Lerch, Alexander
author_facet	Lerch, Alexander O'Brien, Cian John
author	O'Brien, Cian John
author_sort	O'Brien, Cian John
title	Supervised feature learning via sparse coding for music information rerieval
title_short	Supervised feature learning via sparse coding for music information rerieval
title_full	Supervised feature learning via sparse coding for music information rerieval
title_fullStr	Supervised feature learning via sparse coding for music information rerieval
title_full_unstemmed	Supervised feature learning via sparse coding for music information rerieval
title_sort	supervised feature learning via sparse coding for music information rerieval
publisher	Georgia Institute of Technology
publishDate	2015
url	http://hdl.handle.net/1853/53615
work_keys_str_mv	AT obriencianjohn supervisedfeaturelearningviasparsecodingformusicinformationrerieval
_version_	1716806599645855744

Supervised feature learning via sparse coding for music information rerieval

Similar Items