Automatic transcription of polyphonic music exploiting temporal evolution

Automatic music transcription is the process of converting an audio recording into a symbolic representation using musical notation. It has numerous applications in music information retrieval, computational musicology, and the creation of interactive systems. Even for expert musicians, transcribing...

Full description

Bibliographic Details
Main Author:	Benetos, Emmanouil
Published:	Queen Mary, University of London 2012
Subjects:	621.382 Electronic Engineering
Online Access:	https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.566635

id	ndltd-bl.uk-oai-ethos.bl.uk-566635
record_format	oai_dc
spelling	ndltd-bl.uk-oai-ethos.bl.uk-5666352019-02-27T03:24:15ZAutomatic transcription of polyphonic music exploiting temporal evolutionBenetos, Emmanouil2012Automatic music transcription is the process of converting an audio recording into a symbolic representation using musical notation. It has numerous applications in music information retrieval, computational musicology, and the creation of interactive systems. Even for expert musicians, transcribing polyphonic pieces of music is not a trivial task, and while the problem of automatic pitch estimation for monophonic signals is considered to be solved, the creation of an automated system able to transcribe polyphonic music without setting restrictions on the degree of polyphony and the instrument type still remains open. In this thesis, research on automatic transcription is performed by explicitly incorporating information on the temporal evolution of sounds. First efforts address the problem by focusing on signal processing techniques and by proposing audio features utilising temporal characteristics. Techniques for note onset and offset detection are also utilised for improving transcription performance. Subsequent approaches propose transcription models based on shift-invariant probabilistic latent component analysis (SI-PLCA), modeling the temporal evolution of notes in a multiple-instrument case and supporting frequency modulations in produced notes. Datasets and annotations for transcription research have also been created during this work. Proposed systems have been privately as well as publicly evaluated within the Music Information Retrieval Evaluation eXchange (MIREX) framework. Proposed systems have been shown to outperform several state-of-the-art transcription approaches. Developed techniques have also been employed for other tasks related to music technology, such as for key modulation detection, temperament estimation, and automatic piano tutoring. Finally, proposed music transcription models have also been utilized in a wider context, namely for modeling acoustic scenes.621.382Electronic EngineeringQueen Mary, University of Londonhttps://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.566635http://qmro.qmul.ac.uk/xmlui/handle/123456789/3368Electronic Thesis or Dissertation
collection	NDLTD
sources	NDLTD
topic	621.382 Electronic Engineering
spellingShingle	621.382 Electronic Engineering Benetos, Emmanouil Automatic transcription of polyphonic music exploiting temporal evolution
description	Automatic music transcription is the process of converting an audio recording into a symbolic representation using musical notation. It has numerous applications in music information retrieval, computational musicology, and the creation of interactive systems. Even for expert musicians, transcribing polyphonic pieces of music is not a trivial task, and while the problem of automatic pitch estimation for monophonic signals is considered to be solved, the creation of an automated system able to transcribe polyphonic music without setting restrictions on the degree of polyphony and the instrument type still remains open. In this thesis, research on automatic transcription is performed by explicitly incorporating information on the temporal evolution of sounds. First efforts address the problem by focusing on signal processing techniques and by proposing audio features utilising temporal characteristics. Techniques for note onset and offset detection are also utilised for improving transcription performance. Subsequent approaches propose transcription models based on shift-invariant probabilistic latent component analysis (SI-PLCA), modeling the temporal evolution of notes in a multiple-instrument case and supporting frequency modulations in produced notes. Datasets and annotations for transcription research have also been created during this work. Proposed systems have been privately as well as publicly evaluated within the Music Information Retrieval Evaluation eXchange (MIREX) framework. Proposed systems have been shown to outperform several state-of-the-art transcription approaches. Developed techniques have also been employed for other tasks related to music technology, such as for key modulation detection, temperament estimation, and automatic piano tutoring. Finally, proposed music transcription models have also been utilized in a wider context, namely for modeling acoustic scenes.
author	Benetos, Emmanouil
author_facet	Benetos, Emmanouil
author_sort	Benetos, Emmanouil
title	Automatic transcription of polyphonic music exploiting temporal evolution
title_short	Automatic transcription of polyphonic music exploiting temporal evolution
title_full	Automatic transcription of polyphonic music exploiting temporal evolution
title_fullStr	Automatic transcription of polyphonic music exploiting temporal evolution
title_full_unstemmed	Automatic transcription of polyphonic music exploiting temporal evolution
title_sort	automatic transcription of polyphonic music exploiting temporal evolution
publisher	Queen Mary, University of London
publishDate	2012
url	https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.566635
work_keys_str_mv	AT benetosemmanouil automatictranscriptionofpolyphonicmusicexploitingtemporalevolution
_version_	1718984005810913280

Automatic transcription of polyphonic music exploiting temporal evolution

Similar Items