Score Following as a Multi-Modal Reinforcement Learning Problem

Score following is the process of tracking a musical performance (audio) in a corresponding symbolic representation (score). While methods using computer-readable score representations as input are able to achieve reliable tracking results, there is little research on score following based on raw sc...

Full description

Bibliographic Details
Main Authors:	Florian Henkel, Stefan Balke, Matthias Dorfer, Gerhard Widmer
Format:	Article
Language:	English
Published:	Ubiquity Press 2019-11-01
Series:	Transactions of the International Society for Music Information Retrieval
Subjects:	reinforcement learning score following sheet music
Online Access:	https://transactions.ismir.net/articles/31

id	doaj-60030e4fd1bf4eaf8f13df2c08f123ac
record_format	Article
spelling	doaj-60030e4fd1bf4eaf8f13df2c08f123ac2020-11-25T03:07:56ZengUbiquity PressTransactions of the International Society for Music Information Retrieval2514-32982019-11-012110.5334/tismir.3117Score Following as a Multi-Modal Reinforcement Learning ProblemFlorian Henkel0Stefan Balke1Matthias Dorfer2Gerhard Widmer3Johannes Kepler University LinzJohannes Kepler University LinzJohannes Kepler University LinzJohannes Kepler University Linz; Austrian Research Inst. for Artificial Intelligence, ViennaScore following is the process of tracking a musical performance (audio) in a corresponding symbolic representation (score). While methods using computer-readable score representations as input are able to achieve reliable tracking results, there is little research on score following based on raw score images. In this paper, we build on previous work that formulates the score following task as a multi-modal Markov Decision Process (MDP). Given this formal definition, one can address the problem of score following with state-of-the-art deep reinforcement learning (RL) algorithms. In particular, we design end-to-end multi-modal RL agents that simultaneously learn to listen to music recordings, read the scores from images of sheet music, and follow the music along in the sheet. Using algorithms such as synchronous Advantage Actor Critic (A2C) and Proximal Policy Optimization (PPO), we reproduce and further improve existing results. We also present first experiments indicating that this approach can be extended to track real piano recordings of human performances. These audio recordings are made openly available to the research community, along with precise note-level alignment ground truth.https://transactions.ismir.net/articles/31reinforcement learningscore followingsheet music
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Florian Henkel Stefan Balke Matthias Dorfer Gerhard Widmer
spellingShingle	Florian Henkel Stefan Balke Matthias Dorfer Gerhard Widmer Score Following as a Multi-Modal Reinforcement Learning Problem Transactions of the International Society for Music Information Retrieval reinforcement learning score following sheet music
author_facet	Florian Henkel Stefan Balke Matthias Dorfer Gerhard Widmer
author_sort	Florian Henkel
title	Score Following as a Multi-Modal Reinforcement Learning Problem
title_short	Score Following as a Multi-Modal Reinforcement Learning Problem
title_full	Score Following as a Multi-Modal Reinforcement Learning Problem
title_fullStr	Score Following as a Multi-Modal Reinforcement Learning Problem
title_full_unstemmed	Score Following as a Multi-Modal Reinforcement Learning Problem
title_sort	score following as a multi-modal reinforcement learning problem
publisher	Ubiquity Press
series	Transactions of the International Society for Music Information Retrieval
issn	2514-3298
publishDate	2019-11-01
description	Score following is the process of tracking a musical performance (audio) in a corresponding symbolic representation (score). While methods using computer-readable score representations as input are able to achieve reliable tracking results, there is little research on score following based on raw score images. In this paper, we build on previous work that formulates the score following task as a multi-modal Markov Decision Process (MDP). Given this formal definition, one can address the problem of score following with state-of-the-art deep reinforcement learning (RL) algorithms. In particular, we design end-to-end multi-modal RL agents that simultaneously learn to listen to music recordings, read the scores from images of sheet music, and follow the music along in the sheet. Using algorithms such as synchronous Advantage Actor Critic (A2C) and Proximal Policy Optimization (PPO), we reproduce and further improve existing results. We also present first experiments indicating that this approach can be extended to track real piano recordings of human performances. These audio recordings are made openly available to the research community, along with precise note-level alignment ground truth.
topic	reinforcement learning score following sheet music
url	https://transactions.ismir.net/articles/31
work_keys_str_mv	AT florianhenkel scorefollowingasamultimodalreinforcementlearningproblem AT stefanbalke scorefollowingasamultimodalreinforcementlearningproblem AT matthiasdorfer scorefollowingasamultimodalreinforcementlearningproblem AT gerhardwidmer scorefollowingasamultimodalreinforcementlearningproblem
_version_	1724668223480135680

Score Following as a Multi-Modal Reinforcement Learning Problem

Similar Items