Lightweight End-to-End Deep Learning Model for Music Source Separation

碩士 === 國立中央大學 === 資訊工程學系 === 107 === DNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the i...

Full description

Bibliographic Details
Main Authors:	Yao-Ting Wang, 王耀霆
Other Authors:	Jia-Ching Wang
Format:	Others
Language:	zh-TW
Published:	2019
Online Access:	http://ndltd.ncl.edu.tw/handle/2gq2x6

id	ndltd-TW-107NCU05392129
record_format	oai_dc
spelling	ndltd-TW-107NCU053921292019-10-22T05:28:14Z http://ndltd.ncl.edu.tw/handle/2gq2x6 Lightweight End-to-End Deep Learning Model for Music Source Separation 端到端輕量化音樂源分離深度學習模型 Yao-Ting Wang 王耀霆碩士國立中央大學資訊工程學系 107 DNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the information loss caused by the lack of consideration of complex value, deep learning models have gradually been proposed for audio source separation based on time domain for end-to-end processing. However, those models are huge, i.e., the number of parameters is very large. Therefore, it is difficult to use them where the computing resources of the device is limited. On the other hand, it generally takes a long term input to obtain a good result for separation, which represents high delay. It is less helpful for some applications that require low latency. Based on the previous research, this thesis proposes a lightweight end-to-end music source separation deep learning model. To reduce the number of parameters and accelerate the computation, and then propose a novel decoder that can further enhance the result of separation while the input context length is limited. The experimental results show that the method proposed in this paper can obtain better than the previous results by only uses 10% or less parameters. Jia-Ching Wang 王家慶 2019 學位論文 ; thesis 61 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立中央大學 === 資訊工程學系 === 107 === DNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the information loss caused by the lack of consideration of complex value, deep learning models have gradually been proposed for audio source separation based on time domain for end-to-end processing. However, those models are huge, i.e., the number of parameters is very large. Therefore, it is difficult to use them where the computing resources of the device is limited. On the other hand, it generally takes a long term input to obtain a good result for separation, which represents high delay. It is less helpful for some applications that require low latency. Based on the previous research, this thesis proposes a lightweight end-to-end music source separation deep learning model. To reduce the number of parameters and accelerate the computation, and then propose a novel decoder that can further enhance the result of separation while the input context length is limited. The experimental results show that the method proposed in this paper can obtain better than the previous results by only uses 10% or less parameters.
author2	Jia-Ching Wang
author_facet	Jia-Ching Wang Yao-Ting Wang 王耀霆
author	Yao-Ting Wang 王耀霆
spellingShingle	Yao-Ting Wang 王耀霆 Lightweight End-to-End Deep Learning Model for Music Source Separation
author_sort	Yao-Ting Wang
title	Lightweight End-to-End Deep Learning Model for Music Source Separation
title_short	Lightweight End-to-End Deep Learning Model for Music Source Separation
title_full	Lightweight End-to-End Deep Learning Model for Music Source Separation
title_fullStr	Lightweight End-to-End Deep Learning Model for Music Source Separation
title_full_unstemmed	Lightweight End-to-End Deep Learning Model for Music Source Separation
title_sort	lightweight end-to-end deep learning model for music source separation
publishDate	2019
url	http://ndltd.ncl.edu.tw/handle/2gq2x6
work_keys_str_mv	AT yaotingwang lightweightendtoenddeeplearningmodelformusicsourceseparation AT wángyàotíng lightweightendtoenddeeplearningmodelformusicsourceseparation AT yaotingwang duāndàoduānqīngliànghuàyīnlèyuánfēnlíshēndùxuéxímóxíng AT wángyàotíng duāndàoduānqīngliànghuàyīnlèyuánfēnlíshēndùxuéxímóxíng
_version_	1719274240746717184

Lightweight End-to-End Deep Learning Model for Music Source Separation

Similar Items