Lightweight End-to-End Deep Learning Model for Music Source Separation

碩士 === 國立中央大學 === 資訊工程學系 === 107 === DNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the i...

Full description

Bibliographic Details
Main Authors: Yao-Ting Wang, 王耀霆
Other Authors: Jia-Ching Wang
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/2gq2x6
id ndltd-TW-107NCU05392129
record_format oai_dc
spelling ndltd-TW-107NCU053921292019-10-22T05:28:14Z http://ndltd.ncl.edu.tw/handle/2gq2x6 Lightweight End-to-End Deep Learning Model for Music Source Separation 端到端輕量化音樂源分離深度學習模型 Yao-Ting Wang 王耀霆 碩士 國立中央大學 資訊工程學系 107 DNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the information loss caused by the lack of consideration of complex value, deep learning models have gradually been proposed for audio source separation based on time domain for end-to-end processing. However, those models are huge, i.e., the number of parameters is very large. Therefore, it is difficult to use them where the computing resources of the device is limited. On the other hand, it generally takes a long term input to obtain a good result for separation, which represents high delay. It is less helpful for some applications that require low latency. Based on the previous research, this thesis proposes a lightweight end-to-end music source separation deep learning model. To reduce the number of parameters and accelerate the computation, and then propose a novel decoder that can further enhance the result of separation while the input context length is limited. The experimental results show that the method proposed in this paper can obtain better than the previous results by only uses 10% or less parameters. Jia-Ching Wang 王家慶 2019 學位論文 ; thesis 61 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中央大學 === 資訊工程學系 === 107 === DNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the information loss caused by the lack of consideration of complex value, deep learning models have gradually been proposed for audio source separation based on time domain for end-to-end processing. However, those models are huge, i.e., the number of parameters is very large. Therefore, it is difficult to use them where the computing resources of the device is limited. On the other hand, it generally takes a long term input to obtain a good result for separation, which represents high delay. It is less helpful for some applications that require low latency. Based on the previous research, this thesis proposes a lightweight end-to-end music source separation deep learning model. To reduce the number of parameters and accelerate the computation, and then propose a novel decoder that can further enhance the result of separation while the input context length is limited. The experimental results show that the method proposed in this paper can obtain better than the previous results by only uses 10% or less parameters.
author2 Jia-Ching Wang
author_facet Jia-Ching Wang
Yao-Ting Wang
王耀霆
author Yao-Ting Wang
王耀霆
spellingShingle Yao-Ting Wang
王耀霆
Lightweight End-to-End Deep Learning Model for Music Source Separation
author_sort Yao-Ting Wang
title Lightweight End-to-End Deep Learning Model for Music Source Separation
title_short Lightweight End-to-End Deep Learning Model for Music Source Separation
title_full Lightweight End-to-End Deep Learning Model for Music Source Separation
title_fullStr Lightweight End-to-End Deep Learning Model for Music Source Separation
title_full_unstemmed Lightweight End-to-End Deep Learning Model for Music Source Separation
title_sort lightweight end-to-end deep learning model for music source separation
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/2gq2x6
work_keys_str_mv AT yaotingwang lightweightendtoenddeeplearningmodelformusicsourceseparation
AT wángyàotíng lightweightendtoenddeeplearningmodelformusicsourceseparation
AT yaotingwang duāndàoduānqīngliànghuàyīnlèyuánfēnlíshēndùxuéxímóxíng
AT wángyàotíng duāndàoduānqīngliànghuàyīnlèyuánfēnlíshēndùxuéxímóxíng
_version_ 1719274240746717184