Lightweight End-to-End Deep Learning Model for Music Source Separation
碩士 === 國立中央大學 === 資訊工程學系 === 107 === DNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the i...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/2gq2x6 |
id |
ndltd-TW-107NCU05392129 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107NCU053921292019-10-22T05:28:14Z http://ndltd.ncl.edu.tw/handle/2gq2x6 Lightweight End-to-End Deep Learning Model for Music Source Separation 端到端輕量化音樂源分離深度學習模型 Yao-Ting Wang 王耀霆 碩士 國立中央大學 資訊工程學系 107 DNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the information loss caused by the lack of consideration of complex value, deep learning models have gradually been proposed for audio source separation based on time domain for end-to-end processing. However, those models are huge, i.e., the number of parameters is very large. Therefore, it is difficult to use them where the computing resources of the device is limited. On the other hand, it generally takes a long term input to obtain a good result for separation, which represents high delay. It is less helpful for some applications that require low latency. Based on the previous research, this thesis proposes a lightweight end-to-end music source separation deep learning model. To reduce the number of parameters and accelerate the computation, and then propose a novel decoder that can further enhance the result of separation while the input context length is limited. The experimental results show that the method proposed in this paper can obtain better than the previous results by only uses 10% or less parameters. Jia-Ching Wang 王家慶 2019 學位論文 ; thesis 61 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中央大學 === 資訊工程學系 === 107 === DNNs(Deep neural networks) have made rapid progress in the field of audio processing. In the past, most of them used spectrum information via STFT (Short Term Fourier Transform), but them usually only deal with real parts. In recent years, in order to avoid the information loss caused by the lack of consideration of complex value, deep learning models have gradually been proposed for audio source separation based on time domain for end-to-end processing. However, those models are huge, i.e., the number of parameters is very large. Therefore, it is difficult to use them where the computing resources of the device is limited. On the other hand, it generally takes a long term input to obtain a good result for separation, which represents high delay. It is less helpful for some applications that require low latency.
Based on the previous research, this thesis proposes a lightweight end-to-end music source separation deep learning model. To reduce the number of parameters and accelerate the computation, and then propose a novel decoder that can further enhance the result of separation while the input context length is limited. The experimental results show that the method proposed in this paper can obtain better than the previous results by only uses 10% or less parameters.
|
author2 |
Jia-Ching Wang |
author_facet |
Jia-Ching Wang Yao-Ting Wang 王耀霆 |
author |
Yao-Ting Wang 王耀霆 |
spellingShingle |
Yao-Ting Wang 王耀霆 Lightweight End-to-End Deep Learning Model for Music Source Separation |
author_sort |
Yao-Ting Wang |
title |
Lightweight End-to-End Deep Learning Model for Music Source Separation |
title_short |
Lightweight End-to-End Deep Learning Model for Music Source Separation |
title_full |
Lightweight End-to-End Deep Learning Model for Music Source Separation |
title_fullStr |
Lightweight End-to-End Deep Learning Model for Music Source Separation |
title_full_unstemmed |
Lightweight End-to-End Deep Learning Model for Music Source Separation |
title_sort |
lightweight end-to-end deep learning model for music source separation |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/2gq2x6 |
work_keys_str_mv |
AT yaotingwang lightweightendtoenddeeplearningmodelformusicsourceseparation AT wángyàotíng lightweightendtoenddeeplearningmodelformusicsourceseparation AT yaotingwang duāndàoduānqīngliànghuàyīnlèyuánfēnlíshēndùxuéxímóxíng AT wángyàotíng duāndàoduānqīngliànghuàyīnlèyuánfēnlíshēndùxuéxímóxíng |
_version_ |
1719274240746717184 |