Radial Basis Function Networks for Conversion of Sound Spectra
In many advanced signal processing tasks, such as pitch shifting, voice conversion or sound synthesis, accurate spectral processing is required. Here, the use of Radial Basis Function Networks (RBFN) is proposed for the modeling of the spectral changes (or conversions) related to the control of impo...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2001-03-01
|
Series: | EURASIP Journal on Advances in Signal Processing |
Subjects: | |
Online Access: | http://dx.doi.org/10.1155/S1110865701000117 |
id |
doaj-3228ab3672ce4eb7868419a5f0f34a3e |
---|---|
record_format |
Article |
spelling |
doaj-3228ab3672ce4eb7868419a5f0f34a3e2020-11-25T01:06:02ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61721687-61802001-03-0120011364410.1155/S1687617201000117Radial Basis Function Networks for Conversion of Sound SpectraCarlo DrioliIn many advanced signal processing tasks, such as pitch shifting, voice conversion or sound synthesis, accurate spectral processing is required. Here, the use of Radial Basis Function Networks (RBFN) is proposed for the modeling of the spectral changes (or conversions) related to the control of important sound parameters, such as pitch or intensity. The identification of such conversion functions is based on a procedure which learns the shape of the conversion from few couples of target spectra from a data set. The generalization properties of RBFNs provides for interpolation with respect to the pitch range. In the construction of the training set, mel-cepstral encoding of the spectrum is used to catch the perceptually most relevant spectral changes. Moreover, a singular value decomposition (SVD) approach is used to reduce the dimension of conversion functions. The RBFN conversion functions introduced are characterized by a perceptually-based fast training procedure, desirable interpolation properties and computational efficiency.http://dx.doi.org/10.1155/S1110865701000117sound transformationssinusoidal representationRBFNsspectral processing. |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Carlo Drioli |
spellingShingle |
Carlo Drioli Radial Basis Function Networks for Conversion of Sound Spectra EURASIP Journal on Advances in Signal Processing sound transformations sinusoidal representation RBFNs spectral processing. |
author_facet |
Carlo Drioli |
author_sort |
Carlo Drioli |
title |
Radial Basis Function Networks for Conversion of Sound Spectra |
title_short |
Radial Basis Function Networks for Conversion of Sound Spectra |
title_full |
Radial Basis Function Networks for Conversion of Sound Spectra |
title_fullStr |
Radial Basis Function Networks for Conversion of Sound Spectra |
title_full_unstemmed |
Radial Basis Function Networks for Conversion of Sound Spectra |
title_sort |
radial basis function networks for conversion of sound spectra |
publisher |
SpringerOpen |
series |
EURASIP Journal on Advances in Signal Processing |
issn |
1687-6172 1687-6180 |
publishDate |
2001-03-01 |
description |
In many advanced signal processing tasks, such as pitch shifting, voice conversion or sound synthesis, accurate spectral processing is required. Here, the use of Radial Basis Function Networks (RBFN) is proposed for the modeling of the spectral changes (or conversions) related to the control of important sound parameters, such as pitch or intensity. The identification of such conversion functions is based on a procedure which learns the shape of the conversion from few couples of target spectra from a data set. The generalization properties of RBFNs provides for interpolation with respect to the pitch range. In the construction of the training set, mel-cepstral encoding of the spectrum is used to catch the perceptually most relevant spectral changes. Moreover, a singular value decomposition (SVD) approach is used to reduce the dimension of conversion functions. The RBFN conversion functions introduced are characterized by a perceptually-based fast training procedure, desirable interpolation properties and computational efficiency. |
topic |
sound transformations sinusoidal representation RBFNs spectral processing. |
url |
http://dx.doi.org/10.1155/S1110865701000117 |
work_keys_str_mv |
AT carlodrioli radialbasisfunctionnetworksforconversionofsoundspectra |
_version_ |
1725191903258869760 |