Automated spectroscopic modelling with optimised convolutional neural networks

Abstract Convolutional neural networks (CNN) for spectroscopic modelling are currently tuned manually, and the effects of their hyperparameters are not analysed. These can result in sub-optimal models. Here, we propose an approach to tune one-dimensional CNN (1D-CNNs) automatically. It consists of a...

Full description

Bibliographic Details
Main Authors: Zefang Shen, R. A. Viscarra Rossel
Format: Article
Language:English
Published: Nature Publishing Group 2021-01-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-020-80486-9
id doaj-9041605288ac4d19a797da78cc01ff0b
record_format Article
spelling doaj-9041605288ac4d19a797da78cc01ff0b2021-01-10T12:47:17ZengNature Publishing GroupScientific Reports2045-23222021-01-0111111210.1038/s41598-020-80486-9Automated spectroscopic modelling with optimised convolutional neural networksZefang Shen0R. A. Viscarra Rossel1Soil and Landscape Science, School of Molecular and Life Sciences, Curtin UniversitySoil and Landscape Science, School of Molecular and Life Sciences, Curtin UniversityAbstract Convolutional neural networks (CNN) for spectroscopic modelling are currently tuned manually, and the effects of their hyperparameters are not analysed. These can result in sub-optimal models. Here, we propose an approach to tune one-dimensional CNN (1D-CNNs) automatically. It consists of a parametric representation of 1D-CNNs and an optimisation of hyperparameters to maximise a model’s performance. We used a large European soil spectroscopic database to demonstrate our approach for estimating soil organic carbon (SOC) contents. To assess the optimisation, we compared it to random search, and to understand the effects of the hyperparameters, we calculated their importance using functional Analysis of Variance. Compared to random search, the optimisation produced better final results and showed faster convergence. The optimal model produced the most accurate estimates of SOC with $$\hbox {RMSE} = 9.67 \pm 0.51$$ RMSE = 9.67 ± 0.51 (s.d.) and $${R}^2 = 0.89 \pm 0.013$$ R 2 = 0.89 ± 0.013 (s.d.). The hyperparameters associated with model training and architecture critically affected the model’s performance, while those related to the spectral preprocessing had little effect. The optimisation searched through a complex hyperparameter space and returned an optimal 1D-CNN. Our approach simplified the development of 1D-CNNs for spectroscopic modelling by automatically selecting hyperparameters and preprocessing methods. Hyperparameter importance analysis shed light on the tuning process and increased the model’s reliability.https://doi.org/10.1038/s41598-020-80486-9
collection DOAJ
language English
format Article
sources DOAJ
author Zefang Shen
R. A. Viscarra Rossel
spellingShingle Zefang Shen
R. A. Viscarra Rossel
Automated spectroscopic modelling with optimised convolutional neural networks
Scientific Reports
author_facet Zefang Shen
R. A. Viscarra Rossel
author_sort Zefang Shen
title Automated spectroscopic modelling with optimised convolutional neural networks
title_short Automated spectroscopic modelling with optimised convolutional neural networks
title_full Automated spectroscopic modelling with optimised convolutional neural networks
title_fullStr Automated spectroscopic modelling with optimised convolutional neural networks
title_full_unstemmed Automated spectroscopic modelling with optimised convolutional neural networks
title_sort automated spectroscopic modelling with optimised convolutional neural networks
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-01-01
description Abstract Convolutional neural networks (CNN) for spectroscopic modelling are currently tuned manually, and the effects of their hyperparameters are not analysed. These can result in sub-optimal models. Here, we propose an approach to tune one-dimensional CNN (1D-CNNs) automatically. It consists of a parametric representation of 1D-CNNs and an optimisation of hyperparameters to maximise a model’s performance. We used a large European soil spectroscopic database to demonstrate our approach for estimating soil organic carbon (SOC) contents. To assess the optimisation, we compared it to random search, and to understand the effects of the hyperparameters, we calculated their importance using functional Analysis of Variance. Compared to random search, the optimisation produced better final results and showed faster convergence. The optimal model produced the most accurate estimates of SOC with $$\hbox {RMSE} = 9.67 \pm 0.51$$ RMSE = 9.67 ± 0.51 (s.d.) and $${R}^2 = 0.89 \pm 0.013$$ R 2 = 0.89 ± 0.013 (s.d.). The hyperparameters associated with model training and architecture critically affected the model’s performance, while those related to the spectral preprocessing had little effect. The optimisation searched through a complex hyperparameter space and returned an optimal 1D-CNN. Our approach simplified the development of 1D-CNNs for spectroscopic modelling by automatically selecting hyperparameters and preprocessing methods. Hyperparameter importance analysis shed light on the tuning process and increased the model’s reliability.
url https://doi.org/10.1038/s41598-020-80486-9
work_keys_str_mv AT zefangshen automatedspectroscopicmodellingwithoptimisedconvolutionalneuralnetworks
AT raviscarrarossel automatedspectroscopicmodellingwithoptimisedconvolutionalneuralnetworks
_version_ 1724342232272601088