SDMtune: An R package to tune and evaluate species distribution models

Abstract Balancing model complexity is a key challenge of modern computational ecology, particularly so since the spread of machine learning algorithms. Species distribution models are often implemented using a wide variety of machine learning algorithms that can be fine‐tuned to achieve the best mo...

Full description

Bibliographic Details
Main Authors: Sergio Vignali, Arnaud G. Barras, Raphaël Arlettaz, Veronika Braunisch
Format: Article
Language:English
Published: Wiley 2020-10-01
Series:Ecology and Evolution
Subjects:
Online Access:https://doi.org/10.1002/ece3.6786
id doaj-7b24fea5e8f4439791ce630ff4a6411c
record_format Article
spelling doaj-7b24fea5e8f4439791ce630ff4a6411c2021-04-02T19:24:15ZengWileyEcology and Evolution2045-77582020-10-011020114881150610.1002/ece3.6786SDMtune: An R package to tune and evaluate species distribution modelsSergio Vignali0Arnaud G. Barras1Raphaël Arlettaz2Veronika Braunisch3Division of Conservation Biology Institute of Ecology and Evolution University of Bern Bern SwitzerlandDivision of Conservation Biology Institute of Ecology and Evolution University of Bern Bern SwitzerlandDivision of Conservation Biology Institute of Ecology and Evolution University of Bern Bern SwitzerlandDivision of Conservation Biology Institute of Ecology and Evolution University of Bern Bern SwitzerlandAbstract Balancing model complexity is a key challenge of modern computational ecology, particularly so since the spread of machine learning algorithms. Species distribution models are often implemented using a wide variety of machine learning algorithms that can be fine‐tuned to achieve the best model prediction while avoiding overfitting. We have released SDMtune, a new R package that aims to facilitate training, tuning, and evaluation of species distribution models in a unified framework. The main innovations of this package are its functions to perform data‐driven variable selection, and a novel genetic algorithm to tune model hyperparameters. Real‐time and interactive charts are displayed during the execution of several functions to help users understand the effect of removing a variable or varying model hyperparameters on model performance. SDMtune supports three different metrics to evaluate model performance: the area under the receiver operating characteristic curve, the true skill statistic, and Akaike's information criterion corrected for small sample sizes. It implements four statistical methods: artificial neural networks, boosted regression trees, maximum entropy modeling, and random forest. Moreover, it includes functions to display the outputs and create a final report. SDMtune therefore represents a new, unified and user‐friendly framework for the still‐growing field of species distribution modeling.https://doi.org/10.1002/ece3.6786ecological niche modelfine‐tuninggenetic algorithmmachine learningmodel complexityvariable selection
collection DOAJ
language English
format Article
sources DOAJ
author Sergio Vignali
Arnaud G. Barras
Raphaël Arlettaz
Veronika Braunisch
spellingShingle Sergio Vignali
Arnaud G. Barras
Raphaël Arlettaz
Veronika Braunisch
SDMtune: An R package to tune and evaluate species distribution models
Ecology and Evolution
ecological niche model
fine‐tuning
genetic algorithm
machine learning
model complexity
variable selection
author_facet Sergio Vignali
Arnaud G. Barras
Raphaël Arlettaz
Veronika Braunisch
author_sort Sergio Vignali
title SDMtune: An R package to tune and evaluate species distribution models
title_short SDMtune: An R package to tune and evaluate species distribution models
title_full SDMtune: An R package to tune and evaluate species distribution models
title_fullStr SDMtune: An R package to tune and evaluate species distribution models
title_full_unstemmed SDMtune: An R package to tune and evaluate species distribution models
title_sort sdmtune: an r package to tune and evaluate species distribution models
publisher Wiley
series Ecology and Evolution
issn 2045-7758
publishDate 2020-10-01
description Abstract Balancing model complexity is a key challenge of modern computational ecology, particularly so since the spread of machine learning algorithms. Species distribution models are often implemented using a wide variety of machine learning algorithms that can be fine‐tuned to achieve the best model prediction while avoiding overfitting. We have released SDMtune, a new R package that aims to facilitate training, tuning, and evaluation of species distribution models in a unified framework. The main innovations of this package are its functions to perform data‐driven variable selection, and a novel genetic algorithm to tune model hyperparameters. Real‐time and interactive charts are displayed during the execution of several functions to help users understand the effect of removing a variable or varying model hyperparameters on model performance. SDMtune supports three different metrics to evaluate model performance: the area under the receiver operating characteristic curve, the true skill statistic, and Akaike's information criterion corrected for small sample sizes. It implements four statistical methods: artificial neural networks, boosted regression trees, maximum entropy modeling, and random forest. Moreover, it includes functions to display the outputs and create a final report. SDMtune therefore represents a new, unified and user‐friendly framework for the still‐growing field of species distribution modeling.
topic ecological niche model
fine‐tuning
genetic algorithm
machine learning
model complexity
variable selection
url https://doi.org/10.1002/ece3.6786
work_keys_str_mv AT sergiovignali sdmtuneanrpackagetotuneandevaluatespeciesdistributionmodels
AT arnaudgbarras sdmtuneanrpackagetotuneandevaluatespeciesdistributionmodels
AT raphaelarlettaz sdmtuneanrpackagetotuneandevaluatespeciesdistributionmodels
AT veronikabraunisch sdmtuneanrpackagetotuneandevaluatespeciesdistributionmodels
_version_ 1721548825149046784