Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models
We propose a probabilistic graphical model for discriminative substrate characterization, to support geological and biological habitat mapping in aquatic environments. The model, called a fully-connected conditional random field (CRF), is demonstrated using multispectral and monospectral acoustic ba...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2018-10-01
|
Series: | Geosciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3263/8/11/395 |
id |
doaj-d763297b71c849519e4e0bd85b0473e2 |
---|---|
record_format |
Article |
spelling |
doaj-d763297b71c849519e4e0bd85b0473e22020-11-25T01:25:57ZengMDPI AGGeosciences2076-32632018-10-0181139510.3390/geosciences8110395geosciences8110395Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative ModelsDaniel Buscombe0Paul E. Grams1School of Earth & Sustainability, Northern Arizona University, Flagstaff, AZ 86011, USAU.S. Geological Survey, Southwest Biological Science Center, Grand Canyon Monitoring & Research Center, Flagstaff, AZ 86001, USAWe propose a probabilistic graphical model for discriminative substrate characterization, to support geological and biological habitat mapping in aquatic environments. The model, called a fully-connected conditional random field (CRF), is demonstrated using multispectral and monospectral acoustic backscatter from heterogeneous seafloors in Patricia Bay, British Columbia, and Bedford Basin, Nova Scotia. Unlike previously proposed discriminative algorithms, the CRF model considers both the relative backscatter magnitudes of different substrates and their relative proximities. The model therefore combines the statistical flexibility of a machine learning algorithm with an inherently spatial treatment of the substrate. The CRF model predicts substrates such that nearby locations with similar backscattering characteristics are likely to be in the same substrate class. The degree of allowable proximity and backscatter similarity are controlled by parameters that are learned from the data. CRF model results were evaluated against a popular generative model known as a Gaussian Mixture model (GMM) that doesn’t include spatial dependencies, only covariance between substrate backscattering response over different frequencies. Both models are used in conjunction with sparse bed observations/samples in a supervised classification. A detailed accuracy assessment, including a leave-one-out cross-validation analysis, was performed using both models. Using multispectral backscatter, the GMM model trained on 50% of the bed observations resulted in a 75% and 89% average accuracies in Patricia Bay and Bedford Basin, respectively. The same metrics for the CRF model were 78% and 95%. Further, the CRF model resulted in a 91% mean cross-validation accuracy across four substrate classes at Patricia Bay, and a 99.5% mean accuracy across three substrate classes at Bedford Basin, which suggest that the CRF model generalizes extremely well to new data. This analysis also showed that the CRF model was much less sensitive to the specific number and locations of bed observations than the generative model, owing to its ability to incorporate spatial autocorrelation in substrates. The CRF therefore may prove to be a powerful ‘spatially aware’ alternative to other discriminative classifiers.https://www.mdpi.com/2076-3263/8/11/395multibeam echosounderbackscattermulti-frequencymachine-learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Daniel Buscombe Paul E. Grams |
spellingShingle |
Daniel Buscombe Paul E. Grams Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models Geosciences multibeam echosounder backscatter multi-frequency machine-learning |
author_facet |
Daniel Buscombe Paul E. Grams |
author_sort |
Daniel Buscombe |
title |
Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models |
title_short |
Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models |
title_full |
Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models |
title_fullStr |
Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models |
title_full_unstemmed |
Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models |
title_sort |
probabilistic substrate classification with multispectral acoustic backscatter: a comparison of discriminative and generative models |
publisher |
MDPI AG |
series |
Geosciences |
issn |
2076-3263 |
publishDate |
2018-10-01 |
description |
We propose a probabilistic graphical model for discriminative substrate characterization, to support geological and biological habitat mapping in aquatic environments. The model, called a fully-connected conditional random field (CRF), is demonstrated using multispectral and monospectral acoustic backscatter from heterogeneous seafloors in Patricia Bay, British Columbia, and Bedford Basin, Nova Scotia. Unlike previously proposed discriminative algorithms, the CRF model considers both the relative backscatter magnitudes of different substrates and their relative proximities. The model therefore combines the statistical flexibility of a machine learning algorithm with an inherently spatial treatment of the substrate. The CRF model predicts substrates such that nearby locations with similar backscattering characteristics are likely to be in the same substrate class. The degree of allowable proximity and backscatter similarity are controlled by parameters that are learned from the data. CRF model results were evaluated against a popular generative model known as a Gaussian Mixture model (GMM) that doesn’t include spatial dependencies, only covariance between substrate backscattering response over different frequencies. Both models are used in conjunction with sparse bed observations/samples in a supervised classification. A detailed accuracy assessment, including a leave-one-out cross-validation analysis, was performed using both models. Using multispectral backscatter, the GMM model trained on 50% of the bed observations resulted in a 75% and 89% average accuracies in Patricia Bay and Bedford Basin, respectively. The same metrics for the CRF model were 78% and 95%. Further, the CRF model resulted in a 91% mean cross-validation accuracy across four substrate classes at Patricia Bay, and a 99.5% mean accuracy across three substrate classes at Bedford Basin, which suggest that the CRF model generalizes extremely well to new data. This analysis also showed that the CRF model was much less sensitive to the specific number and locations of bed observations than the generative model, owing to its ability to incorporate spatial autocorrelation in substrates. The CRF therefore may prove to be a powerful ‘spatially aware’ alternative to other discriminative classifiers. |
topic |
multibeam echosounder backscatter multi-frequency machine-learning |
url |
https://www.mdpi.com/2076-3263/8/11/395 |
work_keys_str_mv |
AT danielbuscombe probabilisticsubstrateclassificationwithmultispectralacousticbackscatteracomparisonofdiscriminativeandgenerativemodels AT paulegrams probabilisticsubstrateclassificationwithmultispectralacousticbackscatteracomparisonofdiscriminativeandgenerativemodels |
_version_ |
1725111611511799808 |