Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models

We propose a probabilistic graphical model for discriminative substrate characterization, to support geological and biological habitat mapping in aquatic environments. The model, called a fully-connected conditional random field (CRF), is demonstrated using multispectral and monospectral acoustic ba...

Full description

Bibliographic Details
Main Authors: Daniel Buscombe, Paul E. Grams
Format: Article
Language:English
Published: MDPI AG 2018-10-01
Series:Geosciences
Subjects:
Online Access:https://www.mdpi.com/2076-3263/8/11/395
id doaj-d763297b71c849519e4e0bd85b0473e2
record_format Article
spelling doaj-d763297b71c849519e4e0bd85b0473e22020-11-25T01:25:57ZengMDPI AGGeosciences2076-32632018-10-0181139510.3390/geosciences8110395geosciences8110395Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative ModelsDaniel Buscombe0Paul E. Grams1School of Earth & Sustainability, Northern Arizona University, Flagstaff, AZ 86011, USAU.S. Geological Survey, Southwest Biological Science Center, Grand Canyon Monitoring & Research Center, Flagstaff, AZ 86001, USAWe propose a probabilistic graphical model for discriminative substrate characterization, to support geological and biological habitat mapping in aquatic environments. The model, called a fully-connected conditional random field (CRF), is demonstrated using multispectral and monospectral acoustic backscatter from heterogeneous seafloors in Patricia Bay, British Columbia, and Bedford Basin, Nova Scotia. Unlike previously proposed discriminative algorithms, the CRF model considers both the relative backscatter magnitudes of different substrates and their relative proximities. The model therefore combines the statistical flexibility of a machine learning algorithm with an inherently spatial treatment of the substrate. The CRF model predicts substrates such that nearby locations with similar backscattering characteristics are likely to be in the same substrate class. The degree of allowable proximity and backscatter similarity are controlled by parameters that are learned from the data. CRF model results were evaluated against a popular generative model known as a Gaussian Mixture model (GMM) that doesn’t include spatial dependencies, only covariance between substrate backscattering response over different frequencies. Both models are used in conjunction with sparse bed observations/samples in a supervised classification. A detailed accuracy assessment, including a leave-one-out cross-validation analysis, was performed using both models. Using multispectral backscatter, the GMM model trained on 50% of the bed observations resulted in a 75% and 89% average accuracies in Patricia Bay and Bedford Basin, respectively. The same metrics for the CRF model were 78% and 95%. Further, the CRF model resulted in a 91% mean cross-validation accuracy across four substrate classes at Patricia Bay, and a 99.5% mean accuracy across three substrate classes at Bedford Basin, which suggest that the CRF model generalizes extremely well to new data. This analysis also showed that the CRF model was much less sensitive to the specific number and locations of bed observations than the generative model, owing to its ability to incorporate spatial autocorrelation in substrates. The CRF therefore may prove to be a powerful ‘spatially aware’ alternative to other discriminative classifiers.https://www.mdpi.com/2076-3263/8/11/395multibeam echosounderbackscattermulti-frequencymachine-learning
collection DOAJ
language English
format Article
sources DOAJ
author Daniel Buscombe
Paul E. Grams
spellingShingle Daniel Buscombe
Paul E. Grams
Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models
Geosciences
multibeam echosounder
backscatter
multi-frequency
machine-learning
author_facet Daniel Buscombe
Paul E. Grams
author_sort Daniel Buscombe
title Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models
title_short Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models
title_full Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models
title_fullStr Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models
title_full_unstemmed Probabilistic Substrate Classification with Multispectral Acoustic Backscatter: A Comparison of Discriminative and Generative Models
title_sort probabilistic substrate classification with multispectral acoustic backscatter: a comparison of discriminative and generative models
publisher MDPI AG
series Geosciences
issn 2076-3263
publishDate 2018-10-01
description We propose a probabilistic graphical model for discriminative substrate characterization, to support geological and biological habitat mapping in aquatic environments. The model, called a fully-connected conditional random field (CRF), is demonstrated using multispectral and monospectral acoustic backscatter from heterogeneous seafloors in Patricia Bay, British Columbia, and Bedford Basin, Nova Scotia. Unlike previously proposed discriminative algorithms, the CRF model considers both the relative backscatter magnitudes of different substrates and their relative proximities. The model therefore combines the statistical flexibility of a machine learning algorithm with an inherently spatial treatment of the substrate. The CRF model predicts substrates such that nearby locations with similar backscattering characteristics are likely to be in the same substrate class. The degree of allowable proximity and backscatter similarity are controlled by parameters that are learned from the data. CRF model results were evaluated against a popular generative model known as a Gaussian Mixture model (GMM) that doesn’t include spatial dependencies, only covariance between substrate backscattering response over different frequencies. Both models are used in conjunction with sparse bed observations/samples in a supervised classification. A detailed accuracy assessment, including a leave-one-out cross-validation analysis, was performed using both models. Using multispectral backscatter, the GMM model trained on 50% of the bed observations resulted in a 75% and 89% average accuracies in Patricia Bay and Bedford Basin, respectively. The same metrics for the CRF model were 78% and 95%. Further, the CRF model resulted in a 91% mean cross-validation accuracy across four substrate classes at Patricia Bay, and a 99.5% mean accuracy across three substrate classes at Bedford Basin, which suggest that the CRF model generalizes extremely well to new data. This analysis also showed that the CRF model was much less sensitive to the specific number and locations of bed observations than the generative model, owing to its ability to incorporate spatial autocorrelation in substrates. The CRF therefore may prove to be a powerful ‘spatially aware’ alternative to other discriminative classifiers.
topic multibeam echosounder
backscatter
multi-frequency
machine-learning
url https://www.mdpi.com/2076-3263/8/11/395
work_keys_str_mv AT danielbuscombe probabilisticsubstrateclassificationwithmultispectralacousticbackscatteracomparisonofdiscriminativeandgenerativemodels
AT paulegrams probabilisticsubstrateclassificationwithmultispectralacousticbackscatteracomparisonofdiscriminativeandgenerativemodels
_version_ 1725111611511799808