Annotation-free learning of plankton for classification and anomaly detection

Abstract The acquisition of increasingly large plankton digital image datasets requires automatic methods of recognition and classification. As data size and collection speed increases, manual annotation and database representation are often bottlenecks for utilization of machine learning algorithms...

Full description

Bibliographic Details
Main Authors: Vito P. Pastore, Thomas G. Zimmerman, Sujoy K. Biswas, Simone Bianco
Format: Article
Language:English
Published: Nature Publishing Group 2020-07-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-020-68662-3
id doaj-165c621872b74325a17ffed8475d41a3
record_format Article
spelling doaj-165c621872b74325a17ffed8475d41a32021-07-25T11:18:05ZengNature Publishing GroupScientific Reports2045-23222020-07-0110111510.1038/s41598-020-68662-3Annotation-free learning of plankton for classification and anomaly detectionVito P. Pastore0Thomas G. Zimmerman1Sujoy K. Biswas2Simone Bianco3Industrial and Applied Genomics, AI and Cognitive Software, IBM Research – AlmadenIndustrial and Applied Genomics, AI and Cognitive Software, IBM Research – AlmadenIndustrial and Applied Genomics, AI and Cognitive Software, IBM Research – AlmadenIndustrial and Applied Genomics, AI and Cognitive Software, IBM Research – AlmadenAbstract The acquisition of increasingly large plankton digital image datasets requires automatic methods of recognition and classification. As data size and collection speed increases, manual annotation and database representation are often bottlenecks for utilization of machine learning algorithms for taxonomic classification of plankton species in field studies. In this paper we present a novel set of algorithms to perform accurate detection and classification of plankton species with minimal supervision. Our algorithms approach the performance of existing supervised machine learning algorithms when tested on a plankton dataset generated from a custom-built lensless digital device. Similar results are obtained on a larger image dataset obtained from the Woods Hole Oceanographic Institution. Additionally, we introduce a new algorithm to perform anomaly detection on unclassified samples. Here an anomaly is defined as a significant deviation from the established classification. Our algorithms are designed to provide a new way to monitor the environment with a class of rapid online intelligent detectors.https://doi.org/10.1038/s41598-020-68662-3
collection DOAJ
language English
format Article
sources DOAJ
author Vito P. Pastore
Thomas G. Zimmerman
Sujoy K. Biswas
Simone Bianco
spellingShingle Vito P. Pastore
Thomas G. Zimmerman
Sujoy K. Biswas
Simone Bianco
Annotation-free learning of plankton for classification and anomaly detection
Scientific Reports
author_facet Vito P. Pastore
Thomas G. Zimmerman
Sujoy K. Biswas
Simone Bianco
author_sort Vito P. Pastore
title Annotation-free learning of plankton for classification and anomaly detection
title_short Annotation-free learning of plankton for classification and anomaly detection
title_full Annotation-free learning of plankton for classification and anomaly detection
title_fullStr Annotation-free learning of plankton for classification and anomaly detection
title_full_unstemmed Annotation-free learning of plankton for classification and anomaly detection
title_sort annotation-free learning of plankton for classification and anomaly detection
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2020-07-01
description Abstract The acquisition of increasingly large plankton digital image datasets requires automatic methods of recognition and classification. As data size and collection speed increases, manual annotation and database representation are often bottlenecks for utilization of machine learning algorithms for taxonomic classification of plankton species in field studies. In this paper we present a novel set of algorithms to perform accurate detection and classification of plankton species with minimal supervision. Our algorithms approach the performance of existing supervised machine learning algorithms when tested on a plankton dataset generated from a custom-built lensless digital device. Similar results are obtained on a larger image dataset obtained from the Woods Hole Oceanographic Institution. Additionally, we introduce a new algorithm to perform anomaly detection on unclassified samples. Here an anomaly is defined as a significant deviation from the established classification. Our algorithms are designed to provide a new way to monitor the environment with a class of rapid online intelligent detectors.
url https://doi.org/10.1038/s41598-020-68662-3
work_keys_str_mv AT vitoppastore annotationfreelearningofplanktonforclassificationandanomalydetection
AT thomasgzimmerman annotationfreelearningofplanktonforclassificationandanomalydetection
AT sujoykbiswas annotationfreelearningofplanktonforclassificationandanomalydetection
AT simonebianco annotationfreelearningofplanktonforclassificationandanomalydetection
_version_ 1721283266897510400