Filtering method based on cluster analysis to avoid salinity drifts and recover Argo data in less time

<p>Currently there is a huge amount of freely available hydrographic data, and it is increasingly important to have easy access to it and to be provided with as much information as possible. Argo is a global collection of around 4000 active autonomous hydrographic profilers. Argo data go thro...

Full description

Bibliographic Details
Main Authors: E. Romero, L. Tenorio-Fernandez, I. Castro, M. Castro
Format: Article
Language:English
Published: Copernicus Publications 2021-09-01
Series:Ocean Science
Online Access:https://os.copernicus.org/articles/17/1273/2021/os-17-1273-2021.pdf
id doaj-48c4872e684043d994b6b98c15aaa6d9
record_format Article
spelling doaj-48c4872e684043d994b6b98c15aaa6d92021-09-17T07:17:16ZengCopernicus PublicationsOcean Science1812-07841812-07922021-09-01171273128410.5194/os-17-1273-2021Filtering method based on cluster analysis to avoid salinity drifts and recover Argo data in less timeE. Romero0L. Tenorio-Fernandez1I. Castro2M. Castro3Tecnológico Nacional de México/Instituto Tecnológico de La Paz, La Paz, MéxicoCONACyT-Instituto Politécnico Nacional-Centro Interdisciplinario de Ciencias Marinas, La Paz, MéxicoTecnológico Nacional de México/Instituto Tecnológico de La Paz, La Paz, MéxicoTecnológico Nacional de México/Instituto Tecnológico de La Paz, La Paz, México<p>Currently there is a huge amount of freely available hydrographic data, and it is increasingly important to have easy access to it and to be provided with as much information as possible. Argo is a global collection of around 4000 active autonomous hydrographic profilers. Argo data go through two quality processes, real time and delayed mode. This work shows a methodology to filter profiles within a given polygon using the odd–even algorithm; this allows analysis of a study area, regardless of size, shape or location. The aim is to offer two filtering methods and to discard only the real-time quality control data that present salinity drifts. This takes advantage of the largest possible amount of valid data within a given polygon. In the study area selected as an example, it was possible to recover around 80 % in the case of the first filter that uses cluster analysis and 30 % in the case of the second, which discards profilers with salinity drifts, of the total real-time quality control data that are usually discarded by the users due to problems such as salinity drifts. This allows users to use any of the filters or a combination of both to have a greater amount of data within the study area of their interest in a matter of minutes, rather than waiting for the delayed-mode quality control that takes up to 12 months to be completed. This methodology has been tested for its replicability in five selected areas around the world and has obtained good results.</p>https://os.copernicus.org/articles/17/1273/2021/os-17-1273-2021.pdf
collection DOAJ
language English
format Article
sources DOAJ
author E. Romero
L. Tenorio-Fernandez
I. Castro
M. Castro
spellingShingle E. Romero
L. Tenorio-Fernandez
I. Castro
M. Castro
Filtering method based on cluster analysis to avoid salinity drifts and recover Argo data in less time
Ocean Science
author_facet E. Romero
L. Tenorio-Fernandez
I. Castro
M. Castro
author_sort E. Romero
title Filtering method based on cluster analysis to avoid salinity drifts and recover Argo data in less time
title_short Filtering method based on cluster analysis to avoid salinity drifts and recover Argo data in less time
title_full Filtering method based on cluster analysis to avoid salinity drifts and recover Argo data in less time
title_fullStr Filtering method based on cluster analysis to avoid salinity drifts and recover Argo data in less time
title_full_unstemmed Filtering method based on cluster analysis to avoid salinity drifts and recover Argo data in less time
title_sort filtering method based on cluster analysis to avoid salinity drifts and recover argo data in less time
publisher Copernicus Publications
series Ocean Science
issn 1812-0784
1812-0792
publishDate 2021-09-01
description <p>Currently there is a huge amount of freely available hydrographic data, and it is increasingly important to have easy access to it and to be provided with as much information as possible. Argo is a global collection of around 4000 active autonomous hydrographic profilers. Argo data go through two quality processes, real time and delayed mode. This work shows a methodology to filter profiles within a given polygon using the odd–even algorithm; this allows analysis of a study area, regardless of size, shape or location. The aim is to offer two filtering methods and to discard only the real-time quality control data that present salinity drifts. This takes advantage of the largest possible amount of valid data within a given polygon. In the study area selected as an example, it was possible to recover around 80 % in the case of the first filter that uses cluster analysis and 30 % in the case of the second, which discards profilers with salinity drifts, of the total real-time quality control data that are usually discarded by the users due to problems such as salinity drifts. This allows users to use any of the filters or a combination of both to have a greater amount of data within the study area of their interest in a matter of minutes, rather than waiting for the delayed-mode quality control that takes up to 12 months to be completed. This methodology has been tested for its replicability in five selected areas around the world and has obtained good results.</p>
url https://os.copernicus.org/articles/17/1273/2021/os-17-1273-2021.pdf
work_keys_str_mv AT eromero filteringmethodbasedonclusteranalysistoavoidsalinitydriftsandrecoverargodatainlesstime
AT ltenoriofernandez filteringmethodbasedonclusteranalysistoavoidsalinitydriftsandrecoverargodatainlesstime
AT icastro filteringmethodbasedonclusteranalysistoavoidsalinitydriftsandrecoverargodatainlesstime
AT mcastro filteringmethodbasedonclusteranalysistoavoidsalinitydriftsandrecoverargodatainlesstime
_version_ 1717377474819522560