Using Hierarchical Clustering in Order to Increase Efficiency of Self-Organizing Feature Map to Identify Hydrological Homogeneous Regions for Flood Estimation

Introduction: Hydrologic homogeneous group identification is considered both fundamental and applied research in hydrology. Clustering methods are among conventional methods to assess the hydrological homogeneous regions. Recently, Self Organizing feature Map (SOM) method has been applied in some st...

Full description

Bibliographic Details
Main Authors:	F. Farsadnia, B. Ghahreman
Format:	Article
Language:	fas
Published:	Ferdowsi University of Mashhad 2017-01-01
Series:	مجله آب و خاک
Subjects:	Principal Component Analysis Regional flood frequency analysis Hybrid clustering linear moments
Online Access:	http://jsw.um.ac.ir/index.php/jsw/article/view/34143

id	doaj-c58b6a068e8f4f59968e877d4a89078b
record_format	Article
spelling	doaj-c58b6a068e8f4f59968e877d4a89078b2021-06-02T09:12:48ZfasFerdowsi University of Mashhadمجله آب و خاک2008-47572423-396X2017-01-012951207121810.22067/jsw.v29i5.3414310380Using Hierarchical Clustering in Order to Increase Efficiency of Self-Organizing Feature Map to Identify Hydrological Homogeneous Regions for Flood EstimationF. Farsadnia0B. Ghahreman1Ferdowsi University of MashhadFerdowsi University of MashhadIntroduction: Hydrologic homogeneous group identification is considered both fundamental and applied research in hydrology. Clustering methods are among conventional methods to assess the hydrological homogeneous regions. Recently, Self Organizing feature Map (SOM) method has been applied in some studies. However, the main problem of this method is the interpretation on the output map of this approach. Therefore, SOM is used as input to other clustering algorithms. The aim of this study is to apply a two-level Self-Organizing feature map and Ward hierarchical clustering method to determine the hydrologic homogenous regions in North and Razavi Khorasan provinces. Materials and Methods: SOM approximates the probability density function of input data through an unsupervised learning algorithm, and is not only an effective method for clustering, but also for the visualization and abstraction of complex data. The algorithm has properties of neighborhood preservation and local resolution of the input space proportional to the data distribution. A SOM consists of two layers: an input layer formed by a set of nodes and an output layer formed by nodes arranged in a two-dimensional grid. In this study we used SOM for visualization and clustering of watersheds based on physiographical data in North and Razavi Khorasan provinces. In the next step, SOM weight vectors were used to classify the units by Ward’s Agglomerative hierarchical clustering (Ward) methods. Ward’s algorithm is a frequently used technique for regionalization studies in hydrology and climatology. It is based on the assumption that if two clusters are merged, the resulting loss of information, or change in the value of objective function, will depend only on the relationship between the two merged clusters and not on the relationships with any other clusters. After the formation of clusters by SOM and Ward, the most frequently applied tests of regional homogeneity based on the theory of L-moments are used to compare and modify the clusters which are formed by clustering algorithms and find the best clustering method to achieve hydrologically homogeneous regions. Two statistical measures are used to form a homogeneous region, (i) discordancy measure and (ii) heterogeneity measure. The discordancy measure, Di, is used to find out unusual sites from the pooling group (i.e., the sites whose at-site sample L moments are markedly different from the other sites). Generally, any site with Di>3 is considered as discordant. The homogeneity of the region is evaluated using homogeneity measures which are based on sample L-moments (LCv, LCs and LCk), respectively. The homogeneity measures are based on the simulation of 500 homogeneous regions with population parameters equal to the regional average sample l-moment ratios. The value of the H-statistic indicates that the region under consideration is acceptably homogeneous when Hhttp://jsw.um.ac.ir/index.php/jsw/article/view/34143Principal Component AnalysisRegional flood frequency analysisHybrid clusteringlinear moments
collection	DOAJ
language	fas
format	Article
sources	DOAJ
author	F. Farsadnia B. Ghahreman
spellingShingle	F. Farsadnia B. Ghahreman Using Hierarchical Clustering in Order to Increase Efficiency of Self-Organizing Feature Map to Identify Hydrological Homogeneous Regions for Flood Estimation مجله آب و خاک Principal Component Analysis Regional flood frequency analysis Hybrid clustering linear moments
author_facet	F. Farsadnia B. Ghahreman
author_sort	F. Farsadnia
title	Using Hierarchical Clustering in Order to Increase Efficiency of Self-Organizing Feature Map to Identify Hydrological Homogeneous Regions for Flood Estimation
title_short	Using Hierarchical Clustering in Order to Increase Efficiency of Self-Organizing Feature Map to Identify Hydrological Homogeneous Regions for Flood Estimation
title_full	Using Hierarchical Clustering in Order to Increase Efficiency of Self-Organizing Feature Map to Identify Hydrological Homogeneous Regions for Flood Estimation
title_fullStr	Using Hierarchical Clustering in Order to Increase Efficiency of Self-Organizing Feature Map to Identify Hydrological Homogeneous Regions for Flood Estimation
title_full_unstemmed	Using Hierarchical Clustering in Order to Increase Efficiency of Self-Organizing Feature Map to Identify Hydrological Homogeneous Regions for Flood Estimation
title_sort	using hierarchical clustering in order to increase efficiency of self-organizing feature map to identify hydrological homogeneous regions for flood estimation
publisher	Ferdowsi University of Mashhad
series	مجله آب و خاک
issn	2008-4757 2423-396X
publishDate	2017-01-01
description	Introduction: Hydrologic homogeneous group identification is considered both fundamental and applied research in hydrology. Clustering methods are among conventional methods to assess the hydrological homogeneous regions. Recently, Self Organizing feature Map (SOM) method has been applied in some studies. However, the main problem of this method is the interpretation on the output map of this approach. Therefore, SOM is used as input to other clustering algorithms. The aim of this study is to apply a two-level Self-Organizing feature map and Ward hierarchical clustering method to determine the hydrologic homogenous regions in North and Razavi Khorasan provinces. Materials and Methods: SOM approximates the probability density function of input data through an unsupervised learning algorithm, and is not only an effective method for clustering, but also for the visualization and abstraction of complex data. The algorithm has properties of neighborhood preservation and local resolution of the input space proportional to the data distribution. A SOM consists of two layers: an input layer formed by a set of nodes and an output layer formed by nodes arranged in a two-dimensional grid. In this study we used SOM for visualization and clustering of watersheds based on physiographical data in North and Razavi Khorasan provinces. In the next step, SOM weight vectors were used to classify the units by Ward’s Agglomerative hierarchical clustering (Ward) methods. Ward’s algorithm is a frequently used technique for regionalization studies in hydrology and climatology. It is based on the assumption that if two clusters are merged, the resulting loss of information, or change in the value of objective function, will depend only on the relationship between the two merged clusters and not on the relationships with any other clusters. After the formation of clusters by SOM and Ward, the most frequently applied tests of regional homogeneity based on the theory of L-moments are used to compare and modify the clusters which are formed by clustering algorithms and find the best clustering method to achieve hydrologically homogeneous regions. Two statistical measures are used to form a homogeneous region, (i) discordancy measure and (ii) heterogeneity measure. The discordancy measure, Di, is used to find out unusual sites from the pooling group (i.e., the sites whose at-site sample L moments are markedly different from the other sites). Generally, any site with Di>3 is considered as discordant. The homogeneity of the region is evaluated using homogeneity measures which are based on sample L-moments (LCv, LCs and LCk), respectively. The homogeneity measures are based on the simulation of 500 homogeneous regions with population parameters equal to the regional average sample l-moment ratios. The value of the H-statistic indicates that the region under consideration is acceptably homogeneous when H
topic	Principal Component Analysis Regional flood frequency analysis Hybrid clustering linear moments
url	http://jsw.um.ac.ir/index.php/jsw/article/view/34143
work_keys_str_mv	AT ffarsadnia usinghierarchicalclusteringinordertoincreaseefficiencyofselforganizingfeaturemaptoidentifyhydrologicalhomogeneousregionsforfloodestimation AT bghahreman usinghierarchicalclusteringinordertoincreaseefficiencyofselforganizingfeaturemaptoidentifyhydrologicalhomogeneousregionsforfloodestimation
_version_	1721406061128187904

Using Hierarchical Clustering in Order to Increase Efficiency of Self-Organizing Feature Map to Identify Hydrological Homogeneous Regions for Flood Estimation

Similar Items