Characterizations of pyramids and their generalizations
Cluster Analysis is a collection of techniques whose goals are to try and suggest possible internal structures of a data set. It is a subfield of exploratory data analysis in which the goal is to find a starting point to investigate some collection of objects. A clustering technique takes a finite d...
Main Author: | |
---|---|
Language: | ENG |
Published: |
ScholarWorks@UMass Amherst
1998
|
Subjects: | |
Online Access: | https://scholarworks.umass.edu/dissertations/AAI9841841 |
id |
ndltd-UMASS-oai-scholarworks.umass.edu-dissertations-3031 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UMASS-oai-scholarworks.umass.edu-dissertations-30312020-12-02T14:29:28Z Characterizations of pyramids and their generalizations Boucher, Catherine Dornback Cluster Analysis is a collection of techniques whose goals are to try and suggest possible internal structures of a data set. It is a subfield of exploratory data analysis in which the goal is to find a starting point to investigate some collection of objects. A clustering technique takes a finite data set E with finitely many attributes or a collection of measurements called a dissimilarity coefficient and produces a single classification or a nested sequence of classifications of E. When one forms a nested sequence of partitions on the given set it is easily visualized as a hierarchy. Pyramids, developed by Diday (12), allow visual representation of output that has some overlap. It is a well known fact that weakly indexed pyramids are in one-to-one correspondence with definite Robinsonian dissimilarity coefficients. Pyramids allow some overlap between clusters. One drawback to pyramidal representations is the requirement that one must impose a linear order on the underlying set to be clustered. It will be shown that by examining a dissimilarity coefficient one is able to determine its compatible linear orders, if any, using the consecutive ones property. A generalization of pyramids, pseudo-pyramids, will be introduced. The concepts of weakly indexed and indexed pseudo-pyramids are constructed. Pyramids and their generalizations will be placed in the ordinal model developed by Janowitz (25). Characterizations of pyramids and their generalizations are given from set-theoretical, graph-theoretical, and lattice-theoretical viewpoints. In particular, a characterization of indexed pseudo-pyramids with respect to a collection of planar lattices will be introduced. Generalizations of dissimilarity coefficients called pseudo-dissimilarity coefficients will be given. A bijection between indexed (weakly indexed) pseudo-pyramids and strongly Robinsonian (Robinsonian) pseudo-dissimilarities is possible. This generalization removes the necessity of the minimal value on a dissimilarity being 0. Also, the output of a clustering technique using a pseudo-dissimilarity need not be reflexive at each level. In other words, it is not necessary to have all singleton subsets in the classifications. 1998-01-01T08:00:00Z text https://scholarworks.umass.edu/dissertations/AAI9841841 Doctoral Dissertations Available from Proquest ENG ScholarWorks@UMass Amherst Mathematics |
collection |
NDLTD |
language |
ENG |
sources |
NDLTD |
topic |
Mathematics |
spellingShingle |
Mathematics Boucher, Catherine Dornback Characterizations of pyramids and their generalizations |
description |
Cluster Analysis is a collection of techniques whose goals are to try and suggest possible internal structures of a data set. It is a subfield of exploratory data analysis in which the goal is to find a starting point to investigate some collection of objects. A clustering technique takes a finite data set E with finitely many attributes or a collection of measurements called a dissimilarity coefficient and produces a single classification or a nested sequence of classifications of E. When one forms a nested sequence of partitions on the given set it is easily visualized as a hierarchy. Pyramids, developed by Diday (12), allow visual representation of output that has some overlap. It is a well known fact that weakly indexed pyramids are in one-to-one correspondence with definite Robinsonian dissimilarity coefficients. Pyramids allow some overlap between clusters. One drawback to pyramidal representations is the requirement that one must impose a linear order on the underlying set to be clustered. It will be shown that by examining a dissimilarity coefficient one is able to determine its compatible linear orders, if any, using the consecutive ones property. A generalization of pyramids, pseudo-pyramids, will be introduced. The concepts of weakly indexed and indexed pseudo-pyramids are constructed. Pyramids and their generalizations will be placed in the ordinal model developed by Janowitz (25). Characterizations of pyramids and their generalizations are given from set-theoretical, graph-theoretical, and lattice-theoretical viewpoints. In particular, a characterization of indexed pseudo-pyramids with respect to a collection of planar lattices will be introduced. Generalizations of dissimilarity coefficients called pseudo-dissimilarity coefficients will be given. A bijection between indexed (weakly indexed) pseudo-pyramids and strongly Robinsonian (Robinsonian) pseudo-dissimilarities is possible. This generalization removes the necessity of the minimal value on a dissimilarity being 0. Also, the output of a clustering technique using a pseudo-dissimilarity need not be reflexive at each level. In other words, it is not necessary to have all singleton subsets in the classifications. |
author |
Boucher, Catherine Dornback |
author_facet |
Boucher, Catherine Dornback |
author_sort |
Boucher, Catherine Dornback |
title |
Characterizations of pyramids and their generalizations |
title_short |
Characterizations of pyramids and their generalizations |
title_full |
Characterizations of pyramids and their generalizations |
title_fullStr |
Characterizations of pyramids and their generalizations |
title_full_unstemmed |
Characterizations of pyramids and their generalizations |
title_sort |
characterizations of pyramids and their generalizations |
publisher |
ScholarWorks@UMass Amherst |
publishDate |
1998 |
url |
https://scholarworks.umass.edu/dissertations/AAI9841841 |
work_keys_str_mv |
AT bouchercatherinedornback characterizationsofpyramidsandtheirgeneralizations |
_version_ |
1719363752598437888 |