Network-Based and Binless Frequency Analyses.

We introduce and develop a new network-based and binless methodology to perform frequency analyses and produce histograms. In contrast with traditional frequency analysis techniques that use fixed intervals to bin values, we place a range ±ζ around each individual value in a data set and count the n...

Full description

Bibliographic Details
Main Authors: Sybil Derrible, Nasir Ahmad
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4631440?pdf=render
id doaj-33f846abaf5e4fbcb7fee1ad1b0a9e70
record_format Article
spelling doaj-33f846abaf5e4fbcb7fee1ad1b0a9e702020-11-25T02:14:07ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-011011e014210810.1371/journal.pone.0142108Network-Based and Binless Frequency Analyses.Sybil DerribleNasir AhmadWe introduce and develop a new network-based and binless methodology to perform frequency analyses and produce histograms. In contrast with traditional frequency analysis techniques that use fixed intervals to bin values, we place a range ±ζ around each individual value in a data set and count the number of values within that range, which allows us to compare every single value of a data set with one another. In essence, the methodology is identical to the construction of a network, where two values are connected if they lie within a given a range (±ζ). The value with the highest degree (i.e., most connections) is therefore assimilated to the mode of the distribution. To select an optimal range, we look at the stability of the proportion of nodes in the largest cluster. The methodology is validated by sampling 12 typical distributions, and it is applied to a number of real-world data sets with both spatial and temporal components. The methodology can be applied to any data set and provides a robust means to uncover meaningful patterns and trends. A free python script and a tutorial are also made available to facilitate the application of the method.http://europepmc.org/articles/PMC4631440?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Sybil Derrible
Nasir Ahmad
spellingShingle Sybil Derrible
Nasir Ahmad
Network-Based and Binless Frequency Analyses.
PLoS ONE
author_facet Sybil Derrible
Nasir Ahmad
author_sort Sybil Derrible
title Network-Based and Binless Frequency Analyses.
title_short Network-Based and Binless Frequency Analyses.
title_full Network-Based and Binless Frequency Analyses.
title_fullStr Network-Based and Binless Frequency Analyses.
title_full_unstemmed Network-Based and Binless Frequency Analyses.
title_sort network-based and binless frequency analyses.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2015-01-01
description We introduce and develop a new network-based and binless methodology to perform frequency analyses and produce histograms. In contrast with traditional frequency analysis techniques that use fixed intervals to bin values, we place a range ±ζ around each individual value in a data set and count the number of values within that range, which allows us to compare every single value of a data set with one another. In essence, the methodology is identical to the construction of a network, where two values are connected if they lie within a given a range (±ζ). The value with the highest degree (i.e., most connections) is therefore assimilated to the mode of the distribution. To select an optimal range, we look at the stability of the proportion of nodes in the largest cluster. The methodology is validated by sampling 12 typical distributions, and it is applied to a number of real-world data sets with both spatial and temporal components. The methodology can be applied to any data set and provides a robust means to uncover meaningful patterns and trends. A free python script and a tutorial are also made available to facilitate the application of the method.
url http://europepmc.org/articles/PMC4631440?pdf=render
work_keys_str_mv AT sybilderrible networkbasedandbinlessfrequencyanalyses
AT nasirahmad networkbasedandbinlessfrequencyanalyses
_version_ 1724901828482564096