Network-Based and Binless Frequency Analyses.
We introduce and develop a new network-based and binless methodology to perform frequency analyses and produce histograms. In contrast with traditional frequency analysis techniques that use fixed intervals to bin values, we place a range ±ζ around each individual value in a data set and count the n...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2015-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC4631440?pdf=render |
id |
doaj-33f846abaf5e4fbcb7fee1ad1b0a9e70 |
---|---|
record_format |
Article |
spelling |
doaj-33f846abaf5e4fbcb7fee1ad1b0a9e702020-11-25T02:14:07ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-011011e014210810.1371/journal.pone.0142108Network-Based and Binless Frequency Analyses.Sybil DerribleNasir AhmadWe introduce and develop a new network-based and binless methodology to perform frequency analyses and produce histograms. In contrast with traditional frequency analysis techniques that use fixed intervals to bin values, we place a range ±ζ around each individual value in a data set and count the number of values within that range, which allows us to compare every single value of a data set with one another. In essence, the methodology is identical to the construction of a network, where two values are connected if they lie within a given a range (±ζ). The value with the highest degree (i.e., most connections) is therefore assimilated to the mode of the distribution. To select an optimal range, we look at the stability of the proportion of nodes in the largest cluster. The methodology is validated by sampling 12 typical distributions, and it is applied to a number of real-world data sets with both spatial and temporal components. The methodology can be applied to any data set and provides a robust means to uncover meaningful patterns and trends. A free python script and a tutorial are also made available to facilitate the application of the method.http://europepmc.org/articles/PMC4631440?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sybil Derrible Nasir Ahmad |
spellingShingle |
Sybil Derrible Nasir Ahmad Network-Based and Binless Frequency Analyses. PLoS ONE |
author_facet |
Sybil Derrible Nasir Ahmad |
author_sort |
Sybil Derrible |
title |
Network-Based and Binless Frequency Analyses. |
title_short |
Network-Based and Binless Frequency Analyses. |
title_full |
Network-Based and Binless Frequency Analyses. |
title_fullStr |
Network-Based and Binless Frequency Analyses. |
title_full_unstemmed |
Network-Based and Binless Frequency Analyses. |
title_sort |
network-based and binless frequency analyses. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2015-01-01 |
description |
We introduce and develop a new network-based and binless methodology to perform frequency analyses and produce histograms. In contrast with traditional frequency analysis techniques that use fixed intervals to bin values, we place a range ±ζ around each individual value in a data set and count the number of values within that range, which allows us to compare every single value of a data set with one another. In essence, the methodology is identical to the construction of a network, where two values are connected if they lie within a given a range (±ζ). The value with the highest degree (i.e., most connections) is therefore assimilated to the mode of the distribution. To select an optimal range, we look at the stability of the proportion of nodes in the largest cluster. The methodology is validated by sampling 12 typical distributions, and it is applied to a number of real-world data sets with both spatial and temporal components. The methodology can be applied to any data set and provides a robust means to uncover meaningful patterns and trends. A free python script and a tutorial are also made available to facilitate the application of the method. |
url |
http://europepmc.org/articles/PMC4631440?pdf=render |
work_keys_str_mv |
AT sybilderrible networkbasedandbinlessfrequencyanalyses AT nasirahmad networkbasedandbinlessfrequencyanalyses |
_version_ |
1724901828482564096 |