The Fabric of Entropy: A Discussion on the Meaning of Fractional Information
Why is the term information in English an uncountable noun, whereas in information theory it is a well-defined quantity? Since the amount of information can be quantified, what is the meaning of a fraction of that amount? This dissertation introduces a quasi-entropy matrix which developed from Claud...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | English |
Published: |
University of North Texas
2019
|
Subjects: | |
Online Access: | https://digital.library.unt.edu/ark:/67531/metadc1538775/ |
id |
ndltd-unt.edu-info-ark-67531-metadc1538775 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-unt.edu-info-ark-67531-metadc15387752021-10-03T05:23:41Z The Fabric of Entropy: A Discussion on the Meaning of Fractional Information Zhang, Yuan information theory entropy data visualization Why is the term information in English an uncountable noun, whereas in information theory it is a well-defined quantity? Since the amount of information can be quantified, what is the meaning of a fraction of that amount? This dissertation introduces a quasi-entropy matrix which developed from Claude Shannon's information measure as an analytical tool for behavioral studies. Such matrix emphasizes the role of relative characteristics of individual level data across different collections. The real challenge in the big data era is never the size of the dataset, but how data lead scientists to individuals rather than arbitrarily divided statistical groups. This proposed matrix, when combining with other statistical measures, provides a new and easy-to-do method for identifying pattern in a well-defined system because it is built on the idea that uneven probability distributions lead to decrease in system entropy. Although the matrix is not superior to classical correlation techniques, it allows an interpretation not available with traditional standard statistics. Finally, this matrix connects heterogeneous datasets because it is a frequency-based method and it works on the modes of data rather than the means of values. It also visualizes clustering in data although this type of clustering is not measured by the squared Euclidean distance of the numerical attributes. University of North Texas O’Connor, Brian C. Chang, Hsia-Ching Anderson, Richard L., 1970- 2019-08 Thesis or Dissertation vii, 88 pages Text local-cont-no: submission_1615 https://digital.library.unt.edu/ark:/67531/metadc1538775/ ark: ark:/67531/metadc1538775 English Public Zhang, Yuan Copyright Copyright is held by the author, unless otherwise noted. All rights Reserved. |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
information theory entropy data visualization |
spellingShingle |
information theory entropy data visualization Zhang, Yuan The Fabric of Entropy: A Discussion on the Meaning of Fractional Information |
description |
Why is the term information in English an uncountable noun, whereas in information theory it is a well-defined quantity? Since the amount of information can be quantified, what is the meaning of a fraction of that amount? This dissertation introduces a quasi-entropy matrix which developed from Claude Shannon's information measure as an analytical tool for behavioral studies. Such matrix emphasizes the role of relative characteristics of individual level data across different collections. The real challenge in the big data era is never the size of the dataset, but how data lead scientists to individuals rather than arbitrarily divided statistical groups. This proposed matrix, when combining with other statistical measures, provides a new and easy-to-do method for identifying pattern in a well-defined system because it is built on the idea that uneven probability distributions lead to decrease in system entropy. Although the matrix is not superior to classical correlation techniques, it allows an interpretation not available with traditional standard statistics. Finally, this matrix connects heterogeneous datasets because it is a frequency-based method and it works on the modes of data rather than the means of values. It also visualizes clustering in data although this type of clustering is not measured by the squared Euclidean distance of the numerical attributes. |
author2 |
O’Connor, Brian C. |
author_facet |
O’Connor, Brian C. Zhang, Yuan |
author |
Zhang, Yuan |
author_sort |
Zhang, Yuan |
title |
The Fabric of Entropy: A Discussion on the Meaning of Fractional Information |
title_short |
The Fabric of Entropy: A Discussion on the Meaning of Fractional Information |
title_full |
The Fabric of Entropy: A Discussion on the Meaning of Fractional Information |
title_fullStr |
The Fabric of Entropy: A Discussion on the Meaning of Fractional Information |
title_full_unstemmed |
The Fabric of Entropy: A Discussion on the Meaning of Fractional Information |
title_sort |
fabric of entropy: a discussion on the meaning of fractional information |
publisher |
University of North Texas |
publishDate |
2019 |
url |
https://digital.library.unt.edu/ark:/67531/metadc1538775/ |
work_keys_str_mv |
AT zhangyuan thefabricofentropyadiscussiononthemeaningoffractionalinformation AT zhangyuan fabricofentropyadiscussiononthemeaningoffractionalinformation |
_version_ |
1719487126379167744 |