The Fabric of Entropy: A Discussion on the Meaning of Fractional Information

Why is the term information in English an uncountable noun, whereas in information theory it is a well-defined quantity? Since the amount of information can be quantified, what is the meaning of a fraction of that amount? This dissertation introduces a quasi-entropy matrix which developed from Claud...

Full description

Bibliographic Details
Main Author: Zhang, Yuan
Other Authors: O’Connor, Brian C.
Format: Others
Language:English
Published: University of North Texas 2019
Subjects:
Online Access:https://digital.library.unt.edu/ark:/67531/metadc1538775/
id ndltd-unt.edu-info-ark-67531-metadc1538775
record_format oai_dc
spelling ndltd-unt.edu-info-ark-67531-metadc15387752021-10-03T05:23:41Z The Fabric of Entropy: A Discussion on the Meaning of Fractional Information Zhang, Yuan information theory entropy data visualization Why is the term information in English an uncountable noun, whereas in information theory it is a well-defined quantity? Since the amount of information can be quantified, what is the meaning of a fraction of that amount? This dissertation introduces a quasi-entropy matrix which developed from Claude Shannon's information measure as an analytical tool for behavioral studies. Such matrix emphasizes the role of relative characteristics of individual level data across different collections. The real challenge in the big data era is never the size of the dataset, but how data lead scientists to individuals rather than arbitrarily divided statistical groups. This proposed matrix, when combining with other statistical measures, provides a new and easy-to-do method for identifying pattern in a well-defined system because it is built on the idea that uneven probability distributions lead to decrease in system entropy. Although the matrix is not superior to classical correlation techniques, it allows an interpretation not available with traditional standard statistics. Finally, this matrix connects heterogeneous datasets because it is a frequency-based method and it works on the modes of data rather than the means of values. It also visualizes clustering in data although this type of clustering is not measured by the squared Euclidean distance of the numerical attributes. University of North Texas O’Connor, Brian C. Chang, Hsia-Ching Anderson, Richard L., 1970- 2019-08 Thesis or Dissertation vii, 88 pages Text local-cont-no: submission_1615 https://digital.library.unt.edu/ark:/67531/metadc1538775/ ark: ark:/67531/metadc1538775 English Public Zhang, Yuan Copyright Copyright is held by the author, unless otherwise noted. All rights Reserved.
collection NDLTD
language English
format Others
sources NDLTD
topic information theory
entropy
data visualization
spellingShingle information theory
entropy
data visualization
Zhang, Yuan
The Fabric of Entropy: A Discussion on the Meaning of Fractional Information
description Why is the term information in English an uncountable noun, whereas in information theory it is a well-defined quantity? Since the amount of information can be quantified, what is the meaning of a fraction of that amount? This dissertation introduces a quasi-entropy matrix which developed from Claude Shannon's information measure as an analytical tool for behavioral studies. Such matrix emphasizes the role of relative characteristics of individual level data across different collections. The real challenge in the big data era is never the size of the dataset, but how data lead scientists to individuals rather than arbitrarily divided statistical groups. This proposed matrix, when combining with other statistical measures, provides a new and easy-to-do method for identifying pattern in a well-defined system because it is built on the idea that uneven probability distributions lead to decrease in system entropy. Although the matrix is not superior to classical correlation techniques, it allows an interpretation not available with traditional standard statistics. Finally, this matrix connects heterogeneous datasets because it is a frequency-based method and it works on the modes of data rather than the means of values. It also visualizes clustering in data although this type of clustering is not measured by the squared Euclidean distance of the numerical attributes.
author2 O’Connor, Brian C.
author_facet O’Connor, Brian C.
Zhang, Yuan
author Zhang, Yuan
author_sort Zhang, Yuan
title The Fabric of Entropy: A Discussion on the Meaning of Fractional Information
title_short The Fabric of Entropy: A Discussion on the Meaning of Fractional Information
title_full The Fabric of Entropy: A Discussion on the Meaning of Fractional Information
title_fullStr The Fabric of Entropy: A Discussion on the Meaning of Fractional Information
title_full_unstemmed The Fabric of Entropy: A Discussion on the Meaning of Fractional Information
title_sort fabric of entropy: a discussion on the meaning of fractional information
publisher University of North Texas
publishDate 2019
url https://digital.library.unt.edu/ark:/67531/metadc1538775/
work_keys_str_mv AT zhangyuan thefabricofentropyadiscussiononthemeaningoffractionalinformation
AT zhangyuan fabricofentropyadiscussiononthemeaningoffractionalinformation
_version_ 1719487126379167744