Statistical Complexity Analysis of Turing Machine tapes with Fixed Algorithmic Complexity Using the Best-Order Markov Model

Sources that generate symbolic sequences with algorithmic nature may differ in statistical complexity because they create structures that follow algorithmic schemes, rather than generating symbols from a probabilistic function assuming independence. In the case of Turing machines, this means that ma...

Full description

Bibliographic Details
Main Authors: Jorge M. Silva, Eduardo Pinho, Sérgio Matos, Diogo Pratas
Format: Article
Language:English
Published: MDPI AG 2020-01-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/22/1/105
id doaj-41c035dc0f7c45fa88aa4c48523db9a3
record_format Article
spelling doaj-41c035dc0f7c45fa88aa4c48523db9a32020-11-25T01:42:38ZengMDPI AGEntropy1099-43002020-01-0122110510.3390/e22010105e22010105Statistical Complexity Analysis of Turing Machine tapes with Fixed Algorithmic Complexity Using the Best-Order Markov ModelJorge M. Silva0Eduardo Pinho1Sérgio Matos2Diogo Pratas3Institute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, 3810-193 Aveiro, PortugalInstitute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, 3810-193 Aveiro, PortugalInstitute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, 3810-193 Aveiro, PortugalInstitute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, 3810-193 Aveiro, PortugalSources that generate symbolic sequences with algorithmic nature may differ in statistical complexity because they create structures that follow algorithmic schemes, rather than generating symbols from a probabilistic function assuming independence. In the case of Turing machines, this means that machines with the same algorithmic complexity can create tapes with different statistical complexity. In this paper, we use a compression-based approach to measure global and local statistical complexity of specific Turing machine tapes with the same number of states and alphabet. Both measures are estimated using the best-order Markov model. For the global measure, we use the Normalized Compression (NC), while, for the local measures, we define and use normal and dynamic complexity profiles to quantify and localize lower and higher regions of statistical complexity. We assessed the validity of our methodology on synthetic and real genomic data showing that it is tolerant to increasing rates of editions and block permutations. Regarding the analysis of the tapes, we localize patterns of higher statistical complexity in two regions, for a different number of machine states. We show that these patterns are generated by a decrease of the tape’s amplitude, given the setting of small rule cycles. Additionally, we performed a comparison with a measure that uses both algorithmic and statistical approaches (BDM) for analysis of the tapes. Naturally, BDM is efficient given the algorithmic nature of the tapes. However, for a higher number of states, BDM is progressively approximated by our methodology. Finally, we provide a simple algorithm to increase the statistical complexity of a Turing machine tape while retaining the same algorithmic complexity. We supply a publicly available implementation of the algorithm in C++ language under the GPLv3 license. All results can be reproduced in full with scripts provided at the repository.https://www.mdpi.com/1099-4300/22/1/105turing machinesinformation theorystatistical complexityalgorithmic complexitycomputational complexitymarkov modelscompression-based analysis
collection DOAJ
language English
format Article
sources DOAJ
author Jorge M. Silva
Eduardo Pinho
Sérgio Matos
Diogo Pratas
spellingShingle Jorge M. Silva
Eduardo Pinho
Sérgio Matos
Diogo Pratas
Statistical Complexity Analysis of Turing Machine tapes with Fixed Algorithmic Complexity Using the Best-Order Markov Model
Entropy
turing machines
information theory
statistical complexity
algorithmic complexity
computational complexity
markov models
compression-based analysis
author_facet Jorge M. Silva
Eduardo Pinho
Sérgio Matos
Diogo Pratas
author_sort Jorge M. Silva
title Statistical Complexity Analysis of Turing Machine tapes with Fixed Algorithmic Complexity Using the Best-Order Markov Model
title_short Statistical Complexity Analysis of Turing Machine tapes with Fixed Algorithmic Complexity Using the Best-Order Markov Model
title_full Statistical Complexity Analysis of Turing Machine tapes with Fixed Algorithmic Complexity Using the Best-Order Markov Model
title_fullStr Statistical Complexity Analysis of Turing Machine tapes with Fixed Algorithmic Complexity Using the Best-Order Markov Model
title_full_unstemmed Statistical Complexity Analysis of Turing Machine tapes with Fixed Algorithmic Complexity Using the Best-Order Markov Model
title_sort statistical complexity analysis of turing machine tapes with fixed algorithmic complexity using the best-order markov model
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2020-01-01
description Sources that generate symbolic sequences with algorithmic nature may differ in statistical complexity because they create structures that follow algorithmic schemes, rather than generating symbols from a probabilistic function assuming independence. In the case of Turing machines, this means that machines with the same algorithmic complexity can create tapes with different statistical complexity. In this paper, we use a compression-based approach to measure global and local statistical complexity of specific Turing machine tapes with the same number of states and alphabet. Both measures are estimated using the best-order Markov model. For the global measure, we use the Normalized Compression (NC), while, for the local measures, we define and use normal and dynamic complexity profiles to quantify and localize lower and higher regions of statistical complexity. We assessed the validity of our methodology on synthetic and real genomic data showing that it is tolerant to increasing rates of editions and block permutations. Regarding the analysis of the tapes, we localize patterns of higher statistical complexity in two regions, for a different number of machine states. We show that these patterns are generated by a decrease of the tape’s amplitude, given the setting of small rule cycles. Additionally, we performed a comparison with a measure that uses both algorithmic and statistical approaches (BDM) for analysis of the tapes. Naturally, BDM is efficient given the algorithmic nature of the tapes. However, for a higher number of states, BDM is progressively approximated by our methodology. Finally, we provide a simple algorithm to increase the statistical complexity of a Turing machine tape while retaining the same algorithmic complexity. We supply a publicly available implementation of the algorithm in C++ language under the GPLv3 license. All results can be reproduced in full with scripts provided at the repository.
topic turing machines
information theory
statistical complexity
algorithmic complexity
computational complexity
markov models
compression-based analysis
url https://www.mdpi.com/1099-4300/22/1/105
work_keys_str_mv AT jorgemsilva statisticalcomplexityanalysisofturingmachinetapeswithfixedalgorithmiccomplexityusingthebestordermarkovmodel
AT eduardopinho statisticalcomplexityanalysisofturingmachinetapeswithfixedalgorithmiccomplexityusingthebestordermarkovmodel
AT sergiomatos statisticalcomplexityanalysisofturingmachinetapeswithfixedalgorithmiccomplexityusingthebestordermarkovmodel
AT diogopratas statisticalcomplexityanalysisofturingmachinetapeswithfixedalgorithmiccomplexityusingthebestordermarkovmodel
_version_ 1725035041065533440