Information Theory in Computational Biology: Where We Stand Today

“A Mathematical Theory of Communication” was published in 1948 by Claude Shannon to address the problems in the field of data compression and communication over (noisy) communication channels. Since then, the concepts and ideas developed in Shannon’s work have formed the basis of information theory,...

Full description

Bibliographic Details
Main Authors: Pritam Chanda, Eduardo Costa, Jie Hu, Shravan Sukumar, John Van Hemert, Rasna Walia
Format: Article
Language:English
Published: MDPI AG 2020-06-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/22/6/627
id doaj-a059fcf1c50a4a30806e5bf379722d94
record_format Article
spelling doaj-a059fcf1c50a4a30806e5bf379722d942020-11-25T03:48:48ZengMDPI AGEntropy1099-43002020-06-012262762710.3390/e22060627Information Theory in Computational Biology: Where We Stand TodayPritam Chanda0Eduardo Costa1Jie Hu2Shravan Sukumar3John Van Hemert4Rasna Walia5Corteva Agriscience™, Indianapolis, IN 46268, USACorteva Agriscience™, Mogi Mirim, Sao Paulo 13801-540, BrazilCorteva Agriscience™, Indianapolis, IN 46268, USACorteva Agriscience™, Indianapolis, IN 46268, USACorteva Agriscience™, Johnston, IA 50131, USACorteva Agriscience™, Johnston, IA 50131, USA“A Mathematical Theory of Communication” was published in 1948 by Claude Shannon to address the problems in the field of data compression and communication over (noisy) communication channels. Since then, the concepts and ideas developed in Shannon’s work have formed the basis of information theory, a cornerstone of statistical learning and inference, and has been playing a key role in disciplines such as physics and thermodynamics, probability and statistics, computational sciences and biological sciences. In this article we review the basic information theory based concepts and describe their key applications in multiple major areas of research in computational biology—gene expression and transcriptomics, alignment-free sequence comparison, sequencing and error correction, genome-wide disease-gene association mapping, metabolic networks and metabolomics, and protein sequence, structure and interaction analysis.https://www.mdpi.com/1099-4300/22/6/627information theoryentropycomputational biologygene expressiontranscriptomicssequence comparison
collection DOAJ
language English
format Article
sources DOAJ
author Pritam Chanda
Eduardo Costa
Jie Hu
Shravan Sukumar
John Van Hemert
Rasna Walia
spellingShingle Pritam Chanda
Eduardo Costa
Jie Hu
Shravan Sukumar
John Van Hemert
Rasna Walia
Information Theory in Computational Biology: Where We Stand Today
Entropy
information theory
entropy
computational biology
gene expression
transcriptomics
sequence comparison
author_facet Pritam Chanda
Eduardo Costa
Jie Hu
Shravan Sukumar
John Van Hemert
Rasna Walia
author_sort Pritam Chanda
title Information Theory in Computational Biology: Where We Stand Today
title_short Information Theory in Computational Biology: Where We Stand Today
title_full Information Theory in Computational Biology: Where We Stand Today
title_fullStr Information Theory in Computational Biology: Where We Stand Today
title_full_unstemmed Information Theory in Computational Biology: Where We Stand Today
title_sort information theory in computational biology: where we stand today
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2020-06-01
description “A Mathematical Theory of Communication” was published in 1948 by Claude Shannon to address the problems in the field of data compression and communication over (noisy) communication channels. Since then, the concepts and ideas developed in Shannon’s work have formed the basis of information theory, a cornerstone of statistical learning and inference, and has been playing a key role in disciplines such as physics and thermodynamics, probability and statistics, computational sciences and biological sciences. In this article we review the basic information theory based concepts and describe their key applications in multiple major areas of research in computational biology—gene expression and transcriptomics, alignment-free sequence comparison, sequencing and error correction, genome-wide disease-gene association mapping, metabolic networks and metabolomics, and protein sequence, structure and interaction analysis.
topic information theory
entropy
computational biology
gene expression
transcriptomics
sequence comparison
url https://www.mdpi.com/1099-4300/22/6/627
work_keys_str_mv AT pritamchanda informationtheoryincomputationalbiologywherewestandtoday
AT eduardocosta informationtheoryincomputationalbiologywherewestandtoday
AT jiehu informationtheoryincomputationalbiologywherewestandtoday
AT shravansukumar informationtheoryincomputationalbiologywherewestandtoday
AT johnvanhemert informationtheoryincomputationalbiologywherewestandtoday
AT rasnawalia informationtheoryincomputationalbiologywherewestandtoday
_version_ 1724496997108416512