Is a genome a codeword of an error-correcting code?

Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the exi...

Full description

Bibliographic Details
Main Authors: Luzinete C B Faria, Andréa S L Rocha, João H Kleinschmidt, Márcio C Silva-Filho, Edson Bim, Roberto H Herai, Michel E B Yamagishi, Reginaldo Palazzo
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3359345?pdf=render
id doaj-11c5bbea887d42a5900d0122658ab88c
record_format Article
spelling doaj-11c5bbea887d42a5900d0122658ab88c2020-11-25T00:46:33ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0175e3664410.1371/journal.pone.0036644Is a genome a codeword of an error-correcting code?Luzinete C B FariaAndréa S L RochaJoão H KleinschmidtMárcio C Silva-FilhoEdson BimRoberto H HeraiMichel E B YamagishiReginaldo PalazzoSince a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.http://europepmc.org/articles/PMC3359345?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Luzinete C B Faria
Andréa S L Rocha
João H Kleinschmidt
Márcio C Silva-Filho
Edson Bim
Roberto H Herai
Michel E B Yamagishi
Reginaldo Palazzo
spellingShingle Luzinete C B Faria
Andréa S L Rocha
João H Kleinschmidt
Márcio C Silva-Filho
Edson Bim
Roberto H Herai
Michel E B Yamagishi
Reginaldo Palazzo
Is a genome a codeword of an error-correcting code?
PLoS ONE
author_facet Luzinete C B Faria
Andréa S L Rocha
João H Kleinschmidt
Márcio C Silva-Filho
Edson Bim
Roberto H Herai
Michel E B Yamagishi
Reginaldo Palazzo
author_sort Luzinete C B Faria
title Is a genome a codeword of an error-correcting code?
title_short Is a genome a codeword of an error-correcting code?
title_full Is a genome a codeword of an error-correcting code?
title_fullStr Is a genome a codeword of an error-correcting code?
title_full_unstemmed Is a genome a codeword of an error-correcting code?
title_sort is a genome a codeword of an error-correcting code?
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2012-01-01
description Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.
url http://europepmc.org/articles/PMC3359345?pdf=render
work_keys_str_mv AT luzinetecbfaria isagenomeacodewordofanerrorcorrectingcode
AT andreaslrocha isagenomeacodewordofanerrorcorrectingcode
AT joaohkleinschmidt isagenomeacodewordofanerrorcorrectingcode
AT marciocsilvafilho isagenomeacodewordofanerrorcorrectingcode
AT edsonbim isagenomeacodewordofanerrorcorrectingcode
AT robertohherai isagenomeacodewordofanerrorcorrectingcode
AT michelebyamagishi isagenomeacodewordofanerrorcorrectingcode
AT reginaldopalazzo isagenomeacodewordofanerrorcorrectingcode
_version_ 1725264588169019392