Swiftly Computing Center Strings

Abstract Background The center string (or closest string) problem is a classic computer science problem with important applications in computational biology. Given <it>k </it>input strings and a distance threshold <it>d</it>, we...

Full description

Bibliographic Details
Main Authors:	Jahn Katharina, Kuchenbecker Léon, Hufsky Franziska, Stoye Jens, Böcker Sebastian
Format:	Article
Language:	English
Published:	BMC 2011-04-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/12/106

id	doaj-ef521c8520174adda8f9fd9a1b4b4ead
record_format	Article
spelling	doaj-ef521c8520174adda8f9fd9a1b4b4ead2020-11-24T21:53:01ZengBMCBMC Bioinformatics1471-21052011-04-0112110610.1186/1471-2105-12-106Swiftly Computing Center StringsJahn KatharinaKuchenbecker LéonHufsky FranziskaStoye JensBöcker Sebastian<p>Abstract</p> <p>Background</p> <p>The center string (or closest string) problem is a classic computer science problem with important applications in computational biology. Given <it>k </it>input strings and a distance threshold <it>d</it>, we search for a string within Hamming distance at most <it>d </it>to each input string. This problem is NP complete.</p> <p>Results</p> <p>In this paper, we focus on exact methods for the problem that are also swift in application. We first introduce data reduction techniques that allow us to infer that certain instances have no solution, or that a center string must satisfy certain conditions. We describe how to use this information to speed up two previously published search tree algorithms. Then, we describe a novel iterative search strategy that is effecient in practice, where some of our reduction techniques can also be applied. Finally, we present results of an evaluation study for two different data sets from a biological application.</p> <p>Conclusions</p> <p>We find that the running time for computing the optimal center string is dominated by the subroutine calls for <it>d </it>= <it>d</it><sub>opt </sub>-1 and <it>d </it>= <it>d</it><sub>opt</sub>. Our data reduction is very effective for both, either rejecting unsolvable instances or solving trivial positions. We find that this speeds up computations considerably.</p> http://www.biomedcentral.com/1471-2105/12/106
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Jahn Katharina Kuchenbecker Léon Hufsky Franziska Stoye Jens Böcker Sebastian
spellingShingle	Jahn Katharina Kuchenbecker Léon Hufsky Franziska Stoye Jens Böcker Sebastian Swiftly Computing Center Strings BMC Bioinformatics
author_facet	Jahn Katharina Kuchenbecker Léon Hufsky Franziska Stoye Jens Böcker Sebastian
author_sort	Jahn Katharina
title	Swiftly Computing Center Strings
title_short	Swiftly Computing Center Strings
title_full	Swiftly Computing Center Strings
title_fullStr	Swiftly Computing Center Strings
title_full_unstemmed	Swiftly Computing Center Strings
title_sort	swiftly computing center strings
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2011-04-01
description	<p>Abstract</p> <p>Background</p> <p>The center string (or closest string) problem is a classic computer science problem with important applications in computational biology. Given <it>k </it>input strings and a distance threshold <it>d</it>, we search for a string within Hamming distance at most <it>d </it>to each input string. This problem is NP complete.</p> <p>Results</p> <p>In this paper, we focus on exact methods for the problem that are also swift in application. We first introduce data reduction techniques that allow us to infer that certain instances have no solution, or that a center string must satisfy certain conditions. We describe how to use this information to speed up two previously published search tree algorithms. Then, we describe a novel iterative search strategy that is effecient in practice, where some of our reduction techniques can also be applied. Finally, we present results of an evaluation study for two different data sets from a biological application.</p> <p>Conclusions</p> <p>We find that the running time for computing the optimal center string is dominated by the subroutine calls for <it>d </it>= <it>d</it><sub>opt </sub>-1 and <it>d </it>= <it>d</it><sub>opt</sub>. Our data reduction is very effective for both, either rejecting unsolvable instances or solving trivial positions. We find that this speeds up computations considerably.</p>
url	http://www.biomedcentral.com/1471-2105/12/106
work_keys_str_mv	AT jahnkatharina swiftlycomputingcenterstrings AT kuchenbeckerleon swiftlycomputingcenterstrings AT hufskyfranziska swiftlycomputingcenterstrings AT stoyejens swiftlycomputingcenterstrings AT bockersebastian swiftlycomputingcenterstrings
_version_	1725873346063630336

Swiftly Computing Center Strings

Similar Items