DocumentCode :
3068755
Title :
On reconstructing a string from its substring compositions
Author :
Acharya, Jayadev ; Das, Hirakendu ; Milenkovic, Olgica ; Orlitsky, Alon ; Pan, Shengjun
fYear :
2010
fDate :
13-18 June 2010
Firstpage :
1238
Lastpage :
1242
Abstract :
Motivated by protein sequencing, we consider the problem of reconstructing a string from the compositions of its substrings. We provide several results, including the following. General classes of strings that cannot be distinguished from their substring compositions. An almost complete characterization of the lengths for which reconstruction is possible. Bounds on the number of strings with the same substring compositions in terms of the number of divisors of the string length plus one. A relation to the turnpike problem and a bivariate polynomial formulation of string reconstruction.
Keywords :
biocomputing; combinatorial mathematics; proteins; bivariate polynomial formulation; protein sequencing; string reconstruction; substring compositions; Amino acids; Binary sequences; Ion sources; Mass spectroscopy; Polynomials; Protein sequence; Reconstruction algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Theory Proceedings (ISIT), 2010 IEEE International Symposium on
Conference_Location :
Austin, TX
Print_ISBN :
978-1-4244-7890-3
Electronic_ISBN :
978-1-4244-7891-0
Type :
conf
DOI :
10.1109/ISIT.2010.5513668
Filename :
5513668
Link To Document :
بازگشت