Efficient String Algorithms with Applications in Bioinformatics

Efficient String Algorithms with Applications in Bioinformatics
Author :
Publisher :
Total Pages : 73
Release :
ISBN-10 : OCLC:1293446692
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Efficient String Algorithms with Applications in Bioinformatics by : Sahar Hooshmand

Download or read book Efficient String Algorithms with Applications in Bioinformatics written by Sahar Hooshmand and published by . This book was released on 2020 with total page 73 pages. Available in PDF, EPUB and Kindle. Book excerpt: The work presented in this dissertation deals with establishing efficient methods for solving some algorithmic problems, which have applications to Bioinformatics. After a short introduction in Chapter 1, an algorithm for genome mappability problem is presented in Chapter 2. Genome mappability is a measure for the approximate repeat structure of the genome with respect to substrings of specific length and a tolerance to define the number of mismatches. The similarity between reads is measured by using the Hamming distance function. Genome mappability is computed for each position in the string and has several applications in designing high-throughput short-read sequencing experiments. Chapter 3, presents an algorithm to compute the Average Common Substring of two input sequences in their run-length encoded format. The distance between them based on the Average Common Substring measure can be computed in linearithmic time and linear space proportional to the total length of sequences after run-length encoding. Chapter 4, presents a method that produces a better approximation for Average Common Substring calculations where we are allowed to have mismatches. This method is applicable to the alignmentfree comparison of biological sequences at highly competitive speed. Finally, in Chapter 5, we present two algorithms to efficiently decode the Suffix Array/Inverse Suffix Array of the reveres text, by using the FM-index of the forward text. Additionally, our experimental results are competitive when compared to the standard approach of maintaining the FM-Index for both the forward and the reverse text in approximate string-matching applications.

Efficient String Algorithms with Applications in Bioinformatics Related Books