Patterns and algorithms in high-throughput sequencing count data

Patterns and algorithms in high-throughput sequencing count data

Computational and biological challenges

Scholar's Press ( 2017-01-31 )

€ 67,90

Buy at the MoreBooks! Shop

Proteins interacting with the genome, such as histones and transcription factors, play a major role in the regulation of gene expression. Experimental techniques such as ChIP-seq provide a new type of digital sequences that quantify the presence of a protein along the genome. These are count signals: sequences as long as the genetic code, but with the natural numbers as an alphabet. The computational analysis of these sequences is challenging, as the biological patterns are complex and the datasets are large. This thesis presents 3 efficient algorithms for pattern detection problems in count signals. The first infers the genomic locations of positioned nucleosomes by using an appropriate wavelet and by integrating measurements from multiple ChIP-seq experiments. The second characterizes the regulatory processes acting on the chromatin and is based on an accurate probabilistic model for read counts. The third detects transcription factor binding sites from ChIP-exo data by simultaneously modelling the sequence and the read counts associated to a binding event. Overall, the thesis presents a general computational framework that is likely to be important for future challenges.

Book Details:

ISBN-13:

978-3-659-84622-9

ISBN-10:

3659846228

EAN:

9783659846229

Book language:

English

By (author) :

Alessandro Mammana

Number of pages:

140

Published on:

2017-01-31

Category:

Informatics