msaFragment   
  

introduction to bioinformatics algorithms

CS/ST480 - Spring 2006

Where: Engineering E105

When: 9-10 MWF

Course description

    The past decade has seen an explosion of biological data, including the DNA sequences of a number of organisms. Computer science and statistics play a major role in analyzing these data: from assembling the genomic sequence to predicting the protein-coding regions, their function and the manner in which they are regulated. The course will focus on biological sequence analysis, and analysis of gene expression data, starting with the biological background motivating the various computational problems, and methods for their solution. Computational techniques covered in the course include suffix trees for pattern matching, dynamic programming for sequence alignment, hidden Markov models for gene finding, Gibbs sampling for motif discovery, and Likelihood and Bayesian methods for gene expression estimation.

    The course is aimed at senior undergraduates and graduate students interested in computational biology/bioinformatics. It is expected that students from computer science, statistics, and the life sciences will benefit from such a course. It will provide a broad overview of the computational and statistical techniques currently used in bioinformatics. Students will obtain in-depth understanding of standard bioinformatics tools and learn to use various sources of sequence data. The hands-on lab component of the course will consist of implementing and using methods for analysis of sequence and gene expression data.

Main texts

  • Neil C. Jones and Pavel A. Pevzner. An introduction to bioinformatics algorithms. MIT Press, 2004.
  • Richard Durbin, Sean R. Eddy, Anders. Krogh and Graeme Mitchison. Biological Sequence Analysis. Cambridge UP, 1998.

Additional references

  • David Mount, Bioinformatics: Sequence and genome analysis, second edition. CSHL Press, 2004.
  • Richard C. Deonier, Simon Tavaré, and Michael S. Waterman. Computational Genome Analysis: An Introduction, 2005.

Prerequisites

    (a) ST301 or ST307 or ST309, and (b) CS301 or ST420, OR permission of instructor