Who, Where, When
| Instructor | Asa Ben-Hur Office: 448 Office Hours: By appointment. |
| Lecture | TR 11:00am-12:15pm at CSB 325 |
Course Outline
- A primer in biology; computational problems arising in biology (week 1)
- Sequence alignment for DNA and protein sequences (weeks 2 – 4)
- Concepts: homology, sequence similarity and sequence alignment; dynamic programming algorithms
- Pairwise alignment
- Global and local alignment using dynamic programming
- Heuristic alignment methods: BLAST/FASTA and the statistics of local alignments
- Multiple sequence alignment
- Definition, scoring, techniques
- Aligners for proteins sequences
- Spliced alignment
- Motif finding in DNA and proteins (week 5)
- Hidden Markov models (HMMs) (week 6 – 8)
- The basic HMM algorithms: forward, backward, Viterbi, Baum-Welch
- Applications: CpG islands, gene finding, profile HMMs, pair HMMs
- Genome assembly (week 9)
- Analysis of high-throughput sequencing data (week 10)
- Phylogenetic analysis (week 11 – 12)
- Why phylogeny?
- Neighbor joining, parsimony, and maximum likelihood methods
- Comparative genomics: gene regulation, gene finding, genome rearrangements (week 13)
- High throughput biological data: microarrays, mass spectrometry, and protein-protein interactions (week 14)
- Student project presentations and class summary (week 15)
Prerequisites:
Some background in statistics and ability to program (R and matlab are good enough)
Textbook:
Richard Durbin, Sean R. Eddy, Anders. Krogh and Graeme Mitchison. Biological Sequence Analysis. Cambridge UP, 1998.
Also useful:
Neil C. Jones and Pavel A. Pevzner
An introduction to bioinformatics algorithms
MIT Press, 2004
Grading:
| Assignments | 60% | Programming and written assignments |
| Project | 40% | Includes a written report and class presentation |
