Alternative Splicing Prediction
In this project we intend to develop classifiers that are able to predict alternative splicing in DNA sequences.
In plants, such as A. thaliana, the most common form of alternative splicing is intron retention,
where introns are retained in mature mRNA under some conditions.
We are experimenting with classifiers such as Support Vector Machines (SVMs) based on Hidden
Markov Models (HMMs), Smith-Waterman alignment scores and other sequence-based scoring schemes.
SpliceGrapher
In pursuing these goals we have developed
SpliceGrapher
a python-based scripting tool that can use
annotated gene models, next-generation sequencing data (RNA-Seq) and EST or cDNA alignments to predict
alternative splicing events.
In our tests we have found that SpliceGrapher produces predictions that are more consistent with
annotated gene models than those of other software packages.
Related articles:
- SpliceGrapher: Detecting patterns of alternative splicing
from RNA-seq data in the context of gene models and EST data.
Genome Biology, Vol. 13, 2012
- Comparative analysis of serine/arginine-rich proteins across 27 eukaryotes:
insights into subfamily classification and extent of alternative splicing.
(Richardson et al., 2011)
- Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation
(Trapnell et al., 2009)
- RASE: recognition of alternatively spliced exons in C. elegans
(Rätsch et al., 2007)
- Nuclear Pre-mRNA Splicing in Plants(Reddy, 2001)