Software
- Twinscan/N-SCAN - Latest Version - 4.1.2
- Twinscan/N-SCAN is our lab's suite of software for gene-structure prediction. We recommend running N-SCAN on our web server (http://mblab.wustl.edu/nscan). Twinscan is currently available for Mammals, Caenorhabditis (worm), Dicot plants, and Cryptococci. N-SCAN is available for human and Drosophila (fruitfly).
- Validate GTF - Latest Version - 1.0
- GTF (gene transfer format) is a simple, precise format for annotating protein coding genes which has been used for a number of multi-group annotation efforts. It is a specialized subset of GFF (general feature format). The specification for GTF can be found here.
If a GTF file contains errors, programs that parse it can yield false conclusions. Validate GTF is a flexible Perl script that checks a GTF file for correctness. It can detect most common syntactic errors, such as including the stop codon within the CDS annotation. It can also detect semantic errors, such as annotated coding sequence that contains stop codons spanning splice sites.
- PPFINDER - Latest Version - 1.0
- PPFINDER is a Perl-based procedure for finding processed pseudogenes in genome annotations (GTF format). PPFINDER identifies pseudogenes using two different methods, and creates a GTF file with pseudogene coordinates that can be used for masking the genome sequence. In gene prediction mode, PPFINDER will mask out the genomic sequence and rerun gene prediction. Rounds of pseudogene masking and gene prediction are iterated until convergence.
- eval - Latest Version - 2.2.8
- Eval is a flexible tool for analyzing the performance of gene-structure prediction programs. It provides summaries and graphical distributions for many statistics describing any set of annotations, regardless of their source. It also compares sets of predictions to standard annotations and to one another.
- primerD - Latest Version - 1.0
- The primerD program implements a novel algorithm for the design of unique degenerate primer pairs. Using primers designed by primerD it is possible to reduce the total number of primers that need to be amplified by over 35% while amplifying the same number of targets. This program has a highly configurable set of parameters that account for both basic PCR and degenerate primer constraints. As a final verification of primer pair efficacy an extensive mis-priming check is done to ensure the uniqueness of each intended amplification.
- Pairagon - Latest Version - 1.1
- Pairagon is a pair-HMM based cDNA-to-genome alignment program.
- Trace Recalling - Latest Version - 0.5
- Trace recalling package described in genome research 17:212-218
- iParameterEstimation - Latest Version - 0.3.1
- iParameterEstimation is a highly configurable maximum likelihood parameter estimation package for generalized Hidden Markov Models. Specific configuration files are included for training Twinscan, N-SCAN and their EST variants.
Note: This software requires the eval package, available here.
- GRENDEL - Latest Version - 0.2
- GRENDEL (Gene Regulatory Network Decoding Evaluations tooL), generates random gene regulatory networks according to user defined constraints on the network topology and kinetics. It then
simulates the state of each regulatory network under various user defined conditions (the experimental design) and produces simulated gene expression data.