Homepage  | Internal Site 

The figure overviews the pipeline used for the annotation of the sequences originated from the gene trap technique. The sequences are obtained from the clones were a successful insertion occurred.
We use Blat to map the sequences onto the mouse genome (assembly NCBI - 33). We do not process sequences with length less that 18 bp or those with very low quality score as calculated by Phred¹.
First we map the sequences to the mouse genome with BLAT². Afterwards we query the Ensembl database³ for the relevant annotation in that mapped region. We start by verifying if a known exon was trapped. If so, we store the relevant information of the corresponding gene. In case no known exon is trapped we try to identify if the trapped region corresponds to a known gene (perhaps a novel exon), a putative exon or gene (predict in the Ensembl pipeline) and finally to the EST's that have been aligned to the genome but are no associated with a Ensembl prediction.

1. Ewing B, Hillier L, Wendl MC, Green P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 1998 Mar;8(3):175-85.
2. Jim Kent. BLAT--the BLAST-like alignment tool. Genome Res. 2002 Apr;12(4):656-64.
3. Birney et al. An overview of Ensembl. Genome Res. 2004 May;14(5):925-8.

Homepage  | Internal Site  |  Browse  |  Blast  |  Clone Request |  Tigem |  IRBM