Background The proper assembly of the transcriptional initiation machinery is a key regulatory step in the execution of the correct program of mRNA synthesis. splicing variation of the second exon, (iii) extension of the annotated first exon, (iv) shortening of the annotated first exon, (v) confirmation of previously annotated TSS. Conclusion In silico and experimental analysis of the transcripts proved to be essential for the ultimate mapping of TSSs. Our results highlight the necessity of a tissue specific approach to complete the existing gene annotation. The new TSSs and transcribed sequences are essential for further exploration of the promoter and other cis-regulatory sequences at the 5’end of genes. Background The spatial and temporal regulation of gene transcription is usually primarily determined by it’s flanking promoter (cis-regulatory DNA elements) through conversation with trans-acting regulatory proteins (transcription factors) [1,2]. The start of transcription is Mouse monoclonal to CCNB1 accomplished by the formation of a pre-initiation complex around the DNA, yet our knowledge of transcriptional initiation sequences in the human genome is still limited despite the availability of the complete genome sequence [3,4]. Therefore one of the main remaining challenges is usually to locate these gene sequences, defined as the transcription start site (TSS), in order to explore core promoter and Chondroitin sulfate IC50 cis-regulatory elements that direct the start of every transcript. Genomic structure and full length cDNA sequences aligned around the genome provide opportunities to locate TSSs. Conventional methods for determining exact TSSs, such as 5′ RACE or primer extension are laborious and are not selective for the complete transcript. Consequently, many mRNA sequences stored in public databases lack information about their genuine 5′ ends, mainly due to the difficulties in obtaining full-length cDNA. Several bioinformatic and experimental approaches have been developed to explore full-length cDNAs and the human transcriptome [5]. Computational predictions Chondroitin sulfate IC50 may represent a powerful tool to localize first exons and TSSs on an averaged genome-wide scale [6,7], however they may fail at the Chondroitin sulfate IC50 level of individual genes or in genes with complex regulatory patterns (e.g. multiple or tissue-specific TSS). Recently a number of experimental approaches to compile TSSs on a genome-wide scale have been established including the Database of human Transcriptional Start Sites (DBTSS) [8], whole genome tilling array analysis [9], and the exploration of mouse and human CAGE tag libraries [10]. To enable future progress we need to complete and revise these catalogues with an accurate annotation of the 5′ and 3’end, and include splice isoforms of the transcripts. In addition to genome wide approaches, there is a need for more specific studies, which cover tissue specific genes, expressed in a restricted manner. Identification of potential transcription signals that are tissue specific relies on the correct determination of transcriptional start sites. In this work we describe an experimental approach to identify the TSSs of a selected group of genes, which are predominantly expressed in retina. We focused our attention around the human retina, due to its unique and specialised function. This complex tissue, composed of multiple, highly differentiated and specific cell types (e.g. rod and cone photoreceptors, amacrine cells, Mueller glial cells), expresses a large number of specific genes. Mutations in many of these genes result in blinding disorders. Chondroitin sulfate IC50 The subset of genes expressed in human retina has been partially elucidated [11,12], with a number of studies defining genes that are either highly expressed in retina or which pose a crucial target of transcription factors Chondroitin sulfate IC50 in this tissue [13-16]..