The genomic locus (RPII215) encoding the largest subunit of mouse RNA polymerase II has been cloned by low stringency hybridization to a Drosophila RPII215 probe. The mouse gene consists of 28 exons which span 30 kilobases. Analysis of the nucleotide and predicted protein sequences indicates that the protein is comprised of two domains. There is a 1500 residue amino-terminal domain which contains seven regions strikingly similar to those in the beta' subunit of Escherichia coli RNA polymerase, and a carboxyl-terminal domain comprised of 52 repeats of a 7-amino-acid consensus sequence Tyr-Ser-Pro-Thr-Ser-Pro-Ser. Among the seven highly conserved regions are a strongly basic domain consistent with a DNA-binding site and a consensus sequence characteristic of a potential zinc-binding domain. The 5' upstream region contains three tandem sequences similar to binding sites for the transcription factor SP1. Two of the introns in this gene splice at donor GC dinucleotides as opposed to previously described invariant GT sites. The identification of regions which are highly conserved as compared with bacterial and yeast RNA polymerase and other regions which are unique to the mouse protein suggests which domains of RNA polymerase large subunits are involved in aspects of transcription common to both procaryotes and eucaryotes and which are characteristic of transcription in higher organisms.
|Original language||English (US)|
|Number of pages||11|
|Journal||Journal of Biological Chemistry|
|State||Published - Aug 5 1987|