Showing Protein DNA-directed RNA polymerase II subunit RPB1 (CDBP00698)
Identification | ||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
HMDB Protein ID | CDBP00698 | |||||||||||||||||||||||||||||||||||||||||
Secondary Accession Numbers | Not Available | |||||||||||||||||||||||||||||||||||||||||
Name | DNA-directed RNA polymerase II subunit RPB1 | |||||||||||||||||||||||||||||||||||||||||
Description | Not Available | |||||||||||||||||||||||||||||||||||||||||
Synonyms |
|
|||||||||||||||||||||||||||||||||||||||||
Gene Name | POLR2A | |||||||||||||||||||||||||||||||||||||||||
Protein Type | Enzyme | |||||||||||||||||||||||||||||||||||||||||
Biological Properties | ||||||||||||||||||||||||||||||||||||||||||
General Function | Involved in DNA binding | |||||||||||||||||||||||||||||||||||||||||
Specific Function | DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Largest and catalytic component of RNA polymerase II which synthesizes mRNA precursors and many functional non-coding RNAs. Forms the polymerase active center together with the second largest subunit. Pol II is the central component of the basal RNA polymerase II transcription machinery. It is composed of mobile elements that move relative to each other. RPB1 is part of the core element with the central large cleft, the clamp element that moves to open and close the cleft and the jaws that are thought to grab the incoming DNA template. At the start of transcription, a single stranded DNA template strand of the promoter is positioned within the central active site cleft of Pol II. A bridging helix emanates from RPB1 and crosses the cleft near the catalytic site and is thought to promote translocation of Pol II by acting as a ratchet that moves the RNA-DNA hybrid through the active site by switching from straight to bent conformations at each step of nucleotide addition. During transcription elongation, Pol II moves on the template as the transcript elongates. Elongation is influenced by the phosphorylation status of the C-terminal domain (CTD) of Pol II largest subunit (RPB1), which serves as a platform for assembly of factors that regulate transcription initiation, elongation, termination and mRNA processing. Acts as a RNA-dependent RNA polymerase when associated with small delta antigen of Hepatitis delta virus, acting both as a replicate and transcriptase for the viral RNA circular genome. | |||||||||||||||||||||||||||||||||||||||||
GO Classification |
|
|||||||||||||||||||||||||||||||||||||||||
Cellular Location |
|
|||||||||||||||||||||||||||||||||||||||||
Pathways |
|
|||||||||||||||||||||||||||||||||||||||||
Gene Properties | ||||||||||||||||||||||||||||||||||||||||||
Chromosome Location | 17 | |||||||||||||||||||||||||||||||||||||||||
Locus | 17p13.1 | |||||||||||||||||||||||||||||||||||||||||
SNPs | POLR2A | |||||||||||||||||||||||||||||||||||||||||
Gene Sequence |
>5913 bp ATGCACGGGGGTGGCCCCCCCTCGGGGGACAGCGCATGCCCGCTGCGCACCATCAAGAGA GTCCAGTTCGGAGTCCTGAGTCCGGATGAACTGAAGCGAATGTCTGTGACGGAGGGTGGC ATCAAATACCCAGAGACGACTGAGGGAGGCCGCCCCAAGCTTGGGGGGCTGATGGACCCG AGGCAGGGGGTGATTGAGCGGACTGGCCGCTGCCAAACATGTGCAGGAAACATGACAGAG TGTCCTGGCCACTTTGGCCACATTGAACTGGCCAAGCCTGTGTTTCACGTGGGCTTCCTG GTGAAGACAATGAAAGTTTTGCGCTGTGTCTGCTTCTTCTGCTCCAAACTGCTTGTGGAC TCTAACAACCCAAAGATCAAGGATATCCTGGCTAAGTCCAAGGGACAGCCCAAGAAGCGG CTCACACATGTCTACGACCTTTGCAAGGGCAAAAACATATGCGAGGGTGGGGAGGAGATG GACAACAAGTTCGGTGTGGAACAACCTGAGGGTGACGAGGATCTGACCAAAGAAAAGGGC CATGGTGGCTGTGGGCGGTACCAGCCCAGGATCCGGCGTTCTGGCCTAGAGCTGTATGCG GAATGGAAGCACGTTAATGAGGACTCTCAGGAGAAGAAGATCCTGCTGAGTCCAGAGCGA GTGCATGAGATCTTCAAACGCATCTCAGATGAGGAGTGTTTTGTGCTGGGCATGGAGCCC CGCTATGCACGGCCAGAGTGGATGATTGTCACAGTGCTGCCTGTGCCCCCGCTCTCCGTG CGGCCTGCTGTTGTGATGCAGGGCTCTGCCCGTAACCAGGATGACCTGACTCACAAACTG GCTGACATCGTGAAGATCAACAATCAGCTGCGGCGCAATGAGCAGAACGGCGCAGCGGCC CATGTCATTGCAGAGGATGTGAAGCTCCTCCAGTTCCATGTGGCCACCATGGTGGACAAT GAGCTGCCTGGCTTGCCCCGTGCCATGCAGAAGTCTGGGCGTCCCCTCAAGTCCCTGAAG CAGCGGTTGAAGGGCAAGGAAGGCCGGGTGCGAGGGAACCTGATGGGCAAAAGAGTGGAC TTCTCGGCCCGTACTGTCATCACCCCCGACCCCAACCTCTCCATTGACCAGGTTGGCGTG CCCCGCTCCATTGCTGCCAACATGACCTTTGCGGAGATTGTCACCCCCTTCAACATTGAC AGACTTCAAGAACTAGTGCGCAGGGGGAACAGTCAGTACCCAGGCGCCAAGTACATCATC CGAGACAATGGTGATCGCATTGACTTGCGTTTCCACCCCAAGCCCAGTGACCTTCACCTG CAGACCGGCTATAAGGTGGAACGGCACATGTGTGATGGGGACATTGTTATCTTCAACCGG CAGCCAACTCTGCACAAAATGTCCATGATGGGGCATCGGGTCCGCATTCTCCCATGGTCT ACCTTTCGCTTGAATCTTAGCGTGACAACTCCGTACAATGCAGACTTTGACGGGGATGAG ATGAACTTGCACCTGCCACAGTCTCTGGAGACGCGAGCAGAGATCCAGGAGCTGGCCATG GTTCCTCGCATGATTGTCACCCCCCAGAGCAATCGGCCTGTCATGGGTATTGTGCAGGAC ACACTCACAGCAGTGCGCAAATTCACCAAGAGAGACGTCTTCCTGGAGCGGGGTGAAGTG ATGAACCTCCTGATGTTCCTGTCGACGTGGGATGGGAAGGTCCCACAGCCGGCCATCCTA AAGCCCCGGCCCCTGTGGACAGGCAAGCAAATCTTCTCCCTCATCATACCTGGTCACATC AATTGTATCCGTACCCACAGCACCCATCCCGATGATGAAGACAGTGGCCCTTACAAGCAC ATCTCTCCTGGGGACACCAAGGTGGTGGTGGAGAATGGGGAGCTGATCATGGGCATCCTG TGTAAGAAGTCTCTGGGCACGTCAGCTGGCTCCCTGGTCCACATCTCCTACCTAGAGATG GGTCATGACATCACTCGCCTCTTCTACTCCAACATTCAGACTGTCATTAACAACTGGCTC CTCATCGAGGGTCATACTATTGGCATTGGGGACTCCATTGCTGATTCTAAGACTTACCAG GACATTCAGAACACTATTAAGAAGGCCAAGCAGGACGTAATAGAGGTCATCGAGAAGGCA CACAACAATGAGCTGGAGCCCACCCCAGGGAACACTCTGCGGCAGACGTTTGAGAATCAG GTGAACCGCATTCTTAACGATGCCCGAGACAAGACTGGCTCCTCTGCTCAGAAATCCCTG TCTGAATACAACAACTTCAAGTCTATGGTCGTGTCCGGAGCTAAAGGTTCCAAGATTAAC ATCTCCCAGGTCATTGCTGTCGTTGGACAGCAGAACGTCGAGGGCAAGCGGATTCCATTT GGCTTCAAGCACCGGACTCTGCCTCACTTCATCAAGGATGACTACGGGCCTGAGAGCCGT GGCTTTGTGGAGAACTCCTACCTAGCCGGCCTCACACCCACTGAGTTCTTTTTCCACGCC ATGGGGGGTCGTGAGGGGCTCATTGACACGGCTGTCAAGACTGCTGAGACTGGATACATC CAGCGGCGGCTGATCAAGTCCATGGAGTCAGTGATGGTGAAGTACGACGCGACTGTGCGG AACTCCATCAACCAGGTGGTGCAGCTGCGCTACGGCGAAGACGGCCTGGCAGGCGAGAGC GTTGAGTTCCAGAACCTGGCTACGCTTAAGCCTTCCAACAAGGCTTTTGAGAAGAAGTTC CGCTTTGATTATACCAATGAGAGGGCCCTGCGGCGCACTCTGCAGGAGGACCTGGTGAAG GACGTGCTGAGCAACGCACACATCCAGAACGAGTTGGAGCGGGAATTTGAGCGGATGCGG GAGGATCGGGAGGTGCTCAGGGTCATCTTCCCAACTGGAGACAGCAAGGTCGTCCTCCCC TGTAACCTGCTGCGGATGATCTGGAATGCTCAGAAAATCTTCCACATCAACCCACGCCTT CCCTCCGACCTGCACCCCATCAAAGTGGTGGAGGGAGTCAAGGAATTGAGCAAGAAGCTG GTGATTGTGAATGGGGATGACCCACTAAGTCGACAGGCCCAGGAAAATGCCACGCTGCTC TTCAACATCCACCTGCGGTCCACGTTGTGTTCCCGCCGCATGGCAGAGGAGTTTCGGCTC AGTGGGGAGGCCTTCGACTGGCTGCTTGGGGAGATTGAGTCCAAGTTCAACCAAGCCATT GCGCATCCCGGGGAAATGGTGGGGGCTCTGGCTGCGCAGTCCCTTGGAGAACCTGCCACC CAGATGACCTTGAATACCTTCCACTATGCTGGTGTGTCTGCCAAGAATGTGACGCTGGGT GTGCCCCGACTTAAGGAGCTCATCAACATTTCCAAGAAGCCAAAGACTCCTTCGCTTACT GTCTTCCTGTTGGGCCAGTCCGCTCGAGATGCTGAGAGAGCCAAGGATATTCTGTGCCGT CTGGAGCATACAACGTTGAGGAAGGTGACTGCCAACACAGCCATCTACTATGACCCCAAC CCCCAGAGCACGGTGGTGGCAGAGGATCAGGAATGGGTGAATGTCTACTATGAAATGCCT GACTTTGATGTGGCCCGAATCTCCCCCTGGCTGTTGCGGGTGGAGCTGGATCGGAAGCAC ATGACTGACCGGAAGCTCACCATGGAGCAGATTGCTGAAAAGATCAATGCTGGTTTTGGT GACGACTTGAACTGCATCTTTAATGATGACAATGCAGAGAAGCTGGTGCTCCGTATTCGC ATCATGAACAGCGATGAGAACAAGATGCAAGAGGAGGAAGAGGTGGTGGACAAGATGGAT GATGATGTCTTCCTGCGCTGCATCGAGTCCAACATGCTGACAGATATGACCCTGCAGGGC ATCGAGCAGATCAGCAAGGTGTACATGCACTTGCCACAGACAGACAACAAGAAGAAGATC ATCATCACGGAGGATGGGGAATTCAAGGCCCTGCAGGAGTGGATCCTGGAGACGGACGGC GTGAGCTTGATGCGGGTGCTGAGTGAGAAGGACGTGGACCCCGTACGCACCACGTCCAAT GACATTGTGGAGATCTTCACGGTGCTGGGCATTGAAGCCGTGCGGAAGGCCCTGGAGCGG GAGCTGTACCACGTCATCTCCTTTGATGGCTCCTATGTCAATTACCGACACTTGGCTCTC TTGTGTGATACCATGACCTGTCGTGGCCACTTGATGGCCATCACCCGACACGGAGTCAAC CGCCAGGACACAGGACCACTCATGAAGTGTTCCTTTGAGGAAACGGTGGACGTGCTTATG GAAGCAGCCGCACACGGTGAGAGTGACCCCATGAAGGGGGTCTCTGAGAATATCATGCTG GGCCAGCTGGCTCCGGCCGGCACTGGCTGCTTTGACCTCCTGCTTGATGCAGAGAAGTGC AAGTATGGCATGGAGATCCCCACCAATATCCCCGGCCTGGGGGCTGCTGGACCCACCGGC ATGTTCTTTGGTTCAGCACCCAGTCCCATGGGTGGAATCTCTCCTGCCATGACACCTTGG AACCAGGGTGCAACCCCTGCCTATGGCGCCTGGTCCCCCAGTGTTGGGAGTGGAATGACC CCAGGGGCAGCCGGTTTCTCTCCCAGTGCTGCGTCAGATGCCAGCGGCTTCAGCCCAGGT TACTCCCCTGCCTGGTCTCCCACACCGGGCTCCCCGGGGTCCCCAGGTCCCTCAAGCCCC TACATCCCTTCACCAGGTGGCGCCATGTCTCCCAGCTACTCGCCAACGTCACCTGCCTAC GAGCCCCGCTCTCCTGGGGGCTACACACCCCAGAGTCCCTCTTATTCCCCCACTTCACCC TCCTACTCCCCTACCTCTCCATCCTATTCTCCAACCAGTCCCAACTATAGTCCCACATCA CCCAGCTATTCGCCAACGTCACCCAGCTACTCACCGACCTCTCCCAGCTACTCACCCACC TCTCCCAGCTACTCGCCCACCTCTCCCAGCTATTCGCCCACCTCTCCCAGCTACTCACCC ACTTCCCCTAGCTATTCGCCCACTTCCCCTAGCTACTCGCCAACGTCTCCCAGCTACTCG CCGACATCTCCCAGCTACTCGCCAACTTCACCCAGCTATTCTCCCACTTCTCCCAGCTAC TCACCTACCTCTCCAAGCTATTCACCCACCTCCCCCAGCTACTCACCCACTTCCCCAAGT TACTCACCCACCAGCCCGAACTATTCTCCAACCAGTCCCAATTACACCCCAACATCACCC AGCTACAGCCCGACATCACCCAGCTATTCCCCTACTAGTCCCAACTACACACCTACCAGC CCTAACTACAGCCCAACCTCTCCAAGCTACTCTCCAACATCACCCAGCTATTCCCCGACC TCACCAAGTTACTCCCCTTCCAGCCCACGATACACACCACAGTCTCCAACCTATACCCCA AGCTCACCCAGCTACAGCCCCAGTTCGCCCAGCTACAGCCCAACCTCACCCAAGTACACC CCAACCAGTCCTTCTTATAGTCCCAGCTCCCCAGAGTATACCCCAACCTCTCCCAAGTAC TCACCTACCAGTCCCAAATATTCACCCACCTCTCCCAAGTACTCGCCTACCAGTCCCACC TATTCACCCACCACCCCAAAATACTCCCCAACATCTCCTACTTATTCCCCAACCTCTCCA GTCTACACCCCAACCTCTCCCAAGTACTCACCTACTAGCCCCACTTACTCGCCCACTTCC CCCAAGTACTCGCCCACCAGCCCCACCTACTCGCCCACCTCCCCCAAAGGCTCAACCTAC TCTCCCACTTCCCCTGGTTACTCGCCCACCAGCCCCACCTACAGTCTCACAAGCCCGGCT ATCAGCCCGGATGACAGTGACGAGGAGAACTGA |
|||||||||||||||||||||||||||||||||||||||||
Protein Properties | ||||||||||||||||||||||||||||||||||||||||||
Number of Residues | 1970 | |||||||||||||||||||||||||||||||||||||||||
Molecular Weight | 217204.265 | |||||||||||||||||||||||||||||||||||||||||
Theoretical pI | 7.365 | |||||||||||||||||||||||||||||||||||||||||
Pfam Domain Function | ||||||||||||||||||||||||||||||||||||||||||
Signals | Not Available | |||||||||||||||||||||||||||||||||||||||||
Transmembrane Regions | Not Available | |||||||||||||||||||||||||||||||||||||||||
Protein Sequence |
>DNA-directed RNA polymerase II subunit RPB1 MHGGGPPSGDSACPLRTIKRVQFGVLSPDELKRMSVTEGGIKYPETTEGGRPKLGGLMDP RQGVIERTGRCQTCAGNMTECPGHFGHIELAKPVFHVGFLVKTMKVLRCVCFFCSKLLVD SNNPKIKDILAKSKGQPKKRLTHVYDLCKGKNICEGGEEMDNKFGVEQPEGDEDLTKEKG HGGCGRYQPRIRRSGLELYAEWKHVNEDSQEKKILLSPERVHEIFKRISDEECFVLGMEP RYARPEWMIVTVLPVPPLSVRPAVVMQGSARNQDDLTHKLADIVKINNQLRRNEQNGAAA HVIAEDVKLLQFHVATMVDNELPGLPRAMQKSGRPLKSLKQRLKGKEGRVRGNLMGKRVD FSARTVITPDPNLSIDQVGVPRSIAANMTFAEIVTPFNIDRLQELVRRGNSQYPGAKYII RDNGDRIDLRFHPKPSDLHLQTGYKVERHMCDGDIVIFNRQPTLHKMSMMGHRVRILPWS TFRLNLSVTTPYNADFDGDEMNLHLPQSLETRAEIQELAMVPRMIVTPQSNRPVMGIVQD TLTAVRKFTKRDVFLERGEVMNLLMFLSTWDGKVPQPAILKPRPLWTGKQIFSLIIPGHI NCIRTHSTHPDDEDSGPYKHISPGDTKVVVENGELIMGILCKKSLGTSAGSLVHISYLEM GHDITRLFYSNIQTVINNWLLIEGHTIGIGDSIADSKTYQDIQNTIKKAKQDVIEVIEKA HNNELEPTPGNTLRQTFENQVNRILNDARDKTGSSAQKSLSEYNNFKSMVVSGAKGSKIN ISQVIAVVGQQNVEGKRIPFGFKHRTLPHFIKDDYGPESRGFVENSYLAGLTPTEFFFHA MGGREGLIDTAVKTAETGYIQRRLIKSMESVMVKYDATVRNSINQVVQLRYGEDGLAGES VEFQNLATLKPSNKAFEKKFRFDYTNERALRRTLQEDLVKDVLSNAHIQNELEREFERMR EDREVLRVIFPTGDSKVVLPCNLLRMIWNAQKIFHINPRLPSDLHPIKVVEGVKELSKKL VIVNGDDPLSRQAQENATLLFNIHLRSTLCSRRMAEEFRLSGEAFDWLLGEIESKFNQAI AHPGEMVGALAAQSLGEPATQMTLNTFHYAGVSAKNVTLGVPRLKELINISKKPKTPSLT VFLLGQSARDAERAKDILCRLEHTTLRKVTANTAIYYDPNPQSTVVAEDQEWVNVYYEMP DFDVARISPWLLRVELDRKHMTDRKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIR IMNSDENKMQEEEEVVDKMDDDVFLRCIESNMLTDMTLQGIEQISKVYMHLPQTDNKKKI IITEDGEFKALQEWILETDGVSLMRVLSEKDVDPVRTTSNDIVEIFTVLGIEAVRKALER ELYHVISFDGSYVNYRHLALLCDTMTCRGHLMAITRHGVNRQDTGPLMKCSFEETVDVLM EAAAHGESDPMKGVSENIMLGQLAPAGTGCFDLLLDAEKCKYGMEIPTNIPGLGAAGPTG MFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGFSPSAASDASGFSPG YSPAWSPTPGSPGSPGPSSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSP SYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSP TSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPS YSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYTPTSPNYSPTSPSYSPTSPSYSPT SPSYSPSSPRYTPQSPTYTPSSPSYSPSSPSYSPASPKYTPTSPSYSPSSPEYTPTSPKY SPTSPKYSPTSPKYSPTSPTYSPTTPKYSPTSPTYSPTSPVYTPTSPKYSPTSPTYSPTS PKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAISPDDSDEEN |
|||||||||||||||||||||||||||||||||||||||||
External Links | ||||||||||||||||||||||||||||||||||||||||||
GenBank ID Protein | 36124 | |||||||||||||||||||||||||||||||||||||||||
UniProtKB/Swiss-Prot ID | P24928 | |||||||||||||||||||||||||||||||||||||||||
UniProtKB/Swiss-Prot Entry Name | RPB1_HUMAN | |||||||||||||||||||||||||||||||||||||||||
PDB IDs | ||||||||||||||||||||||||||||||||||||||||||
GenBank Gene ID | X63564 | |||||||||||||||||||||||||||||||||||||||||
GeneCard ID | POLR2A | |||||||||||||||||||||||||||||||||||||||||
GenAtlas ID | POLR2A | |||||||||||||||||||||||||||||||||||||||||
HGNC ID | HGNC:9187 | |||||||||||||||||||||||||||||||||||||||||
References | ||||||||||||||||||||||||||||||||||||||||||
General References | Not Available |