Identification
HMDB Protein ID CDBP00698
Secondary Accession Numbers Not Available
Name DNA-directed RNA polymerase II subunit RPB1
Description Not Available
Synonyms
  1. DNA-directed RNA polymerase II subunit A
  2. DNA-directed RNA polymerase III largest subunit
  3. RNA polymerase II subunit B1
  4. RNA-directed RNA polymerase II subunit RPB1
Gene Name POLR2A
Protein Type Enzyme
Biological Properties
General Function Involved in DNA binding
Specific Function DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Largest and catalytic component of RNA polymerase II which synthesizes mRNA precursors and many functional non-coding RNAs. Forms the polymerase active center together with the second largest subunit. Pol II is the central component of the basal RNA polymerase II transcription machinery. It is composed of mobile elements that move relative to each other. RPB1 is part of the core element with the central large cleft, the clamp element that moves to open and close the cleft and the jaws that are thought to grab the incoming DNA template. At the start of transcription, a single stranded DNA template strand of the promoter is positioned within the central active site cleft of Pol II. A bridging helix emanates from RPB1 and crosses the cleft near the catalytic site and is thought to promote translocation of Pol II by acting as a ratchet that moves the RNA-DNA hybrid through the active site by switching from straight to bent conformations at each step of nucleotide addition. During transcription elongation, Pol II moves on the template as the transcript elongates. Elongation is influenced by the phosphorylation status of the C-terminal domain (CTD) of Pol II largest subunit (RPB1), which serves as a platform for assembly of factors that regulate transcription initiation, elongation, termination and mRNA processing. Acts as a RNA-dependent RNA polymerase when associated with small delta antigen of Hepatitis delta virus, acting both as a replicate and transcriptase for the viral RNA circular genome.
GO Classification
Biological Process
transcription-coupled nucleotide-excision repair
7-methylguanosine mRNA capping
mRNA splicing, via spliceosome
viral reproduction
protein phosphorylation
regulation of transcription, DNA-dependent
positive regulation of viral transcription
transcription elongation from RNA polymerase II promoter
transcription initiation from RNA polymerase II promoter
Cellular Component
DNA-directed RNA polymerase II, core complex
Component
organelle part
intracellular organelle part
nuclear part
nucleoplasm part
dna-directed rna polymerase ii, core complex
Function
binding
catalytic activity
transferase activity
transferase activity, transferring phosphorus-containing groups
nucleotidyltransferase activity
nucleic acid binding
dna binding
rna polymerase activity
dna-directed rna polymerase activity
Molecular Function
metal ion binding
DNA-directed RNA polymerase activity
DNA binding
RNA-directed RNA polymerase activity
Process
macromolecule biosynthetic process
cellular macromolecule biosynthetic process
metabolic process
biosynthetic process
transcription
transcription, dna-dependent
transcription from rna polymerase ii promoter
Cellular Location
  1. Nucleus
Pathways
Gene Properties
Chromosome Location 17
Locus 17p13.1
SNPs POLR2A
Gene Sequence
>5913 bp
ATGCACGGGGGTGGCCCCCCCTCGGGGGACAGCGCATGCCCGCTGCGCACCATCAAGAGA
GTCCAGTTCGGAGTCCTGAGTCCGGATGAACTGAAGCGAATGTCTGTGACGGAGGGTGGC
ATCAAATACCCAGAGACGACTGAGGGAGGCCGCCCCAAGCTTGGGGGGCTGATGGACCCG
AGGCAGGGGGTGATTGAGCGGACTGGCCGCTGCCAAACATGTGCAGGAAACATGACAGAG
TGTCCTGGCCACTTTGGCCACATTGAACTGGCCAAGCCTGTGTTTCACGTGGGCTTCCTG
GTGAAGACAATGAAAGTTTTGCGCTGTGTCTGCTTCTTCTGCTCCAAACTGCTTGTGGAC
TCTAACAACCCAAAGATCAAGGATATCCTGGCTAAGTCCAAGGGACAGCCCAAGAAGCGG
CTCACACATGTCTACGACCTTTGCAAGGGCAAAAACATATGCGAGGGTGGGGAGGAGATG
GACAACAAGTTCGGTGTGGAACAACCTGAGGGTGACGAGGATCTGACCAAAGAAAAGGGC
CATGGTGGCTGTGGGCGGTACCAGCCCAGGATCCGGCGTTCTGGCCTAGAGCTGTATGCG
GAATGGAAGCACGTTAATGAGGACTCTCAGGAGAAGAAGATCCTGCTGAGTCCAGAGCGA
GTGCATGAGATCTTCAAACGCATCTCAGATGAGGAGTGTTTTGTGCTGGGCATGGAGCCC
CGCTATGCACGGCCAGAGTGGATGATTGTCACAGTGCTGCCTGTGCCCCCGCTCTCCGTG
CGGCCTGCTGTTGTGATGCAGGGCTCTGCCCGTAACCAGGATGACCTGACTCACAAACTG
GCTGACATCGTGAAGATCAACAATCAGCTGCGGCGCAATGAGCAGAACGGCGCAGCGGCC
CATGTCATTGCAGAGGATGTGAAGCTCCTCCAGTTCCATGTGGCCACCATGGTGGACAAT
GAGCTGCCTGGCTTGCCCCGTGCCATGCAGAAGTCTGGGCGTCCCCTCAAGTCCCTGAAG
CAGCGGTTGAAGGGCAAGGAAGGCCGGGTGCGAGGGAACCTGATGGGCAAAAGAGTGGAC
TTCTCGGCCCGTACTGTCATCACCCCCGACCCCAACCTCTCCATTGACCAGGTTGGCGTG
CCCCGCTCCATTGCTGCCAACATGACCTTTGCGGAGATTGTCACCCCCTTCAACATTGAC
AGACTTCAAGAACTAGTGCGCAGGGGGAACAGTCAGTACCCAGGCGCCAAGTACATCATC
CGAGACAATGGTGATCGCATTGACTTGCGTTTCCACCCCAAGCCCAGTGACCTTCACCTG
CAGACCGGCTATAAGGTGGAACGGCACATGTGTGATGGGGACATTGTTATCTTCAACCGG
CAGCCAACTCTGCACAAAATGTCCATGATGGGGCATCGGGTCCGCATTCTCCCATGGTCT
ACCTTTCGCTTGAATCTTAGCGTGACAACTCCGTACAATGCAGACTTTGACGGGGATGAG
ATGAACTTGCACCTGCCACAGTCTCTGGAGACGCGAGCAGAGATCCAGGAGCTGGCCATG
GTTCCTCGCATGATTGTCACCCCCCAGAGCAATCGGCCTGTCATGGGTATTGTGCAGGAC
ACACTCACAGCAGTGCGCAAATTCACCAAGAGAGACGTCTTCCTGGAGCGGGGTGAAGTG
ATGAACCTCCTGATGTTCCTGTCGACGTGGGATGGGAAGGTCCCACAGCCGGCCATCCTA
AAGCCCCGGCCCCTGTGGACAGGCAAGCAAATCTTCTCCCTCATCATACCTGGTCACATC
AATTGTATCCGTACCCACAGCACCCATCCCGATGATGAAGACAGTGGCCCTTACAAGCAC
ATCTCTCCTGGGGACACCAAGGTGGTGGTGGAGAATGGGGAGCTGATCATGGGCATCCTG
TGTAAGAAGTCTCTGGGCACGTCAGCTGGCTCCCTGGTCCACATCTCCTACCTAGAGATG
GGTCATGACATCACTCGCCTCTTCTACTCCAACATTCAGACTGTCATTAACAACTGGCTC
CTCATCGAGGGTCATACTATTGGCATTGGGGACTCCATTGCTGATTCTAAGACTTACCAG
GACATTCAGAACACTATTAAGAAGGCCAAGCAGGACGTAATAGAGGTCATCGAGAAGGCA
CACAACAATGAGCTGGAGCCCACCCCAGGGAACACTCTGCGGCAGACGTTTGAGAATCAG
GTGAACCGCATTCTTAACGATGCCCGAGACAAGACTGGCTCCTCTGCTCAGAAATCCCTG
TCTGAATACAACAACTTCAAGTCTATGGTCGTGTCCGGAGCTAAAGGTTCCAAGATTAAC
ATCTCCCAGGTCATTGCTGTCGTTGGACAGCAGAACGTCGAGGGCAAGCGGATTCCATTT
GGCTTCAAGCACCGGACTCTGCCTCACTTCATCAAGGATGACTACGGGCCTGAGAGCCGT
GGCTTTGTGGAGAACTCCTACCTAGCCGGCCTCACACCCACTGAGTTCTTTTTCCACGCC
ATGGGGGGTCGTGAGGGGCTCATTGACACGGCTGTCAAGACTGCTGAGACTGGATACATC
CAGCGGCGGCTGATCAAGTCCATGGAGTCAGTGATGGTGAAGTACGACGCGACTGTGCGG
AACTCCATCAACCAGGTGGTGCAGCTGCGCTACGGCGAAGACGGCCTGGCAGGCGAGAGC
GTTGAGTTCCAGAACCTGGCTACGCTTAAGCCTTCCAACAAGGCTTTTGAGAAGAAGTTC
CGCTTTGATTATACCAATGAGAGGGCCCTGCGGCGCACTCTGCAGGAGGACCTGGTGAAG
GACGTGCTGAGCAACGCACACATCCAGAACGAGTTGGAGCGGGAATTTGAGCGGATGCGG
GAGGATCGGGAGGTGCTCAGGGTCATCTTCCCAACTGGAGACAGCAAGGTCGTCCTCCCC
TGTAACCTGCTGCGGATGATCTGGAATGCTCAGAAAATCTTCCACATCAACCCACGCCTT
CCCTCCGACCTGCACCCCATCAAAGTGGTGGAGGGAGTCAAGGAATTGAGCAAGAAGCTG
GTGATTGTGAATGGGGATGACCCACTAAGTCGACAGGCCCAGGAAAATGCCACGCTGCTC
TTCAACATCCACCTGCGGTCCACGTTGTGTTCCCGCCGCATGGCAGAGGAGTTTCGGCTC
AGTGGGGAGGCCTTCGACTGGCTGCTTGGGGAGATTGAGTCCAAGTTCAACCAAGCCATT
GCGCATCCCGGGGAAATGGTGGGGGCTCTGGCTGCGCAGTCCCTTGGAGAACCTGCCACC
CAGATGACCTTGAATACCTTCCACTATGCTGGTGTGTCTGCCAAGAATGTGACGCTGGGT
GTGCCCCGACTTAAGGAGCTCATCAACATTTCCAAGAAGCCAAAGACTCCTTCGCTTACT
GTCTTCCTGTTGGGCCAGTCCGCTCGAGATGCTGAGAGAGCCAAGGATATTCTGTGCCGT
CTGGAGCATACAACGTTGAGGAAGGTGACTGCCAACACAGCCATCTACTATGACCCCAAC
CCCCAGAGCACGGTGGTGGCAGAGGATCAGGAATGGGTGAATGTCTACTATGAAATGCCT
GACTTTGATGTGGCCCGAATCTCCCCCTGGCTGTTGCGGGTGGAGCTGGATCGGAAGCAC
ATGACTGACCGGAAGCTCACCATGGAGCAGATTGCTGAAAAGATCAATGCTGGTTTTGGT
GACGACTTGAACTGCATCTTTAATGATGACAATGCAGAGAAGCTGGTGCTCCGTATTCGC
ATCATGAACAGCGATGAGAACAAGATGCAAGAGGAGGAAGAGGTGGTGGACAAGATGGAT
GATGATGTCTTCCTGCGCTGCATCGAGTCCAACATGCTGACAGATATGACCCTGCAGGGC
ATCGAGCAGATCAGCAAGGTGTACATGCACTTGCCACAGACAGACAACAAGAAGAAGATC
ATCATCACGGAGGATGGGGAATTCAAGGCCCTGCAGGAGTGGATCCTGGAGACGGACGGC
GTGAGCTTGATGCGGGTGCTGAGTGAGAAGGACGTGGACCCCGTACGCACCACGTCCAAT
GACATTGTGGAGATCTTCACGGTGCTGGGCATTGAAGCCGTGCGGAAGGCCCTGGAGCGG
GAGCTGTACCACGTCATCTCCTTTGATGGCTCCTATGTCAATTACCGACACTTGGCTCTC
TTGTGTGATACCATGACCTGTCGTGGCCACTTGATGGCCATCACCCGACACGGAGTCAAC
CGCCAGGACACAGGACCACTCATGAAGTGTTCCTTTGAGGAAACGGTGGACGTGCTTATG
GAAGCAGCCGCACACGGTGAGAGTGACCCCATGAAGGGGGTCTCTGAGAATATCATGCTG
GGCCAGCTGGCTCCGGCCGGCACTGGCTGCTTTGACCTCCTGCTTGATGCAGAGAAGTGC
AAGTATGGCATGGAGATCCCCACCAATATCCCCGGCCTGGGGGCTGCTGGACCCACCGGC
ATGTTCTTTGGTTCAGCACCCAGTCCCATGGGTGGAATCTCTCCTGCCATGACACCTTGG
AACCAGGGTGCAACCCCTGCCTATGGCGCCTGGTCCCCCAGTGTTGGGAGTGGAATGACC
CCAGGGGCAGCCGGTTTCTCTCCCAGTGCTGCGTCAGATGCCAGCGGCTTCAGCCCAGGT
TACTCCCCTGCCTGGTCTCCCACACCGGGCTCCCCGGGGTCCCCAGGTCCCTCAAGCCCC
TACATCCCTTCACCAGGTGGCGCCATGTCTCCCAGCTACTCGCCAACGTCACCTGCCTAC
GAGCCCCGCTCTCCTGGGGGCTACACACCCCAGAGTCCCTCTTATTCCCCCACTTCACCC
TCCTACTCCCCTACCTCTCCATCCTATTCTCCAACCAGTCCCAACTATAGTCCCACATCA
CCCAGCTATTCGCCAACGTCACCCAGCTACTCACCGACCTCTCCCAGCTACTCACCCACC
TCTCCCAGCTACTCGCCCACCTCTCCCAGCTATTCGCCCACCTCTCCCAGCTACTCACCC
ACTTCCCCTAGCTATTCGCCCACTTCCCCTAGCTACTCGCCAACGTCTCCCAGCTACTCG
CCGACATCTCCCAGCTACTCGCCAACTTCACCCAGCTATTCTCCCACTTCTCCCAGCTAC
TCACCTACCTCTCCAAGCTATTCACCCACCTCCCCCAGCTACTCACCCACTTCCCCAAGT
TACTCACCCACCAGCCCGAACTATTCTCCAACCAGTCCCAATTACACCCCAACATCACCC
AGCTACAGCCCGACATCACCCAGCTATTCCCCTACTAGTCCCAACTACACACCTACCAGC
CCTAACTACAGCCCAACCTCTCCAAGCTACTCTCCAACATCACCCAGCTATTCCCCGACC
TCACCAAGTTACTCCCCTTCCAGCCCACGATACACACCACAGTCTCCAACCTATACCCCA
AGCTCACCCAGCTACAGCCCCAGTTCGCCCAGCTACAGCCCAACCTCACCCAAGTACACC
CCAACCAGTCCTTCTTATAGTCCCAGCTCCCCAGAGTATACCCCAACCTCTCCCAAGTAC
TCACCTACCAGTCCCAAATATTCACCCACCTCTCCCAAGTACTCGCCTACCAGTCCCACC
TATTCACCCACCACCCCAAAATACTCCCCAACATCTCCTACTTATTCCCCAACCTCTCCA
GTCTACACCCCAACCTCTCCCAAGTACTCACCTACTAGCCCCACTTACTCGCCCACTTCC
CCCAAGTACTCGCCCACCAGCCCCACCTACTCGCCCACCTCCCCCAAAGGCTCAACCTAC
TCTCCCACTTCCCCTGGTTACTCGCCCACCAGCCCCACCTACAGTCTCACAAGCCCGGCT
ATCAGCCCGGATGACAGTGACGAGGAGAACTGA
Protein Properties
Number of Residues 1970
Molecular Weight 217204.265
Theoretical pI 7.365
Pfam Domain Function
Signals Not Available
Transmembrane Regions Not Available
Protein Sequence
>DNA-directed RNA polymerase II subunit RPB1
MHGGGPPSGDSACPLRTIKRVQFGVLSPDELKRMSVTEGGIKYPETTEGGRPKLGGLMDP
RQGVIERTGRCQTCAGNMTECPGHFGHIELAKPVFHVGFLVKTMKVLRCVCFFCSKLLVD
SNNPKIKDILAKSKGQPKKRLTHVYDLCKGKNICEGGEEMDNKFGVEQPEGDEDLTKEKG
HGGCGRYQPRIRRSGLELYAEWKHVNEDSQEKKILLSPERVHEIFKRISDEECFVLGMEP
RYARPEWMIVTVLPVPPLSVRPAVVMQGSARNQDDLTHKLADIVKINNQLRRNEQNGAAA
HVIAEDVKLLQFHVATMVDNELPGLPRAMQKSGRPLKSLKQRLKGKEGRVRGNLMGKRVD
FSARTVITPDPNLSIDQVGVPRSIAANMTFAEIVTPFNIDRLQELVRRGNSQYPGAKYII
RDNGDRIDLRFHPKPSDLHLQTGYKVERHMCDGDIVIFNRQPTLHKMSMMGHRVRILPWS
TFRLNLSVTTPYNADFDGDEMNLHLPQSLETRAEIQELAMVPRMIVTPQSNRPVMGIVQD
TLTAVRKFTKRDVFLERGEVMNLLMFLSTWDGKVPQPAILKPRPLWTGKQIFSLIIPGHI
NCIRTHSTHPDDEDSGPYKHISPGDTKVVVENGELIMGILCKKSLGTSAGSLVHISYLEM
GHDITRLFYSNIQTVINNWLLIEGHTIGIGDSIADSKTYQDIQNTIKKAKQDVIEVIEKA
HNNELEPTPGNTLRQTFENQVNRILNDARDKTGSSAQKSLSEYNNFKSMVVSGAKGSKIN
ISQVIAVVGQQNVEGKRIPFGFKHRTLPHFIKDDYGPESRGFVENSYLAGLTPTEFFFHA
MGGREGLIDTAVKTAETGYIQRRLIKSMESVMVKYDATVRNSINQVVQLRYGEDGLAGES
VEFQNLATLKPSNKAFEKKFRFDYTNERALRRTLQEDLVKDVLSNAHIQNELEREFERMR
EDREVLRVIFPTGDSKVVLPCNLLRMIWNAQKIFHINPRLPSDLHPIKVVEGVKELSKKL
VIVNGDDPLSRQAQENATLLFNIHLRSTLCSRRMAEEFRLSGEAFDWLLGEIESKFNQAI
AHPGEMVGALAAQSLGEPATQMTLNTFHYAGVSAKNVTLGVPRLKELINISKKPKTPSLT
VFLLGQSARDAERAKDILCRLEHTTLRKVTANTAIYYDPNPQSTVVAEDQEWVNVYYEMP
DFDVARISPWLLRVELDRKHMTDRKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIR
IMNSDENKMQEEEEVVDKMDDDVFLRCIESNMLTDMTLQGIEQISKVYMHLPQTDNKKKI
IITEDGEFKALQEWILETDGVSLMRVLSEKDVDPVRTTSNDIVEIFTVLGIEAVRKALER
ELYHVISFDGSYVNYRHLALLCDTMTCRGHLMAITRHGVNRQDTGPLMKCSFEETVDVLM
EAAAHGESDPMKGVSENIMLGQLAPAGTGCFDLLLDAEKCKYGMEIPTNIPGLGAAGPTG
MFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGFSPSAASDASGFSPG
YSPAWSPTPGSPGSPGPSSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSP
SYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSP
TSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPS
YSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYTPTSPNYSPTSPSYSPTSPSYSPT
SPSYSPSSPRYTPQSPTYTPSSPSYSPSSPSYSPASPKYTPTSPSYSPSSPEYTPTSPKY
SPTSPKYSPTSPKYSPTSPTYSPTTPKYSPTSPTYSPTSPVYTPTSPKYSPTSPTYSPTS
PKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAISPDDSDEEN
GenBank ID Protein 36124
UniProtKB/Swiss-Prot ID P24928
UniProtKB/Swiss-Prot Entry Name RPB1_HUMAN
PDB IDs
GenBank Gene ID X63564
GeneCard ID POLR2A
GenAtlas ID POLR2A
HGNC ID HGNC:9187
References
General References Not Available