Identification |
HMDB Protein ID
| CDBP00698 |
Secondary Accession Numbers
| Not Available |
Name
| DNA-directed RNA polymerase II subunit RPB1 |
Description
| Not Available |
Synonyms
|
- DNA-directed RNA polymerase II subunit A
- DNA-directed RNA polymerase III largest subunit
- RNA polymerase II subunit B1
- RNA-directed RNA polymerase II subunit RPB1
|
Gene Name
| POLR2A |
Protein Type
| Enzyme |
Biological Properties |
General Function
| Involved in DNA binding |
Specific Function
| DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Largest and catalytic component of RNA polymerase II which synthesizes mRNA precursors and many functional non-coding RNAs. Forms the polymerase active center together with the second largest subunit. Pol II is the central component of the basal RNA polymerase II transcription machinery. It is composed of mobile elements that move relative to each other. RPB1 is part of the core element with the central large cleft, the clamp element that moves to open and close the cleft and the jaws that are thought to grab the incoming DNA template. At the start of transcription, a single stranded DNA template strand of the promoter is positioned within the central active site cleft of Pol II. A bridging helix emanates from RPB1 and crosses the cleft near the catalytic site and is thought to promote translocation of Pol II by acting as a ratchet that moves the RNA-DNA hybrid through the active site by switching from straight to bent conformations at each step of nucleotide addition. During transcription elongation, Pol II moves on the template as the transcript elongates. Elongation is influenced by the phosphorylation status of the C-terminal domain (CTD) of Pol II largest subunit (RPB1), which serves as a platform for assembly of factors that regulate transcription initiation, elongation, termination and mRNA processing. Acts as a RNA-dependent RNA polymerase when associated with small delta antigen of Hepatitis delta virus, acting both as a replicate and transcriptase for the viral RNA circular genome.
|
GO Classification
|
Biological Process |
transcription-coupled nucleotide-excision repair |
7-methylguanosine mRNA capping |
mRNA splicing, via spliceosome |
viral reproduction |
protein phosphorylation |
regulation of transcription, DNA-dependent |
positive regulation of viral transcription |
transcription elongation from RNA polymerase II promoter |
transcription initiation from RNA polymerase II promoter |
Cellular Component |
DNA-directed RNA polymerase II, core complex |
Component |
organelle part |
intracellular organelle part |
nuclear part |
nucleoplasm part |
dna-directed rna polymerase ii, core complex |
Function |
binding |
catalytic activity |
transferase activity |
transferase activity, transferring phosphorus-containing groups |
nucleotidyltransferase activity |
nucleic acid binding |
dna binding |
rna polymerase activity |
dna-directed rna polymerase activity |
Molecular Function |
metal ion binding |
DNA-directed RNA polymerase activity |
DNA binding |
RNA-directed RNA polymerase activity |
Process |
macromolecule biosynthetic process |
cellular macromolecule biosynthetic process |
metabolic process |
biosynthetic process |
transcription |
transcription, dna-dependent |
transcription from rna polymerase ii promoter |
|
Cellular Location
|
- Nucleus
|
Pathways
|
Name | SMPDB/Pathwhiz | KEGG | RNA polymerase | Not Available | | Huntington's disease | Not Available | | Herpes simplex infection | Not Available | | Epstein-Barr virus infection | Not Available | |
|
Gene Properties |
Chromosome Location
| 17 |
Locus
| 17p13.1 |
SNPs
| POLR2A |
Gene Sequence
|
>5913 bp
ATGCACGGGGGTGGCCCCCCCTCGGGGGACAGCGCATGCCCGCTGCGCACCATCAAGAGA
GTCCAGTTCGGAGTCCTGAGTCCGGATGAACTGAAGCGAATGTCTGTGACGGAGGGTGGC
ATCAAATACCCAGAGACGACTGAGGGAGGCCGCCCCAAGCTTGGGGGGCTGATGGACCCG
AGGCAGGGGGTGATTGAGCGGACTGGCCGCTGCCAAACATGTGCAGGAAACATGACAGAG
TGTCCTGGCCACTTTGGCCACATTGAACTGGCCAAGCCTGTGTTTCACGTGGGCTTCCTG
GTGAAGACAATGAAAGTTTTGCGCTGTGTCTGCTTCTTCTGCTCCAAACTGCTTGTGGAC
TCTAACAACCCAAAGATCAAGGATATCCTGGCTAAGTCCAAGGGACAGCCCAAGAAGCGG
CTCACACATGTCTACGACCTTTGCAAGGGCAAAAACATATGCGAGGGTGGGGAGGAGATG
GACAACAAGTTCGGTGTGGAACAACCTGAGGGTGACGAGGATCTGACCAAAGAAAAGGGC
CATGGTGGCTGTGGGCGGTACCAGCCCAGGATCCGGCGTTCTGGCCTAGAGCTGTATGCG
GAATGGAAGCACGTTAATGAGGACTCTCAGGAGAAGAAGATCCTGCTGAGTCCAGAGCGA
GTGCATGAGATCTTCAAACGCATCTCAGATGAGGAGTGTTTTGTGCTGGGCATGGAGCCC
CGCTATGCACGGCCAGAGTGGATGATTGTCACAGTGCTGCCTGTGCCCCCGCTCTCCGTG
CGGCCTGCTGTTGTGATGCAGGGCTCTGCCCGTAACCAGGATGACCTGACTCACAAACTG
GCTGACATCGTGAAGATCAACAATCAGCTGCGGCGCAATGAGCAGAACGGCGCAGCGGCC
CATGTCATTGCAGAGGATGTGAAGCTCCTCCAGTTCCATGTGGCCACCATGGTGGACAAT
GAGCTGCCTGGCTTGCCCCGTGCCATGCAGAAGTCTGGGCGTCCCCTCAAGTCCCTGAAG
CAGCGGTTGAAGGGCAAGGAAGGCCGGGTGCGAGGGAACCTGATGGGCAAAAGAGTGGAC
TTCTCGGCCCGTACTGTCATCACCCCCGACCCCAACCTCTCCATTGACCAGGTTGGCGTG
CCCCGCTCCATTGCTGCCAACATGACCTTTGCGGAGATTGTCACCCCCTTCAACATTGAC
AGACTTCAAGAACTAGTGCGCAGGGGGAACAGTCAGTACCCAGGCGCCAAGTACATCATC
CGAGACAATGGTGATCGCATTGACTTGCGTTTCCACCCCAAGCCCAGTGACCTTCACCTG
CAGACCGGCTATAAGGTGGAACGGCACATGTGTGATGGGGACATTGTTATCTTCAACCGG
CAGCCAACTCTGCACAAAATGTCCATGATGGGGCATCGGGTCCGCATTCTCCCATGGTCT
ACCTTTCGCTTGAATCTTAGCGTGACAACTCCGTACAATGCAGACTTTGACGGGGATGAG
ATGAACTTGCACCTGCCACAGTCTCTGGAGACGCGAGCAGAGATCCAGGAGCTGGCCATG
GTTCCTCGCATGATTGTCACCCCCCAGAGCAATCGGCCTGTCATGGGTATTGTGCAGGAC
ACACTCACAGCAGTGCGCAAATTCACCAAGAGAGACGTCTTCCTGGAGCGGGGTGAAGTG
ATGAACCTCCTGATGTTCCTGTCGACGTGGGATGGGAAGGTCCCACAGCCGGCCATCCTA
AAGCCCCGGCCCCTGTGGACAGGCAAGCAAATCTTCTCCCTCATCATACCTGGTCACATC
AATTGTATCCGTACCCACAGCACCCATCCCGATGATGAAGACAGTGGCCCTTACAAGCAC
ATCTCTCCTGGGGACACCAAGGTGGTGGTGGAGAATGGGGAGCTGATCATGGGCATCCTG
TGTAAGAAGTCTCTGGGCACGTCAGCTGGCTCCCTGGTCCACATCTCCTACCTAGAGATG
GGTCATGACATCACTCGCCTCTTCTACTCCAACATTCAGACTGTCATTAACAACTGGCTC
CTCATCGAGGGTCATACTATTGGCATTGGGGACTCCATTGCTGATTCTAAGACTTACCAG
GACATTCAGAACACTATTAAGAAGGCCAAGCAGGACGTAATAGAGGTCATCGAGAAGGCA
CACAACAATGAGCTGGAGCCCACCCCAGGGAACACTCTGCGGCAGACGTTTGAGAATCAG
GTGAACCGCATTCTTAACGATGCCCGAGACAAGACTGGCTCCTCTGCTCAGAAATCCCTG
TCTGAATACAACAACTTCAAGTCTATGGTCGTGTCCGGAGCTAAAGGTTCCAAGATTAAC
ATCTCCCAGGTCATTGCTGTCGTTGGACAGCAGAACGTCGAGGGCAAGCGGATTCCATTT
GGCTTCAAGCACCGGACTCTGCCTCACTTCATCAAGGATGACTACGGGCCTGAGAGCCGT
GGCTTTGTGGAGAACTCCTACCTAGCCGGCCTCACACCCACTGAGTTCTTTTTCCACGCC
ATGGGGGGTCGTGAGGGGCTCATTGACACGGCTGTCAAGACTGCTGAGACTGGATACATC
CAGCGGCGGCTGATCAAGTCCATGGAGTCAGTGATGGTGAAGTACGACGCGACTGTGCGG
AACTCCATCAACCAGGTGGTGCAGCTGCGCTACGGCGAAGACGGCCTGGCAGGCGAGAGC
GTTGAGTTCCAGAACCTGGCTACGCTTAAGCCTTCCAACAAGGCTTTTGAGAAGAAGTTC
CGCTTTGATTATACCAATGAGAGGGCCCTGCGGCGCACTCTGCAGGAGGACCTGGTGAAG
GACGTGCTGAGCAACGCACACATCCAGAACGAGTTGGAGCGGGAATTTGAGCGGATGCGG
GAGGATCGGGAGGTGCTCAGGGTCATCTTCCCAACTGGAGACAGCAAGGTCGTCCTCCCC
TGTAACCTGCTGCGGATGATCTGGAATGCTCAGAAAATCTTCCACATCAACCCACGCCTT
CCCTCCGACCTGCACCCCATCAAAGTGGTGGAGGGAGTCAAGGAATTGAGCAAGAAGCTG
GTGATTGTGAATGGGGATGACCCACTAAGTCGACAGGCCCAGGAAAATGCCACGCTGCTC
TTCAACATCCACCTGCGGTCCACGTTGTGTTCCCGCCGCATGGCAGAGGAGTTTCGGCTC
AGTGGGGAGGCCTTCGACTGGCTGCTTGGGGAGATTGAGTCCAAGTTCAACCAAGCCATT
GCGCATCCCGGGGAAATGGTGGGGGCTCTGGCTGCGCAGTCCCTTGGAGAACCTGCCACC
CAGATGACCTTGAATACCTTCCACTATGCTGGTGTGTCTGCCAAGAATGTGACGCTGGGT
GTGCCCCGACTTAAGGAGCTCATCAACATTTCCAAGAAGCCAAAGACTCCTTCGCTTACT
GTCTTCCTGTTGGGCCAGTCCGCTCGAGATGCTGAGAGAGCCAAGGATATTCTGTGCCGT
CTGGAGCATACAACGTTGAGGAAGGTGACTGCCAACACAGCCATCTACTATGACCCCAAC
CCCCAGAGCACGGTGGTGGCAGAGGATCAGGAATGGGTGAATGTCTACTATGAAATGCCT
GACTTTGATGTGGCCCGAATCTCCCCCTGGCTGTTGCGGGTGGAGCTGGATCGGAAGCAC
ATGACTGACCGGAAGCTCACCATGGAGCAGATTGCTGAAAAGATCAATGCTGGTTTTGGT
GACGACTTGAACTGCATCTTTAATGATGACAATGCAGAGAAGCTGGTGCTCCGTATTCGC
ATCATGAACAGCGATGAGAACAAGATGCAAGAGGAGGAAGAGGTGGTGGACAAGATGGAT
GATGATGTCTTCCTGCGCTGCATCGAGTCCAACATGCTGACAGATATGACCCTGCAGGGC
ATCGAGCAGATCAGCAAGGTGTACATGCACTTGCCACAGACAGACAACAAGAAGAAGATC
ATCATCACGGAGGATGGGGAATTCAAGGCCCTGCAGGAGTGGATCCTGGAGACGGACGGC
GTGAGCTTGATGCGGGTGCTGAGTGAGAAGGACGTGGACCCCGTACGCACCACGTCCAAT
GACATTGTGGAGATCTTCACGGTGCTGGGCATTGAAGCCGTGCGGAAGGCCCTGGAGCGG
GAGCTGTACCACGTCATCTCCTTTGATGGCTCCTATGTCAATTACCGACACTTGGCTCTC
TTGTGTGATACCATGACCTGTCGTGGCCACTTGATGGCCATCACCCGACACGGAGTCAAC
CGCCAGGACACAGGACCACTCATGAAGTGTTCCTTTGAGGAAACGGTGGACGTGCTTATG
GAAGCAGCCGCACACGGTGAGAGTGACCCCATGAAGGGGGTCTCTGAGAATATCATGCTG
GGCCAGCTGGCTCCGGCCGGCACTGGCTGCTTTGACCTCCTGCTTGATGCAGAGAAGTGC
AAGTATGGCATGGAGATCCCCACCAATATCCCCGGCCTGGGGGCTGCTGGACCCACCGGC
ATGTTCTTTGGTTCAGCACCCAGTCCCATGGGTGGAATCTCTCCTGCCATGACACCTTGG
AACCAGGGTGCAACCCCTGCCTATGGCGCCTGGTCCCCCAGTGTTGGGAGTGGAATGACC
CCAGGGGCAGCCGGTTTCTCTCCCAGTGCTGCGTCAGATGCCAGCGGCTTCAGCCCAGGT
TACTCCCCTGCCTGGTCTCCCACACCGGGCTCCCCGGGGTCCCCAGGTCCCTCAAGCCCC
TACATCCCTTCACCAGGTGGCGCCATGTCTCCCAGCTACTCGCCAACGTCACCTGCCTAC
GAGCCCCGCTCTCCTGGGGGCTACACACCCCAGAGTCCCTCTTATTCCCCCACTTCACCC
TCCTACTCCCCTACCTCTCCATCCTATTCTCCAACCAGTCCCAACTATAGTCCCACATCA
CCCAGCTATTCGCCAACGTCACCCAGCTACTCACCGACCTCTCCCAGCTACTCACCCACC
TCTCCCAGCTACTCGCCCACCTCTCCCAGCTATTCGCCCACCTCTCCCAGCTACTCACCC
ACTTCCCCTAGCTATTCGCCCACTTCCCCTAGCTACTCGCCAACGTCTCCCAGCTACTCG
CCGACATCTCCCAGCTACTCGCCAACTTCACCCAGCTATTCTCCCACTTCTCCCAGCTAC
TCACCTACCTCTCCAAGCTATTCACCCACCTCCCCCAGCTACTCACCCACTTCCCCAAGT
TACTCACCCACCAGCCCGAACTATTCTCCAACCAGTCCCAATTACACCCCAACATCACCC
AGCTACAGCCCGACATCACCCAGCTATTCCCCTACTAGTCCCAACTACACACCTACCAGC
CCTAACTACAGCCCAACCTCTCCAAGCTACTCTCCAACATCACCCAGCTATTCCCCGACC
TCACCAAGTTACTCCCCTTCCAGCCCACGATACACACCACAGTCTCCAACCTATACCCCA
AGCTCACCCAGCTACAGCCCCAGTTCGCCCAGCTACAGCCCAACCTCACCCAAGTACACC
CCAACCAGTCCTTCTTATAGTCCCAGCTCCCCAGAGTATACCCCAACCTCTCCCAAGTAC
TCACCTACCAGTCCCAAATATTCACCCACCTCTCCCAAGTACTCGCCTACCAGTCCCACC
TATTCACCCACCACCCCAAAATACTCCCCAACATCTCCTACTTATTCCCCAACCTCTCCA
GTCTACACCCCAACCTCTCCCAAGTACTCACCTACTAGCCCCACTTACTCGCCCACTTCC
CCCAAGTACTCGCCCACCAGCCCCACCTACTCGCCCACCTCCCCCAAAGGCTCAACCTAC
TCTCCCACTTCCCCTGGTTACTCGCCCACCAGCCCCACCTACAGTCTCACAAGCCCGGCT
ATCAGCCCGGATGACAGTGACGAGGAGAACTGA
|
Protein Properties |
Number of Residues
| 1970 |
Molecular Weight
| 217204.265 |
Theoretical pI
| 7.365 |
Pfam Domain Function
|
|
Signals
|
Not Available
|
Transmembrane Regions
|
Not Available
|
Protein Sequence
|
>DNA-directed RNA polymerase II subunit RPB1
MHGGGPPSGDSACPLRTIKRVQFGVLSPDELKRMSVTEGGIKYPETTEGGRPKLGGLMDP
RQGVIERTGRCQTCAGNMTECPGHFGHIELAKPVFHVGFLVKTMKVLRCVCFFCSKLLVD
SNNPKIKDILAKSKGQPKKRLTHVYDLCKGKNICEGGEEMDNKFGVEQPEGDEDLTKEKG
HGGCGRYQPRIRRSGLELYAEWKHVNEDSQEKKILLSPERVHEIFKRISDEECFVLGMEP
RYARPEWMIVTVLPVPPLSVRPAVVMQGSARNQDDLTHKLADIVKINNQLRRNEQNGAAA
HVIAEDVKLLQFHVATMVDNELPGLPRAMQKSGRPLKSLKQRLKGKEGRVRGNLMGKRVD
FSARTVITPDPNLSIDQVGVPRSIAANMTFAEIVTPFNIDRLQELVRRGNSQYPGAKYII
RDNGDRIDLRFHPKPSDLHLQTGYKVERHMCDGDIVIFNRQPTLHKMSMMGHRVRILPWS
TFRLNLSVTTPYNADFDGDEMNLHLPQSLETRAEIQELAMVPRMIVTPQSNRPVMGIVQD
TLTAVRKFTKRDVFLERGEVMNLLMFLSTWDGKVPQPAILKPRPLWTGKQIFSLIIPGHI
NCIRTHSTHPDDEDSGPYKHISPGDTKVVVENGELIMGILCKKSLGTSAGSLVHISYLEM
GHDITRLFYSNIQTVINNWLLIEGHTIGIGDSIADSKTYQDIQNTIKKAKQDVIEVIEKA
HNNELEPTPGNTLRQTFENQVNRILNDARDKTGSSAQKSLSEYNNFKSMVVSGAKGSKIN
ISQVIAVVGQQNVEGKRIPFGFKHRTLPHFIKDDYGPESRGFVENSYLAGLTPTEFFFHA
MGGREGLIDTAVKTAETGYIQRRLIKSMESVMVKYDATVRNSINQVVQLRYGEDGLAGES
VEFQNLATLKPSNKAFEKKFRFDYTNERALRRTLQEDLVKDVLSNAHIQNELEREFERMR
EDREVLRVIFPTGDSKVVLPCNLLRMIWNAQKIFHINPRLPSDLHPIKVVEGVKELSKKL
VIVNGDDPLSRQAQENATLLFNIHLRSTLCSRRMAEEFRLSGEAFDWLLGEIESKFNQAI
AHPGEMVGALAAQSLGEPATQMTLNTFHYAGVSAKNVTLGVPRLKELINISKKPKTPSLT
VFLLGQSARDAERAKDILCRLEHTTLRKVTANTAIYYDPNPQSTVVAEDQEWVNVYYEMP
DFDVARISPWLLRVELDRKHMTDRKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIR
IMNSDENKMQEEEEVVDKMDDDVFLRCIESNMLTDMTLQGIEQISKVYMHLPQTDNKKKI
IITEDGEFKALQEWILETDGVSLMRVLSEKDVDPVRTTSNDIVEIFTVLGIEAVRKALER
ELYHVISFDGSYVNYRHLALLCDTMTCRGHLMAITRHGVNRQDTGPLMKCSFEETVDVLM
EAAAHGESDPMKGVSENIMLGQLAPAGTGCFDLLLDAEKCKYGMEIPTNIPGLGAAGPTG
MFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGFSPSAASDASGFSPG
YSPAWSPTPGSPGSPGPSSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSP
SYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSP
TSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPS
YSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYTPTSPNYSPTSPSYSPTSPSYSPT
SPSYSPSSPRYTPQSPTYTPSSPSYSPSSPSYSPASPKYTPTSPSYSPSSPEYTPTSPKY
SPTSPKYSPTSPKYSPTSPTYSPTTPKYSPTSPTYSPTSPVYTPTSPKYSPTSPTYSPTS
PKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAISPDDSDEEN
|
External Links |
GenBank ID Protein
| 36124 |
UniProtKB/Swiss-Prot ID
| P24928 |
UniProtKB/Swiss-Prot Entry Name
| RPB1_HUMAN |
PDB IDs
|
|
GenBank Gene ID
| X63564 |
GeneCard ID
| POLR2A |
GenAtlas ID
| POLR2A |
HGNC ID
| HGNC:9187 |
References |
General References
| Not Available |