DNA-directed RNA polymerase II subunit RPB1
Name | DNA-directed RNA polymerase II subunit RPB1 |
---|---|
Synonyms |
|
Gene Name | POLR2A |
Organism | Human |
Amino acid sequence | >lcl|BSEQ0037255|DNA-directed RNA polymerase II subunit RPB1 MHGGGPPSGDSACPLRTIKRVQFGVLSPDELKRMSVTEGGIKYPETTEGGRPKLGGLMDP RQGVIERTGRCQTCAGNMTECPGHFGHIELAKPVFHVGFLVKTMKVLRCVCFFCSKLLVD SNNPKIKDILAKSKGQPKKRLTHVYDLCKGKNICEGGEEMDNKFGVEQPEGDEDLTKEKG HGGCGRYQPRIRRSGLELYAEWKHVNEDSQEKKILLSPERVHEIFKRISDEECFVLGMEP RYARPEWMIVTVLPVPPLSVRPAVVMQGSARNQDDLTHKLADIVKINNQLRRNEQNGAAA HVIAEDVKLLQFHVATMVDNELPGLPRAMQKSGRPLKSLKQRLKGKEGRVRGNLMGKRVD FSARTVITPDPNLSIDQVGVPRSIAANMTFAEIVTPFNIDRLQELVRRGNSQYPGAKYII RDNGDRIDLRFHPKPSDLHLQTGYKVERHMCDGDIVIFNRQPTLHKMSMMGHRVRILPWS TFRLNLSVTTPYNADFDGDEMNLHLPQSLETRAEIQELAMVPRMIVTPQSNRPVMGIVQD TLTAVRKFTKRDVFLERGEVMNLLMFLSTWDGKVPQPAILKPRPLWTGKQIFSLIIPGHI NCIRTHSTHPDDEDSGPYKHISPGDTKVVVENGELIMGILCKKSLGTSAGSLVHISYLEM GHDITRLFYSNIQTVINNWLLIEGHTIGIGDSIADSKTYQDIQNTIKKAKQDVIEVIEKA HNNELEPTPGNTLRQTFENQVNRILNDARDKTGSSAQKSLSEYNNFKSMVVSGAKGSKIN ISQVIAVVGQQNVEGKRIPFGFKHRTLPHFIKDDYGPESRGFVENSYLAGLTPTEFFFHA MGGREGLIDTAVKTAETGYIQRRLIKSMESVMVKYDATVRNSINQVVQLRYGEDGLAGES VEFQNLATLKPSNKAFEKKFRFDYTNERALRRTLQEDLVKDVLSNAHIQNELEREFERMR EDREVLRVIFPTGDSKVVLPCNLLRMIWNAQKIFHINPRLPSDLHPIKVVEGVKELSKKL VIVNGDDPLSRQAQENATLLFNIHLRSTLCSRRMAEEFRLSGEAFDWLLGEIESKFNQAI AHPGEMVGALAAQSLGEPATQMTLNTFHYAGVSAKNVTLGVPRLKELINISKKPKTPSLT VFLLGQSARDAERAKDILCRLEHTTLRKVTANTAIYYDPNPQSTVVAEDQEWVNVYYEMP DFDVARISPWLLRVELDRKHMTDRKLTMEQIAEKINAGFGDDLNCIFNDDNAEKLVLRIR IMNSDENKMQEEEEVVDKMDDDVFLRCIESNMLTDMTLQGIEQISKVYMHLPQTDNKKKI IITEDGEFKALQEWILETDGVSLMRVLSEKDVDPVRTTSNDIVEIFTVLGIEAVRKALER ELYHVISFDGSYVNYRHLALLCDTMTCRGHLMAITRHGVNRQDTGPLMKCSFEETVDVLM EAAAHGESDPMKGVSENIMLGQLAPAGTGCFDLLLDAEKCKYGMEIPTNIPGLGAAGPTG MFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGFSPSAASDASGFSPG YSPAWSPTPGSPGSPGPSSPYIPSPGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSP SYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSP TSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPS YSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYTPTSPNYSPTSPSYSPTSPSYSPT SPSYSPSSPRYTPQSPTYTPSSPSYSPSSPSYSPASPKYTPTSPSYSPSSPEYTPTSPKY SPTSPKYSPTSPKYSPTSPTYSPTTPKYSPTSPTYSPTSPVYTPTSPKYSPTSPTYSPTS PKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAISPDDSDEEN |
Number of residues | 1970 |
Molecular Weight | 217174.235 |
Theoretical pI | 7.38 |
GO Classification | Functions
Processes
Components
|
General Function | Ubiquitin protein ligase binding |
Specific Function | DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Largest and catalytic component of RNA polymerase II which synthesizes mRNA precursors and many functional non-coding RNAs. Forms the polymerase active center together with the second largest subunit. Pol II is the central component of the basal RNA polymerase II transcription machinery. It is composed of mobile elements that move relative to each other. RPB1 is part of the core element with the central large cleft, the clamp element that moves to open and close the cleft and the jaws that are thought to grab the incoming DNA template. At the start of transcription, a single-stranded DNA template strand of the promoter is positioned within the central active site cleft of Pol II. A bridging helix emanates from RPB1 and crosses the cleft near the catalytic site and is thought to promote translocation of Pol II by acting as a ratchet that moves the RNA-DNA hybrid through the active site by switching from straight to bent conformations at each step of nucleotide addition. During transcription elongation, Pol II moves on the template as the transcript elongates. Elongation is influenced by the phosphorylation status of the C-terminal domain (CTD) of Pol II largest subunit (RPB1), which serves as a platform for assembly of factors that regulate transcription initiation, elongation, termination and mRNA processing. Acts as an RNA-dependent RNA polymerase when associated with small delta antigen of Hepatitis delta virus, acting both as a replicate and transcriptase for the viral RNA circular genome. |
Pfam Domain Function | |
Transmembrane Regions | Not Available |
GenBank Protein ID | 36124 |
UniProtKB ID | P24928 |
UniProtKB Entry Name | RPB1_HUMAN |
Cellular Location | Nucleus |
Gene sequence | >lcl|BSEQ0020705|DNA-directed RNA polymerase II subunit RPB1 (POLR2A) ATGCACGGGGGTGGCCCCCCCTCGGGGGACAGCGCATGCCCGCTGCGCACCATCAAGAGA GTCCAGTTCGGAGTCCTGAGTCCGGATGAACTGAAGCGAATGTCTGTGACGGAGGGTGGC ATCAAATACCCAGAGACGACTGAGGGAGGCCGCCCCAAGCTTGGGGGGCTGATGGACCCG AGGCAGGGGGTGATTGAGCGGACTGGCCGCTGCCAAACATGTGCAGGAAACATGACAGAG TGTCCTGGCCACTTTGGCCACATTGAACTGGCCAAGCCTGTGTTTCACGTGGGCTTCCTG GTGAAGACAATGAAAGTTTTGCGCTGTGTCTGCTTCTTCTGCTCCAAACTGCTTGTGGAC TCTAACAACCCAAAGATCAAGGATATCCTGGCTAAGTCCAAGGGACAGCCCAAGAAGCGG CTCACACATGTCTACGACCTTTGCAAGGGCAAAAACATATGCGAGGGTGGGGAGGAGATG GACAACAAGTTCGGTGTGGAACAACCTGAGGGTGACGAGGATCTGACCAAAGAAAAGGGC CATGGTGGCTGTGGGCGGTACCAGCCCAGGATCCGGCGTTCTGGCCTAGAGCTGTATGCG GAATGGAAGCACGTTAATGAGGACTCTCAGGAGAAGAAGATCCTGCTGAGTCCAGAGCGA GTGCATGAGATCTTCAAACGCATCTCAGATGAGGAGTGTTTTGTGCTGGGCATGGAGCCC CGCTATGCACGGCCAGAGTGGATGATTGTCACAGTGCTGCCTGTGCCCCCGCTCTCCGTG CGGCCTGCTGTTGTGATGCAGGGCTCTGCCCGTAACCAGGATGACCTGACTCACAAACTG GCTGACATCGTGAAGATCAACAATCAGCTGCGGCGCAATGAGCAGAACGGCGCAGCGGCC CATGTCATTGCAGAGGATGTGAAGCTCCTCCAGTTCCATGTGGCCACCATGGTGGACAAT GAGCTGCCTGGCTTGCCCCGTGCCATGCAGAAGTCTGGGCGTCCCCTCAAGTCCCTGAAG CAGCGGTTGAAGGGCAAGGAAGGCCGGGTGCGAGGGAACCTGATGGGCAAAAGAGTGGAC TTCTCGGCCCGTACTGTCATCACCCCCGACCCCAACCTCTCCATTGACCAGGTTGGCGTG CCCCGCTCCATTGCTGCCAACATGACCTTTGCGGAGATTGTCACCCCCTTCAACATTGAC AGACTTCAAGAACTAGTGCGCAGGGGGAACAGCCAGTACCCAGGCGCCAAGTACATCATC CGAGACAATGGTGATCGCATTGACTTGCGTTTCCACCCCAAGCCCAGTGACCTTCACCTG CAGACCGGCTATAAGGTGGAACGGCACATGTGTGATGGGGACATTGTTATCTTCAACCGG CAGCCAACTCTGCACAAAATGTCCATGATGGGGCATCGGGTCCGCATTCTCCCATGGTCT ACCTTTCGCTTGAATCTTAGTGTGACAACTCCGTACAATGCAGACTTTGACGGGGATGAG ATGAACTTGCACCTGCCACAGTCTCTGGAGACGCGAGCAGAGATCCAGGAGCTGGCCATG GTTCCTCGCATGATTGTCACCCCCCAGAGCAATCGGCCTGTCATGGGTATTGTGCAGGAC ACACTCACAGCAGTGCGCAAATTCACCAAGAGAGACGTCTTCCTGGAGCGGGGTGAAGTG ATGAACCTCCTGATGTTCCTGTCGACGTGGGATGGGAAGGTCCCACAGCCGGCCATCCTA AAGCCCCGGCCCCTGTGGACAGGCAAGCAAATCTTCTCCCTCATCATACCTGGTCACATC AATTGTATCCGTACCCACAGCACCCATCCCGATGATGAAGACAGTGGCCCTTACAAGCAC ATCTCTCCTGGGGACACCAAGGTGGTGGTGGAGAATGGGGAGCTGATCATGGGCATCCTG TGTAAGAAGTCTCTGGGCACGTCAGCTGGCTCCCTGGTCCACATCTCCTACCTAGAGATG GGTCATGACATCACTCGCCTCTTCTACTCCAACATTCAGACTGTCATTAACAACTGGCTC CTCATCGAGGGTCATACTATTGGCATTGGGGACTCCATTGCTGATTCTAAGACTTACCAG GACATTCAGAACACTATTAAGAAGGCCAAGCAGGACGTAATAGAGGTCATCGAGAAGGCA CACAACAATGAGCTGGAGCCCACCCCAGGGAACACTCTGCGGCAGACGTTTGAGAATCAG GTGAACCGCATTCTTAACGATGCCCGAGACAAGACTGGCTCCTCTGCTCAGAAATCCCTG TCTGAATACAACAACTTCAAGTCTATGGTCGTGTCCGGAGCTAAAGGTTCCAAGATTAAC ATCTCCCAGGTCATTGCTGTCGTTGGACAGCAGAACGTCGAGGGCAAGCGGATTCCATTT GGCTTCAAGCACCGGACTCTGCCTCACTTCATCAAGGATGACTACGGGCCTGAGAGCCGT GGCTTTGTGGAGAACTCCTACCTAGCCGGCCTCACACCCACTGAGTTCTTTTTCCACGCC ATGGGGGGTCGTGAGGGGCTCATTGACACGGCTGTCAAGACTGCTGAGACTGGATACATC CAGCGGCGGCTGATCAAGTCCATGGAGTCAGTGATGGTGAAGTACGACGCGACTGTGCGG AACTCCATCAACCAGGTGGTGCAGCTGCGCTACGGCGAAGACGGCCTGGCAGGCGAGAGC GTTGAGTTCCAGAACCTGGCTACGCTTAAGCCTTCCAACAAGGCTTTTGAGAAGAAGTTC CGCTTTGATTATACCAATGAGAGGGCCCTGCGGCGCACTCTGCAGGAGGACCTGGTGAAG GACGTGCTGAGCAACGCACACATCCAGAACGAGTTGGAGCGGGAATTTGAGCGGATGCGG GAGGATCGGGAGGTGCTCAGGGTCATCTTCCCAACTGGAGACAGCAAGGTCGTCCTCCCC TGTAACCTGCTGCGGATGATCTGGAATGCTCAGAAAATCTTCCACATCAACCCACGCCTT CCCTCCGACCTGCACCCCATCAAAGTGGTGGAGGGAGTCAAGGAATTGAGCAAGAAGCTG GTGATTGTGAATGGGGATGACCCACTAAGTCGACAGGCCCAGGAAAATGCCACGCTGCTC TTCAACATCCACCTGCGGTCCACGTTGTGTTCCCGCCGCATGGCAGAGGAGTTTCGGCTC AGTGGGGAGGCCTTCGACTGGCTGCTTGGGGAGATTGAGTCCAAGTTCAACCAAGCCATT GCGCATCCCGGGGAAATGGTGGGGGCTCTGGCTGCGCAGTCCCTTGGAGAACCTGCCACC CAGATGACCTTGAATACCTTCCACTATGCTGGTGTGTCTGCCAAGAATGTGACGCTGGGT GTGCCCCGACTTAAGGAGCTCATCAACATTTCCAAGAAGCCAAAGACTCCTTCGCTTACT GTCTTCCTGTTGGGCCAGTCCGCTCGAGATGCTGAGAGAGCCAAGGATATTCTGTGCCGT CTGGAGCATACAACGTTGAGGAAGGTGACTGCCAACACAGCCATCTACTATGACCCCAAC CCCCAGAGCACGGTGGTGGCAGAGGATCAGGAATGGGTGAATGTCTACTATGAAATGCCT GACTTTGATGTGGCCCGAATCTCCCCCTGGCTGTTGCGGGTGGAGCTGGATCGGAAGCAC ATGACTGACCGGAAGCTCACCATGGAGCAGATTGCTGAAAAGATCAATGCTGGTTTTGGT GACGACTTGAACTGCATCTTTAATGATGACAATGCAGAGAAGCTGGTGCTCCGTATTCGC ATCATGAACAGCGATGAGAACAAGATGCAAGAGGAGGAAGAGGTGGTGGACAAGATGGAT GATGATGTCTTCCTGCGCTGCATCGAGTCCAACATGCTGACAGATATGACCCTGCAGGGC ATCGAGCAGATCAGCAAGGTGTACATGCACTTGCCACAGACAGACAACAAGAAGAAGATC ATCATCACGGAGGATGGGGAATTCAAGGCCCTGCAGGAGTGGATCCTGGAGACGGACGGC GTGAGCTTGATGCGGGTGCTGAGTGAGAAGGACGTGGACCCCGTACGCACCACGTCCAAT GACATTGTGGAGATCTTCACGGTGCTGGGCATTGAAGCCGTGCGGAAGGCCCTGGAGCGG GAGCTGTACCACGTCATCTCCTTTGATGGCTCCTATGTCAATTACCGACACTTGGCTCTC TTGTGTGATACCATGACCTGTCGTGGCCACTTGATGGCCATCACCCGACACGGAGTCAAC CGCCAGGACACAGGACCACTCATGAAGTGTTCCTTTGAGGAAACGGTGGACGTGCTTATG GAAGCAGCCGCACACGGTGAGAGTGACCCCATGAAGGGGGTCTCTGAGAATATCATGCTG GGCCAGCTGGCTCCGGCCGGCACTGGCTGCTTTGACCTCCTGCTTGATGCAGAGAAGTGC AAGTATGGCATGGAGATCCCCACCAATATCCCCGGCCTGGGGGCTGCTGGACCCACCGGC ATGTTCTTTGGTTCAGCACCCAGTCCCATGGGTGGAATCTCTCCTGCCATGACACCTTGG AACCAGGGTGCAACCCCTGCCTATGGCGCCTGGTCCCCCAGTGTTGGGAGTGGAATGACC CCAGGGGCAGCCGGCTTCTCTCCCAGTGCTGCGTCAGATGCCAGCGGCTTCAGCCCAGGT TACTCCCCTGCCTGGTCTCCCACACCGGGCTCCCCGGGGTCCCCAGGTCCCTCAAGCCCC TACATCCCTTCACCAGGTGGTGCCATGTCTCCCAGCTACTCGCCAACGTCACCTGCCTAC GAGCCCCGCTCTCCTGGGGGCTACACACCCCAGAGTCCCTCTTATTCCCCCACTTCACCC TCCTACTCCCCTACCTCTCCATCCTATTCTCCAACCAGTCCCAACTATAGTCCCACATCA CCCAGCTATTCGCCAACGTCACCCAGCTACTCACCGACCTCTCCCAGCTACTCACCCACC TCTCCCAGCTACTCGCCCACCTCTCCCAGCTACTCGCCCACCTCTCCCAGCTACTCACCC ACTTCCCCTAGCTACTCGCCCACTTCCCCTAGCTACTCGCCAACGTCTCCCAGCTACTCG CCGACATCTCCCAGCTACTCGCCAACTTCACCCAGCTATTCTCCCACTTCTCCCAGCTAC TCACCTACCTCTCCAAGCTATTCACCCACCTCCCCCAGCTACTCACCCACTTCCCCAAGT TACTCACCCACCAGCCCGAACTATTCTCCAACCAGTCCCAATTACACCCCAACATCACCC AGCTACAGCCCGACATCACCCAGCTATTCACCTACTAGTCCCAACTACACACCTACCAGC CCTAACTACAGCCCAACCTCTCCAAGCTACTCTCCAACATCACCCAGCTATTCCCCGACC TCACCAAGTTACTCCCCTTCCAGCCCACGATACACACCACAGTCTCCAACCTATACCCCA AGCTCACCCAGCTACAGCCCCAGCTCGCCCAGCTACAGCCCAACCTCACCCAAGTACACC CCAACCAGTCCTTCTTACAGTCCCAGCTCCCCAGAGTATACCCCAACCTCTCCCAAGTAC TCACCTACCAGTCCCAAATATTCACCCACCTCTCCCAAGTACTCGCCTACCAGTCCCACC TATTCACCCACCACCCCAAAATACTCCCCAACATCTCCTACTTATTCCCCAACCTCTCCA GTCTACACCCCAACCTCTCCCAAGTACTCACCTACTAGCCCCACTTACTCGCCCACTTCC CCCAAGTACTCGCCCACCAGCCCCACCTACTCGCCCACCTCCCCCAAAGGCTCAACCTAC TCTCCCACTTCCCCTGGTTACTCGCCCACCAGCCCCACCTACAGTCTCACAAGCCCGGCT ATCAGCCCGGATGACAGTGACGAGGAGAACTGA |
GenBank Gene ID | X63564 |
GeneCard ID | Not Available |
GenAtlas ID | Not Available |
HGNC ID | HGNC:9187 |
Chromosome Location | 17 |
Locus | 17p13.1 |
References |
|