Methylcytosine dioxygenase TET1
Name | Methylcytosine dioxygenase TET1 |
---|---|
Synonyms |
|
Gene Name | TET1 |
Organism | Human |
Amino acid sequence | >lcl|BSEQ0038056|Methylcytosine dioxygenase TET1 MSRSRHARPSRLVRKEDVNKKKKNSQLRKTTKGANKNVASVKTLSPGKLKQLIQERDVKK KTEPKPPVPVRSLLTRAGAARMNLDRTEVLFQNPESLTCNGFTMALRSTSLSRRLSQPPL VVAKSKKVPLSKGLEKQHDCDYKILPALGVKHSENDSVPMQDTQVLPDIETLIGVQNPSL LKGKSQETTQFWSQRVEDSKINIPTHSGPAAEILPGPLEGTRCGEGLFSEETLNDTSGSP KMFAQDTVCAPFPQRATPKVTSQGNPSIQLEELGSRVESLKLSDSYLDPIKSEHDCYPTS SLNKVIPDLNLRNCLALGGSTSPTSVIKFLLAGSKQATLGAKPDHQEAFEATANQQEVSD TTSFLGQAFGAIPHQWELPGADPVHGEALGETPDLPEIPGAIPVQGEVFGTILDQQETLG MSGSVVPDLPVFLPVPPNPIATFNAPSKWPEPQSTVSYGLAVQGAIQILPLGSGHTPQSS SNSEKNSLPPVMAISNVENEKQVHISFLPANTQGFPLAPERGLFHASLGIAQLSQAGPSK SDRGSSQVSVTSTVHVVNTTVVTMPVPMVSTSSSSYTTLLPTLEKKKRKRCGVCEPCQQK TNCGECTYCKNRKNSHQICKKRKCEELKKKPSVVVPLEVIKENKRPQREKKPKVLKADFD NKPVNGPKSESMDYSRCGHGEEQKLELNPHTVENVTKNEDSMTGIEVEKWTQNKKSQLTD HVKGDFSANVPEAEKSKNSEVDKKRTKSPKLFVQTVRNGIKHVHCLPAETNVSFKKFNIE EFGKTLENNSYKFLKDTANHKNAMSSVATDMSCDHLKGRSNVLVFQQPGFNCSSIPHSSH SIINHHASIHNEGDQPKTPENIPSKEPKDGSPVQPSLLSLMKDRRLTLEQVVAIEALTQL SEAPSENSSPSKSEKDEESEQRTASLLNSCKAILYTVRKDLQDPNLQGEPPKLNHCPSLE KQSSCNTVVFNGQTTTLSNSHINSATNQASTKSHEYSKVTNSLSLFIPKSNSSKIDTNKS IAQGIITLDNCSNDLHQLPPRNNEVEYCNQLLDSSKKLDSDDLSCQDATHTQIEEDVATQ LTQLASIIKINYIKPEDKKVESTPTSLVTCNVQQKYNQEKGTIQQKPPSSVHNNHGSSLT KQKNPTQKKTKSTPSRDRRKKKPTVVSYQENDRQKWEKLSYMYGTICDIWIASKFQNFGQ FCPHDFPTVFGKISSSTKIWKPLAQTRSIMQPKTVFPPLTQIKLQRYPESAEEKVKVEPL DSLSLFHLKTESNGKAFTDKAYNSQVQLTVNANQKAHPLTQPSSPPNQCANVMAGDDQIR FQQVVKEQLMHQRLPTLPGISHETPLPESALTLRNVNVVCSGGITVVSTKSEEEVCSSSF GTSEFSTVDSAQKNFNDYAMNFFTNPTKNLVSITKDSELPTCSCLDRVIQKDKGPYYTHL GAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKEGKSSHGCPIAKWVLRRSSDEEKVLC LVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYTELTENLKSYNGHPTDRRCTLNENRT CTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLA TRLAPIYKQYAPVAYQNQVEYENVARECRLGSKEGRPFSGVTACLDFCAHPHRDIHNMNN GSTVVCTLTREDNRSLGVIPQDEQLHVLPLYKLSDTDEFGSKEGMEAKIKSGAIEVLAPR RKKRTCFTQPVPRSGKKRAAMMTEVLAHKIRAVEKKPIPRIKRKNNSTTTNNSKPSSLPT LGSNTETVQPEVKSETEPHFILKSSDNTKTYSLMPSAPHPVKEASPGFSWSPKTASATPA PLKNDATASCGFSERSSTPHCTMPSGRLSGANAAAADGPGISQLGEVAPLPTLSAPVMEP LINSEPSTGVTEPLTPHQPNHQPSFLTSPQDLASSPMEEDEQHSEADEPPSDEPLSDDPL SPAEEKLPHIDEYWSDSEHIFLDANIGGVAIAPAHGSVLIECARRELHATTPVEHPNRNH PTRLSLVFYQHKNLNKPQHGFELNKIKFEAKEAKNKKMKASEQKDQAANEGPEQSSEVNE LNQIPSHKALTLTHDNVVTVSPYALTHVAGPYNHWV |
Number of residues | 2136 |
Molecular Weight | 235306.965 |
Theoretical pI | Not Available |
GO Classification | Functions
Processes
Components
|
General Function | Zinc ion binding |
Specific Function | Dioxygenase that catalyzes the conversion of the modified genomic base 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC) and plays a key role in active DNA demethylation. Also mediates subsequent conversion of 5hmC into 5-formylcytosine (5fC), and conversion of 5fC to 5-carboxylcytosine (5caC). Conversion of 5mC into 5hmC, 5fC and 5caC probably constitutes the first step in cytosine demethylation. Methylation at the C5 position of cytosine bases is an epigenetic modification of the mammalian genome which plays an important role in transcriptional regulation. In addition to its role in DNA demethylation, plays a more general role in chromatin regulation. Preferentially binds to CpG-rich sequences at promoters of both transcriptionally active and Polycomb-repressed genes. Involved in the recruitment of the O-GlcNAc transferase OGT to CpG-rich transcription start sites of active genes, thereby promoting histone H2B GlcNAcylation by OGT. Also involved in transcription repression of a subset of genes through recruitment of transcriptional repressors to promoters. Involved in the balance between pluripotency and lineage commitment of cells it plays a role in embryonic stem cells maintenance and inner cell mass cell specification. |
Pfam Domain Function | |
Transmembrane Regions | Not Available |
GenBank Protein ID | Not Available |
UniProtKB ID | Q8NFU7 |
UniProtKB Entry Name | TET1_HUMAN |
Cellular Location | Nucleus |
Gene sequence | >lcl|BSEQ0013961|Methylcytosine dioxygenase TET1 (TET1) ATGTCTCGATCCCGCCATGCAAGGCCTTCCAGATTAGTCAGGAAGGAAGATGTAAACAAA AAAAAGAAAAACAGCCAACTACGAAAGACAACCAAGGGAGCCAACAAAAATGTGGCATCA GTCAAGACTTTAAGCCCTGGAAAATTAAAGCAATTAATTCAAGAAAGAGATGTTAAGAAA AAAACAGAACCTAAACCACCCGTGCCAGTCAGAAGCCTTCTGACAAGAGCTGGAGCAGCA CGCATGAATTTGGATAGGACTGAGGTTCTTTTTCAGAACCCAGAGTCCTTAACCTGCAAT GGGTTTACAATGGCGCTACGAAGCACCTCTCTTAGCAGGCGACTCTCCCAACCCCCACTG GTCGTAGCCAAATCCAAAAAGGTTCCACTTTCTAAGGGTTTAGAAAAGCAACATGATTGT GATTATAAGATACTCCCTGCTTTGGGAGTAAAGCACTCAGAAAATGATTCGGTTCCAATG CAAGACACCCAAGTCCTTCCTGATATAGAGACTCTAATTGGTGTACAAAATCCCTCTTTA CTTAAAGGTAAGAGCCAAGAGACAACTCAGTTTTGGTCCCAAAGAGTTGAGGATTCCAAG ATCAATATCCCTACCCACAGTGGCCCTGCAGCTGAGATCCTTCCTGGGCCACTGGAAGGG ACACGCTGTGGTGAAGGACTATTCTCTGAAGAGACATTGAATGATACCAGTGGTTCCCCA AAAATGTTTGCTCAGGACACAGTGTGTGCTCCTTTTCCCCAAAGAGCAACCCCCAAAGTT ACCTCTCAAGGAAACCCCAGCATTCAGTTAGAAGAGTTGGGTTCACGAGTAGAATCTCTT AAGTTATCTGATTCTTACCTGGATCCCATTAAAAGTGAACATGATTGCTACCCCACCTCC AGTCTTAATAAGGTTATACCTGACTTGAACCTTAGAAACTGCTTGGCTCTTGGTGGGTCT ACGTCTCCTACCTCTGTAATAAAATTCCTCTTGGCAGGCTCAAAACAAGCGACCCTTGGT GCTAAACCAGATCATCAAGAGGCCTTCGAAGCTACTGCAAATCAACAGGAAGTTTCTGAT ACCACCTCTTTCCTAGGACAGGCCTTTGGTGCTATCCCACATCAATGGGAACTTCCTGGT GCTGACCCAGTTCATGGTGAGGCCCTGGGTGAGACCCCAGATCTACCAGAGATTCCTGGT GCTATTCCAGTCCAAGGAGAGGTCTTTGGTACTATTTTAGACCAACAAGAAACTCTTGGT ATGAGTGGGAGTGTTGTCCCAGACTTGCCTGTCTTCCTTCCTGTTCCTCCAAATCCAATT GCTACCTTTAATGCTCCTTCCAAATGGCCTGAGCCCCAAAGCACTGTCTCATATGGACTT GCAGTCCAGGGTGCTATACAGATTTTGCCTTTGGGCTCAGGACACACTCCTCAATCATCA TCAAACTCAGAGAAAAATTCATTACCTCCAGTAATGGCTATAAGCAATGTAGAAAATGAG AAGCAGGTTCATATAAGCTTCCTGCCAGCTAACACTCAGGGGTTCCCATTAGCCCCTGAG AGAGGACTCTTCCATGCTTCACTGGGTATAGCCCAACTCTCTCAGGCTGGTCCTAGCAAA TCAGACAGAGGGAGCTCCCAGGTCAGTGTAACCAGCACAGTTCATGTTGTCAACACCACA GTGGTGACTATGCCAGTGCCAATGGTCAGTACCTCCTCTTCTTCCTATACCACTTTGCTA CCGACTTTGGAAAAGAAGAAAAGAAAGCGATGTGGGGTCTGTGAACCCTGCCAGCAGAAG ACCAACTGTGGTGAATGCACTTACTGCAAGAACAGAAAGAACAGCCATCAGATCTGTAAG AAAAGAAAATGTGAGGAGCTGAAAAAGAAACCATCTGTTGTTGTGCCTCTGGAGGTTATA AAGGAAAACAAGAGGCCCCAGAGGGAAAAGAAGCCCAAAGTTTTAAAGGCAGATTTTGAC AACAAACCAGTAAATGGCCCCAAGTCAGAATCCATGGACTACAGTAGATGTGGTCATGGG GAAGAACAAAAATTGGAATTGAACCCACATACTGTTGAAAATGTAACTAAAAATGAAGAC AGCATGACAGGCATCGAGGTGGAGAAGTGGACACAAAACAAGAAATCACAGTTAACTGAT CACGTGAAAGGAGATTTTAGTGCTAATGTCCCAGAAGCTGAAAAATCGAAAAACTCTGAA GTTGACAAGAAACGAACCAAATCTCCAAAATTGTTTGTACAAACCGTAAGAAATGGCATT AAACATGTACACTGTTTACCAGCTGAAACAAATGTTTCATTTAAAAAATTCAATATTGAA GAATTCGGCAAGACATTGGAAAACAATTCTTATAAATTCCTAAAAGACACTGCAAACCAT AAAAACGCTATGAGCTCTGTTGCTACTGATATGAGTTGTGATCATCTCAAGGGGAGAAGT AACGTTTTAGTATTCCAGCAGCCTGGCTTTAACTGCAGTTCCATTCCACATTCTTCACAC TCCATCATAAATCATCATGCTAGTATACACAATGAAGGTGATCAACCAAAAACTCCTGAG AATATACCAAGTAAAGAACCAAAAGATGGATCTCCCGTTCAACCAAGTCTCTTATCGTTA ATGAAAGATAGGAGATTAACATTGGAGCAAGTGGTAGCCATAGAGGCCCTGACTCAACTC TCAGAAGCCCCATCAGAGAATTCCTCCCCATCAAAGTCAGAGAAGGATGAGGAATCAGAG CAGAGAACAGCCAGTTTGCTTAATAGCTGCAAAGCTATCCTCTACACTGTAAGAAAAGAC CTCCAAGACCCAAACTTACAGGGAGAGCCACCAAAACTTAATCACTGTCCATCTTTGGAA AAACAAAGTTCATGCAACACGGTGGTTTTCAATGGGCAAACTACTACCCTTTCCAACTCA CATATCAACTCAGCTACTAACCAAGCATCCACAAAGTCACATGAATATTCAAAAGTCACA AATTCATTATCTCTTTTTATACCAAAATCAAATTCATCCAAGATTGACACCAATAAAAGT ATTGCTCAAGGGATAATTACTCTTGACAATTGTTCCAATGATTTGCATCAGTTGCCACCA AGAAATAATGAAGTGGAGTATTGCAACCAGTTACTGGACAGCAGCAAAAAATTGGACTCA GATGATCTATCATGTCAGGATGCAACCCATACCCAAATTGAGGAAGATGTTGCAACACAG TTGACACAACTTGCTTCGATAATTAAGATCAATTATATAAAACCAGAGGACAAAAAAGTT GAAAGTACACCAACAAGCCTTGTCACATGTAATGTACAGCAAAAATACAATCAGGAGAAG GGCACAATACAACAGAAACCACCTTCAAGTGTACACAATAATCATGGTTCATCATTAACA AAACAAAAGAACCCAACCCAGAAAAAGACAAAATCCACCCCATCAAGAGATCGGCGGAAA AAGAAGCCCACAGTTGTAAGTTATCAAGAAAATGATCGGCAGAAGTGGGAAAAGTTGTCC TATATGTATGGCACAATATGCGACATTTGGATAGCATCGAAATTTCAAAATTTTGGGCAA TTTTGTCCACATGATTTTCCTACTGTATTTGGGAAAATTTCTTCCTCGACCAAAATATGG AAACCACTGGCTCAAACGAGGTCCATTATGCAACCCAAAACAGTATTTCCACCACTCACT CAGATAAAATTACAGAGATATCCTGAATCAGCAGAGGAAAAGGTGAAGGTTGAACCATTG GATTCACTCAGCTTATTTCATCTTAAAACGGAATCCAACGGGAAGGCATTCACTGATAAA GCTTATAATTCTCAGGTACAGTTAACGGTGAATGCCAATCAGAAAGCCCATCCTTTGACC CAGCCCTCCTCTCCACCTAACCAGTGTGCTAACGTGATGGCAGGCGATGACCAAATACGG TTTCAGCAGGTTGTTAAGGAGCAACTCATGCATCAGAGACTGCCAACATTGCCTGGTATC TCTCATGAAACACCCTTACCGGAGTCAGCACTAACTCTCAGGAATGTAAATGTAGTGTGT TCAGGTGGAATTACAGTGGTTTCTACCAAAAGTGAAGAGGAAGTCTGTTCATCCAGTTTT GGAACATCAGAATTTTCCACAGTGGACAGTGCACAGAAAAATTTTAATGATTATGCCATG AACTTCTTTACTAACCCTACAAAAAACCTAGTGTCTATAACTAAAGATTCTGAACTGCCC ACCTGCAGCTGTCTTGATCGAGTTATACAAAAAGACAAAGGCCCATATTATACACACCTT GGGGCAGGACCAAGTGTTGCTGCTGTCAGGGAAATCATGGAGAATAGGTATGGTCAAAAA GGAAACGCAATAAGGATAGAAATAGTAGTGTACACCGGTAAAGAAGGGAAAAGCTCTCAT GGGTGTCCAATTGCTAAGTGGGTTTTAAGAAGAAGCAGTGATGAAGAAAAAGTTCTTTGT TTGGTCCGGCAGCGTACAGGCCACCACTGTCCAACTGCTGTGATGGTGGTGCTCATCATG GTGTGGGATGGCATCCCTCTTCCAATGGCCGACCGGCTATACACAGAGCTCACAGAGAAT CTAAAGTCATACAATGGGCACCCTACCGACAGAAGATGCACCCTCAATGAAAATCGTACC TGTACATGTCAAGGAATTGATCCAGAGACTTGTGGAGCTTCATTCTCTTTTGGCTGTTCA TGGAGTATGTACTTTAATGGCTGTAAGTTTGGTAGAAGCCCAAGCCCCAGAAGATTTAGA ATTGATCCAAGCTCTCCCTTACATGAAAAAAACCTTGAAGATAACTTACAGAGTTTGGCT ACACGATTAGCTCCAATTTATAAGCAGTATGCTCCAGTAGCTTACCAAAATCAGGTGGAA TATGAAAATGTTGCCCGAGAATGTCGGCTTGGCAGCAAGGAAGGTCGTCCCTTCTCTGGG GTCACTGCTTGCCTGGACTTCTGTGCTCATCCCCACAGGGACATTCACAACATGAATAAT GGAAGCACTGTGGTTTGTACCTTAACTCGAGAAGATAACCGCTCTTTGGGTGTTATTCCT CAAGATGAGCAGCTCCATGTGCTACCTCTTTATAAGCTTTCAGACACAGATGAGTTTGGC TCCAAGGAAGGAATGGAAGCCAAGATCAAATCTGGGGCCATCGAGGTCCTGGCACCCCGC CGCAAAAAAAGAACGTGTTTCACTCAGCCTGTTCCCCGTTCTGGAAAGAAGAGGGCTGCG ATGATGACAGAGGTTCTTGCACATAAGATAAGGGCAGTGGAAAAGAAACCTATTCCCCGA ATCAAGCGGAAGAATAACTCAACAACAACAAACAACAGTAAGCCTTCGTCACTGCCAACC TTAGGGAGTAACACTGAGACCGTGCAACCTGAAGTAAAAAGTGAAACCGAACCCCATTTT ATCTTAAAAAGTTCAGACAACACTAAAACTTATTCGCTGATGCCATCCGCTCCTCACCCA GTGAAAGAGGCATCTCCAGGCTTCTCCTGGTCCCCGAAGACTGCTTCAGCCACACCAGCT CCACTGAAGAATGACGCAACAGCCTCATGCGGGTTTTCAGAAAGAAGCAGCACTCCCCAC TGTACGATGCCTTCGGGAAGACTCAGTGGTGCCAATGCAGCTGCTGCTGATGGCCCTGGC ATTTCACAGCTTGGCGAAGTGGCTCCTCTCCCCACCCTGTCTGCTCCTGTGATGGAGCCC CTCATTAATTCTGAGCCTTCCACTGGTGTGACTGAGCCGCTAACGCCTCATCAGCCAAAC CACCAGCCCTCCTTCCTCACCTCTCCTCAAGACCTTGCCTCTTCTCCAATGGAAGAAGAT GAGCAGCATTCTGAAGCAGATGAGCCTCCATCAGACGAACCCCTATCTGATGACCCCCTG TCACCTGCTGAGGAGAAATTGCCCCACATTGATGAGTATTGGTCAGACAGTGAGCACATC TTTTTGGATGCAAATATTGGTGGGGTGGCCATCGCACCTGCTCACGGCTCGGTTTTGATT GAGTGTGCCCGGCGAGAGCTGCACGCTACCACTCCTGTTGAGCACCCCAACCGTAATCAT CCAACCCGCCTCTCCCTTGTCTTTTACCAGCACAAAAACCTAAATAAGCCCCAACATGGT TTTGAACTAAACAAGATTAAGTTTGAGGCTAAAGAAGCTAAGAATAAGAAAATGAAGGCC TCAGAGCAAAAAGACCAGGCAGCTAATGAAGGTCCAGAACAGTCCTCTGAAGTAAATGAA TTGAACCAAATTCCTTCTCATAAAGCATTAACATTAACCCATGACAATGTTGTCACCGTG TCCCCTTATGCTCTCACACACGTTGCGGGGCCCTATAACCATTGGGTCTGA |
GenBank Gene ID | Not Available |
GeneCard ID | Not Available |
GenAtlas ID | Not Available |
HGNC ID | HGNC:29484 |
Chromosome Location | 10 |
Locus | Not Available |
References |
|