Identification |
HMDB Protein ID
| CDBP01573 |
Secondary Accession Numbers
| Not Available |
Name
| Histone-lysine N-methyltransferase SETD1A |
Description
| Not Available |
Synonyms
|
- Lysine N-methyltransferase 2F
- SET domain-containing protein 1A
- Set1/Ash2 histone methyltransferase complex subunit SET1
- hSET1A
|
Gene Name
| SETD1A |
Protein Type
| Enzyme |
Biological Properties |
General Function
| Involved in nucleotide binding |
Specific Function
| Histone methyltransferase that specifically methylates 'Lys-4' of histone H3, when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation. The non-overalpping localization with SETD1B suggests that SETD1A and SETD1B make non-redundant contributions to the epigenetic control of chromatin structure and gene expression.
|
GO Classification
|
Biological Process |
regulation of transcription, DNA-dependent |
transcription, DNA-dependent |
Cellular Component |
chromosome |
nuclear speck |
Set1C/COMPASS complex |
Function |
binding |
nucleotide binding |
nucleic acid binding |
Molecular Function |
RNA binding |
histone methyltransferase activity (H3-K4 specific) |
nucleotide binding |
|
Cellular Location
|
- Nucleus speckle
- Chromosome
|
Pathways
|
Not Available
|
Gene Properties |
Chromosome Location
| 16 |
Locus
| 16p11.2 |
SNPs
| SETD1A |
Gene Sequence
|
>5124 bp
ATGGATCAGGAAGGTGGGGGAGATGGGCAGAAGGCCCCGAGCTTCCAGTGGCGGAACTAC
AAGCTCATCGTGGATCCTGCCTTGGACCCTGCCCTGCGCAGGCCTTCTCAGAAGGTGTAC
CGCTATGATGGAGTCCACTTCAGTGTCAACGACTCAAAGTATATACCAGTCGAAGACCTC
CAAGACCCCCGTTGCCATGTCAGGTCCAAAAACAGAGACTTTTCCCTCCCAGTCCCTAAG
TTTAAGCTGGACGAGTTCTATATTGGACAGATTCCACTGAAGGAAGTGACTTTTGCAAGG
CTGAATGACAACGTGCGGGAGACCTTCCTGAAGGATATGTGCCGTAAGTACGGTGAGGTG
GAAGAGGTAGAGATCCTCCTTCACCCCCGTACGCGCAAGCACCTGGGCCTGGCCCGTGTG
CTCTTCACCAGCACTCGGGGCGCCAAGGAAACGGTCAAAAACCTCCACCTTACCTCCGTC
ATGGGCAACATCATCCATGCCCAGCTTGACATCAAAGGACAACAACGAATGAAATACTAT
GAACTAATTGTCAATGGCTCCTACACCCCTCAGACTGTGCCCACTGGGGGCAAGGCCCTG
AGTGAGAAGTTCCAAGGCTCGGGTGCAGCCACTGAGACGGCCGAATCCCGCCGCCGCTCT
TCCTCTGACACAGCTGCCTACCCAGCAGGCACCACTGCGGTGGGCACTCCTGGCAACGGC
ACCCCCTGCTCCCAGGACACAAGCTTCTCCAGCAGCCGACAAGATACCCCATCTTCCTTT
GGCCAGTTCACACCTCAGTCCTCCCAAGGAACCCCCTACACGTCTCGGGGCAGCACCCCC
TACTCTCAGGACTCTGCCTACTCCAGCAGCACCACTTCAACCTCCTTCAAGCCCCGGCGG
TCAGAGAACAGCTACCAAGATGCCTTTTCCCGCCGCCACTTCTCTGCATCTTCAGCCTCC
ACAACCGCCTCCACGGCCATCGCCGCCACCACTGCAGCCACTGCCTCATCCTCCGCCTCT
TCCTCCTCATTGTCCTCGTCCTCCTCGTCATCCTCTTCCTCCTCGTCCTCTCAGTTTCGT
AGTTCTGATGCAAACTACCCAGCGTATTATGAAAGCTGGAATCGCTACCAGCGCCATACT
TCCTACCCACCACGCCGGGCCACACGGGAGGAACCCCCTGGAGCCCCTTTTGCTGAAAAT
ACAGCTGAGCGCTTCCCACCTTCTTACACCTCCTACCTGCCCCCCGAGCCCAGCCGGCCC
ACCGACCAGGACTACCGGCCTCCTGCCTCAGAGGCTCCACCCCCGGAGCCTCCAGAACCT
GGTGGAGGCGGGGGTGGAGGAGGGCCCAGCCCTGAGAGAGAAGAAGTTCGGACTTCCCCC
CGCCCAGCCTCCCCTGCCCGCTCTGGCTCCCCAGCCCCGGAGACCACCAATGAGAGTGTG
CCCTTCGCCCAGCACAGCAGCCTGGATTCCCGCATCGAGATGCTGCTGAAGGAGCAGCGC
TCCAAGTTTTCCTTCTTGGCCTCTGACACAGAGGAGGAGGAAGAGAACAGCAGCATGGTC
CTTGGGGCCAGAGATACAGGGAGTGAGGTGCCTTCTGGGTCAGGGCATGGGCCCTGCACA
CCCCCTCCGGCCCCAGCTAATTTTGAGGATGTGGCACCTACAGGGAGCGGGGAGCCAGGG
GCTACCCGGGAGTCTCCCAAGGCAAATGGACAGAACCAGGCTTCTCCATGCTCTTCTGGA
GACGACATGGAGATCTCCGACGACGACCGGGGTGGCTCACCCCCTCCGGCCCCGACGCCC
CCTCAGCAGCCTCCGCCACCTCCCCCTCCCCCGCCGCCTCCTCCTCCCTACCTGGCGTCC
CTTCCTCTTGGTTATCCTCCCCACCAACCTGCCTACCTCCTCCCACCCAGACCTGATGGG
CCGCCGCCCCCTGAGTACCCCCCACCTCCTCCACCACCCCCGCACATCTATGACTTTGTG
AACTCCTTGGAGCTCATGGACCGACTTGGGGCTCAGTGGGGAGGGATGCCCATGTCCTTC
CAGATGCAGACCCAGATGTTAACTCGGCTCCATCAGCTGCGGCAGGGCAAGGGATTGATT
GCCGCCTCAGCTGGCCCCCCCGGTGGGGCCTTTGGGGAGGCCTTCCTCCCGTTTCCACCC
CCGCAGGAGGCAGCCTACGGCTTGCCGTATGCTCTATATGCACAGGGGCAGGAGGGCAGA
GGGGCATACTCACGGGAGGCCTACCACCTGCCCATGCCAATGGCAGCCGAGCCCCTGCCC
TCCTCCTCAGTCTCGGGAGAGGAGGCCCGGCTGCCACCCAGGGAAGAAGCAGAGCTGGCA
GAGGGCAAGACCCTCCCGACAGCAGGCACCGTGGGCCGTGTGCTCGCCATGCTGGTCCAG
GAGATGAAGAGCATCATGCAGCGAGACCTCAACCGCAAGATGGTGGAGAACGTGGCCTTC
GGAGCCTTTGACCAGTGGTGGGAGAGCAAGGAGGAGAAGGCCAAGCCATTCCAGAACGCG
GCCAAGCAGCAAGCCAAGGAGGAGGATAAAGAGAAGACGAAGCTGAAGGAGCCTGGCCTG
CTGTCCCTCGTGGACTGGGCCAAGAGCGGGGGCACTACGGGCATCGAGGCTTTCGCCTTT
GGGTCAGGGCTGAGAGGGGCCCTGCGGCTGCCTTCATTCAAGGTAAAGCGGAAAGAGCCA
TCGGAAATTTCCGAGGCCAGTGAGGAAAAGAGGCCTCGTCCCTCCACTCCTGCTGAGGAA
GATGAAGACGACCCTGAACAAGAGAAGGAGGCTGGAGAGCCAGGACGTCCGGGGACCAAG
CCCCCGAAGCGGGACGAAGAGCGAGGCAAGACCCAGGGCAAGCACCGCAAGTCCTTTGCT
CTGGACAGCGAAGGGGAGGAGGCATCCCAGGAGTCCTCCTCGGAGAAGGATGAGGAGGAT
GACGAGGAAGATGAGGAAGATGAAGATCGAGAGGAAGCTGTGGATACCACAAAGAAGGAG
ACAGAGGTGTCGGATGGCGAGGACGAGGAAAGCGATTCGTCTTCCAAATGTTCTCTGTAT
GCTGACTCAGATGGCGAAAATGACAGCACATCAGACTCCGAGAGCAGCAGCTCTTCCAGC
TCCTCATCCTCCTCCTCCTCCTCGTCCTCATCCTCCTCGTCCTCTTCATCCTCTGAGTCC
TCCTCTGAAGATGAAGAGGAAGAGGAGCGGCCAGCAGCCCTTCCCTCAGCCTCCCCGCCC
CCCAGAGAAGTCCCAGTGCCCACGCCAGCACCTGTGGAGGTGCCAGTGCCGGAAAGGGTT
GCAGGCTCCCCAGTCACACCCCTGCCCGAACAGGAGGCGTCTCCAGCAAGGCCTGCAGGC
CCCACGGAGGAGTCACCCCCCAGTGCGCCTCTGCGTCCCCCAGAACCACCTGCTGGGCCC
CCGGCCCCTGCCCCACGCCCCGATGAGCGTCCCTCTTCTCCCATCCCCCTCCTGCCCCCA
CCCAAGAAACGCCGGAAAACTGTCTCCTTCTCTGCCATCGAGGTGGTGCCAGCCCCGGAG
CCCCCTCCAGCCACACCGCCGCAGGCCAAGTTTCCCGGCCCAGCCTCCCGCAAGGCTCCC
CGGGGCGTGGAGCGGACCATCCGCAACCTGCCCCTGGACCACGCATCTCTGGTCAAGAGT
TGGCCCGAGGAGGTGTCCCGAGGAGGCCGGAGCCGGGCTGGAGGCCGAGGCCGCCTCACC
GAGGAAGAGGAGGCTGAGCCAGGGACAGAGGTGGACCTGGCGGTCCTGGCCGACCTGGCC
CTGACCCCTGCCCGGCGCGGGCTGCCTGCCCTGCCTGCTGTTGAAGACTCAGAGGCCACA
GAGACATCGGACGAGGCCGAGCGCCCTAGGCCCCTGCTCAGCCACATCCTCCTGGAGCAC
AACTATGCCCTGGCCGTCAAGCCCACGCCCCCTGCGCCAGCCCTGCGGCCCCCGGAGCCA
GTGCCCGCACCCGCCGCCCTCTTCAGTTCCCCAGCTGATGAGGTCCTGGAGGCCCCCGAG
GTGGTGGTGGCTGAGGCGGAGGAGCCCAAGCCGCAGCAACTGCAGCAGCAGCGGGAGGAG
GGCGAAGAGGAGGGGGAGGAAGAGGGGGAGGAAGAGGAGGAGGAGTCCTCTGACAGCAGC
AGCAGCAGCGATGGGGAGGGCGCCCTCCGGAGGCGCAGCCTCCGCTCCCACGCCCGGCGC
CGCCGCCCTCCGCCCCCACCCCCGCCGCCACCGCCCCGCGCCTACGAGCCACGCAGTGAG
TTTGAACAGATGACCATCCTGTATGACATTTGGAACTCGGGCCTGGACTCAGAGGACATG
AGTTACCTGCGGCTTACGTACGAGCGGCTGCTGCAGCAGACAAGCGGGGCTGACTGGCTC
AACGACACTCACTGGGTCCATCACACAATCACCAACCTGACCACCCCAAAACGCAAGCGG
CGGCCCCAGGATGGGCCCCGGGAGCACCAGACAGGCTCAGCCCGCAGCGAAGGCTACTAC
CCCATCAGCAAGAAGGAGAAGGACAAGTACCTGGACGTGTGCCCAGTCTCGGCCCGGCAG
CTGGAGGGCGTGGACACTCAGGGGACGAACCGCGTGCTGTCCGAGCGCCGGTCCGAGCAG
CGGCGGCTGCTGAGCGCCATCGGTACCTCCGCCATCATGGACAGTGACCTGCTGAAACTC
AACCAGCTCAAGTTCCGGAAGAAGAAGCTCCGATTTGGCCGGAGCCGGATCCACGAGTGG
GGTCTGTTTGCCATGGAACCCATTGCTGCTGACGAGATGGTCATCGAATACGTGGGTCAG
AACATCCGTCAGATGGTGGCCGACATGCGGGAGAAGCGCTACGTGCAGGAGGGCATTGGC
AGCAGCTACCTGTTCCGGGTGGACCACGACACCATCATCGATGCCACCAAGTGTGGCAAC
CTGGCCAGATTCATCAACCACTGCTGCACGCCTAACTGCTACGCCAAGGTCATCACCATC
GAGTCCCAGAAGAAGATCGTGATCTACTCCAAGCAGCCCATTGGCGTGGACGAGGAGATC
ACCTACGACTACAAGTTCCCACTGGAAGACAACAAGATCCCGTGTCTGTGTGGCACAGAG
AGCTGCCGGGGCTCCCTAAACTGA
|
Protein Properties |
Number of Residues
| 1707 |
Molecular Weight
| 186032.16 |
Theoretical pI
| 5.141 |
Pfam Domain Function
|
|
Signals
|
Not Available
|
Transmembrane Regions
|
Not Available
|
Protein Sequence
|
>Histone-lysine N-methyltransferase SETD1A
MDQEGGGDGQKAPSFQWRNYKLIVDPALDPALRRPSQKVYRYDGVHFSVNDSKYIPVEDL
QDPRCHVRSKNRDFSLPVPKFKLDEFYIGQIPLKEVTFARLNDNVRETFLKDMCRKYGEV
EEVEILLHPRTRKHLGLARVLFTSTRGAKETVKNLHLTSVMGNIIHAQLDIKGQQRMKYY
ELIVNGSYTPQTVPTGGKALSEKFQGSGAATETAESRRRSSSDTAAYPAGTTAVGTPGNG
TPCSQDTSFSSSRQDTPSSFGQFTPQSSQGTPYTSRGSTPYSQDSAYSSSTTSTSFKPRR
SENSYQDAFSRRHFSASSASTTASTAIAATTAATASSSASSSSLSSSSSSSSSSSSSQFR
SSDANYPAYYESWNRYQRHTSYPPRRATREEPPGAPFAENTAERFPPSYTSYLPPEPSRP
TDQDYRPPASEAPPPEPPEPGGGGGGGGPSPEREEVRTSPRPASPARSGSPAPETTNESV
PFAQHSSLDSRIEMLLKEQRSKFSFLASDTEEEEENSSMVLGARDTGSEVPSGSGHGPCT
PPPAPANFEDVAPTGSGEPGATRESPKANGQNQASPCSSGDDMEISDDDRGGSPPPAPTP
PQQPPPPPPPPPPPPPYLASLPLGYPPHQPAYLLPPRPDGPPPPEYPPPPPPPPHIYDFV
NSLELMDRLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGLIAASAGPPGGAFGEAFLPFPP
PQEAAYGLPYALYAQGQEGRGAYSREAYHLPMPMAAEPLPSSSVSGEEARLPPREEAELA
EGKTLPTAGTVGRVLAMLVQEMKSIMQRDLNRKMVENVAFGAFDQWWESKEEKAKPFQNA
AKQQAKEEDKEKTKLKEPGLLSLVDWAKSGGTTGIEAFAFGSGLRGALRLPSFKVKRKEP
SEISEASEEKRPRPSTPAEEDEDDPEQEKEAGEPGRPGTKPPKRDEERGKTQGKHRKSFA
LDSEGEEASQESSSEKDEEDDEEDEEDEDREEAVDTTKKETEVSDGEDEESDSSSKCSLY
ADSDGENDSTSDSESSSSSSSSSSSSSSSSSSSSSSSSESSSEDEEEEERPAALPSASPP
PREVPVPTPAPVEVPVPERVAGSPVTPLPEQEASPARPAGPTEESPPSAPLRPPEPPAGP
PAPAPRPDERPSSPIPLLPPPKKRRKTVSFSAIEVVPAPEPPPATPPQAKFPGPASRKAP
RGVERTIRNLPLDHASLVKSWPEEVSRGGRSRAGGRGRLTEEEEAEPGTEVDLAVLADLA
LTPARRGLPALPAVEDSEATETSDEAERPRPLLSHILLEHNYALAVKPTPPAPALRPPEP
VPAPAALFSSPADEVLEAPEVVVAEAEEPKPQQLQQQREEGEEEGEEEGEEEEEESSDSS
SSSDGEGALRRRSLRSHARRRRPPPPPPPPPPRAYEPRSEFEQMTILYDIWNSGLDSEDM
SYLRLTYERLLQQTSGADWLNDTHWVHHTITNLTTPKRKRRPQDGPREHQTGSARSEGYY
PISKKEKDKYLDVCPVSARQLEGVDTQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKL
NQLKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIG
SSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEI
TYDYKFPLEDNKIPCLCGTESCRGSLN
|
External Links |
GenBank ID Protein
| 55741677 |
UniProtKB/Swiss-Prot ID
| O15047 |
UniProtKB/Swiss-Prot Entry Name
| SET1A_HUMAN |
PDB IDs
|
|
GenBank Gene ID
| NM_014712.1 |
GeneCard ID
| SETD1A |
GenAtlas ID
| SETD1A |
HGNC ID
| HGNC:29010 |
References |
General References
| Not Available |