NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907140377|ref|XP_036018175|]
View 

MAX gene-associated protein isoform X2 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
T-box_MGA-like cd20195
DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known ...
75-260 1.01e-136

DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known as MGAP, MAX dimerization protein, MAD5, MXD5) is a dual-specificity transcription factor that regulates the expression of both, MAX-network and T-box family target genes. MGA functions as a repressor or an activator; it binds to 5'-AATTTCACACCTAGGTGTGAAATT-3' core sequence. Its function is activated by heterodimerization with MAX. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


:

Pssm-ID: 410321  Cd Length: 186  Bit Score: 424.92  E-value: 1.01e-136
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG 154
Cdd:cd20195      1 ITVTLENNSMWNEFYRCGTEMILTKQGRRMFPYCRFRISGLDPDRNYILVMDISPVDNFRYRWNGRWWEPSGKAEPHVLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20195     81 RVFIHPESPATGRHWMDQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHVVPADKEVDVIQLNGPDVHTFTFPQTEFF 160
                          170       180
                   ....*....|....*....|....*.
gi 1907140377  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20195    161 AVTAYQNKQITQLKIDYNPFAKGFRE 186
bHLHzip_MGA cd18911
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and ...
2403-2467 1.65e-32

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and similar proteins; MGA, also termed MAX dimerization protein 5 (MAD5), is a dual specificity T-box/ bHLHzip transcription factor that regulates the expression of both Max-network and T-box family target genes. It contains a Myc-like bHLHZip motif and requires heterodimerization with Max for binding to the preferred Myc-Max-binding site CACGTG. In addition to the bHLHZip domain, MGA harbors a second DNA-binding domain, the T-box or T-domain. It thus binds the preferred Brachyury-binding sequence and represses transcription of reporter genes containing promoter-proximal Brachyury-binding sites.


:

Pssm-ID: 381481  Cd Length: 65  Bit Score: 121.43  E-value: 1.65e-32
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLLS 2467
Cdd:cd18911      1 RRTHTANERRRRNEMRDLFEKLKRTLGLHNLPKVSKYYILKQAFEEIQGLTDQADRLIGQKTLLT 65
MGA_dom super family cl24582
MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), ...
1038-1079 1.29e-13

MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), which is a dual-specificity transcription factor that contains both a bHLHZip domain and a T-box domain and is able to bind to and regulate transcriptional targets through both E-box sites as well as T-box-binding elements (TBEs).


The actual alignment was detected with superfamily member pfam16059:

Pssm-ID: 464998  Cd Length: 51  Bit Score: 67.51  E-value: 1.29e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1907140377 1038 RKRAPPCNNDFCRLGCVCSSLA-LEKRQPAHCRRPDCMFGCTC 1079
Cdd:pfam16059    2 KDAKKPCDKDYCQLGCVCDSLAgTRPPKREHCGRADCVLGCVC 44
DUF5585 super family cl39316
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1457-1783 4.41e-10

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


The actual alignment was detected with superfamily member pfam17823:

Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 64.98  E-value: 4.41e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1457 AYKRKPSSTTSGLIQVASNAKVAASRKPRTLLPSTSNSKMassgPATNRSGKNLKAFVPAKRPIAARPSPGGVFTQFVMS 1536
Cdd:pfam17823  110 AASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAA----CRANASAAPRAAIAAASAPHAASPAPRTAASSTTAA 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1537 KVGALQQKIPGVRTpqpltgpqkfSIRPSPVMVVTPVVSSEQVQV---CSTVAAAVTTSPQVfLENVTAVPSLTANSDMG 1613
Cdd:pfam17823  186 SSTTAASSAPTTAA----------SSAPATLTPARGISTAATATGhpaAGTALAAVGNSSPA-AGTVTAAVGTVTPAALA 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1614 AKEATYSSSASTAGVVEISETNNTTLVTSTQS---TATVNLTKTTGITTS-PVASVSFAKPLVAS---PTITlPVASTAS 1686
Cdd:pfam17823  255 TLAAAAGTVASAAGTINMGDPHARRLSPAKHMpsdTMARNPAAPMGAQAQgPIIQVSTDQPVHNTagePTPS-PSNTTLE 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1687 TSIVMVTTAASSSVVTTPTS-----SLSSVPIilsgingsPPVSQRPENAPQIPVTTPQISSNNVKRTGPRLLLIPVQQG 1761
Cdd:pfam17823  334 PNTPKSVASTNLAVVTTTKAqakepSASPVPV--------LHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVA 405
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1907140377 1762 S----------PTLRPIQNPQL----------QGQRMVL--QPV 1783
Cdd:pfam17823  406 TeatagtasagPTPRSSGDPKTlamascqlstQGQYLVVttDPL 449
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1670-1974 9.19e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 9.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1670 PLVASPTITLPVASTASTSivmvTTAASSsvvTTPTSSLSSVPIilsgiNGSPPVSQrPENAPQIPVTTPQISSNNVKRT 1749
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPG----TTQAAT---AGPTPSAPSVPP-----QGSPATSQ-PPNQTQSTAAPHTLIQQTPTLH 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1750 GPRL-----LLIPVQQGSP----TLRPIQNPQLQGQrmvLQPVRGP--SGMNLFRHP----------------------- 1795
Cdd:pfam03154  239 PQRLpsphpPLQPMTQPPPpsqvSPQPLPQPSLHGQ---MPPMPHSlqTGPSHMQHPvppqpfpltpqssqsqvppgpsp 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1796 ----NGQIVQLLPLHQIRGSNAQPSLQPVVFRNPGSMVGIR---------LPAPCKSSETPSSSASSSAFSVMS----PV 1858
Cdd:pfam03154  316 aapgQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKpppttpipqLPNPQSHKHPPHLSGPSPFQMNSNlpppPA 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1859 IQAVGSSPTVN--------------------------VISQAPSLLSSGSSFVSQAGTLTLRISPPETQNLASKTGSESk 1912
Cdd:pfam03154  396 LKPLSSLSTHHppsahppplqlmpqsqqlppppaqppVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPP- 474
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907140377 1913 ITPSTGGQPVGTASLIPLQSGSFALLQLPGqkPIPSSVLQHVASLQIKKEsqSTDQKDETNS 1974
Cdd:pfam03154  475 ITPPSGPPTSTSSAMPGIQPPSSASVSSSG--PVPAAVSCPLPPVQIKEE--ALDEAEEPES 532
 
Name Accession Description Interval E-value
T-box_MGA-like cd20195
DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known ...
75-260 1.01e-136

DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known as MGAP, MAX dimerization protein, MAD5, MXD5) is a dual-specificity transcription factor that regulates the expression of both, MAX-network and T-box family target genes. MGA functions as a repressor or an activator; it binds to 5'-AATTTCACACCTAGGTGTGAAATT-3' core sequence. Its function is activated by heterodimerization with MAX. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410321  Cd Length: 186  Bit Score: 424.92  E-value: 1.01e-136
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG 154
Cdd:cd20195      1 ITVTLENNSMWNEFYRCGTEMILTKQGRRMFPYCRFRISGLDPDRNYILVMDISPVDNFRYRWNGRWWEPSGKAEPHVLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20195     81 RVFIHPESPATGRHWMDQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHVVPADKEVDVIQLNGPDVHTFTFPQTEFF 160
                          170       180
                   ....*....|....*....|....*.
gi 1907140377  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20195    161 AVTAYQNKQITQLKIDYNPFAKGFRE 186
T-box pfam00907
T-box; The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box ...
77-260 4.79e-108

T-box; The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations.


Pssm-ID: 459990  Cd Length: 182  Bit Score: 342.23  E-value: 4.79e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   77 VTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILGRV 156
Cdd:pfam00907    1 VSLENKELWKKFHELGTEMIITKSGRRMFPTLKVSVSGLDPNAKYSVLLDIVPVDDKRYKFHNGKWVVAGKAEPHSPPRV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  157 FIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaEKATEVIQLNGPGVHTFTFPQTEFFAV 236
Cdd:pfam00907   81 YIHPDSPATGSHWMKQPVSFDKLKLTNNKEDKNGHIILNSMHKYQPRLHIV--RVGGDEPSLPEENVKTFVFPETEFIAV 158
                          170       180
                   ....*....|....*....|....
gi 1907140377  237 TAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:pfam00907  159 TAYQNEEITQLKIDNNPFAKGFRD 182
TBOX smart00425
Domain first found in the mice T locus (Brachyury) protein;
75-264 7.66e-88

Domain first found in the mice T locus (Brachyury) protein;


Pssm-ID: 214656  Cd Length: 190  Bit Score: 284.93  E-value: 7.66e-88
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377    75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG 154
Cdd:smart00425    1 IKVSLEDKELWRKFHELGTEMIVTKSGRRMFPTLKYKVSGLDPNALYSVLMDLVPVDDKRYKFNNGKWVVAGKAEPHMPS 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGH--IILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQTE 232
Cdd:smart00425   81 RVYVHPDSPATGAHWMKQPVSFDKVKLTNNQSDKNGHlqIILNSMHKYQPRLHIV---EVDDISKEILSQFKTFVFPETQ 157
                           170       180       190
                    ....*....|....*....|....*....|..
gi 1907140377   233 FFAVTAYQNIQITQLKIDYNPFAKGFRDDGLS 264
Cdd:smart00425  158 FIAVTAYQNQKITKLKIDNNPFAKGFRDQGRR 189
bHLHzip_MGA cd18911
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and ...
2403-2467 1.65e-32

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and similar proteins; MGA, also termed MAX dimerization protein 5 (MAD5), is a dual specificity T-box/ bHLHzip transcription factor that regulates the expression of both Max-network and T-box family target genes. It contains a Myc-like bHLHZip motif and requires heterodimerization with Max for binding to the preferred Myc-Max-binding site CACGTG. In addition to the bHLHZip domain, MGA harbors a second DNA-binding domain, the T-box or T-domain. It thus binds the preferred Brachyury-binding sequence and represses transcription of reporter genes containing promoter-proximal Brachyury-binding sites.


Pssm-ID: 381481  Cd Length: 65  Bit Score: 121.43  E-value: 1.65e-32
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLLS 2467
Cdd:cd18911      1 RRTHTANERRRRNEMRDLFEKLKRTLGLHNLPKVSKYYILKQAFEEIQGLTDQADRLIGQKTLLT 65
MGA_dom pfam16059
MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), ...
1038-1079 1.29e-13

MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), which is a dual-specificity transcription factor that contains both a bHLHZip domain and a T-box domain and is able to bind to and regulate transcriptional targets through both E-box sites as well as T-box-binding elements (TBEs).


Pssm-ID: 464998  Cd Length: 51  Bit Score: 67.51  E-value: 1.29e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1907140377 1038 RKRAPPCNNDFCRLGCVCSSLA-LEKRQPAHCRRPDCMFGCTC 1079
Cdd:pfam16059    2 KDAKKPCDKDYCQLGCVCDSLAgTRPPKREHCGRADCVLGCVC 44
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1457-1783 4.41e-10

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 64.98  E-value: 4.41e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1457 AYKRKPSSTTSGLIQVASNAKVAASRKPRTLLPSTSNSKMassgPATNRSGKNLKAFVPAKRPIAARPSPGGVFTQFVMS 1536
Cdd:pfam17823  110 AASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAA----CRANASAAPRAAIAAASAPHAASPAPRTAASSTTAA 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1537 KVGALQQKIPGVRTpqpltgpqkfSIRPSPVMVVTPVVSSEQVQV---CSTVAAAVTTSPQVfLENVTAVPSLTANSDMG 1613
Cdd:pfam17823  186 SSTTAASSAPTTAA----------SSAPATLTPARGISTAATATGhpaAGTALAAVGNSSPA-AGTVTAAVGTVTPAALA 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1614 AKEATYSSSASTAGVVEISETNNTTLVTSTQS---TATVNLTKTTGITTS-PVASVSFAKPLVAS---PTITlPVASTAS 1686
Cdd:pfam17823  255 TLAAAAGTVASAAGTINMGDPHARRLSPAKHMpsdTMARNPAAPMGAQAQgPIIQVSTDQPVHNTagePTPS-PSNTTLE 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1687 TSIVMVTTAASSSVVTTPTS-----SLSSVPIilsgingsPPVSQRPENAPQIPVTTPQISSNNVKRTGPRLLLIPVQQG 1761
Cdd:pfam17823  334 PNTPKSVASTNLAVVTTTKAqakepSASPVPV--------LHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVA 405
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1907140377 1762 S----------PTLRPIQNPQL----------QGQRMVL--QPV 1783
Cdd:pfam17823  406 TeatagtasagPTPRSSGDPKTlamascqlstQGQYLVVttDPL 449
HLH pfam00010
Helix-loop-helix DNA-binding domain;
2402-2453 8.62e-09

Helix-loop-helix DNA-binding domain;


Pssm-ID: 459628 [Multi-domain]  Cd Length: 53  Bit Score: 53.62  E-value: 8.62e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907140377 2402 YRRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILNRAFSEIQGLT 2453
Cdd:pfam00010    1 RREAHNERERRRRDRINDAFDELRELLpTLPPDKKLSKAEILRLAIEYIKHLQ 53
HLH smart00353
helix loop helix domain;
2407-2458 1.10e-05

helix loop helix domain;


Pssm-ID: 197674 [Multi-domain]  Cd Length: 53  Bit Score: 44.90  E-value: 1.10e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1907140377  2407 TANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILNRAFSEIQGLTDQADK 2458
Cdd:smart00353    1 NARERRRRRKINEAFDELRSLLpTLPKNKKLSKAEILRLAIEYIKSLQEELQK 53
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1550-1748 1.36e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 1.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1550 TPQPLTGPQKFSIRPSPVMVVTPVVSSEQVQ--VCSTVAAAVTTSPQVflENVTAVPSLTANSDMGAKEATYSSSASTAG 1627
Cdd:COG3469     13 GGASATAVTLLGAAATAASVTLTAATATTVVstTGSVVVAASGSAGSG--TGTTAASSTAATSSTTSTTATATAAAAAAT 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1628 VVEISETNNTTLVTSTQSTATVNLTKTTGITTSPVASVSFAKPLVASPTITLPVASTASTSIVMV----TTAASSSVVTT 1703
Cdd:COG3469     91 STSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGtetaTGGTTTTSTTT 170
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907140377 1704 PTSSLSSVPiILSGINGSPPVSQRPENAPQIPVTTPQISSNNVKR 1748
Cdd:COG3469    171 TTTSASTTP-SATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
PHA03255 PHA03255
BDLF3; Provisional
1620-1757 1.47e-05

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 49.13  E-value: 1.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1620 SSSASTAGVVEISETNNTTLVTSTQSTATVNLTKTTGITTSPV---ASVSFAKPLVASPTITLPVASTAS--TSIVMVTT 1694
Cdd:PHA03255    26 SSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPItttAILSTNTTTVTSTGTTVTPVPTTSnaSTINVTTK 105
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907140377 1695 AASSSVVTTPTSSLSSVPIILSGINGSPPVSQRPENAPQIPVTTPQISS---NNVKRTGPRLLLIP 1757
Cdd:PHA03255   106 VTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSkgtSNATKTTAELPTVP 171
SP4_N cd22536
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ...
1657-1834 4.11e-05

N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.


Pssm-ID: 411773 [Multi-domain]  Cd Length: 623  Bit Score: 49.15  E-value: 4.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1657 ITTSPVASVSFAKPLVASPTITLPVASTA-STSIvmvTTAASSSVVTTPTSS--LSSVPIILSGINGSPPVSQRPENAPQ 1733
Cdd:cd22536    262 LVQPSDGGVSNGNQLVSTPITTASVSTMPeSPSS---STTCTTTASTSLTSSdtLVSSAETGQYASTAASSERTEEEPQT 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1734 IPVTTPQISSNNVKRTGprllLIPVQQGSPTLrpiQNPQLQGQRMVLQPVRGPSGMNLFRHPNGQIVQLLPLHQIRGSNA 1813
Cdd:cd22536    339 SAAESEAQSSSQLQSNG----LQNVQDQSNSL---QQVQIVGQPILQQIQIQQPQQQIIQAIQPQSFQLQSGQTIQTIQQ 411
                          170       180
                   ....*....|....*....|...
gi 1907140377 1814 QP--SLQPVVFRNPgSMVGIRLP 1834
Cdd:cd22536    412 QPlqNVQLQAVQSP-TQVLIRAP 433
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1670-1974 9.19e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 9.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1670 PLVASPTITLPVASTASTSivmvTTAASSsvvTTPTSSLSSVPIilsgiNGSPPVSQrPENAPQIPVTTPQISSNNVKRT 1749
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPG----TTQAAT---AGPTPSAPSVPP-----QGSPATSQ-PPNQTQSTAAPHTLIQQTPTLH 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1750 GPRL-----LLIPVQQGSP----TLRPIQNPQLQGQrmvLQPVRGP--SGMNLFRHP----------------------- 1795
Cdd:pfam03154  239 PQRLpsphpPLQPMTQPPPpsqvSPQPLPQPSLHGQ---MPPMPHSlqTGPSHMQHPvppqpfpltpqssqsqvppgpsp 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1796 ----NGQIVQLLPLHQIRGSNAQPSLQPVVFRNPGSMVGIR---------LPAPCKSSETPSSSASSSAFSVMS----PV 1858
Cdd:pfam03154  316 aapgQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKpppttpipqLPNPQSHKHPPHLSGPSPFQMNSNlpppPA 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1859 IQAVGSSPTVN--------------------------VISQAPSLLSSGSSFVSQAGTLTLRISPPETQNLASKTGSESk 1912
Cdd:pfam03154  396 LKPLSSLSTHHppsahppplqlmpqsqqlppppaqppVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPP- 474
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907140377 1913 ITPSTGGQPVGTASLIPLQSGSFALLQLPGqkPIPSSVLQHVASLQIKKEsqSTDQKDETNS 1974
Cdd:pfam03154  475 ITPPSGPPTSTSSAMPGIQPPSSASVSSSG--PVPAAVSCPLPPVQIKEE--ALDEAEEPES 532
 
Name Accession Description Interval E-value
T-box_MGA-like cd20195
DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known ...
75-260 1.01e-136

DNA-binding domain of MAX gene-associated protein and related T-box proteins; MGA (also known as MGAP, MAX dimerization protein, MAD5, MXD5) is a dual-specificity transcription factor that regulates the expression of both, MAX-network and T-box family target genes. MGA functions as a repressor or an activator; it binds to 5'-AATTTCACACCTAGGTGTGAAATT-3' core sequence. Its function is activated by heterodimerization with MAX. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410321  Cd Length: 186  Bit Score: 424.92  E-value: 1.01e-136
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG 154
Cdd:cd20195      1 ITVTLENNSMWNEFYRCGTEMILTKQGRRMFPYCRFRISGLDPDRNYILVMDISPVDNFRYRWNGRWWEPSGKAEPHVLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20195     81 RVFIHPESPATGRHWMDQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHVVPADKEVDVIQLNGPDVHTFTFPQTEFF 160
                          170       180
                   ....*....|....*....|....*.
gi 1907140377  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20195    161 AVTAYQNKQITQLKIDYNPFAKGFRE 186
T-box pfam00907
T-box; The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box ...
77-260 4.79e-108

T-box; The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations.


Pssm-ID: 459990  Cd Length: 182  Bit Score: 342.23  E-value: 4.79e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   77 VTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILGRV 156
Cdd:pfam00907    1 VSLENKELWKKFHELGTEMIITKSGRRMFPTLKVSVSGLDPNAKYSVLLDIVPVDDKRYKFHNGKWVVAGKAEPHSPPRV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  157 FIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaEKATEVIQLNGPGVHTFTFPQTEFFAV 236
Cdd:pfam00907   81 YIHPDSPATGSHWMKQPVSFDKLKLTNNKEDKNGHIILNSMHKYQPRLHIV--RVGGDEPSLPEENVKTFVFPETEFIAV 158
                          170       180
                   ....*....|....*....|....
gi 1907140377  237 TAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:pfam00907  159 TAYQNEEITQLKIDNNPFAKGFRD 182
T-box_TBX4_5-like cd20189
DNA-binding domain of T-box transcription factor 4 and 5, and related T-box proteins; This ...
75-260 2.47e-90

DNA-binding domain of T-box transcription factor 4 and 5, and related T-box proteins; This subfamily includes the T-box transcription factors TBX4 and TBX5 which play important roles in vertebrate limb and heart development, and in lung and trachea development. TBX4 is needed for normal skeletal and muscular hindlimb development and is involved in super-enhancer-driven transcriptional programs underlying features specific to lung fibroblasts. TBX5 plays a role in regulating cardiac conduction system function, and in coordinating forelimb muscle pattern. Mutations in human TBX5 and TBX4 are associated with Holt-Oram syndrome and Small Patella syndrome, respectively. Both syndromes are characterized by limb defects in addition to other abnormalities. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410315  Cd Length: 185  Bit Score: 292.03  E-value: 2.47e-90
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG 154
Cdd:cd20189      1 IKVFLENRELWQKFHEVGTEMIITKAGRRMFPSIKVKVTGLNPKTKYILLMDIVPADDHRYKFHDSEWVVAGKAEPAMPG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKaTEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20189     81 RLYVHPDSPATGAHWMRQLVSFQKLKLTNNHLDQFGHIILNSMHKYQPRIHIVQADD-NNAFGSKNTAFSTHVFPETAFI 159
                          170       180
                   ....*....|....*....|....*.
gi 1907140377  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20189    160 AVTAYQNHQITQLKIENNPFAKGFRG 185
TBOX smart00425
Domain first found in the mice T locus (Brachyury) protein;
75-264 7.66e-88

Domain first found in the mice T locus (Brachyury) protein;


Pssm-ID: 214656  Cd Length: 190  Bit Score: 284.93  E-value: 7.66e-88
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377    75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG 154
Cdd:smart00425    1 IKVSLEDKELWRKFHELGTEMIVTKSGRRMFPTLKYKVSGLDPNALYSVLMDLVPVDDKRYKFNNGKWVVAGKAEPHMPS 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGH--IILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQTE 232
Cdd:smart00425   81 RVYVHPDSPATGAHWMKQPVSFDKVKLTNNQSDKNGHlqIILNSMHKYQPRLHIV---EVDDISKEILSQFKTFVFPETQ 157
                           170       180       190
                    ....*....|....*....|....*....|..
gi 1907140377   233 FFAVTAYQNIQITQLKIDYNPFAKGFRDDGLS 264
Cdd:smart00425  158 FIAVTAYQNQKITKLKIDNNPFAKGFRDQGRR 189
T-box_TBX6_VegT-like cd20190
DNA-binding domain of T-box transcription factor 6, VegT and related T-box proteins; This ...
75-260 2.93e-87

DNA-binding domain of T-box transcription factor 6, VegT and related T-box proteins; This subfamily includes the transcriptional regulators TBX6 and VegT. TBX6 plays an essential role in the fate determination of axial stem to become either neural or mesodermal. It also plays an essential role in the regulation of left/right patterning in mouse embryos through effects on nodal cilia and perinodal signaling. VegT (also known as Antipodean, Brat and Xombi) is required in early Xenopus embryos for the formation of both the mesoderm and endoderm germ layers. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved 1DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410316  Cd Length: 183  Bit Score: 282.93  E-value: 2.93e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG 154
Cdd:cd20190      1 VSLSLEDRELWKEFSSVGTEMIITKSGRRMFPACKVSVTGLDPEAKYLFLLDVVPVDNARYKWNKRRWEPSGKAEPHLPD 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20190     81 RVYIHPDSPAPGAHWMRQPISFHKLKLTNNTLDPHGHLILHSMHKYQPRIHLV---QSADLCSQHWGGMASFRFPETTFI 157
                          170       180
                   ....*....|....*....|....*.
gi 1907140377  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20190    158 AVTAYQNPQITKLKIAANPFAKGFRE 183
T-box_VegT-like cd20197
DNA-binding domain of Xenopus VegT and related T-box proteins; VegT, (also known as Antipodean, ...
75-260 1.15e-86

DNA-binding domain of Xenopus VegT and related T-box proteins; VegT, (also known as Antipodean, Brat and Xombi), is a T-box transcription factor required in early Xenopus embryos for the formation of both, the mesoderm and endoderm germ layers. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410323  Cd Length: 183  Bit Score: 281.34  E-value: 1.15e-86
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG 154
Cdd:cd20197      1 VRASLEDQDLWKKFHQIGTEMIITKSGRRMFPQCKIRVSGLLPYAKYVMLVDFVPVDNFRYKWNKDQWEVAGKAEPQPPC 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20197     81 RTYVHPDSPAPGSHWMKQPISFQKLKLTNNTLDQHGHIILHSMHRYQPRFHIV---QADDLFNVRWSLFQVFSFPETVFT 157
                          170       180
                   ....*....|....*....|....*.
gi 1907140377  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20197    158 AVTAYQNEKITKLKIDNNPFAKGFRE 183
T-box_TBX6 cd20196
DNA-binding domain of T-box transcription factor 6, and related T-box proteins; TBX6 is a ...
75-260 1.11e-82

DNA-binding domain of T-box transcription factor 6, and related T-box proteins; TBX6 is a T-box transcription factor which plays an essential role in the fate determination of axial stem to become either neural or mesodermal. It also plays an essential role in the regulation of left/right patterning in mouse embryos, through effects on nodal cilia and perinodal signaling. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410322  Cd Length: 182  Bit Score: 269.82  E-value: 1.11e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG 154
Cdd:cd20196      1 VRMSLENAELWKQFSSVGTEMIITKAGRRMFPQLRVSVSGLDPEARYLLLLDVVPVDGSRYRWQGNSWEASGKAEPRLPD 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKatevIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20196     81 RVYIHPDSPATGAHWMRQPISFHRAKLTNNTLDPHGHIILHSMHRYQPRVHVVRARD----VLSWGGGCASFTFPETQFI 156
                          170       180
                   ....*....|....*....|....*.
gi 1907140377  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20196    157 TVTAYQNPKITQLKINSNPFAKGFRE 182
T-box_TBX2_3-like cd20188
DNA-binding domain of T-box transcription factor 2 and 3, and related T-box proteins; This ...
75-260 1.36e-82

DNA-binding domain of T-box transcription factor 2 and 3, and related T-box proteins; This subfamily includes the T-box transcription factors TBX2 and TBX3 and similar proteins. TBX2 is an oncogenic transcription factor implicated in developmental processes, including coordinating cell fate, patterning and morphogenesis of a wide range of tissues and organs. It is overexpressed in several cancers, including melanoma and breast, and plays a key role during cardiac development. TBX2 is a negative regulator of promyelocytic leukemia protein (PML) function in cellular senescence, and it interacts with HP1 to recruit a repression complex to EGR1-responsive promoters to drive the proliferation of breast cancer cells. TBX3 has also been implicated in oncogenesis in breast cancer and melanoma. The tbx3 gene is downregulated by PML. TBX3 directly represses TBX2 under the control of the PRC2 complex in skeletal muscle and rhabdomyosarcoma. Also included in this family is the Drosophila melanogaster optomotor-blind protein (Omb, also known as lethal(1)optomotor-blind, or L(1)omb, or protein bifid) which controls many developmental processes such as wing, eye, and abdominal tergites and optic lobes, and induces epithelial cell migration and extrusion in vivo. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410314  Cd Length: 185  Bit Score: 269.69  E-value: 1.36e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG 154
Cdd:cd20188      3 PKVELEAKDLWDQFHKLGTEMVITKSGRRMFPPFKVRVSGLDKKAKYILLMDIVAADDCRYKFHNSRWMVAGKADPEMPK 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  155 RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQTEFF 234
Cdd:cd20188     83 RMYIHPDSPSTGEQWMQKVVSFHKLKLTNNISDKHGFTILNSMHKYQPRFHIV---RANDILKLPYSTFRTYVFKETEFI 159
                          170       180
                   ....*....|....*....|....*.
gi 1907140377  235 AVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20188    160 AVTAYQNEKITQLKIDNNPFAKGFRD 185
T-box cd00182
DNA-binding domain of the T-box transcription factor family; The T-box family is an ancient ...
75-252 1.65e-81

DNA-binding domain of the T-box transcription factor family; The T-box family is an ancient family of transcription factors which plays a multitude of diverse functions throughout development. The founding member of the family is Brachyury (also known as TBXT, or T). Members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns. The T-box factors in Caenorhabditis elegans have evolved very differently than those in other organisms; its genome contains 22 T-box genes which encode factors which are diverse in DNA-binding specificity, function and sequence, and only 3 of these factors fall into the conserved T-box subfamilies.


Pssm-ID: 410312  Cd Length: 176  Bit Score: 266.00  E-value: 1.65e-81
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHI-L 153
Cdd:cd00182      1 ITVSLRNEELWKKFHELGTEMIVTKSGRRMFPTLEYSVSGLDPNKLYSVSLHFERVDNKRYKFNNGKWVPSGKAEPPPeP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  154 GRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLD-QEGHIILHSMHRYLPRLHLVpaeKATEVIQLNGPgVHTFTFPQTE 232
Cdd:cd00182     81 SRIYVHPDGPQTGSFWMKKGVSFDKVKITNNKEDkKEGHILLHSMHKYIPVLTIY---EVDDNGLLSKL-VKEFRFPETE 156
                          170       180
                   ....*....|....*....|
gi 1907140377  233 FFAVTAYQNIQITQLKIDYN 252
Cdd:cd00182    157 FIAVTAYQNDEITQLKIDNN 176
T-box_Drosocross-like cd20681
DNA-binding domain of Drosophila Dorsocross and related T-box proteins; Drosophila Dorsocross ...
75-260 1.13e-79

DNA-binding domain of Drosophila Dorsocross and related T-box proteins; Drosophila Dorsocross (Doc) includes three Dorsocross paralogs, Doc1-3. These are key cardiogenic T-box transcription factors during specification and differentiation of heart cells. Drosophila Doc also functions in caudal visceral mesoderm development, and modulates Notch signaling in the developing Drosophila eye by regulating the expression of Delta in the eye imaginal discs. Doc also functions in the morphogenesis of epithelial tissues: in Drosophila, which possesses a single extraembryonic (EE) membrane, it is essential for EE epithelia tissue maintenance while in Tribolium castaneum, which has 2 EE membranes, Doc plays a major role in EE morphogenetic events throughout development without affecting EE tissue specificity or maintenance. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410332  Cd Length: 186  Bit Score: 261.50  E-value: 1.13e-79
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNG-RWWEPSGKAEPHIL 153
Cdd:cd20681      1 VKVTLKNRDLWQQFHREGTEMIITKSGRRMFPSLRLSVSGLEPDARYCVLLEMVLASDCRFKYSGnGGWVPAGGAEPQPP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  154 G--RVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVpaeKATEVIQLNGPGVHTFTFPQT 231
Cdd:cd20681     81 LprRIYIHPDSPATGDHWMSQPISFSKVKLTNNTLDPQGNIVLTSMHKYQPRIHIV---RCSDTLALPWAPTASFTFPET 157
                          170       180
                   ....*....|....*....|....*....
gi 1907140377  232 EFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20681    158 EFIAVTAYQNERITKLKIDNNPFAKGFRE 186
T-box_TBXT_TBX19-like cd20192
DNA-binding domain of T-box transcription factor T, T-box transcription factor 19 and related ...
75-260 5.22e-77

DNA-binding domain of T-box transcription factor T, T-box transcription factor 19 and related T-box proteins; Tbx19 (also known as Tpit) is a T-box factor restricted to two pituitary (pro-opiomelanocortin) POMC-expressing lineages, the corticotrophs and melanotrophs; it controls terminal differentiation of these lineages. TBX19 activates POMC gene transcription with the cooperation of another transcription factor Pitx1. TBXT, also known as Brachyury protein, or protein T, is a transcription factor needed for posterior mesoderm formation and differentiation as well as for the notochord development during embryogenesis. It binds to a 24 base-pair (bp) palindromic site (called the T site) and activates gene transcription when bound to such a site. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. TBXT is the founding member of the T-box family, members of which share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410318  Cd Length: 180  Bit Score: 253.34  E-value: 5.22e-77
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKW-NGRWWePSGKAEPHIL 153
Cdd:cd20192      1 IRVTLEDRELWKKFHSLTNEMIVTKSGRRMFPVLKVSVSGLDPNAMYSVLLDFVQVDNHRWKYvNGEWV-PGGKAEPPPP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  154 GRVFIHPESPSTGHYWMHQPVSFYKLKLTNNtLDQEGHIILHSMHRYLPRLHLVPAEKateviQLNGPGVHTFTFPQTEF 233
Cdd:cd20192     80 SSVYVHPDSPNFGAHWMKGPVSFSKVKLTNK-PNGEGQIMLNSLHKYEPRVHIVRVGS-----NNHERLVSTFSFPETQF 153
                          170       180
                   ....*....|....*....|....*..
gi 1907140377  234 FAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20192    154 IAVTAYQNEEITALKIKYNPFAKAFLD 180
T-box_TBX1_10-like cd20187
DNA-binding domain of T-box transcription factor 1 and 10, and related T-box proteifactors; ...
75-260 8.74e-76

DNA-binding domain of T-box transcription factor 1 and 10, and related T-box proteifactors; This subfamily includes TBX1 and TBX10. TBX1 is a T-box transcription factor which plays an important role in heart development and has been implicated in DiGeorge or 22q11.2 deletion syndrome. This syndrome is associated with various types of cardiac outflow tract (OFT) and vascular defects. Wnt5a is regulated by TBX1 in the second heart field (SHF). TBX1 is required to maintain the integrity of extracellular matrix-cell interactions in the SHF and this interaction is critical for cardiac (OFT) development. TBX10 is a putative T-box transcription factor. Diseases associated with TBX10 include Isolated Cleft Lip and Cleft Lip/cleft lip with or without cleft palate. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410313  Cd Length: 189  Bit Score: 250.42  E-value: 8.74e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKW--NGRWWEPSGKAEPHI 152
Cdd:cd20187      2 VTVQLEMKALWDEFNQLGTEMIVTKAGRRMFPTFQVKIFGMDPMADYMLMMDFVPVDDKRYRYafHSSSWLVAGKADPAM 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  153 LGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLNGPGVHTFTFPQTE 232
Cdd:cd20187     82 PGRIHVHPDSPAKGAQWMKQIVSFDKLKLTNNLLDDNGHIILNSMHRYQPRFHVVYVDPRKDSENSAEENFKTFIFPETK 161
                          170       180
                   ....*....|....*....|....*...
gi 1907140377  233 FFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20187    162 FTAVTAYQNHRITQLKIASNPFAKGFRD 189
T-box_TBX20-like cd20193
DNA-binding domain of T-box transcription factor 20 and related T-box proteins; TBX20 is a ...
75-260 8.04e-73

DNA-binding domain of T-box transcription factor 20 and related T-box proteins; TBX20 is a T-box transcriptional factor which functions in embryonic development and its deficiency is associated with congenital heart disease. It acts both as a transcriptional activator and a repressor required for cardiac development, and has key roles in maintaining the functional and structural phenotypes in the adult heart. The TBX20-cardiac transcription factor CASZ1 protein complex is protective against dilated cardiomyopathy and is essential for maintaining cardiac homeostasis. TBX20 has also been shown to regulate angiogenesis through the PROK2-PROKR1 (prokineticin receptor 1) pathway and is involved in both, pathological and developmental, angiogenesis. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410319  Cd Length: 190  Bit Score: 241.95  E-value: 8.04e-73
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKW--NGRWWEPSGKAEPHI 152
Cdd:cd20193      1 VQCHLETKELWDKFHELGTEMIITKSGRRMFPTVRVSFSGVDPDAKYIVLMDIVPVDNKRYRYayHRSSWLVAGKADPPL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  153 LGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKAT-EVIQLNGPGVHTFTFPQT 231
Cdd:cd20193     81 PARLYVHPDSPFTGEQLLKQMVSFEKVKLTNNELDKHGHIILNSMHKYQPRVHIVKKKDHTaSLVNLKSEEFRTFIFPET 160
                          170       180
                   ....*....|....*....|....*....
gi 1907140377  232 EFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20193    161 VFTAVTAYQNQLITKLKIDSNPFAKGFRD 189
T-box-like cd20682
T-box DNA-binding domain; uncharacterized subfamily; The T-box family is an ancient group that ...
75-260 9.15e-73

T-box DNA-binding domain; uncharacterized subfamily; The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.


Pssm-ID: 410333  Cd Length: 191  Bit Score: 241.91  E-value: 9.15e-73
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKW---NGRwWEPSGKAEPH 151
Cdd:cd20682      1 IQVELCSRELWLQFHNLGNEMIITKAGRRMFPALKVKLTGLDPDKLYIVWVDIVPVDSNRYRYvyhSSK-WVVAGSGDVL 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  152 ILGRVFIHPESPSTGHYWMHQPVSFYKLKLTNN-TLDQEGHIILHSMHRYLPRLHLVPAE-KATEVIQLNGPGVH-TFTF 228
Cdd:cd20682     80 PPANRYIHPDSPASGKYWMSQIVSFDKLKLTNNkEPKQKGQISLHSMHKYQPRIHIQPVEdDGRNVEKAINSSKAlSFEF 159
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1907140377  229 PQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20682    160 PETSFITVTAYQNQQITKLKIASNPFAKGFRD 191
T-box_TBX15_18_22-like cd20191
DNA-binding domain of T-box transcription factor 15, 18 and 22, and related T-box proteins; ...
75-260 5.91e-71

DNA-binding domain of T-box transcription factor 15, 18 and 22, and related T-box proteins; This subfamily includes the transcriptional regulators TBX15, TBX18 and TBX22 which are involved in various developmental processes. TBX15 (also known as TBX14) plays an important role in the development of the skeleton of the limb, vertebral column and head, possibly through its control of the number of mesenchymal precursor cells and chondrocytes; it also plays a role in the differentiation of brown and brite adipocytes. TBX18 is involved in the developmental processes of a variety of tissues and organs, including the ureter, vertebral column. epicardium and coronary vessels; it is important for the development of the head portion of the sino atrial node (SAN). Mutations in the T-box transcription factor gene TBX22 are found in X-linked Cleft Palate with or without Ankyloglossia syndrome (CPX syndrome), and associated with cleft lip and palate, and tooth agenesis. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410317  Cd Length: 194  Bit Score: 236.71  E-value: 5.91e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKW--NGRWWEPSGKAEPHI 152
Cdd:cd20191      3 IQVELQGSELWKRFHDIGTEMIITKAGRRMFPAIRVKVSGLDPHAQYIVAMDIVPVDNKRYRYvyHSSKWMVAGNADAPV 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  153 LGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEV----IQLNGPGVHTFTF 228
Cdd:cd20191     83 PPRVYIHPDSPASGETWMRQVVSFDKLKLTNNEMDDQGHIILHSMHKYQPRVHVIRKDSSTDLspkkPVPPGEGVKTFSF 162
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1907140377  229 PQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20191    163 PETVFTTVTAYQNQQITRLKIDRNPFAKGFRD 194
T-box_TBR1_2_21-like cd20194
DNA-binding domain of T-box brain protein 1 and 2, T-box transcription factor 21 and related ...
74-260 4.55e-69

DNA-binding domain of T-box brain protein 1 and 2, T-box transcription factor 21 and related T-box proteins; TBX21 (also known as T-cell-specific T-box transcription factor T-bet or transcription factor TBLYM) is a lineage-defining transcription factor which directs T helper type 1 (Th1) cell differentiation. This subfamily includes TBR1 (also known as T-brain-1, or TES-56), which is a neuron-specific transcription factor involved in forebrain development, and TBR2 (also known as Eomesodermin, Eomes, or T-brain-2), which is associated with neurogenesis, cardiogenesis and tumor immune response. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410320  Cd Length: 185  Bit Score: 230.83  E-value: 4.55e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   74 KITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEP-HI 152
Cdd:cd20194      1 KASVYLCNRDLWLKFHQHQTEMIITKQGRRMFPTLSFNLSGLDPTAHYNVFVDMVLADPNHWKFQSGKWVPCGKAEGlPQ 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  153 LGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHlvpaekateVIQLNGPG------VHTF 226
Cdd:cd20194     81 GNRVYVHPDSPNTGAHWMKQEISFSKLKLTNNKGADQGMIVLNSMHKYQPRIH---------VIEVGGNGpneqrnLQTH 151
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1907140377  227 TFPQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20194    152 SFPETQFIAVTAYQNTDITQLKIDHNPFAKGFRD 185
T-box_TBX21 cd20203
DNA-binding domain of T-box transcription factor 21 and related T-box proteins; TBX21 (also ...
74-260 1.31e-66

DNA-binding domain of T-box transcription factor 21 and related T-box proteins; TBX21 (also known as T-cell-specific T-box transcription factor T-bet or transcription factor TBLYM) is a lineage-defining transcription factor which directs T helper type 1 (Th1) cell differentiation. It initiates Th1 lineage development from naive T helper precursor cells both by initiating the Th1 genetic programs and by inhibiting the opposing Th2 and Th17 lineage-commitment programs. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410329  Cd Length: 191  Bit Score: 224.07  E-value: 1.31e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   74 KITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHIL 153
Cdd:cd20203      1 KLQVLLNNHPLWSKFHKHQTEMIITKQGRRMFPFLSFNLTGLDPTAHYNVYVDVVLADQHHWRYQGGKWVQCGKAEGNMP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  154 G-RVFIHPESPSTGHYWMHQPVSFYKLKLTNN---TLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLNGPGVHTFTFP 229
Cdd:cd20203     81 GnRLYVHPDSPNTGAHWMRQEVSFGKLKLTNNkgaSNNVTQMIVLQSLHKYQPRLHIVEVKEGETEEAYSSSKTHTFTFP 160
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1907140377  230 QTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20203    161 ETQFIAVTAYQNAEITQLKIDHNPFAKGFRD 191
T-box_TBX22-like cd20200
DNA-binding domain of T-box transcription factor 22 and related T-box proteins; TBX22 is a ...
74-260 1.42e-63

DNA-binding domain of T-box transcription factor 22 and related T-box proteins; TBX22 is a transcriptional regulator involved in developmental processes. Mutations in the T-Box transcription factor gene TBX22 are found in X-linked Cleft Palate with or without Ankyloglossia syndrome (CPX syndrome). TBX22 mutation is also associated with cleft lip and palate, and tooth agenesis. This subgroup belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410326  Cd Length: 194  Bit Score: 215.55  E-value: 1.42e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   74 KITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKW---NGRWWEPSGKAEP 150
Cdd:cd20200      2 KVQVELQGSELWKRFHEIGTEMIITKAGRRMFPSVRVKVKGLDPLKQYYIAMDVVPVDSKRYRYvyhSSQWMVAGNTDHS 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  151 HILGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLV---PAEKATEVIQLNGPGVHTFT 227
Cdd:cd20200     82 CITPRLYVHPDSPCSGETWMRQIISFDRVKLTNNEMDDKGHIILQSMHKYKPRVHVIlqdSRFDLSQIQSLPAEGVKTFS 161
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1907140377  228 FPQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20200    162 FPETEFTTVTAYQNQQITKLKIDRNPFAKGFRD 194
T-box_TBX18_like cd20199
DNA-binding domain of T-box transcription factor 18 and related T-box proteins; TBX18 acts as ...
75-260 2.00e-63

DNA-binding domain of T-box transcription factor 18 and related T-box proteins; TBX18 acts as a transcription repressor involved in the developmental processes of a variety of tissues and organs, including the ureter, vertebral column. epicardium and coronary vessels. TBX18 is important for the development of the head portion of the sino atrial node (SAN); SAN is the pacemaker region of the heart that initiates each heartbeat. This subgroup belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410325  Cd Length: 195  Bit Score: 215.30  E-value: 2.00e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   75 ITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKW--NGRWWEPSGKAEPHI 152
Cdd:cd20199      4 VRVDLQGADLWKRFHEIGTEMIITKAGRRMFPAMRVKITGLDPHQQYYIAMDIVPVDNKRYRYvyHSSKWMVAGNADSPV 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  153 LGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLN----GPGVHTFTF 228
Cdd:cd20199     84 PPRVYIHPDSPASGETWMRQVISFDKLKLTNNELDDQGHIILHSMHKYQPRVHVIRKECGEELSPVKpipsGEGVKAFSF 163
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1907140377  229 PQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20199    164 PETVFTTVTAYQNQQITRLKIDRNPFAKGFRD 195
T-box_TBX15-like cd20198
DNA-binding domain of T-box transcription factor 15 and related T-box proteins; TBX15 (also ...
74-260 6.99e-61

DNA-binding domain of T-box transcription factor 15 and related T-box proteins; TBX15 (also known as TBX14) plays an important role in the development of the skeleton of the limb, vertebral column and head, possibly through its control of the number of mesenchymal precursor cells and chondrocytes. TBX15 also plays a role in the differentiation of brown and brite adipocytes. This subgroup belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410324  Cd Length: 198  Bit Score: 208.04  E-value: 6.99e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   74 KITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKW--NGRWWEPSGKAEPH 151
Cdd:cd20198      6 EIQVELQCADLWKRFHDIGTEMIITKAGRRMFPAMRVKITGLDPHQQYYIAMDIVPVDNKRYRYvyHSSKWMVAGNADSP 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  152 ILGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTLDQEGHIILHSMHRYLPRLHLVPAEKATEVIQLN----GPGVHTFT 227
Cdd:cd20198     86 VPPRVYIHPDSLASGDTWMRQVVSFDKLKLTNNELDDQGHIILHSMHKYQPRVHVIRKDFSSDLSPTKpvptGDGVKTFS 165
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1907140377  228 FPQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20198    166 FPETVFTTVTAYQNQQITRLKIDRNPFAKGFRD 198
T-box_TBR1 cd20204
DNA-binding domain of T-box brain protein 1 and related T-box proteins; TBR1 (also known as ...
74-260 5.90e-59

DNA-binding domain of T-box brain protein 1 and related T-box proteins; TBR1 (also known as T-brain-1 or TES-56) is a neuron-specific transcription factor of the T-box family and involved in forebrain development. It has been recognized as a high-confidence risk gene for autism spectrum disorders (ASD); it regulates the expression of ASD-related genes that are critical for cortical development. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410330  Cd Length: 191  Bit Score: 202.27  E-value: 5.90e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   74 KITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHIL 153
Cdd:cd20204      1 KAQVYLCNRPLWLKFHRHQTEMIITKQGRRMFPFLSFNISGLDPTAHYNIFVDVILADPNHWRFQGGKWVPCGKADTNVQ 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  154 G-RVFIHPESPSTGHYWMHQPVSFYKLKLTNN--TLDQEGH-IILHSMHRYLPRLHLVPA-EKATEviQLNGPG-VHTFT 227
Cdd:cd20204     81 GnRVYMHPDSPNTGAHWMRQEISFGKLKLTNNkgASNNNGQmVVLQSLHKYQPRLHVVEVnEDGTE--DTSQPGrVQTFT 158
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1907140377  228 FPQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20204    159 FPETQFIAVTAYQNTDITQLKIDHNPFAKGFRD 191
T-box_Fungi_incertae_sedis cd20683
T-box DNA-binding domain; uncharacterized subfamily of fungi classified as Fungi incertae ...
76-261 2.46e-57

T-box DNA-binding domain; uncharacterized subfamily of fungi classified as Fungi incertae sedis; Fungi incertae sedis refers to a fungal taxonomic group where its broader relationships are unknown or undefined. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.


Pssm-ID: 410334  Cd Length: 214  Bit Score: 198.39  E-value: 2.46e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   76 TVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKW-NGRW-----------WE 143
Cdd:cd20683      2 QLLLEDADLWAQFHSVQNEMIITKSGRCLFPLLRFRAVNLDPKALYSIALDIEQVSPNRFRFrNGRWnpidkdqrgddAF 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  144 PSGKAEPHI-LGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNTL------------------DQEGHIILHSMHRYLPRL 204
Cdd:cd20683     82 SSGTADKSVlLPESYIHPDGPQTGAFWMANGISFAKIKLSNRQPnssdrdgpkenitnsisaLPDGHFFLTSFHKYQPRL 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907140377  205 HLVPAEKATEVIQLngpgVHTFTFPQTEFFAVTAYQNIQITQLKIDYNPFAKGFRDD 261
Cdd:cd20683    162 HLIQHSAGDHDDIL----STTFTFEETEFIAVTHYQNEKVNILKKDYNPHAKGFKDD 214
T-box_TBX19-like cd20201
DNA-binding domain of T-box transcription factor 19 and related T-box proteins; Tbx19 (also ...
71-260 3.28e-56

DNA-binding domain of T-box transcription factor 19 and related T-box proteins; Tbx19 (also known as Tpit) is a T-box factor restricted to two pituitary (pro-opiomelanocortin) POMC-expressing lineages, the corticotrophs and melanotrophs; it controls terminal differentiation of these lineages. TBX19 activates POMC gene transcription with the cooperation of another transcription factor Pitx1. Mutations of the human TPIT gene cause early onset pituitary adrenocorticotrophic hormone (ACTH) deficiency. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410327  Cd Length: 183  Bit Score: 194.09  E-value: 3.28e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   71 TVGKITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEP 150
Cdd:cd20201      2 TEKQLQVSLEDAELWQRFKEVTNEMIVTKNGRRMFPVLKISVSGLDPNAMYSFLLDFAPADGHRWKYVNGEWVPAGKPEP 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  151 HILGRVFIHPESPSTGHYWMHQPVSFYKLKLTNNtLDQEGHIILHSMHRYLPRLHLVpaekateviQLNGPG--VHTFTF 228
Cdd:cd20201     82 HSHSCVYIHPDSPNFGAHWMKAPISFSKVKLTNK-LNGGGQIMLNSLHKYEPQIHIV---------RVGGPHrmVTNCSF 151
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1907140377  229 PQTEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20201    152 PETQFIAVTAYQNEEITALKIKYNPFAKAFLD 183
T-box_TBR2 cd20205
DNA-binding domain of T-box brain protein 2 and related T-box proteins; TBR2 (also known as ...
77-260 3.84e-56

DNA-binding domain of T-box brain protein 2 and related T-box proteins; TBR2 (also known as Eomesodermin, Eomes, or T-brain-2) is a member of the T-box family of transcription factors and is associated with neurogenesis, cardiogenesis and tumor immune response. This subfamily belongs to the T-box family of transcription factors which plays a multitude of diverse functions throughout development. The founding member of the T-box family is Brachyury (also known as TBXT, or T). T-box family members share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410331  Cd Length: 191  Bit Score: 194.13  E-value: 3.84e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   77 VTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHILG-R 155
Cdd:cd20205      4 VYLCNRPLWLKFHRHQTEMIITKQGRRMFPFLSFNITGLNPTAHYNVFVEVVLADPNHWRFQGGKWVTCGKADNNMQGnK 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  156 VFIHPESPSTGHYWMHQPVSFYKLKLTNN---TLDQEGHIILHSMHRYLPRLHLVP-AEKATEviQLNGPG-VHTFTFPQ 230
Cdd:cd20205     84 VYVHPESPNTGAHWMRQEISFGKLKLTNNkgaNNNNTQMIVLQSLHKYQPRLHIVEvSEDGVE--DLNDSSkTQTFTFPE 161
                          170       180       190
                   ....*....|....*....|....*....|
gi 1907140377  231 TEFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20205    162 NQFIAVTAYQNTDITQLKIDHNPFAKGFRD 191
T-box_TBXT cd20202
DNA-binding domain of T-box transcription factor T and related T-box proteins; TBXT, also ...
74-260 7.36e-56

DNA-binding domain of T-box transcription factor T and related T-box proteins; TBXT, also known as Brachyury protein, or protein T, is a transcription factor needed for posterior mesoderm formation and differentiation as well as for the notochord development during embryogenesis. It binds to a 24 base-pair (bp) palindromic site (called the T site) and activates gene transcription when bound to such a site. This subfamily belongs to the T-box family of transcription factors which play a multitude of diverse functions throughout development. TBXT is the founding member of the T-box family, members of which share a conserved DNA-binding domain (T-box) which binds DNA in a sequence-specific manner. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development, and conserved expression patterns.


Pssm-ID: 410328  Cd Length: 179  Bit Score: 192.95  E-value: 7.36e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377   74 KITVTLDNNSMWNEFHNRSTEMILTKQGRRMFPYCRYWITGLDSNLKYILVMDISPVDSHRYKWNGRWWEPSGKAEPHIL 153
Cdd:cd20202      1 ELKVSLEESELWLRFKELTNEMIVTKNGRRMFPVLKVNVSGLDPNAMYSFLLDFVAADNHRWKYVNGEWVPGGKPEPQAP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377  154 GRVFIHPESPSTGHYWMHQPVSFYKLKLTNNtLDQEGHIILHSMHRYLPRLHLVpaekateviQLNGPG--VHTFTFPQT 231
Cdd:cd20202     81 SCVYIHPDSPNFGAHWMKAPVSFSKVKLTNK-LNGGGQIMLNSLHKYEPRIHIV---------RVGGPQrmITSHSFPET 150
                          170       180
                   ....*....|....*....|....*....
gi 1907140377  232 EFFAVTAYQNIQITQLKIDYNPFAKGFRD 260
Cdd:cd20202    151 QFIAVTAYQNEEITALKIKYNPFAKAFLD 179
bHLHzip_MGA cd18911
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and ...
2403-2467 1.65e-32

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) and similar proteins; MGA, also termed MAX dimerization protein 5 (MAD5), is a dual specificity T-box/ bHLHzip transcription factor that regulates the expression of both Max-network and T-box family target genes. It contains a Myc-like bHLHZip motif and requires heterodimerization with Max for binding to the preferred Myc-Max-binding site CACGTG. In addition to the bHLHZip domain, MGA harbors a second DNA-binding domain, the T-box or T-domain. It thus binds the preferred Brachyury-binding sequence and represses transcription of reporter genes containing promoter-proximal Brachyury-binding sites.


Pssm-ID: 381481  Cd Length: 65  Bit Score: 121.43  E-value: 1.65e-32
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLLS 2467
Cdd:cd18911      1 RRTHTANERRRRNEMRDLFEKLKRTLGLHNLPKVSKYYILKQAFEEIQGLTDQADRLIGQKTLLT 65
bHLHzip_MGA_like cd19682
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) ...
2403-2466 7.84e-21

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MAX gene-associated protein (MGA) family; The MGA family includes MGA, Schizosaccharomyces pombe ESC1 (spESC1) and similar proteins. MGA, also termed MAX dimerization protein 5 (MAD5), is a dual specificity T-box/ bHLHzip transcription factor that regulates the expression of both Max-network and T-box family target genes. It contains a Myc-like bHLHZip motif and requires heterodimerization with Max for binding to the preferred Myc-Max-binding site CACGTG. In addition to the bHLHZip domain, MGA harbors a second DNA-binding domain, the T-box or T-domain. It thus binds the preferred Brachyury-binding sequence and represses transcription of reporter genes containing promoter-proximal Brachyury-binding sites. spESC1 is a bHLHzip protein with homology to human MyoD and Myf-5 myogenic differentiation inducers. It is involved in the sexual differentiation process.


Pssm-ID: 381525 [Multi-domain]  Cd Length: 65  Bit Score: 88.48  E-value: 7.84e-21
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLL 2466
Cdd:cd19682      1 RLRHKKRERERRSELRELFDKLKQLLGLDSDEKASKLAVLTEAIEEIQQLKREEDELQKEKARL 64
MGA_dom pfam16059
MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), ...
1038-1079 1.29e-13

MGA, conserved domain; This domain can be found in the MAX gene-associated protein (Mga), which is a dual-specificity transcription factor that contains both a bHLHZip domain and a T-box domain and is able to bind to and regulate transcriptional targets through both E-box sites as well as T-box-binding elements (TBEs).


Pssm-ID: 464998  Cd Length: 51  Bit Score: 67.51  E-value: 1.29e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1907140377 1038 RKRAPPCNNDFCRLGCVCSSLA-LEKRQPAHCRRPDCMFGCTC 1079
Cdd:pfam16059    2 KDAKKPCDKDYCQLGCVCDSLAgTRPPKREHCGRADCVLGCVC 44
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1457-1783 4.41e-10

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 64.98  E-value: 4.41e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1457 AYKRKPSSTTSGLIQVASNAKVAASRKPRTLLPSTSNSKMassgPATNRSGKNLKAFVPAKRPIAARPSPGGVFTQFVMS 1536
Cdd:pfam17823  110 AASRALAAAASSSPSSAAQSLPAAIAALPSEAFSAPRAAA----CRANASAAPRAAIAAASAPHAASPAPRTAASSTTAA 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1537 KVGALQQKIPGVRTpqpltgpqkfSIRPSPVMVVTPVVSSEQVQV---CSTVAAAVTTSPQVfLENVTAVPSLTANSDMG 1613
Cdd:pfam17823  186 SSTTAASSAPTTAA----------SSAPATLTPARGISTAATATGhpaAGTALAAVGNSSPA-AGTVTAAVGTVTPAALA 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1614 AKEATYSSSASTAGVVEISETNNTTLVTSTQS---TATVNLTKTTGITTS-PVASVSFAKPLVAS---PTITlPVASTAS 1686
Cdd:pfam17823  255 TLAAAAGTVASAAGTINMGDPHARRLSPAKHMpsdTMARNPAAPMGAQAQgPIIQVSTDQPVHNTagePTPS-PSNTTLE 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1687 TSIVMVTTAASSSVVTTPTS-----SLSSVPIilsgingsPPVSQRPENAPQIPVTTPQISSNNVKRTGPRLLLIPVQQG 1761
Cdd:pfam17823  334 PNTPKSVASTNLAVVTTTKAqakepSASPVPV--------LHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVA 405
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1907140377 1762 S----------PTLRPIQNPQL----------QGQRMVL--QPV 1783
Cdd:pfam17823  406 TeatagtasagPTPRSSGDPKTlamascqlstQGQYLVVttDPL 449
bHLHzip_Myc cd11400
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in the Myc family; The Myc family is a ...
2403-2480 2.50e-09

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in the Myc family; The Myc family is a member of the bHLHzip family of transcription factors that play important roles in the control of normal cell proliferation, growth, survival and differentiation. All Myc isoforms contain two independently functioning polypeptide chain regions: N-terminal transactivating residues and a C-terminal bHLHzip segment. The bHLHzip family of bHLH transcription factors are characterized by a highly conserved N-terminal basic region that may bind DNA at a consensus hexanucleotide sequence known as the E-box (CANNTG) followed by HLH and leucine zipper motifs that may interact with other proteins to form homo- and heterodimers. Myc heterodimerizes with Max enabling specific binding to E-box DNA sequences in the promoters of target genes. The Myc proto-oncoprotein family includes at least five different functional members: c-, N-, L-, S- and B-Myc (which is lacking the bHLH domain).


Pssm-ID: 381406 [Multi-domain]  Cd Length: 80  Bit Score: 56.02  E-value: 2.50e-09
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLLSRKRSILIRKVSSL 2480
Cdd:cd11400      2 RRLHNVLERQRRNDLKNSFEKLRDLVpELADNEKASKVVILKKATEYIKQLQQEEKKLEKEKDKLKARNEQLRKKLERL 80
HLH pfam00010
Helix-loop-helix DNA-binding domain;
2402-2453 8.62e-09

Helix-loop-helix DNA-binding domain;


Pssm-ID: 459628 [Multi-domain]  Cd Length: 53  Bit Score: 53.62  E-value: 8.62e-09
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907140377 2402 YRRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILNRAFSEIQGLT 2453
Cdd:pfam00010    1 RREAHNERERRRRDRINDAFDELRELLpTLPPDKKLSKAEILRLAIEYIKHLQ 53
bHLHzip_spESC1_like cd19690
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Schizosaccharomyces pombe ESC1 (spESC1) ...
2403-2459 1.14e-08

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Schizosaccharomyces pombe ESC1 (spESC1) and similar proteins; spESC1 is a bHLHzip protein with homology to human MyoD and Myf-5 myogenic differentiation inducers. It is involved in the sexual differentiation process.


Pssm-ID: 381533  Cd Length: 65  Bit Score: 53.62  E-value: 1.14e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILNRAFSEIQGLTDQADKL 2459
Cdd:cd19690      1 RVSHKLAERKRRKEMKELFEDLRDALPQERGTKASKWEILTKAISYIQQLKRHIREL 57
bHLH_SF cd00083
basic Helix Loop Helix (bHLH) domain superfamily; bHLH proteins are transcriptional regulators ...
2410-2452 1.33e-07

basic Helix Loop Helix (bHLH) domain superfamily; bHLH proteins are transcriptional regulators that are found in organisms from yeast to humans. Members of the bHLH superfamily have two highly conserved and functionally distinct regions. The basic part is at the amino end of the bHLH that may bind DNA to a consensus hexanucleotide sequence known as the E box (CANNTG). Different families of bHLH proteins recognize different E-box consensus sequences. At the carboxyl-terminal end of the region is the HLH region that interacts with other proteins to form homo- and heterodimers. bHLH proteins function as a diverse set of regulatory factors because they recognize different DNA sequences and dimerize with different proteins. The bHLH proteins can be divided to cell-type specific and widely expressed proteins. The cell-type specific members of bHLH superfamily are involved in cell-fate determination and act in neurogenesis, cardiogenesis, myogenesis, and hematopoiesis.


Pssm-ID: 381392 [Multi-domain]  Cd Length: 46  Bit Score: 50.21  E-value: 1.33e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1907140377 2410 ERRRRGEMRDLFEKLKITLGLLH-SSKVSKSLILNRAFSEIQGL 2452
Cdd:cd00083      1 ERRRRDKINDAFEELKRLLPELPdSKKLSKASILQKAVEYIREL 44
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1480-1770 1.84e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 56.89  E-value: 1.84e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1480 ASRKPRTLLPSTSNSKMASSGPATNRSGKNLKAFVPAKRPIAARPSPGGVFTQFVMSKVGALQQKipgvrTPQPLTGPQk 1559
Cdd:pfam17823   65 AAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQS-----LPAAIAALP- 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1560 fsirpspvmvvTPVVSSEQVQVCSTVAAAVTTSP---QVFLENVTAVPSLTANSDMGAKEATYSSSASTAGVVEISETNN 1636
Cdd:pfam17823  139 -----------SEAFSAPRAAACRANASAAPRAAiaaASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLT 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1637 TTLVTSTQSTATVNLTKTTGITTSPVASVSFAKPLVASPTITLPVASTASTSIVMVTTAA-----SSSVVTTPTSSlSSV 1711
Cdd:pfam17823  208 PARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAgtinmGDPHARRLSPA-KHM 286
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1712 PIILSGINGSPPvsQRPE-NAPQIPVTTPQISSNNVKRTGPRLLLIPVQQGSPTLRPIQN 1770
Cdd:pfam17823  287 PSDTMARNPAAP--MGAQaQGPIIQVSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTN 344
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1447-1775 5.07e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 52.61  E-value: 5.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1447 AGLSPSGKLVAYKRKPSSTTSgliqvaSNAKVAASRKPRTLLPSTSNSKMASSGPATNrSGKNLKAFVPAKRPIAARPSP 1526
Cdd:pfam05109  394 SGLGTAPKTLIITRTATNATT------TTHKVIFSKAPESTTTSPTLNTTGFAAPNTT-TGLPSSTHVPTNLTAPASTGP 466
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1527 GgVFTQFVMSKVGALQQKIPGVRTPQPltgpqkfSIRPSPVMVVTPVVSSEQVQVCSTVAAAVTTSPQVflenVTAVPSL 1606
Cdd:pfam05109  467 T-VSTADVTSPTPAGTTSGASPVTPSP-------SPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAV----TTPTPNA 534
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1607 TAnsdmgakeATYSSSASTAGVVeiSETNNTTLVTSTQSTATVNLTKTTGITTSPVASVSFAKPLVASPTI--TLPVAS- 1683
Cdd:pfam05109  535 TS--------PTLGKTSPTSAVT--TPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVgeTSPQANt 604
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1684 -------TASTSIVMVTTAASSSVVTT-----PTSSLSSVPIILSGINGSPPVSQRPENAPQIPVTT---PQISSNNVKR 1748
Cdd:pfam05109  605 tnhtlggTSSTPVVTSPPKNATSAVTTgqhniTSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTsahPTGGENITQV 684
                          330       340
                   ....*....|....*....|....*..
gi 1907140377 1749 TGPRLLLIPVQQGSPTLRPIQNPQLQG 1775
Cdd:pfam05109  685 TPASTSTHHVSTSSPAPRPGTTSQASG 711
bHLHzip_L-Myc cd11457
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in L-Myc and similar proteins; L-Myc, ...
2403-2482 6.07e-06

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in L-Myc and similar proteins; L-Myc, also termed Class E basic helix-loop-helix protein 38 (bHLHe38), or protein L-Myc-1, or V-myc myelocytomatosis viral oncogene homolog, is a bHLHZip oncoprotein belonging to the Myc oncogene protein family. It binds DNA as a heterodimer with MAX. L-Myc is co-expressed with another Myc family member and has weaker transformation/transactivation activities. L-Myc knockout mouse did not exhibit any phenotypic abnormalities.


Pssm-ID: 381463 [Multi-domain]  Cd Length: 89  Bit Score: 46.71  E-value: 6.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLLSRKRSILIRKVSSLS 2481
Cdd:cd11457      8 RKNHNFLERKRRNDLRSRFLALRDEVpGLASCSKTPKVVILSKATEYLRGLVSAERRMAAEKRQLKSRQQQLLRRIAQLK 87

                   .
gi 1907140377 2482 G 2482
Cdd:cd11457     88 G 88
bHLHzip_Mlx_like cd11404
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-like protein X (Mlx) family; Mlx, ...
2403-2470 7.16e-06

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-like protein X (Mlx) family; Mlx, also termed Class D basic helix-loop-helix protein 13 (bHLHd13), or Max-like bHLHZip protein, or protein BigMax, or transcription factor-like protein 4, is a Max-like bHLHZip transcription regulator that interacts with the Max network of transcription factors. It forms a sequence-specific DNA-binding protein complex with some member of Mad family (Mad1 and Mad4) and Mondo family but not the Myc family and bind the E-box DNA to control transcription. The family also includes Saccharomyces cerevisiae INO4, which is a bHLH transcriptional activator of phospholipid synthetic genes (such as INO1, CHO1/PSS, CHO2/PEM1, OPI3/PEM2, etc.). It is required for de-repression of phospholipid biosynthetic gene expression in response to inositol deprivation in yeast.


Pssm-ID: 381410 [Multi-domain]  Cd Length: 70  Bit Score: 46.14  E-value: 7.16e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLLSRKR 2470
Cdd:cd11404      3 RLNHVRSEKKRRELIKKGYDELCALVPGLDPQKRTKADILQKAADWIQELKEENEKLEEQLDELKEAA 70
HLH smart00353
helix loop helix domain;
2407-2458 1.10e-05

helix loop helix domain;


Pssm-ID: 197674 [Multi-domain]  Cd Length: 53  Bit Score: 44.90  E-value: 1.10e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1907140377  2407 TANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILNRAFSEIQGLTDQADK 2458
Cdd:smart00353    1 NARERRRRRKINEAFDELRSLLpTLPKNKKLSKAEILRLAIEYIKSLQEELQK 53
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
1550-1748 1.36e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 50.52  E-value: 1.36e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1550 TPQPLTGPQKFSIRPSPVMVVTPVVSSEQVQ--VCSTVAAAVTTSPQVflENVTAVPSLTANSDMGAKEATYSSSASTAG 1627
Cdd:COG3469     13 GGASATAVTLLGAAATAASVTLTAATATTVVstTGSVVVAASGSAGSG--TGTTAASSTAATSSTTSTTATATAAAAAAT 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1628 VVEISETNNTTLVTSTQSTATVNLTKTTGITTSPVASVSFAKPLVASPTITLPVASTASTSIVMV----TTAASSSVVTT 1703
Cdd:COG3469     91 STSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGtetaTGGTTTTSTTT 170
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907140377 1704 PTSSLSSVPiILSGINGSPPVSQRPENAPQIPVTTPQISSNNVKR 1748
Cdd:COG3469    171 TTTSASTTP-SATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
PHA03255 PHA03255
BDLF3; Provisional
1620-1757 1.47e-05

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 49.13  E-value: 1.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1620 SSSASTAGVVEISETNNTTLVTSTQSTATVNLTKTTGITTSPV---ASVSFAKPLVASPTITLPVASTAS--TSIVMVTT 1694
Cdd:PHA03255    26 SSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPItttAILSTNTTTVTSTGTTVTPVPTTSnaSTINVTTK 105
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907140377 1695 AASSSVVTTPTSSLSSVPIILSGINGSPPVSQRPENAPQIPVTTPQISS---NNVKRTGPRLLLIP 1757
Cdd:PHA03255   106 VTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSkgtSNATKTTAELPTVP 171
bHLHzip_N-Myc_like cd11456
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in N-Myc and similar proteins; N-Myc, ...
2403-2477 2.73e-05

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in N-Myc and similar proteins; N-Myc, also termed Class E basic helix-loop-helix protein 37 (bHLHe37), is a bHLHZip proto-oncogene protein that positively regulates the transcription of MYCNOS in neuroblastoma cells. It is also essential during embryonic development. N-Myc has a critical role in regulating the switch between proliferation and differentiation of progenitor cells. It binds DNA as a heterodimer with MAX. The family also includes S-Myc, encoded by rat or mouse intronless myc gene, which has apoptosis-inducing activity.


Pssm-ID: 381462 [Multi-domain]  Cd Length: 87  Bit Score: 44.89  E-value: 2.73e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLLSRKRSILIRKV 2477
Cdd:cd11456      6 RRNHNILERQRRNDLRSSFLTLRDHVpELVKNEKAAKVVILKKATEYVHSLQAEEQKLLLEKEKLQARQQQLLKKI 81
SP4_N cd22536
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ...
1657-1834 4.11e-05

N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.


Pssm-ID: 411773 [Multi-domain]  Cd Length: 623  Bit Score: 49.15  E-value: 4.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1657 ITTSPVASVSFAKPLVASPTITLPVASTA-STSIvmvTTAASSSVVTTPTSS--LSSVPIILSGINGSPPVSQRPENAPQ 1733
Cdd:cd22536    262 LVQPSDGGVSNGNQLVSTPITTASVSTMPeSPSS---STTCTTTASTSLTSSdtLVSSAETGQYASTAASSERTEEEPQT 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1734 IPVTTPQISSNNVKRTGprllLIPVQQGSPTLrpiQNPQLQGQRMVLQPVRGPSGMNLFRHPNGQIVQLLPLHQIRGSNA 1813
Cdd:cd22536    339 SAAESEAQSSSQLQSNG----LQNVQDQSNSL---QQVQIVGQPILQQIQIQQPQQQIIQAIQPQSFQLQSGQTIQTIQQ 411
                          170       180
                   ....*....|....*....|...
gi 1907140377 1814 QP--SLQPVVFRNPgSMVGIRLP 1834
Cdd:cd22536    412 QPlqNVQLQAVQSP-TQVLIRAP 433
bHLHzip_Max cd11406
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in protein Max and similar proteins; Max, ...
2403-2450 4.78e-05

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in protein Max and similar proteins; Max, also termed Class D basic helix-loop-helix protein 4 (bHLHd4), or Myc-associated factor X, is a bHLHZip transcription regulator that forms a sequence-specific DNA-binding protein complex with MYC or MAD which recognizes the core sequence 5'-CAC[GA]TG-3'. The MYC:MAX complex is a transcriptional activator, whereas the MAD:MAX complex is a transcriptional repressor. Max homodimer bind DNA but is transcriptionally inactive. Targeted deletion of max results in early embryonic lethality in mice.


Pssm-ID: 381412  Cd Length: 69  Bit Score: 43.49  E-value: 4.78e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILNRAFSEIQ 2450
Cdd:cd11406      2 RAHHNALERKRRDHIKDSFHSLRDSVPSLQGEKASRAQILKKATEYIQ 49
bHLHzip_Mad4 cd18929
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-associated protein 4 (Mad4) and ...
2402-2481 2.35e-04

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-associated protein 4 (Mad4) and similar proteins; Mad4, also termed Max dimerization protein 4, or Max dimerizer 4 (MXD4), or Class C basic helix-loop-helix protein 12 (bHLHc12), or Max-interacting transcriptional repressor MAD4, is a bHLHZip Max-interacting transcriptional repressor that suppresses c-myc dependent transformation and is expressed during neural and epidermal differentiation. It is regulated by a transcriptional repressor complex that contains Miz-1 and c-Myc.


Pssm-ID: 381499 [Multi-domain]  Cd Length: 88  Bit Score: 42.30  E-value: 2.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 2402 YRRTHTANERRRRGEMRDLFEKLK--ITLGLLHSSKVSKSLiLNRAFSEIQGLTDQADKLIGQKNLLSRKRSILIRKVSS 2479
Cdd:cd18929      2 NRSSHNELEKHRRAKLRLYLEQLKqlVPLGPDSTRHTTLSL-LKRAKMHIKKLEEQDRKALNIKEQLQREHRYLKRRLEQ 80

                   ..
gi 1907140377 2480 LS 2481
Cdd:cd18929     81 LS 82
SSP160 pfam06933
Special lobe-specific silk protein SSP160; This family consists of several special ...
1570-1685 2.77e-04

Special lobe-specific silk protein SSP160; This family consists of several special lobe-specific silk protein SSP160 sequences which appear to be specific to Chironomus (Midge) species.


Pssm-ID: 115579 [Multi-domain]  Cd Length: 758  Bit Score: 46.69  E-value: 2.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1570 VTPVVSSEQVQVCSTVAAAVTTSPQVFLENVTAVPSLTANSD--MGAKEATYSSSASTAGVVEISETNNTTLVTSTQSTA 1647
Cdd:pfam06933  613 LTAFLASFNATINATIAAASANNSEVQSSEAACIESSLADAAaiLAMFEAAYQNCTAPGSVTVPAAANTTTSSTTTTTTT 692
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1907140377 1648 TVNLTKTTgiTTSPVASVSFAKPL----------VASPTITLPVASTA 1685
Cdd:pfam06933  693 TTTAAPTT--TTTKAANAPFTYPLcnlimsaacsAGGAGCTYPFISSA 738
PHA03247 PHA03247
large tegument protein UL36; Provisional
1503-1883 4.10e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 4.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1503 TNRSGKNLKAFVPAKRPI--AARPSPGgvftqfvmsKVGALQQKIPGVRTPQPltgpqkfsiRPSPVMVVTPVVSSEQVQ 1580
Cdd:PHA03247  2667 ARRLGRAAQASSPPQRPRrrAARPTVG---------SLTSLADPPPPPPTPEP---------APHALVSATPLPPGPAAA 2728
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1581 VCSTVAAAVTTSPQVfLENVTAVPSLTANSDMGAKEATYSSSASTAGVVEISETNNTTLVTSTQSTATVNLTKTTGITTS 1660
Cdd:PHA03247  2729 RQASPALPAAPAPPA-VPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADP 2807
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1661 PVAsvsfakplVASPTITLPVASTASTSIVMVTTAASSSVVTTPTSSLSSVPIILSGINGSpPVSQRP--ENAPQIPVTT 1738
Cdd:PHA03247  2808 PAA--------VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGG-DVRRRPpsRSPAAKPAAP 2878
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1739 PQISSNNVKRTGPRLLLIPVQQGSPTLRPIQNPQLQGQRMVLQPVRGPSGMNLFRHPNGQivQLLPLHQIRGSNAQPSLQ 1818
Cdd:PHA03247  2879 ARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR--PQPPLAPTTDPAGAGEPS 2956
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907140377 1819 PVVfRNP--GSMVGIRLPAPckssetpsssasSSAFSVMSPVIQAVGSSPTVNVISQAPSLLSSGSS 1883
Cdd:PHA03247  2957 GAV-PQPwlGALVPGRVAVP------------RFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASS 3010
bHLHzip_c-Myc cd11458
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in c-Myc and similar proteins; c-Myc, ...
2403-2480 7.37e-04

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in c-Myc and similar proteins; c-Myc, also termed Myc proto-oncogene protein, or Class E basic helix-loop-helix protein 39 (bHLHe39), or transcription factor p64, a bHLHZip proto-oncogene protein that functions as a transcription factor, which binds DNA in a non-specific manner, yet also specifically recognizes the core sequence 5'-CAC[GA]TG-3'. It activates the transcription of growth-related genes.


Pssm-ID: 381464 [Multi-domain]  Cd Length: 84  Bit Score: 40.63  E-value: 7.37e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITL-GLLHSSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLLSRKRSILIRKVSSL 2480
Cdd:cd11458      6 RRTHNVLERQRRNELKLSFFALRDQIpEVANNEKAPKVVILKKATEYILSMQADEQRLISEKEQLRRRREQLKHRLEQL 84
bHLHzip_USF3 cd18910
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in basic helix-loop-helix ...
2400-2460 9.46e-04

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in basic helix-loop-helix domain-containing protein USF3 and similar proteins; USF3, also termed upstream transcription factor 3, is a bHLHzip protein that is involved in the negative regulation of epithelial-mesenchymal transition, the process by which epithelial cells lose their polarity and adhesion properties to become mesenchymal cells with enhanced migration and invasive properties.


Pssm-ID: 381480  Cd Length: 65  Bit Score: 39.98  E-value: 9.46e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907140377 2400 AYYRRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSLILNRAFSEIQGLTDQADKLI 2460
Cdd:cd18910      3 EKKRESHNEVERRRKDKINAGINKIGELLPDRDAKKQSKNMILEQAYKYIVELKKKNDKLL 63
bHLHzip_Mad cd11401
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in the Mad family; Members of the Mad ...
2403-2477 1.39e-03

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in the Mad family; Members of the Mad family (Mad1, Mxi, Mad3, and Mad4) bear the bHLHzip domain (also known as basic-helix-loop-helix-leucine-zipper or bHLH-LZ domain), which mediates heterodimerization to Max and the sequence-specific DNA binding ability to E-box DNA. Mad family proteins can repress transcription at the E-box through their interaction with co-repressors. Mad family proteins antagonize Myc function in transactivation and transformation and they are growth/tumor suppressors. The developmental phenotypes of the individual Mad family member knockout mice are relatively mild- all these mice have been shown to be viable and normal.


Pssm-ID: 381407 [Multi-domain]  Cd Length: 76  Bit Score: 39.89  E-value: 1.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLK------------ITLGLlhsskvsksliLNRAFSEIQGLTDQADKLIGQKNLLSRKR 2470
Cdd:cd11401      1 RSTHNELEKNRRAHLRLCLERLKelvplgpdatrhTTLSL-----------LTKAKAYIKNLEDKEKRQRQQKEQLRREQ 69

                   ....*..
gi 1907140377 2471 SILIRKV 2477
Cdd:cd11401     70 RELKRRL 76
bHLHzip_MLXIP_like cd11405
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MLX-interacting protein (MLXIP), ...
2403-2472 1.75e-03

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in MLX-interacting protein (MLXIP), MLX-interacting protein-like (MLXIPL) and similar proteins; The family includes MLXIP and MLXIPL. MLXIP, also termed Class E basic helix-loop-helix protein 36 (bHLHe36), or transcriptional activator MondoA, is a bHLHZip transcriptional activator that binds DNA as a heterodimer with Mlx. It binds to the canonical E box sequence 5'-CACGTG-3' and plays a role in transcriptional activation of glycolytic target genes. MLXIP is most highly expressed in skeletal muscle and functions as an indirect glucose sensor, by sensing glucose 6-phosphate and shuttling between the nucleus and the cytoplasm. MLXIPL, also termed carbohydrate-responsive element-binding protein (ChREBP), or Class D basic helix-loop-helix protein 14 (bHLHd14), or MLX interactor, or WS basic-helix-loop-helix leucine zipper protein (WS-bHLH), or Williams-Beuren syndrome chromosomal region 14 protein (WBSCR14), is a bHLHZip transcriptional factor integral to the regulation of glycolysis and lipogenesis in the liver. It forms heterodimers with the bHLHZip protein Mlx to bind the DNA sequence 5'-CACGTG-3'.


Pssm-ID: 381411 [Multi-domain]  Cd Length: 74  Bit Score: 39.57  E-value: 1.75e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLK---ITLGLLHSSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLLsrKRSI 2472
Cdd:cd11405      4 RLSHISAEQKRRFNIKSGFDTLQsliPSLGQNPNQKVSKAAMLQKAAEYIKSLKRERQQMQEEAEQL--RQEI 74
bHLHzip_MXI1 cd18930
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-interacting protein 1 (MXI1) and ...
2402-2480 2.78e-03

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-interacting protein 1 (MXI1) and similar proteins; MXI1, also termed Max interactor 1, or Class C basic helix-loop-helix protein 11 (bHLHc11), is a bHLHZip transcriptional repressor that binds with MAX to form a sequence-specific DNA-binding protein complex which recognizes the core sequence 5'-CAC[GA]TG-3'. It thus antagonizes MYC transcriptional activity by competing for MAX. It plays an important role in the regulation of cell proliferation.


Pssm-ID: 381500 [Multi-domain]  Cd Length: 80  Bit Score: 39.21  E-value: 2.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 2402 YRRTHTANERRRRGEMRDLFEKLKITLGLLHSSKVSKSL-ILNRAFSEIQGLTDQADKLIGQKNLLSRKRSILIRKVSSL 2480
Cdd:cd18930      1 NRSTHNELEKNRRAHLRLCLERLKVLIPLGPDCTRHTTLgLLNKAKAHIKKLEEADRKSQHQLENLEREQRFLKRRLEQL 80
bHLHzip_Mlx cd19687
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-like protein X (Mlx) and similar ...
2403-2469 4.23e-03

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-like protein X (Mlx) and similar proteins; Mlx, also termed Class D basic helix-loop-helix protein 13 (bHLHd13), or Max-like bHLHZip protein, or protein BigMax, or transcription factor-like protein 4, is a Max-like bHLHZip transcription regulator that interacts with the Max network of transcription factors. It forms a sequence-specific DNA-binding protein complex with some member of Mad family (Mad1 and Mad4) and Mondo family but not the Myc family and bind the E-box DNA to control transcription.


Pssm-ID: 381530 [Multi-domain]  Cd Length: 76  Bit Score: 38.56  E-value: 4.23e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLKITLGLLH------SSKVSKSLILNRAFSEIQGLTDQADKLIGQKNLLsRK 2469
Cdd:cd19687      3 REAHTQAEQKRRDAIKKGYDDLQDIVPTCQqqddigSQKLSKATILQRSIDYIQFLHQQKKKQEEELSAL-RK 74
bHLH_ScINO2_like cd11388
basic helix-loop-helix (bHLH) domain found in Saccharomyces cerevisiae protein INO2 and ...
2403-2459 6.32e-03

basic helix-loop-helix (bHLH) domain found in Saccharomyces cerevisiae protein INO2 and similar proteins; INO2 is a positive regulatory factor required for depression of the co-regulated phospholipid biosynthetic enzymes in Saccharomyces cerevisiae. It is also involved in the expression of ITR1.


Pssm-ID: 381394  Cd Length: 68  Bit Score: 37.72  E-value: 6.32e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907140377 2403 RRTHTANERRRRGEMRDLFEKLkitLGLLH------SSKVSKSLILNRAFSEIQGLTDQADKL 2459
Cdd:cd11388      4 KWKHVEAEKKRRNQIKKGFEDL---INLINyprnnnEKRISKSELLNKAVDDIRGLLKANEQL 63
bHLH_AtbHLH_like cd11393
basic helix-loop-helix (bHLH) domain found in Arabidopsis thaliana genes coding transcription ...
2406-2459 9.08e-03

basic helix-loop-helix (bHLH) domain found in Arabidopsis thaliana genes coding transcription factors and similar proteins; bHLH proteins are the second largest class of plant transcription factors that regulate transcription of genes that are involve in many essential physiological and developmental process. bHLH proteins are transcriptional regulators that are found in organisms from yeast to humans. The Arabidopsis bHLH proteins that have been characterized so far have roles in regulation of fruit dehiscence, cell development (carpel, anther and epidermal), phytochrome signaling, flavonoid biosynthesis, hormone signaling and stress responses.


Pssm-ID: 381399 [Multi-domain]  Cd Length: 53  Bit Score: 36.78  E-value: 9.08e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907140377 2406 HTANERRRRGEMRDLFEKLKitlGLL-HSSKVSKSLILNRAFSEIQGLTDQADKL 2459
Cdd:cd11393      1 HSIAERKRREKINERIRALR---SLVpNGGKTDKASILDEAIEYIKFLQEQVKVL 52
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1670-1974 9.19e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 9.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1670 PLVASPTITLPVASTASTSivmvTTAASSsvvTTPTSSLSSVPIilsgiNGSPPVSQrPENAPQIPVTTPQISSNNVKRT 1749
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPG----TTQAAT---AGPTPSAPSVPP-----QGSPATSQ-PPNQTQSTAAPHTLIQQTPTLH 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1750 GPRL-----LLIPVQQGSP----TLRPIQNPQLQGQrmvLQPVRGP--SGMNLFRHP----------------------- 1795
Cdd:pfam03154  239 PQRLpsphpPLQPMTQPPPpsqvSPQPLPQPSLHGQ---MPPMPHSlqTGPSHMQHPvppqpfpltpqssqsqvppgpsp 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1796 ----NGQIVQLLPLHQIRGSNAQPSLQPVVFRNPGSMVGIR---------LPAPCKSSETPSSSASSSAFSVMS----PV 1858
Cdd:pfam03154  316 aapgQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKpppttpipqLPNPQSHKHPPHLSGPSPFQMNSNlpppPA 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907140377 1859 IQAVGSSPTVN--------------------------VISQAPSLLSSGSSFVSQAGTLTLRISPPETQNLASKTGSESk 1912
Cdd:pfam03154  396 LKPLSSLSTHHppsahppplqlmpqsqqlppppaqppVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPP- 474
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907140377 1913 ITPSTGGQPVGTASLIPLQSGSFALLQLPGqkPIPSSVLQHVASLQIKKEsqSTDQKDETNS 1974
Cdd:pfam03154  475 ITPPSGPPTSTSSAMPGIQPPSSASVSSSG--PVPAAVSCPLPPVQIKEE--ALDEAEEPES 532
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH