NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|460838688|ref|NP_001264078|]
View 

chromodomain-helicase-DNA-binding protein 7 isoform 1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03142 super family cl33647
Probable chromatin-remodeling complex ATPase chain; Provisional
957-1556 8.17e-155

Probable chromatin-remodeling complex ATPase chain; Provisional


The actual alignment was detected with superfamily member PLN03142:

Pssm-ID: 215601 [Multi-domain]  Cd Length: 1033  Bit Score: 510.88  E-value: 8.17e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  957 KLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTE-LNVV 1034
Cdd:PLN03142  169 KMRDYQLAGLNWLIRLYENGINGILADEMGLGKTLQTISLLgYLHEYRGITGPHMVVAPKSTLGNWMNEIRRFCPvLRAV 248
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1035 VYHGSQASRRTIQLYEMyfkdpqgrvIKGsyKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMM 1114
Cdd:PLN03142  249 KFHGNPEERAHQREELL---------VAG--KFDVCVTSFEMAIKEKTALKRFSWRYIIIDEAHRIKNENSLLSKTMRLF 317
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1115 DLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEF---GDLKTEEQVQKLQAILKPMMLRRLKEDVEKNLAPK 1191
Cdd:PLN03142  318 STNYRLLITGTPLQNNLHELWALLNFLLPEIFSSAETFDEWFqisGENDQQEVVQQLHKVLRPFLLRRLKSDVEKGLPPK 397
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1192 EETIIEVELTNIQKKYYRAILEKNFTFLSKGGGQANvpnLLNTMMELRKCCNHPYLINGAEEKileefkethnaeSPDFQ 1271
Cdd:PLN03142  398 KETILKVGMSQMQKQYYKALLQKDLDVVNAGGERKR---LLNIAMQLRKCCNHPYLFQGAEPG------------PPYTT 462
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1272 LQAMIQAAGKLVLIDKLLPKLKAGGHRVLIFSQMVRCLDILEDYLIQRRYPYERIDGRVRGNLRQAAIDRFSKPDSDRFV 1351
Cdd:PLN03142  463 GEHLVENSGKMVLLDKLLPKLKERDSRVLIFSQMTRLLDILEDYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPGSEKFV 542
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1352 FLLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIGQSKSVKIYRLITRNSYEREMFDKASLKLGLDKAVLQS 1431
Cdd:PLN03142  543 FLLSTRAGGLGINLATADIVILYDSDWNPQVDLQAQDRAHRIGQKKEVQVFRFCTEYTIEEKVIERAYKKLALDALVIQQ 622
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1432 MSGRENATngvqqLSKKEIEDLLRkgaYGALMDEEDEGSKFCEEDIDQILLR-RTHTITIESEGKGSTFAKASFVASGNR 1510
Cdd:PLN03142  623 GRLAEQKT-----VNKDELLQMVR---YGAEMVFSSKDSTITDEDIDRIIAKgEEATAELDAKMKKFTEDAIKFKMDDTA 694
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|....*.
gi 460838688 1511 TDISLDDpnfwQKWAKKAELDIDALNGRNnlVIDTPRvRKQTRLYS 1556
Cdd:PLN03142  695 ELYDFDD----EDDKDENKLDFKKIVSDN--WIDPPK-RERKRNYS 733
CD2_tandem_CHD5-9_like cd18663
repeat 2 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, ...
869-926 4.85e-35

repeat 2 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, and similar proteins; Repeat 2 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD5, CHD6, CHD7, CHD8, and CHD9. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. CHD6, CHD7, and CHD8 enzymes have been demonstrated to have different substrate specificities and remodeling activities. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


:

Pssm-ID: 349310 [Multi-domain]  Cd Length: 59  Bit Score: 128.56  E-value: 4.85e-35
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 460838688  869 PDYVEVDRIMDFARSTD-DRGEPVIHYLVKWCSLPYEDSTWELKQDIDQTKIEEFEKLM 926
Cdd:cd18663     1 PDYVEVDRILDVSVSTDpNTGEPVTHYLVKWCSLPYEDSTWELEEDVDPAKIEEFEKLR 59
CD1_tandem_CHD5-9_like cd18668
repeat 1 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, ...
786-850 1.06e-24

repeat 1 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, and similar proteins; Repeat 1 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD5, CHD6, CHD7, CHD8, and CHD9. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. CHD6, CHD7, and CHD8 enzymes have been demonstrated to have different substrate specificities and remodeling activities. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


:

Pssm-ID: 349315 [Multi-domain]  Cd Length: 68  Bit Score: 99.33  E-value: 1.06e-24
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 460838688  786 DAEGPVVEKIMSSRLVKKQKESGEEVEV-EEFYVKYKNFSYLHCQWASVEDLEK-DKRIQQKIKRFK 850
Cdd:cd18668     1 EEDTMIIEKILASRKKKKEKEEGAEEIEvEEYLVKYKNFSYLHCEWKTEEELEKgDKRIKQKIKRFK 67
BRK smart00592
domain in transcription and CHROMO domain helicases;
2631-2675 7.08e-19

domain in transcription and CHROMO domain helicases;


:

Pssm-ID: 197800  Cd Length: 45  Bit Score: 82.01  E-value: 7.08e-19
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 460838688   2631 TGEERVPVVNKRNGKKMGGAMAPPMKDLPRWLEENPEFAVAPDWT 2675
Cdd:smart00592    1 DGEERVPVINRETGKKLTGDDAPKAKDLERWLEENPEYEVAPRSA 45
BRK pfam07533
BRK domain; The function of this domain is unknown. It is often found associated with ...
2552-2593 3.71e-14

BRK domain; The function of this domain is unknown. It is often found associated with helicases and transcription factors.


:

Pssm-ID: 462196  Cd Length: 44  Bit Score: 68.69  E-value: 3.71e-14
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 460838688  2552 LDPDTRIPVINLEDGTRLVGEDAPKNKDLVDWLKLHPTYTVD 2593
Cdd:pfam07533    1 LTGDERVPVVNRKTGKKLTGDKAPKLKDLEEWLEENPGYEVD 42
Glutenin_hmw super family cl26620
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
23-553 2.00e-12

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


The actual alignment was detected with superfamily member pfam03157:

Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 73.44  E-value: 2.00e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    23 GLGECGY-PENPVNPMGQQMP-----IDQGFPSLQPSL----HHPSPNQNQTKLTHFDHYSQYEQKMHLMDQPNRMMGSA 92
Cdd:pfam03157  159 GQGQQGYyPTSPQQSGQRQQPgqgqqLRQGQQGQQSGQgqpgYYPTSSQQPGQLQQTGQGQQGQQPERGQQGQQPGQGQQ 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    93 PGNGlasphSQYHTPPVPQVPhgggGGGQMGVYPGI-QNERHGQSFVDGGSMWGPRAVQVPDQirapyqqqqpQPAPSGP 171
Cdd:pfam03157  239 PGQG-----QQGQQPGQPQQL----GQGQQGYYPISpQQPRQWQQSGQGQQGYYPTSLQQPGQ----------GQSGYYP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   172 PAQGHPQHMQQMGSYLARGDFSMQQHGQPQQRMGQFSQGQEGLSQGSPfiATSGPGHLSHMPQQspsmapslrhPVQQQF 251
Cdd:pfam03157  300 TSQQQAGQLQQEQQLGQEQQDQQPGQGRQGQQPGQGQQGQQPAQGQQP--GQGQPGYYPTSPQQ----------PGQGQP 367
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   252 HHHPAALHGESVAHSPRFSPNPPQQG----AVRPQTLNFSSRNQT---VPSPTVNNSGQYSRYPYSNLNQGLVNSTGMNQ 324
Cdd:pfam03157  368 GYYPTSQQQPQQGQQPEQGQQGQQQGqgqqGQQPGQGQQPGQGQPgyyPTSPQQSGQGQPGYYPTSPQQSGQGQQPGQGQ 447
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   325 NLGltNSTPMNQSVPrypnavgfpsnsGQGLVHQQPihssgslnQMNTQTMHPSQPQGTYASPPPMSPMKamsnpaGTPP 404
Cdd:pfam03157  448 QPG--QEQPGQGQQP------------GQGQQGQQP--------GQPEQGQQPGQGQPGYYPTSPQQSGQ------GQQL 499
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   405 PQVRPGSAGMPmevGSYPNMPHPQpshqppgamGIGQRNMGPRNMQQPRSFMGMSSAPRELTGHMRPNGCPGVGLADPQA 484
Cdd:pfam03157  500 GQWQQQGQGQP---GYYPTSPLQP---------GQGQPGYYPTSPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQGQQPGQ 567
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   485 IQERLIPGQQHPGQQPSFQQLPTcpplQPHPGLHQSSPPHP-HHQPWAQLHPSPQNTPQKVPVHQHSPSE 553
Cdd:pfam03157  568 GQQGQQPGQGQQGQQPGQGQQPG----QGQPGYYPTSPQQSgQGQQPGQWQQPGQGQPGYYPTSSLQLGQ 633
 
Name Accession Description Interval E-value
PLN03142 PLN03142
Probable chromatin-remodeling complex ATPase chain; Provisional
957-1556 8.17e-155

Probable chromatin-remodeling complex ATPase chain; Provisional


Pssm-ID: 215601 [Multi-domain]  Cd Length: 1033  Bit Score: 510.88  E-value: 8.17e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  957 KLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTE-LNVV 1034
Cdd:PLN03142  169 KMRDYQLAGLNWLIRLYENGINGILADEMGLGKTLQTISLLgYLHEYRGITGPHMVVAPKSTLGNWMNEIRRFCPvLRAV 248
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1035 VYHGSQASRRTIQLYEMyfkdpqgrvIKGsyKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMM 1114
Cdd:PLN03142  249 KFHGNPEERAHQREELL---------VAG--KFDVCVTSFEMAIKEKTALKRFSWRYIIIDEAHRIKNENSLLSKTMRLF 317
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1115 DLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEF---GDLKTEEQVQKLQAILKPMMLRRLKEDVEKNLAPK 1191
Cdd:PLN03142  318 STNYRLLITGTPLQNNLHELWALLNFLLPEIFSSAETFDEWFqisGENDQQEVVQQLHKVLRPFLLRRLKSDVEKGLPPK 397
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1192 EETIIEVELTNIQKKYYRAILEKNFTFLSKGGGQANvpnLLNTMMELRKCCNHPYLINGAEEKileefkethnaeSPDFQ 1271
Cdd:PLN03142  398 KETILKVGMSQMQKQYYKALLQKDLDVVNAGGERKR---LLNIAMQLRKCCNHPYLFQGAEPG------------PPYTT 462
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1272 LQAMIQAAGKLVLIDKLLPKLKAGGHRVLIFSQMVRCLDILEDYLIQRRYPYERIDGRVRGNLRQAAIDRFSKPDSDRFV 1351
Cdd:PLN03142  463 GEHLVENSGKMVLLDKLLPKLKERDSRVLIFSQMTRLLDILEDYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPGSEKFV 542
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1352 FLLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIGQSKSVKIYRLITRNSYEREMFDKASLKLGLDKAVLQS 1431
Cdd:PLN03142  543 FLLSTRAGGLGINLATADIVILYDSDWNPQVDLQAQDRAHRIGQKKEVQVFRFCTEYTIEEKVIERAYKKLALDALVIQQ 622
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1432 MSGRENATngvqqLSKKEIEDLLRkgaYGALMDEEDEGSKFCEEDIDQILLR-RTHTITIESEGKGSTFAKASFVASGNR 1510
Cdd:PLN03142  623 GRLAEQKT-----VNKDELLQMVR---YGAEMVFSSKDSTITDEDIDRIIAKgEEATAELDAKMKKFTEDAIKFKMDDTA 694
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|....*.
gi 460838688 1511 TDISLDDpnfwQKWAKKAELDIDALNGRNnlVIDTPRvRKQTRLYS 1556
Cdd:PLN03142  695 ELYDFDD----EDDKDENKLDFKKIVSDN--WIDPPK-RERKRNYS 733
DEXHc_CHD7 cd18059
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 7; ...
958-1179 4.60e-153

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 7; Chromodomain-helicase-DNA-binding protein 7 (CHD7) is a probable transcription regulator. It may be involved in the 45S precursor rRNA production. CHD7 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350817 [Multi-domain]  Cd Length: 222  Bit Score: 473.36  E-value: 4.60e-153
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNVVVYH 1037
Cdd:cd18059     1 LREYQLEGVNWLLFNWYNTRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNVVVYH 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1038 GSQASRRTIQLYEMYFKDPQGRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMDLE 1117
Cdd:cd18059    81 GSQASRRTIQLYEMYFKDPQGRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMDLE 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688 1118 HKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18059   161 HKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 222
HepA COG0553
Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, ...
957-1416 3.05e-128

Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, and repair];


Pssm-ID: 440319 [Multi-domain]  Cd Length: 682  Bit Score: 421.17  E-value: 3.05e-128
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  957 KLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWT-ELNVVV 1035
Cdd:COG0553   241 TLRPYQLEGAAWLLFLRRLGLGGLLADDMGLGKTIQALALLLELKERGLARPVLIVAPTSLVGNWQRELAKFApGLRVLV 320
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1036 YHGSQASRRTIQLYEmyfkdpqgrvikgsyKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMD 1115
Cdd:COG0553   321 LDGTRERAKGANPFE---------------DADLVITSYGLLRRDIELLAAVDWDLVILDEAQHIKNPATKRAKAVRALK 385
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1116 LEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKT---EEQVQKLQAILKPMMLRRLKEDVEKNLAPKE 1192
Cdd:COG0553   386 ARHRLALTGTPVENRLEELWSLLDFLNPGLLGSLKAFRERFARPIEkgdEEALERLRRLLRPFLLRRTKEDVLKDLPEKT 465
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1193 ETIIEVELTNIQKKYYRAILEKNFTFLSKGGGQANVPNLLNTMMELRKCCNHPYLingaeekILEEFKEtHNAESpdfql 1272
Cdd:COG0553   466 EETLYVELTPEQRALYEAVLEYLRRELEGAEGIRRRGLILAALTRLRQICSHPAL-------LLEEGAE-LSGRS----- 532
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1273 qamiqaaGKLVLIDKLLPKLKAGGHRVLIFSQMVRCLDILEDYLIQRRYPYERIDGRVRGNLRQAAIDRFSKpDSDRFVF 1352
Cdd:COG0553   533 -------AKLEALLELLEELLAEGEKVLVFSQFTDTLDLLEERLEERGIEYAYLHGGTSAEERDELVDRFQE-GPEAPVF 604
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 460838688 1353 LLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIGQSKSVKIYRLITRNSYEREMFD 1416
Cdd:COG0553   605 LISLKAGGEGLNLTAADHVIHYDLWWNPAVEEQAIDRAHRIGQTRDVQVYKLVAEGTIEEKILE 668
SNF2-rel_dom pfam00176
SNF2-related domain; This domain is found in proteins involved in a variety of processes ...
961-1248 5.16e-87

SNF2-related domain; This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1). SNF2 functions as the ATPase component of the SNF2/SWI multisubunit complex, which utilizes energy derived from ATP hydrolysis to disrupt histone-DNA interactions, resulting in the increased accessibility of DNA to transcription factors.


Pssm-ID: 425504 [Multi-domain]  Cd Length: 289  Bit Score: 286.89  E-value: 5.16e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   961 YQLEGVNWLLFNWYNM-RNCILADEMGLGKTIQSITFLYeiYLKGIH----GPFLVIAPLSTIPNWEREFRTWT---ELN 1032
Cdd:pfam00176    1 YQIEGVNWMLSLENNLgRGGILADEMGLGKTLQTISLLL--YLKHVDknwgGPTLIVVPLSLLHNWMNEFERWVsppALR 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  1033 VVVYHGSQASRRTIQLYEMYFKDpqgrvikgsykFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLK 1112
Cdd:pfam00176   79 VVVLHGNKRPQERWKNDPNFLAD-----------FDVVITTYETLRKHKELLKKVHWHRIVLDEGHRLKNSKSKLSKALK 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  1113 MMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFG----DLKTEEQVQKLQAILKPMMLRRLKEDVEKNL 1188
Cdd:pfam00176  148 SLKTRNRWILTGTPLQNNLEELWALLNFLRPGPFGSLSTFRNWFDrpieRGGGKKGVSRLHKLLKPFLLRRTKKDVEKSL 227
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688  1189 APKEETIIEVELTNIQKKYY-RAILEKNFTFLSKG-GGQANVPNLLNTMMELRKCCNHPYLI 1248
Cdd:pfam00176  228 PPKVEYILFCRLSKLQRKLYqTFLLKKDLNAIKTGeGGREIKASLLNILMRLRKICNHPGLI 289
CD2_tandem_CHD5-9_like cd18663
repeat 2 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, ...
869-926 4.85e-35

repeat 2 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, and similar proteins; Repeat 2 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD5, CHD6, CHD7, CHD8, and CHD9. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. CHD6, CHD7, and CHD8 enzymes have been demonstrated to have different substrate specificities and remodeling activities. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349310 [Multi-domain]  Cd Length: 59  Bit Score: 128.56  E-value: 4.85e-35
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 460838688  869 PDYVEVDRIMDFARSTD-DRGEPVIHYLVKWCSLPYEDSTWELKQDIDQTKIEEFEKLM 926
Cdd:cd18663     1 PDYVEVDRILDVSVSTDpNTGEPVTHYLVKWCSLPYEDSTWELEEDVDPAKIEEFEKLR 59
DEXDc smart00487
DEAD-like helicases superfamily;
951-1149 2.62e-31

DEAD-like helicases superfamily;


Pssm-ID: 214692 [Multi-domain]  Cd Length: 201  Bit Score: 123.37  E-value: 2.62e-31
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    951 EYKNNNKLREYQLEGVNWLLFNWynmRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPL-STIPNWEREFRTWT 1029
Cdd:smart00487    2 EKFGFEPLRPYQKEAIEALLSGL---RDVILAAPTGSGKTLAALLPALEALKRGKGGRVLVLVPTrELAEQWAEELKKLG 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   1030 E---LNVVVYHGSQASRRTIQlyemyfkdpqgRVIKGsyKFHAIITTFEMILTDCPE--LRNIPWRCVVIDEAHRLKN-- 1102
Cdd:smart00487   79 PslgLKVVGLYGGDSKREQLR-----------KLESG--KTDILVTTPGRLLDLLENdkLSLSNVDLVILDEAHRLLDgg 145
                           170       180       190       200       210
                    ....*....|....*....|....*....|....*....|....*....|.
gi 460838688   1103 RNCKLLEGLKMMDLE-HKVLLTGTP---LQNTVEELFSLLHFLEPSRFPSE 1149
Cdd:smart00487  146 FGDQLEKLLKLLPKNvQLLLLSATPpeeIENLLELFLNDPVFIDVGFTPLE 196
CD1_tandem_CHD5-9_like cd18668
repeat 1 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, ...
786-850 1.06e-24

repeat 1 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, and similar proteins; Repeat 1 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD5, CHD6, CHD7, CHD8, and CHD9. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. CHD6, CHD7, and CHD8 enzymes have been demonstrated to have different substrate specificities and remodeling activities. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349315 [Multi-domain]  Cd Length: 68  Bit Score: 99.33  E-value: 1.06e-24
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 460838688  786 DAEGPVVEKIMSSRLVKKQKESGEEVEV-EEFYVKYKNFSYLHCQWASVEDLEK-DKRIQQKIKRFK 850
Cdd:cd18668     1 EEDTMIIEKILASRKKKKEKEEGAEEIEvEEYLVKYKNFSYLHCEWKTEEELEKgDKRIKQKIKRFK 67
DISARM_DrmD_b NF038318
DISARM system SNF2-like helicase DrmD, short form; DrmD, a SNF2-like helicase, is a component ...
980-1428 7.03e-22

DISARM system SNF2-like helicase DrmD, short form; DrmD, a SNF2-like helicase, is a component of class 1 DISARM (Defence Island System Associated with Restriction Modification), which contains a DNA adenine N6 methyltransferase. This HMM describes a distinct form that is somewhat shorter than the majority of DrmD proteins.


Pssm-ID: 468472 [Multi-domain]  Cd Length: 868  Bit Score: 104.38  E-value: 7.03e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  980 ILADEMGLGKTIQS---ITFLYEIYLKGIhgpfLVIAPLSTIPNWEREFRTWTELNVVVyhgsqASRRTiqlYEMYFKDP 1056
Cdd:NF038318   51 ILADEVGLGKTIEAglvLKYVLESGAKKI----LIILPANLRKQWEIELEEKFDLESLI-----LDSLT---VEKDAKKW 118
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1057 QGRvIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKN--RNCKLLEGLkmMDLEH---KVLLTGTPLQNTV 1131
Cdd:NF038318  119 NKR-LTDNKKVRIVITSYDYASKLMKRFPKVKWDFIIIDEAHNLRNvhKGGKRAKNL--YELTKgipKILLTATPLQNSL 195
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1132 EELFSLLHFLEPSRFPSETTFM------QEFGDLKTEeqvqklqaiLKPMMLRRLKEDVEKNLAPKEETII--------- 1196
Cdd:NF038318  196 LDLYGLVSFIDPRIFGSEKVFSkryikdEDYSDLKRE---------LSPVLYRTLRKDVADYMQFKKRKCItvdfelspd 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1197 EVELTN-----IQKKYYRAI-----------------------------LEKNFTFLSKGGGQANVPNLLNtmmelrkcc 1242
Cdd:NF038318  267 EIELYVrvnnfLKRDILYSIptsnrtliilvirkllasssfalaetfevLKKRLEKLKEGTRSANAQEGFD--------- 337
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1243 nhpYLINGAEEKILEEFKETHNAESPDFQLQaMIQAAGKLV--LIDK------------LLPKLKAG---------GHRV 1299
Cdd:NF038318  338 ---LFWSFVEDEIDESGFEEKQDELYTRQKE-FIQHEIDEVdaIIDVakriktnakvtaLKTALEIAfeyqreegiAQKV 413
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1300 LIFSQMVRCLDILEDYLIQRRYPYERI---DG--------------RVR--GN--------LRQAAIDRFSkpdsDRFVF 1352
Cdd:NF038318  414 VVFTESKRTQKYIAEELRKSGYEGEDIllfNGdfddamtkeiyrawQVKnyGKanygrsveYKHAIVDYFK----NNAKI 489
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 460838688 1353 LLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIGQSKSVKIYRLI-TRNSYEREMFDKASLKLGLDKAV 1428
Cdd:NF038318  490 LIVTDAGSEGLNLQFCNTVINYDLPWNPQKIEQRIGRCHRYGQKNDVVAINLLnTQNVADKRVYEILSEKFELFEGV 566
BRK smart00592
domain in transcription and CHROMO domain helicases;
2631-2675 7.08e-19

domain in transcription and CHROMO domain helicases;


Pssm-ID: 197800  Cd Length: 45  Bit Score: 82.01  E-value: 7.08e-19
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 460838688   2631 TGEERVPVVNKRNGKKMGGAMAPPMKDLPRWLEENPEFAVAPDWT 2675
Cdd:smart00592    1 DGEERVPVINRETGKKLTGDDAPKAKDLERWLEENPEYEVAPRSA 45
BRK pfam07533
BRK domain; The function of this domain is unknown. It is often found associated with ...
2630-2673 1.61e-18

BRK domain; The function of this domain is unknown. It is often found associated with helicases and transcription factors.


Pssm-ID: 462196  Cd Length: 44  Bit Score: 81.02  E-value: 1.61e-18
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 460838688  2630 LTGEERVPVVNKRNGKKMGGAMAPPMKDLPRWLEENPEFAVAPD 2673
Cdd:pfam07533    1 LTGDERVPVVNRKTGKKLTGDKAPKLKDLEEWLEENPGYEVDPR 44
BRK pfam07533
BRK domain; The function of this domain is unknown. It is often found associated with ...
2552-2593 3.71e-14

BRK domain; The function of this domain is unknown. It is often found associated with helicases and transcription factors.


Pssm-ID: 462196  Cd Length: 44  Bit Score: 68.69  E-value: 3.71e-14
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 460838688  2552 LDPDTRIPVINLEDGTRLVGEDAPKNKDLVDWLKLHPTYTVD 2593
Cdd:pfam07533    1 LTGDERVPVVNRKTGKKLTGDKAPKLKDLEEWLEENPGYEVD 42
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
23-553 2.00e-12

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 73.44  E-value: 2.00e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    23 GLGECGY-PENPVNPMGQQMP-----IDQGFPSLQPSL----HHPSPNQNQTKLTHFDHYSQYEQKMHLMDQPNRMMGSA 92
Cdd:pfam03157  159 GQGQQGYyPTSPQQSGQRQQPgqgqqLRQGQQGQQSGQgqpgYYPTSSQQPGQLQQTGQGQQGQQPERGQQGQQPGQGQQ 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    93 PGNGlasphSQYHTPPVPQVPhgggGGGQMGVYPGI-QNERHGQSFVDGGSMWGPRAVQVPDQirapyqqqqpQPAPSGP 171
Cdd:pfam03157  239 PGQG-----QQGQQPGQPQQL----GQGQQGYYPISpQQPRQWQQSGQGQQGYYPTSLQQPGQ----------GQSGYYP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   172 PAQGHPQHMQQMGSYLARGDFSMQQHGQPQQRMGQFSQGQEGLSQGSPfiATSGPGHLSHMPQQspsmapslrhPVQQQF 251
Cdd:pfam03157  300 TSQQQAGQLQQEQQLGQEQQDQQPGQGRQGQQPGQGQQGQQPAQGQQP--GQGQPGYYPTSPQQ----------PGQGQP 367
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   252 HHHPAALHGESVAHSPRFSPNPPQQG----AVRPQTLNFSSRNQT---VPSPTVNNSGQYSRYPYSNLNQGLVNSTGMNQ 324
Cdd:pfam03157  368 GYYPTSQQQPQQGQQPEQGQQGQQQGqgqqGQQPGQGQQPGQGQPgyyPTSPQQSGQGQPGYYPTSPQQSGQGQQPGQGQ 447
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   325 NLGltNSTPMNQSVPrypnavgfpsnsGQGLVHQQPihssgslnQMNTQTMHPSQPQGTYASPPPMSPMKamsnpaGTPP 404
Cdd:pfam03157  448 QPG--QEQPGQGQQP------------GQGQQGQQP--------GQPEQGQQPGQGQPGYYPTSPQQSGQ------GQQL 499
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   405 PQVRPGSAGMPmevGSYPNMPHPQpshqppgamGIGQRNMGPRNMQQPRSFMGMSSAPRELTGHMRPNGCPGVGLADPQA 484
Cdd:pfam03157  500 GQWQQQGQGQP---GYYPTSPLQP---------GQGQPGYYPTSPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQGQQPGQ 567
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   485 IQERLIPGQQHPGQQPSFQQLPTcpplQPHPGLHQSSPPHP-HHQPWAQLHPSPQNTPQKVPVHQHSPSE 553
Cdd:pfam03157  568 GQQGQQPGQGQQGQQPGQGQQPG----QGQPGYYPTSPQQSgQGQQPGQWQQPGQGQPGYYPTSSLQLGQ 633
BRK smart00592
domain in transcription and CHROMO domain helicases;
2553-2593 7.35e-12

domain in transcription and CHROMO domain helicases;


Pssm-ID: 197800  Cd Length: 45  Bit Score: 61.98  E-value: 7.35e-12
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 460838688   2553 DPDTRIPVINLEDGTRLVGEDAPKNKDLVDWLKLHPTYTVD 2593
Cdd:smart00592    1 DGEERVPVINRETGKKLTGDDAPKAKDLERWLEENPEYEVA 41
Chromo pfam00385
Chromo (CHRromatin organization MOdifier) domain;
873-924 1.53e-10

Chromo (CHRromatin organization MOdifier) domain;


Pssm-ID: 459793 [Multi-domain]  Cd Length: 52  Bit Score: 58.74  E-value: 1.53e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 460838688   873 EVDRIMDfaRSTDDRGEPviHYLVKWCSLPYEDSTWELKQDID--QTKIEEFEK 924
Cdd:pfam00385    2 EVERILD--HRKDKGGKE--EYLVKWKGYPYDENTWEPEENLSkcPELIEEFKD 51
Chromo pfam00385
Chromo (CHRromatin organization MOdifier) domain;
791-852 1.37e-08

Chromo (CHRromatin organization MOdifier) domain;


Pssm-ID: 459793 [Multi-domain]  Cd Length: 52  Bit Score: 52.96  E-value: 1.37e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688   791 VVEKIMSSRLVKKQKEsgeeveveEFYVKYKNFSYLHCQWASVEDLEKDKRIqqkIKRFKSK 852
Cdd:pfam00385    2 EVERILDHRKDKGGKE--------EYLVKWKGYPYDENTWEPEENLSKCPEL---IEEFKDR 52
PHA03378 PHA03378
EBNA-3B; Provisional
222-545 1.07e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 57.77  E-value: 1.07e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  222 ATSGPGHLSHMPQQSPSmapslrhPVQQQFHHHPAALHGESVAHSPRFSPNPPQQGAVRPQTLNFSSRNQTVPSPtvnns 301
Cdd:PHA03378  554 ASTEPVHDQLLPAPGLG-------PLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAP----- 621
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  302 gqysRYPYSNLNQGLVNSTGMNqnlgltnSTPMNQSVPRYPNAVGFPSNSGQGLVHQQPIHSSGSLNQMNTQTMHPSQPQ 381
Cdd:PHA03378  622 ----RQWPMPLRPIPMRPLRMQ-------PITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWA 690
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  382 GTYASPPPMSPMKAmsnpagtPPPQVRPGSagmpmevgsypnmphpqpshqppgamgigqrnmgprnMQQPRSFMGMSSA 461
Cdd:PHA03378  691 PGTMQPPPRAPTPM-------RPPAAPPGR-------------------------------------AQRPAAATGRARP 726
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  462 PRELTGHMRPNGCPGVGLADPQAIQERLIPGQQHPGQQPSFQQLPTCPPLQPHPglhqSSPPHPHHQPwaQLHPSPQNTP 541
Cdd:PHA03378  727 PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPP----QAPPAPQQRP--RGAPTPQPPP 800

                  ....
gi 460838688  542 QKVP 545
Cdd:PHA03378  801 QAGP 804
CHROMO smart00298
Chromatin organization modifier domain;
791-853 1.12e-06

Chromatin organization modifier domain;


Pssm-ID: 214605 [Multi-domain]  Cd Length: 55  Bit Score: 47.59  E-value: 1.12e-06
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 460838688    791 VVEKIMSSRLVKK-QKEsgeeveveeFYVKYKNFSYLHCQWASVEDLEKDKRiqqKIKRFKSKQ 853
Cdd:smart00298    3 EVEKILDHRWKKKgELE---------YLVKWKGYSYSEDTWEPEENLLNCSK---KLDNYKKKE 54
CHROMO smart00298
Chromatin organization modifier domain;
873-924 1.83e-05

Chromatin organization modifier domain;


Pssm-ID: 214605 [Multi-domain]  Cd Length: 55  Bit Score: 44.51  E-value: 1.83e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....
gi 460838688    873 EVDRIMDfARSTDDRGEpviHYLVKWCSLPYEDSTWELKQDI--DQTKIEEFEK 924
Cdd:smart00298    3 EVEKILD-HRWKKKGEL---EYLVKWKGYSYSEDTWEPEENLlnCSKKLDNYKK 52
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
491-552 3.52e-04

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 45.42  E-value: 3.52e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688  491 PGQQHPGQQPSFQQLPTCPPLQPHPGLHQSSPPHPHHQPWAQLHPSPQNTPQkvpvHQHSPS 552
Cdd:cd22056   220 AFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQYQYMNAPYPPHY----AHQGAP 277
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
440-556 4.34e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.95  E-value: 4.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   440 GQRNMGPRNMQQPRSFMGMSSAP--RELTGHMRPNGCPGVGLADPQaiqerlIPGQQHPGQQPSFQQLPT-----CPPLQ 512
Cdd:TIGR01628  376 MQLQPRMRQLPMGSPMGGAMGQPpyYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPmnavrAPSRN 449
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 460838688   513 PHPGLHQSSPPHPHHQPWAQLHPSPQNTPQKVPVHQHSPSEPFL 556
Cdd:TIGR01628  450 AQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKL 493
 
Name Accession Description Interval E-value
PLN03142 PLN03142
Probable chromatin-remodeling complex ATPase chain; Provisional
957-1556 8.17e-155

Probable chromatin-remodeling complex ATPase chain; Provisional


Pssm-ID: 215601 [Multi-domain]  Cd Length: 1033  Bit Score: 510.88  E-value: 8.17e-155
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  957 KLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTE-LNVV 1034
Cdd:PLN03142  169 KMRDYQLAGLNWLIRLYENGINGILADEMGLGKTLQTISLLgYLHEYRGITGPHMVVAPKSTLGNWMNEIRRFCPvLRAV 248
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1035 VYHGSQASRRTIQLYEMyfkdpqgrvIKGsyKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMM 1114
Cdd:PLN03142  249 KFHGNPEERAHQREELL---------VAG--KFDVCVTSFEMAIKEKTALKRFSWRYIIIDEAHRIKNENSLLSKTMRLF 317
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1115 DLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEF---GDLKTEEQVQKLQAILKPMMLRRLKEDVEKNLAPK 1191
Cdd:PLN03142  318 STNYRLLITGTPLQNNLHELWALLNFLLPEIFSSAETFDEWFqisGENDQQEVVQQLHKVLRPFLLRRLKSDVEKGLPPK 397
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1192 EETIIEVELTNIQKKYYRAILEKNFTFLSKGGGQANvpnLLNTMMELRKCCNHPYLINGAEEKileefkethnaeSPDFQ 1271
Cdd:PLN03142  398 KETILKVGMSQMQKQYYKALLQKDLDVVNAGGERKR---LLNIAMQLRKCCNHPYLFQGAEPG------------PPYTT 462
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1272 LQAMIQAAGKLVLIDKLLPKLKAGGHRVLIFSQMVRCLDILEDYLIQRRYPYERIDGRVRGNLRQAAIDRFSKPDSDRFV 1351
Cdd:PLN03142  463 GEHLVENSGKMVLLDKLLPKLKERDSRVLIFSQMTRLLDILEDYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPGSEKFV 542
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1352 FLLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIGQSKSVKIYRLITRNSYEREMFDKASLKLGLDKAVLQS 1431
Cdd:PLN03142  543 FLLSTRAGGLGINLATADIVILYDSDWNPQVDLQAQDRAHRIGQKKEVQVFRFCTEYTIEEKVIERAYKKLALDALVIQQ 622
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1432 MSGRENATngvqqLSKKEIEDLLRkgaYGALMDEEDEGSKFCEEDIDQILLR-RTHTITIESEGKGSTFAKASFVASGNR 1510
Cdd:PLN03142  623 GRLAEQKT-----VNKDELLQMVR---YGAEMVFSSKDSTITDEDIDRIIAKgEEATAELDAKMKKFTEDAIKFKMDDTA 694
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|....*.
gi 460838688 1511 TDISLDDpnfwQKWAKKAELDIDALNGRNnlVIDTPRvRKQTRLYS 1556
Cdd:PLN03142  695 ELYDFDD----EDDKDENKLDFKKIVSDN--WIDPPK-RERKRNYS 733
DEXHc_CHD7 cd18059
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 7; ...
958-1179 4.60e-153

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 7; Chromodomain-helicase-DNA-binding protein 7 (CHD7) is a probable transcription regulator. It may be involved in the 45S precursor rRNA production. CHD7 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350817 [Multi-domain]  Cd Length: 222  Bit Score: 473.36  E-value: 4.60e-153
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNVVVYH 1037
Cdd:cd18059     1 LREYQLEGVNWLLFNWYNTRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNVVVYH 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1038 GSQASRRTIQLYEMYFKDPQGRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMDLE 1117
Cdd:cd18059    81 GSQASRRTIQLYEMYFKDPQGRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMDLE 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688 1118 HKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18059   161 HKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 222
DEXHc_CHD6_7_8_9 cd17995
DEXH-box helicase domain of the chromodomain helicase DNA binding protein 6, 7, 8 and 9; ...
958-1179 4.59e-150

DEXH-box helicase domain of the chromodomain helicase DNA binding protein 6, 7, 8 and 9; Chromodomain-helicase-DNA-binding protein 6-9 (CHD6, CHD7, CHD8, and CHD9) are members of the DEAD-like helicases superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350753 [Multi-domain]  Cd Length: 223  Bit Score: 464.41  E-value: 4.59e-150
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLK-GIHGPFLVIAPLSTIPNWEREFRTWTELNVVVY 1036
Cdd:cd17995     1 LRDYQLEGVNWLLFNWYNRRNCILADEMGLGKTIQSIAFLEHLYQVeGIRGPFLVIAPLSTIPNWQREFETWTDMNVVVY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1037 HGSQASRRTIQLYEMYFKDPQGRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMDL 1116
Cdd:cd17995    81 HGSGESRQIIQQYEMYFKDAQGRKKKGVYKFDVLITTYEMVIADAEELRKIPWRVVVVDEAHRLKNRNSKLLQGLKKLTL 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 460838688 1117 EHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd17995   161 EHKLLLTGTPLQNNTEELWSLLNFLEPEKFPSSEEFLEEFGDLKTAEQVEKLQALLKPYMLRR 223
DEXHc_CHD6 cd18058
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 6; ...
958-1179 1.37e-137

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 6; Chromodomain-helicase-DNA-binding protein 6 (CHD6) is a DNA-dependent ATPase that plays a role in chromatin remodeling. It regulates transcription by disrupting nucleosomes in a largely non-sliding manner which strongly increases the accessibility of chromatin. It activates transcription of specific genes in response to oxidative stress through interaction with NFE2L2.2 and acts as a transcriptional repressor of different viruses including influenza virus or papillomavirus. During influenza virus infection, the viral polymerase complex localizes CHD6 to inactive chromatin where it gets degraded in a proteasome independent-manner. CHD6 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350816 [Multi-domain]  Cd Length: 222  Bit Score: 428.69  E-value: 1.37e-137
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNVVVYH 1037
Cdd:cd18058     1 LREYQLEGMNWLLFNWYNRKNCILADEMGLGKTIQSITFLSEIFLMGIRGPFLIIAPLSTITNWEREFRTWTEMNAIVYH 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1038 GSQASRRTIQLYEMYFKDPQGRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMDLE 1117
Cdd:cd18058    81 GSQISRQMIQQYEMYYRDEQGNPLSGIFKFQVVITTFEMILADCPELKKINWSCVIIDEAHRLKNRNCKLLEGLKLMALE 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688 1118 HKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18058   161 HKVLLTGTPLQNSVEELFSLLNFLEPSQFPSETTFLEEFGDLKTEEQVKKLQSILKPMMLRR 222
DEXHc_CHD9 cd18061
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 9; ...
958-1179 2.28e-133

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 9; Chromodomain-helicase-DNA-binding protein 9 (CHD9) acts as a transcriptional coactivator for PPARA and possibly other nuclear receptors. It is proposed to be a ATP-dependent chromatin remodeling protein. CHD9 has DNA-dependent ATPase activity and binds to A/T-rich DNA. It also associates with A/T-rich regulatory regions in promoters of genes that participate in the differentiation of progenitors during osteogenesis. CHD9 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350819 [Multi-domain]  Cd Length: 222  Bit Score: 416.71  E-value: 2.28e-133
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNVVVYH 1037
Cdd:cd18061     1 LREYQLEGLNWLLFNWYNRRNCILADEMGLGKTIQSITFLYEILLTGIRGPFLIIAPLSTIANWEREFRTWTDLNVVVYH 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1038 GSQASRRTIQLYEMYFKDPQGRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMDLE 1117
Cdd:cd18061    81 GSLISRQMIQQYEMYFRDSQGRIIRGAYRFQAIITTFEMILGGCPELNAIDWRCVIIDEAHRLKNKNCKLLEGLKLMNLE 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688 1118 HKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18061   161 HKVLLTGTPLQNTVEELFSLLHFLEPLRFPSESTFMQEFGDLKTEEQVQKLQAILKPMMLRR 222
DEXHc_CHD8 cd18060
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 8; ...
958-1179 1.40e-132

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 8; Chromodomain-helicase-DNA-binding protein 8 (CHD8) is a DNA helicase that acts as a chromatin remodeling factor and regulates transcription. It also acts as a transcription repressor by remodeling chromatin structure and recruiting histone H1 to target genes. It suppresses p53/TP53-mediated apoptosis by recruiting histone H1 and preventing p53/TP53 transactivation activity and of STAT3 activity by suppressing the LIF-induced STAT3 transcriptional activity. It also acts as a negative regulator of Wnt signaling pathway and CTNNB1-targeted gene expression. CHD8 is also involved in both enhancer blocking and epigenetic remodeling at chromatin boundary via its interaction with CTCF. It also acts as a transcription activator via its interaction with ZNF143 by participating in efficient U6 RNA polymerase III transcription. CHD8 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350818 [Multi-domain]  Cd Length: 222  Bit Score: 414.45  E-value: 1.40e-132
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNVVVYH 1037
Cdd:cd18060     1 LREYQLEGVNWLLFNWYNRQNCILADEMGLGKTIQSIAFLQEVYNVGIHGPFLVIAPLSTITNWEREFNTWTEMNTIVYH 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1038 GSQASRRTIQLYEMYFKDPQGRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMDLE 1117
Cdd:cd18060    81 GSLASRQMIQQYEMYCKDSRGRLIPGAYKFDALITTFEMILSDCPELREIEWRCVIIDEAHRLKNRNCKLLDSLKHMDLE 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688 1118 HKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18060   161 HKVLLTGTPLQNTVEELFSLLHFLEPSQFPSESEFLKDFGDLKTEEQVQKLQAILKPMMLRR 222
HepA COG0553
Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, ...
957-1416 3.05e-128

Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, and repair];


Pssm-ID: 440319 [Multi-domain]  Cd Length: 682  Bit Score: 421.17  E-value: 3.05e-128
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  957 KLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWT-ELNVVV 1035
Cdd:COG0553   241 TLRPYQLEGAAWLLFLRRLGLGGLLADDMGLGKTIQALALLLELKERGLARPVLIVAPTSLVGNWQRELAKFApGLRVLV 320
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1036 YHGSQASRRTIQLYEmyfkdpqgrvikgsyKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMD 1115
Cdd:COG0553   321 LDGTRERAKGANPFE---------------DADLVITSYGLLRRDIELLAAVDWDLVILDEAQHIKNPATKRAKAVRALK 385
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1116 LEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKT---EEQVQKLQAILKPMMLRRLKEDVEKNLAPKE 1192
Cdd:COG0553   386 ARHRLALTGTPVENRLEELWSLLDFLNPGLLGSLKAFRERFARPIEkgdEEALERLRRLLRPFLLRRTKEDVLKDLPEKT 465
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1193 ETIIEVELTNIQKKYYRAILEKNFTFLSKGGGQANVPNLLNTMMELRKCCNHPYLingaeekILEEFKEtHNAESpdfql 1272
Cdd:COG0553   466 EETLYVELTPEQRALYEAVLEYLRRELEGAEGIRRRGLILAALTRLRQICSHPAL-------LLEEGAE-LSGRS----- 532
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1273 qamiqaaGKLVLIDKLLPKLKAGGHRVLIFSQMVRCLDILEDYLIQRRYPYERIDGRVRGNLRQAAIDRFSKpDSDRFVF 1352
Cdd:COG0553   533 -------AKLEALLELLEELLAEGEKVLVFSQFTDTLDLLEERLEERGIEYAYLHGGTSAEERDELVDRFQE-GPEAPVF 604
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 460838688 1353 LLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIGQSKSVKIYRLITRNSYEREMFD 1416
Cdd:COG0553   605 LISLKAGGEGLNLTAADHVIHYDLWWNPAVEEQAIDRAHRIGQTRDVQVYKLVAEGTIEEKILE 668
SNF2-rel_dom pfam00176
SNF2-related domain; This domain is found in proteins involved in a variety of processes ...
961-1248 5.16e-87

SNF2-related domain; This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1). SNF2 functions as the ATPase component of the SNF2/SWI multisubunit complex, which utilizes energy derived from ATP hydrolysis to disrupt histone-DNA interactions, resulting in the increased accessibility of DNA to transcription factors.


Pssm-ID: 425504 [Multi-domain]  Cd Length: 289  Bit Score: 286.89  E-value: 5.16e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   961 YQLEGVNWLLFNWYNM-RNCILADEMGLGKTIQSITFLYeiYLKGIH----GPFLVIAPLSTIPNWEREFRTWT---ELN 1032
Cdd:pfam00176    1 YQIEGVNWMLSLENNLgRGGILADEMGLGKTLQTISLLL--YLKHVDknwgGPTLIVVPLSLLHNWMNEFERWVsppALR 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  1033 VVVYHGSQASRRTIQLYEMYFKDpqgrvikgsykFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLK 1112
Cdd:pfam00176   79 VVVLHGNKRPQERWKNDPNFLAD-----------FDVVITTYETLRKHKELLKKVHWHRIVLDEGHRLKNSKSKLSKALK 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  1113 MMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFG----DLKTEEQVQKLQAILKPMMLRRLKEDVEKNL 1188
Cdd:pfam00176  148 SLKTRNRWILTGTPLQNNLEELWALLNFLRPGPFGSLSTFRNWFDrpieRGGGKKGVSRLHKLLKPFLLRRTKKDVEKSL 227
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688  1189 APKEETIIEVELTNIQKKYY-RAILEKNFTFLSKG-GGQANVPNLLNTMMELRKCCNHPYLI 1248
Cdd:pfam00176  228 PPKVEYILFCRLSKLQRKLYqTFLLKKDLNAIKTGeGGREIKASLLNILMRLRKICNHPGLI 289
DEXHc_CHD1_2 cd17993
DEXH-box helicase domain of the chromodomain helicase DNA binding proteins 1 and 2, and ...
957-1179 5.34e-84

DEXH-box helicase domain of the chromodomain helicase DNA binding proteins 1 and 2, and similar proteins; Chromodomain-helicase-DNA-binding protein 1 (CHD1) is an ATP-dependent chromatin-remodeling factor which functions as the substrate recognition component of the transcription regulatory histone acetylation (HAT) complex SAGA. It regulates polymerase II transcription and is also required for efficient transcription by RNA polymerase I, and more specifically the polymerase I transcription termination step. It is not only involved in transcription-related chromatin-remodeling, but is also required to maintain a specific chromatin configuration across the genome. CHD1 is also associated with histone deacetylase (HDAC) activity. Chromodomain-helicase-DNA-binding protein 2 (CHD2) is a DNA-binding helicase that specifically binds to the promoter of target genes, leading to chromatin remodeling, possibly by promoting deposition of histone H3.3. It is involved in myogenesis via interaction with MYOD1; it binds to myogenic gene regulatory sequences and mediates incorporation of histone H3.3 prior to the onset of myogenic gene expression, promoting their expression. Both are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350751 [Multi-domain]  Cd Length: 218  Bit Score: 275.00  E-value: 5.34e-84
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  957 KLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYeiYL---KGIHGPFLVIAPLSTIPNWEREFRTWT-ELN 1032
Cdd:cd17993     1 ELRDYQLTGLNWLAHSWCKGNNGILADEMGLGKTVQTISFLS--YLfhsQQQYGPFLVVVPLSTMPAWQREFAKWApDMN 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1033 VVVYHGSQASRRTIQLYEMYFkdPQGRVIKgsykFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLK 1112
Cdd:cd17993    79 VIVYLGDIKSRDTIREYEFYF--SQTKKLK----FNVLLTTYEIILKDKAFLGSIKWQYLAVDEAHRLKNDESLLYEALK 152
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 460838688 1113 MMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFmQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd17993   153 EFKTNNRLLITGTPLQNSLKELWALLHFLMPGKFDIWEEF-EEEHDEEQEKGIADLHKELEPFILRR 218
DEXHc_Snf cd17919
DEXH/Q-box helicase domain of DEAD-like helicase Snf family proteins; Sucrose Non-Fermenting ...
958-1143 6.08e-79

DEXH/Q-box helicase domain of DEAD-like helicase Snf family proteins; Sucrose Non-Fermenting (SNF) proteins DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350677 [Multi-domain]  Cd Length: 182  Bit Score: 259.04  E-value: 6.08e-79
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWT-ELNVVV 1035
Cdd:cd17919     1 LRPYQLEGLNFLLELYENGPGGILADEMGLGKTLQAIAFLaYLLKEGKERGPVLVVCPLSVLENWEREFEKWTpDLRVVV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1036 YHGSQASRRTIQLYEmyfkdpqgrvikGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMD 1115
Cdd:cd17919    81 YHGSQRERAQIRAKE------------KLDKFDVVLTTYETLRRDKASLRKFRWDLVVVDEAHRLKNPKSQLSKALKALR 148
                         170       180
                  ....*....|....*....|....*...
gi 460838688 1116 LEHKVLLTGTPLQNTVEELFSLLHFLEP 1143
Cdd:cd17919   149 AKRRLLLTGTPLQNNLEELWALLDFLDP 176
DEXHc_HELLS_SMARCA6 cd18009
DEXH-box helicase domain of HELLS; HELLS (helicase, lymphoid specific, also known as Lsh or ...
955-1181 1.20e-73

DEXH-box helicase domain of HELLS; HELLS (helicase, lymphoid specific, also known as Lsh or SMARCA6) is a major epigenetic regulator crucial for normal heterochromatin structure and function. HELLS is part of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350767 [Multi-domain]  Cd Length: 236  Bit Score: 246.14  E-value: 1.20e-73
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  955 NNKLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWT-ELNV 1033
Cdd:cd18009     1 GGVMRPYQLEGMEWLRMLWENGINGILADEMGLGKTIQTIALLAHLRERGVWGPFLVIAPLSTLPNWVNEFARFTpSVPV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1034 VVYHGSQASRRTIQlYEMYFKDPQGRvikgsyKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKM 1113
Cdd:cd18009    81 LLYHGTKEERERLR-KKIMKREGTLQ------DFPVVVTSYEIAMRDRKALQHYAWKYLIVDEGHRLKNLNCRLIQELKT 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1114 MDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFG-----------DLKTEEQ----VQKLQAILKPMMLR 1178
Cdd:cd18009   154 FNSDNRLLLTGTPLQNNLSELWSLLNFLLPDVFDDLSSFESWFDfsslsdnaadiSNLSEEReqniVHMLHAILKPFLLR 233

                  ...
gi 460838688 1179 RLK 1181
Cdd:cd18009   234 RLK 236
DEXHc_CHD3_4_5 cd17994
DEAH-box helicase domain of the chromodomain helicase DNA binding proteins 3, 4 and 5; ...
958-1179 9.60e-73

DEAH-box helicase domain of the chromodomain helicase DNA binding proteins 3, 4 and 5; Chromodomain-helicase-DNA-binding protein 3 (CHD3) is a component of the histone deacetylase NuRD complex which participates in the remodeling of chromatin by deacetylating histones. It is required for anchoring centrosomal pericentrin in both interphase and mitosis, for spindle organization and centrosome integrity. Chromodomain-helicase-DNA-binding protein 4 (CHD4) is a component of the histone deacetylase NuRD complex which participates in the remodeling of chromatin by deacetylating histones. Chromodomain-helicase-DNA-binding protein 5 (CHD5) is a chromatin-remodeling protein that binds DNA through histones and regulates gene transcription. It is thought to specifically recognize and bind trimethylated 'Lys-27' (H3K27me3) and non-methylated 'Lys-4' of histone H3 and plays a role in the development of the nervous system by activating the expression of genes promoting neuron terminal differentiation. In parallel, it may also positively regulate the trimethylation of histone H3 at 'Lys-27' thereby specifically repressing genes that promote the differentiation into non-neuronal cell lineages. As a tumor suppressor, it regulates the expression of genes involved in cell proliferation and differentiation. In spermatogenesis, it probably regulates histone hyperacetylation and the replacement of histones by transition proteins in chromatin, a crucial step in the condensation of spermatid chromatin and the production of functional spermatozoa. CHD3, CHD4, and CHD5 are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350752 [Multi-domain]  Cd Length: 196  Bit Score: 241.96  E-value: 9.60e-73
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKG-IHGPFLVIAPLSTIPNWEREFRTWT-ELNVVV 1035
Cdd:cd17994     1 LHPYQLEGLNWLRFSWAQGTDTILADEMGLGKTIQTIVFLYSLYKEGhSKGPFLVSAPLSTIINWEREFEMWApDFYVVT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1036 YHGSqasrrtiqlyemyfkdpqgrvikgsykfHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMD 1115
Cdd:cd17994    81 YVGD----------------------------HVLLTSYELISIDQAILGSIDWAVLVVDEAHRLKNNQSKFFRILNSYK 132
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 460838688 1116 LEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd17994   133 IGYKLLLTGTPLQNNLEELFHLLNFLTPERFNNLQGFLEEFADISKEDQIKKLHDLLGPHMLRR 196
DEXHc_CHD3 cd18055
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 3; ...
958-1179 6.87e-69

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 3; Chromodomain-helicase-DNA-binding protein 3 (CHD3) is a component of the histone deacetylase NuRD complex which participates in the remodeling of chromatin by deacetylating histones. It is required for anchoring centrosomal pericentrin in both interphase and mitosis, for spindle organization and centrosome integrity. CHD3 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350813 [Multi-domain]  Cd Length: 232  Bit Score: 232.21  E-value: 6.87e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKG-IHGPFLVIAPLSTIPNWEREFRTWT-ELNVVV 1035
Cdd:cd18055     1 LHMYQLEGLNWLRFSWAQGTDTILADEMGLGKTIQTIVFLYSLYKEGhTKGPFLVSAPLSTIINWEREFQMWApDFYVVT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1036 YHGSQASRRTIQLYEMYFKDpqgRVIKGS-----------YKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRN 1104
Cdd:cd18055    81 YTGDKDSRAIIRENEFSFDD---NAVKGGkkafkmkreaqVKFHVLLTSYELVTIDQAALGSIRWACLVVDEAHRLKNNQ 157
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 460838688 1105 CKLLEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18055   158 SKFFRVLNGYKIDHKLLLTGTPLQNNLEELFHLLNFLTPERFNNLEGFLEEFADISKEDQIKKLHDLLGPHMLRR 232
DEXHc_SMARCA1_SMARCA5 cd17997
DEAH-box helicase domain of SMARCA1 and SMARCA5; SWI/SNF related, matrix associated, actin ...
955-1181 7.94e-69

DEAH-box helicase domain of SMARCA1 and SMARCA5; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 1 and 5 (SMARCA1 and SMARCA5) are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350755 [Multi-domain]  Cd Length: 222  Bit Score: 231.83  E-value: 7.94e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  955 NNKLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWT-ELN 1032
Cdd:cd17997     1 GGTMRDYQIRGLNWLISLFENGINGILADEMGLGKTLQTISLLgYLKHYKNINGPHLIIVPKSTLDNWMREFKRWCpSLR 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1033 VVVYHGSQASRRTIQlyemyfkdpQGRVIKGsyKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLK 1112
Cdd:cd17997    81 VVVLIGDKEERADII---------RDVLLPG--KFDVCITSYEMVIKEKTVLKKFNWRYIIIDEAHRIKNEKSKLSQIVR 149
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 460838688 1113 MMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEF----GDLKTEEQVQKLQAILKPMMLRRLK 1181
Cdd:cd17997   150 LFNSRNRLLLTGTPLQNNLHELWALLNFLLPDVFTSSEDFDEWFnvnnCDDDNQEVVQRLHKVLRPFLLRRIK 222
DEXHc_SMARCA2_SMARCA4 cd17996
DEXH-box helicase domain of SMARCA2 and SMARCA4; SWI/SNF related, matrix associated, actin ...
955-1181 4.66e-67

DEXH-box helicase domain of SMARCA2 and SMARCA4; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, members 2 and 4 (SMARCA2 and SMARCA4) are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350754 [Multi-domain]  Cd Length: 233  Bit Score: 227.25  E-value: 4.66e-67
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  955 NNKLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSI---TFLYEIylKGIHGPFLVIAPLSTIPNWEREFRTWT-E 1030
Cdd:cd17996     1 GGTLKEYQLKGLQWMVSLYNNNLNGILADEMGLGKTIQTIsliTYLMEK--KKNNGPYLVIVPLSTLSNWVSEFEKWApS 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1031 LNVVVYHGSQASRRTIQlyemyfkdPQGRvikgSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEG 1110
Cdd:cd17996    79 VSKIVYKGTPDVRKKLQ--------SQIR----AGKFNVLLTTYEYIIKDKPLLSKIKWKYMIIDEGHRMKNAQSKLTQT 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1111 LK-MMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFG------------DLKTEEQV---QKLQAILKP 1174
Cdd:cd17996   147 LNtYYHARYRLLLTGTPLQNNLPELWALLNFLLPKIFKSCKTFEQWFNtpfantgeqvkiELNEEETLliiRRLHKVLRP 226

                  ....*..
gi 460838688 1175 MMLRRLK 1181
Cdd:cd17996   227 FLLRRLK 233
DEXHc_CHD5 cd18057
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 5; ...
958-1179 1.26e-66

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 5; Chromodomain-helicase-DNA-binding protein 5 (CHD5) is a chromatin-remodeling protein that binds DNA through histones and regulates gene transcription. It is thought to specifically recognize and bind trimethylated 'Lys-27' (H3K27me3) and non-methylated 'Lys-4' of histone H3 and plays a role in the development of the nervous system by activating the expression of genes promoting neuron terminal differentiation. In parallel, it may also positively regulate the trimethylation of histone H3 at 'Lys-27' thereby specifically repressing genes that promote the differentiation into non-neuronal cell lineages. As a tumor suppressor, it regulates the expression of genes involved in cell proliferation and differentiation. In spermatogenesis, it probably regulates histone hyperacetylation and the replacement of histones by transition proteins in chromatin, a crucial step in the condensation of spermatid chromatin and the production of functional spermatozoa. CHD5 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350815 [Multi-domain]  Cd Length: 232  Bit Score: 226.10  E-value: 1.26e-66
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKG-IHGPFLVIAPLSTIPNWEREFRTWT-ELNVVV 1035
Cdd:cd18057     1 LHPYQLEGLNWLRFSWAQGTDTILADEMGLGKTVQTIVFLYSLYKEGhSKGPYLVSAPLSTIINWEREFEMWApDFYVVT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1036 YHGSQASRRTIQLYEMYFKDPQGRVIKGSY--------KFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKL 1107
Cdd:cd18057    81 YTGDKESRSVIRENEFSFEDNAIRSGKKVFrmkkeaqiKFHVLLTSYELITIDQAILGSIEWACLVVDEAHRLKNNQSKF 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688 1108 LEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18057   161 FRVLNSYKIDYKLLLTGTPLQNNLEELFHLLNFLTPERFNNLEGFLEEFADISKEDQIKKLHDLLGPHMLRR 232
DEXHc_CHD4 cd18056
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 4; ...
958-1179 2.64e-66

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 4; Chromodomain-helicase-DNA-binding protein 4 (CHD4) is a component of the histone deacetylase NuRD complex which participates in the remodeling of chromatin by deacetylating histones. CHD4 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350814 [Multi-domain]  Cd Length: 232  Bit Score: 224.94  E-value: 2.64e-66
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKG-IHGPFLVIAPLSTIPNWEREFRTWT-ELNVVV 1035
Cdd:cd18056     1 LHPYQLEGLNWLRFSWAQGTDTILADEMGLGKTVQTAVFLYSLYKEGhSKGPFLVSAPLSTIINWEREFEMWApDMYVVT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1036 YHGSQASRRTIQLYEMYFKDPQGRVIK--------GSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKL 1107
Cdd:cd18056    81 YVGDKDSRAIIRENEFSFEDNAIRGGKkasrmkkeASVKFHVLLTSYELITIDMAILGSIDWACLIVDEAHRLKNNQSKF 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688 1108 LEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18056   161 FRVLNGYSLQHKLLLTGTPLQNNLEELFHLLNFLTPERFHNLEGFLEEFADIAKEDQIKKLHDMLGPHMLRR 232
DEXHc_CHD2 cd18054
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 2; ...
957-1179 2.33e-65

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 2; Chromodomain-helicase-DNA-binding protein 2 (CHD2) is a DNA-binding helicase that specifically binds to the promoter of target genes, leading to chromatin remodeling, possibly by promoting deposition of histone H3.3. It is involved in myogenesis via interaction with MYOD1; it binds to myogenic gene regulatory sequences and mediates incorporation of histone H3.3 prior to the onset of myogenic gene expression, promoting their expression. CHD2 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350812 [Multi-domain]  Cd Length: 237  Bit Score: 222.57  E-value: 2.33e-65
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  957 KLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWT-ELNVV 1034
Cdd:cd18054    20 ELRDYQLEGLNWLAHSWCKNNSVILADEMGLGKTIQTISFLsYLFHQHQLYGPFLLVVPLSTLTSWQREFEIWApEINVV 99
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1035 VYHGSQASRRTIQLYEmyFKDPQGRVIKgsykFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMM 1114
Cdd:cd18054   100 VYIGDLMSRNTIREYE--WIHSQTKRLK----FNALITTYEILLKDKTVLGSINWAFLGVDEAHRLKNDDSLLYKTLIDF 173
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 460838688 1115 DLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKtEEQVQKLQAILKPMMLRR 1179
Cdd:cd18054   174 KSNHRLLITGTPLQNSLKELWSLLHFIMPEKFEFWEDFEEDHGKGR-ENGYQSLHKVLEPFLLRR 237
DEXHc_CHD1L cd18006
DEAH/Q-box helicase domain of CHD1L; Chromodomain helicase DNA binding protein 1 like (CHD1L, ...
958-1179 4.24e-60

DEAH/Q-box helicase domain of CHD1L; Chromodomain helicase DNA binding protein 1 like (CHD1L, also known as ALC1) is involved in DNA repair by regulating chromatin relaxation following DNA damage. CHD1L is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350764 [Multi-domain]  Cd Length: 216  Bit Score: 206.52  E-value: 4.24e-60
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLyeIYLKG---IHGPFLVIAPLSTIPNWEREFRTWT-ELNV 1033
Cdd:cd18006     1 LRPYQLEGVNWLLQCRAEQHGCILGDEMGLGKTCQTISLL--WYLAGrlkLLGPFLVLCPLSVLDNWKEELNRFApDLSV 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1034 VVYHGSQASRRTIQlyemyfkdpqgRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKM 1113
Cdd:cd18006    79 ITYMGDKEKRLDLQ-----------QDIKSTNRFHVLLTTYEICLKDASFLKSFPWASLVVDEAHRLKNQNSLLHKTLSE 147
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 460838688 1114 MDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETT--FMQEFGDLKTE-EQVQKLQAILKPMMLRR 1179
Cdd:cd18006   148 FSVDFRLLLTGTPIQNSLQELYALLSFIEPNVFPKDKLddFIKAYSETDDEsETVEELHLLLQPFLLRR 216
DEXQc_arch_SWI2_SNF2 cd18012
DEAQ-box helicase domain of archaeal and bacterial SNF2-related proteins; Proteins belonging ...
955-1181 3.84e-59

DEAQ-box helicase domain of archaeal and bacterial SNF2-related proteins; Proteins belonging to SNF2 family of DNA dependent ATPases are important members of the chromatin remodeling complexes that are implicated in epigenetic control of gene expression. The Snf2 family comprises a large group of ATP-hydrolyzing proteins that are ubiquitous in eukaryotes, but also present in eubacteria and archaea. Archaeal SWI2 and SNF2 are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350770 [Multi-domain]  Cd Length: 218  Bit Score: 203.95  E-value: 3.84e-59
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  955 NNKLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWT-ELNV 1033
Cdd:cd18012     2 KATLRPYQKEGFNWLSFLRHYGLGGILADDMGLGKTLQTLALLLSRKEEGRKGPSLVVAPTSLIYNWEEEAAKFApELKV 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1034 VVYHGSQASRRTIQLYEMYFkdpqgrvikgsykfhAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKM 1113
Cdd:cd18012    82 LVIHGTKRKREKLRALEDYD---------------LVITSYGLLRRDIELLKEVKFHYLVLDEAQNIKNPQTKTAKAVKA 146
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688 1114 MDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFG----DLKTEEQVQKLQAILKPMMLRRLK 1181
Cdd:cd18012   147 LKADHRLALTGTPIENHLGELWSIFDFLNPGLLGSYKRFKKRFAkpieKDGDEEALEELKKLISPFILRRLK 218
DEXQc_SRCAP cd18003
DEXH/Q-box helicase domain of SRCAP; Snf2-related CBP activator (SRCAP, also known as SWR1 or ...
958-1179 1.12e-57

DEXH/Q-box helicase domain of SRCAP; Snf2-related CBP activator (SRCAP, also known as SWR1 or DOMO1) is the core catalytic component of the multiprotein chromatin-remodeling SRCAP complex, that is necessary for the incorporation of the histone variant H2A.Z into nucleosomes. SRCAP is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350761 [Multi-domain]  Cd Length: 223  Bit Score: 199.89  E-value: 1.12e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTE-LNVVV 1035
Cdd:cd18003     1 LREYQHIGLDWLATLYEKNLNGILADEMGLGKTIQTIALLaHLACEKGNWGPHLIVVPTSVMLNWEMEFKRWCPgFKILT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1036 YHGSQASRRtiqlyemyfkdpqgRVIKGSYK---FHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLK 1112
Cdd:cd18003    81 YYGSAKERK--------------LKRQGWMKpnsFHVCITSYQLVVQDHQVFKRKKWKYLILDEAHNIKNFKSQRWQTLL 146
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 460838688 1113 MMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTF----------MQEFGDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18003   147 NFNTQRRLLLTGTPLQNSLMELWSLMHFLMPHIFQSHQEFkewfsnpltaMSEGSQEENEELVRRLHKVLRPFLLRR 223
SF2_C_SNF cd18793
C-terminal helicase domain of the SNF family helicases; The Sucrose Non-Fermenting (SNF) ...
1280-1405 2.96e-57

C-terminal helicase domain of the SNF family helicases; The Sucrose Non-Fermenting (SNF) family includes chromatin-remodeling factors, such as CHD proteins and SMARCA proteins, recombination proteins Rad54, and many others. They are DEAD-like helicases belonging to superfamily (SF)2, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. Similar to SF1 helicases, SF2 helicases do not form toroidal structures like SF3-6 helicases. Their helicase core consists of two similar protein domains that resemble the fold of the recombination protein RecA. This model describes the C-terminal domain, also called HelicC.


Pssm-ID: 350180 [Multi-domain]  Cd Length: 135  Bit Score: 195.00  E-value: 2.96e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1280 GKLVLIDKLLPKLKAGGHRVLIFSQMVRCLDILEDYLIQRRYPYERIDGRVRGNLRQAAIDRFSKPDSDrFVFLLCTRAG 1359
Cdd:cd18793    11 GKLEALLELLEELREPGEKVLIFSQFTDTLDILEEALRERGIKYLRLDGSTSSKERQKLVDRFNEDPDI-RVFLLSTKAG 89
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 460838688 1360 GLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIGQSKSVKIYRLI 1405
Cdd:cd18793    90 GVGLNLTAANRVILYDPWWNPAVEEQAIDRAHRIGQKKPVVVYRLI 135
DEXHc_CHD1 cd18053
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 1; ...
957-1179 5.86e-53

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 1; Chromodomain-helicase-DNA-binding protein 1 (CHD1) is an ATP-dependent chromatin-remodeling factor which functions as substrate recognition component of the transcription regulatory histone acetylation (HAT) complex SAGA. It regulates polymerase II transcription and is also required for efficient transcription by RNA polymerase I, and more specifically the polymerase I transcription termination step. It is not only involved in transcription-related chromatin-remodeling, but also required to maintain a specific chromatin configuration across the genome. CHD1 is also associated with histone deacetylase (HDAC) activity. It is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350811 [Multi-domain]  Cd Length: 237  Bit Score: 186.79  E-value: 5.86e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  957 KLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWT-ELNVV 1034
Cdd:cd18053    20 ELRDYQLNGLNWLAHSWCKGNSCILADEMGLGKTIQTISFLnYLFHEHQLYGPFLLVVPLSTLTSWQREIQTWApQMNAV 99
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1035 VYHGSQASRRTIQLYEmyFKDPQGRVIKgsykFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMM 1114
Cdd:cd18053   100 VYLGDINSRNMIRTHE--WMHPQTKRLK----FNILLTTYEILLKDKSFLGGLNWAFIGVDEAHRLKNDDSLLYKTLIDF 173
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 460838688 1115 DLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLKtEEQVQKLQAILKPMMLRR 1179
Cdd:cd18053   174 KSNHRLLITGTPLQNSLKELWSLLHFIMPEKFSSWEDFEEEHGKGR-EYGYASLHKELEPFLLRR 237
DEXHc_SMARCAD1 cd17998
DEXH-box helicase domain of SMARCAD1; SWI/SNF-related matrix-associated actin-dependent ...
958-1146 1.67e-52

DEXH-box helicase domain of SMARCAD1; SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A containing DEAD/H box 1 (SMARCAD1, also known as ATP-dependent helicase 1 or Hel1) possesses intrinsic ATP-dependent nucleosome-remodeling activity and is required for both DNA repair and heterochromatin organization. SMARCAD1 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350756 [Multi-domain]  Cd Length: 187  Bit Score: 183.36  E-value: 1.67e-52
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWT-ELNVVVY 1036
Cdd:cd17998     1 LKDYQLIGLNWLNLLYQKKLSGILADEMGLGKTIQVIAFLAYLKEIGIPGPHLVVVPSSTLDNWLREFKRWCpSLKVEPY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1037 HGSQASRRTIqlyemyfkdpQGRVIKGSYKFHAIITTFEMILTDCPE---LRNIPWRCVVIDEAHRLKNRNCKLLEGLKM 1113
Cdd:cd17998    81 YGSQEERKHL----------RYDILKGLEDFDVIVTTYNLATSNPDDrsfFKRLKLNYVVYDEGHMLKNMTSERYRHLMT 150
                         170       180       190
                  ....*....|....*....|....*....|...
gi 460838688 1114 MDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRF 1146
Cdd:cd17998   151 INANFRLLLTGTPLQNNLLELMSLLNFIMPKPF 183
DEXHc_ERCC6L2 cd18005
DEXH-box helicase domain of ERCC6L2; ERCC excision repair 6 like 2 (ERCC6L2, also known as ...
958-1179 1.96e-52

DEXH-box helicase domain of ERCC6L2; ERCC excision repair 6 like 2 (ERCC6L2, also known as RAD26L) may play a role in DNA repair and mitochondrial function. In humans, mutations in the ERCC6L2 gene are associated with bone marrow failure syndrome 2. ERCC6L2 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350763 [Multi-domain]  Cd Length: 245  Bit Score: 185.66  E-value: 1.96e-52
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLlFNWY-NMRNCILADEMGLGKTIQSITFLYEIYLK---------------------GIHGPFLVIAPL 1015
Cdd:cd18005     1 LRDYQREGVEFM-YDLYkNGRGGILGDDMGLGKTVQVIAFLAAVLGKtgtrrdrennrprfkkkppasSAKKPVLIVAPL 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1016 STIPNWEREFRTWTELNVVVYHGSqasRRTiqlyemyfKDPQGRVIKGSYKFhaIITTFEMILTDCPELRNIPWRCVVID 1095
Cdd:cd18005    80 SVLYNWKDELDTWGHFEVGVYHGS---RKD--------DELEGRLKAGRLEV--VVTTYDTLRRCIDSLNSINWSAVIAD 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1096 EAHRLKNRNCKLLEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGD---------------LK 1160
Cdd:cd18005   147 EAHRIKNPKSKLTQAMKELKCKVRIGLTGTLLQNNMKELWCLLDWAVPGALGSRSQFKKHFSEpikrgqrhtatarelRL 226
                         250
                  ....*....|....*....
gi 460838688 1161 TEEQVQKLQAILKPMMLRR 1179
Cdd:cd18005   227 GRKRKQELAVKLSKFFLRR 245
DEXQc_INO80 cd18002
DEAQ-box helicase domain of INO80; INO80 is the catalytic ATPase subunit of the INO80 ...
958-1179 1.55e-51

DEAQ-box helicase domain of INO80; INO80 is the catalytic ATPase subunit of the INO80 chromatin remodeling complex. INO80 removes histone H3-containing nucleosomes from associated chromatin, promotes CENP-ACnp1 chromatin assembly at the centromere in a redundant manner with another chromatin-remodeling factor Chd1Hrp1. INO80 mutants have severe defects in oxygen consumption and promiscuous cell division that is no longer coupled with metabolic status. INO80 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350760 [Multi-domain]  Cd Length: 229  Bit Score: 182.32  E-value: 1.55e-51
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLfNWYNMR-NCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREF-RTWTELNVV 1034
Cdd:cd18002     1 LKEYQLKGLNWLA-NLYEQGiNGILADEMGLGKTVQSIAVLaHLAEEHNIWGPFLVIAPASTLHNWQQEIsRFVPQFKVL 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1035 VYHGSQASRRTIQLY----EMYFKDPqgrvikgsyKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEG 1110
Cdd:cd18002    80 PYWGNPKDRKVLRKFwdrkNLYTRDA---------PFHVVITSYQLVVQDEKYFQRVKWQYMVLDEAQAIKSSSSSRWKT 150
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 460838688 1111 LKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFG-DLKT---------EEQVQKLQAILKPMMLRR 1179
Cdd:cd18002   151 LLSFHCRNRLLLTGTPIQNSMAELWALLHFIMPTLFDSHDEFNEWFSkDIEShaenktglnEHQLKRLHMILKPFMLRR 229
DEXHc_SMARCA4 cd18062
DEXH-box helicase domain of SMARCA4; SWI/SNF related, matrix associated, actin dependent ...
955-1181 8.68e-50

DEXH-box helicase domain of SMARCA4; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 4 (SMARCA4, also known as transcription activator BRG1) is a component of the CREST-BRG1 complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. Mutation of SMARCA4 (BRG1), the ATPase of BAF (mSWI/SNF) and PBAF complexes, contributes to a range of malignancies and neurologic disorders. SMARCA4 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350820 [Multi-domain]  Cd Length: 251  Bit Score: 178.31  E-value: 8.68e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  955 NNKLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNV 1033
Cdd:cd18062    21 NGVLKQYQIKGLEWLVSLYNNNLNGILADEMGLGKTIQTIALItYLMEHKRINGPFLIIVPLSTLSNWVYEFDKWAPSVV 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1034 -VVYHGSQASRRTIQlyemyfkdPQGRvikgSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLK 1112
Cdd:cd18062   101 kVSYKGSPAARRAFV--------PQLR----SGKFNVLLTTYEYIIKDKQILAKIRWKYMIVDEGHRMKNHHCKLTQVLN 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1113 MMDLE-HKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFG----------DLKTEEQ---VQKLQAILKPMMLR 1178
Cdd:cd18062   169 THYVApRRLLLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNapfamtgekvDLNEEETiliIRRLHKVLRPFLLR 248

                  ...
gi 460838688 1179 RLK 1181
Cdd:cd18062   249 RLK 251
DEXHc_SMARCA2 cd18063
DEXH-box helicase domain of SMARCA2; SWI/SNF related, matrix associated, actin dependent ...
955-1181 2.14e-48

DEXH-box helicase domain of SMARCA2; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 2 (SMARCA2, also known as brahma homolog) is a component of the BAF complex. SMARCA2 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350821 [Multi-domain]  Cd Length: 251  Bit Score: 174.48  E-value: 2.14e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  955 NNKLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNV 1033
Cdd:cd18063    21 NGTLKHYQLQGLEWMVSLYNNNLNGILADEMGLGKTIQTIALItYLMEHKRLNGPYLIIVPLSTLSNWTYEFDKWAPSVV 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1034 -VVYHGSQASRRTIQlyemyfkdPQGRvikgSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLK 1112
Cdd:cd18063   101 kISYKGTPAMRRSLV--------PQLR----SGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMKNHHCKLTQVLN 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1113 MMDLE-HKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFG----------DLKTEEQ---VQKLQAILKPMMLR 1178
Cdd:cd18063   169 THYVApRRILLTGTPLQNKLPELWALLNFLLPTIFKSCSTFEQWFNapfamtgervDLNEEETiliIRRLHKVLRPFLLR 248

                  ...
gi 460838688 1179 RLK 1181
Cdd:cd18063   249 RLK 251
DEXHc_SMARCA5 cd18064
DEAH-box helicase domain of SMARCA5; SWI/SNF related, matrix associated, actin dependent ...
947-1192 7.00e-48

DEAH-box helicase domain of SMARCA5; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 5 (SMARCA5, also called SNF2H) is the catalytic subunit of the four known chromatin-remodeling complexes: CHRAC, RSF, ACF/WCRF, and WICH. SMARCA5 plays a major role organising arrays of nucleosomes adjacent to the binding sites for the architectural transcription factor CTCF sites and acts to promote CTCF binding SMARCA5 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350822 [Multi-domain]  Cd Length: 244  Bit Score: 172.54  E-value: 7.00e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  947 ESSREYKNNNKLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREF 1025
Cdd:cd18064     5 EDSPSYVKWGKLRDYQVRGLNWLISLYENGINGILADEMGLGKTLQTISLLgYMKHYRNIPGPHMVLVPKSTLHNWMAEF 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1026 RTWT-ELNVVVYHGSQASRRTiqlyemYFKDPqgrVIKGSYKfhAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRN 1104
Cdd:cd18064    85 KRWVpTLRAVCLIGDKDQRAA------FVRDV---LLPGEWD--VCVTSYEMLIKEKSVFKKFNWRYLVIDEAHRIKNEK 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1105 CKLLEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGD---LKTEEQVQKLQAILKPMMLRRLK 1181
Cdd:cd18064   154 SKLSEIVREFKTTNRLLLTGTPLQNNLHELWALLNFLLPDVFNSAEDFDSWFDTnncLGDQKLVERLHMVLRPFLLRRIK 233
                         250
                  ....*....|.
gi 460838688 1182 EDVEKNLAPKE 1192
Cdd:cd18064   234 ADVEKSLPPKK 244
DEXDc_SHPRH-like cd18008
DEXH-box helicase domain of SHPRH-like proteins; The SHPRH-like subgroup belongs to the ...
958-1179 1.86e-45

DEXH-box helicase domain of SHPRH-like proteins; The SHPRH-like subgroup belongs to the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350766 [Multi-domain]  Cd Length: 241  Bit Score: 165.54  E-value: 1.86e-45
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFnwynmRNCILADEMGLGKTIQSI-------------TFLYEIYLKGIHGPF-----LVIAPLSTIP 1019
Cdd:cd18008     1 LLPYQKQGLAWMLP-----RGGILADEMGLGKTIQALalilatrpqdpkiPEELEENSSDPKKLYlskttLIVVPLSLLS 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1020 NWEREFRTWTE---LNVVVYHGSQASRRTIQLYEmyfkdpqgrvikgsykFHAIITTFEMILTDCPE------------- 1083
Cdd:cd18008    76 QWKDEIEKHTKpgsLKVYVYHGSKRIKSIEELSD----------------YDIVITTYGTLASEFPKnkkgggrdskeke 139
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1084 ---LRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGDLK 1160
Cdd:cd18008   140 aspLHRIRWYRVILDEAHNIKNRSTKTSRAVCALKAERRWCLTGTPIQNSLDDLYSLLRFLRVEPFGDYPWFNSDISKPF 219
                         250       260
                  ....*....|....*....|..
gi 460838688 1161 TE---EQVQKLQAILKPMMLRR 1179
Cdd:cd18008   220 SKndrKALERLQALLKPILLRR 241
DEXHc_SMARCA1 cd18065
DEAH-box helicase domain of SMARCA1; SWI/SNF related, matrix associated, actin dependent ...
947-1181 5.96e-44

DEAH-box helicase domain of SMARCA1; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 1 (SMARCA1, also called SNF2L) is a component of NURF (nucleosome-remodeling factor) and CERF (CECR2-containing-remodeling factor) complexes which promote the perturbation of chromatin structure in an ATP-dependent manner. SMARCA1 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350823 [Multi-domain]  Cd Length: 233  Bit Score: 160.95  E-value: 5.96e-44
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  947 ESSREYKNNNKLREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFL-YEIYLKGIHGPFLVIAPLSTIPNWEREF 1025
Cdd:cd18065     5 EESPSYVKGGTLRDYQVRGLNWMISLYENGVNGILADEMGLGKTLQTIALLgYLKHYRNIPGPHMVLVPKSTLHNWMNEF 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1026 RTWT-ELNVVVYHGSQASRRTIQLYEMyfkdpqgrvIKGSYKfhAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRN 1104
Cdd:cd18065    85 KRWVpSLRAVCLIGDKDARAAFIRDVM---------MPGEWD--VCVTSYEMVIKEKSVFKKFNWRYLVIDEAHRIKNEK 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1105 CKLLEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGD---LKTEEQVQKLQAILKPMMLRRLK 1181
Cdd:cd18065   154 SKLSEIVREFKTTNRLLLTGTPLQNNLHELWALLNFLLPDVFNSADDFDSWFDTkncLGDQKLVERLHAVLKPFLLRRIK 233
DEXHc_ERCC6L cd18001
DEXH-box helicase domain of ERCC6L; ERCC excision repair 6 like, spindle assembly checkpoint ...
958-1179 7.47e-41

DEXH-box helicase domain of ERCC6L; ERCC excision repair 6 like, spindle assembly checkpoint helicase (ERCC6L, also known as RAD26L) is an essential component of the mitotic spindle assembly checkpoint, by acting as a tension sensor that associates with catenated DNA which is stretched under tension until it is resolved during anaphase. ERCC6L is proposed to stimulate cancer cell proliferation by promoting cell cycle through a way of RAB31-MAPK-CDK2. ERCC6L is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350759 [Multi-domain]  Cd Length: 232  Bit Score: 151.76  E-value: 7.47e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTE-LNVVVY 1036
Cdd:cd18001     1 LYPHQREGVAWLWSLHDGGKGGILADDMGLGKTVQICAFLSGMFDSGLIKSVLVVMPTSLIPHWVKEFAKWTPgLRVKVF 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1037 HGSQASRRTIQLYemyfkdpqgRVIKGsykFHAIITTFEMILTDCPEL-----RNIPWRCVVIDEAHRLKNRNCKLLEGL 1111
Cdd:cd18001    81 HGTSKKERERNLE---------RIQRG---GGVLLTTYGMVLSNTEQLsaddhDEFKWDYVILDEGHKIKNSKTKSAKSL 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1112 KMMDLEHKVLLTGTPLQNTVEELFSLLHFLEP-SRFPSETTFMQEF------GDLKTEEQVQK---------LQAILKPM 1175
Cdd:cd18001   149 REIPAKNRIILTGTPIQNNLKELWALFDFACNgSLLGTRKTFKMEFenpitrGRDKDATQGEKalgsevaenLRQIIKPY 228

                  ....
gi 460838688 1176 MLRR 1179
Cdd:cd18001   229 FLRR 232
DEXHc_RAD54 cd18004
DEXH-box helicase domain of RAD54; RAD54 proteins play a role in recombination. They are ...
958-1179 1.12e-39

DEXH-box helicase domain of RAD54; RAD54 proteins play a role in recombination. They are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350762 [Multi-domain]  Cd Length: 240  Bit Score: 148.59  E-value: 1.12e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLL-----FNWYNMRNCILADEMGLGKTIQSITFLYeIYLKgiHGPF--------LVIAPLSTIPNWERE 1024
Cdd:cd18004     1 LRPHQREGVQFLYdcltgRRGYGGGGAILADEMGLGKTLQAIALVW-TLLK--QGPYgkptakkaLIVCPSSLVGNWKAE 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1025 FRTWTELNVVVYHGSQASRRTIQLyEMYFKDPQGRvikgsykFHAIITTFEMILTDCPELrNIPWRC--VVIDEAHRLKN 1102
Cdd:cd18004    78 FDKWLGLRRIKVVTADGNAKDVKA-SLDFFSSAST-------YPVLIISYETLRRHAEKL-SKKISIdlLICDEGHRLKN 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1103 RNCKLLEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFGD--LK------TEEQV-------QK 1167
Cdd:cd18004   149 SESKTTKALNSLPCRRRLLLTGTPIQNDLDEFFALVDFVNPGILGSLASFRKVFEEpiLRsrdpdaSEEDKelgaersQE 228
                         250
                  ....*....|..
gi 460838688 1168 LQAILKPMMLRR 1179
Cdd:cd18004   229 LSELTSRFILRR 240
DEXHc_Mot1 cd17999
DEXH-box helicase domain of Mot1; Modifier of transcription 1 (Mot1, also known as TAF172 in ...
958-1179 1.48e-38

DEXH-box helicase domain of Mot1; Modifier of transcription 1 (Mot1, also known as TAF172 in eukaryotes) regulates transcription in association with TATA binding protein (TBP). Mot1, Ino80C, and NC2 function coordinately to regulate pervasive transcription in yeast and mammals. Mot1 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350757 [Multi-domain]  Cd Length: 232  Bit Score: 145.19  E-value: 1.48e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLF-NWYNMRNcILADEMGLGKTIQSITFLY------EIYLKGIHGPFLVIAPLSTIPNWEREFRTWTE 1030
Cdd:cd17999     1 LRPYQQEGINWLAFlNKYNLHG-ILCDDMGLGKTLQTLCILAsdhhkrANSFNSENLPSLVVCPPTLVGHWVAEIKKYFP 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1031 ---LNVVVYHGSQASRRTIQlyemyfkdpqgrviKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKL 1107
Cdd:cd17999    80 nafLKPLAYVGPPQERRRLR--------------EQGEKHNVIVASYDVLRNDIEVLTKIEWNYCVLDEGHIIKNSKTKL 145
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1108 LEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEFG-------DLK-----TEEQVQKLQAILK-- 1173
Cdd:cd17999   146 SKAVKQLKANHRLILSGTPIQNNVLELWSLFDFLMPGYLGTEKQFQRRFLkpilasrDSKasakeQEAGALALEALHKqv 225

                  ....*..
gi 460838688 1174 -PMMLRR 1179
Cdd:cd17999   226 lPFLLRR 232
DEXHc_ERCC6 cd18000
DEXH-box helicase domain of ERCC6; ERCC excision repair 6, chromatin remodeling factor (ERCC6, ...
958-1143 2.93e-37

DEXH-box helicase domain of ERCC6; ERCC excision repair 6, chromatin remodeling factor (ERCC6, also known Cockayne syndrome group B (CSB), Rad26 in Saccharomyces cerevisiae, and Rhp26 in Schizosaccharomyces pombe) is a DNA-binding protein that is important in transcription-coupled excision repair. ERCC6 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350758 [Multi-domain]  Cd Length: 193  Bit Score: 140.15  E-value: 2.93e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLfnWYNMRNC--ILADEMGLGKTIQSITFLYEI-YLKGIHGPFLVIAPLSTIPNWEREFRTW-TELNV 1033
Cdd:cd18000     1 LFKYQQTGVQWLW--ELHCQRVggILGDEMGLGKTIQIIAFLAALhHSKLGLGPSLIVCPATVLKQWVKEFHRWwPPFRV 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1034 VVYHGSQASRRTIQLYEMYFKDPQgRVIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKM 1113
Cdd:cd18000    79 VVLHSSGSGTGSEEKLGSIERKSQ-LIRKVVGDGGILITTYEGFRKHKDLLLNHNWQYVILDEGHKIRNPDAEITLACKQ 157
                         170       180       190
                  ....*....|....*....|....*....|
gi 460838688 1114 MDLEHKVLLTGTPLQNTVEELFSLLHFLEP 1143
Cdd:cd18000   158 LRTPHRLILSGTPIQNNLKELWSLFDFVFP 187
CD2_tandem_CHD5-9_like cd18663
repeat 2 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, ...
869-926 4.85e-35

repeat 2 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, and similar proteins; Repeat 2 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD5, CHD6, CHD7, CHD8, and CHD9. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. CHD6, CHD7, and CHD8 enzymes have been demonstrated to have different substrate specificities and remodeling activities. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349310 [Multi-domain]  Cd Length: 59  Bit Score: 128.56  E-value: 4.85e-35
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 460838688  869 PDYVEVDRIMDFARSTD-DRGEPVIHYLVKWCSLPYEDSTWELKQDIDQTKIEEFEKLM 926
Cdd:cd18663     1 PDYVEVDRILDVSVSTDpNTGEPVTHYLVKWCSLPYEDSTWELEEDVDPAKIEEFEKLR 59
DEXHc_ATRX-like cd18007
DEXH-box helicase domain of ATRX-like proteins; This family includes ATRX-like members such as ...
958-1156 4.33e-34

DEXH-box helicase domain of ATRX-like proteins; This family includes ATRX-like members such as transcriptional regulator ATRX (also called alpha thalassemia/mental retardation syndrome X-linked and X-linked nuclear protein or XNP) which is involved in transcriptional regulation and chromatin remodeling, and ARIP4 (also called androgen receptor-interacting protein 4, RAD54 like 2 or RAD54L2) which modulates androgen receptor (AR)-dependent transactivation in a promoter-dependent manner. They are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350765 [Multi-domain]  Cd Length: 239  Bit Score: 132.42  E-value: 4.33e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNwllFNWYNM----------RNCILADEMGLGKTIQSITFLYeIYLKgiHGP----FLVIAPLSTIPNWER 1023
Cdd:cd18007     1 LKPHQVEGVR---FLWSNLvgtdvgsdegGGCILAHTMGLGKTLQVITFLH-TYLA--AAPrrsrPLVLCPASTLYNWED 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1024 EFRTWT---ELNVVVYHGSQASRRTIQL-----------------YEMYFKDPQGRvIKGSYKFHAIITTfemILTDCPE 1083
Cdd:cd18007    75 EFKKWLppdLRPLLVLVSLSASKRADARlrkinkwhkeggvlligYELFRNLASNA-TTDPRLKQEFIAA---LLDPGPD 150
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 460838688 1084 LrnipwrcVVIDEAHRLKNRNCKLLEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEF 1156
Cdd:cd18007   151 L-------LVLDEGHRLKNEKSQLSKALSKVKTKRRILLTGTPLQNNLKEYWTMVDFARPKYLGTLKEFKKKF 216
DEXDc smart00487
DEAD-like helicases superfamily;
951-1149 2.62e-31

DEAD-like helicases superfamily;


Pssm-ID: 214692 [Multi-domain]  Cd Length: 201  Bit Score: 123.37  E-value: 2.62e-31
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    951 EYKNNNKLREYQLEGVNWLLFNWynmRNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPL-STIPNWEREFRTWT 1029
Cdd:smart00487    2 EKFGFEPLRPYQKEAIEALLSGL---RDVILAAPTGSGKTLAALLPALEALKRGKGGRVLVLVPTrELAEQWAEELKKLG 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   1030 E---LNVVVYHGSQASRRTIQlyemyfkdpqgRVIKGsyKFHAIITTFEMILTDCPE--LRNIPWRCVVIDEAHRLKN-- 1102
Cdd:smart00487   79 PslgLKVVGLYGGDSKREQLR-----------KLESG--KTDILVTTPGRLLDLLENdkLSLSNVDLVILDEAHRLLDgg 145
                           170       180       190       200       210
                    ....*....|....*....|....*....|....*....|....*....|.
gi 460838688   1103 RNCKLLEGLKMMDLE-HKVLLTGTP---LQNTVEELFSLLHFLEPSRFPSE 1149
Cdd:smart00487  146 FGDQLEKLLKLLPKNvQLLLLSATPpeeIENLLELFLNDPVFIDVGFTPLE 196
DEXHc_RAD54B cd18066
DEXH-box helicase domain of RAD54B; DNA repair and recombination protein RAD54B, also known as ...
958-1156 2.03e-29

DEXH-box helicase domain of RAD54B; DNA repair and recombination protein RAD54B, also known as RDH54, binds to double-stranded DNA, displays ATPase activity in the presence of DNA, and may have a role in meiotic and mitotic recombination. RAD54B is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350824 [Multi-domain]  Cd Length: 235  Bit Score: 119.18  E-value: 2.03e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMR-----NCILADEMGLGKTIQSITFLYEIYLKGIHGP------FLVIAPLSTIPNWEREFR 1026
Cdd:cd18066     1 LRPHQREGIEFLYECVMGMRvnerfGAILADEMGLGKTLQCISLIWTLLRQGPYGGkpvikrALIVTPGSLVKNWKKEFQ 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1027 TWTelnvvvyhgsqaSRRTIQLYEMYFKDPQGRVIKGSYkFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCK 1106
Cdd:cd18066    81 KWL------------GSERIKVFTVDQDHKVEEFIASPL-YSVLIISYEMLLRSLDQISKLNFDLVICDEGHRLKNTSIK 147
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 460838688 1107 LLEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEF 1156
Cdd:cd18066   148 TTTALTSLSCERRIILTGTPIQNDLQEFFALIDFVNPGILGSLSTYRKVY 197
DEXHc_HARP_SMARCAL1 cd18010
DEXH-box helicase domain of SMARCAL1; SMARCAL1 (SWI/SNF related, matrix associated, actin ...
958-1179 4.39e-27

DEXH-box helicase domain of SMARCAL1; SMARCAL1 (SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a like 1, also known as HARP) is recruited to stalled replication forks to promote repair and helps restart replication. It plays a role in DNA repair, telomere maintenance and replication fork stability in response to DNA replication stress. Mutations cause Schimke Immunoosseous Dysplasia. SMARCAL1 is part of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350768 [Multi-domain]  Cd Length: 213  Bit Score: 111.53  E-value: 4.39e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNwyNMRnCILADEMGLGKTIQSITFLYeiYLKGiHGPFLVIAPLSTIPNWEREFRTW----TELNV 1033
Cdd:cd18010     1 LLPFQREGVCFALRR--GGR-VLIADEMGLGKTVQAIAIAA--YYRE-EWPLLIVCPSSLRLTWADEIERWlpslPPDDI 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1034 VVYHGSQasrrtiqlyeMYFKDPQGRVikgsykfhaIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRN---CKLLEG 1110
Cdd:cd18010    75 QVIVKSK----------DGLRDGDAKV---------VIVSYDLLRRLEKQLLARKFKVVICDESHYLKNSKakrTKAALP 135
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 460838688 1111 LkMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTF--------MQEFG-DLKTEEQVQKLQAIL-KPMMLRR 1179
Cdd:cd18010   136 L-LKRAKRVILLSGTPALSRPIELFTQLDALDPKLFGRFHDFgrrycaakQGGFGwDYSGSSNLEELHLLLlATIMIRR 213
Helicase_C pfam00271
Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, ...
1280-1394 2.12e-26

Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.


Pssm-ID: 459740 [Multi-domain]  Cd Length: 109  Bit Score: 105.76  E-value: 2.12e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  1280 GKLVLIDKLLPKLKagGHRVLIFSQMVRCLDilEDYLIQRR-YPYERIDGRVRGNLRQAAIDRFSKPDSDrfvFLLCTRA 1358
Cdd:pfam00271    1 EKLEALLELLKKER--GGKVLIFSQTKKTLE--AELLLEKEgIKVARLHGDLSQEEREEILEDFRKGKID---VLVATDV 73
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 460838688  1359 GGLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIG 1394
Cdd:pfam00271   74 AERGLDLPDVDLVINYDLPWNPASYIQRIGRAGRAG 109
DEXHc_RAD54A cd18067
DEXH-box helicase domain of RAD54A; DNA repair and recombination protein RAD54A, also known as ...
958-1179 3.65e-25

DEXH-box helicase domain of RAD54A; DNA repair and recombination protein RAD54A, also known as RAD54L or RAD54, plays a role in homologous recombination related repair of DNA double-strand breaks. RAD54A is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350825 [Multi-domain]  Cd Length: 243  Bit Score: 106.79  E-value: 3.65e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNMRN-----CILADEMGLGKTIQSITFLYEIYLKGIHGP-----FLVIAPLSTIPNWEREFRT 1027
Cdd:cd18067     1 LRPHQREGVKFLYRCVTGRRIrgshgCIMADEMGLGKTLQCITLMWTLLRQSPQCKpeidkAIVVSPSSLVKNWANELGK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1028 WTELNVVVYHGSQASRRTIQLYEMYFKDPQGRVIKGSykfhAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKL 1107
Cdd:cd18067    81 WLGGRLQPLAIDGGSKKEIDRKLVQWASQQGRRVSTP----VLIISYETFRLHVEVLQKGEVGLVICDEGHRLKNSDNQT 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1108 LEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEF---------GD------LKTEEQVQKLQAIL 1172
Cdd:cd18067   157 YQALDSLNTQRRVLLSGTPIQNDLSEYFSLVNFVNPGILGTAAEFKKNFelpilkgrdADasekerQLGEEKLQELISIV 236

                  ....*..
gi 460838688 1173 KPMMLRR 1179
Cdd:cd18067   237 NRCIIRR 243
CD1_tandem_CHD5-9_like cd18668
repeat 1 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, ...
786-850 1.06e-24

repeat 1 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 5-9, and similar proteins; Repeat 1 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD5, CHD6, CHD7, CHD8, and CHD9. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. CHD6, CHD7, and CHD8 enzymes have been demonstrated to have different substrate specificities and remodeling activities. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349315 [Multi-domain]  Cd Length: 68  Bit Score: 99.33  E-value: 1.06e-24
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 460838688  786 DAEGPVVEKIMSSRLVKKQKESGEEVEV-EEFYVKYKNFSYLHCQWASVEDLEK-DKRIQQKIKRFK 850
Cdd:cd18668     1 EEDTMIIEKILASRKKKKEKEEGAEEIEvEEYLVKYKNFSYLHCEWKTEEELEKgDKRIKQKIKRFK 67
DEXHc_HLTF1_SMARC3 cd18071
DEXH-box helicase domain of HLTF1; Helicase like transcription factor (HLTF1, also known as ...
975-1179 1.87e-23

DEXH-box helicase domain of HLTF1; Helicase like transcription factor (HLTF1, also known as HIP116 or SMARCA3) has both helicase and E3 ubiquitin ligase activities and ATP-dependent nucleosome-remodeling activity. HLTF1 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350829 [Multi-domain]  Cd Length: 239  Bit Score: 101.78  E-value: 1.87e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  975 NMRNCILADEMGLGKTIQSITFLyeiylkgIHGPFLVIAPLSTIPNWEREFRTWTE---LNVVVYHGSQASRRTIQLYem 1051
Cdd:cd18071    47 LVRGGILADDMGLGKTLTTISLI-------LANFTLIVCPLSVLSNWETQFEEHVKpgqLKVYTYHGGERNRDPKLLS-- 117
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1052 yfkdpqgrvikgsyKFHAIITTFEMILTD-----CPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMDLEHKVLLTGTP 1126
Cdd:cd18071   118 --------------KYDIVLTTYNTLASDfgakgDSPLHTINWLRVVLDEGHQIRNPNAQQTKAVLNLSSERRWVLTGTP 183
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 460838688 1127 LQNTVEELFSLLHFLEPSRFPSETTFMQEFG---DLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18071   184 IQNSPKDLGSLLSFLHLKPFSNPEYWRRLIQrplTMGDPTGLKRLQVLMKQITLRR 239
DEXHc_ATRX cd18068
DEXH-box helicase domain of ATRX; Transcriptional regulator ATRX (also called alpha ...
979-1156 4.17e-23

DEXH-box helicase domain of ATRX; Transcriptional regulator ATRX (also called alpha thalassemia/mental retardation syndrome X-linked and X-linked nuclear protein or XNP) is involved in transcriptional regulation and chromatin remodeling. Mutations in humans cause mental retardation, X-linked, syndromic, with hypotonic facies 1 (MRXSHF1) and alpha-thalassemia myelodysplasia syndrome (ATMDS). ATRX is part of the a DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350826 [Multi-domain]  Cd Length: 246  Bit Score: 101.12  E-value: 4.17e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  979 CILADEMGLGKTIQSITFLYEIYLKGIHGPF---LVIAPLSTIPNWEREFRTWTELNvvvyhgsqASRRTIQLYEM-YFK 1054
Cdd:cd18068    31 CILAHCMGLGKTLQVVTFLHTVLLCEKLENFsrvLVVCPLNTVLNWLNEFEKWQEGL--------KDEEKIEVNELaTYK 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1055 DPQGRVIKGSYKFH---AIITTFEM--ILT--DCPELR------------NIPWRCVVIDEAHRLKNRNCKLLEGLKMMD 1115
Cdd:cd18068   103 RPQERSYKLQRWQEeggVMIIGYDMyrILAqeRNVKSReklkeifnkalvDPGPDFVVCDEGHILKNEASAVSKAMNSIR 182
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 460838688 1116 LEHKVLLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQEF 1156
Cdd:cd18068   183 TKRRIVLTGTPLQNNLIEYHCMVNFVKPNLLGTIKEFRNRF 223
DEXDc_RapA cd18011
DEXH-box helicase domain of RapA; In bacteria, RapA is an RNA polymerase (RNAP)-associated ...
958-1179 9.54e-23

DEXH-box helicase domain of RapA; In bacteria, RapA is an RNA polymerase (RNAP)-associated SWI2/SNF2 (switch/sucrose non-fermentable) protein that mediates RNAP recycling during transcription. The ATPase activity of RapA is stimulated by its interaction with RNAP and inhibited by its N-terminal domain. The conformational changes of RapA and its interaction with RNAP are essential for RNAP recycling. RapA is part of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350769 [Multi-domain]  Cd Length: 207  Bit Score: 98.90  E-value: 9.54e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNWYNmrNCILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNVVVyh 1037
Cdd:cd18011     1 PLPHQIDAVLRALRKPPV--RLLLADEVGLGKTIEAGLIIKELLLRGDAKRVLILCPASLVEQWQDELQDKFGLPFLI-- 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1038 gsqASRRTIQLYEMYFKDPQGRvikgsykFHAIITTFEMILTDcPELR----NIPWRCVVIDEAHRLKNRNC-------K 1106
Cdd:cd18011    77 ---LDRETAAQLRRLIGNPFEE-------FPIVIVSLDLLKRS-EERRglllSEEWDLVVVDEAHKLRNSGGgketkryK 145
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 460838688 1107 LLEGLKMMDlEHKVLLTGTPLQNTVEELFSLLHFLEPSRFpsetTFMQEFGDLKTEEQVqklqaiLKPMMLRR 1179
Cdd:cd18011   146 LGRLLAKRA-RHVLLLTATPHNGKEEDFRALLSLLDPGRF----AVLGRFLRLDGLREV------LAKVLLRR 207
DISARM_DrmD_b NF038318
DISARM system SNF2-like helicase DrmD, short form; DrmD, a SNF2-like helicase, is a component ...
980-1428 7.03e-22

DISARM system SNF2-like helicase DrmD, short form; DrmD, a SNF2-like helicase, is a component of class 1 DISARM (Defence Island System Associated with Restriction Modification), which contains a DNA adenine N6 methyltransferase. This HMM describes a distinct form that is somewhat shorter than the majority of DrmD proteins.


Pssm-ID: 468472 [Multi-domain]  Cd Length: 868  Bit Score: 104.38  E-value: 7.03e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  980 ILADEMGLGKTIQS---ITFLYEIYLKGIhgpfLVIAPLSTIPNWEREFRTWTELNVVVyhgsqASRRTiqlYEMYFKDP 1056
Cdd:NF038318   51 ILADEVGLGKTIEAglvLKYVLESGAKKI----LIILPANLRKQWEIELEEKFDLESLI-----LDSLT---VEKDAKKW 118
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1057 QGRvIKGSYKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKN--RNCKLLEGLkmMDLEH---KVLLTGTPLQNTV 1131
Cdd:NF038318  119 NKR-LTDNKKVRIVITSYDYASKLMKRFPKVKWDFIIIDEAHNLRNvhKGGKRAKNL--YELTKgipKILLTATPLQNSL 195
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1132 EELFSLLHFLEPSRFPSETTFM------QEFGDLKTEeqvqklqaiLKPMMLRRLKEDVEKNLAPKEETII--------- 1196
Cdd:NF038318  196 LDLYGLVSFIDPRIFGSEKVFSkryikdEDYSDLKRE---------LSPVLYRTLRKDVADYMQFKKRKCItvdfelspd 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1197 EVELTN-----IQKKYYRAI-----------------------------LEKNFTFLSKGGGQANVPNLLNtmmelrkcc 1242
Cdd:NF038318  267 EIELYVrvnnfLKRDILYSIptsnrtliilvirkllasssfalaetfevLKKRLEKLKEGTRSANAQEGFD--------- 337
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1243 nhpYLINGAEEKILEEFKETHNAESPDFQLQaMIQAAGKLV--LIDK------------LLPKLKAG---------GHRV 1299
Cdd:NF038318  338 ---LFWSFVEDEIDESGFEEKQDELYTRQKE-FIQHEIDEVdaIIDVakriktnakvtaLKTALEIAfeyqreegiAQKV 413
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1300 LIFSQMVRCLDILEDYLIQRRYPYERI---DG--------------RVR--GN--------LRQAAIDRFSkpdsDRFVF 1352
Cdd:NF038318  414 VVFTESKRTQKYIAEELRKSGYEGEDIllfNGdfddamtkeiyrawQVKnyGKanygrsveYKHAIVDYFK----NNAKI 489
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 460838688 1353 LLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIGQSKSVKIYRLI-TRNSYEREMFDKASLKLGLDKAV 1428
Cdd:NF038318  490 LIVTDAGSEGLNLQFCNTVINYDLPWNPQKIEQRIGRCHRYGQKNDVVAINLLnTQNVADKRVYEILSEKFELFEGV 566
HELICc smart00490
helicase superfamily c-terminal domain;
1310-1394 3.63e-21

helicase superfamily c-terminal domain;


Pssm-ID: 197757 [Multi-domain]  Cd Length: 82  Bit Score: 89.96  E-value: 3.63e-21
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   1310 DILEDYLIQRRYPYERIDGRVRGNLRQAAIDRFSKPDSDrfvFLLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQAR 1389
Cdd:smart00490    1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIK---VLVATDVAERGLDLPGVDLVIIYDLPWSPASYIQRIGR 77

                    ....*
gi 460838688   1390 CHRIG 1394
Cdd:smart00490   78 AGRAG 82
BRK smart00592
domain in transcription and CHROMO domain helicases;
2631-2675 7.08e-19

domain in transcription and CHROMO domain helicases;


Pssm-ID: 197800  Cd Length: 45  Bit Score: 82.01  E-value: 7.08e-19
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 460838688   2631 TGEERVPVVNKRNGKKMGGAMAPPMKDLPRWLEENPEFAVAPDWT 2675
Cdd:smart00592    1 DGEERVPVINRETGKKLTGDDAPKAKDLERWLEENPEYEVAPRSA 45
BRK pfam07533
BRK domain; The function of this domain is unknown. It is often found associated with ...
2630-2673 1.61e-18

BRK domain; The function of this domain is unknown. It is often found associated with helicases and transcription factors.


Pssm-ID: 462196  Cd Length: 44  Bit Score: 81.02  E-value: 1.61e-18
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 460838688  2630 LTGEERVPVVNKRNGKKMGGAMAPPMKDLPRWLEENPEFAVAPD 2673
Cdd:pfam07533    1 LTGDERVPVVNRKTGKKLTGDKAPKLKDLEEWLEENPGYEVDPR 44
DEXHc_ARIP4 cd18069
DEXH-box helicase domain of ARIP4; Androgen receptor-interacting protein 4 (ARIP4, also called ...
979-1156 1.20e-17

DEXH-box helicase domain of ARIP4; Androgen receptor-interacting protein 4 (ARIP4, also called RAD54 like 2 or RAD54L2 ) modulates androgen receptor (AR)-dependent transactivation in a promoter-dependent manner. ARIP4 is part of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350827 [Multi-domain]  Cd Length: 227  Bit Score: 84.48  E-value: 1.20e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  979 CILADEMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPNWEREFRTWTELNVVVyhgSQASRRTIQLY----EMYFK 1054
Cdd:cd18069    31 CILAHSMGLGKTLQVISFLDVLLRHTGAKTVLAIVPVNTLQNWLSEFNKWLPPPEAL---PNVRPRPFKVFilndEHKTT 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1055 DPQGRVI-KGSYKFHAIITTFEMI-LTDCPELrnipwrcVVIDEAHRLKNRNCKLLEGLKMMDLEHKVLLTGTPLQNTVE 1132
Cdd:cd18069   108 AARAKVIeDWVKDGGVLLMGYEMFrLRPGPDV-------VICDEGHRIKNCHASTSQALKNIRSRRRIVLTGYPLQNNLI 180
                         170       180
                  ....*....|....*....|....
gi 460838688 1133 ELFSLLHFLEPSRFPSETTFMQEF 1156
Cdd:cd18069   181 EYWCMVDFVRPDFLGTRQEFSNMF 204
DEXHc_TTF2 cd18072
DEAH-box helicase domain of TTF2; Transcription termination factor 2 (TTF2 also called ...
958-1179 7.05e-17

DEAH-box helicase domain of TTF2; Transcription termination factor 2 (TTF2 also called Forkhead-box E1/FOXE1 ) is a transcription termination factor that couples ATP hydrolysis with the removal of RNA polymerase II from the DNA template. Single nucleotide polymorphism (SNP) within the 5'-UTR of TTF2 is associated with thyroid cancer risk.TTF2 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350830 [Multi-domain]  Cd Length: 241  Bit Score: 82.91  E-value: 7.05e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLFNW-YNMRNCILADEMGLGKTIQSITF------------------LYEIYLKGIHGPF-----LVIA 1013
Cdd:cd18072     1 LLLHQKQALAWLLWRErQKPRGGILADDMGLGKTLTMIALilaqkntqnrkeeekekaLTEWESKKDSTLVpsagtLVVC 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1014 PLSTIPNWEREFRTWT---ELNVVVYHGsqASRRTIqlyemyfkdpqGRVIKgsyKFHAIITTFEMILTDCPELRN---- 1086
Cdd:cd18072    81 PASLVHQWKNEVESRVasnKLRVCLYHG--PNRERI-----------GEVLR---DYDIVITTYSLVAKEIPTYKEesrs 144
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1087 -----IPWRCVVIDEAHRLKNRN-------CKLLEGLKMMdlehkvlLTGTPLQNTVEELFSLLHFLEPSRFPSETTFMQ 1154
Cdd:cd18072   145 splfrIAWARIILDEAHNIKNPKvqasiavCKLRAHARWA-------LTGTPIQNNLLDMYSLLKFLRCSPFDDLKVWKK 217
                         250       260
                  ....*....|....*....|....*
gi 460838688 1155 EFgDLKTEEQVQKLQAILKPMMLRR 1179
Cdd:cd18072   218 QV-DNKSRKGGERLNILTKSLLLRR 241
CD2_tandem cd18659
repeat 2 of paired tandem chromodomains; Repeat 2 of tandem CHRomatin Organization Modifier ...
870-924 1.93e-16

repeat 2 of paired tandem chromodomains; Repeat 2 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD1 to CHD9, and yeast protein CHD1. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349306 [Multi-domain]  Cd Length: 54  Bit Score: 75.30  E-value: 1.93e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 460838688  870 DYVEVDRIMdfARSTDDRGepVIHYLVKWCSLPYEDSTWELKQDI---DQTKIEEFEK 924
Cdd:cd18659     1 EYTIVERII--AHREDDEG--VTEYLVKWKGLPYDECTWESEEDIsdiFQEAIDEYKK 54
BRK pfam07533
BRK domain; The function of this domain is unknown. It is often found associated with ...
2552-2593 3.71e-14

BRK domain; The function of this domain is unknown. It is often found associated with helicases and transcription factors.


Pssm-ID: 462196  Cd Length: 44  Bit Score: 68.69  E-value: 3.71e-14
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 460838688  2552 LDPDTRIPVINLEDGTRLVGEDAPKNKDLVDWLKLHPTYTVD 2593
Cdd:pfam07533    1 LTGDERVPVVNRKTGKKLTGDKAPKLKDLEEWLEENPGYEVD 42
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
23-553 2.00e-12

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 73.44  E-value: 2.00e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    23 GLGECGY-PENPVNPMGQQMP-----IDQGFPSLQPSL----HHPSPNQNQTKLTHFDHYSQYEQKMHLMDQPNRMMGSA 92
Cdd:pfam03157  159 GQGQQGYyPTSPQQSGQRQQPgqgqqLRQGQQGQQSGQgqpgYYPTSSQQPGQLQQTGQGQQGQQPERGQQGQQPGQGQQ 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    93 PGNGlasphSQYHTPPVPQVPhgggGGGQMGVYPGI-QNERHGQSFVDGGSMWGPRAVQVPDQirapyqqqqpQPAPSGP 171
Cdd:pfam03157  239 PGQG-----QQGQQPGQPQQL----GQGQQGYYPISpQQPRQWQQSGQGQQGYYPTSLQQPGQ----------GQSGYYP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   172 PAQGHPQHMQQMGSYLARGDFSMQQHGQPQQRMGQFSQGQEGLSQGSPfiATSGPGHLSHMPQQspsmapslrhPVQQQF 251
Cdd:pfam03157  300 TSQQQAGQLQQEQQLGQEQQDQQPGQGRQGQQPGQGQQGQQPAQGQQP--GQGQPGYYPTSPQQ----------PGQGQP 367
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   252 HHHPAALHGESVAHSPRFSPNPPQQG----AVRPQTLNFSSRNQT---VPSPTVNNSGQYSRYPYSNLNQGLVNSTGMNQ 324
Cdd:pfam03157  368 GYYPTSQQQPQQGQQPEQGQQGQQQGqgqqGQQPGQGQQPGQGQPgyyPTSPQQSGQGQPGYYPTSPQQSGQGQQPGQGQ 447
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   325 NLGltNSTPMNQSVPrypnavgfpsnsGQGLVHQQPihssgslnQMNTQTMHPSQPQGTYASPPPMSPMKamsnpaGTPP 404
Cdd:pfam03157  448 QPG--QEQPGQGQQP------------GQGQQGQQP--------GQPEQGQQPGQGQPGYYPTSPQQSGQ------GQQL 499
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   405 PQVRPGSAGMPmevGSYPNMPHPQpshqppgamGIGQRNMGPRNMQQPRSFMGMSSAPRELTGHMRPNGCPGVGLADPQA 484
Cdd:pfam03157  500 GQWQQQGQGQP---GYYPTSPLQP---------GQGQPGYYPTSPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQGQQPGQ 567
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   485 IQERLIPGQQHPGQQPSFQQLPTcpplQPHPGLHQSSPPHP-HHQPWAQLHPSPQNTPQKVPVHQHSPSE 553
Cdd:pfam03157  568 GQQGQQPGQGQQGQQPGQGQQPG----QGQPGYYPTSPQQSgQGQQPGQWQQPGQGQPGYYPTSSLQLGQ 633
BRK smart00592
domain in transcription and CHROMO domain helicases;
2553-2593 7.35e-12

domain in transcription and CHROMO domain helicases;


Pssm-ID: 197800  Cd Length: 45  Bit Score: 61.98  E-value: 7.35e-12
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 460838688   2553 DPDTRIPVINLEDGTRLVGEDAPKNKDLVDWLKLHPTYTVD 2593
Cdd:smart00592    1 DGEERVPVINRETGKKLTGDDAPKAKDLERWLEENPEYEVA 41
DEXQc_bact_SNF2 cd18013
DEXQ-box helicase domain of bacterial SNF2 family proteins; Proteins belonging to the SNF2 ...
958-1142 1.27e-11

DEXQ-box helicase domain of bacterial SNF2 family proteins; Proteins belonging to the SNF2 family of DNA dependent ATPases are important members of the chromatin remodeling complexes that are implicated in epigenetic control of gene expression. The Snf2 family comprise a large group of ATP-hydrolyzing proteins that are ubiquitous in eukaryotes, but also present in eubacteria and archaea. The bacterial SNF2 present in this family are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350771 [Multi-domain]  Cd Length: 218  Bit Score: 66.60  E-value: 1.27e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLfnwYNMRNCILADeMGLGKTIQSITFLYEIYLKGIHGPFLVIAPLSTIPN-WEREFRTWTELN---V 1033
Cdd:cd18013     1 PHPYQKVAINFII---EHPYCGLFLD-MGLGKTVTTLTALSDLQLDDFTRRVLVIAPLRVARStWPDEVEKWNHLRnltV 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1034 VVYHGSQASRRTI-----QLYemyfkdpqgrvikgsykfhaiITTFEMiLTDCPELRNIPW--RCVVIDEAHRLKNRNCK 1106
Cdd:cd18013    77 SVAVGTERQRSKAantpaDLY---------------------VINREN-LKWLVNKSGDPWpfDMVVIDELSSFKSPRSK 134
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 460838688 1107 LLEGLKMMD--LEHKVLLTGTPLQNTVEELFSLLHFLE 1142
Cdd:cd18013   135 RFKALRKVRpvIKRLIGLTGTPSPNGLMDLWAQIALLD 172
CD1_tandem cd18660
repeat 1 of paired tandem chromodomains; Repeat 1 of tandem CHRomatin Organization Modifier ...
788-850 1.42e-11

repeat 1 of paired tandem chromodomains; Repeat 1 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD1 to CHD9, and yeast protein CHD1. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349307 [Multi-domain]  Cd Length: 70  Bit Score: 61.99  E-value: 1.42e-11
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  788 EGPVVEKIMSSRLVKKQKESGEEVEVEE-------FYVKYKNFSYLHCQWASVEDLEKDkRIQQKIKRFK 850
Cdd:cd18660     1 DEDKIEKILDHRPKGPVEEASLDLTDPDepwdereFLVKWKGKSYLHCTWVTEETLEQL-RGKKKLKNYI 69
DEXQc_SHPRH cd18070
DEXQ-box helicase domain of SHPRH; E3 ubiquitin-protein ligase SHPRH is a ubiquitously ...
958-1146 2.78e-11

DEXQ-box helicase domain of SHPRH; E3 ubiquitin-protein ligase SHPRH is a ubiquitously expressed protein that contains motifs characteristic of several DNA repair proteins, transcription factors, and helicases. SHPRH is a functional homolog of S. cerevisiae RAD5 and is involved in DNA repair. SHPRH is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350828 [Multi-domain]  Cd Length: 257  Bit Score: 66.60  E-value: 2.78e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  958 LREYQLEGVNWLLfnwynMRNCILADEMGLGKTIQSI----------------------TFLYEIYLKGIH----GPFLV 1011
Cdd:cd18070     1 LLPYQRRAVNWML-----VPGGILADEMGLGKTVEVLalillhprpdndldaadddsdeMVCCPDCLVAETpvssKATLI 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1012 IAPLSTIPNWEREFRTWTELNVVVYHgsqasrrtiqlYEMYFKDPQgrVIKGSYKFHA----IITTFEMILTD------- 1080
Cdd:cd18070    76 VCPSAILAQWLDEINRHVPSSLKVLT-----------YQGVKKDGA--LASPAPEILAeydiVVTTYDVLRTElhyaean 142
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1081 ----------------CPELRnIPWRCVVIDEAHRLKNRNCKLLEGLKMMDLEHKVLLTGTPLQNTVEELFSLLHFLEPS 1144
Cdd:cd18070   143 rsnrrrrrqkryeappSPLVL-VEWWRVCLDEAQMVESSTSKAAEMARRLPRVNRWCVSGTPIQRGLDDLFGLLSFLGVE 221

                  ..
gi 460838688 1145 RF 1146
Cdd:cd18070   222 PF 223
Chromo pfam00385
Chromo (CHRromatin organization MOdifier) domain;
873-924 1.53e-10

Chromo (CHRromatin organization MOdifier) domain;


Pssm-ID: 459793 [Multi-domain]  Cd Length: 52  Bit Score: 58.74  E-value: 1.53e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 460838688   873 EVDRIMDfaRSTDDRGEPviHYLVKWCSLPYEDSTWELKQDID--QTKIEEFEK 924
Cdd:pfam00385    2 EVERILD--HRKDKGGKE--EYLVKWKGYPYDENTWEPEENLSkcPELIEEFKD 51
SSL2 COG1061
Superfamily II DNA or RNA helicase [Transcription, Replication, recombination, and repair];
904-1478 4.11e-10

Superfamily II DNA or RNA helicase [Transcription, Replication, recombination, and repair];


Pssm-ID: 440681 [Multi-domain]  Cd Length: 566  Bit Score: 65.43  E-value: 4.11e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  904 EDSTWELKQDIDQTKIEEFEKLMSREPETERVERPPADDWKK---SESSREYKNNNKLREYQLEGVN-WLLFNWYNMRNC 979
Cdd:COG1061    24 ERLELSLLRNLVEARRLAIKEGTREDGRRLPEEDTERELAEAealEAGDEASGTSFELRPYQQEALEaLLAALERGGGRG 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  980 ILADEMGLGKTIQSITFLYEIYLKGihgPFLVIAPLSTIPN-WEREFRTWteLNVVVYHGSQasrrtiqlyemyfKDPQG 1058
Cdd:COG1061   104 LVVAPTGTGKTVLALALAAELLRGK---RVLVLVPRRELLEqWAEELRRF--LGDPLAGGGK-------------KDSDA 165
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1059 RVIkgsykfhaiITTFEmILTDCPELRNIP--WRCVVIDEAHRLKNRncKLLEGLKMMDLEHKVLLTGTPLqntveelfs 1136
Cdd:COG1061   166 PIT---------VATYQ-SLARRAHLDELGdrFGLVIIDEAHHAGAP--SYRRILEAFPAAYRLGLTATPF--------- 224
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1137 llhflepsRFPSETTFMQEFGDLKTEEQVQKLQAilkpmmlrrlkedvEKNLAPKEETIIEVELTNIQKKYyrAILEKNF 1216
Cdd:COG1061   225 --------RSDGREILLFLFDGIVYEYSLKEAIE--------------DGYLAPPEYYGIRVDLTDERAEY--DALSERL 280
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1217 TFLskgggqanvpnllntmmelrkccnhpyLINGAEEK--ILEEFKETHnaespdfqlqamiqaagklvlidkllpklkA 1294
Cdd:COG1061   281 REA---------------------------LAADAERKdkILRELLREH------------------------------P 303
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1295 GGHRVLIFSQMVRCLDILEDYLIQRRYPYERIDGRVRGNLRQAAIDRFSkpdSDRFVFLLCTRAGGLGINLTAADTCIIF 1374
Cdd:COG1061   304 DDRKTLVFCSSVDHAEALAELLNEAGIRAAVVTGDTPKKEREEILEAFR---DGELRILVTVDVLNEGVDVPRLDVAILL 380
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1375 DSDWNPQNDLQAQARCHRIGQSKS-VKIYRLITRNSYEREMFDKASLKLGLDKAVLQSMSGRENATNGVQQLSKKEIEDL 1453
Cdd:COG1061   381 RPTGSPREFIQRLGRGLRPAPGKEdALVYDFVGNDVPVLEELAKDLRDLAGYRVEFLDEEESEELALLIAVKPALEVKGE 460
                         570       580
                  ....*....|....*....|....*
gi 460838688 1454 LRKGAYGALMDEEDEGSKFCEEDID 1478
Cdd:COG1061   461 LEEELLEELELLEDALLLVLAELLL 485
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
193-569 7.97e-10

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 64.64  E-value: 7.97e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   193 SMQQHGQPQQRMGQFSQG--------QEGLSQGS--PFIATSGPGHLSHMPQQ--SPSMAPSLrhpvQQQFHHHPAALHG 260
Cdd:pfam09606   61 QQPQGGQGNGGMGGGQQGmpdpinalQNLAGQGTrpQMMGPMGPGPGGPMGQQmgGPGTASNL----LASLGRPQMPMGG 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   261 ESVAHsprfspnpPQQGAVRPQTLNFSSRNQTVPSPTVNNSGQYSRYPYSNLNQGLVNSTGMNQNLGLTNSTPMNQSVPR 340
Cdd:pfam09606  137 AGFPS--------QMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPG 208
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   341 YPNavgfPSNSGQGLVHQQPIHSsgslnQMNTQTMHPSQPQGTYASPPPMspMKAMSNPAGTPPPQVRPGSAGMPMEVGS 420
Cdd:pfam09606  209 MPG----PADAGAQMGQQAQANG-----GMNPQQMGGAPNQVAMQQQQPQ--QQGQQSQLGMGINQMQQMPQGVGGGAGQ 277
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   421 YPNMPHPQPSHQPPGA----MGIGQRN----MGPRNMQQPRSFMGMSSAPRELTGHMRPNG--CPGVGLADPQAIQERLI 490
Cdd:pfam09606  278 GGPGQPMGPPGQQPGAmpnvMSIGDQNnyqqQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGqvVALGGLNHLETWNPGNF 357
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   491 PGQQHPGQQPSFQQLPTCPPLQPHPGLHQSSPPHPHHQPWAQLHPSPQ----NTPQKVPvHQHSPSEPFLEKPVPDMTQV 566
Cdd:pfam09606  358 GGLGANPMQRGQPGMMSSPSPVPGQQVRQVTPNQFMRQSPQPSVPSPQgpgsQPPQSHP-GGMIPSPALIPSPSPQMSQQ 436

                   ...
gi 460838688   567 SAQ 569
Cdd:pfam09606  437 PAQ 439
CD2_tandem_CHD3-4_like cd18662
repeat 2 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 3 and ...
869-924 9.78e-09

repeat 2 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 3 and 4, and similar proteins; Repeat 2 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD3 and CHD4, and yeast protein CHD1. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. Human CHD3 (also named Mi-2 alpha) and CHD4 (also named Mi-2 beta) are coexpressed in many cell lines and tissues and may act as the motor subunit of the NuRD complex (nucleosome remodeling and deacetylase activities). The proteins form distinct CHD3- and CHD4-NuRD complexes that repress, as well as activate gene transcription of overlapping and specific target genes. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349309 [Multi-domain]  Cd Length: 55  Bit Score: 53.42  E-value: 9.78e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 460838688  869 PDYVEVDRIMDfaRSTDDRGEpvIHYLVKWCSLPYEDSTWELkqdiDQTKIEEFEK 924
Cdd:cd18662     1 PEWLQIHRIIN--HRVDKDGN--TWYLVKWRDLPYDQSTWES----EDDDIPDYEK 48
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
79-580 1.06e-08

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 61.18  E-value: 1.06e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    79 MHLMDQPNRMMGSAPGNGLASPHSQYHTPPVpqvphgggGGGQMGVYPGIQNERHGQSFVDGGsmwGPRAVQVPDQIRAP 158
Cdd:pfam09606   53 MSKKAAQQQQPQGGQGNGGMGGGQQGMPDPI--------NALQNLAGQGTRPQMMGPMGPGPG---GPMGQQMGGPGTAS 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   159 YQQQQPQPAPSGPPAQGHPQHMQQMGSylargdfsMQQHGQPQQRMGQFSQGQEGLSQGSPFIATSGPGHLSHMPQQSPS 238
Cdd:pfam09606  122 NLLASLGRPQMPMGGAGFPSQMSRVGR--------MQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQ 193
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   239 MAPSLRHPVQ--QQFHHHPAALHGESVAHSPRFSPNPPQQGAVRPqtlNFSSRNQTVPSPTVNNSgqysrypysnlnqgl 316
Cdd:pfam09606  194 GPMGGQMPPQmgVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAP---NQVAMQQQQPQQQGQQS--------------- 255
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   317 VNSTGMNQNLGLTNSTPMNQSVPRYPNAVGFPSNSGQGLVHQQPIHSSGSLNQMntQTMHPSQPQGtyASPPPMSPMKAM 396
Cdd:pfam09606  256 QLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAMPNVMSIGDQNNYQQQ--QTRQQQQQQG--GNHPAAHQQQMN 331
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   397 SnpagtpppQVRPGSAGMPMEVGsypnMPHPQPSHQPPGAMGIGQRNMGPRNMqqprsfMGMSS-----APRELTGH--M 469
Cdd:pfam09606  332 Q--------SVGQGGQVVALGGL----NHLETWNPGNFGGLGANPMQRGQPGM------MSSPSpvpgqQVRQVTPNqfM 393
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   470 RPNGCPGVgladpqaiqerliPGQQHPGQQPSfqqlptcpplQPHPGlhQSSPPhPHHQPwaqlHPSPQntPQKVPVHQH 549
Cdd:pfam09606  394 RQSPQPSV-------------PSPQGPGSQPP----------QSHPG--GMIPS-PALIP----SPSPQ--MSQQPAQQR 441
                          490       500       510
                   ....*....|....*....|....*....|.
gi 460838688   550 SPSEPFLEKPVPDMTQVSAqNAQLVKSDDYL 580
Cdd:pfam09606  442 TIGQDSPGGSLNTPGQSAV-NSPLNPQEEQL 471
Chromo pfam00385
Chromo (CHRromatin organization MOdifier) domain;
791-852 1.37e-08

Chromo (CHRromatin organization MOdifier) domain;


Pssm-ID: 459793 [Multi-domain]  Cd Length: 52  Bit Score: 52.96  E-value: 1.37e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688   791 VVEKIMSSRLVKKQKEsgeeveveEFYVKYKNFSYLHCQWASVEDLEKDKRIqqkIKRFKSK 852
Cdd:pfam00385    2 EVERILDHRKDKGGKE--------EYLVKWKGYPYDENTWEPEENLSKCPEL---IEEFKDR 52
PHA03378 PHA03378
EBNA-3B; Provisional
222-545 1.07e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 57.77  E-value: 1.07e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  222 ATSGPGHLSHMPQQSPSmapslrhPVQQQFHHHPAALHGESVAHSPRFSPNPPQQGAVRPQTLNFSSRNQTVPSPtvnns 301
Cdd:PHA03378  554 ASTEPVHDQLLPAPGLG-------PLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAP----- 621
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  302 gqysRYPYSNLNQGLVNSTGMNqnlgltnSTPMNQSVPRYPNAVGFPSNSGQGLVHQQPIHSSGSLNQMNTQTMHPSQPQ 381
Cdd:PHA03378  622 ----RQWPMPLRPIPMRPLRMQ-------PITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWA 690
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  382 GTYASPPPMSPMKAmsnpagtPPPQVRPGSagmpmevgsypnmphpqpshqppgamgigqrnmgprnMQQPRSFMGMSSA 461
Cdd:PHA03378  691 PGTMQPPPRAPTPM-------RPPAAPPGR-------------------------------------AQRPAAATGRARP 726
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  462 PRELTGHMRPNGCPGVGLADPQAIQERLIPGQQHPGQQPSFQQLPTCPPLQPHPglhqSSPPHPHHQPwaQLHPSPQNTP 541
Cdd:PHA03378  727 PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPP----QAPPAPQQRP--RGAPTPQPPP 800

                  ....
gi 460838688  542 QKVP 545
Cdd:PHA03378  801 QAGP 804
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
16-468 1.59e-07

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 57.33  E-value: 1.59e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    16 LFSEGLEGLGECGYPENPVNPMGQQMPIDQGFPSLQPSLHHPSpnqnqtklthfdhysqyeQKMHLMDQPNRMMG---SA 92
Cdd:pfam09606   89 LAGQGTRPQMMGPMGPGPGGPMGQQMGGPGTASNLLASLGRPQ------------------MPMGGAGFPSQMSRvgrMQ 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    93 PGNGLASPHSQYHTPPVPQVPHggggggQMGvypGIQNERHGQSFVDGGSMWGPRAVQVPDQIRAPYQQQQPQPAPSGPP 172
Cdd:pfam09606  151 PGGQAGGMMQPSSGQPGSGTPN------QMG---PNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPADAGAQMGQ 221
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   173 AQG-----HPQHMQQMGSYLARGDFSMQQHGQPQQ------RMGQFSQGQEGLS-QGspfiatsGPGHLSHMPQQSPSMA 240
Cdd:pfam09606  222 QAQanggmNPQQMGGAPNQVAMQQQQPQQQGQQSQlgmginQMQQMPQGVGGGAgQG-------GPGQPMGPPGQQPGAM 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   241 PSLRHPVQQQfhhhpaalhgesVAHSPRFSPNPPQQGAVRPQTLnfssrnQTVPSPTVNNSGQYSRYPYSNL----NQGl 316
Cdd:pfam09606  295 PNVMSIGDQN------------NYQQQQTRQQQQQQGGNHPAAH------QQQMNQSVGQGGQVVALGGLNHletwNPG- 355
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   317 vNSTGMNqnlgltnstpMNQSVPRYPNAVGFPSNSGQGLVHQQpihssgslnQMNTQTMHPSQPQGTYASPPPmspmkam 396
Cdd:pfam09606  356 -NFGGLG----------ANPMQRGQPGMMSSPSPVPGQQVRQV---------TPNQFMRQSPQPSVPSPQGPG------- 408
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 460838688   397 SNPAGTPPPQVRPGSAGMPmevgsypnmphpqpSHQ-PPGAMGIGQRNMGPRNMQQPRSFMGMSSAPRELTGH 468
Cdd:pfam09606  409 SQPPQSHPGGMIPSPALIP--------------SPSpQMSQQPAQQRTIGQDSPGGSLNTPGQSAVNSPLNPQ 467
CD2_tandem_ScCHD1_like cd18664
repeat 2 of the paired tandem chromodomains of yeast chromodomain helicase DNA-binding protein ...
870-928 3.26e-07

repeat 2 of the paired tandem chromodomains of yeast chromodomain helicase DNA-binding protein 1, and similar proteins; Repeat 2 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as yeast protein CHD1. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349311  Cd Length: 59  Bit Score: 49.58  E-value: 3.26e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 460838688  870 DYVEVDRIMDFARSTDDRGEPVIHYLVKWCSLPYEDSTWELKQDIDQTKIEEFEKLMSR 928
Cdd:cd18664     1 EFHVVERIIASQRASLEDGTSQLQYLVKWRRLNYDECTWEDATLIAKLAPEQVDHFQNR 59
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
2-305 3.62e-07

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 56.17  E-value: 3.62e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688     2 ADPGMMSLFGEDGSLFSEGLEGLGECGYPENPVNPMGQQMPIDQGFPS--LQPSLHHPSPNQNQtklthfdhysqyeqkM 79
Cdd:pfam09606  155 AGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPqmGVPGMPGPADAGAQ---------------M 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    80 HLMDQPNRMMGSAPGNGLASPHSQYHTPPVPQVphgggGGGQMGVYPGiQNERHGQSFVDGGSMWGP-RAVQVPDQIRAP 158
Cdd:pfam09606  220 GQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQG-----QQSQLGMGIN-QMQQMPQGVGGGAGQGGPgQPMGPPGQQPGA 293
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   159 YQQQQPQPAPSGPPAQGHPQHMQQMGsylarGDFSMQQHGQPQQRMGQFSQGQ--EGLSQGSPFIATS---------GPG 227
Cdd:pfam09606  294 MPNVMSIGDQNNYQQQQTRQQQQQQG-----GNHPAAHQQQMNQSVGQGGQVValGGLNHLETWNPGNfgglganpmQRG 368
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   228 HLSHMPQQSP-------SMAP--SLRHPVQQQ--FHHHPAALHGESVAH----SPRFSPNPPQQGAVRPQtlnfSSRN-- 290
Cdd:pfam09606  369 QPGMMSSPSPvpgqqvrQVTPnqFMRQSPQPSvpSPQGPGSQPPQSHPGgmipSPALIPSPSPQMSQQPA----QQRTig 444
                          330
                   ....*....|....*
gi 460838688   291 QTVPSPTVNNSGQYS 305
Cdd:pfam09606  445 QDSPGGSLNTPGQSA 459
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
275-564 5.78e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 55.54  E-value: 5.78e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   275 QQGAVRPQTLNFSSRNQT--VPSPTVNNSGQYSRYPYSNLN-QGLV----NSTGMNQNLGLTNSTPMNQSVPRyPNAVGF 347
Cdd:pfam03154  128 DEGSSDPKDIDQDNRSTSpsIPSPQDNESDSDSSAQQQILQtQPPVlqaqSGAASPPSPPPPGTTQAATAGPT-PSAPSV 206
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   348 PSN----SGQGLVHQQPIHSSGSLNQmNTQTMHPSQpqgtyaSPPPMSPMKAMSNPAgtPPPQVRPGSAGMPmevgsypn 423
Cdd:pfam03154  207 PPQgspaTSQPPNQTQSTAAPHTLIQ-QTPTLHPQR------LPSPHPPLQPMTQPP--PPSQVSPQPLPQP-------- 269
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   424 mphpqpSHQPPGAMGIGQRNMGPRNMQQPRSFMGMSSAPRELTGHMRPNGCPGVGLADPQAIQERLIPGQQHPGQQPSFQ 503
Cdd:pfam03154  270 ------SLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQ 343
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 460838688   504 QLPTCPPLQPHPGLHQSSPPHPHHQPWAQLHPSPQNTPQKVPVHQHSPSEPFLeKPVPDMT 564
Cdd:pfam03154  344 PLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAL-KPLSSLS 403
CHROMO smart00298
Chromatin organization modifier domain;
791-853 1.12e-06

Chromatin organization modifier domain;


Pssm-ID: 214605 [Multi-domain]  Cd Length: 55  Bit Score: 47.59  E-value: 1.12e-06
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 460838688    791 VVEKIMSSRLVKK-QKEsgeeveveeFYVKYKNFSYLHCQWASVEDLEKDKRiqqKIKRFKSKQ 853
Cdd:smart00298    3 EVEKILDHRWKKKgELE---------YLVKWKGYSYSEDTWEPEENLLNCSK---KLDNYKKKE 54
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
22-414 1.49e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 54.00  E-value: 1.49e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    22 EGLGECGyPENPVNPMGQQMP--IDQGFPSLQPSLhhPSPNQNQTklthfDHYSQYEQKMhLMDQPNRMMG-----SAPG 94
Cdd:pfam03154  115 EGEGESS-DGRSVNDEGSSDPkdIDQDNRSTSPSI--PSPQDNES-----DSDSSAQQQI-LQTQPPVLQAqsgaaSPPS 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688    95 NGLASPHSQYHTPPVPQVPHGGGGGGQMGVYPGIQNERHG--QSFVDGGSMWGPRAVQVPdqiRAPYQQQQPQPAPSGPP 172
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAapHTLIQQTPTLHPQRLPSP---HPPLQPMTQPPPPSQVS 262
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   173 AQGHPQ-----HMQQMGSYLARGDFSMQQHGQPQqrmGQFSQGQEGLSQGSPFIATSGPGHLSHMPQQSPSMApslrHPV 247
Cdd:pfam03154  263 PQPLPQpslhgQMPPMPHSLQTGPSHMQHPVPPQ---PFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQS----QLQ 335
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   248 QQQfhhhpaalhgesvahSPRFSPNPPQqgavrPQTLNFSSRNQTVPSPTVNNSGQYSRYPYSN------LNQGLVNSTG 321
Cdd:pfam03154  336 SQQ---------------PPREQPLPPA-----PLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSgpspfqMNSNLPPPPA 395
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   322 MNQNLGLTNSTPMNQSVPR---YPNAVGFPSNSGQGLV-HQQPIHSSGSLNQMNTQTMHPSQPQGTYASPPPMSPMKAMS 397
Cdd:pfam03154  396 LKPLSSLSTHHPPSAHPPPlqlMPQSQQLPPPPAQPPVlTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPI 475
                          410
                   ....*....|....*..
gi 460838688   398 NPAGTPPPQVRPGSAGM 414
Cdd:pfam03154  476 TPPSGPPTSTSSAMPGI 492
SF2-N cd00046
N-terminal DEAD/H-box helicase domain of superfamily 2 helicases; The DEAD/H-like superfamily ...
978-1125 4.07e-06

N-terminal DEAD/H-box helicase domain of superfamily 2 helicases; The DEAD/H-like superfamily 2 helicases comprise a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This N-terminal domain contains the ATP-binding region.


Pssm-ID: 350668 [Multi-domain]  Cd Length: 146  Bit Score: 48.94  E-value: 4.07e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  978 NCILADEMGLGKTIQSITFLYEiYLKGIHGPFLVIAPLSTI-PNWEREFRTWTELNVVV---YHGSQASRRTiqlyemyf 1053
Cdd:cd00046     3 NVLITAPTGSGKTLAALLAALL-LLLKKGKKVLVLVPTKALaLQTAERLRELFGPGIRVavlVGGSSAEERE-------- 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688 1054 kdpqgRVIKGSYKFhaIITTFEMILTDCPELRNI---PWRCVVIDEAHRLKN-----RNCKLLEGLKMMDLEHKVLLTGT 1125
Cdd:cd00046    74 -----KNKLGDADI--IIATPDMLLNLLLREDRLflkDLKLIIVDEAHALLIdsrgaLILDLAVRKAGLKNAQVILLSAT 146
PHA03378 PHA03378
EBNA-3B; Provisional
387-554 1.14e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 51.22  E-value: 1.14e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  387 PPPMSPMKAMS---NPAGTPPPQVRPGSAGMPMEvGSYPNMPHPQPSHQPPGAMGIGQRNMGPRNMQQPRSFMGMSSAPR 463
Cdd:PHA03378  630 PIPMRPLRMQPitfNVLVFPTPHQPPQVEITPYK-PTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPA 708
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  464 ELTGHMRPNGCPGVGLADPQAIQERLIPGQQHPGQQPSFQQLPT-CPPLQPHPGLH---QSSPPHPHHQPWAQLHPSPQN 539
Cdd:PHA03378  709 APPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGrARPPAAAPGRArppAAAPGAPTPQPPPQAPPAPQQ 788
                         170
                  ....*....|....*
gi 460838688  540 TPQKVPVHQHSPSEP 554
Cdd:PHA03378  789 RPRGAPTPQPPPQAG 803
CHROMO smart00298
Chromatin organization modifier domain;
873-924 1.83e-05

Chromatin organization modifier domain;


Pssm-ID: 214605 [Multi-domain]  Cd Length: 55  Bit Score: 44.51  E-value: 1.83e-05
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....
gi 460838688    873 EVDRIMDfARSTDDRGEpviHYLVKWCSLPYEDSTWELKQDI--DQTKIEEFEK 924
Cdd:smart00298    3 EVEKILD-HRWKKKGEL---EYLVKWKGYSYSEDTWEPEENLlnCSKKLDNYKK 52
CD_CSD cd00024
CHROMO (CHRromatin Organization Modifier) domains and chromo shadow domains; Members of this ...
873-924 2.92e-05

CHROMO (CHRromatin Organization Modifier) domains and chromo shadow domains; Members of this group are chromodomains or chromo shadow domains; these are SH3-fold-beta-barrel domains of the chromo-like superfamily. Chromodomains lack the first strand of the SH3-fold-beta-barrel, this first strand is altered by insertion in the chromo shadow domains. The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. Chromodomain-containing proteins include: i) those having an N-terminal chromodomain followed by a related chromo shadow domain, such as Drosophila and human heterochromatin protein Su(var)205 (HP1), and mammalian modifier 1 and 2; ii) those having a single chromodomain, such as Drosophila protein Polycomb (Pc), mammalian modifier 3, human Mi-2 autoantigen, and several yeast and Caenorhabditis elegans proteins of unknown function; iii) those having paired tandem chromodomains, such as mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1; (iv) and elongation factor eEF3, a member of the ATP-binding cassette (ABC) family of proteins, that serves an essential function in the translation cycle of fungi. eEF3 is a soluble factor lacking a transmembrane domain and having two ABC domains arranged in tandem, with a unique chromodomain inserted within the ABC2 domain.


Pssm-ID: 349274 [Multi-domain]  Cd Length: 50  Bit Score: 43.62  E-value: 2.92e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 460838688  873 EVDRIMDfarSTDDRGEpvIHYLVKWCSLPYEDSTWELKQDIDQTK--IEEFEK 924
Cdd:cd00024     2 EVEKILD---HRVRKGK--LEYLVKWKGYPPEENTWEPEENLTNAPelIKEYEK 50
PHA03247 PHA03247
large tegument protein UL36; Provisional
265-566 4.98e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.55  E-value: 4.98e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  265 HSPRFSPNPPQQGAVR--PQTLNFSSRNQTVPSPTVNNSGQYSryPYSNLNQGLVNSTGMNQnlglTNSTPMNQSVPRYP 342
Cdd:PHA03247 2702 PPPPPTPEPAPHALVSatPLPPGPAAARQASPALPAAPAPPAV--PAGPATPGGPARPARPP----TTAGPPAPAPPAAP 2775
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  343 NAVGFPSnsgqglvhqQPIHSSGSLNQMNTQTMHPSQPQG-TYASPPPMSPMKAMSNPAGTPPPQVRPGSAGMPMEVGSY 421
Cdd:PHA03247 2776 AAGPPRR---------LTRPAVASLSESRESLPSPWDPADpPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPP 2846
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  422 PnmphpqPSHQPPGAMGIGqrnmGP-RNMQQPRSFMGMSSAPReltghmRPngcPGVGLADPQAIQER----LIPGQQHP 496
Cdd:PHA03247 2847 P------PSLPLGGSVAPG----GDvRRRPPSRSPAAKPAAPA------RP---PVRRLARPAVSRSTesfaLPPDQPER 2907
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  497 GQQPSFQQLPTCPPLQPHPGLHQSSPPHPHhQPWAQLHPSPQntPQKVPVHQHSPSEPFLEKPVPDMTQV 566
Cdd:PHA03247 2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPP-RPQPPLAPTTD--PAGAGEPSGAVPQPWLGALVPGRVAV 2974
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
348-572 1.82e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.34  E-value: 1.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   348 PSNSGQGLVHQQPIHSSGSLNQMNTQTMHPSQPQGT----YASPPPMSPMKAMSNPAGTPPPQvRPGSAGMPmEVGSYPN 423
Cdd:pfam09770  107 PAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTgyekYKEPEPIPDLQVDASLWGVAPKK-AAAPAPAP-QPAAQPA 184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   424 mphpqpshqppGAMGIGQRNMgprNMQQPRSFMGMSSAPRELTGHMRPNGCPGVGLADPQAIQER---LIPGQQHPGQQP 500
Cdd:pfam09770  185 -----------SLPAPSRKMM---SLEEVEAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQfppQIQQQQQPQQQP 250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   501 SFQQLPTCPPLQPH----PGLHQSSPPHPHHQPWAQLHPSPQNTPQKVPVHQ----HSPSEPFLEKPVPDMTQVSAQNAQ 572
Cdd:pfam09770  251 QQPQQHPGQGHPVTilqrPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQIlqnpNRLSAARVGYPQNPQPGVQPAPAH 330
DEAD pfam00270
DEAD/DEAH box helicase; Members of this family include the DEAD and DEAH box helicases. ...
959-1134 2.22e-04

DEAD/DEAH box helicase; Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.


Pssm-ID: 425570 [Multi-domain]  Cd Length: 165  Bit Score: 44.16  E-value: 2.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   959 REYQLEGVNWLLFNwynmRNCILADEMGLGKT-IQSITFLYEIYLKGIHGPFLVIAPLST-IPNWEREFRTW---TELNV 1033
Cdd:pfam00270    1 TPIQAEAIPAILEG----RDVLVQAPTGSGKTlAFLLPALEALDKLDNGPQALVLAPTRElAEQIYEELKKLgkgLGLKV 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  1034 VVYHGSQAsrrtiqlyemyfKDPQGRVIKGSykfHAIITTFEMILTDCPE---LRNIpwRCVVIDEAHRL--KNRNCKLL 1108
Cdd:pfam00270   77 ASLLGGDS------------RKEQLEKLKGP---DILVGTPGRLLDLLQErklLKNL--KLLVLDEAHRLldMGFGPDLE 139
                          170       180
                   ....*....|....*....|....*..
gi 460838688  1109 EGLKMMDLEHK-VLLTGTPLQNtVEEL 1134
Cdd:pfam00270  140 EILRRLPKKRQiLLLSATLPRN-LEDL 165
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
491-552 3.52e-04

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 45.42  E-value: 3.52e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 460838688  491 PGQQHPGQQPSFQQLPTCPPLQPHPGLHQSSPPHPHHQPWAQLHPSPQNTPQkvpvHQHSPS 552
Cdd:cd22056   220 AFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQYQYMNAPYPPHY----AHQGAP 277
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
440-556 4.34e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.95  E-value: 4.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   440 GQRNMGPRNMQQPRSFMGMSSAP--RELTGHMRPNGCPGVGLADPQaiqerlIPGQQHPGQQPSFQQLPT-----CPPLQ 512
Cdd:TIGR01628  376 MQLQPRMRQLPMGSPMGGAMGQPpyYGQGPQQQFNGQPLGWPRMSM------MPTPMGPGGPLRPNGLAPmnavrAPSRN 449
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 460838688   513 PHPGLHQSSPPHPHHQPWAQLHPSPQNTPQKVPVHQHSPSEPFL 556
Cdd:TIGR01628  450 AQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKL 493
SF2_C cd18785
C-terminal helicase domain of superfamily 2 DEAD/H-box helicases; Superfamily (SF)2 helicases ...
1347-1402 8.53e-04

C-terminal helicase domain of superfamily 2 DEAD/H-box helicases; Superfamily (SF)2 helicases include DEAD-box helicases, UvrB, RecG, Ski2, Sucrose Non-Fermenting (SNF) family helicases, and dicer proteins, among others. Similar to SF1 helicases, they do not form toroidal structures like SF3-6 helicases. SF2 helicases are a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. Their helicase core is surrounded by C- and N-terminal domains with specific functions such as nucleases, RNA or DNA binding domains, or domains engaged in protein-protein interactions. The core consists of two similar protein domains that resemble the fold of the recombination protein RecA. This model describes the C-terminal domain, also called HelicC.


Pssm-ID: 350172 [Multi-domain]  Cd Length: 77  Bit Score: 40.38  E-value: 8.53e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 460838688 1347 SDRFVFLLCTRAGGLGINLTAADTCIIFDSDWNPQNDLQAQARCHRIGQSKSVKIY 1402
Cdd:cd18785    20 ASSLEILVATNVLGEGIDVPSLDTVIFFDPPSSAASYIQRVGRAGRGGKDEGEVIL 75
PHA03247 PHA03247
large tegument protein UL36; Provisional
225-561 1.13e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 1.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  225 GPGHLSHMPQQSPSMAPSlrhPVQQQFHHHPAALHGESVAHSPRFSPNPPqqgAVRPQTLNFSSRNQTVPSPTVNNSGQY 304
Cdd:PHA03247 2640 PHPPPTVPPPERPRDDPA---PGRVSRPRRARRLGRAAQASSPPQRPRRR---AARPTVGSLTSLADPPPPPPTPEPAPH 2713
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  305 SRYPYSNLNQGLVNSTGMNQNLGLTNSTPMNQSVPRYPNAVGFPSNsgqglvhqqpihssgslnqmntqtmhPSQPQGTY 384
Cdd:PHA03247 2714 ALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPAR--------------------------PPTTAGPP 2767
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  385 ASPPPMSPmkAMSNPAGTPPPQVRPGSAGMPMEVGSYPNMPHPQPSHQPPGAMGIGQRNMGPrnmqQPRSFMGMSSAPRE 464
Cdd:PHA03247 2768 APAPPAAP--AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP----LPPPTSAQPTAPPP 2841
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  465 LTGHMRPNGCPGVGLAdPQAIQERLIPgqqhPGQQPSFQQLPTCPPLQ--PHPGLHQSSPPHPhhQPWAQLHPSPQNTPQ 542
Cdd:PHA03247 2842 PPGPPPPSLPLGGSVA-PGGDVRRRPP----SRSPAAKPAAPARPPVRrlARPAVSRSTESFA--LPPDQPERPPQPQAP 2914
                         330
                  ....*....|....*....
gi 460838688  543 KVPVHQHSPSEPFLEKPVP 561
Cdd:PHA03247 2915 PPPQPQPQPPPPPQPQPPP 2933
chromodomain cd18964
CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain ...
874-924 1.48e-03

CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain. Chromodomains belong to the chromo-like superfamily of SH3-fold-beta-barrel domains which includes chromo shadow domains and chromo barrel domains. Chromodomains differ from these in that they lack the first strand of the SH3-fold-beta-barrel. This first strand is altered by insertion in the chromo shadow domains, and chromo barrel domains are typical SH3-fold-beta-barrel domains with sequence similarity to the canonical chromo domain.


Pssm-ID: 349320  Cd Length: 54  Bit Score: 38.85  E-value: 1.48e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 460838688  874 VDRIMDfARSTDDRGEPVIHYLVKWCSLPYEDSTWELKQDIDQ--TKIEEFEK 924
Cdd:cd18964     3 VERIIG-RRPSARDGPGKFLWLVKWDGYPIEDATWEPPENLGEhaKLIEDFEK 54
CD2_tandem_CHD1-2_like cd18661
repeat 2 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 1 and ...
871-923 3.07e-03

repeat 2 of the paired tandem chromodomains of chromodomain helicase DNA-binding protein 1 and 2, and similar proteins; Repeat 2 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD1 and CHD2. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349308  Cd Length: 58  Bit Score: 38.05  E-value: 3.07e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 460838688  871 YVEVDRIMDFARSTD-DRGEPviHYLVKWCSLPYEDSTWELKQDID---QTKIEEFE 923
Cdd:cd18661     2 YQIVERIIAHSPQKSaASGYP--DYLCKWQGLPYSECTWEDGALISkkfQACIDEYH 56
CD_POL_like cd18974
chromodomain of Penicillium solitum protein PENSOL_c198G03123; This subgroup includes the ...
873-924 3.57e-03

chromodomain of Penicillium solitum protein PENSOL_c198G03123; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in Penicillium solitum protein PENSOL_c198G03123 a putative polyprotein from a Ty3/Gypsy long terminal repeat (LTR) retroelement. The pol gene in TY3/gypsy elements generally encodes domains in the following order: an aspartyl protease, a reverse transcriptase, RNase H, and an integrase, here the chromodomain is found at the C-terminus of the integrase domain. The chromodomain, is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349330  Cd Length: 50  Bit Score: 37.84  E-value: 3.57e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 460838688  873 EVDRIMDFARSTDDRgepviHYLVKWCSLPYEDSTWELKQDIDQTK--IEEFEK 924
Cdd:cd18974     2 EVEEIVDEKMIDDEL-----HYLVKWKGWPAEYNQWEPEDDMENAPkaIQSYEK 50
CD2_tandem cd18659
repeat 2 of paired tandem chromodomains; Repeat 2 of tandem CHRomatin Organization Modifier ...
791-851 3.63e-03

repeat 2 of paired tandem chromodomains; Repeat 2 of tandem CHRomatin Organization Modifier (chromo) domains, found in CHD (chromodomain helicase DNA-binding) proteins such as mammalian helicase DNA-binding proteins CHD1 to CHD9, and yeast protein CHD1. The CHD proteins belong to the SNF2 superfamily of ATP-dependent chromatin remodelers and contain two signature motifs: a pair of chromodomains located in the N-terminal region, and the SNF2-like ATPase domain located in the central region of the protein. CHD chromatin remodelers are important regulators of transcription and play critical roles during developmental processes. The N-terminal chromodomains of CHD1 have been shown to guard against sliding hexasomes. Mutations in the chromodomains of mouse CHD1 result in nuclear redistribution, suggesting that the chromodomain is essential for proper association with chromatin; also, deletion of the chromodomains in the Drosophila melanogaster CHD3-4 homolog impaired nucleosome binding, mobilization, and ATPase functions. A chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349306 [Multi-domain]  Cd Length: 54  Bit Score: 37.94  E-value: 3.63e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 460838688  791 VVEKIMSSRLVKKQKEsgeeveveEFYVKYKNFSYLHCQWASVEDLEkdKRIQQKIKRFKS 851
Cdd:cd18659     4 IVERIIAHREDDEGVT--------EYLVKWKGLPYDECTWESEEDIS--DIFQEAIDEYKK 54
CD_MarY1_POL_like cd18975
chromodomain of Tricholoma matsutake polyprotein, and similar proteins; This subgroup includes ...
873-924 3.94e-03

chromodomain of Tricholoma matsutake polyprotein, and similar proteins; This subgroup includes the CHROMO (CHRromatin Organization Modifier) domain found in the polyprotein from the MarY1 Ty3/Gypsy long terminal repeat (LTR) retroelement from the from the Ectomycorrhizal Basidiomycete Tricholoma matsutake. The pol gene in TY3/gypsy elements generally encodes domains in the following order: prt-reverse transcriptase-RNase H-integrase, in marY1 POL the chromodomain is found at the C-terminus of the integrase domain. The chromodomain, is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain.


Pssm-ID: 349331  Cd Length: 49  Bit Score: 37.52  E-value: 3.94e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 460838688  873 EVDRIMDfarSTDDRGEpvIHYLVKWCSLPYEDSTWELKQDIDQTK-IEEFEK 924
Cdd:cd18975     2 EVESILN---SRLHRGK--LQYLIQWKGYPLEEASWELEDNIKNPRlIEEFHK 49
MISS pfam15822
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ...
216-422 5.39e-03

MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.


Pssm-ID: 318115 [Multi-domain]  Cd Length: 238  Bit Score: 41.13  E-value: 5.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   216 QGSPfiaTSGPGHLSHMPQQSPSMAPSLRHPVQQQFHHHPAALHGESVAHSPRFSPNPPqqgaVRPQTLNFSSRNQTVPS 295
Cdd:pfam15822   28 QGWP---GSNPWNNPSAPPAVPSGLPPSTAPSTVPFGPAPTGMYPSIPLTGPSPGPPAP----FPPSGPSCPPPGGPYPA 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688   296 PTVNNSGQYSRYPysnlnqglvnstgmnqnlgltnstPMNQSVPRYPNAVGFPSNSGQGlvhqQPIHSSGSLNQ------ 369
Cdd:pfam15822  101 PTVPGPGPIGPYP------------------------TPNMPFPELPRPYGAPTDPAAA----APSGPWGSMSSgpwapg 152
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 460838688   370 MNTQTMHPSQP---QGTY-ASPPPMSPMKAMSNPAGTPPPQVRPGSAGMPMEVGSYP 422
Cdd:pfam15822  153 MGGQYPAPNMPypsPGPYpAVPPPQSPGAAPPVPWGTVPPGPWGPPAPYPDPTGSYP 209
CD3_cpSRP43_like cd18628
chromodomain 3 of chloroplast signal recognition particle 43 kDa protein, and similar proteins; ...
870-923 6.23e-03

chromodomain 3 of chloroplast signal recognition particle 43 kDa protein, and similar proteins; This subgroup includes the chromodomain 3 of chloroplast SRP43 (cpSRP43), and similar proteins. CpSRP43 is a component of the chloroplast signal recognition particle (SRP) pathway. It forms a stable complex with cpSRP54 (cpSRP complex) which is required for the efficient posttranslational transport of members of the nuclearly encoded light harvesting chlorophyll-a/b-binding proteins (LHCPs) to the thylakoid membrane. Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromodomain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followed by a related chromo shadow domain.


Pssm-ID: 349278  Cd Length: 51  Bit Score: 36.98  E-value: 6.23e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 460838688  870 DYVEVDRIMDfARSTDdrGEPVIHYLVKWcsLPYEDSTWELKQDIDQTKIEEFE 923
Cdd:cd18628     2 EYAVAESVIG-KRVGD--DGKTIEYLVKW--TDMSDATWEPQDNVDSTLVLLYQ 50
KLF1_2_4_N-like cd22056
N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of ...
190-277 7.23e-03

N-terminal domain of Kruppel-like factors with similarity to the N-terminal domains of Kruppel-like factor (KLF)1, KLF2, and KLF4; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domains of an unknown subfamily of KLFs, predominantly found in fish, related to the N-terminal domains of KLF1, KLF2, and KLF4.


Pssm-ID: 409231 [Multi-domain]  Cd Length: 339  Bit Score: 41.57  E-value: 7.23e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  190 GDFSMQQHGQPQQRMGQFSQGQEGLSQGSPFIATSGPGHLSHMPQQSPSMAPSLRHPVQQQFHHHPAALHGesvAHsprf 269
Cdd:cd22056   201 GGFMGQQKPKHQMHSVHPQAFTHHQAAGPGALQGRGGRGGPDCHLLHSSHHHHHHHHLQYQYMNAPYPPHY---AH---- 273

                  ....*....
gi 460838688  270 sPNPPQ-QG 277
Cdd:cd22056   274 -QGAPQfHG 281
chromodomain cd18965
CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain ...
894-914 7.78e-03

CHROMO (CHRromatin Organization Modifier) domain; uncharacterized subgroup; The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. A chromodomain may occur as a single instance, in a tandem arrangement, or followed by a related chromo shadow domain. Chromodomains belong to the chromo-like superfamily of SH3-fold-beta-barrel domains which includes chromo shadow domains and chromo barrel domains. Chromodomains differ from these in that they lack the first strand of the SH3-fold-beta-barrel. This first strand is altered by insertion in the chromo shadow domains, and chromo barrel domains are typical SH3-fold-beta-barrel domains with sequence similarity to the canonical chromo domain.


Pssm-ID: 349321  Cd Length: 53  Bit Score: 37.07  E-value: 7.78e-03
                          10        20
                  ....*....|....*....|.
gi 460838688  894 YLVKWCSLPYEDSTWELKQDI 914
Cdd:cd18965    18 YLVKWHGLPESENTWEREKDI 38
CD_CSD cd00024
CHROMO (CHRromatin Organization Modifier) domains and chromo shadow domains; Members of this ...
791-851 9.32e-03

CHROMO (CHRromatin Organization Modifier) domains and chromo shadow domains; Members of this group are chromodomains or chromo shadow domains; these are SH3-fold-beta-barrel domains of the chromo-like superfamily. Chromodomains lack the first strand of the SH3-fold-beta-barrel, this first strand is altered by insertion in the chromo shadow domains. The chromodomain is a conserved region of about 50 amino acids, found in a variety of chromosomal proteins, and which appears to play a role in the functional organization of the eukaryotic nucleus. The chromodomain is implicated in the binding, of the proteins in which it is found, to methylated histone tails and maybe RNA. Chromodomain-containing proteins include: i) those having an N-terminal chromodomain followed by a related chromo shadow domain, such as Drosophila and human heterochromatin protein Su(var)205 (HP1), and mammalian modifier 1 and 2; ii) those having a single chromodomain, such as Drosophila protein Polycomb (Pc), mammalian modifier 3, human Mi-2 autoantigen, and several yeast and Caenorhabditis elegans proteins of unknown function; iii) those having paired tandem chromodomains, such as mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1; (iv) and elongation factor eEF3, a member of the ATP-binding cassette (ABC) family of proteins, that serves an essential function in the translation cycle of fungi. eEF3 is a soluble factor lacking a transmembrane domain and having two ABC domains arranged in tandem, with a unique chromodomain inserted within the ABC2 domain.


Pssm-ID: 349274 [Multi-domain]  Cd Length: 50  Bit Score: 36.69  E-value: 9.32e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 460838688  791 VVEKIMSSRLVKKQKEsgeeveveeFYVKYKNFSYLHCQWASVEDLEKDKRIqqkIKRFKS 851
Cdd:cd00024     2 EVEKILDHRVRKGKLE---------YLVKWKGYPPEENTWEPEENLTNAPEL---IKEYEK 50
PRK10263 PRK10263
DNA translocase FtsK; Provisional
386-554 9.83e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.61  E-value: 9.83e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  386 SPPPMSPMKAMSNP--AGTPPPQVRPGSAGMPMEVGSYPNMPHPQPSHQPPGAmgigqrnmgprNMQQPRSFMGMSSAPR 463
Cdd:PRK10263  343 TPPVASVDVPPAQPtvAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNE-----------PLQQPVQPQQPYYAPA 411
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 460838688  464 ELTGHMRPNGCPGVGLAD------PQAIQERLIPGQQHPGQQPSFQQLPTCPPLQPHPglhQSSPPHPHHQPWAQLHPSP 537
Cdd:PRK10263  412 AEQPAQQPYYAPAPEQPAqqpyyaPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQ---QPAAQEPLYQQPQPVEQQP 488
                         170
                  ....*....|....*..
gi 460838688  538 QNTPQKVpVHQHSPSEP 554
Cdd:PRK10263  489 VVEPEPV-VEETKPARP 504
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH