|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
364-744 |
2.45e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 143.13 E-value: 2.45e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 364 ALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNL 442
Cdd:COG2319 83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 443 DSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELH 522
Cdd:COG2319 150 AT-------------------------------------GKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 523 FMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDIqMISCGAD 602
Cdd:COG2319 193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAFSPDGRL-LASGSAD 267
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 603 KSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDEGSllkVHVD 682
Cdd:COG2319 268 GTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAFS 339
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530416321 683 PSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWHL 744
Cdd:COG2319 340 PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
492-744 |
1.87e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.97 E-value: 1.87e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 492 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 571
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 572 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 651
Cdd:cd00200 85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 652 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 731
Cdd:cd00200 159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
|
250
....*....|...
gi 530416321 732 TVSGDSCVFIWHL 744
Cdd:cd00200 236 SGSEDGTIRVWDL 248
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
42-567 |
2.91e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 110.39 E-value: 2.91e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 42 LRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSP 121
Cdd:COG2319 9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 122 DGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHdmVLNVWDWKKDIVVAS-NKVSCR 200
Cdd:COG2319 89 DGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGA 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 201 VIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvpLVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllc 279
Cdd:COG2319 165 VTSVAFSPDGKLLASGSdDGTVRLW--DLATGKLLRT---LTG--------HTGAVRSVA------------FSPDG--- 216
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 280 qfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFqahslhylanlpkphylgvDVAQGlEPSFLFHRKAEAVY 359
Cdd:COG2319 217 --------------------------KLLASGSADGTVRLW-------------------DLATG-KLLRTLTGHSGSVR 250
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 360 pdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSEL-FHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTI 437
Cdd:COG2319 251 ----SVAFSPDGRLLASGSADGTVRLWDLAT----GELLRTLtGHSGGVNSV----------AFSPDGKLLaSGSDDGTV 312
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 438 RFWNLDSSpdshwqknifsntllkvvyvendiQHLQDMSHFPDRgsengtpmdvkagVRVMQVSPDGQHLASGDRSGNLR 517
Cdd:COG2319 313 RLWDLATG------------------------KLLRTLTGHTGA-------------VRSVAFSPDGKTLASGSDDGTVR 355
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|
gi 530416321 518 IHELHFMDELVKVEAHDAEVLCLEYSKPEtglTLLASASRDRLIHVLNVE 567
Cdd:COG2319 356 LWDLATGELLRTLTGHTGAVTSVAFSPDG---RTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
111-441 |
8.33e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 97.41 E-value: 8.33e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 111 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 189
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 190 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvplvgrsgILGelHNnifcgvacgrgrmaG 267
Cdd:cd00200 84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVW--DVETGKCLTT---------LRG--HT--------------D 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 268 STFCVSYSGllcqfnekrvlekwinlkvslssclcvSQELIFCGCTDGIVRIFQAHSLHYLANLpKPHYLGV-------- 339
Cdd:cd00200 137 WVNSVAFSP---------------------------DGTFVASSSQDGTIKLWDLRTGKCVATL-TGHTGEVnsvafspd 188
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 340 ----------------DVAQGlEPSFLFHRKAEAVYpdtvALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfH 403
Cdd:cd00200 189 gekllssssdgtiklwDLSTG-KCLGTLRGHENGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---H 260
|
330 340 350
....*....|....*....|....*....|....*....
gi 530416321 404 SSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWN 441
Cdd:cd00200 261 TNSVTSL----------AWSPDGKRLaSGSADGTIRIWD 289
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
979-1380 |
3.23e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 62.26 E-value: 3.23e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 979 SSDSPKDQSPPEGCAGPTEDELSLPEGPSVPSSSLPQTPEQEKFLRHHFETlTESPCRELFPAALGDVEAS--------- 1049
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-PQRPRRRAARPTVGSLTSLadppppppt 2707
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1050 -EAEDHFFNPRLSISTQFLSSLQKASRFTHTFPPRATQclvKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEetl 1128
Cdd:PHA03247 2708 pEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP---AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--- 2781
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1129 eawRPPPPCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQ-----GVHAPSTCSYMEATASSRARISRSISLGDSEGP 1203
Cdd:PHA03247 2782 ---RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1204 ivatlAQPLRRPSSVGelaslgqelQAITTATTPSldsegqEPALRSWGnhearanlRLTLSSACDGLLQPPVDTQPGVT 1283
Cdd:PHA03247 2859 -----GGDVRRRPPSR---------SPAAKPAAPA------RPPVRRLA--------RPAVSRSTESFALPPDQPERPPQ 2910
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1284 VPAVSFPAPSPVEESALRlhgsafrpSLPAPESPGLPAHPSNPQ-----LPEARPGIPGGTASLLEPTSGALGLLQGSPA 1358
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQ--------PQPPPPPPPRPQPPLAPTtdpagAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP 2982
|
410 420
....*....|....*....|....*
gi 530416321 1359 RwsePWVPVEALPPSPL---ELSRV 1380
Cdd:PHA03247 2983 A---PSREAPASSTPPLtghSLSRV 3004
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
704-743 |
1.41e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 43.46 E-value: 1.41e-05
10 20 30 40
....*....|....*....|....*....|....*....|
gi 530416321 704 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 743
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1133-1372 |
4.11e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.22 E-value: 4.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1133 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 1210
Cdd:pfam03154 186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1211 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSW---GNHEARANLRLTLSSACDGllQPPVD 1277
Cdd:pfam03154 265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPaapGQSQQRIHTPPSQSQLQSQ--QPPRE 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1278 tQP----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPT 1346
Cdd:pfam03154 343 -QPlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPP 408
|
250 260
....*....|....*....|....*.
gi 530416321 1347 SGALGLLQGSPArwSEPWVPVEALPP 1372
Cdd:pfam03154 409 SAHPPPLQLMPQ--SQQLPPPPAQPP 432
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
634-726 |
7.13e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 43.04 E-value: 7.13e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 634 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 712
Cdd:pfam12894 7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
|
90
....*....|....
gi 530416321 713 GHSEIITSMKFTYD 726
Cdd:pfam12894 78 AGSDLITCLGWGEN 91
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
145-185 |
3.11e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.91 E-value: 3.11e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 530416321 145 KNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 185
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
|
|
| SPS1 |
COG0515 |
Serine/threonine protein kinase [Signal transduction mechanisms]; |
1211-1445 |
8.04e-03 |
|
Serine/threonine protein kinase [Signal transduction mechanisms];
Pssm-ID: 440281 [Multi-domain] Cd Length: 482 Bit Score: 40.38 E-value: 8.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1211 PLRRPSSVGELAslgQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLTLSSAcdGLLQPPVDTQPGVTVPAVSFP 1290
Cdd:COG0515 251 PEERYQSAAELA---AALRAVLRSLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA--AAAAAAAAAAAAAAAAAAPAA 325
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1291 APSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARWSEPWVPVEAL 1370
Cdd:COG0515 326 AAAAAAAAAALAAAAAAAAAAAAAALLAAAAALAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAALAAAAA 405
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530416321 1371 PPSPLELSRVGNILHRLQTTFQEALDLYRVLVSSGQVDTGQQQARTELVSTFLWIHSQLEAECLVGTSVAPAQAL 1445
Cdd:COG0515 406 AAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAAAARLLAAAAAAAAAAAAAPLLAALLAAAALAAAAAAAALA 480
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
364-744 |
2.45e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 143.13 E-value: 2.45e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 364 ALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNL 442
Cdd:COG2319 83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 443 DSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELH 522
Cdd:COG2319 150 AT-------------------------------------GKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 523 FMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDIqMISCGAD 602
Cdd:COG2319 193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAFSPDGRL-LASGSAD 267
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 603 KSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDEGSllkVHVD 682
Cdd:COG2319 268 GTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAFS 339
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530416321 683 PSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWHL 744
Cdd:COG2319 340 PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
492-744 |
1.87e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.97 E-value: 1.87e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 492 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 571
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 572 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 651
Cdd:cd00200 85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 652 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 731
Cdd:cd00200 159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
|
250
....*....|...
gi 530416321 732 TVSGDSCVFIWHL 744
Cdd:cd00200 236 SGSEDGTIRVWDL 248
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
403-742 |
2.42e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.58 E-value: 2.42e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 403 HSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNLdsspdshwqknifsntllkvvyvENDIQHLQDMSHfpdr 481
Cdd:cd00200 8 HTGGVTCV----------AFSPDGKLLaTGSGDGTIKVWDL-----------------------ETGELLRTLKGH---- 50
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 482 gsengtpmdvKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLI 561
Cdd:cd00200 51 ----------TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSP---DGRILSSSSRDKTI 117
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 562 HVLNVEkNYNLEQTLDDHSSSITAIKFAGNRDIQmISCGADKSIYFRSAQQGSdglhFVRTHHvAEKTTLYDMDIDITQK 641
Cdd:cd00200 118 KVWDVE-TGKCLTTLRGHTDWVNSVAFSPDGTFV-ASSSQDGTIKLWDLRTGK----CVATLT-GHTGEVNSVAFSPDGE 190
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 642 YVAVACQDRNVRVYNTVNGKQKKCYkgsQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSM 721
Cdd:cd00200 191 KLLSSSSDGTIKLWDLSTGKCLGTL---RGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSL 267
|
330 340
....*....|....*....|.
gi 530416321 722 KFTYDCHHLITVSGDSCVFIW 742
Cdd:cd00200 268 AWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
42-567 |
2.91e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 110.39 E-value: 2.91e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 42 LRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSP 121
Cdd:COG2319 9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 122 DGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHdmVLNVWDWKKDIVVAS-NKVSCR 200
Cdd:COG2319 89 DGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGA 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 201 VIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvpLVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllc 279
Cdd:COG2319 165 VTSVAFSPDGKLLASGSdDGTVRLW--DLATGKLLRT---LTG--------HTGAVRSVA------------FSPDG--- 216
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 280 qfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFqahslhylanlpkphylgvDVAQGlEPSFLFHRKAEAVY 359
Cdd:COG2319 217 --------------------------KLLASGSADGTVRLW-------------------DLATG-KLLRTLTGHSGSVR 250
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 360 pdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSEL-FHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTI 437
Cdd:COG2319 251 ----SVAFSPDGRLLASGSADGTVRLWDLAT----GELLRTLtGHSGGVNSV----------AFSPDGKLLaSGSDDGTV 312
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 438 RFWNLDSSpdshwqknifsntllkvvyvendiQHLQDMSHFPDRgsengtpmdvkagVRVMQVSPDGQHLASGDRSGNLR 517
Cdd:COG2319 313 RLWDLATG------------------------KLLRTLTGHTGA-------------VRSVAFSPDGKTLASGSDDGTVR 355
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|
gi 530416321 518 IHELHFMDELVKVEAHDAEVLCLEYSKPEtglTLLASASRDRLIHVLNVE 567
Cdd:COG2319 356 LWDLATGELLRTLTGHTGAVTSVAFSPDG---RTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
480-745 |
8.57e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 105.76 E-value: 8.57e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 480 DRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPETglTLLASASRDR 559
Cdd:COG2319 66 AAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS-PDG--KTLASGSADG 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 560 LIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDiQMISCGADKSIYFRSAQQGSDgLHFVRTHhvaeKTTLYDMDIDIT 639
Cdd:COG2319 143 TVRLWDLATG-KLLRTLTGHSGAVTSVAFSPDGK-LLASGSDDGTVRLWDLATGKL-LRTLTGH----TGAVRSVAFSPD 215
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 640 QKYVAVACQDRNVRVYNTVNGKQKKCYKGsqgDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIIT 719
Cdd:COG2319 216 GKLLASGSADGTVRLWDLATGKLLRTLTG---HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVN 292
|
250 260
....*....|....*....|....*.
gi 530416321 720 SMKFTYDCHHLITVSGDSCVFIWHLG 745
Cdd:COG2319 293 SVAFSPDGKLLASGSDDGTVRLWDLA 318
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
93-444 |
1.91e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 104.61 E-value: 1.91e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 93 VVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVS 172
Cdd:COG2319 144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 173 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTetkvtstvplvGRSGILGEL 250
Cdd:COG2319 222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLAT-----------GELLRTLTG 286
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 251 HNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFQAHSLHYLAN 330
Cdd:COG2319 287 HSGGVNSVA------------FSPDG-----------------------------KLLASGSDDGTVRLWDLATGKLLRT 325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 331 LpKPHYLGVDvaqglepsflfhrkaeavypdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSELF-HSSYVWN 409
Cdd:COG2319 326 L-TGHTGAVR-----------------------SVAFSPDGKTLASGSDDGTVRLWDLAT----GELLRTLTgHTGAVTS 377
|
330 340 350
....*....|....*....|....*....|....*.
gi 530416321 410 VevypefedqrACLPSGSFL-TCSSDNTIRFWNLDS 444
Cdd:COG2319 378 V----------AFSPDGRTLaSGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2-518 |
7.23e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 103.07 E-value: 7.23e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 2 AAVGSGGYARNDAGEKLPSVMAGVPARRGQSSPPPAPPICLRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPG 81
Cdd:COG2319 11 AASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDG 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 82 TGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGENGHRpaVRIWDVEEKNQVAEMLGHKYGVACV 161
Cdd:COG2319 91 RLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTGHSGAVTSV 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 162 AFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTstvP 239
Cdd:COG2319 169 AFSPDGKLLASGSD--DGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSaDGTVRLW--DLATGKLLR---T 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 240 LVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRI 319
Cdd:COG2319 242 LTG--------HSGSVRSVA------------FSPDG-----------------------------RLLASGSADGTVRL 272
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 320 FqahslhylanlpkphylgvDVAQGLEPSFLFHRKAeAVYpdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWS 399
Cdd:COG2319 273 W-------------------DLATGELLRTLTGHSG-GVN----SVAFSPDGKLLASGSDDGTVRLWDLAT----GKLLR 324
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 400 EL-FHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmsh 477
Cdd:COG2319 325 TLtGHTGAVRSV----------AFSPDGKTLaSGSDDGTVRLWDLAT--------------------------------- 361
|
490 500 510 520
....*....|....*....|....*....|....*....|.
gi 530416321 478 fpdrGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRI 518
Cdd:COG2319 362 ----GELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRL 398
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
111-441 |
8.33e-22 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 97.41 E-value: 8.33e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 111 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 189
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 190 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvplvgrsgILGelHNnifcgvacgrgrmaG 267
Cdd:cd00200 84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVW--DVETGKCLTT---------LRG--HT--------------D 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 268 STFCVSYSGllcqfnekrvlekwinlkvslssclcvSQELIFCGCTDGIVRIFQAHSLHYLANLpKPHYLGV-------- 339
Cdd:cd00200 137 WVNSVAFSP---------------------------DGTFVASSSQDGTIKLWDLRTGKCVATL-TGHTGEVnsvafspd 188
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 340 ----------------DVAQGlEPSFLFHRKAEAVYpdtvALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfH 403
Cdd:cd00200 189 gekllssssdgtiklwDLSTG-KCLGTLRGHENGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---H 260
|
330 340 350
....*....|....*....|....*....|....*....
gi 530416321 404 SSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWN 441
Cdd:cd00200 261 TNSVTSL----------AWSPDGKRLaSGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
153-563 |
4.44e-16 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 80.46 E-value: 4.44e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 153 GHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVST 230
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSG--DGTIKVWDLETGELLRTLKGhTGPVRDVAASADGTYLASGSsDKTIRLW--DLET 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 231 ETKVTStvpLVGrsgilgelHNnifcgvacgrgrmaGSTFCVSYSgllcqfnekrvlekwinlkvslssclcVSQELIFC 310
Cdd:cd00200 83 GECVRT---LTG--------HT--------------SYVSSVAFS---------------------------PDGRILSS 110
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 311 GCTDGIVRIFqahslhylanlpkphylgvDVAQG-LEPSFLFHRKaeavypDTVALTFDPIHQWLSCVYKDHSIYIWDVK 389
Cdd:cd00200 111 SSRDKTIKVW-------------------DVETGkCLTTLRGHTD------WVNSVAFSPDGTFVASSSQDGTIKLWDLR 165
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 390 DINRVGKVwseLFHSSYVWNVEVYPEfedqraclpSGSFLTCSSDNTIRFWNLDSSpdshwqknifsntllkvvyvendi 469
Cdd:cd00200 166 TGKCVATL---TGHTGEVNSVAFSPD---------GEKLLSSSSDGTIKLWDLSTG------------------------ 209
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 470 QHLQDMshfpdRGSENgtpmdvkaGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkpETGL 549
Cdd:cd00200 210 KCLGTL-----RGHEN--------GVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS--PDGK 274
|
410
....*....|....
gi 530416321 550 TlLASASRDRLIHV 563
Cdd:cd00200 275 R-LASGSADGTIRI 287
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
480-744 |
7.24e-14 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 75.33 E-value: 7.24e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 480 DRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkpeTGLTLLASASRDR 559
Cdd:COG2319 24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFS---PDGRLLASASADG 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 560 LIHVLNVEKNyNLEQTLDDHSSSITAIKFAgnrdiqmiscgadksiyfrsaqqgSDGlhfvrthhvaekttlydmdidit 639
Cdd:COG2319 101 TVRLWDLATG-LLLRTLTGHTGAVRSVAFS------------------------PDG----------------------- 132
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 640 qKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIIT 719
Cdd:COG2319 133 -KTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTS---VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
|
250 260
....*....|....*....|....*
gi 530416321 720 SMKFTYDCHHLITVSGDSCVFIWHL 744
Cdd:COG2319 209 SVAFSPDGKLLASGSADGTVRLWDL 233
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
95-321 |
8.84e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 73.52 E-value: 8.84e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 95 ILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMG 174
Cdd:cd00200 77 LWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSS 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 175 YqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSED-SSYFVTVGNRHVRFWflEVSTETKVTStvpLVGrsgilgelHN 252
Cdd:cd00200 155 Q--DGTIKLWDLRTGKCVATLTGhTGEVNSVAFSPDgEKLLSSSSDGTIKLW--DLSTGKCLGT---LRG--------HE 219
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 253 NIFCGVACGRGRMagstFCVSYS--GLLCQFN-----EKRVL---EKWINlkvslssCLCVSQE--LIFCGCTDGIVRIF 320
Cdd:cd00200 220 NGVNSVAFSPDGY----LLASGSedGTIRVWDlrtgeCVQTLsghTNSVT-------SLAWSPDgkRLASGSADGTIRIW 288
|
.
gi 530416321 321 Q 321
Cdd:cd00200 289 D 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
87-224 |
3.00e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 71.98 E-value: 3.00e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 87 YLAGC----VVVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGENGHrpAVRIWDVEEKNQVAEMLGHKYGVACVA 162
Cdd:cd00200 107 ILSSSsrdkTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG--TIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 530416321 163 FSPNMKHIVSMGyqHDMVLNVWDWKKDIVVASNKVSC-RVIALSFSEDSSYFVTVG-NRHVRFW 224
Cdd:cd00200 185 FSPDGEKLLSSS--SDGTIKLWDLSTGKCLGTLRGHEnGVNSVAFSPDGYLLASGSeDGTIRVW 246
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
979-1380 |
3.23e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 62.26 E-value: 3.23e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 979 SSDSPKDQSPPEGCAGPTEDELSLPEGPSVPSSSLPQTPEQEKFLRHHFETlTESPCRELFPAALGDVEAS--------- 1049
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-PQRPRRRAARPTVGSLTSLadppppppt 2707
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1050 -EAEDHFFNPRLSISTQFLSSLQKASRFTHTFPPRATQclvKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEetl 1128
Cdd:PHA03247 2708 pEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP---AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--- 2781
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1129 eawRPPPPCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQ-----GVHAPSTCSYMEATASSRARISRSISLGDSEGP 1203
Cdd:PHA03247 2782 ---RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1204 ivatlAQPLRRPSSVGelaslgqelQAITTATTPSldsegqEPALRSWGnhearanlRLTLSSACDGLLQPPVDTQPGVT 1283
Cdd:PHA03247 2859 -----GGDVRRRPPSR---------SPAAKPAAPA------RPPVRRLA--------RPAVSRSTESFALPPDQPERPPQ 2910
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1284 VPAVSFPAPSPVEESALRlhgsafrpSLPAPESPGLPAHPSNPQ-----LPEARPGIPGGTASLLEPTSGALGLLQGSPA 1358
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQ--------PQPPPPPPPRPQPPLAPTtdpagAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP 2982
|
410 420
....*....|....*....|....*
gi 530416321 1359 RwsePWVPVEALPPSPL---ELSRV 1380
Cdd:PHA03247 2983 A---PSREAPASSTPPLtghSLSRV 3004
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1133-1376 |
7.50e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.64 E-value: 7.50e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1133 PPPPCLTSLAScvPASSVLPTDRNLPTPTSAPTPGLAQGVHAPstcsymeatASSRARISRSISLGDSEGPIVATLAQPL 1212
Cdd:PHA03247 2616 PLPPDTHAPDP--PPPSPSPAANEPDPHPPPTVPPPERPRDDP---------APGRVSRPRRARRLGRAAQASSPPQRPR 2684
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1213 RR--PSSVGELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF- 1289
Cdd:PHA03247 2685 RRaaRPTVGSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAa 2738
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1290 PAPSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEA 1369
Cdd:PHA03247 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAV 2811
|
....*..
gi 530416321 1370 LPPSPLE 1376
Cdd:PHA03247 2812 LAPAAAL 2818
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1104-1373 |
5.79e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 5.79e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1104 PRAGTGYASPDRTHVLAAGKAE-ETLEAWRPPPPCltslascVPASSVLPTDRNLPTPTSAPT---PGLAQGVHAPSTcs 1179
Cdd:PHA03247 2521 PDEPVGEPVHPRMLTWIRGLEElASDDAGDPPPPL-------PPAAPPAAPDRSVPPPRPAPRpsePAVTSRARRPDA-- 2591
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1180 ymeATASSRARISRSISlGDSEGPIVATLAQP----LRRPSSVGelASLGQELQAITTATTPSLDSEGQEPALRSWGNH- 1254
Cdd:PHA03247 2592 ---PPQSARPRAPVDDR-GDPRGPAPPSPLPPdthaPDPPPPSP--SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPr 2665
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1255 EARANLRLTLSSAcdgllqPPVDTQPGVTVPAVSF-------PAPSPVEESALRLHGSAFrPSLPAPES-----PGLPAH 1322
Cdd:PHA03247 2666 RARRLGRAAQASS------PPQRPRRRAARPTVGSltsladpPPPPPTPEPAPHALVSAT-PLPPGPAAarqasPALPAA 2738
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 530416321 1323 PSNPQLPEArPGIPGGTASLLEPTSGAlgllqgSPARWSEPWVPVEALPPS 1373
Cdd:PHA03247 2739 PAPPAVPAG-PATPGGPARPARPPTTA------GPPAPAPPAAPAAGPPRR 2782
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
981-1379 |
5.89e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 5.89e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 981 DSPKDQSPPEGCAGPTEDelslPEGPSVPSSSLPQTPEQEKFLRHHFETLTESPCRELFPAAlgdvEASEAEDHFFNPRL 1060
Cdd:PHA03247 2590 DAPPQSARPRAPVDDRGD----PRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP----PPERPRDDPAPGRV 2661
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1061 SIS--TQFLSSLQKASRFTHTFPPRATQCLV-------------KSPEvklmdrggSQPRAGTGyASPDRTHVLAAGKAE 1125
Cdd:PHA03247 2662 SRPrrARRLGRAAQASSPPQRPRRRAARPTVgsltsladpppppPTPE--------PAPHALVS-ATPLPPGPAAARQAS 2732
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1126 ETLEAWRPPPPclTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGDSEGPIV 1205
Cdd:PHA03247 2733 PALPAAPAPPA--VPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA 2810
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1206 ATLAQPLRRPSSVgelASLGQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLtlssacdgllQPPvdTQPGVTVP 1285
Cdd:PHA03247 2811 VLAPAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRR----------RPP--SRSPAAKP 2875
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1286 A---------VSFPAPSPVEES-ALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQG 1355
Cdd:PHA03247 2876 AaparppvrrLARPAVSRSTESfALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEP 2955
|
410 420
....*....|....*....|....
gi 530416321 1356 SPARwSEPWVPveALPPSPLELSR 1379
Cdd:PHA03247 2956 SGAV-PQPWLG--ALVPGRVAVPR 2976
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
704-743 |
1.41e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 43.46 E-value: 1.41e-05
10 20 30 40
....*....|....*....|....*....|....*....|
gi 530416321 704 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 743
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1133-1372 |
4.11e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.22 E-value: 4.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1133 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 1210
Cdd:pfam03154 186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1211 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSW---GNHEARANLRLTLSSACDGllQPPVD 1277
Cdd:pfam03154 265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPaapGQSQQRIHTPPSQSQLQSQ--QPPRE 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1278 tQP----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPT 1346
Cdd:pfam03154 343 -QPlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPP 408
|
250 260
....*....|....*....|....*.
gi 530416321 1347 SGALGLLQGSPArwSEPWVPVEALPP 1372
Cdd:pfam03154 409 SAHPPPLQLMPQ--SQQLPPPPAQPP 432
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
634-726 |
7.13e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 43.04 E-value: 7.13e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 634 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 712
Cdd:pfam12894 7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
|
90
....*....|....
gi 530416321 713 GHSEIITSMKFTYD 726
Cdd:pfam12894 78 AGSDLITCLGWGEN 91
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
705-742 |
1.12e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.79 E-value: 1.12e-04
10 20 30
....*....|....*....|....*....|....*...
gi 530416321 705 GECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 742
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
963-1331 |
3.57e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.31 E-value: 3.57e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 963 VEAGPGDQQGDSYLRVSSDSPKDQSPPEGCAGP-TEDELSLPEGPSVPSSS----LPQTPEQEKFLRHHFETLTESPCRE 1037
Cdd:PHA03247 2719 TPLPPGPAAARQASPALPAAPAPPAVPAGPATPgGPARPARPPTTAGPPAPappaAPAAGPPRRLTRPAVASLSESRESL 2798
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1038 LFPAALGDVEASEAEDHFFNPRLSISTQFL----SSLQKASRFTHT-FPPRATQCLVKSPEVKLMDRGGSQPRAGTGYAS 1112
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLppptSAQPTAPPPPPGpPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1113 PD-RTHVLAAGKAEETLEAWRPPPPCLTSLASCVPASSVLPTDRNLPTPTSAP---TPGLAQGVHAPSTCSYMEATASSR 1188
Cdd:PHA03247 2879 ARpPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPpppPPPRPQPPLAPTTDPAGAGEPSGA 2958
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1189 ARISRSISLGDSEGPIVATLAqPLRRPSsvgelaslgqelQAITTATTPSLDSEgQEPALRSWGN----HEARANLRLTL 1264
Cdd:PHA03247 2959 VPQPWLGALVPGRVAVPRFRV-PQPAPS------------REAPASSTPPLTGH-SLSRVSSWASslalHEETDPPPVSL 3024
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 530416321 1265 SSAcdglLQPPVDTQPGvtvpavsfPAPSPVEESALRLHGSAFRPSLPAPESPglPAHPSNPQLPEA 1331
Cdd:PHA03247 3025 KQT----LWPPDDTEDS--------DADSLFDSDSERSDLEALDPLPPEPHDP--FAHEPDPATPEA 3077
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
1269-1375 |
1.13e-03 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 43.51 E-value: 1.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1269 DGLLQPPVDTQPGVT-VPAVSFPAPSPVEESALRLHGSAFRPSLPAPespgLPAHP-SNPQLPEARPGIPGGTASLLEPT 1346
Cdd:PHA03379 482 DQLPGVVQDGRPACApVPAPAGPIVRPWEASLSQVPGVAFAPVMPQP----MPVEPvPVPTVALERPVCPAPPLIAMQGP 557
|
90 100
....*....|....*....|....*....
gi 530416321 1347 SGALGLLQGSPARWSEPWVPVEALPPSPL 1375
Cdd:PHA03379 558 GETSGIVRVRERWRPAPWTPNPPRSPSQM 586
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
145-185 |
3.11e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.91 E-value: 3.11e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 530416321 145 KNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 185
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
982-1333 |
4.44e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.51 E-value: 4.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 982 SPKDQSPPEGCAGPTEDELSLPEGPSVPSSSLPQTPEQEKFLRHHFETLTESPCRELFPAALGDVEASEAEDHFFNPRLS 1061
Cdd:PRK07764 436 APAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRER 515
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1062 IStQFLSSLQKASRFTHTfppratqclVKSPEVK-LMDRGG------SQPRAGTGYASPDRTHVLAAGKAEETLEAWR-- 1132
Cdd:PRK07764 516 WP-EILAAVPKRSRKTWA---------ILLPEATvLGVRGDtlvlgfSTGGLARRFASPGNAEVLVTALAEELGGDWQve 585
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1133 ----------------PPPPCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRAriSRSIS 1196
Cdd:PRK07764 586 avvgpapgaaggegppAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVA--VPDAS 663
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1197 LGDSEGPIVATLAQPLRRPSSVGELASLGqelqaittattPSLDSEGQEPALRSWGNHEARANLRLTlssacdgllQPPV 1276
Cdd:PRK07764 664 DGGDGWPAKAGGAAPAAPPPAPAPAAPAA-----------PAGAAPAQPAPAPAATPPAGQADDPAA---------QPPQ 723
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*..
gi 530416321 1277 DTQPGVTVPAVSFPAPSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARP 1333
Cdd:PRK07764 724 AAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
|
|
| SPS1 |
COG0515 |
Serine/threonine protein kinase [Signal transduction mechanisms]; |
1211-1445 |
8.04e-03 |
|
Serine/threonine protein kinase [Signal transduction mechanisms];
Pssm-ID: 440281 [Multi-domain] Cd Length: 482 Bit Score: 40.38 E-value: 8.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1211 PLRRPSSVGELAslgQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLTLSSAcdGLLQPPVDTQPGVTVPAVSFP 1290
Cdd:COG0515 251 PEERYQSAAELA---AALRAVLRSLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA--AAAAAAAAAAAAAAAAAAPAA 325
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1291 APSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARWSEPWVPVEAL 1370
Cdd:COG0515 326 AAAAAAAAAALAAAAAAAAAAAAAALLAAAAALAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAALAAAAA 405
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530416321 1371 PPSPLELSRVGNILHRLQTTFQEALDLYRVLVSSGQVDTGQQQARTELVSTFLWIHSQLEAECLVGTSVAPAQAL 1445
Cdd:COG0515 406 AAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAAAARLLAAAAAAAAAAAAAPLLAALLAAAALAAAAAAAALA 480
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
117-170 |
8.44e-03 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 40.79 E-value: 8.44e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*..
gi 530416321 117 LAFSPDGKYIV---TGENGHRpAVRIWDVEEKnQVAEMLGHKYGVACVAFSPNMKHI 170
Cdd:COG4946 437 LAWSPDSKWLAyskPGPNQLS-QIFLYDVETG-KTVQLTDGRYDDGSPAFSPDGKYL 491
|
|
|