|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
339-739 |
1.13e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 144.28 E-value: 1.13e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 339 VDVAQGLEPRKAEAVYPDTVALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrACL 418
Cdd:COG2319 63 LDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFS 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 419 PSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSP 497
Cdd:COG2319 130 PDGKTLaSGSADGTVRLWDLAT-------------------------------------GKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 498 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSS 577
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGS 248
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 578 ITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 657
Cdd:COG2319 249 VRSVAFSPDGRL-LASGSADGTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 658 KKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 737
Cdd:COG2319 323 LRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399
|
..
gi 2462564671 738 HL 739
Cdd:COG2319 400 DL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
487-739 |
2.20e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.97 E-value: 2.20e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 487 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 566
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 567 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 646
Cdd:cd00200 85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 647 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 726
Cdd:cd00200 159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
|
250
....*....|...
gi 2462564671 727 TVSGDSCVFIWHL 739
Cdd:cd00200 236 SGSEDGTIRVWDL 248
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
93-439 |
5.43e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.53 E-value: 5.43e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 93 VVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVS 172
Cdd:COG2319 144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 173 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTetkvtstvplvGRSGILGEL 250
Cdd:COG2319 222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLAT-----------GELLRTLTG 286
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 251 HNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFQAHSLHYLAN 330
Cdd:COG2319 287 HSGGVNSVA------------FSPDG-----------------------------KLLASGSDDGTVRLWDLATGKLLRT 325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 331 LpKPHYLGVDvaqgleprkaeavypdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSELF-HSSYVWNVevyp 409
Cdd:COG2319 326 L-TGHTGAVR------------------SVAFSPDGKTLASGSDDGTVRLWDLAT----GELLRTLTgHTGAVTSV---- 378
|
330 340 350
....*....|....*....|....*....|.
gi 2462564671 410 efedqrACLPSGSFL-TCSSDNTIRFWNLDS 439
Cdd:COG2319 379 ------AFSPDGRTLaSGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
111-436 |
1.88e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 96.25 E-value: 1.88e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 111 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 189
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 190 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvplvgrsgILGelHNN-IFCGVACGRGRMA 266
Cdd:cd00200 84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVW--DVETGKCLTT---------LRG--HTDwVNSVAFSPDGTFV 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 267 gstFCVSYSGL-----LCQFNEKRVL---EKWINlkvslssCLCVS--QELIFCGCTDGIVRIFQAHSLHYLANLpkphy 336
Cdd:cd00200 151 ---ASSSQDGTiklwdLRTGKCVATLtghTGEVN-------SVAFSpdGEKLLSSSSDGTIKLWDLSTGKCLGTL----- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 337 lgvdvaqglePRKAEAVYpdtvALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrA 416
Cdd:cd00200 216 ----------RGHENGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSL----------A 268
|
330 340
....*....|....*....|.
gi 2462564671 417 CLPSGSFL-TCSSDNTIRFWN 436
Cdd:cd00200 269 WSPDGKRLaSGSADGTIRIWD 289
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1160-1403 |
8.20e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.64 E-value: 8.20e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1160 PPPPCLTSLAScvPASSVLPTDRNLPTPTSAPTPGLAQGVHAPstcsymeatASSRARISRSISLGDSEGPIVATLAQPL 1239
Cdd:PHA03247 2616 PLPPDTHAPDP--PPPSPSPAANEPDPHPPPTVPPPERPRDDP---------APGRVSRPRRARRLGRAAQASSPPQRPR 2684
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1240 RR--PSSVGELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF- 1316
Cdd:PHA03247 2685 RRaaRPTVGSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAa 2738
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1317 PAPSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEA 1396
Cdd:PHA03247 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAV 2811
|
....*..
gi 2462564671 1397 LPPSPLE 1403
Cdd:PHA03247 2812 LAPAAAL 2818
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
699-738 |
1.55e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 43.46 E-value: 1.55e-05
10 20 30 40
....*....|....*....|....*....|....*....|
gi 2462564671 699 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 738
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1160-1399 |
5.27e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.22 E-value: 5.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1160 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 1237
Cdd:pfam03154 186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1238 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSW---GNHEARANLRLTLSSACDGllQPPVD 1304
Cdd:pfam03154 265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPaapGQSQQRIHTPPSQSQLQSQ--QPPRE 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1305 tQP----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPT 1373
Cdd:pfam03154 343 -QPlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPP 408
|
250 260
....*....|....*....|....*.
gi 2462564671 1374 SGALGLLQGSPArwSEPWVPVEALPP 1399
Cdd:pfam03154 409 SAHPPPLQLMPQ--SQQLPPPPAQPP 432
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
629-721 |
7.26e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 43.04 E-value: 7.26e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 629 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 707
Cdd:pfam12894 7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
|
90
....*....|....
gi 2462564671 708 GHSEIITSMKFTYD 721
Cdd:pfam12894 78 AGSDLITCLGWGEN 91
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
145-185 |
3.57e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 3.57e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2462564671 145 KNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 185
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
|
|
| SPS1 |
COG0515 |
Serine/threonine protein kinase [Signal transduction mechanisms]; |
1238-1472 |
8.34e-03 |
|
Serine/threonine protein kinase [Signal transduction mechanisms];
Pssm-ID: 440281 [Multi-domain] Cd Length: 482 Bit Score: 40.38 E-value: 8.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1238 PLRRPSSVGELAslgQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLTLSSAcdGLLQPPVDTQPGVTVPAVSFP 1317
Cdd:COG0515 251 PEERYQSAAELA---AALRAVLRSLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA--AAAAAAAAAAAAAAAAAAPAA 325
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1318 APSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARWSEPWVPVEAL 1397
Cdd:COG0515 326 AAAAAAAAAALAAAAAAAAAAAAAALLAAAAALAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAALAAAAA 405
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462564671 1398 PPSPLELSRVGNILHRLQTTFQEALDLYRVLVSSGQVDTGQQQARTELVSTFLWIHSQLEAECLVGTSVAPAQAL 1472
Cdd:COG0515 406 AAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAAAARLLAAAAAAAAAAAAAPLLAALLAAAALAAAAAAAALA 480
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
339-739 |
1.13e-36 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 144.28 E-value: 1.13e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 339 VDVAQGLEPRKAEAVYPDTVALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrACL 418
Cdd:COG2319 63 LDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFS 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 419 PSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSP 497
Cdd:COG2319 130 PDGKTLaSGSADGTVRLWDLAT-------------------------------------GKLLRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 498 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSS 577
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGS 248
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 578 ITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 657
Cdd:COG2319 249 VRSVAFSPDGRL-LASGSADGTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 658 KKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 737
Cdd:COG2319 323 LRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399
|
..
gi 2462564671 738 HL 739
Cdd:COG2319 400 DL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
487-739 |
2.20e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.97 E-value: 2.20e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 487 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 566
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 567 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 646
Cdd:cd00200 85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 647 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 726
Cdd:cd00200 159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
|
250
....*....|...
gi 2462564671 727 TVSGDSCVFIWHL 739
Cdd:cd00200 236 SGSEDGTIRVWDL 248
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
398-737 |
2.99e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 113.58 E-value: 2.99e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 398 HSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNLdsspdshwqknifsntllkvvyvENDIQHLQDMSHfpdr 476
Cdd:cd00200 8 HTGGVTCV----------AFSPDGKLLaTGSGDGTIKVWDL-----------------------ETGELLRTLKGH---- 50
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 477 gsengtpmdvKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLI 556
Cdd:cd00200 51 ----------TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSP---DGRILSSSSRDKTI 117
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 557 HVLNVEkNYNLEQTLDDHSSSITAIKFAGNRDIQmISCGADKSIYFRSAQQGSdglhFVRTHHvAEKTTLYDMDIDITQK 636
Cdd:cd00200 118 KVWDVE-TGKCLTTLRGHTDWVNSVAFSPDGTFV-ASSSQDGTIKLWDLRTGK----CVATLT-GHTGEVNSVAFSPDGE 190
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 637 YVAVACQDRNVRVYNTVNGKQKKCYkgsQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSM 716
Cdd:cd00200 191 KLLSSSSDGTIKLWDLSTGKCLGTL---RGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSL 267
|
330 340
....*....|....*....|.
gi 2462564671 717 KFTYDCHHLITVSGDSCVFIW 737
Cdd:cd00200 268 AWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
42-562 |
5.89e-25 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 109.23 E-value: 5.89e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 42 LRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSP 121
Cdd:COG2319 9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 122 DGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHdmVLNVWDWKKDIVVAS-NKVSCR 200
Cdd:COG2319 89 DGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGA 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 201 VIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvpLVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllc 279
Cdd:COG2319 165 VTSVAFSPDGKLLASGSdDGTVRLW--DLATGKLLRT---LTG--------HTGAVRSVA------------FSPDG--- 216
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 280 qfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFqahslhylanlpkphylgvDVAQGLEPRKAEAVYPDTVA 359
Cdd:COG2319 217 --------------------------KLLASGSADGTVRLW-------------------DLATGKLLRTLTGHSGSVRS 251
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 360 LTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSEL-FHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNL 437
Cdd:COG2319 252 VAFSPDGRLLASGSADGTVRLWDLAT----GELLRTLtGHSGGVNSV----------AFSPDGKLLaSGSDDGTVRLWDL 317
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 438 DSSpdshwqknifsntllkvvyvendiQHLQDMSHFPDRgsengtpmdvkagVRVMQVSPDGQHLASGDRSGNLRIHELH 517
Cdd:COG2319 318 ATG------------------------KLLRTLTGHTGA-------------VRSVAFSPDGKTLASGSDDGTVRLWDLA 360
|
490 500 510 520
....*....|....*....|....*....|....*....|....*
gi 2462564671 518 FMDELVKVEAHDAEVLCLEYSKPEtglTLLASASRDRLIHVLNVE 562
Cdd:COG2319 361 TGELLRTLTGHTGAVTSVAFSPDG---RTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
93-439 |
5.43e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 106.53 E-value: 5.43e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 93 VVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVS 172
Cdd:COG2319 144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 173 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTetkvtstvplvGRSGILGEL 250
Cdd:COG2319 222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLAT-----------GELLRTLTG 286
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 251 HNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFQAHSLHYLAN 330
Cdd:COG2319 287 HSGGVNSVA------------FSPDG-----------------------------KLLASGSDDGTVRLWDLATGKLLRT 325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 331 LpKPHYLGVDvaqgleprkaeavypdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSELF-HSSYVWNVevyp 409
Cdd:COG2319 326 L-TGHTGAVR------------------SVAFSPDGKTLASGSDDGTVRLWDLAT----GELLRTLTgHTGAVTSV---- 378
|
330 340 350
....*....|....*....|....*....|.
gi 2462564671 410 efedqrACLPSGSFL-TCSSDNTIRFWNLDS 439
Cdd:COG2319 379 ------AFSPDGRTLaSGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
475-740 |
9.70e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 105.76 E-value: 9.70e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 475 DRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPETglTLLASASRDR 554
Cdd:COG2319 66 AAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS-PDG--KTLASGSADG 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 555 LIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDiQMISCGADKSIYFRSAQQGSDgLHFVRTHhvaeKTTLYDMDIDIT 634
Cdd:COG2319 143 TVRLWDLATG-KLLRTLTGHSGAVTSVAFSPDGK-LLASGSDDGTVRLWDLATGKL-LRTLTGH----TGAVRSVAFSPD 215
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 635 QKYVAVACQDRNVRVYNTVNGKQKKCYKGsqgDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIIT 714
Cdd:COG2319 216 GKLLASGSADGTVRLWDLATGKLLRTLTG---HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVN 292
|
250 260
....*....|....*....|....*.
gi 2462564671 715 SMKFTYDCHHLITVSGDSCVFIWHLG 740
Cdd:COG2319 293 SVAFSPDGKLLASGSDDGTVRLWDLA 318
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
2-513 |
3.93e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 103.84 E-value: 3.93e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 2 AAVGSGGYARNDAGEKLPSVMAGVPARRGQSSPPPAPPICLRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPG 81
Cdd:COG2319 11 AASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDG 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 82 TGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGENGHRpaVRIWDVEEKNQVAEMLGHKYGVACV 161
Cdd:COG2319 91 RLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTGHSGAVTSV 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 162 AFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTstvP 239
Cdd:COG2319 169 AFSPDGKLLASGSD--DGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSaDGTVRLW--DLATGKLLR---T 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 240 LVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRI 319
Cdd:COG2319 242 LTG--------HSGSVRSVA------------FSPDG-----------------------------RLLASGSADGTVRL 272
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 320 FqahslhylanlpkphylgvDVAQGLEPRKAEAVYPDTVALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSEL-FH 398
Cdd:COG2319 273 W-------------------DLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLAT----GKLLRTLtGH 329
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 399 SSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrG 477
Cdd:COG2319 330 TGAVRSV----------AFSPDGKTLaSGSDDGTVRLWDLAT-------------------------------------G 362
|
490 500 510
....*....|....*....|....*....|....*.
gi 2462564671 478 SENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRI 513
Cdd:COG2319 363 ELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRL 398
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
111-436 |
1.88e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 96.25 E-value: 1.88e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 111 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 189
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 190 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvplvgrsgILGelHNN-IFCGVACGRGRMA 266
Cdd:cd00200 84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVW--DVETGKCLTT---------LRG--HTDwVNSVAFSPDGTFV 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 267 gstFCVSYSGL-----LCQFNEKRVL---EKWINlkvslssCLCVS--QELIFCGCTDGIVRIFQAHSLHYLANLpkphy 336
Cdd:cd00200 151 ---ASSSQDGTiklwdLRTGKCVATLtghTGEVN-------SVAFSpdGEKLLSSSSDGTIKLWDLSTGKCLGTL----- 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 337 lgvdvaqglePRKAEAVYpdtvALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrA 416
Cdd:cd00200 216 ----------RGHENGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSL----------A 268
|
330 340
....*....|....*....|.
gi 2462564671 417 CLPSGSFL-TCSSDNTIRFWN 436
Cdd:cd00200 269 WSPDGKRLaSGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
153-558 |
1.32e-15 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 79.30 E-value: 1.32e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 153 GHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVST 230
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSG--DGTIKVWDLETGELLRTLKGhTGPVRDVAASADGTYLASGSsDKTIRLW--DLET 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 231 ETKVTStvpLVGrsgilgelHNnifcgvacgrgrmaGSTFCVSYSgllcqfnekrvlekwinlkvslssclcVSQELIFC 310
Cdd:cd00200 83 GECVRT---LTG--------HT--------------SYVSSVAFS---------------------------PDGRILSS 110
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 311 GCTDGIVRIFqahslhylanlpkphylgvDVAQGleprKAEAVYP---DTV-ALTFDPIHQWLSCVYKDHSIYIWDVKDI 386
Cdd:cd00200 111 SSRDKTIKVW-------------------DVETG----KCLTTLRghtDWVnSVAFSPDGTFVASSSQDGTIKLWDLRTG 167
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 387 NRVGKVwseLFHSSYVWNVEVYPEfedqraclpSGSFLTCSSDNTIRFWNLDSSpdshwqknifsntllkvvyvendiQH 466
Cdd:cd00200 168 KCVATL---TGHTGEVNSVAFSPD---------GEKLLSSSSDGTIKLWDLSTG------------------------KC 211
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 467 LQDMshfpdRGSENgtpmdvkaGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkpETGLTl 546
Cdd:cd00200 212 LGTL-----RGHEN--------GVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS--PDGKR- 275
|
410
....*....|..
gi 2462564671 547 LASASRDRLIHV 558
Cdd:cd00200 276 LASGSADGTIRI 287
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
475-739 |
8.16e-14 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 75.33 E-value: 8.16e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 475 DRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkpeTGLTLLASASRDR 554
Cdd:COG2319 24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFS---PDGRLLASASADG 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 555 LIHVLNVEKNyNLEQTLDDHSSSITAIKFAgnrdiqmiscgadksiyfrsaqqgSDGlhfvrthhvaekttlydmdidit 634
Cdd:COG2319 101 TVRLWDLATG-LLLRTLTGHTGAVRSVAFS------------------------PDG----------------------- 132
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 635 qKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIIT 714
Cdd:COG2319 133 -KTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTS---VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
|
250 260
....*....|....*....|....*
gi 2462564671 715 SMKFTYDCHHLITVSGDSCVFIWHL 739
Cdd:COG2319 209 SVAFSPDGKLLASGSADGTVRLWDL 233
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
95-321 |
1.09e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 73.52 E-value: 1.09e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 95 ILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMG 174
Cdd:cd00200 77 LWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSS 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 175 YqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSED-SSYFVTVGNRHVRFWflEVSTETKVTStvpLVGrsgilgelHN 252
Cdd:cd00200 155 Q--DGTIKLWDLRTGKCVATLTGhTGEVNSVAFSPDgEKLLSSSSDGTIKLW--DLSTGKCLGT---LRG--------HE 219
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 253 NIFCGVACGRGRMagstFCVSYS--GLLCQFN-----EKRVL---EKWINlkvslssCLCVSQE--LIFCGCTDGIVRIF 320
Cdd:cd00200 220 NGVNSVAFSPDGY----LLASGSedGTIRVWDlrtgeCVQTLsghTNSVT-------SLAWSPDgkRLASGSADGTIRIW 288
|
.
gi 2462564671 321 Q 321
Cdd:cd00200 289 D 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
87-224 |
3.51e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 71.98 E-value: 3.51e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 87 YLAGC----VVVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGENGHrpAVRIWDVEEKNQVAEMLGHKYGVACVA 162
Cdd:cd00200 107 ILSSSsrdkTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG--TIKLWDLRTGKCVATLTGHTGEVNSVA 184
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462564671 163 FSPNMKHIVSMGyqHDMVLNVWDWKKDIVVASNKVSC-RVIALSFSEDSSYFVTVG-NRHVRFW 224
Cdd:cd00200 185 FSPDGEKLLSSS--SDGTIKLWDLSTGKCLGTLRGHEnGVNSVAFSPDGYLLASGSeDGTIRVW 246
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1160-1403 |
8.20e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.64 E-value: 8.20e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1160 PPPPCLTSLAScvPASSVLPTDRNLPTPTSAPTPGLAQGVHAPstcsymeatASSRARISRSISLGDSEGPIVATLAQPL 1239
Cdd:PHA03247 2616 PLPPDTHAPDP--PPPSPSPAANEPDPHPPPTVPPPERPRDDP---------APGRVSRPRRARRLGRAAQASSPPQRPR 2684
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1240 RR--PSSVGELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF- 1316
Cdd:PHA03247 2685 RRaaRPTVGSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAa 2738
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1317 PAPSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEA 1396
Cdd:PHA03247 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAV 2811
|
....*..
gi 2462564671 1397 LPPSPLE 1403
Cdd:PHA03247 2812 LAPAAAL 2818
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1131-1400 |
6.60e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 6.60e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1131 PRAGTGYASPDRTHVLAAGKAE-ETLEAWRPPPPCltslascVPASSVLPTDRNLPTPTSAPT---PGLAQGVHAPSTcs 1206
Cdd:PHA03247 2521 PDEPVGEPVHPRMLTWIRGLEElASDDAGDPPPPL-------PPAAPPAAPDRSVPPPRPAPRpsePAVTSRARRPDA-- 2591
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1207 ymeATASSRARISRSISlGDSEGPIVATLAQP----LRRPSSVGelASLGQELQAITTATTPSLDSEGQEPALRSWGNH- 1281
Cdd:PHA03247 2592 ---PPQSARPRAPVDDR-GDPRGPAPPSPLPPdthaPDPPPPSP--SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPr 2665
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1282 EARANLRLTLSSAcdgllqPPVDTQPGVTVPAVSF-------PAPSPVEESALRLHGSAFrPSLPAPES-----PGLPAH 1349
Cdd:PHA03247 2666 RARRLGRAAQASS------PPQRPRRRAARPTVGSltsladpPPPPPTPEPAPHALVSAT-PLPPGPAAarqasPALPAA 2738
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 2462564671 1350 PSNPQLPEArPGIPGGTASLLEPTSGAlgllqgSPARWSEPWVPVEALPPS 1400
Cdd:PHA03247 2739 PAPPAVPAG-PATPGGPARPARPPTTA------GPPAPAPPAAPAAGPPRR 2782
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1118-1403 |
7.68e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 7.68e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1118 SPEVKLMDRGGSQPRAgtgyASPDRTHVLA-AGKAEETLEAWRPP--PPCLTSLAScvpasSVLPTDRNlPTPTSAPTPG 1194
Cdd:PHA03247 2646 VPPPERPRDDPAPGRV----SRPRRARRLGrAAQASSPPQRPRRRaaRPTVGSLTS-----LADPPPPP-PTPEPAPHAL 2715
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1195 LAQgvhAPSTCSYMEATASSRARISRSISLGDSEGPIV-ATLAQPLRRPSSVGELASlgqelqaittaTTPSLDSEGQEP 1273
Cdd:PHA03247 2716 VSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATpGGPARPARPPTTAGPPAP-----------APPAAPAAGPPR 2781
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1274 ALRSWGNHEARANlRLTLSSACDGLLQPPVDTQPGVTVPAVSFPA-PSPVeesalrlhgsafrPSLPAPESPGLPAHPSN 1352
Cdd:PHA03247 2782 RLTRPAVASLSES-RESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPP-------------PTSAQPTAPPPPPGPPP 2847
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 2462564671 1353 PQLPEARPGIPGGTASLLEPTSGALGLLQGSP----ARWSEPWVPVE----ALPPSPLE 1403
Cdd:PHA03247 2848 PSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPArppvRRLARPAVSRStesfALPPDQPE 2906
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1085-1407 |
1.17e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.79 E-value: 1.17e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1085 PRLSISTQFLSSLQKASRFTHTFPPR--ATQCLVKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEetleawRPPP 1162
Cdd:PHA03247 2712 PHALVSATPLPPGPAAARQASPALPAapAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR------RLTR 2785
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1163 PCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQ-----GVHAPSTCSYMEATASSRARISRSISLGDSEGPivatlAQ 1237
Cdd:PHA03247 2786 PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP-----GG 2860
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1238 PLRRPSSVGelaslgqelQAITTATTPSldsegqEPALRSWGnhearanlRLTLSSACDGLLQPPVDTQPGVTVPAVSFP 1317
Cdd:PHA03247 2861 DVRRRPPSR---------SPAAKPAAPA------RPPVRRLA--------RPAVSRSTESFALPPDQPERPPQPQAPPPP 2917
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1318 APSPVEESALRlhgsafrpSLPAPESPGLPAHPSNPQ-----LPEARPGIPGGTASLLEPTSGALGLLQGSPARwsePWV 1392
Cdd:PHA03247 2918 QPQPQPPPPPQ--------PQPPPPPPPRPQPPLAPTtdpagAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA---PSR 2986
|
330
....*....|....*...
gi 2462564671 1393 PVEALPPSPL---ELSRV 1407
Cdd:PHA03247 2987 EAPASSTPPLtghSLSRV 3004
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1106-1406 |
1.44e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.40 E-value: 1.44e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1106 TFPPRATQCLVKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEETLEAW-RPPPPCLTSLASCVPASSVLPTDRNL 1184
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLaDPPPPPPTPEPAPHALVSATPLPPGP 2725
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1185 -----------------PTPTSAPTPGLAQGVHAPSTCSYMEATASSRARIS---RSISLGDSEGPIVATLAQPLRRPSS 1244
Cdd:PHA03247 2726 aaarqaspalpaapappAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAgppRRLTRPAVASLSESRESLPSPWDPA 2805
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1245 VGELASLGQELQAITTAT--TPSLDSEGQEPALRSWGNHEARANLRLTLSSACDGLLQPPVDTQPGVTVPA--------- 1313
Cdd:PHA03247 2806 DPPAAVLAPAAALPPAASpaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAaparppvrr 2885
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1314 VSFPAPSPVEES-ALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARwSEPWV 1392
Cdd:PHA03247 2886 LARPAVSRSTESfALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAV-PQPWL 2964
|
330
....*....|....
gi 2462564671 1393 PveALPPSPLELSR 1406
Cdd:PHA03247 2965 G--ALVPGRVAVPR 2976
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
699-738 |
1.55e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 43.46 E-value: 1.55e-05
10 20 30 40
....*....|....*....|....*....|....*....|
gi 2462564671 699 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 738
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1160-1399 |
5.27e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.22 E-value: 5.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1160 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 1237
Cdd:pfam03154 186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1238 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSW---GNHEARANLRLTLSSACDGllQPPVD 1304
Cdd:pfam03154 265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPaapGQSQQRIHTPPSQSQLQSQ--QPPRE 342
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1305 tQP----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPT 1373
Cdd:pfam03154 343 -QPlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPP 408
|
250 260
....*....|....*....|....*.
gi 2462564671 1374 SGALGLLQGSPArwSEPWVPVEALPP 1399
Cdd:pfam03154 409 SAHPPPLQLMPQ--SQQLPPPPAQPP 432
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
629-721 |
7.26e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 43.04 E-value: 7.26e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 629 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 707
Cdd:pfam12894 7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
|
90
....*....|....
gi 2462564671 708 GHSEIITSMKFTYD 721
Cdd:pfam12894 78 AGSDLITCLGWGEN 91
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
700-737 |
1.19e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 40.79 E-value: 1.19e-04
10 20 30
....*....|....*....|....*....|....*...
gi 2462564671 700 GECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 737
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1108-1381 |
2.74e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.70 E-value: 2.74e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1108 PPRATQCLVKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEETLEawrPPPPcltslaSCVPASSVLPTDrnlPTP 1187
Cdd:PHA03247 2780 PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPL---PPPT------SAQPTAPPPPPG---PPP 2847
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1188 TSAPT-----PGLAQGVHAPSTCSYMEATASSRARISRsislgdsegpivatLAQP-LRRPSSVGELASLGQELQAITTA 1261
Cdd:PHA03247 2848 PSLPLggsvaPGGDVRRRPPSRSPAAKPAAPARPPVRR--------------LARPaVSRSTESFALPPDQPERPPQPQA 2913
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1262 TTPSLDSEGQEPALRSWGNHEARANlrltlssaCDGLLQPPVDT----QPGVTVPAVSFPAPSPVEESALRLHGSAFRPS 1337
Cdd:PHA03247 2914 PPPPQPQPQPPPPPQPQPPPPPPPR--------PQPPLAPTTDPagagEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 2462564671 1338 LPAPESpglpahPSNPQLPEARPGIPGGTASL---LEPTSGALGLLQ 1381
Cdd:PHA03247 2986 REAPAS------STPPLTGHSLSRVSSWASSLalhEETDPPPVSLKQ 3026
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
145-185 |
3.57e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 3.57e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2462564671 145 KNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 185
Cdd:smart00320 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1107-1358 |
5.75e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 5.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1107 FPPRATQCLVKSPEVKLMDRGGSQPRAGTGYASPD-RTHVLAAGKAEETLEAWRPPPPCLTSLASCVPASSVLPTDRNLP 1185
Cdd:PHA03247 2846 PPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARpPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1186 TPTSAP---TPGLAQGVHAPSTCSYMEATASSRARISRSISLGDSEGPIVATLAqPLRRPSsvgelaslgqelQAITTAT 1262
Cdd:PHA03247 2926 PPQPQPpppPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV-PQPAPS------------REAPASS 2992
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1263 TPSLDSEgQEPALRSWGN----HEARANLRLTLSSAcdglLQPPVDTQPGvtvpavsfPAPSPVEESALRLHGSAFRPSL 1338
Cdd:PHA03247 2993 TPPLTGH-SLSRVSSWASslalHEETDPPPVSLKQT----LWPPDDTEDS--------DADSLFDSDSERSDLEALDPLP 3059
|
250 260
....*....|....*....|
gi 2462564671 1339 PAPESPglPAHPSNPQLPEA 1358
Cdd:PHA03247 3060 PEPHDP--FAHEPDPATPEA 3077
|
|
| SPS1 |
COG0515 |
Serine/threonine protein kinase [Signal transduction mechanisms]; |
1238-1472 |
8.34e-03 |
|
Serine/threonine protein kinase [Signal transduction mechanisms];
Pssm-ID: 440281 [Multi-domain] Cd Length: 482 Bit Score: 40.38 E-value: 8.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1238 PLRRPSSVGELAslgQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLTLSSAcdGLLQPPVDTQPGVTVPAVSFP 1317
Cdd:COG0515 251 PEERYQSAAELA---AALRAVLRSLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA--AAAAAAAAAAAAAAAAAAPAA 325
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1318 APSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARWSEPWVPVEAL 1397
Cdd:COG0515 326 AAAAAAAAAALAAAAAAAAAAAAAALLAAAAALAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAALAAAAA 405
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462564671 1398 PPSPLELSRVGNILHRLQTTFQEALDLYRVLVSSGQVDTGQQQARTELVSTFLWIHSQLEAECLVGTSVAPAQAL 1472
Cdd:COG0515 406 AAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAAAARLLAAAAAAAAAAAAAPLLAALLAAAALAAAAAAAALA 480
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
117-170 |
8.61e-03 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 40.79 E-value: 8.61e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*..
gi 2462564671 117 LAFSPDGKYIV---TGENGHRpAVRIWDVEEKnQVAEMLGHKYGVACVAFSPNMKHI 170
Cdd:COG4946 437 LAWSPDSKWLAyskPGPNQLS-QIFLYDVETG-KTVQLTDGRYDDGSPAFSPDGKYL 491
|
|
|