NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462564671|ref|XP_054176634|]
View 

WD repeat-containing protein 62 isoform X2 [Homo sapiens]

Protein Classification

WD40 repeat domain-containing protein( domain architecture ID 11455410)

WD40 repeat domain-containing protein similar to proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly

CATH:  2.130.10.10
PubMed:  10322433|8090199
SCOP:  4002744

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
339-739 1.13e-36

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 144.28  E-value: 1.13e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  339 VDVAQGLEPRKAEAVYPDTVALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrACL 418
Cdd:COG2319     63 LDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFS 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  419 PSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSP 497
Cdd:COG2319    130 PDGKTLaSGSADGTVRLWDLAT-------------------------------------GKLLRTLTGHSGAVTSVAFSP 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  498 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSS 577
Cdd:COG2319    173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGS 248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  578 ITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 657
Cdd:COG2319    249 VRSVAFSPDGRL-LASGSADGTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  658 KKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 737
Cdd:COG2319    323 LRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399

                   ..
gi 2462564671  738 HL 739
Cdd:COG2319    400 DL 401
WD40 COG2319
WD40 repeat [General function prediction only];
93-439 5.43e-24

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 106.53  E-value: 5.43e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671   93 VVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVS 172
Cdd:COG2319    144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  173 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTetkvtstvplvGRSGILGEL 250
Cdd:COG2319    222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLAT-----------GELLRTLTG 286
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  251 HNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFQAHSLHYLAN 330
Cdd:COG2319    287 HSGGVNSVA------------FSPDG-----------------------------KLLASGSDDGTVRLWDLATGKLLRT 325
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  331 LpKPHYLGVDvaqgleprkaeavypdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSELF-HSSYVWNVevyp 409
Cdd:COG2319    326 L-TGHTGAVR------------------SVAFSPDGKTLASGSDDGTVRLWDLAT----GELLRTLTgHTGAVTSV---- 378
                          330       340       350
                   ....*....|....*....|....*....|.
gi 2462564671  410 efedqrACLPSGSFL-TCSSDNTIRFWNLDS 439
Cdd:COG2319    379 ------AFSPDGRTLaSGSADGTVRLWDLAT 403
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1160-1403 8.20e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 8.20e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1160 PPPPCLTSLAScvPASSVLPTDRNLPTPTSAPTPGLAQGVHAPstcsymeatASSRARISRSISLGDSEGPIVATLAQPL 1239
Cdd:PHA03247  2616 PLPPDTHAPDP--PPPSPSPAANEPDPHPPPTVPPPERPRDDP---------APGRVSRPRRARRLGRAAQASSPPQRPR 2684
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1240 RR--PSSVGELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF- 1316
Cdd:PHA03247  2685 RRaaRPTVGSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAa 2738
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1317 PAPSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEA 1396
Cdd:PHA03247  2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAV 2811

                   ....*..
gi 2462564671 1397 LPPSPLE 1403
Cdd:PHA03247  2812 LAPAAAL 2818
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
339-739 1.13e-36

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 144.28  E-value: 1.13e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  339 VDVAQGLEPRKAEAVYPDTVALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrACL 418
Cdd:COG2319     63 LDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFS 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  419 PSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSP 497
Cdd:COG2319    130 PDGKTLaSGSADGTVRLWDLAT-------------------------------------GKLLRTLTGHSGAVTSVAFSP 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  498 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSS 577
Cdd:COG2319    173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGS 248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  578 ITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 657
Cdd:COG2319    249 VRSVAFSPDGRL-LASGSADGTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  658 KKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 737
Cdd:COG2319    323 LRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399

                   ..
gi 2462564671  738 HL 739
Cdd:COG2319    400 DL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
487-739 2.20e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.97  E-value: 2.20e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  487 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 566
Cdd:cd00200      9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  567 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 646
Cdd:cd00200     85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  647 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 726
Cdd:cd00200    159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
                          250
                   ....*....|...
gi 2462564671  727 TVSGDSCVFIWHL 739
Cdd:cd00200    236 SGSEDGTIRVWDL 248
WD40 COG2319
WD40 repeat [General function prediction only];
93-439 5.43e-24

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 106.53  E-value: 5.43e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671   93 VVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVS 172
Cdd:COG2319    144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  173 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTetkvtstvplvGRSGILGEL 250
Cdd:COG2319    222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLAT-----------GELLRTLTG 286
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  251 HNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFQAHSLHYLAN 330
Cdd:COG2319    287 HSGGVNSVA------------FSPDG-----------------------------KLLASGSDDGTVRLWDLATGKLLRT 325
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  331 LpKPHYLGVDvaqgleprkaeavypdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSELF-HSSYVWNVevyp 409
Cdd:COG2319    326 L-TGHTGAVR------------------SVAFSPDGKTLASGSDDGTVRLWDLAT----GELLRTLTgHTGAVTSV---- 378
                          330       340       350
                   ....*....|....*....|....*....|.
gi 2462564671  410 efedqrACLPSGSFL-TCSSDNTIRFWNLDS 439
Cdd:COG2319    379 ------AFSPDGRTLaSGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
111-436 1.88e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 96.25  E-value: 1.88e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  111 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 189
Cdd:cd00200      9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  190 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvplvgrsgILGelHNN-IFCGVACGRGRMA 266
Cdd:cd00200     84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVW--DVETGKCLTT---------LRG--HTDwVNSVAFSPDGTFV 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  267 gstFCVSYSGL-----LCQFNEKRVL---EKWINlkvslssCLCVS--QELIFCGCTDGIVRIFQAHSLHYLANLpkphy 336
Cdd:cd00200    151 ---ASSSQDGTiklwdLRTGKCVATLtghTGEVN-------SVAFSpdGEKLLSSSSDGTIKLWDLSTGKCLGTL----- 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  337 lgvdvaqglePRKAEAVYpdtvALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrA 416
Cdd:cd00200    216 ----------RGHENGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSL----------A 268
                          330       340
                   ....*....|....*....|.
gi 2462564671  417 CLPSGSFL-TCSSDNTIRFWN 436
Cdd:cd00200    269 WSPDGKRLaSGSADGTIRIWD 289
PHA03247 PHA03247
large tegument protein UL36; Provisional
1160-1403 8.20e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 8.20e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1160 PPPPCLTSLAScvPASSVLPTDRNLPTPTSAPTPGLAQGVHAPstcsymeatASSRARISRSISLGDSEGPIVATLAQPL 1239
Cdd:PHA03247  2616 PLPPDTHAPDP--PPPSPSPAANEPDPHPPPTVPPPERPRDDP---------APGRVSRPRRARRLGRAAQASSPPQRPR 2684
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1240 RR--PSSVGELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF- 1316
Cdd:PHA03247  2685 RRaaRPTVGSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAa 2738
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1317 PAPSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEA 1396
Cdd:PHA03247  2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAV 2811

                   ....*..
gi 2462564671 1397 LPPSPLE 1403
Cdd:PHA03247  2812 LAPAAAL 2818
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
699-738 1.55e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.46  E-value: 1.55e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2462564671   699 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 738
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1160-1399 5.27e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 5.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1160 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 1237
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1238 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSW---GNHEARANLRLTLSSACDGllQPPVD 1304
Cdd:pfam03154  265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPaapGQSQQRIHTPPSQSQLQSQ--QPPRE 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1305 tQP----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPT 1373
Cdd:pfam03154  343 -QPlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPP 408
                          250       260
                   ....*....|....*....|....*.
gi 2462564671 1374 SGALGLLQGSPArwSEPWVPVEALPP 1399
Cdd:pfam03154  409 SAHPPPLQLMPQ--SQQLPPPPAQPP 432
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
629-721 7.26e-05

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 43.04  E-value: 7.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  629 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 707
Cdd:pfam12894    7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
                           90
                   ....*....|....
gi 2462564671  708 GHSEIITSMKFTYD 721
Cdd:pfam12894   78 AGSDLITCLGWGEN 91
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
145-185 3.57e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.52  E-value: 3.57e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 2462564671   145 KNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 185
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
SPS1 COG0515
Serine/threonine protein kinase [Signal transduction mechanisms];
1238-1472 8.34e-03

Serine/threonine protein kinase [Signal transduction mechanisms];


Pssm-ID: 440281 [Multi-domain]  Cd Length: 482  Bit Score: 40.38  E-value: 8.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1238 PLRRPSSVGELAslgQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLTLSSAcdGLLQPPVDTQPGVTVPAVSFP 1317
Cdd:COG0515    251 PEERYQSAAELA---AALRAVLRSLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA--AAAAAAAAAAAAAAAAAAPAA 325
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1318 APSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARWSEPWVPVEAL 1397
Cdd:COG0515    326 AAAAAAAAAALAAAAAAAAAAAAAALLAAAAALAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAALAAAAA 405
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462564671 1398 PPSPLELSRVGNILHRLQTTFQEALDLYRVLVSSGQVDTGQQQARTELVSTFLWIHSQLEAECLVGTSVAPAQAL 1472
Cdd:COG0515    406 AAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAAAARLLAAAAAAAAAAAAAPLLAALLAAAALAAAAAAAALA 480
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
339-739 1.13e-36

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 144.28  E-value: 1.13e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  339 VDVAQGLEPRKAEAVYPDTVALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrACL 418
Cdd:COG2319     63 LDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFS 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  419 PSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSP 497
Cdd:COG2319    130 PDGKTLaSGSADGTVRLWDLAT-------------------------------------GKLLRTLTGHSGAVTSVAFSP 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  498 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSS 577
Cdd:COG2319    173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGS 248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  578 ITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 657
Cdd:COG2319    249 VRSVAFSPDGRL-LASGSADGTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  658 KKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 737
Cdd:COG2319    323 LRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399

                   ..
gi 2462564671  738 HL 739
Cdd:COG2319    400 DL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
487-739 2.20e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.97  E-value: 2.20e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  487 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 566
Cdd:cd00200      9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  567 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 646
Cdd:cd00200     85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  647 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 726
Cdd:cd00200    159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
                          250
                   ....*....|...
gi 2462564671  727 TVSGDSCVFIWHL 739
Cdd:cd00200    236 SGSEDGTIRVWDL 248
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
398-737 2.99e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.58  E-value: 2.99e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  398 HSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNLdsspdshwqknifsntllkvvyvENDIQHLQDMSHfpdr 476
Cdd:cd00200      8 HTGGVTCV----------AFSPDGKLLaTGSGDGTIKVWDL-----------------------ETGELLRTLKGH---- 50
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  477 gsengtpmdvKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLI 556
Cdd:cd00200     51 ----------TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSP---DGRILSSSSRDKTI 117
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  557 HVLNVEkNYNLEQTLDDHSSSITAIKFAGNRDIQmISCGADKSIYFRSAQQGSdglhFVRTHHvAEKTTLYDMDIDITQK 636
Cdd:cd00200    118 KVWDVE-TGKCLTTLRGHTDWVNSVAFSPDGTFV-ASSSQDGTIKLWDLRTGK----CVATLT-GHTGEVNSVAFSPDGE 190
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  637 YVAVACQDRNVRVYNTVNGKQKKCYkgsQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSM 716
Cdd:cd00200    191 KLLSSSSDGTIKLWDLSTGKCLGTL---RGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSL 267
                          330       340
                   ....*....|....*....|.
gi 2462564671  717 KFTYDCHHLITVSGDSCVFIW 737
Cdd:cd00200    268 AWSPDGKRLASGSADGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
42-562 5.89e-25

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 109.23  E-value: 5.89e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671   42 LRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSP 121
Cdd:COG2319      9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  122 DGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHdmVLNVWDWKKDIVVAS-NKVSCR 200
Cdd:COG2319     89 DGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGA 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  201 VIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvpLVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllc 279
Cdd:COG2319    165 VTSVAFSPDGKLLASGSdDGTVRLW--DLATGKLLRT---LTG--------HTGAVRSVA------------FSPDG--- 216
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  280 qfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFqahslhylanlpkphylgvDVAQGLEPRKAEAVYPDTVA 359
Cdd:COG2319    217 --------------------------KLLASGSADGTVRLW-------------------DLATGKLLRTLTGHSGSVRS 251
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  360 LTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSEL-FHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNL 437
Cdd:COG2319    252 VAFSPDGRLLASGSADGTVRLWDLAT----GELLRTLtGHSGGVNSV----------AFSPDGKLLaSGSDDGTVRLWDL 317
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  438 DSSpdshwqknifsntllkvvyvendiQHLQDMSHFPDRgsengtpmdvkagVRVMQVSPDGQHLASGDRSGNLRIHELH 517
Cdd:COG2319    318 ATG------------------------KLLRTLTGHTGA-------------VRSVAFSPDGKTLASGSDDGTVRLWDLA 360
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*
gi 2462564671  518 FMDELVKVEAHDAEVLCLEYSKPEtglTLLASASRDRLIHVLNVE 562
Cdd:COG2319    361 TGELLRTLTGHTGAVTSVAFSPDG---RTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
93-439 5.43e-24

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 106.53  E-value: 5.43e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671   93 VVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVS 172
Cdd:COG2319    144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  173 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTetkvtstvplvGRSGILGEL 250
Cdd:COG2319    222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLAT-----------GELLRTLTG 286
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  251 HNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFQAHSLHYLAN 330
Cdd:COG2319    287 HSGGVNSVA------------FSPDG-----------------------------KLLASGSDDGTVRLWDLATGKLLRT 325
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  331 LpKPHYLGVDvaqgleprkaeavypdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSELF-HSSYVWNVevyp 409
Cdd:COG2319    326 L-TGHTGAVR------------------SVAFSPDGKTLASGSDDGTVRLWDLAT----GELLRTLTgHTGAVTSV---- 378
                          330       340       350
                   ....*....|....*....|....*....|.
gi 2462564671  410 efedqrACLPSGSFL-TCSSDNTIRFWNLDS 439
Cdd:COG2319    379 ------AFSPDGRTLaSGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
475-740 9.70e-24

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 105.76  E-value: 9.70e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  475 DRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPETglTLLASASRDR 554
Cdd:COG2319     66 AAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS-PDG--KTLASGSADG 142
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  555 LIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDiQMISCGADKSIYFRSAQQGSDgLHFVRTHhvaeKTTLYDMDIDIT 634
Cdd:COG2319    143 TVRLWDLATG-KLLRTLTGHSGAVTSVAFSPDGK-LLASGSDDGTVRLWDLATGKL-LRTLTGH----TGAVRSVAFSPD 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  635 QKYVAVACQDRNVRVYNTVNGKQKKCYKGsqgDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIIT 714
Cdd:COG2319    216 GKLLASGSADGTVRLWDLATGKLLRTLTG---HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVN 292
                          250       260
                   ....*....|....*....|....*.
gi 2462564671  715 SMKFTYDCHHLITVSGDSCVFIWHLG 740
Cdd:COG2319    293 SVAFSPDGKLLASGSDDGTVRLWDLA 318
WD40 COG2319
WD40 repeat [General function prediction only];
2-513 3.93e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 103.84  E-value: 3.93e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671    2 AAVGSGGYARNDAGEKLPSVMAGVPARRGQSSPPPAPPICLRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPG 81
Cdd:COG2319     11 AASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDG 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671   82 TGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGENGHRpaVRIWDVEEKNQVAEMLGHKYGVACV 161
Cdd:COG2319     91 RLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTGHSGAVTSV 168
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  162 AFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTstvP 239
Cdd:COG2319    169 AFSPDGKLLASGSD--DGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSaDGTVRLW--DLATGKLLR---T 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  240 LVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRI 319
Cdd:COG2319    242 LTG--------HSGSVRSVA------------FSPDG-----------------------------RLLASGSADGTVRL 272
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  320 FqahslhylanlpkphylgvDVAQGLEPRKAEAVYPDTVALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSEL-FH 398
Cdd:COG2319    273 W-------------------DLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLAT----GKLLRTLtGH 329
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  399 SSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrG 477
Cdd:COG2319    330 TGAVRSV----------AFSPDGKTLaSGSDDGTVRLWDLAT-------------------------------------G 362
                          490       500       510
                   ....*....|....*....|....*....|....*.
gi 2462564671  478 SENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRI 513
Cdd:COG2319    363 ELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRL 398
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
111-436 1.88e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 96.25  E-value: 1.88e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  111 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 189
Cdd:cd00200      9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  190 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvplvgrsgILGelHNN-IFCGVACGRGRMA 266
Cdd:cd00200     84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVW--DVETGKCLTT---------LRG--HTDwVNSVAFSPDGTFV 150
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  267 gstFCVSYSGL-----LCQFNEKRVL---EKWINlkvslssCLCVS--QELIFCGCTDGIVRIFQAHSLHYLANLpkphy 336
Cdd:cd00200    151 ---ASSSQDGTiklwdLRTGKCVATLtghTGEVN-------SVAFSpdGEKLLSSSSDGTIKLWDLSTGKCLGTL----- 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  337 lgvdvaqglePRKAEAVYpdtvALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrA 416
Cdd:cd00200    216 ----------RGHENGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---HTNSVTSL----------A 268
                          330       340
                   ....*....|....*....|.
gi 2462564671  417 CLPSGSFL-TCSSDNTIRFWN 436
Cdd:cd00200    269 WSPDGKRLaSGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
153-558 1.32e-15

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 79.30  E-value: 1.32e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  153 GHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVST 230
Cdd:cd00200      7 GHTGGVTCVAFSPDGKLLATGSG--DGTIKVWDLETGELLRTLKGhTGPVRDVAASADGTYLASGSsDKTIRLW--DLET 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  231 ETKVTStvpLVGrsgilgelHNnifcgvacgrgrmaGSTFCVSYSgllcqfnekrvlekwinlkvslssclcVSQELIFC 310
Cdd:cd00200     83 GECVRT---LTG--------HT--------------SYVSSVAFS---------------------------PDGRILSS 110
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  311 GCTDGIVRIFqahslhylanlpkphylgvDVAQGleprKAEAVYP---DTV-ALTFDPIHQWLSCVYKDHSIYIWDVKDI 386
Cdd:cd00200    111 SSRDKTIKVW-------------------DVETG----KCLTTLRghtDWVnSVAFSPDGTFVASSSQDGTIKLWDLRTG 167
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  387 NRVGKVwseLFHSSYVWNVEVYPEfedqraclpSGSFLTCSSDNTIRFWNLDSSpdshwqknifsntllkvvyvendiQH 466
Cdd:cd00200    168 KCVATL---TGHTGEVNSVAFSPD---------GEKLLSSSSDGTIKLWDLSTG------------------------KC 211
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  467 LQDMshfpdRGSENgtpmdvkaGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkpETGLTl 546
Cdd:cd00200    212 LGTL-----RGHEN--------GVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS--PDGKR- 275
                          410
                   ....*....|..
gi 2462564671  547 LASASRDRLIHV 558
Cdd:cd00200    276 LASGSADGTIRI 287
WD40 COG2319
WD40 repeat [General function prediction only];
475-739 8.16e-14

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 75.33  E-value: 8.16e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  475 DRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkpeTGLTLLASASRDR 554
Cdd:COG2319     24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFS---PDGRLLASASADG 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  555 LIHVLNVEKNyNLEQTLDDHSSSITAIKFAgnrdiqmiscgadksiyfrsaqqgSDGlhfvrthhvaekttlydmdidit 634
Cdd:COG2319    101 TVRLWDLATG-LLLRTLTGHTGAVRSVAFS------------------------PDG----------------------- 132
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  635 qKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIIT 714
Cdd:COG2319    133 -KTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTS---VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
                          250       260
                   ....*....|....*....|....*
gi 2462564671  715 SMKFTYDCHHLITVSGDSCVFIWHL 739
Cdd:COG2319    209 SVAFSPDGKLLASGSADGTVRLWDL 233
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
95-321 1.09e-13

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 73.52  E-value: 1.09e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671   95 ILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMG 174
Cdd:cd00200     77 LWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSS 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  175 YqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSED-SSYFVTVGNRHVRFWflEVSTETKVTStvpLVGrsgilgelHN 252
Cdd:cd00200    155 Q--DGTIKLWDLRTGKCVATLTGhTGEVNSVAFSPDgEKLLSSSSDGTIKLW--DLSTGKCLGT---LRG--------HE 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  253 NIFCGVACGRGRMagstFCVSYS--GLLCQFN-----EKRVL---EKWINlkvslssCLCVSQE--LIFCGCTDGIVRIF 320
Cdd:cd00200    220 NGVNSVAFSPDGY----LLASGSedGTIRVWDlrtgeCVQTLsghTNSVT-------SLAWSPDgkRLASGSADGTIRIW 288

                   .
gi 2462564671  321 Q 321
Cdd:cd00200    289 D 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
87-224 3.51e-13

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 71.98  E-value: 3.51e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671   87 YLAGC----VVVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGENGHrpAVRIWDVEEKNQVAEMLGHKYGVACVA 162
Cdd:cd00200    107 ILSSSsrdkTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG--TIKLWDLRTGKCVATLTGHTGEVNSVA 184
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462564671  163 FSPNMKHIVSMGyqHDMVLNVWDWKKDIVVASNKVSC-RVIALSFSEDSSYFVTVG-NRHVRFW 224
Cdd:cd00200    185 FSPDGEKLLSSS--SDGTIKLWDLSTGKCLGTLRGHEnGVNSVAFSPDGYLLASGSeDGTIRVW 246
PHA03247 PHA03247
large tegument protein UL36; Provisional
1160-1403 8.20e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 8.20e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1160 PPPPCLTSLAScvPASSVLPTDRNLPTPTSAPTPGLAQGVHAPstcsymeatASSRARISRSISLGDSEGPIVATLAQPL 1239
Cdd:PHA03247  2616 PLPPDTHAPDP--PPPSPSPAANEPDPHPPPTVPPPERPRDDP---------APGRVSRPRRARRLGRAAQASSPPQRPR 2684
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1240 RR--PSSVGELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF- 1316
Cdd:PHA03247  2685 RRaaRPTVGSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAa 2738
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1317 PAPSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEA 1396
Cdd:PHA03247  2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAV 2811

                   ....*..
gi 2462564671 1397 LPPSPLE 1403
Cdd:PHA03247  2812 LAPAAAL 2818
PHA03247 PHA03247
large tegument protein UL36; Provisional
1131-1400 6.60e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 6.60e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1131 PRAGTGYASPDRTHVLAAGKAE-ETLEAWRPPPPCltslascVPASSVLPTDRNLPTPTSAPT---PGLAQGVHAPSTcs 1206
Cdd:PHA03247  2521 PDEPVGEPVHPRMLTWIRGLEElASDDAGDPPPPL-------PPAAPPAAPDRSVPPPRPAPRpsePAVTSRARRPDA-- 2591
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1207 ymeATASSRARISRSISlGDSEGPIVATLAQP----LRRPSSVGelASLGQELQAITTATTPSLDSEGQEPALRSWGNH- 1281
Cdd:PHA03247  2592 ---PPQSARPRAPVDDR-GDPRGPAPPSPLPPdthaPDPPPPSP--SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPr 2665
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1282 EARANLRLTLSSAcdgllqPPVDTQPGVTVPAVSF-------PAPSPVEESALRLHGSAFrPSLPAPES-----PGLPAH 1349
Cdd:PHA03247  2666 RARRLGRAAQASS------PPQRPRRRAARPTVGSltsladpPPPPPTPEPAPHALVSAT-PLPPGPAAarqasPALPAA 2738
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462564671 1350 PSNPQLPEArPGIPGGTASLLEPTSGAlgllqgSPARWSEPWVPVEALPPS 1400
Cdd:PHA03247  2739 PAPPAVPAG-PATPGGPARPARPPTTA------GPPAPAPPAAPAAGPPRR 2782
PHA03247 PHA03247
large tegument protein UL36; Provisional
1118-1403 7.68e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 7.68e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1118 SPEVKLMDRGGSQPRAgtgyASPDRTHVLA-AGKAEETLEAWRPP--PPCLTSLAScvpasSVLPTDRNlPTPTSAPTPG 1194
Cdd:PHA03247  2646 VPPPERPRDDPAPGRV----SRPRRARRLGrAAQASSPPQRPRRRaaRPTVGSLTS-----LADPPPPP-PTPEPAPHAL 2715
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1195 LAQgvhAPSTCSYMEATASSRARISRSISLGDSEGPIV-ATLAQPLRRPSSVGELASlgqelqaittaTTPSLDSEGQEP 1273
Cdd:PHA03247  2716 VSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATpGGPARPARPPTTAGPPAP-----------APPAAPAAGPPR 2781
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1274 ALRSWGNHEARANlRLTLSSACDGLLQPPVDTQPGVTVPAVSFPA-PSPVeesalrlhgsafrPSLPAPESPGLPAHPSN 1352
Cdd:PHA03247  2782 RLTRPAVASLSES-RESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPP-------------PTSAQPTAPPPPPGPPP 2847
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462564671 1353 PQLPEARPGIPGGTASLLEPTSGALGLLQGSP----ARWSEPWVPVE----ALPPSPLE 1403
Cdd:PHA03247  2848 PSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPArppvRRLARPAVSRStesfALPPDQPE 2906
PHA03247 PHA03247
large tegument protein UL36; Provisional
1085-1407 1.17e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.79  E-value: 1.17e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1085 PRLSISTQFLSSLQKASRFTHTFPPR--ATQCLVKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEetleawRPPP 1162
Cdd:PHA03247  2712 PHALVSATPLPPGPAAARQASPALPAapAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR------RLTR 2785
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1163 PCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQ-----GVHAPSTCSYMEATASSRARISRSISLGDSEGPivatlAQ 1237
Cdd:PHA03247  2786 PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP-----GG 2860
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1238 PLRRPSSVGelaslgqelQAITTATTPSldsegqEPALRSWGnhearanlRLTLSSACDGLLQPPVDTQPGVTVPAVSFP 1317
Cdd:PHA03247  2861 DVRRRPPSR---------SPAAKPAAPA------RPPVRRLA--------RPAVSRSTESFALPPDQPERPPQPQAPPPP 2917
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1318 APSPVEESALRlhgsafrpSLPAPESPGLPAHPSNPQ-----LPEARPGIPGGTASLLEPTSGALGLLQGSPARwsePWV 1392
Cdd:PHA03247  2918 QPQPQPPPPPQ--------PQPPPPPPPRPQPPLAPTtdpagAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA---PSR 2986
                          330
                   ....*....|....*...
gi 2462564671 1393 PVEALPPSPL---ELSRV 1407
Cdd:PHA03247  2987 EAPASSTPPLtghSLSRV 3004
PHA03247 PHA03247
large tegument protein UL36; Provisional
1106-1406 1.44e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 1.44e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1106 TFPPRATQCLVKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEETLEAW-RPPPPCLTSLASCVPASSVLPTDRNL 1184
Cdd:PHA03247  2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLaDPPPPPPTPEPAPHALVSATPLPPGP 2725
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1185 -----------------PTPTSAPTPGLAQGVHAPSTCSYMEATASSRARIS---RSISLGDSEGPIVATLAQPLRRPSS 1244
Cdd:PHA03247  2726 aaarqaspalpaapappAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAgppRRLTRPAVASLSESRESLPSPWDPA 2805
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1245 VGELASLGQELQAITTAT--TPSLDSEGQEPALRSWGNHEARANLRLTLSSACDGLLQPPVDTQPGVTVPA--------- 1313
Cdd:PHA03247  2806 DPPAAVLAPAAALPPAASpaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAaparppvrr 2885
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1314 VSFPAPSPVEES-ALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARwSEPWV 1392
Cdd:PHA03247  2886 LARPAVSRSTESfALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAV-PQPWL 2964
                          330
                   ....*....|....
gi 2462564671 1393 PveALPPSPLELSR 1406
Cdd:PHA03247  2965 G--ALVPGRVAVPR 2976
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
699-738 1.55e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.46  E-value: 1.55e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2462564671   699 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 738
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1160-1399 5.27e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 5.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1160 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 1237
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1238 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSW---GNHEARANLRLTLSSACDGllQPPVD 1304
Cdd:pfam03154  265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPaapGQSQQRIHTPPSQSQLQSQ--QPPRE 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1305 tQP----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPT 1373
Cdd:pfam03154  343 -QPlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPP 408
                          250       260
                   ....*....|....*....|....*.
gi 2462564671 1374 SGALGLLQGSPArwSEPWVPVEALPP 1399
Cdd:pfam03154  409 SAHPPPLQLMPQ--SQQLPPPPAQPP 432
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
629-721 7.26e-05

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 43.04  E-value: 7.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671  629 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 707
Cdd:pfam12894    7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
                           90
                   ....*....|....
gi 2462564671  708 GHSEIITSMKFTYD 721
Cdd:pfam12894   78 AGSDLITCLGWGEN 91
WD40 pfam00400
WD domain, G-beta repeat;
700-737 1.19e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.79  E-value: 1.19e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2462564671  700 GECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 737
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
PHA03247 PHA03247
large tegument protein UL36; Provisional
1108-1381 2.74e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 2.74e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1108 PPRATQCLVKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEETLEawrPPPPcltslaSCVPASSVLPTDrnlPTP 1187
Cdd:PHA03247  2780 PRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPL---PPPT------SAQPTAPPPPPG---PPP 2847
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1188 TSAPT-----PGLAQGVHAPSTCSYMEATASSRARISRsislgdsegpivatLAQP-LRRPSSVGELASLGQELQAITTA 1261
Cdd:PHA03247  2848 PSLPLggsvaPGGDVRRRPPSRSPAAKPAAPARPPVRR--------------LARPaVSRSTESFALPPDQPERPPQPQA 2913
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1262 TTPSLDSEGQEPALRSWGNHEARANlrltlssaCDGLLQPPVDT----QPGVTVPAVSFPAPSPVEESALRLHGSAFRPS 1337
Cdd:PHA03247  2914 PPPPQPQPQPPPPPQPQPPPPPPPR--------PQPPLAPTTDPagagEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPS 2985
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 2462564671 1338 LPAPESpglpahPSNPQLPEARPGIPGGTASL---LEPTSGALGLLQ 1381
Cdd:PHA03247  2986 REAPAS------STPPLTGHSLSRVSSWASSLalhEETDPPPVSLKQ 3026
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
145-185 3.57e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.52  E-value: 3.57e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 2462564671   145 KNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 185
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
PHA03247 PHA03247
large tegument protein UL36; Provisional
1107-1358 5.75e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 5.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1107 FPPRATQCLVKSPEVKLMDRGGSQPRAGTGYASPD-RTHVLAAGKAEETLEAWRPPPPCLTSLASCVPASSVLPTDRNLP 1185
Cdd:PHA03247  2846 PPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARpPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1186 TPTSAP---TPGLAQGVHAPSTCSYMEATASSRARISRSISLGDSEGPIVATLAqPLRRPSsvgelaslgqelQAITTAT 1262
Cdd:PHA03247  2926 PPQPQPpppPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV-PQPAPS------------REAPASS 2992
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1263 TPSLDSEgQEPALRSWGN----HEARANLRLTLSSAcdglLQPPVDTQPGvtvpavsfPAPSPVEESALRLHGSAFRPSL 1338
Cdd:PHA03247  2993 TPPLTGH-SLSRVSSWASslalHEETDPPPVSLKQT----LWPPDDTEDS--------DADSLFDSDSERSDLEALDPLP 3059
                          250       260
                   ....*....|....*....|
gi 2462564671 1339 PAPESPglPAHPSNPQLPEA 1358
Cdd:PHA03247  3060 PEPHDP--FAHEPDPATPEA 3077
SPS1 COG0515
Serine/threonine protein kinase [Signal transduction mechanisms];
1238-1472 8.34e-03

Serine/threonine protein kinase [Signal transduction mechanisms];


Pssm-ID: 440281 [Multi-domain]  Cd Length: 482  Bit Score: 40.38  E-value: 8.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1238 PLRRPSSVGELAslgQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLTLSSAcdGLLQPPVDTQPGVTVPAVSFP 1317
Cdd:COG0515    251 PEERYQSAAELA---AALRAVLRSLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA--AAAAAAAAAAAAAAAAAAPAA 325
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462564671 1318 APSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARWSEPWVPVEAL 1397
Cdd:COG0515    326 AAAAAAAAAALAAAAAAAAAAAAAALLAAAAALAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAALAAAAA 405
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462564671 1398 PPSPLELSRVGNILHRLQTTFQEALDLYRVLVSSGQVDTGQQQARTELVSTFLWIHSQLEAECLVGTSVAPAQAL 1472
Cdd:COG0515    406 AAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAAAARLLAAAAAAAAAAAAAPLLAALLAAAALAAAAAAAALA 480
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
117-170 8.61e-03

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 40.79  E-value: 8.61e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462564671  117 LAFSPDGKYIV---TGENGHRpAVRIWDVEEKnQVAEMLGHKYGVACVAFSPNMKHI 170
Cdd:COG4946    437 LAWSPDSKWLAyskPGPNQLS-QIFLYDVETG-KTVQLTDGRYDDGSPAFSPDGKYL 491
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH