NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|530416321|ref|XP_005258866|]
View 

WD repeat-containing protein 62 isoform X3 [Homo sapiens]

Protein Classification

WD40 repeat domain-containing protein( domain architecture ID 11455410)

WD40 repeat domain-containing protein similar to proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly

CATH:  2.130.10.10
PubMed:  10322433|8090199
SCOP:  4002744

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
364-744 2.45e-36

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 143.13  E-value: 2.45e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  364 ALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNL 442
Cdd:COG2319    83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  443 DSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELH 522
Cdd:COG2319   150 AT-------------------------------------GKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  523 FMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDIqMISCGAD 602
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAFSPDGRL-LASGSAD 267
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  603 KSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDEGSllkVHVD 682
Cdd:COG2319   268 GTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAFS 339
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530416321  683 PSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWHL 744
Cdd:COG2319   340 PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 COG2319
WD40 repeat [General function prediction only];
42-567 2.91e-25

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 110.39  E-value: 2.91e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321   42 LRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSP 121
Cdd:COG2319     9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  122 DGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHdmVLNVWDWKKDIVVAS-NKVSCR 200
Cdd:COG2319    89 DGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGA 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  201 VIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvpLVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllc 279
Cdd:COG2319   165 VTSVAFSPDGKLLASGSdDGTVRLW--DLATGKLLRT---LTG--------HTGAVRSVA------------FSPDG--- 216
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  280 qfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFqahslhylanlpkphylgvDVAQGlEPSFLFHRKAEAVY 359
Cdd:COG2319   217 --------------------------KLLASGSADGTVRLW-------------------DLATG-KLLRTLTGHSGSVR 250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  360 pdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSEL-FHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTI 437
Cdd:COG2319   251 ----SVAFSPDGRLLASGSADGTVRLWDLAT----GELLRTLtGHSGGVNSV----------AFSPDGKLLaSGSDDGTV 312
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  438 RFWNLDSSpdshwqknifsntllkvvyvendiQHLQDMSHFPDRgsengtpmdvkagVRVMQVSPDGQHLASGDRSGNLR 517
Cdd:COG2319   313 RLWDLATG------------------------KLLRTLTGHTGA-------------VRSVAFSPDGKTLASGSDDGTVR 355
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|
gi 530416321  518 IHELHFMDELVKVEAHDAEVLCLEYSKPEtglTLLASASRDRLIHVLNVE 567
Cdd:COG2319   356 LWDLATGELLRTLTGHTGAVTSVAFSPDG---RTLASGSADGTVRLWDLA 402
PHA03247 super family cl33720
large tegument protein UL36; Provisional
979-1380 3.23e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 62.26  E-value: 3.23e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  979 SSDSPKDQSPPEGCAGPTEDELSLPEGPSVPSSSLPQTPEQEKFLRHHFETlTESPCRELFPAALGDVEAS--------- 1049
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-PQRPRRRAARPTVGSLTSLadppppppt 2707
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1050 -EAEDHFFNPRLSISTQFLSSLQKASRFTHTFPPRATQclvKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEetl 1128
Cdd:PHA03247 2708 pEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP---AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--- 2781
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1129 eawRPPPPCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQ-----GVHAPSTCSYMEATASSRARISRSISLGDSEGP 1203
Cdd:PHA03247 2782 ---RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1204 ivatlAQPLRRPSSVGelaslgqelQAITTATTPSldsegqEPALRSWGnhearanlRLTLSSACDGLLQPPVDTQPGVT 1283
Cdd:PHA03247 2859 -----GGDVRRRPPSR---------SPAAKPAAPA------RPPVRRLA--------RPAVSRSTESFALPPDQPERPPQ 2910
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1284 VPAVSFPAPSPVEESALRlhgsafrpSLPAPESPGLPAHPSNPQ-----LPEARPGIPGGTASLLEPTSGALGLLQGSPA 1358
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQ--------PQPPPPPPPRPQPPLAPTtdpagAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP 2982
                         410       420
                  ....*....|....*....|....*
gi 530416321 1359 RwsePWVPVEALPPSPL---ELSRV 1380
Cdd:PHA03247 2983 A---PSREAPASSTPPLtghSLSRV 3004
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
364-744 2.45e-36

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 143.13  E-value: 2.45e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  364 ALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNL 442
Cdd:COG2319    83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  443 DSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELH 522
Cdd:COG2319   150 AT-------------------------------------GKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  523 FMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDIqMISCGAD 602
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAFSPDGRL-LASGSAD 267
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  603 KSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDEGSllkVHVD 682
Cdd:COG2319   268 GTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAFS 339
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530416321  683 PSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWHL 744
Cdd:COG2319   340 PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
492-744 1.87e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.97  E-value: 1.87e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  492 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 571
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  572 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 651
Cdd:cd00200    85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  652 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 731
Cdd:cd00200   159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
                         250
                  ....*....|...
gi 530416321  732 TVSGDSCVFIWHL 744
Cdd:cd00200   236 SGSEDGTIRVWDL 248
WD40 COG2319
WD40 repeat [General function prediction only];
42-567 2.91e-25

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 110.39  E-value: 2.91e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321   42 LRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSP 121
Cdd:COG2319     9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  122 DGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHdmVLNVWDWKKDIVVAS-NKVSCR 200
Cdd:COG2319    89 DGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGA 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  201 VIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvpLVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllc 279
Cdd:COG2319   165 VTSVAFSPDGKLLASGSdDGTVRLW--DLATGKLLRT---LTG--------HTGAVRSVA------------FSPDG--- 216
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  280 qfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFqahslhylanlpkphylgvDVAQGlEPSFLFHRKAEAVY 359
Cdd:COG2319   217 --------------------------KLLASGSADGTVRLW-------------------DLATG-KLLRTLTGHSGSVR 250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  360 pdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSEL-FHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTI 437
Cdd:COG2319   251 ----SVAFSPDGRLLASGSADGTVRLWDLAT----GELLRTLtGHSGGVNSV----------AFSPDGKLLaSGSDDGTV 312
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  438 RFWNLDSSpdshwqknifsntllkvvyvendiQHLQDMSHFPDRgsengtpmdvkagVRVMQVSPDGQHLASGDRSGNLR 517
Cdd:COG2319   313 RLWDLATG------------------------KLLRTLTGHTGA-------------VRSVAFSPDGKTLASGSDDGTVR 355
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|
gi 530416321  518 IHELHFMDELVKVEAHDAEVLCLEYSKPEtglTLLASASRDRLIHVLNVE 567
Cdd:COG2319   356 LWDLATGELLRTLTGHTGAVTSVAFSPDG---RTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
111-441 8.33e-22

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 97.41  E-value: 8.33e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  111 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 189
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  190 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvplvgrsgILGelHNnifcgvacgrgrmaG 267
Cdd:cd00200    84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVW--DVETGKCLTT---------LRG--HT--------------D 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  268 STFCVSYSGllcqfnekrvlekwinlkvslssclcvSQELIFCGCTDGIVRIFQAHSLHYLANLpKPHYLGV-------- 339
Cdd:cd00200   137 WVNSVAFSP---------------------------DGTFVASSSQDGTIKLWDLRTGKCVATL-TGHTGEVnsvafspd 188
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  340 ----------------DVAQGlEPSFLFHRKAEAVYpdtvALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfH 403
Cdd:cd00200   189 gekllssssdgtiklwDLSTG-KCLGTLRGHENGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---H 260
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 530416321  404 SSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWN 441
Cdd:cd00200   261 TNSVTSL----------AWSPDGKRLaSGSADGTIRIWD 289
PHA03247 PHA03247
large tegument protein UL36; Provisional
979-1380 3.23e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 62.26  E-value: 3.23e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  979 SSDSPKDQSPPEGCAGPTEDELSLPEGPSVPSSSLPQTPEQEKFLRHHFETlTESPCRELFPAALGDVEAS--------- 1049
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-PQRPRRRAARPTVGSLTSLadppppppt 2707
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1050 -EAEDHFFNPRLSISTQFLSSLQKASRFTHTFPPRATQclvKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEetl 1128
Cdd:PHA03247 2708 pEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP---AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--- 2781
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1129 eawRPPPPCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQ-----GVHAPSTCSYMEATASSRARISRSISLGDSEGP 1203
Cdd:PHA03247 2782 ---RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1204 ivatlAQPLRRPSSVGelaslgqelQAITTATTPSldsegqEPALRSWGnhearanlRLTLSSACDGLLQPPVDTQPGVT 1283
Cdd:PHA03247 2859 -----GGDVRRRPPSR---------SPAAKPAAPA------RPPVRRLA--------RPAVSRSTESFALPPDQPERPPQ 2910
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1284 VPAVSFPAPSPVEESALRlhgsafrpSLPAPESPGLPAHPSNPQ-----LPEARPGIPGGTASLLEPTSGALGLLQGSPA 1358
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQ--------PQPPPPPPPRPQPPLAPTtdpagAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP 2982
                         410       420
                  ....*....|....*....|....*
gi 530416321 1359 RwsePWVPVEALPPSPL---ELSRV 1380
Cdd:PHA03247 2983 A---PSREAPASSTPPLtghSLSRV 3004
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
704-743 1.41e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.46  E-value: 1.41e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 530416321    704 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 743
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1133-1372 4.11e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 4.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  1133 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 1210
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  1211 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSW---GNHEARANLRLTLSSACDGllQPPVD 1277
Cdd:pfam03154  265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPaapGQSQQRIHTPPSQSQLQSQ--QPPRE 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  1278 tQP----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPT 1346
Cdd:pfam03154  343 -QPlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPP 408
                          250       260
                   ....*....|....*....|....*.
gi 530416321  1347 SGALGLLQGSPArwSEPWVPVEALPP 1372
Cdd:pfam03154  409 SAHPPPLQLMPQ--SQQLPPPPAQPP 432
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
634-726 7.13e-05

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 43.04  E-value: 7.13e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321   634 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 712
Cdd:pfam12894    7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
                           90
                   ....*....|....
gi 530416321   713 GHSEIITSMKFTYD 726
Cdd:pfam12894   78 AGSDLITCLGWGEN 91
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
145-185 3.11e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.91  E-value: 3.11e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 530416321    145 KNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 185
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
SPS1 COG0515
Serine/threonine protein kinase [Signal transduction mechanisms];
1211-1445 8.04e-03

Serine/threonine protein kinase [Signal transduction mechanisms];


Pssm-ID: 440281 [Multi-domain]  Cd Length: 482  Bit Score: 40.38  E-value: 8.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1211 PLRRPSSVGELAslgQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLTLSSAcdGLLQPPVDTQPGVTVPAVSFP 1290
Cdd:COG0515   251 PEERYQSAAELA---AALRAVLRSLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA--AAAAAAAAAAAAAAAAAAPAA 325
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1291 APSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARWSEPWVPVEAL 1370
Cdd:COG0515   326 AAAAAAAAAALAAAAAAAAAAAAAALLAAAAALAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAALAAAAA 405
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530416321 1371 PPSPLELSRVGNILHRLQTTFQEALDLYRVLVSSGQVDTGQQQARTELVSTFLWIHSQLEAECLVGTSVAPAQAL 1445
Cdd:COG0515   406 AAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAAAARLLAAAAAAAAAAAAAPLLAALLAAAALAAAAAAAALA 480
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
364-744 2.45e-36

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 143.13  E-value: 2.45e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  364 ALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNL 442
Cdd:COG2319    83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTG---HTGAVRSV----------AFSPDGKTLaSGSADGTVRLWDL 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  443 DSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELH 522
Cdd:COG2319   150 AT-------------------------------------GKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA 192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  523 FMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDIqMISCGAD 602
Cdd:COG2319   193 TGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGSVRSVAFSPDGRL-LASGSAD 267
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  603 KSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDEGSllkVHVD 682
Cdd:COG2319   268 GTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRS---VAFS 339
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530416321  683 PSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWHL 744
Cdd:COG2319   340 PDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
492-744 1.87e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.97  E-value: 1.87e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  492 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 571
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  572 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 651
Cdd:cd00200    85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  652 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 731
Cdd:cd00200   159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
                         250
                  ....*....|...
gi 530416321  732 TVSGDSCVFIWHL 744
Cdd:cd00200   236 SGSEDGTIRVWDL 248
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
403-742 2.42e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.58  E-value: 2.42e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  403 HSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNLdsspdshwqknifsntllkvvyvENDIQHLQDMSHfpdr 481
Cdd:cd00200     8 HTGGVTCV----------AFSPDGKLLaTGSGDGTIKVWDL-----------------------ETGELLRTLKGH---- 50
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  482 gsengtpmdvKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLI 561
Cdd:cd00200    51 ----------TGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSP---DGRILSSSSRDKTI 117
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  562 HVLNVEkNYNLEQTLDDHSSSITAIKFAGNRDIQmISCGADKSIYFRSAQQGSdglhFVRTHHvAEKTTLYDMDIDITQK 641
Cdd:cd00200   118 KVWDVE-TGKCLTTLRGHTDWVNSVAFSPDGTFV-ASSSQDGTIKLWDLRTGK----CVATLT-GHTGEVNSVAFSPDGE 190
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  642 YVAVACQDRNVRVYNTVNGKQKKCYkgsQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSM 721
Cdd:cd00200   191 KLLSSSSDGTIKLWDLSTGKCLGTL---RGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSL 267
                         330       340
                  ....*....|....*....|.
gi 530416321  722 KFTYDCHHLITVSGDSCVFIW 742
Cdd:cd00200   268 AWSPDGKRLASGSADGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
42-567 2.91e-25

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 110.39  E-value: 2.91e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321   42 LRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPGTGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSP 121
Cdd:COG2319     9 LAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  122 DGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYQHdmVLNVWDWKKDIVVAS-NKVSCR 200
Cdd:COG2319    89 DGRLLASA--SADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADG--TVRLWDLATGKLLRTlTGHSGA 164
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  201 VIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvpLVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllc 279
Cdd:COG2319   165 VTSVAFSPDGKLLASGSdDGTVRLW--DLATGKLLRT---LTG--------HTGAVRSVA------------FSPDG--- 216
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  280 qfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFqahslhylanlpkphylgvDVAQGlEPSFLFHRKAEAVY 359
Cdd:COG2319   217 --------------------------KLLASGSADGTVRLW-------------------DLATG-KLLRTLTGHSGSVR 250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  360 pdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSEL-FHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTI 437
Cdd:COG2319   251 ----SVAFSPDGRLLASGSADGTVRLWDLAT----GELLRTLtGHSGGVNSV----------AFSPDGKLLaSGSDDGTV 312
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  438 RFWNLDSSpdshwqknifsntllkvvyvendiQHLQDMSHFPDRgsengtpmdvkagVRVMQVSPDGQHLASGDRSGNLR 517
Cdd:COG2319   313 RLWDLATG------------------------KLLRTLTGHTGA-------------VRSVAFSPDGKTLASGSDDGTVR 355
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|
gi 530416321  518 IHELHFMDELVKVEAHDAEVLCLEYSKPEtglTLLASASRDRLIHVLNVE 567
Cdd:COG2319   356 LWDLATGELLRTLTGHTGAVTSVAFSPDG---RTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
480-745 8.57e-24

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 105.76  E-value: 8.57e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  480 DRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPETglTLLASASRDR 559
Cdd:COG2319    66 AAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS-PDG--KTLASGSADG 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  560 LIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDiQMISCGADKSIYFRSAQQGSDgLHFVRTHhvaeKTTLYDMDIDIT 639
Cdd:COG2319   143 TVRLWDLATG-KLLRTLTGHSGAVTSVAFSPDGK-LLASGSDDGTVRLWDLATGKL-LRTLTGH----TGAVRSVAFSPD 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  640 QKYVAVACQDRNVRVYNTVNGKQKKCYKGsqgDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIIT 719
Cdd:COG2319   216 GKLLASGSADGTVRLWDLATGKLLRTLTG---HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVN 292
                         250       260
                  ....*....|....*....|....*.
gi 530416321  720 SMKFTYDCHHLITVSGDSCVFIWHLG 745
Cdd:COG2319   293 SVAFSPDGKLLASGSDDGTVRLWDLA 318
WD40 COG2319
WD40 repeat [General function prediction only];
93-444 1.91e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.61  E-value: 1.91e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321   93 VVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVS 172
Cdd:COG2319   144 VRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLAS 221
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  173 MGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTetkvtstvplvGRSGILGEL 250
Cdd:COG2319   222 GSA--DGTVRLWDLATGKLLRTLTGhSGSVRSVAFSPDGRLLASGSaDGTVRLW--DLAT-----------GELLRTLTG 286
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  251 HNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRIFQAHSLHYLAN 330
Cdd:COG2319   287 HSGGVNSVA------------FSPDG-----------------------------KLLASGSDDGTVRLWDLATGKLLRT 325
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  331 LpKPHYLGVDvaqglepsflfhrkaeavypdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWSELF-HSSYVWN 409
Cdd:COG2319   326 L-TGHTGAVR-----------------------SVAFSPDGKTLASGSDDGTVRLWDLAT----GELLRTLTgHTGAVTS 377
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 530416321  410 VevypefedqrACLPSGSFL-TCSSDNTIRFWNLDS 444
Cdd:COG2319   378 V----------AFSPDGRTLaSGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
2-518 7.23e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 103.07  E-value: 7.23e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321    2 AAVGSGGYARNDAGEKLPSVMAGVPARRGQSSPPPAPPICLRRRTRLSTASEETVQNRVSLEKVLGITAQNSSGLTCDPG 81
Cdd:COG2319    11 AASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDG 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321   82 TGHVAYLAGCVVVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGENGHRpaVRIWDVEEKNQVAEMLGHKYGVACV 161
Cdd:COG2319    91 RLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTGHSGAVTSV 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  162 AFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTstvP 239
Cdd:COG2319   169 AFSPDGKLLASGSD--DGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSaDGTVRLW--DLATGKLLR---T 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  240 LVGrsgilgelHNNIFCGVAcgrgrmagstfcVSYSGllcqfnekrvlekwinlkvslssclcvsqELIFCGCTDGIVRI 319
Cdd:COG2319   242 LTG--------HSGSVRSVA------------FSPDG-----------------------------RLLASGSADGTVRL 272
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  320 FqahslhylanlpkphylgvDVAQGLEPSFLFHRKAeAVYpdtvALTFDPIHQWLSCVYKDHSIYIWDVKDinrvGKVWS 399
Cdd:COG2319   273 W-------------------DLATGELLRTLTGHSG-GVN----SVAFSPDGKLLASGSDDGTVRLWDLAT----GKLLR 324
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  400 EL-FHSSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmsh 477
Cdd:COG2319   325 TLtGHTGAVRSV----------AFSPDGKTLaSGSDDGTVRLWDLAT--------------------------------- 361
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|.
gi 530416321  478 fpdrGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRI 518
Cdd:COG2319   362 ----GELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRL 398
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
111-441 8.33e-22

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 97.41  E-value: 8.33e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  111 RKSLSALAFSPDGKYIVTG-ENGHrpaVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKD 189
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGsGDGT---IKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSS--DKTIRLWDLETG 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  190 IVVAS-NKVSCRVIALSFSEDSSYFVTVG-NRHVRFWflEVSTETKVTStvplvgrsgILGelHNnifcgvacgrgrmaG 267
Cdd:cd00200    84 ECVRTlTGHTSYVSSVAFSPDGRILSSSSrDKTIKVW--DVETGKCLTT---------LRG--HT--------------D 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  268 STFCVSYSGllcqfnekrvlekwinlkvslssclcvSQELIFCGCTDGIVRIFQAHSLHYLANLpKPHYLGV-------- 339
Cdd:cd00200   137 WVNSVAFSP---------------------------DGTFVASSSQDGTIKLWDLRTGKCVATL-TGHTGEVnsvafspd 188
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  340 ----------------DVAQGlEPSFLFHRKAEAVYpdtvALTFDPIHQWLSCVYKDHSIYIWDVKDINRVGKVWSelfH 403
Cdd:cd00200   189 gekllssssdgtiklwDLSTG-KCLGTLRGHENGVN----SVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSG---H 260
                         330       340       350
                  ....*....|....*....|....*....|....*....
gi 530416321  404 SSYVWNVevypefedqrACLPSGSFL-TCSSDNTIRFWN 441
Cdd:cd00200   261 TNSVTSL----------AWSPDGKRLaSGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
153-563 4.44e-16

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 80.46  E-value: 4.44e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  153 GHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSEDSSYFVTVG-NRHVRFWflEVST 230
Cdd:cd00200     7 GHTGGVTCVAFSPDGKLLATGSG--DGTIKVWDLETGELLRTLKGhTGPVRDVAASADGTYLASGSsDKTIRLW--DLET 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  231 ETKVTStvpLVGrsgilgelHNnifcgvacgrgrmaGSTFCVSYSgllcqfnekrvlekwinlkvslssclcVSQELIFC 310
Cdd:cd00200    83 GECVRT---LTG--------HT--------------SYVSSVAFS---------------------------PDGRILSS 110
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  311 GCTDGIVRIFqahslhylanlpkphylgvDVAQG-LEPSFLFHRKaeavypDTVALTFDPIHQWLSCVYKDHSIYIWDVK 389
Cdd:cd00200   111 SSRDKTIKVW-------------------DVETGkCLTTLRGHTD------WVNSVAFSPDGTFVASSSQDGTIKLWDLR 165
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  390 DINRVGKVwseLFHSSYVWNVEVYPEfedqraclpSGSFLTCSSDNTIRFWNLDSSpdshwqknifsntllkvvyvendi 469
Cdd:cd00200   166 TGKCVATL---TGHTGEVNSVAFSPD---------GEKLLSSSSDGTIKLWDLSTG------------------------ 209
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  470 QHLQDMshfpdRGSENgtpmdvkaGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkpETGL 549
Cdd:cd00200   210 KCLGTL-----RGHEN--------GVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWS--PDGK 274
                         410
                  ....*....|....
gi 530416321  550 TlLASASRDRLIHV 563
Cdd:cd00200   275 R-LASGSADGTIRI 287
WD40 COG2319
WD40 repeat [General function prediction only];
480-744 7.24e-14

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 75.33  E-value: 7.24e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  480 DRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkpeTGLTLLASASRDR 559
Cdd:COG2319    24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFS---PDGRLLASASADG 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  560 LIHVLNVEKNyNLEQTLDDHSSSITAIKFAgnrdiqmiscgadksiyfrsaqqgSDGlhfvrthhvaekttlydmdidit 639
Cdd:COG2319   101 TVRLWDLATG-LLLRTLTGHTGAVRSVAFS------------------------PDG----------------------- 132
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  640 qKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIIT 719
Cdd:COG2319   133 -KTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTS---VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVR 208
                         250       260
                  ....*....|....*....|....*
gi 530416321  720 SMKFTYDCHHLITVSGDSCVFIWHL 744
Cdd:COG2319   209 SVAFSPDGKLLASGSADGTVRLWDL 233
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
95-321 8.84e-14

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 73.52  E-value: 8.84e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321   95 ILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGenGHRPAVRIWDVEEKNQVAEMLGHKYGVACVAFSPNMKHIVSMG 174
Cdd:cd00200    77 LWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSS 154
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  175 YqhDMVLNVWDWKKDIVVASNKV-SCRVIALSFSED-SSYFVTVGNRHVRFWflEVSTETKVTStvpLVGrsgilgelHN 252
Cdd:cd00200   155 Q--DGTIKLWDLRTGKCVATLTGhTGEVNSVAFSPDgEKLLSSSSDGTIKLW--DLSTGKCLGT---LRG--------HE 219
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  253 NIFCGVACGRGRMagstFCVSYS--GLLCQFN-----EKRVL---EKWINlkvslssCLCVSQE--LIFCGCTDGIVRIF 320
Cdd:cd00200   220 NGVNSVAFSPDGY----LLASGSedGTIRVWDlrtgeCVQTLsghTNSVT-------SLAWSPDgkRLASGSADGTIRIW 288

                  .
gi 530416321  321 Q 321
Cdd:cd00200   289 D 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
87-224 3.00e-13

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 71.98  E-value: 3.00e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321   87 YLAGC----VVVILDPKENKQQHIFNTARKSLSALAFSPDGKYIVTGENGHrpAVRIWDVEEKNQVAEMLGHKYGVACVA 162
Cdd:cd00200   107 ILSSSsrdkTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDG--TIKLWDLRTGKCVATLTGHTGEVNSVA 184
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 530416321  163 FSPNMKHIVSMGyqHDMVLNVWDWKKDIVVASNKVSC-RVIALSFSEDSSYFVTVG-NRHVRFW 224
Cdd:cd00200   185 FSPDGEKLLSSS--SDGTIKLWDLSTGKCLGTLRGHEnGVNSVAFSPDGYLLASGSeDGTIRVW 246
PHA03247 PHA03247
large tegument protein UL36; Provisional
979-1380 3.23e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 62.26  E-value: 3.23e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  979 SSDSPKDQSPPEGCAGPTEDELSLPEGPSVPSSSLPQTPEQEKFLRHHFETlTESPCRELFPAALGDVEAS--------- 1049
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP-PQRPRRRAARPTVGSLTSLadppppppt 2707
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1050 -EAEDHFFNPRLSISTQFLSSLQKASRFTHTFPPRATQclvKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEetl 1128
Cdd:PHA03247 2708 pEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP---AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR--- 2781
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1129 eawRPPPPCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQ-----GVHAPSTCSYMEATASSRARISRSISLGDSEGP 1203
Cdd:PHA03247 2782 ---RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPaaspaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1204 ivatlAQPLRRPSSVGelaslgqelQAITTATTPSldsegqEPALRSWGnhearanlRLTLSSACDGLLQPPVDTQPGVT 1283
Cdd:PHA03247 2859 -----GGDVRRRPPSR---------SPAAKPAAPA------RPPVRRLA--------RPAVSRSTESFALPPDQPERPPQ 2910
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1284 VPAVSFPAPSPVEESALRlhgsafrpSLPAPESPGLPAHPSNPQ-----LPEARPGIPGGTASLLEPTSGALGLLQGSPA 1358
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQ--------PQPPPPPPPRPQPPLAPTtdpagAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP 2982
                         410       420
                  ....*....|....*....|....*
gi 530416321 1359 RwsePWVPVEALPPSPL---ELSRV 1380
Cdd:PHA03247 2983 A---PSREAPASSTPPLtghSLSRV 3004
PHA03247 PHA03247
large tegument protein UL36; Provisional
1133-1376 7.50e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 7.50e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1133 PPPPCLTSLAScvPASSVLPTDRNLPTPTSAPTPGLAQGVHAPstcsymeatASSRARISRSISLGDSEGPIVATLAQPL 1212
Cdd:PHA03247 2616 PLPPDTHAPDP--PPPSPSPAANEPDPHPPPTVPPPERPRDDP---------APGRVSRPRRARRLGRAAQASSPPQRPR 2684
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1213 RR--PSSVGELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF- 1289
Cdd:PHA03247 2685 RRaaRPTVGSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAa 2738
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1290 PAPSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEA 1369
Cdd:PHA03247 2739 PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAV 2811

                  ....*..
gi 530416321 1370 LPPSPLE 1376
Cdd:PHA03247 2812 LAPAAAL 2818
PHA03247 PHA03247
large tegument protein UL36; Provisional
1104-1373 5.79e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 5.79e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1104 PRAGTGYASPDRTHVLAAGKAE-ETLEAWRPPPPCltslascVPASSVLPTDRNLPTPTSAPT---PGLAQGVHAPSTcs 1179
Cdd:PHA03247 2521 PDEPVGEPVHPRMLTWIRGLEElASDDAGDPPPPL-------PPAAPPAAPDRSVPPPRPAPRpsePAVTSRARRPDA-- 2591
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1180 ymeATASSRARISRSISlGDSEGPIVATLAQP----LRRPSSVGelASLGQELQAITTATTPSLDSEGQEPALRSWGNH- 1254
Cdd:PHA03247 2592 ---PPQSARPRAPVDDR-GDPRGPAPPSPLPPdthaPDPPPPSP--SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPr 2665
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1255 EARANLRLTLSSAcdgllqPPVDTQPGVTVPAVSF-------PAPSPVEESALRLHGSAFrPSLPAPES-----PGLPAH 1322
Cdd:PHA03247 2666 RARRLGRAAQASS------PPQRPRRRAARPTVGSltsladpPPPPPTPEPAPHALVSAT-PLPPGPAAarqasPALPAA 2738
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 530416321 1323 PSNPQLPEArPGIPGGTASLLEPTSGAlgllqgSPARWSEPWVPVEALPPS 1373
Cdd:PHA03247 2739 PAPPAVPAG-PATPGGPARPARPPTTA------GPPAPAPPAAPAAGPPRR 2782
PHA03247 PHA03247
large tegument protein UL36; Provisional
981-1379 5.89e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 5.89e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  981 DSPKDQSPPEGCAGPTEDelslPEGPSVPSSSLPQTPEQEKFLRHHFETLTESPCRELFPAAlgdvEASEAEDHFFNPRL 1060
Cdd:PHA03247 2590 DAPPQSARPRAPVDDRGD----PRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP----PPERPRDDPAPGRV 2661
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1061 SIS--TQFLSSLQKASRFTHTFPPRATQCLV-------------KSPEvklmdrggSQPRAGTGyASPDRTHVLAAGKAE 1125
Cdd:PHA03247 2662 SRPrrARRLGRAAQASSPPQRPRRRAARPTVgsltsladpppppPTPE--------PAPHALVS-ATPLPPGPAAARQAS 2732
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1126 ETLEAWRPPPPclTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGDSEGPIV 1205
Cdd:PHA03247 2733 PALPAAPAPPA--VPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA 2810
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1206 ATLAQPLRRPSSVgelASLGQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLtlssacdgllQPPvdTQPGVTVP 1285
Cdd:PHA03247 2811 VLAPAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRR----------RPP--SRSPAAKP 2875
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1286 A---------VSFPAPSPVEES-ALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQG 1355
Cdd:PHA03247 2876 AaparppvrrLARPAVSRSTESfALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEP 2955
                         410       420
                  ....*....|....*....|....
gi 530416321 1356 SPARwSEPWVPveALPPSPLELSR 1379
Cdd:PHA03247 2956 SGAV-PQPWLG--ALVPGRVAVPR 2976
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
704-743 1.41e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.46  E-value: 1.41e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 530416321    704 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 743
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1133-1372 4.11e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 4.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  1133 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 1210
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  1211 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSW---GNHEARANLRLTLSSACDGllQPPVD 1277
Cdd:pfam03154  265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPaapGQSQQRIHTPPSQSQLQSQ--QPPRE 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  1278 tQP----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPT 1346
Cdd:pfam03154  343 -QPlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPP 408
                          250       260
                   ....*....|....*....|....*.
gi 530416321  1347 SGALGLLQGSPArwSEPWVPVEALPP 1372
Cdd:pfam03154  409 SAHPPPLQLMPQ--SQQLPPPPAQPP 432
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
634-726 7.13e-05

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 43.04  E-value: 7.13e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321   634 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 712
Cdd:pfam12894    7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
                           90
                   ....*....|....
gi 530416321   713 GHSEIITSMKFTYD 726
Cdd:pfam12894   78 AGSDLITCLGWGEN 91
WD40 pfam00400
WD domain, G-beta repeat;
705-742 1.12e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.79  E-value: 1.12e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530416321   705 GECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 742
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
PHA03247 PHA03247
large tegument protein UL36; Provisional
963-1331 3.57e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 3.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  963 VEAGPGDQQGDSYLRVSSDSPKDQSPPEGCAGP-TEDELSLPEGPSVPSSS----LPQTPEQEKFLRHHFETLTESPCRE 1037
Cdd:PHA03247 2719 TPLPPGPAAARQASPALPAAPAPPAVPAGPATPgGPARPARPPTTAGPPAPappaAPAAGPPRRLTRPAVASLSESRESL 2798
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1038 LFPAALGDVEASEAEDHFFNPRLSISTQFL----SSLQKASRFTHT-FPPRATQCLVKSPEVKLMDRGGSQPRAGTGYAS 1112
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLppptSAQPTAPPPPPGpPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAP 2878
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1113 PD-RTHVLAAGKAEETLEAWRPPPPCLTSLASCVPASSVLPTDRNLPTPTSAP---TPGLAQGVHAPSTCSYMEATASSR 1188
Cdd:PHA03247 2879 ARpPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPpppPPPRPQPPLAPTTDPAGAGEPSGA 2958
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1189 ARISRSISLGDSEGPIVATLAqPLRRPSsvgelaslgqelQAITTATTPSLDSEgQEPALRSWGN----HEARANLRLTL 1264
Cdd:PHA03247 2959 VPQPWLGALVPGRVAVPRFRV-PQPAPS------------REAPASSTPPLTGH-SLSRVSSWASslalHEETDPPPVSL 3024
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 530416321 1265 SSAcdglLQPPVDTQPGvtvpavsfPAPSPVEESALRLHGSAFRPSLPAPESPglPAHPSNPQLPEA 1331
Cdd:PHA03247 3025 KQT----LWPPDDTEDS--------DADSLFDSDSERSDLEALDPLPPEPHDP--FAHEPDPATPEA 3077
PHA03379 PHA03379
EBNA-3A; Provisional
1269-1375 1.13e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 43.51  E-value: 1.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1269 DGLLQPPVDTQPGVT-VPAVSFPAPSPVEESALRLHGSAFRPSLPAPespgLPAHP-SNPQLPEARPGIPGGTASLLEPT 1346
Cdd:PHA03379  482 DQLPGVVQDGRPACApVPAPAGPIVRPWEASLSQVPGVAFAPVMPQP----MPVEPvPVPTVALERPVCPAPPLIAMQGP 557
                          90       100
                  ....*....|....*....|....*....
gi 530416321 1347 SGALGLLQGSPARWSEPWVPVEALPPSPL 1375
Cdd:PHA03379  558 GETSGIVRVRERWRPAPWTPNPPRSPSQM 586
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
145-185 3.11e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.91  E-value: 3.11e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 530416321    145 KNQVAEMLGHKYGVACVAFSPNMKHIVSMGYqhDMVLNVWD 185
Cdd:smart00320    2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSD--DGTIKLWD 40
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
982-1333 4.44e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 4.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321  982 SPKDQSPPEGCAGPTEDELSLPEGPSVPSSSLPQTPEQEKFLRHHFETLTESPCRELFPAALGDVEASEAEDHFFNPRLS 1061
Cdd:PRK07764  436 APAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRER 515
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1062 IStQFLSSLQKASRFTHTfppratqclVKSPEVK-LMDRGG------SQPRAGTGYASPDRTHVLAAGKAEETLEAWR-- 1132
Cdd:PRK07764  516 WP-EILAAVPKRSRKTWA---------ILLPEATvLGVRGDtlvlgfSTGGLARRFASPGNAEVLVTALAEELGGDWQve 585
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1133 ----------------PPPPCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRAriSRSIS 1196
Cdd:PRK07764  586 avvgpapgaaggegppAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVA--VPDAS 663
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1197 LGDSEGPIVATLAQPLRRPSSVGELASLGqelqaittattPSLDSEGQEPALRSWGNHEARANLRLTlssacdgllQPPV 1276
Cdd:PRK07764  664 DGGDGWPAKAGGAAPAAPPPAPAPAAPAA-----------PAGAAPAQPAPAPAATPPAGQADDPAA---------QPPQ 723
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 530416321 1277 DTQPGVTVPAVSFPAPSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARP 1333
Cdd:PRK07764  724 AAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
SPS1 COG0515
Serine/threonine protein kinase [Signal transduction mechanisms];
1211-1445 8.04e-03

Serine/threonine protein kinase [Signal transduction mechanisms];


Pssm-ID: 440281 [Multi-domain]  Cd Length: 482  Bit Score: 40.38  E-value: 8.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1211 PLRRPSSVGELAslgQELQAITTATTPSLDSEGQEPALRSWGNHEARANLRLTLSSAcdGLLQPPVDTQPGVTVPAVSFP 1290
Cdd:COG0515   251 PEERYQSAAELA---AALRAVLRSLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA--AAAAAAAAAAAAAAAAAAPAA 325
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530416321 1291 APSPVEESALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARWSEPWVPVEAL 1370
Cdd:COG0515   326 AAAAAAAAAALAAAAAAAAAAAAAALLAAAAALAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAALAAAAA 405
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530416321 1371 PPSPLELSRVGNILHRLQTTFQEALDLYRVLVSSGQVDTGQQQARTELVSTFLWIHSQLEAECLVGTSVAPAQAL 1445
Cdd:COG0515   406 AAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAAAARLLAAAAAAAAAAAAAPLLAALLAAAALAAAAAAAALA 480
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
117-170 8.44e-03

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 40.79  E-value: 8.44e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 530416321  117 LAFSPDGKYIV---TGENGHRpAVRIWDVEEKnQVAEMLGHKYGVACVAFSPNMKHI 170
Cdd:COG4946   437 LAWSPDSKWLAyskPGPNQLS-QIFLYDVETG-KTVQLTDGRYDDGSPAFSPDGKYL 491
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH