NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|768008511|ref|XP_011525142|]
View 

WD repeat-containing protein 62 isoform X12 [Homo sapiens]

Protein Classification

WD40 repeat domain-containing protein( domain architecture ID 11455410)

WD40 repeat domain-containing protein similar to proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly

CATH:  2.130.10.10
PubMed:  10322433|8090199
SCOP:  4002744

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
88-408 3.42e-32

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 130.42  E-value: 3.42e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   88 PSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSP 166
Cdd:COG2319   130 PDGKTLaSGSADGTVRLWDLAT-------------------------------------GKLLRTLTGHSGAVTSVAFSP 172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  167 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSS 246
Cdd:COG2319   173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGS 248
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  247 ITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 326
Cdd:COG2319   249 VRSVAFSPDGRL-LASGSADGTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  327 KKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:COG2319   323 LRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399

                  ..
gi 768008511  407 HL 408
Cdd:COG2319   400 DL 401
PHA03247 super family cl33720
large tegument protein UL36; Provisional
847-1077 5.19e-07

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 5.19e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  847 PASSVLPTDRNLPTPTSAPTP-----GLAQGVHAPSTCSYMEATASSRARISRSISLGDSEGPIVATLAQPLRR--PSSV 919
Cdd:PHA03247 2613 PPSPLPPDTHAPDPPPPSPSPaanepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRaaRPTV 2692
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  920 GELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF-PAPSPVEE 998
Cdd:PHA03247 2693 GSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAaPAPPAVPA 2746
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 768008511  999 SALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEALPPSPLE 1077
Cdd:PHA03247 2747 GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAVLAPAAAL 2818
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
88-408 3.42e-32

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 130.42  E-value: 3.42e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   88 PSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSP 166
Cdd:COG2319   130 PDGKTLaSGSADGTVRLWDLAT-------------------------------------GKLLRTLTGHSGAVTSVAFSP 172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  167 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSS 246
Cdd:COG2319   173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGS 248
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  247 ITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 326
Cdd:COG2319   249 VRSVAFSPDGRL-LASGSADGTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  327 KKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:COG2319   323 LRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399

                  ..
gi 768008511  407 HL 408
Cdd:COG2319   400 DL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
156-408 5.98e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 115.12  E-value: 5.98e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  156 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 235
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  236 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 315
Cdd:cd00200    85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  316 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 395
Cdd:cd00200   159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
                         250
                  ....*....|...
gi 768008511  396 TVSGDSCVFIWHL 408
Cdd:cd00200   236 SGSEDGTIRVWDL 248
PHA03247 PHA03247
large tegument protein UL36; Provisional
847-1077 5.19e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 5.19e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  847 PASSVLPTDRNLPTPTSAPTP-----GLAQGVHAPSTCSYMEATASSRARISRSISLGDSEGPIVATLAQPLRR--PSSV 919
Cdd:PHA03247 2613 PPSPLPPDTHAPDPPPPSPSPaanepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRaaRPTV 2692
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  920 GELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF-PAPSPVEE 998
Cdd:PHA03247 2693 GSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAaPAPPAVPA 2746
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 768008511  999 SALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEALPPSPLE 1077
Cdd:PHA03247 2747 GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAVLAPAAAL 2818
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
368-407 8.06e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.84  E-value: 8.06e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 768008511    368 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 407
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
298-390 5.44e-05

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 43.04  E-value: 5.44e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   298 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 376
Cdd:pfam12894    7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
                           90
                   ....*....|....
gi 768008511   377 GHSEIITSMKFTYD 390
Cdd:pfam12894   78 AGSDLITCLGWGEN 91
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
834-1073 1.74e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 1.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   834 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 911
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   912 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSWG-NHEARANLRLTLSSACDGLLQPPVDtQ 980
Cdd:pfam03154  265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPAaPGQSQQRIHTPPSQSQLQSQQPPRE-Q 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   981 P----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPTSG 1049
Cdd:pfam03154  344 PlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPPSA 410
                          250       260
                   ....*....|....*....|....
gi 768008511  1050 ALGLLQGSPArwSEPWVPVEALPP 1073
Cdd:pfam03154  411 HPPPLQLMPQ--SQQLPPPPAQPP 432
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
88-408 3.42e-32

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 130.42  E-value: 3.42e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   88 PSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSP 166
Cdd:COG2319   130 PDGKTLaSGSADGTVRLWDLAT-------------------------------------GKLLRTLTGHSGAVTSVAFSP 172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  167 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSS 246
Cdd:COG2319   173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGS 248
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  247 ITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 326
Cdd:COG2319   249 VRSVAFSPDGRL-LASGSADGTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  327 KKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:COG2319   323 LRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399

                  ..
gi 768008511  407 HL 408
Cdd:COG2319   400 DL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
156-408 5.98e-28

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 115.12  E-value: 5.98e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  156 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 235
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  236 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 315
Cdd:cd00200    85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  316 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 395
Cdd:cd00200   159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
                         250
                  ....*....|...
gi 768008511  396 TVSGDSCVFIWHL 408
Cdd:cd00200   236 SGSEDGTIRVWDL 248
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
88-406 1.14e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 114.35  E-value: 1.14e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   88 PSGSFL-TCSSDNTIRFWNLdsspdshwqknifsntllkvvyvENDIQHLQDMSHfpdrgsengtpmdvKAGVRVMQVSP 166
Cdd:cd00200    19 PDGKLLaTGSGDGTIKVWDL-----------------------ETGELLRTLKGH--------------TGPVRDVAASA 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  167 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEkNYNLEQTLDDHSSS 246
Cdd:cd00200    62 DGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSP---DGRILSSSSRDKTIKVWDVE-TGKCLTTLRGHTDW 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  247 ITAIKFAGNRDIQmISCGADKSIYFRSAQQGSdglhFVRTHHvAEKTTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 326
Cdd:cd00200   138 VNSVAFSPDGTFV-ASSSQDGTIKLWDLRTGK----CVATLT-GHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKC 211
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  327 KKCYkgsQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:cd00200   212 LGTL---RGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
197-406 3.63e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 112.81  E-value: 3.63e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  197 HDAEVLCLEYSkpeTGLTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDiQMISCGADKSIYFRSAQQ 276
Cdd:cd00200     8 HTGGVTCVAFS---PDGKLLATGSGDGTIKVWDLETG-ELLRTLKGHTGPVRDVAASADGT-YLASGSSDKTIRLWDLET 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  277 GsdglHFVRTHHVAEKTtLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDegsLLKVHVDPSGTFLATSC 356
Cdd:cd00200    83 G----ECVRTLTGHTSY-VSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDW---VNSVAFSPDGTFVASSS 154
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 768008511  357 SDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:cd00200   155 QDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
WD40 COG2319
WD40 repeat [General function prediction only];
144-409 1.47e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 104.61  E-value: 1.47e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  144 DRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPETglTLLASASRDR 223
Cdd:COG2319    66 AAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS-PDG--KTLASGSADG 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  224 LIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDiQMISCGADKSIYFRSAQQGSDgLHFVRTHhvaeKTTLYDMDIDIT 303
Cdd:COG2319   143 TVRLWDLATG-KLLRTLTGHSGAVTSVAFSPDGK-LLASGSDDGTVRLWDLATGKL-LRTLTGH----TGAVRSVAFSPD 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  304 QKYVAVACQDRNVRVYNTVNGKQKKCYKGsqgDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIIT 383
Cdd:COG2319   216 GKLLASGSADGTVRLWDLATGKLLRTLTG---HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVN 292
                         250       260
                  ....*....|....*....|....*.
gi 768008511  384 SMKFTYDCHHLITVSGDSCVFIWHLG 409
Cdd:COG2319   293 SVAFSPDGKLLASGSDDGTVRLWDLA 318
WD40 COG2319
WD40 repeat [General function prediction only];
163-408 1.01e-13

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 74.56  E-value: 1.01e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  163 QVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkpeTGLTLLASASRDRLIHVLNVEKNyNLEQTLDD 242
Cdd:COG2319     1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAAS---PDGARLAAGAGDLTLLLLDAAAG-ALLATLLG 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  243 HSSSITAIKFAGNRDiQMISCGADKSIYFRSAQQGSDGLHFVRTHHvaektTLYDMDIDITQKYVAVACQDRNVRVYNTV 322
Cdd:COG2319    77 HTAAVLSVAFSPDGR-LLASASADGTVRLWDLATGLLLRTLTGHTG-----AVRSVAFSPDGKTLASGSADGTVRLWDLA 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  323 NGKQKKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSC 402
Cdd:COG2319   151 TGKLLRTLTGHSGAVTS---VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227

                  ....*.
gi 768008511  403 VFIWHL 408
Cdd:COG2319   228 VRLWDL 233
WD40 COG2319
WD40 repeat [General function prediction only];
88-231 9.85e-09

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 58.77  E-value: 9.85e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   88 PSGSFL-TCSSDNTIRFWNLDS-------------------SPDSHWqknIFSNTLLKVVYVENdiqhlqdmshfPDRGS 147
Cdd:COG2319   256 PDGRLLaSGSADGTVRLWDLATgellrtltghsggvnsvafSPDGKL---LASGSDDGTVRLWD-----------LATGK 321
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  148 ENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKPETgltLLASASRDRLIHV 227
Cdd:COG2319   322 LLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGR---TLASGSADGTVRL 398

                  ....
gi 768008511  228 LNVE 231
Cdd:COG2319   399 WDLA 402
PHA03247 PHA03247
large tegument protein UL36; Provisional
847-1077 5.19e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 5.19e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  847 PASSVLPTDRNLPTPTSAPTP-----GLAQGVHAPSTCSYMEATASSRARISRSISLGDSEGPIVATLAQPLRR--PSSV 919
Cdd:PHA03247 2613 PPSPLPPDTHAPDPPPPSPSPaanepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRaaRPTV 2692
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  920 GELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF-PAPSPVEE 998
Cdd:PHA03247 2693 GSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAaPAPPAVPA 2746
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 768008511  999 SALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEALPPSPLE 1077
Cdd:PHA03247 2747 GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAVLAPAAAL 2818
PHA03247 PHA03247
large tegument protein UL36; Provisional
792-1075 7.76e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 7.76e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  792 SPEVKLMDRGGSQPRAgtgyASPDRTHVLA-AGKAEETLEAWRPP--PPCLTSLAScvpasSVLPTDRNlPTPTSAPTPG 868
Cdd:PHA03247 2646 VPPPERPRDDPAPGRV----SRPRRARRLGrAAQASSPPQRPRRRaaRPTVGSLTS-----LADPPPPP-PTPEPAPHAL 2715
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  869 LAQgvhAPSTCSYMEATASSRARISRSISLGDSEGPIV-ATLAQPLRRPSSVGELASlgqelqaittaTTPSLDSEGQEP 947
Cdd:PHA03247 2716 VSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATpGGPARPARPPTTAGPPAP-----------APPAAPAAGPPR 2781
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  948 ALRSWGNHEARANlRLTLSSACDGLLQPPVDTQPGVTVPAVSFPA-PSPVeesalrlhgsafrPSLPAPESPGLPAHPSN 1026
Cdd:PHA03247 2782 RLTRPAVASLSES-RESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPP-------------PTSAQPTAPPPPPGPPP 2847
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*....
gi 768008511 1027 PQLPEARPGIPGGTASlLEPTSGALGLLQGSPARWSEPWVPVEALPPSP 1075
Cdd:PHA03247 2848 PSLPLGGSVAPGGDVR-RRPPSRSPAAKPAAPARPPVRRLARPAVSRST 2895
PHA03247 PHA03247
large tegument protein UL36; Provisional
732-1076 7.95e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 7.95e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  732 TESPCRELFPAALGDVEAS----------EAEDHFFNPRLSISTQFLSSLQKASRFTHTFPPRATQclvKSPEVKLMDRG 801
Cdd:PHA03247 2680 PQRPRRRAARPTVGSLTSLadpppppptpEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP---AGPATPGGPAR 2756
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  802 GSQPRAGTGYASPDRTHVLAAGKAEetleawRPPPPCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQ-----GVHAP 876
Cdd:PHA03247 2757 PARPPTTAGPPAPAPPAAPAAGPPR------RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPaaspaGPLPP 2830
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  877 STCSYMEATASSRARISRSISLGDSEGPivatlAQPLRRPSSVGelaslgqelQAITTATTPSldsegqEPALRSWGnhe 956
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAP-----GGDVRRRPPSR---------SPAAKPAAPA------RPPVRRLA--- 2887
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  957 aranlRLTLSSACDGLLQPPVDTQPGVTVPAVSFPAPSPVEESALRlhgsafrpSLPAPESPGLPAHPSNPQ-----LPE 1031
Cdd:PHA03247 2888 -----RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ--------PQPPPPPPPRPQPPLAPTtdpagAGE 2954
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 768008511 1032 ARPGIPGGTASLLEPTSGALGLLQGSPARwsePWVPVEALPPSPL 1076
Cdd:PHA03247 2955 PSGAVPQPWLGALVPGRVAVPRFRVPQPA---PSREAPASSTPPL 2996
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
368-407 8.06e-06

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.84  E-value: 8.06e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 768008511    368 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 407
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
PHA03247 PHA03247
large tegument protein UL36; Provisional
805-1074 8.58e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 8.58e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  805 PRAGTGYASPDRTHVLAAGKAE-ETLEAWRPPPPCltslascVPASSVLPTDRNLPTPTSAPT---PGLAQGVHAPSTcs 880
Cdd:PHA03247 2521 PDEPVGEPVHPRMLTWIRGLEElASDDAGDPPPPL-------PPAAPPAAPDRSVPPPRPAPRpsePAVTSRARRPDA-- 2591
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  881 ymeATASSRARISRSISlGDSEGPIVATLAQP----LRRPSSVGelASLGQELQAITTATTPSLDSEGQEPALRSWGNH- 955
Cdd:PHA03247 2592 ---PPQSARPRAPVDDR-GDPRGPAPPSPLPPdthaPDPPPPSP--SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPr 2665
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  956 EARANLRLTLSSAcdgllqPPVDTQPGVTVPAVSF-------PAPSPVEESALRLHGSAFrPSLPAPES-----PGLPAH 1023
Cdd:PHA03247 2666 RARRLGRAAQASS------PPQRPRRRAARPTVGSltsladpPPPPPTPEPAPHALVSAT-PLPPGPAAarqasPALPAA 2738
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 768008511 1024 PSNPQLPEArPGIPGGTASLLEPTSGAlgllqgSPARWSEPWVPVEALPPS 1074
Cdd:PHA03247 2739 PAPPAVPAG-PATPGGPARPARPPTTA------GPPAPAPPAAPAAGPPRR 2782
PHA03247 PHA03247
large tegument protein UL36; Provisional
780-1080 9.41e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 9.41e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  780 TFPPRATQCLVKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEETLEAW-RPPPPCLTSLASCVPASSVLPTDRNL 858
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLaDPPPPPPTPEPAPHALVSATPLPPGP 2725
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  859 -----------------PTPTSAPTPGLAQGVHAPSTCSYMEATASSRARIS---RSISLGDSEGPIVATLAQPLRRPSS 918
Cdd:PHA03247 2726 aaarqaspalpaapappAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAgppRRLTRPAVASLSESRESLPSPWDPA 2805
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  919 VGELASLGQELQAITTAT--TPSLDSEGQEPALRSWGNHEARANLRLTLSSACDGLLQPPVDTQPGVTVPA--------- 987
Cdd:PHA03247 2806 DPPAAVLAPAAALPPAASpaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAaparppvrr 2885
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  988 VSFPAPSPVEES-ALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARwSEPWV 1066
Cdd:PHA03247 2886 LARPAVSRSTESfALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAV-PQPWL 2964
                         330
                  ....*....|....
gi 768008511 1067 PveALPPSPLELSR 1080
Cdd:PHA03247 2965 G--ALVPGRVAVPR 2976
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
298-390 5.44e-05

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 43.04  E-value: 5.44e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   298 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 376
Cdd:pfam12894    7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
                           90
                   ....*....|....
gi 768008511   377 GHSEIITSMKFTYD 390
Cdd:pfam12894   78 AGSDLITCLGWGEN 91
WD40 pfam00400
WD domain, G-beta repeat;
369-406 6.43e-05

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 41.18  E-value: 6.43e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 768008511   369 GECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
PHA03247 PHA03247
large tegument protein UL36; Provisional
631-1055 1.51e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 1.51e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  631 PGDQQGDSYLRV-SSDSPKDQSPPEDSGESEADLECSFAAIHSPAPPPDPAPRFA-----TSLPHFPGCAGPTEDELSLP 704
Cdd:PHA03247 2601 PVDDRGDPRGPApPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDpapgrVSRPRRARRLGRAAQASSPP 2680
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  705 EGP----------SVPSSSLPQTPEQEKFLRHHfetlTESPCRELFPAALGDVEASEAEDHFFNPRLSISTQFL--SSLQ 772
Cdd:PHA03247 2681 QRPrrraarptvgSLTSLADPPPPPPTPEPAPH----ALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATpgGPAR 2756
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  773 KASRFTHTFPPRATqclvkSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEETLEAWRPPPPCLTSLAS----CVPA 848
Cdd:PHA03247 2757 PARPPTTAGPPAPA-----PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASpagpLPPP 2831
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  849 SSVLPTDRNLPTPTSAPTPGLAQGV--------HAPSTCSYMEATASSRARISRsislgdsegpivatLAQP-LRRPSSV 919
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVapggdvrrRPPSRSPAAKPAAPARPPVRR--------------LARPaVSRSTES 2897
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  920 GELASLGQELQAITTATTPSLDSEGQEPALRSWGNHEARANlrltlssaCDGLLQPPVDT----QPGVTVPAVSFPAPSP 995
Cdd:PHA03247 2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR--------PQPPLAPTTDPagagEPSGAVPQPWLGALVP 2969
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 768008511  996 VEESALRLHGSAFRPSLPAPESpglpahPSNPQLPEARPGIPGGTASL---LEPTSGALGLLQ 1055
Cdd:PHA03247 2970 GRVAVPRFRVPQPAPSREAPAS------STPPLTGHSLSRVSSWASSLalhEETDPPPVSLKQ 3026
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
834-1073 1.74e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.83  E-value: 1.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   834 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 911
Cdd:pfam03154  186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   912 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSWG-NHEARANLRLTLSSACDGLLQPPVDtQ 980
Cdd:pfam03154  265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPAaPGQSQQRIHTPPSQSQLQSQQPPRE-Q 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511   981 P----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPTSG 1049
Cdd:pfam03154  344 PlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPPSA 410
                          250       260
                   ....*....|....*....|....
gi 768008511  1050 ALGLLQGSPArwSEPWVPVEALPP 1073
Cdd:pfam03154  411 HPPPLQLMPQ--SQQLPPPPAQPP 432
PHA03379 PHA03379
EBNA-3A; Provisional
970-1076 3.11e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 41.97  E-value: 3.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511  970 DGLLQPPVDTQPGVT-VPAVSFPAPSPVEESALRLHGSAFRPSLPAPespgLPAHP-SNPQLPEARPGIPGGTASLLEPT 1047
Cdd:PHA03379  482 DQLPGVVQDGRPACApVPAPAGPIVRPWEASLSQVPGVAFAPVMPQP----MPVEPvPVPTVALERPVCPAPPLIAMQGP 557
                          90       100
                  ....*....|....*....|....*....
gi 768008511 1048 SGALGLLQGSPARWSEPWVPVEALPPSPL 1076
Cdd:PHA03379  558 GETSGIVRVRERWRPAPWTPNPPRSPSQM 586
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
377-423 6.74e-03

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 40.01  E-value: 6.74e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 768008511  377 GHSEIITSMKFTYDCHHLITVSGDSCVFIWHL-GPEITNCMKQHLLEI 423
Cdd:cd00200     7 GHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLeTGELLRTLKGHTGPV 54
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH