|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
88-408 |
3.42e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 130.42 E-value: 3.42e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 88 PSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSP 166
Cdd:COG2319 130 PDGKTLaSGSADGTVRLWDLAT-------------------------------------GKLLRTLTGHSGAVTSVAFSP 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 167 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSS 246
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGS 248
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 247 ITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 326
Cdd:COG2319 249 VRSVAFSPDGRL-LASGSADGTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 327 KKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:COG2319 323 LRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399
|
..
gi 768008511 407 HL 408
Cdd:COG2319 400 DL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
156-408 |
5.98e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 115.12 E-value: 5.98e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 156 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 235
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 236 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 315
Cdd:cd00200 85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 316 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 395
Cdd:cd00200 159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
|
250
....*....|...
gi 768008511 396 TVSGDSCVFIWHL 408
Cdd:cd00200 236 SGSEDGTIRVWDL 248
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
847-1077 |
5.19e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 5.19e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 847 PASSVLPTDRNLPTPTSAPTP-----GLAQGVHAPSTCSYMEATASSRARISRSISLGDSEGPIVATLAQPLRR--PSSV 919
Cdd:PHA03247 2613 PPSPLPPDTHAPDPPPPSPSPaanepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRaaRPTV 2692
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 920 GELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF-PAPSPVEE 998
Cdd:PHA03247 2693 GSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAaPAPPAVPA 2746
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 768008511 999 SALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEALPPSPLE 1077
Cdd:PHA03247 2747 GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAVLAPAAAL 2818
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
368-407 |
8.06e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 43.84 E-value: 8.06e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 768008511 368 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 407
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
298-390 |
5.44e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 43.04 E-value: 5.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 298 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 376
Cdd:pfam12894 7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
|
90
....*....|....
gi 768008511 377 GHSEIITSMKFTYD 390
Cdd:pfam12894 78 AGSDLITCLGWGEN 91
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
834-1073 |
1.74e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.83 E-value: 1.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 834 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 911
Cdd:pfam03154 186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 912 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSWG-NHEARANLRLTLSSACDGLLQPPVDtQ 980
Cdd:pfam03154 265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPAaPGQSQQRIHTPPSQSQLQSQQPPRE-Q 343
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 981 P----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPTSG 1049
Cdd:pfam03154 344 PlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPPSA 410
|
250 260
....*....|....*....|....
gi 768008511 1050 ALGLLQGSPArwSEPWVPVEALPP 1073
Cdd:pfam03154 411 HPPPLQLMPQ--SQQLPPPPAQPP 432
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
88-408 |
3.42e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 130.42 E-value: 3.42e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 88 PSGSFL-TCSSDNTIRFWNLDSspdshwqknifsntllkvvyvendiqhlqdmshfpdrGSENGTPMDVKAGVRVMQVSP 166
Cdd:COG2319 130 PDGKTLaSGSADGTVRLWDLAT-------------------------------------GKLLRTLTGHSGAVTSVAFSP 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 167 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPEtGlTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSS 246
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFS-PD-G-KLLASGSADGTVRLWDLATG-KLLRTLTGHSGS 248
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 247 ITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHVAektTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 326
Cdd:COG2319 249 VRSVAFSPDGRL-LASGSADGTVRLWDLATGE--LLRTLTGHSG---GVNSVAFSPDGKLLASGSDDGTVRLWDLATGKL 322
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 327 KKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:COG2319 323 LRTLTGHTGAVRS---VAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLW 399
|
..
gi 768008511 407 HL 408
Cdd:COG2319 400 DL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
156-408 |
5.98e-28 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 115.12 E-value: 5.98e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 156 KAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEKNyN 235
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA---DGTYLASGSSDKTIRLWDLETG-E 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 236 LEQTLDDHSSSITAIKFAGNRDIqMISCGADKSIYFRSAQQGSdgLHFVRTHHvaeKTTLYDMDIDITQKYVAVACQDRN 315
Cdd:cd00200 85 CVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDVETGK--CLTTLRGH---TDWVNSVAFSPDGTFVASSSQDGT 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 316 VRVYNTVNGKqkkCYKGSQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLI 395
Cdd:cd00200 159 IKLWDLRTGK---CVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLA 235
|
250
....*....|...
gi 768008511 396 TVSGDSCVFIWHL 408
Cdd:cd00200 236 SGSEDGTIRVWDL 248
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
88-406 |
1.14e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 114.35 E-value: 1.14e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 88 PSGSFL-TCSSDNTIRFWNLdsspdshwqknifsntllkvvyvENDIQHLQDMSHfpdrgsengtpmdvKAGVRVMQVSP 166
Cdd:cd00200 19 PDGKLLaTGSGDGTIKVWDL-----------------------ETGELLRTLKGH--------------TGPVRDVAASA 61
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 167 DGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKpetGLTLLASASRDRLIHVLNVEkNYNLEQTLDDHSSS 246
Cdd:cd00200 62 DGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSP---DGRILSSSSRDKTIKVWDVE-TGKCLTTLRGHTDW 137
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 247 ITAIKFAGNRDIQmISCGADKSIYFRSAQQGSdglhFVRTHHvAEKTTLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQ 326
Cdd:cd00200 138 VNSVAFSPDGTFV-ASSSQDGTIKLWDLRTGK----CVATLT-GHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKC 211
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 327 KKCYkgsQGDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:cd00200 212 LGTL---RGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
197-406 |
3.63e-27 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 112.81 E-value: 3.63e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 197 HDAEVLCLEYSkpeTGLTLLASASRDRLIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDiQMISCGADKSIYFRSAQQ 276
Cdd:cd00200 8 HTGGVTCVAFS---PDGKLLATGSGDGTIKVWDLETG-ELLRTLKGHTGPVRDVAASADGT-YLASGSSDKTIRLWDLET 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 277 GsdglHFVRTHHVAEKTtLYDMDIDITQKYVAVACQDRNVRVYNTVNGKQKKCYKGSQGDegsLLKVHVDPSGTFLATSC 356
Cdd:cd00200 83 G----ECVRTLTGHTSY-VSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDW---VNSVAFSPDGTFVASSS 154
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 768008511 357 SDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:cd00200 155 QDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
144-409 |
1.47e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 104.61 E-value: 1.47e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 144 DRGSENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkPETglTLLASASRDR 223
Cdd:COG2319 66 AAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFS-PDG--KTLASGSADG 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 224 LIHVLNVEKNyNLEQTLDDHSSSITAIKFAGNRDiQMISCGADKSIYFRSAQQGSDgLHFVRTHhvaeKTTLYDMDIDIT 303
Cdd:COG2319 143 TVRLWDLATG-KLLRTLTGHSGAVTSVAFSPDGK-LLASGSDDGTVRLWDLATGKL-LRTLTGH----TGAVRSVAFSPD 215
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 304 QKYVAVACQDRNVRVYNTVNGKQKKCYKGsqgDEGSLLKVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIIT 383
Cdd:COG2319 216 GKLLASGSADGTVRLWDLATGKLLRTLTG---HSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVN 292
|
250 260
....*....|....*....|....*.
gi 768008511 384 SMKFTYDCHHLITVSGDSCVFIWHLG 409
Cdd:COG2319 293 SVAFSPDGKLLASGSDDGTVRLWDLA 318
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
163-408 |
1.01e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 74.56 E-value: 1.01e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 163 QVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSkpeTGLTLLASASRDRLIHVLNVEKNyNLEQTLDD 242
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAAS---PDGARLAAGAGDLTLLLLDAAAG-ALLATLLG 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 243 HSSSITAIKFAGNRDiQMISCGADKSIYFRSAQQGSDGLHFVRTHHvaektTLYDMDIDITQKYVAVACQDRNVRVYNTV 322
Cdd:COG2319 77 HTAAVLSVAFSPDGR-LLASASADGTVRLWDLATGLLLRTLTGHTG-----AVRSVAFSPDGKTLASGSADGTVRLWDLA 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 323 NGKQKKCYKGSQGDEGSllkVHVDPSGTFLATSCSDKSISVIDFYSGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSC 402
Cdd:COG2319 151 TGKLLRTLTGHSGAVTS---VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT 227
|
....*.
gi 768008511 403 VFIWHL 408
Cdd:COG2319 228 VRLWDL 233
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
88-231 |
9.85e-09 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 58.77 E-value: 9.85e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 88 PSGSFL-TCSSDNTIRFWNLDS-------------------SPDSHWqknIFSNTLLKVVYVENdiqhlqdmshfPDRGS 147
Cdd:COG2319 256 PDGRLLaSGSADGTVRLWDLATgellrtltghsggvnsvafSPDGKL---LASGSDDGTVRLWD-----------LATGK 321
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 148 ENGTPMDVKAGVRVMQVSPDGQHLASGDRSGNLRIHELHFMDELVKVEAHDAEVLCLEYSKPETgltLLASASRDRLIHV 227
Cdd:COG2319 322 LLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGR---TLASGSADGTVRL 398
|
....
gi 768008511 228 LNVE 231
Cdd:COG2319 399 WDLA 402
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
847-1077 |
5.19e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 5.19e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 847 PASSVLPTDRNLPTPTSAPTP-----GLAQGVHAPSTCSYMEATASSRARISRSISLGDSEGPIVATLAQPLRR--PSSV 919
Cdd:PHA03247 2613 PPSPLPPDTHAPDPPPPSPSPaanepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRaaRPTV 2692
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 920 GELASLGQelqaittattPSLDSEGQEPALRSWgnhearanlrltlssaCDGLLQPPVDTQPGVTVPAVSF-PAPSPVEE 998
Cdd:PHA03247 2693 GSLTSLAD----------PPPPPPTPEPAPHAL----------------VSATPLPPGPAAARQASPALPAaPAPPAVPA 2746
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 768008511 999 SALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGAlgllqgsPARWSEPWVPVEALPPSPLE 1077
Cdd:PHA03247 2747 GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------PSPWDPADPPAAVLAPAAAL 2818
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
792-1075 |
7.76e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 7.76e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 792 SPEVKLMDRGGSQPRAgtgyASPDRTHVLA-AGKAEETLEAWRPP--PPCLTSLAScvpasSVLPTDRNlPTPTSAPTPG 868
Cdd:PHA03247 2646 VPPPERPRDDPAPGRV----SRPRRARRLGrAAQASSPPQRPRRRaaRPTVGSLTS-----LADPPPPP-PTPEPAPHAL 2715
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 869 LAQgvhAPSTCSYMEATASSRARISRSISLGDSEGPIV-ATLAQPLRRPSSVGELASlgqelqaittaTTPSLDSEGQEP 947
Cdd:PHA03247 2716 VSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATpGGPARPARPPTTAGPPAP-----------APPAAPAAGPPR 2781
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 948 ALRSWGNHEARANlRLTLSSACDGLLQPPVDTQPGVTVPAVSFPA-PSPVeesalrlhgsafrPSLPAPESPGLPAHPSN 1026
Cdd:PHA03247 2782 RLTRPAVASLSES-RESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPP-------------PTSAQPTAPPPPPGPPP 2847
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 768008511 1027 PQLPEARPGIPGGTASlLEPTSGALGLLQGSPARWSEPWVPVEALPPSP 1075
Cdd:PHA03247 2848 PSLPLGGSVAPGGDVR-RRPPSRSPAAKPAAPARPPVRRLARPAVSRST 2895
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
732-1076 |
7.95e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.71 E-value: 7.95e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 732 TESPCRELFPAALGDVEAS----------EAEDHFFNPRLSISTQFLSSLQKASRFTHTFPPRATQclvKSPEVKLMDRG 801
Cdd:PHA03247 2680 PQRPRRRAARPTVGSLTSLadpppppptpEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP---AGPATPGGPAR 2756
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 802 GSQPRAGTGYASPDRTHVLAAGKAEetleawRPPPPCLTSLASCVPASSVLPTDRNLPTPTSAPTPGLAQ-----GVHAP 876
Cdd:PHA03247 2757 PARPPTTAGPPAPAPPAAPAAGPPR------RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPaaspaGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 877 STCSYMEATASSRARISRSISLGDSEGPivatlAQPLRRPSSVGelaslgqelQAITTATTPSldsegqEPALRSWGnhe 956
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAP-----GGDVRRRPPSR---------SPAAKPAAPA------RPPVRRLA--- 2887
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 957 aranlRLTLSSACDGLLQPPVDTQPGVTVPAVSFPAPSPVEESALRlhgsafrpSLPAPESPGLPAHPSNPQ-----LPE 1031
Cdd:PHA03247 2888 -----RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ--------PQPPPPPPPRPQPPLAPTtdpagAGE 2954
|
330 340 350 360
....*....|....*....|....*....|....*....|....*
gi 768008511 1032 ARPGIPGGTASLLEPTSGALGLLQGSPARwsePWVPVEALPPSPL 1076
Cdd:PHA03247 2955 PSGAVPQPWLGALVPGRVAVPRFRVPQPA---PSREAPASSTPPL 2996
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
368-407 |
8.06e-06 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 43.84 E-value: 8.06e-06
10 20 30 40
....*....|....*....|....*....|....*....|
gi 768008511 368 SGECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIWH 407
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
805-1074 |
8.58e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 8.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 805 PRAGTGYASPDRTHVLAAGKAE-ETLEAWRPPPPCltslascVPASSVLPTDRNLPTPTSAPT---PGLAQGVHAPSTcs 880
Cdd:PHA03247 2521 PDEPVGEPVHPRMLTWIRGLEElASDDAGDPPPPL-------PPAAPPAAPDRSVPPPRPAPRpsePAVTSRARRPDA-- 2591
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 881 ymeATASSRARISRSISlGDSEGPIVATLAQP----LRRPSSVGelASLGQELQAITTATTPSLDSEGQEPALRSWGNH- 955
Cdd:PHA03247 2592 ---PPQSARPRAPVDDR-GDPRGPAPPSPLPPdthaPDPPPPSP--SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPr 2665
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 956 EARANLRLTLSSAcdgllqPPVDTQPGVTVPAVSF-------PAPSPVEESALRLHGSAFrPSLPAPES-----PGLPAH 1023
Cdd:PHA03247 2666 RARRLGRAAQASS------PPQRPRRRAARPTVGSltsladpPPPPPTPEPAPHALVSAT-PLPPGPAAarqasPALPAA 2738
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 768008511 1024 PSNPQLPEArPGIPGGTASLLEPTSGAlgllqgSPARWSEPWVPVEALPPS 1074
Cdd:PHA03247 2739 PAPPAVPAG-PATPGGPARPARPPTTA------GPPAPAPPAAPAAGPPRR 2782
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
780-1080 |
9.41e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 9.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 780 TFPPRATQCLVKSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEETLEAW-RPPPPCLTSLASCVPASSVLPTDRNL 858
Cdd:PHA03247 2646 VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLaDPPPPPPTPEPAPHALVSATPLPPGP 2725
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 859 -----------------PTPTSAPTPGLAQGVHAPSTCSYMEATASSRARIS---RSISLGDSEGPIVATLAQPLRRPSS 918
Cdd:PHA03247 2726 aaarqaspalpaapappAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAgppRRLTRPAVASLSESRESLPSPWDPA 2805
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 919 VGELASLGQELQAITTAT--TPSLDSEGQEPALRSWGNHEARANLRLTLSSACDGLLQPPVDTQPGVTVPA--------- 987
Cdd:PHA03247 2806 DPPAAVLAPAAALPPAASpaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAaparppvrr 2885
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 988 VSFPAPSPVEES-ALRLHGSAFRPSLPAPESPGLPAHPSNPQLPEARPGIPGGTASLLEPTSGALGLLQGSPARwSEPWV 1066
Cdd:PHA03247 2886 LARPAVSRSTESfALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAV-PQPWL 2964
|
330
....*....|....
gi 768008511 1067 PveALPPSPLELSR 1080
Cdd:PHA03247 2965 G--ALVPGRVAVPR 2976
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
298-390 |
5.44e-05 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 43.04 E-value: 5.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 298 MDIditqkyVAVACQDRNVRVYNTvNGKqkKCYKGSQGDEGSLLK-VHVDPSGTFLATSCSDKSISVIDFYSGECIAKMF 376
Cdd:pfam12894 7 MDL------IALATEDGELLLHRL-NWQ--RVWTLSPDKEDLEVTsLAWRPDGKLLAVGYSDGTVRLLDAENGKIVHHFS 77
|
90
....*....|....
gi 768008511 377 GHSEIITSMKFTYD 390
Cdd:pfam12894 78 AGSDLITCLGWGEN 91
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
369-406 |
6.43e-05 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 41.18 E-value: 6.43e-05
10 20 30
....*....|....*....|....*....|....*...
gi 768008511 369 GECIAKMFGHSEIITSMKFTYDCHHLITVSGDSCVFIW 406
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVW 38
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
631-1055 |
1.51e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.00 E-value: 1.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 631 PGDQQGDSYLRV-SSDSPKDQSPPEDSGESEADLECSFAAIHSPAPPPDPAPRFA-----TSLPHFPGCAGPTEDELSLP 704
Cdd:PHA03247 2601 PVDDRGDPRGPApPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDpapgrVSRPRRARRLGRAAQASSPP 2680
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 705 EGP----------SVPSSSLPQTPEQEKFLRHHfetlTESPCRELFPAALGDVEASEAEDHFFNPRLSISTQFL--SSLQ 772
Cdd:PHA03247 2681 QRPrrraarptvgSLTSLADPPPPPPTPEPAPH----ALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATpgGPAR 2756
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 773 KASRFTHTFPPRATqclvkSPEVKLMDRGGSQPRAGTGYASPDRTHVLAAGKAEETLEAWRPPPPCLTSLAS----CVPA 848
Cdd:PHA03247 2757 PARPPTTAGPPAPA-----PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASpagpLPPP 2831
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 849 SSVLPTDRNLPTPTSAPTPGLAQGV--------HAPSTCSYMEATASSRARISRsislgdsegpivatLAQP-LRRPSSV 919
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVapggdvrrRPPSRSPAAKPAAPARPPVRR--------------LARPaVSRSTES 2897
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 920 GELASLGQELQAITTATTPSLDSEGQEPALRSWGNHEARANlrltlssaCDGLLQPPVDT----QPGVTVPAVSFPAPSP 995
Cdd:PHA03247 2898 FALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPR--------PQPPLAPTTDPagagEPSGAVPQPWLGALVP 2969
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 768008511 996 VEESALRLHGSAFRPSLPAPESpglpahPSNPQLPEARPGIPGGTASL---LEPTSGALGLLQ 1055
Cdd:PHA03247 2970 GRVAVPRFRVPQPAPSREAPAS------STPPLTGHSLSRVSSWASSLalhEETDPPPVSLKQ 3026
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
834-1073 |
1.74e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.83 E-value: 1.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 834 PPPPCLTSLASCVPASSVlPTDRNLPTPTSAPTPGLAQGVHAPSTCSYMEATASSRARISRSISLGD--SEGPIVATLAQ 911
Cdd:pfam03154 186 PPPPGTTQAATAGPTPSA-PSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPmtQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 912 PLRRPSSVGELASLGQELQAITTA----------TTPSLDSEGQEPALRSWG-NHEARANLRLTLSSACDGLLQPPVDtQ 980
Cdd:pfam03154 265 PLPQPSLHGQMPPMPHSLQTGPSHmqhpvppqpfPLTPQSSQSQVPPGPSPAaPGQSQQRIHTPPSQSQLQSQQPPRE-Q 343
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 981 P----GVTVPAVSFPAPSPVeesalrlhgsafrPSLPAPESPGLPAHPSNP---QLPEARPGIPG----GTASLLEPTSG 1049
Cdd:pfam03154 344 PlppaPLSMPHIKPPPTTPI-------------PQLPNPQSHKHPPHLSGPspfQMNSNLPPPPAlkplSSLSTHHPPSA 410
|
250 260
....*....|....*....|....
gi 768008511 1050 ALGLLQGSPArwSEPWVPVEALPP 1073
Cdd:pfam03154 411 HPPPLQLMPQ--SQQLPPPPAQPP 432
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
970-1076 |
3.11e-03 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 41.97 E-value: 3.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768008511 970 DGLLQPPVDTQPGVT-VPAVSFPAPSPVEESALRLHGSAFRPSLPAPespgLPAHP-SNPQLPEARPGIPGGTASLLEPT 1047
Cdd:PHA03379 482 DQLPGVVQDGRPACApVPAPAGPIVRPWEASLSQVPGVAFAPVMPQP----MPVEPvPVPTVALERPVCPAPPLIAMQGP 557
|
90 100
....*....|....*....|....*....
gi 768008511 1048 SGALGLLQGSPARWSEPWVPVEALPPSPL 1076
Cdd:PHA03379 558 GETSGIVRVRERWRPAPWTPNPPRSPSQM 586
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
377-423 |
6.74e-03 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 40.01 E-value: 6.74e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 768008511 377 GHSEIITSMKFTYDCHHLITVSGDSCVFIWHL-GPEITNCMKQHLLEI 423
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLeTGELLRTLKGHTGPV 54
|
|
|