|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-143 |
7.09e-90 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization. :
Pssm-ID: 461094 Cd Length: 117 Bit Score: 277.38 E-value: 7.09e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 568961191 98 QEHQQQVAQAVERAKQVTMTELNAIIGVrglpnlpltQQQLQAQHL 143
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQ---------QQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
487-772 |
2.91e-43 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 2.91e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 487 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 565
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 566 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 645
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 646 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 723
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 568961191 724 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 772
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
261-474 |
1.54e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.21 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 261 DVSNEDPA-----TPRVSPAHSPPENGLDKARGLKKDAPTSPASVASSSSTPSSKTKDLGHNDKSSTPGLKS-----NTP 330
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSptsavTTP 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 331 TPRNDAPTPGTSTTPGLRSMP--GKPPGMDPIASALRTPITLTSSYPAPFAMMSHHEMNGSLTSP-----------SAYA 397
Cdd:pfam05109 552 TPNATSPTPAVTTPTPNATIPtlGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvtsppknatsAVTT 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 398 GLHNI----PSQMSAAAAAAAAAYGRSPMVGFDPHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPFPHDALAG 472
Cdd:pfam05109 632 GQHNItsssTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPGTTSQASG 711
|
..
gi 568961191 473 PG 474
Cdd:pfam05109 712 PG 713
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-143 |
7.09e-90 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 277.38 E-value: 7.09e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 568961191 98 QEHQQQVAQAVERAKQVTMTELNAIIGVrglpnlpltQQQLQAQHL 143
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQ---------QQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
487-772 |
2.91e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 2.91e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 487 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 565
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 566 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 645
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 646 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 723
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 568961191 724 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 772
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
486-771 |
3.53e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 149.79 E-value: 3.53e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 486 SHGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLdCLNRDNyIRSCKLLPDGRTLIVGGEASTLTIWDLa 564
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 565 sPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 644
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 645 RSWDLREGR---QLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD-KYQLHLHESCVLSLKFAYCGKWFVST 720
Cdd:cd00200 160 KLWDLRTGKcvaTLTGH--TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 568961191 721 GKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISADDKYIVTGSGDKKATVYE 771
Cdd:cd00200 238 SEDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
609-648 |
2.37e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.69 E-value: 2.37e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 568961191 609 NQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 648
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
508-770 |
2.08e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 2.08e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 508 KGCVKIWDISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPtprIKAELTSSAPACYALAIS 587
Cdd:PLN00181 457 EGLCKYLSFSKLRVKADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRS 533
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 588 PDAKVCF---------SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISH-DGTKLWTGGLDNTVRSWDLREGRQLQQ 657
Cdd:PLN00181 534 KLSGICWnsyiksqvaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 658 HDFTSQIFSLGY-CPTGEWLAVGMESSNVEV--LHHTKPDKYQLHLHESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYG 734
Cdd:PLN00181 614 IKTKANICCVQFpSESGRSLAFGSADHKVYYydLRNPKLPLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMS 692
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 568961191 735 ASIFQSKESSSVLS-------CDISADDKYIVTGSGDKKATVY 770
Cdd:PLN00181 693 ISGINETPLHSFMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
611-648 |
3.57e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.57e-06
10 20 30
....*....|....*....|....*....|....*...
gi 568961191 611 TLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 648
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
261-474 |
1.54e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.21 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 261 DVSNEDPA-----TPRVSPAHSPPENGLDKARGLKKDAPTSPASVASSSSTPSSKTKDLGHNDKSSTPGLKS-----NTP 330
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSptsavTTP 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 331 TPRNDAPTPGTSTTPGLRSMP--GKPPGMDPIASALRTPITLTSSYPAPFAMMSHHEMNGSLTSP-----------SAYA 397
Cdd:pfam05109 552 TPNATSPTPAVTTPTPNATIPtlGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvtsppknatsAVTT 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 398 GLHNI----PSQMSAAAAAAAAAYGRSPMVGFDPHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPFPHDALAG 472
Cdd:pfam05109 632 GQHNItsssTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPGTTSQASG 711
|
..
gi 568961191 473 PG 474
Cdd:pfam05109 712 PG 713
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-143 |
7.09e-90 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 277.38 E-value: 7.09e-90
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 18 FKFTVAESCDRIKDEFQFLQAQYHSLKVEYDKLANEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTILAQIMPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 568961191 98 QEHQQQVAQAVERAKQVTMTELNAIIGVrglpnlpltQQQLQAQHL 143
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQ---------QQQLQAQHL 117
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
487-772 |
2.91e-43 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 161.62 E-value: 2.91e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 487 HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISQPGSKSPISqldclNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLAS 565
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLATGKLLRTLT-----GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLAT 193
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 566 PTPRikAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVR 645
Cdd:COG2319 194 GKLL--RTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 646 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKD 723
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDlATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 568961191 724 NLLNAWRTPYGASIFQSKE-SSSVLSCDISADDKYIVTGSGDKKATVYEV 772
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
486-771 |
3.53e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 149.79 E-value: 3.53e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 486 SHGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLdCLNRDNyIRSCKLLPDGRTLIVGGEASTLTIWDLa 564
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 565 sPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 644
Cdd:cd00200 81 -ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTI 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 645 RSWDLREGR---QLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVLHHTKPD-KYQLHLHESCVLSLKFAYCGKWFVST 720
Cdd:cd00200 160 KLWDLRTGKcvaTLTGH--TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASG 237
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 568961191 721 GKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISADDKYIVTGSGDKKATVYE 771
Cdd:cd00200 238 SEDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
454-772 |
7.27e-40 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 151.99 E-value: 7.27e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 454 VSADGQMQPVPFPHDALAGPGIPRHARQINTLSHGEVVCAVTISNPTRHVYTGGKGCVKIWDISQPGSKSPISQLdclnR 533
Cdd:COG2319 2 LSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLG----H 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 534 DNYIRSCKLLPDGRTLIVGGEASTLTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLV 613
Cdd:COG2319 78 TAAVLSVAFSPDGRLLASASADGTVRLWDLA--TGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 614 RQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREGRQLQQ---HdfTSQIFSLGYCPTGEWLAVGMESSNVEVLH- 689
Cdd:COG2319 156 RTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTltgH--TGAVRSVAFSPDGKLLASGSADGTVRLWDl 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 690 HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI-FQSKESSSVLSCDISADDKYIVTGSGDKKAT 768
Cdd:COG2319 234 ATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLrTLTGHSGGVNSVAFSPDGKLLASGSDDGTVR 313
|
....
gi 568961191 769 VYEV 772
Cdd:COG2319 314 LWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
480-732 |
3.82e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 150.06 E-value: 3.82e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 480 RQINTLS-HGEVVCAVTISNPTRHVYTGGK-GCVKIWDISqpgSKSPISQLDclNRDNYIRSCKLLPDGRTLIVGGEAST 557
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 558 LTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWT 637
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 638 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAYCGK 715
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDlATGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 568961191 716 WFVSTGKDNLLNAWRTP 732
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
581-772 |
2.59e-25 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 106.65 E-value: 2.59e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 581 CYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREGRQLQQ-HD 659
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTlTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 660 FTSQIFSLGYCPTGEWLAVGMESSNVEVLH-HTKPDKYQLHLHESCVLSLKFAyCGKWFVSTGK-DNLLNAWRTPYGASI 737
Cdd:cd00200 92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDvETGKCLTTLRGHTDWVNSVAFS-PDGTFVASSSqDGTIKLWDLRTGKCV 170
|
170 180 190
....*....|....*....|....*....|....*..
gi 568961191 738 --FQSkESSSVLSCDISADDKYIVTGSGDKKATVYEV 772
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
612-772 |
1.03e-17 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 84.31 E-value: 1.03e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 612 LVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVL 688
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 689 H-HTKPDKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISADDKYIVTGSGDK 765
Cdd:cd00200 79 DlETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 568961191 766 KATVYEV 772
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
480-609 |
3.84e-13 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 71.87 E-value: 3.84e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 480 RQINTLS-HGEVVCAVTISNPTRHVYTGGKGC-VKIWDISqpgSKSPISQLDclNRDNYIRSCKLLPDGRTLIVGGEAST 557
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 568961191 558 LTIWDLAspTPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHN 609
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
609-648 |
2.37e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.69 E-value: 2.37e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 568961191 609 NQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 648
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
508-770 |
2.08e-06 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 51.24 E-value: 2.08e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 508 KGCVKIWDISQPGSKSPISQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLTIWDLASPtprIKAELTSSAPACYALAIS 587
Cdd:PLN00181 457 EGLCKYLSFSKLRVKADLKQGDLLNSSNLVCAIGFDRDGEFFATAGVNKKIKIFECESI---IKDGRDIHYPVVELASRS 533
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 588 PDAKVCF---------SCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISH-DGTKLWTGGLDNTVRSWDLREGRQLQQ 657
Cdd:PLN00181 534 KLSGICWnsyiksqvaSSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGT 613
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 658 HDFTSQIFSLGY-CPTGEWLAVGMESSNVEV--LHHTKPDKYQLHLHESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYG 734
Cdd:PLN00181 614 IKTKANICCVQFpSESGRSLAFGSADHKVYYydLRNPKLPLCTMIGHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMS 692
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 568961191 735 ASIFQSKESSSVLS-------CDISADDKYIVTGSGDKKATVY 770
Cdd:PLN00181 693 ISGINETPLHSFMGhtnvknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
611-648 |
3.57e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 3.57e-06
10 20 30
....*....|....*....|....*....|....*...
gi 568961191 611 TLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTVRSWD 648
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
261-474 |
1.54e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.21 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 261 DVSNEDPA-----TPRVSPAHSPPENGLDKARGLKKDAPTSPASVASSSSTPSSKTKDLGHNDKSSTPGLKS-----NTP 330
Cdd:pfam05109 472 DVTSPTPAgttsgASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSptsavTTP 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 331 TPRNDAPTPGTSTTPGLRSMP--GKPPGMDPIASALRTPITLTSSYPAPFAMMSHHEMNGSLTSP-----------SAYA 397
Cdd:pfam05109 552 TPNATSPTPAVTTPTPNATIPtlGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPvvtsppknatsAVTT 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 398 GLHNI----PSQMSAAAAAAAAAYGRSPMVGFDPHPPMRATGLPSSLASIPGGKPA-YSFHVSADGQMQPVPFPHDALAG 472
Cdd:pfam05109 632 GQHNItsssTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPAsTSTHHVSTSSPAPRPGTTSQASG 711
|
..
gi 568961191 473 PG 474
Cdd:pfam05109 712 PG 713
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
567-606 |
2.18e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 2.18e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 568961191 567 TPRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWD 606
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
572-653 |
3.20e-03 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 40.44 E-value: 3.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568961191 572 AELTSSAPACYALAISPDAKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISHDGTKLWTGGLDNTV 644
Cdd:pfam20426 75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148
|
....*....
gi 568961191 645 RSWDLREGR 653
Cdd:pfam20426 149 MVWEVLRGR 157
|
|
|