NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|109948269|ref|NP_035873|]
View 

zinc finger protein 106 isoform 1 [Mus musculus]

Protein Classification

WD40 repeat domain-containing protein( domain architecture ID 10077586)

WD40 repeat domain-containing protein similar to a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly

CATH:  2.130.10.10
PubMed:  10322433|8090199
SCOP:  4002744

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1530-1807 7.51e-44

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


:

Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 161.73  E-value: 7.51e-44
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1530 GSFEGHQAAVNAIQIF--GNFLYTCSADTTVRVYNLVSRKCVGVFEGHTSKVNCLLvthTSGKSSVLYTGSSDHTIRCYN 1607
Cdd:cd00200     3 RTLKGHTGGVTCVAFSpdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVA---ASADGTYLASGSSDKTIRLWD 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1608 IKTRECMEQLQL-EDRVLCL--HNRWRTLYAGLANGTVVTFDIKNNKRQEIFECHgPRAVSCLATAQEGarKLLVVGSYD 1684
Cdd:cd00200    80 LETGECVRTLTGhTSYVSSVafSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGH-TDWVNSVAFSPDG--TFVASSSQD 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1685 CTISVRDARNGLLLRTLEGHSKTVLCMKVVND--LVFSGSSDQSVHAHNIHTGELVRIYKGHNHAVT--VVNILGKVMVT 1760
Cdd:cd00200   157 GTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDgeKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNsvAFSPDGYLLAS 236
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*....
gi 109948269 1761 ACLDKFVRVYELQSHDRLQVYGGHKDMIMCMTIHKS--VIYTGCYDGSI 1807
Cdd:cd00200   237 GSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDgkRLASGSADGTI 285
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
530-681 1.78e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 1.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269   530 PRVLKENKTVSGTQKEPDEK---LNSTSQKAQDTVLQCPKTLQNPLPTTPKRTENDA---KESSVEESAKDSLSIESQPH 603
Cdd:pfam05109  572 PTLGKTSPTSAVTTPTPNATsptVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAvttGQHNITSSSTSSMSLRPSSI 651
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269   604 SAGNSAMTSDAEN---------HGIKSEGVASLTTevVSCSTHTVDKEQGSQIPGTPENLSAsPCNSTVLQKEAEVQVSA 674
Cdd:pfam05109  652 SETLSPSTSDNSTshmplltsaHPTGGENITQVTP--ASTSTHHVSTSSPAPRPGTTSQASG-PGNSSTSTKPGEVNVTK 728

                   ....*..
gi 109948269   675 ATSPHSG 681
Cdd:pfam05109  729 GTPPKNA 735
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1530-1807 7.51e-44

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 161.73  E-value: 7.51e-44
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1530 GSFEGHQAAVNAIQIF--GNFLYTCSADTTVRVYNLVSRKCVGVFEGHTSKVNCLLvthTSGKSSVLYTGSSDHTIRCYN 1607
Cdd:cd00200     3 RTLKGHTGGVTCVAFSpdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVA---ASADGTYLASGSSDKTIRLWD 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1608 IKTRECMEQLQL-EDRVLCL--HNRWRTLYAGLANGTVVTFDIKNNKRQEIFECHgPRAVSCLATAQEGarKLLVVGSYD 1684
Cdd:cd00200    80 LETGECVRTLTGhTSYVSSVafSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGH-TDWVNSVAFSPDG--TFVASSSQD 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1685 CTISVRDARNGLLLRTLEGHSKTVLCMKVVND--LVFSGSSDQSVHAHNIHTGELVRIYKGHNHAVT--VVNILGKVMVT 1760
Cdd:cd00200   157 GTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDgeKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNsvAFSPDGYLLAS 236
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*....
gi 109948269 1761 ACLDKFVRVYELQSHDRLQVYGGHKDMIMCMTIHKS--VIYTGCYDGSI 1807
Cdd:cd00200   237 GSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDgkRLASGSADGTI 285
WD40 COG2319
WD40 repeat [General function prediction only];
1532-1808 7.92e-42

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 159.69  E-value: 7.92e-42
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1532 FEGHQAAVNAIQIF--GNFLYTCSADTTVRVYNLVSRKCVGVFEGHTSKVNCLLVthtSGKSSVLYTGSSDHTIRCYNIK 1609
Cdd:COG2319   116 LTGHTGAVRSVAFSpdGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAF---SPDGKLLASGSDDGTVRLWDLA 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1610 TRECMEQLQL-EDRVLCLhnRW----RTLYAGLANGTVVTFDIKNNKRQEIFECHGPRaVSCLATAQEGarKLLVVGSYD 1684
Cdd:COG2319   193 TGKLLRTLTGhTGAVRSV--AFspdgKLLASGSADGTVRLWDLATGKLLRTLTGHSGS-VRSVAFSPDG--RLLASGSAD 267
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1685 CTISVRDARNGLLLRTLEGHSKTVLCMKVVND--LVFSGSSDQSVHAHNIHTGELVRIYKGHNHAVTVVNI--LGKVMVT 1760
Cdd:COG2319   268 GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDgkLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFspDGKTLAS 347
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 109948269 1761 ACLDKFVRVYELQSHDRLQVYGGHKDMIMCMTIHK--SVIYTGCYDGSIQ 1808
Cdd:COG2319   348 GSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPdgRTLASGSADGTVR 397
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1565-1607 1.43e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.46  E-value: 1.43e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 109948269   1565 SRKCVGVFEGHTSKVNCLLVthtSGKSSVLYTGSSDHTIRCYN 1607
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAF---SPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
1566-1607 1.32e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.79  E-value: 1.32e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 109948269  1566 RKCVGVFEGHTSKVNCLLVthtSGKSSVLYTGSSDHTIRCYN 1607
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAF---SPDGKLLASGSDDGTVKVWD 39
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
530-681 1.78e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 1.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269   530 PRVLKENKTVSGTQKEPDEK---LNSTSQKAQDTVLQCPKTLQNPLPTTPKRTENDA---KESSVEESAKDSLSIESQPH 603
Cdd:pfam05109  572 PTLGKTSPTSAVTTPTPNATsptVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAvttGQHNITSSSTSSMSLRPSSI 651
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269   604 SAGNSAMTSDAEN---------HGIKSEGVASLTTevVSCSTHTVDKEQGSQIPGTPENLSAsPCNSTVLQKEAEVQVSA 674
Cdd:pfam05109  652 SETLSPSTSDNSTshmplltsaHPTGGENITQVTP--ASTSTHHVSTSSPAPRPGTTSQASG-PGNSSTSTKPGEVNVTK 728

                   ....*..
gi 109948269   675 ATSPHSG 681
Cdd:pfam05109  729 GTPPKNA 735
 
Name Accession Description Interval E-value
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1530-1807 7.51e-44

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 161.73  E-value: 7.51e-44
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1530 GSFEGHQAAVNAIQIF--GNFLYTCSADTTVRVYNLVSRKCVGVFEGHTSKVNCLLvthTSGKSSVLYTGSSDHTIRCYN 1607
Cdd:cd00200     3 RTLKGHTGGVTCVAFSpdGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVA---ASADGTYLASGSSDKTIRLWD 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1608 IKTRECMEQLQL-EDRVLCL--HNRWRTLYAGLANGTVVTFDIKNNKRQEIFECHgPRAVSCLATAQEGarKLLVVGSYD 1684
Cdd:cd00200    80 LETGECVRTLTGhTSYVSSVafSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGH-TDWVNSVAFSPDG--TFVASSSQD 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1685 CTISVRDARNGLLLRTLEGHSKTVLCMKVVND--LVFSGSSDQSVHAHNIHTGELVRIYKGHNHAVT--VVNILGKVMVT 1760
Cdd:cd00200   157 GTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDgeKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNsvAFSPDGYLLAS 236
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*....
gi 109948269 1761 ACLDKFVRVYELQSHDRLQVYGGHKDMIMCMTIHKS--VIYTGCYDGSI 1807
Cdd:cd00200   237 GSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDgkRLASGSADGTI 285
WD40 COG2319
WD40 repeat [General function prediction only];
1532-1808 7.92e-42

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 159.69  E-value: 7.92e-42
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1532 FEGHQAAVNAIQIF--GNFLYTCSADTTVRVYNLVSRKCVGVFEGHTSKVNCLLVthtSGKSSVLYTGSSDHTIRCYNIK 1609
Cdd:COG2319   116 LTGHTGAVRSVAFSpdGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAF---SPDGKLLASGSDDGTVRLWDLA 192
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1610 TRECMEQLQL-EDRVLCLhnRW----RTLYAGLANGTVVTFDIKNNKRQEIFECHGPRaVSCLATAQEGarKLLVVGSYD 1684
Cdd:COG2319   193 TGKLLRTLTGhTGAVRSV--AFspdgKLLASGSADGTVRLWDLATGKLLRTLTGHSGS-VRSVAFSPDG--RLLASGSAD 267
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1685 CTISVRDARNGLLLRTLEGHSKTVLCMKVVND--LVFSGSSDQSVHAHNIHTGELVRIYKGHNHAVTVVNI--LGKVMVT 1760
Cdd:COG2319   268 GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDgkLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFspDGKTLAS 347
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 109948269 1761 ACLDKFVRVYELQSHDRLQVYGGHKDMIMCMTIHK--SVIYTGCYDGSIQ 1808
Cdd:COG2319   348 GSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPdgRTLASGSADGTVR 397
WD40 COG2319
WD40 repeat [General function prediction only];
1531-1808 1.77e-38

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 149.68  E-value: 1.77e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1531 SFEGHQAAVNAIQIF--GNFLYTCSADTTVRVYNLVSRKCVGVFEGHTSKVNCLLVTHtSGKssVLYTGSSDHTIRCYNI 1608
Cdd:COG2319    73 TLLGHTAAVLSVAFSpdGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSP-DGK--TLASGSADGTVRLWDL 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1609 KTRECMEQLQL-EDRVLCLhnRW----RTLYAGLANGTVVTFDIKNNKRQEIFECHgPRAVSCLATAQEGarKLLVVGSY 1683
Cdd:COG2319   150 ATGKLLRTLTGhSGAVTSV--AFspdgKLLASGSDDGTVRLWDLATGKLLRTLTGH-TGAVRSVAFSPDG--KLLASGSA 224
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1684 DCTISVRDARNGLLLRTLEGHSKTVLCMKVVND--LVFSGSSDQSVHAHNIHTGELVRIYKGHNHAVTVVNIL--GKVMV 1759
Cdd:COG2319   225 DGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDgrLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSpdGKLLA 304
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 109948269 1760 TACLDKFVRVYELQSHDRLQVYGGHKDMIMCMTIH--KSVIYTGCYDGSIQ 1808
Cdd:COG2319   305 SGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSpdGKTLASGSDDGTVR 355
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1531-1771 8.67e-38

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 144.40  E-value: 8.67e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1531 SFEGHQAAVNAIQI--FGNFLYTCSADTTVRVYNLVSRKCVGVFEGHTSKVNCLLVTHtsgKSSVLYTGSSDHTIRCYNI 1608
Cdd:cd00200    46 TLKGHTGPVRDVAAsaDGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSP---DGRILSSSSRDKTIKVWDV 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1609 KTRECMEQLQL-EDRVLCLhnRW----RTLYAGLANGTVVTFDIKNNKRQEIFECHgPRAVSCLATAQEGARklLVVGSY 1683
Cdd:cd00200   123 ETGKCLTTLRGhTDWVNSV--AFspdgTFVASSSQDGTIKLWDLRTGKCVATLTGH-TGEVNSVAFSPDGEK--LLSSSS 197
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1684 DCTISVRDARNGLLLRTLEGHSKTVLCMKV--VNDLVFSGSSDQSVHAHNIHTGELVRIYKGHNHAVTVVNI--LGKVMV 1759
Cdd:cd00200   198 DGTIKLWDLSTGKCLGTLRGHENGVNSVAFspDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWspDGKRLA 277
                         250
                  ....*....|..
gi 109948269 1760 TACLDKFVRVYE 1771
Cdd:cd00200   278 SGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1531-1774 4.57e-37

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 145.82  E-value: 4.57e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1531 SFEGHQAAVNAIQIF--GNFLYTCSADTTVRVYNLVSRKCVGVFEGHTSKVNCLLVthtSGKSSVLYTGSSDHTIRCYNI 1608
Cdd:COG2319   157 TLTGHSGAVTSVAFSpdGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAF---SPDGKLLASGSADGTVRLWDL 233
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1609 KTRECMEQLQL-EDRVLCLhnRW----RTLYAGLANGTVVTFDIKNNKRQEIFEcHGPRAVSCLATAQEGarKLLVVGSY 1683
Cdd:COG2319   234 ATGKLLRTLTGhSGSVRSV--AFspdgRLLASGSADGTVRLWDLATGELLRTLT-GHSGGVNSVAFSPDG--KLLASGSD 308
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1684 DCTISVRDARNGLLLRTLEGHSKTVLCMKVVND--LVFSGSSDQSVHAHNIHTGELVRIYKGHNHAVTVVNIL--GKVMV 1759
Cdd:COG2319   309 DGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDgkTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSpdGRTLA 388
                         250
                  ....*....|....*
gi 109948269 1760 TACLDKFVRVYELQS 1774
Cdd:COG2319   389 SGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1568-1807 1.42e-33

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 132.07  E-value: 1.42e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1568 CVGVFEGHTSKVNCLlvtHTSGKSSVLYTGSSDHTIRCYNIKTRECMEQLQLE----DRVLCLHNRwRTLYAGLANGTVV 1643
Cdd:cd00200     1 LRRTLKGHTGGVTCV---AFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHtgpvRDVAASADG-TYLASGSSDKTIR 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1644 TFDIKNNKRQEIFECHgPRAVSCLATAQEGarKLLVVGSYDCTISVRDARNGLLLRTLEGHSKTVLCMKV--VNDLVFSG 1721
Cdd:cd00200    77 LWDLETGECVRTLTGH-TSYVSSVAFSPDG--RILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFspDGTFVASS 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1722 SSDQSVHAHNIHTGELVRIYKGHNHAVTVVNIL--GKVMVTACLDKFVRVYELQSHDRLQVYGGHKDMIMCMTIHKS--V 1797
Cdd:cd00200   154 SQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSpdGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDgyL 233
                         250
                  ....*....|
gi 109948269 1798 IYTGCYDGSI 1807
Cdd:cd00200   234 LASGSEDGTI 243
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1502-1727 4.35e-27

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 113.20  E-value: 4.35e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1502 TQTVISIKASKHSSEISSEPGD--------DEEPTEGSFEGHQAAVNAIQI--FGNFLYTCSADTTVRVYNLVSRKCVGV 1571
Cdd:cd00200    93 TSYVSSVAFSPDGRILSSSSRDktikvwdvETGKCLTTLRGHTDWVNSVAFspDGTFVASSSQDGTIKLWDLRTGKCVAT 172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1572 FEGHTSKVNCLLVthtSGKSSVLYTGSSDHTIRCYNIKTRECMEQLQledrvlcLHNRWrtlyaglangtvvtfdiknnk 1651
Cdd:cd00200   173 LTGHTGEVNSVAF---SPDGEKLLSSSSDGTIKLWDLSTGKCLGTLR-------GHENG--------------------- 221
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109948269 1652 rqeifechgpraVSCLATAQEgaRKLLVVGSYDCTISVRDARNGLLLRTLEGHSKTVLCMKVVND--LVFSGSSDQSV 1727
Cdd:cd00200   222 ------------VNSVAFSPD--GYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDgkRLASGSADGTI 285
WD40 COG2319
WD40 repeat [General function prediction only];
1659-1808 1.23e-09

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 62.62  E-value: 1.23e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269 1659 HGPRAVSCLATAQEGARklLVVGSYDCTISVRDARNGLLLRTLEGHSKTVLCMKVVND--LVFSGSSDQSVHAHNIHTGE 1736
Cdd:COG2319    34 GLAAAVASLAASPDGAR--LAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDgrLLASASADGTVRLWDLATGL 111
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 109948269 1737 LVRIYKGHNHAVTVVNIL--GKVMVTACLDKFVRVYELQSHDRLQVYGGHKDMIMCMTIHK--SVIYTGCYDGSIQ 1808
Cdd:COG2319   112 LLRTLTGHTGAVRSVAFSpdGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPdgKLLASGSDDGTVR 187
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1565-1607 1.43e-05

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 43.46  E-value: 1.43e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 109948269   1565 SRKCVGVFEGHTSKVNCLLVthtSGKSSVLYTGSSDHTIRCYN 1607
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAF---SPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
1566-1607 1.32e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 40.79  E-value: 1.32e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 109948269  1566 RKCVGVFEGHTSKVNCLLVthtSGKSSVLYTGSSDHTIRCYN 1607
Cdd:pfam00400    1 GKLLKTLEGHTGSVTSLAF---SPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1531-1562 8.69e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.45  E-value: 8.69e-04
                            10        20        30
                    ....*....|....*....|....*....|....
gi 109948269   1531 SFEGHQAAVNAIQIF--GNFLYTCSADTTVRVYN 1562
Cdd:smart00320    7 TLKGHTGPVTSVAFSpdGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
1530-1562 1.64e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 37.71  E-value: 1.64e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 109948269  1530 GSFEGHQAAVNAIQI--FGNFLYTCSADTTVRVYN 1562
Cdd:pfam00400    5 KTLEGHTGSVTSLAFspDGKLLASGSDDGTVKVWD 39
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
530-681 1.78e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 43.37  E-value: 1.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269   530 PRVLKENKTVSGTQKEPDEK---LNSTSQKAQDTVLQCPKTLQNPLPTTPKRTENDA---KESSVEESAKDSLSIESQPH 603
Cdd:pfam05109  572 PTLGKTSPTSAVTTPTPNATsptVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAvttGQHNITSSSTSSMSLRPSSI 651
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109948269   604 SAGNSAMTSDAEN---------HGIKSEGVASLTTevVSCSTHTVDKEQGSQIPGTPENLSAsPCNSTVLQKEAEVQVSA 674
Cdd:pfam05109  652 SETLSPSTSDNSTshmplltsaHPTGGENITQVTP--ASTSTHHVSTSSPAPRPGTTSQASG-PGNSSTSTKPGEVNVTK 728

                   ....*..
gi 109948269   675 ATSPHSG 681
Cdd:pfam05109  729 GTPPKNA 735
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH