NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1385123368|ref|NP_001349812|]
View 

zinc finger protein 469 [Mus musculus]

Protein Classification

C2H2-type zinc finger protein( domain architecture ID 10442881)

Cys2His2 (C2H2)-type zinc finger protein may be involved in transcriptional regulation

Gene Ontology:  GO:0003677|GO:0046872

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
3281-3701 1.06e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.33  E-value: 1.06e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3281 TLRSVKRPGVPRRKTRVSQDVLPSKQNRLMAPFSPPelstDRIPSTTSPTPSEVSLPALPLAPSlildqpssqenpvdqa 3360
Cdd:PHA03247  2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPP---------------- 2629
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3361 DHSPRGNNLPLSGQDLPPPSLSPFSAASAEGTGGCCKLNRTLEKPEHEASLGSLEPCKWQALVGEKRALHLFPGKHKSPG 3440
Cdd:PHA03247  2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPE 2709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3441 NGDKCAPGCSPGHPSQLQERLV---TTHHMAPEGRIEGPSQKGNATKPGAYSSTSHHRAAEPTKKALKPPAP--PRKPGG 3515
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQAspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRrlTRPAVA 2789
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3516 MGIPAAELVLSPEDRVKPNTSkgklrgTPQSSGGLQPGTQTGGGSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQPPP 3595
Cdd:PHA03247  2790 SLSESRESLPSPWDPADPPAA------VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR 2863
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3596 RahtkgstRGPGDAVHQGVQVHSSPREKReshgrqrkgqalgLGRHGSVGNTGKAPLAPDKSSRAPRKQA----TPSRVP 3671
Cdd:PHA03247  2864 R-------RPPSRSPAAKPAAPARPPVRR-------------LARPAVSRSTESFALPPDQPERPPQPQAppppQPQPQP 2923
                          410       420       430
                   ....*....|....*....|....*....|.
gi 1385123368 3672 PVKSRPSGQ-SSRARPQPSAQRKGDPGHTSE 3701
Cdd:PHA03247  2924 PPPPQPQPPpPPPPRPQPPLAPTTDPAGAGE 2954
PHA03247 super family cl33720
large tegument protein UL36; Provisional
142-618 2.51e-05

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 2.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  142 PGIPRAKALPSPEENSS---QRCFQEASSSFTSTNCTSPSATPGSLPRRAPQSDGTSPHRHASGTNLQAIGTNPWPPAAE 218
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSvppPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  219 NSFPGANFGVSSAEPKPFPDGSRPSspqgvsapyPFPVETVQHERAAetmlftfhQPLVAWSEEALGTNPAYPSLPCNPG 298
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDD---------PAPGRVSRPRRAR--------RLGRAAQASSPPQRPRRRAARPTVG 2693
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  299 PSGGASAPSDLGGALSPPGAARLLPSPfhdslhksltkgIPEGPLPARDGLGSPRGLPNPPPQRHFPGQGYEANGVGTSP 378
Cdd:PHA03247  2694 SLTSLADPPPPPPTPEPAPHALVSATP------------LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  379 ASLDTELPTP-----GPPPTHLPQLWDTTAAPPYPTSTLDPAAAARTAFFESQQQLCLPHSPPLPWSPVLTTPGPNSHqm 453
Cdd:PHA03247  2762 TTAGPPAPAPpaapaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP-- 2839
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  454 gvlSQLTFPRGSSEWQGDSPGTLGALNTIPRPGESALRSSPGQPSSSPRLLAYGglKDPGTQPLFFGGAQPQMSPQGALS 533
Cdd:PHA03247  2840 ---PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPA--VSRSTESFALPPDQPERPPQPQAP 2914
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  534 LPPPRVVGASPSESPLPSPATNTASSSTCSSLSPPSSSPANPSSEDSQQPGPLRSPAFFLPPTHSQETSSPFPSPEPTYT 613
Cdd:PHA03247  2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                   ....*
gi 1385123368  614 LPTRY 618
Cdd:PHA03247  2995 PLTGH 2999
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1933-2445 1.83e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 1.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 1933 QKEPAERSPEKA-ASPQPLFSQEN-----PAPSNrdLAACVFSTRPQATPTPS-------DLEPMPQEDPETRVKPSKPL 1999
Cdd:PHA03247  2481 RRPAEARFPFAAgAAPDPGGGGPPdpdapPAPSR--LAPAILPDEPVGEPVHPrmltwirGLEELASDDAGDPPPPLPPA 2558
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2000 APSSYRDLPSPDDQPTcpvlvPLGASYGLTTKEAEP-----PASPTLLVTSCCGPEEPLSQHSLLGTSSPKDPPVGSLGS 2074
Cdd:PHA03247  2559 APPAAPDRSVPPPRPA-----PRPSEPAVTSRARRPdappqSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSP 2633
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2075 ISFSAPVLLERNSPKGIAVRTLEDSGKEELR------LSPAHSSAPPLGdPSSPKMTIEAAPLTSIApkDGLDSGETLEv 2148
Cdd:PHA03247  2634 AANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrarrlGRAAQASSPPQR-PRRRAARPTVGSLTSLA--DPPPPPPTPE- 2709
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2149 PAPHCMgAPSLSNPERTYSKGPSLGPVSSTPCPGhgegrgiiAVPTDLATLETtgpdsqicqedgadvsikeqdnPETPG 2228
Cdd:PHA03247  2710 PAPHAL-VSATPLPPGPAAARQASPALPAAPAPP--------AVPAGPATPGG----------------------PARPA 2758
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2229 TRHCNVTKVARANARGMPTGLHLTLETPLSGTSSDSRSDSPQYHISISHRPPQKNFSDPQDHKRRPRGLNKKPEHAEQT- 2307
Cdd:PHA03247  2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTa 2838
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2308 ----PAELPETCQLCSA-----SFRSKAGLSRHKARKHRPQREPRSLLS---------PMPVPACQPSDPMTKACQTPGK 2369
Cdd:PHA03247  2839 ppppPGPPPPSLPLGGSvapggDVRRRPPSRSPAAKPAAPARPPVRRLArpavsrsteSFALPPDQPERPPQPQAPPPPQ 2918
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1385123368 2370 KSHKVSEKGRPSRPALGAGRSSGPPPLQDTMGPEILKRTSEKSEGAGTLdTPLSQHPPTLGLSEQGESAEVPASKP 2445
Cdd:PHA03247  2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPSREAPASST 2993
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3171-3191 7.02e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


:

Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.51  E-value: 7.02e-03
                           10        20
                   ....*....|....*....|.
gi 1385123368 3171 CHHCGKQFPKPFKLQRHLAVH 3191
Cdd:pfam00096    3 CPDCGKSFSRKSNLKRHLRTH 23
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
3281-3701 1.06e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.33  E-value: 1.06e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3281 TLRSVKRPGVPRRKTRVSQDVLPSKQNRLMAPFSPPelstDRIPSTTSPTPSEVSLPALPLAPSlildqpssqenpvdqa 3360
Cdd:PHA03247  2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPP---------------- 2629
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3361 DHSPRGNNLPLSGQDLPPPSLSPFSAASAEGTGGCCKLNRTLEKPEHEASLGSLEPCKWQALVGEKRALHLFPGKHKSPG 3440
Cdd:PHA03247  2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPE 2709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3441 NGDKCAPGCSPGHPSQLQERLV---TTHHMAPEGRIEGPSQKGNATKPGAYSSTSHHRAAEPTKKALKPPAP--PRKPGG 3515
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQAspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRrlTRPAVA 2789
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3516 MGIPAAELVLSPEDRVKPNTSkgklrgTPQSSGGLQPGTQTGGGSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQPPP 3595
Cdd:PHA03247  2790 SLSESRESLPSPWDPADPPAA------VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR 2863
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3596 RahtkgstRGPGDAVHQGVQVHSSPREKReshgrqrkgqalgLGRHGSVGNTGKAPLAPDKSSRAPRKQA----TPSRVP 3671
Cdd:PHA03247  2864 R-------RPPSRSPAAKPAAPARPPVRR-------------LARPAVSRSTESFALPPDQPERPPQPQAppppQPQPQP 2923
                          410       420       430
                   ....*....|....*....|....*....|.
gi 1385123368 3672 PVKSRPSGQ-SSRARPQPSAQRKGDPGHTSE 3701
Cdd:PHA03247  2924 PPPPQPQPPpPPPPRPQPPLAPTTDPAGAGE 2954
PHA03247 PHA03247
large tegument protein UL36; Provisional
142-618 2.51e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 2.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  142 PGIPRAKALPSPEENSS---QRCFQEASSSFTSTNCTSPSATPGSLPRRAPQSDGTSPHRHASGTNLQAIGTNPWPPAAE 218
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSvppPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  219 NSFPGANFGVSSAEPKPFPDGSRPSspqgvsapyPFPVETVQHERAAetmlftfhQPLVAWSEEALGTNPAYPSLPCNPG 298
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDD---------PAPGRVSRPRRAR--------RLGRAAQASSPPQRPRRRAARPTVG 2693
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  299 PSGGASAPSDLGGALSPPGAARLLPSPfhdslhksltkgIPEGPLPARDGLGSPRGLPNPPPQRHFPGQGYEANGVGTSP 378
Cdd:PHA03247  2694 SLTSLADPPPPPPTPEPAPHALVSATP------------LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  379 ASLDTELPTP-----GPPPTHLPQLWDTTAAPPYPTSTLDPAAAARTAFFESQQQLCLPHSPPLPWSPVLTTPGPNSHqm 453
Cdd:PHA03247  2762 TTAGPPAPAPpaapaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP-- 2839
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  454 gvlSQLTFPRGSSEWQGDSPGTLGALNTIPRPGESALRSSPGQPSSSPRLLAYGglKDPGTQPLFFGGAQPQMSPQGALS 533
Cdd:PHA03247  2840 ---PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPA--VSRSTESFALPPDQPERPPQPQAP 2914
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  534 LPPPRVVGASPSESPLPSPATNTASSSTCSSLSPPSSSPANPSSEDSQQPGPLRSPAFFLPPTHSQETSSPFPSPEPTYT 613
Cdd:PHA03247  2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                   ....*
gi 1385123368  614 LPTRY 618
Cdd:PHA03247  2995 PLTGH 2999
PHA03247 PHA03247
large tegument protein UL36; Provisional
1933-2445 1.83e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 1.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 1933 QKEPAERSPEKA-ASPQPLFSQEN-----PAPSNrdLAACVFSTRPQATPTPS-------DLEPMPQEDPETRVKPSKPL 1999
Cdd:PHA03247  2481 RRPAEARFPFAAgAAPDPGGGGPPdpdapPAPSR--LAPAILPDEPVGEPVHPrmltwirGLEELASDDAGDPPPPLPPA 2558
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2000 APSSYRDLPSPDDQPTcpvlvPLGASYGLTTKEAEP-----PASPTLLVTSCCGPEEPLSQHSLLGTSSPKDPPVGSLGS 2074
Cdd:PHA03247  2559 APPAAPDRSVPPPRPA-----PRPSEPAVTSRARRPdappqSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSP 2633
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2075 ISFSAPVLLERNSPKGIAVRTLEDSGKEELR------LSPAHSSAPPLGdPSSPKMTIEAAPLTSIApkDGLDSGETLEv 2148
Cdd:PHA03247  2634 AANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrarrlGRAAQASSPPQR-PRRRAARPTVGSLTSLA--DPPPPPPTPE- 2709
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2149 PAPHCMgAPSLSNPERTYSKGPSLGPVSSTPCPGhgegrgiiAVPTDLATLETtgpdsqicqedgadvsikeqdnPETPG 2228
Cdd:PHA03247  2710 PAPHAL-VSATPLPPGPAAARQASPALPAAPAPP--------AVPAGPATPGG----------------------PARPA 2758
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2229 TRHCNVTKVARANARGMPTGLHLTLETPLSGTSSDSRSDSPQYHISISHRPPQKNFSDPQDHKRRPRGLNKKPEHAEQT- 2307
Cdd:PHA03247  2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTa 2838
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2308 ----PAELPETCQLCSA-----SFRSKAGLSRHKARKHRPQREPRSLLS---------PMPVPACQPSDPMTKACQTPGK 2369
Cdd:PHA03247  2839 ppppPGPPPPSLPLGGSvapggDVRRRPPSRSPAAKPAAPARPPVRRLArpavsrsteSFALPPDQPERPPQPQAPPPPQ 2918
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1385123368 2370 KSHKVSEKGRPSRPALGAGRSSGPPPLQDTMGPEILKRTSEKSEGAGTLdTPLSQHPPTLGLSEQGESAEVPASKP 2445
Cdd:PHA03247  2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPSREAPASST 2993
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3171-3191 7.02e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.51  E-value: 7.02e-03
                           10        20
                   ....*....|....*....|.
gi 1385123368 3171 CHHCGKQFPKPFKLQRHLAVH 3191
Cdd:pfam00096    3 CPDCGKSFSRKSNLKRHLRTH 23
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
3281-3701 1.06e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.33  E-value: 1.06e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3281 TLRSVKRPGVPRRKTRVSQDVLPSKQNRLMAPFSPPelstDRIPSTTSPTPSEVSLPALPLAPSlildqpssqenpvdqa 3360
Cdd:PHA03247  2570 PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDR----GDPRGPAPPSPLPPDTHAPDPPPP---------------- 2629
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3361 DHSPRGNNLPLSGQDLPPPSLSPFSAASAEGTGGCCKLNRTLEKPEHEASLGSLEPCKWQALVGEKRALHLFPGKHKSPG 3440
Cdd:PHA03247  2630 SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPE 2709
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3441 NGDKCAPGCSPGHPSQLQERLV---TTHHMAPEGRIEGPSQKGNATKPGAYSSTSHHRAAEPTKKALKPPAP--PRKPGG 3515
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQAspaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRrlTRPAVA 2789
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3516 MGIPAAELVLSPEDRVKPNTSkgklrgTPQSSGGLQPGTQTGGGSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQPPP 3595
Cdd:PHA03247  2790 SLSESRESLPSPWDPADPPAA------VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR 2863
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3596 RahtkgstRGPGDAVHQGVQVHSSPREKReshgrqrkgqalgLGRHGSVGNTGKAPLAPDKSSRAPRKQA----TPSRVP 3671
Cdd:PHA03247  2864 R-------RPPSRSPAAKPAAPARPPVRR-------------LARPAVSRSTESFALPPDQPERPPQPQAppppQPQPQP 2923
                          410       420       430
                   ....*....|....*....|....*....|.
gi 1385123368 3672 PVKSRPSGQ-SSRARPQPSAQRKGDPGHTSE 3701
Cdd:PHA03247  2924 PPPPQPQPPpPPPPRPQPPLAPTTDPAGAGE 2954
PHA03247 PHA03247
large tegument protein UL36; Provisional
142-618 2.51e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 2.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  142 PGIPRAKALPSPEENSS---QRCFQEASSSFTSTNCTSPSATPGSLPRRAPQSDGTSPHRHASGTNLQAIGTNPWPPAAE 218
Cdd:PHA03247  2551 PPPPLPPAAPPAAPDRSvppPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  219 NSFPGANFGVSSAEPKPFPDGSRPSspqgvsapyPFPVETVQHERAAetmlftfhQPLVAWSEEALGTNPAYPSLPCNPG 298
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDD---------PAPGRVSRPRRAR--------RLGRAAQASSPPQRPRRRAARPTVG 2693
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  299 PSGGASAPSDLGGALSPPGAARLLPSPfhdslhksltkgIPEGPLPARDGLGSPRGLPNPPPQRHFPGQGYEANGVGTSP 378
Cdd:PHA03247  2694 SLTSLADPPPPPPTPEPAPHALVSATP------------LPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  379 ASLDTELPTP-----GPPPTHLPQLWDTTAAPPYPTSTLDPAAAARTAFFESQQQLCLPHSPPLPWSPVLTTPGPNSHqm 453
Cdd:PHA03247  2762 TTAGPPAPAPpaapaAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP-- 2839
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  454 gvlSQLTFPRGSSEWQGDSPGTLGALNTIPRPGESALRSSPGQPSSSPRLLAYGglKDPGTQPLFFGGAQPQMSPQGALS 533
Cdd:PHA03247  2840 ---PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPA--VSRSTESFALPPDQPERPPQPQAP 2914
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  534 LPPPRVVGASPSESPLPSPATNTASSSTCSSLSPPSSSPANPSSEDSQQPGPLRSPAFFLPPTHSQETSSPFPSPEPTYT 613
Cdd:PHA03247  2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                   ....*
gi 1385123368  614 LPTRY 618
Cdd:PHA03247  2995 PLTGH 2999
PHA03247 PHA03247
large tegument protein UL36; Provisional
1933-2445 1.83e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 1.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 1933 QKEPAERSPEKA-ASPQPLFSQEN-----PAPSNrdLAACVFSTRPQATPTPS-------DLEPMPQEDPETRVKPSKPL 1999
Cdd:PHA03247  2481 RRPAEARFPFAAgAAPDPGGGGPPdpdapPAPSR--LAPAILPDEPVGEPVHPrmltwirGLEELASDDAGDPPPPLPPA 2558
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2000 APSSYRDLPSPDDQPTcpvlvPLGASYGLTTKEAEP-----PASPTLLVTSCCGPEEPLSQHSLLGTSSPKDPPVGSLGS 2074
Cdd:PHA03247  2559 APPAAPDRSVPPPRPA-----PRPSEPAVTSRARRPdappqSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSP 2633
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2075 ISFSAPVLLERNSPKGIAVRTLEDSGKEELR------LSPAHSSAPPLGdPSSPKMTIEAAPLTSIApkDGLDSGETLEv 2148
Cdd:PHA03247  2634 AANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrarrlGRAAQASSPPQR-PRRRAARPTVGSLTSLA--DPPPPPPTPE- 2709
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2149 PAPHCMgAPSLSNPERTYSKGPSLGPVSSTPCPGhgegrgiiAVPTDLATLETtgpdsqicqedgadvsikeqdnPETPG 2228
Cdd:PHA03247  2710 PAPHAL-VSATPLPPGPAAARQASPALPAAPAPP--------AVPAGPATPGG----------------------PARPA 2758
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2229 TRHCNVTKVARANARGMPTGLHLTLETPLSGTSSDSRSDSPQYHISISHRPPQKNFSDPQDHKRRPRGLNKKPEHAEQT- 2307
Cdd:PHA03247  2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTa 2838
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 2308 ----PAELPETCQLCSA-----SFRSKAGLSRHKARKHRPQREPRSLLS---------PMPVPACQPSDPMTKACQTPGK 2369
Cdd:PHA03247  2839 ppppPGPPPPSLPLGGSvapggDVRRRPPSRSPAAKPAAPARPPVRRLArpavsrsteSFALPPDQPERPPQPQAPPPPQ 2918
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1385123368 2370 KSHKVSEKGRPSRPALGAGRSSGPPPLQDTMGPEILKRTSEKSEGAGTLdTPLSQHPPTLGLSEQGESAEVPASKP 2445
Cdd:PHA03247  2919 PQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL-VPGRVAVPRFRVPQPAPSREAPASST 2993
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-451 3.52e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 3.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368    5 RPPTLPRDLQPCQIARSLGCPSQHPLKDHGSASRTTQGMRDDGSKAQGSPEAQLSQAKDVEQEDLILRVQAPAaRSYAHV 84
Cdd:PHA03247  2599 RAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG-RAAQAS 2677
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368   85 YPWPASRMESGHPQLHSLSPsrirciLGEPLKDLRHEAPQ----VSDTKVPQGQKTRARHRPGIPRAKALPSPEENSSQR 160
Cdd:PHA03247  2678 SPPQRPRRRAARPTVGSLTS------LADPPPPPPTPEPAphalVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  161 CfQEASSSFTSTNCTSPSATPGSLPRRAPQSDGTSPHRhASGTNLQAIGTNPWPPAAensfpganfgVSSAEPKPFPDGS 240
Cdd:PHA03247  2752 G-GPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV-ASLSESRESLPSPWDPAD----------PPAAVLAPAAALP 2819
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  241 RPSSPQGVSAPYPFPVetvqheraaetmlftfhqplvawseealgtnPAYPSLPCNPGPSggasaPSDLGGALSPPGAAR 320
Cdd:PHA03247  2820 PAASPAGPLPPPTSAQ-------------------------------PTAPPPPPGPPPP-----SLPLGGSVAPGGDVR 2863
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  321 LLPSPFHDSLHKSLTKGIPEGPLPARDGLGSPRGLPNPPPQRHFPGQGYEANGVGTSPASLDTELPTPGPPPTHLPQlwd 400
Cdd:PHA03247  2864 RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ--- 2940
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1385123368  401 ttaAPPYPTSTLDPAAAARTAFFESQQQLCLPHSPPLPW------SPVLTTPGPNSH 451
Cdd:PHA03247  2941 ---PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRfrvpqpAPSREAPASSTP 2994
PHA03247 PHA03247
large tegument protein UL36; Provisional
280-653 5.92e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 5.92e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  280 SEEALGTNPAYPSLPCNPGPSGGASAPSDLGGA----LSPPGAARLLPSPFHD-----SLH-------------KSLTKG 337
Cdd:PHA03247  2470 LGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGppdpDAPPAPSRLAPAILPDepvgePVHprmltwirgleelASDDAG 2549
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  338 IPEGPLPARDGLGSP-RGLPNPPPQRHFPGQGYEAN----GVGTSPASLDTELPTPGPPPTHLPqlwdTTAAPPYPTSTL 412
Cdd:PHA03247  2550 DPPPPLPPAAPPAAPdRSVPPPRPAPRPSEPAVTSRarrpDAPPQSARPRAPVDDRGDPRGPAP----PSPLPPDTHAPD 2625
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  413 DPAAAARTAFFESQQQLCLPhSPPLPWSPVLTTPGPNSHQMGVLSQLTFPRGSSEWQG----DSPGTLGALNTIPRPGES 488
Cdd:PHA03247  2626 PPPPSPSPAANEPDPHPPPT-VPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRprrrAARPTVGSLTSLADPPPP 2704
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  489 ALRSSPGQPSSSPRL-LAYGGLKDPGTQPLFFGGAQPQMSPQGALSLPPPRVVGASPSESPLPSPATNTASSSTCSSLSP 567
Cdd:PHA03247  2705 PPTPEPAPHALVSATpLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT 2784
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  568 PSSSPANPSSEDS--QQPGPLRSPAFFLPPTHSQETSSPFPSPEPTYTLPTRYQSETAKAFPLPTEGPGAEDAfksqEGA 645
Cdd:PHA03247  2785 RPAVASLSESRESlpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA----PGG 2860

                   ....*...
gi 1385123368  646 PFSHKSPS 653
Cdd:PHA03247  2861 DVRRRPPS 2868
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3375-3733 8.33e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 8.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3375 DLPPPSLSPFSAASAEGTGGCCKLNRTleKPEHEASLGSLEPCKWQALVGEKRALHlfpgkhkSPGNGDKCAPGCSPGHP 3454
Cdd:PHA03307    69 TGPPPGPGTEAPANESRSTPTWSLSTL--APASPAREGSPTPPGPSSPDPPPPTPP-------PASPPPSPAPDLSEMLR 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3455 SQLQERLVTTHHMAPEGRIEGPSQKGNATKPGAYSSTSHHRAAEPTKKALKPPAPPRKPGG--------MGIPAAELVLS 3526
Cdd:PHA03307   140 PVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAaasprpprRSSPISASASS 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3527 PEDRVKPNTSKGKLRGTPQSSGGLQPGTQTGG---------GSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQ---PP 3594
Cdd:PHA03307   220 PAPAPGRSAADDAGASSSDSSSSESSGCGWGPenecplprpAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERspsPS 299
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3595 PRAHTKGSTRGPGDAVHQGVQVHSSPREKRESHGRQRKGQALGLGRHGSVGNTGKAPlAPDKSSRAPRKQATPSRVPPVK 3674
Cdd:PHA03307   300 PSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP-PPPADPSSPRKRPRPSRAPSSP 378
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1385123368 3675 SRPSGQSSRARPQPSAQRKGDPGHTSekGSLPQARALSRPYKrvrALHVSGVAPMEPRD 3733
Cdd:PHA03307   379 AASAGRPTRRRARAAVAGRARRRDAT--GRFPAGRPRPSPLD---AGAASGAFYARYPL 432
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
3324-3667 1.72e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.59  E-value: 1.72e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3324 PSTTSPTPSEVSLPALPLAPSLILDQPSSQENPVDQADHSPRGNNLPlSGQDLPPPSLSPfSAASAEGTGGCCKLNRTLE 3403
Cdd:PRK07764   436 APAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAP-APPAAPAPAAAP-AAPAAPAAPAGADDAATLR 513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3404 K--PEHEASLGSLEPCKWQALVGEKRALHLFPG----KHKSPGNGDKCApgcSPGHPSQLQERL--VTTHHMAPEGRIEG 3475
Cdd:PRK07764   514 ErwPEILAAVPKRSRKTWAILLPEATVLGVRGDtlvlGFSTGGLARRFA---SPGNAEVLVTALaeELGGDWQVEAVVGP 590
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3476 PSQKGNATKPGA-YSSTSHHRAAEPTKKALKPPAPPRKPGGMGIPAAELVLSPEDRVKPNTSKGKLRGTPQSSGGLQPGT 3554
Cdd:PRK07764   591 APGAAGGEGPPApASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWP 670
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3555 QTGGGSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQPPPRAHTKGSTRGPGDAVHQGVQVHSSPREKR---ESHGRQR 3631
Cdd:PRK07764   671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPlppEPDDPPD 750
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1385123368 3632 KGQALGLGRHGSVGNTGKAPLAPDKSSRAPRKQATP 3667
Cdd:PRK07764   751 PAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
zf-C2H2 pfam00096
Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two ...
3171-3191 7.02e-03

Zinc finger, C2H2 type; The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.


Pssm-ID: 395048 [Multi-domain]  Cd Length: 23  Bit Score: 36.51  E-value: 7.02e-03
                           10        20
                   ....*....|....*....|.
gi 1385123368 3171 CHHCGKQFPKPFKLQRHLAVH 3191
Cdd:pfam00096    3 CPDCGKSFSRKSNLKRHLRTH 23
dnaA PRK14086
chromosomal replication initiator protein DnaA;
233-414 7.10e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.12  E-value: 7.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  233 PKPFPDGSRPSSPQGvSAPYPFPVETVQHERAaetmlfTFHQPLVAWSEEALGTNPAYPSLPCNPGPSGGASAPSDLGGA 312
Cdd:PRK14086    96 APPPPHARRTSEPEL-PRPGRRPYEGYGGPRA------DDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADDYGWQ 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368  313 LSPPG--AARLLPSPFHDSLHKSLTK-----GIPEGPLPARDGLGS------PRGLPNPPPQRHfPGQGYEANGVGTSPA 379
Cdd:PRK14086   169 QQRLGfpPRAPYASPASYAPEQERDRepydaGRPEYDQRRRDYDHPrpdwdrPRRDRTDRPEPP-PGAGHVHRGGPGPPE 247
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1385123368  380 SLDTELPTPGPP-PTHLPQLWDTTAAPPYPTSTLDP 414
Cdd:PRK14086   248 RDDAPVVPIRPSaPGPLAAQPAPAPGPGEPTARLNP 283
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
3434-3758 9.71e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 9.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3434 GKHKSPGNGDKCAPGCSPGHPSQLQERLVTTHHMAPEGRIEGPSQKGNATKPGAYSSTSHHRAAEPTKKAlkPPAPPRKP 3513
Cdd:PHA03307    58 GAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPP--PSPAPDLS 135
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3514 GGMGIPAAELVLSPEDRVKPNTSKGKLRGTPQSSGGLQPGTQTGGGSQPQPTSGQLQSEMASTPTEPSCPSWASSTPDQP 3593
Cdd:PHA03307   136 EMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISA 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3594 PPRAHTKGSTRGPGDAVHQGVQVHSSPREK-----------RESHGRQRKGQALGLGRHGSVGNTGKAPLAPDKSSRAPR 3662
Cdd:PHA03307   216 SASSPAPAPGRSAADDAGASSSDSSSSESSgcgwgpenecpLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERS 295
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1385123368 3663 KQATPSR--VPPVKSRPSGQSSRARPQPS---------------AQRKGDPGHTSEKGSLPQARALSRPYKRVRALHVSG 3725
Cdd:PHA03307   296 PSPSPSSpgSGPAPSSPRASSSSSSSRESsssstssssessrgaAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAP 375
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1385123368 3726 VAPMEPRDRRTAEAQSDLLSQLFGQKLTSFRIP 3758
Cdd:PHA03307   376 SSPAASAGRPTRRRARAAVAGRARRRDATGRFP 408
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH