NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907143096|ref|XP_036018416|]
View 

cip1-interacting zinc finger protein isoform X19 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
16-317 2.45e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 2.45e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   16 ATLGNLRAFNVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSL-------LGPPPVGVPINPSQLNHSGRnTQKQARTPS 88
Cdd:PHA03247  2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASpalpaapAPPAVPAGPATPGGPARPAR-PPTTAGPPA 2768
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   89 STTPNRKDSSSQTVPLEDREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPE-PEPFETLEPPAKRCRSSEESTE 167
Cdd:PHA03247  2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPPGPPPP 2848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  168 KGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQAlqiQTQPkllrQAQTQTSPEHLAPQQDQVPTQAQS 247
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR---STES----FALPPDQPERPPQPQAPPPPQPQP 2921
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  248 QEQTSEKTQDQPQTWPQGSVPPPEQAsGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 317
Cdd:PHA03247  2922 QPPPPPQPQPPPPPPPRPQPPLAPTT-DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
zf-C2H2_jaz pfam12171
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ...
480-504 1.94e-05

Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.


:

Pssm-ID: 432381 [Multi-domain]  Cd Length: 27  Bit Score: 41.77  E-value: 1.94e-05
                          10        20
                  ....*....|....*....|....*
gi 1907143096 480 FCTICNRYFKTPRKFVEHVKSQGHK 504
Cdd:pfam12171   3 YCVLCDKYFKSENALQNHLKSKKHK 27
ZnF_U1 smart00451
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ...
596-629 1.94e-03

U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.


:

Pssm-ID: 197732 [Multi-domain]  Cd Length: 35  Bit Score: 36.08  E-value: 1.94e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1907143096  596 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 629
Cdd:smart00451   3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
GIY-YIG_SF super family cl15257
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ...
449-516 2.31e-03

GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions.


The actual alignment was detected with superfamily member cd10442:

Pssm-ID: 472790  Cd Length: 92  Bit Score: 37.73  E-value: 2.31e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 449 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 516
Cdd:cd10442     6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
16-317 2.45e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 2.45e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   16 ATLGNLRAFNVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSL-------LGPPPVGVPINPSQLNHSGRnTQKQARTPS 88
Cdd:PHA03247  2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASpalpaapAPPAVPAGPATPGGPARPAR-PPTTAGPPA 2768
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   89 STTPNRKDSSSQTVPLEDREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPE-PEPFETLEPPAKRCRSSEESTE 167
Cdd:PHA03247  2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPPGPPPP 2848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  168 KGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQAlqiQTQPkllrQAQTQTSPEHLAPQQDQVPTQAQS 247
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR---STES----FALPPDQPERPPQPQAPPPPQPQP 2921
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  248 QEQTSEKTQDQPQTWPQGSVPPPEQAsGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 317
Cdd:PHA03247  2922 QPPPPPQPQPPPPPPPRPQPPLAPTT-DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
zf-C2H2_jaz pfam12171
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ...
480-504 1.94e-05

Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.


Pssm-ID: 432381 [Multi-domain]  Cd Length: 27  Bit Score: 41.77  E-value: 1.94e-05
                          10        20
                  ....*....|....*....|....*
gi 1907143096 480 FCTICNRYFKTPRKFVEHVKSQGHK 504
Cdd:pfam12171   3 YCVLCDKYFKSENALQNHLKSKKHK 27
Agg_substance NF033875
LPXTG-anchored aggregation substance; Aggregation substances, as described in Enterococcus, ...
26-254 1.56e-04

LPXTG-anchored aggregation substance; Aggregation substances, as described in Enterococcus, are LPXTG-anchored large surface proteins that contribute to virulence. Several closely related paralogs may be found in a single strain.


Pssm-ID: 411439 [Multi-domain]  Cd Length: 1306  Bit Score: 45.09  E-value: 1.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   26 VTAPSLAAPSLTPPQMVTPNLQQffpqatrqSLLGPPPVGVPINPSQLN-HSGRNTQKQARTPSSTTpnRKDSSSQTVPL 104
Cdd:NF033875    21 VVAPILFLGVLGVVGLATDNVQA--------AELDTQPGTTTVQPDNPDpQSGSETPKTAVSEEATV--QKDTTSQPTKV 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  105 EDREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPEPEpfETLEPPAKrcrsseestekgPTGQPQARVQPQTQM 184
Cdd:NF033875    91 EEVASEKNGAEQSSATPNDTTNAQQPTVGAEKSAQEQPVVSPE--TTNEPLGQ------------PTEVAPAENEANKST 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  185 TAPKQTQTPDRLPEPPEVQMLPRI-----------------------QPQALQIQTQPKLLRQ-AQTQTSPEHLAPQQDQ 240
Cdd:NF033875   157 SIPKEFETPDVDKAVDEAKKDPNItvvekpaedlgnvsskdlaakekEVDQLQKEQAKKIAQQaAELKAKNEKIAKENAE 236
                          250
                   ....*....|....
gi 1907143096  241 VPTQAQSQEQTSEK 254
Cdd:NF033875   237 IAAKNKAEKERYEK 250
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
10-388 1.51e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 1.51e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  10 SLTMPTATLGNLRAFNVTAPSLAAPSLTPPQMVTPnlqQFFPQATRQSLLGPPPVGVPINPSQLNHSGrntqkqartPSS 89
Cdd:pfam03154 229 TLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSP---QPLPQPSLHGQMPPMPHSLQTGPSHMQHPV---------PPQ 296
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  90 TTPNRKDSSSQTVPLEdredPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPEPEPFETLEPPAkrcrsseeSTEKG 169
Cdd:pfam03154 297 PFPLTPQSSQSQVPPG----PSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPP--------TTPIP 364
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 170 PTGQPQARVQPqTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEhlAPQQDQVPTQAQSQE 249
Cdd:pfam03154 365 QLPNPQSHKHP-PHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPP--PPAQPPVLTQSQSLP 441
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 250 qtsektqdqpqtwPQGSVPPPEQASGPAcATEPQLSSHAAEAGSDPDKALPE--PVSAQSSEDRSREASAGGLDLGecek 327
Cdd:pfam03154 442 -------------PPAASHPPTSGLHQV-PSQSPFPQHPFVPGGPPPITPPSgpPTSTSSAMPGIQPPSSASVSSS---- 503
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907143096 328 ragemlGMWGAGSSLKVTILQSSNSRAFNTTPLTSGPRPGDSTSATPAIASTPS--KQSLQFF 388
Cdd:pfam03154 504 ------GPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPShaSQSARFY 560
ZnF_U1 smart00451
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ...
596-629 1.94e-03

U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.


Pssm-ID: 197732 [Multi-domain]  Cd Length: 35  Bit Score: 36.08  E-value: 1.94e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1907143096  596 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 629
Cdd:smart00451   3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
GIY-YIG_PLEs cd10442
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ...
449-516 2.31e-03

Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.


Pssm-ID: 198389  Cd Length: 92  Bit Score: 37.73  E-value: 2.31e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 449 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 516
Cdd:cd10442     6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
16-317 2.45e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 2.45e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   16 ATLGNLRAFNVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSL-------LGPPPVGVPINPSQLNHSGRnTQKQARTPS 88
Cdd:PHA03247  2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASpalpaapAPPAVPAGPATPGGPARPAR-PPTTAGPPA 2768
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   89 STTPNRKDSSSQTVPLEDREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPE-PEPFETLEPPAKRCRSSEESTE 167
Cdd:PHA03247  2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAgPLPPPTSAQPTAPPPPPGPPPP 2848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  168 KGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQAlqiQTQPkllrQAQTQTSPEHLAPQQDQVPTQAQS 247
Cdd:PHA03247  2849 SLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSR---STES----FALPPDQPERPPQPQAPPPPQPQP 2921
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  248 QEQTSEKTQDQPQTWPQGSVPPPEQAsGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 317
Cdd:PHA03247  2922 QPPPPPQPQPPPPPPPRPQPPLAPTT-DPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-323 2.88e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 2.88e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096    2 PPATYDGASL-------TMPTATLGNLRAFNVTAPSLAAPSLTPPQMVTPnLQQFFPQATRQSLLGPPPVGVPINPSQLN 74
Cdd:PHA03247  2741 PPAVPAGPATpggparpARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS-LSESRESLPSPWDPADPPAAVLAPAAALP 2819
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   75 HSGRntqKQARTPSSTTPNRKDSSSQTVPLEDREdPTEGSEEATelqmdtcedqdslvGPDSMLSEPQVPEPEPFETLEP 154
Cdd:PHA03247  2820 PAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSL-PLGGSVAPG--------------GDVRRRPPSRSPAAKPAAPARP 2881
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  155 PAKRCRSSEESTEKGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQAlQIQTQPKLLRQAQTQTSPE-- 232
Cdd:PHA03247  2882 PVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP-QPPLAPTTDPAGAGEPSGAvp 2960
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  233 -----HLAPQQDQVPTQAQSQEQTSEKTqdqpqtwPQGSVPPPEQASGPAcatepqLSSHAAEAGSDPDKAlPEPVS--- 304
Cdd:PHA03247  2961 qpwlgALVPGRVAVPRFRVPQPAPSREA-------PASSTPPLTGHSLSR------VSSWASSLALHEETD-PPPVSlkq 3026
                          330       340
                   ....*....|....*....|....*..
gi 1907143096  305 --------AQSSEDRSREASAGGLDLG 323
Cdd:PHA03247  3027 tlwppddtEDSDADSLFDSDSERSDLE 3053
PRK10927 PRK10927
cell division protein FtsN;
144-311 1.87e-05

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 47.37  E-value: 1.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 144 PEPEP----FETLEPPAKRCRSSEESTEKGPTGQPQARVQPQTQMTAPKQT---QTPDRLPEPPEVQMLPRIQPQALQIQ 216
Cdd:PRK10927   77 PKPEErwryIKELESRQPGVRAPTEPSAGGEVKTPEQLTPEQRQLLEQMQAdmrQQPTQLVEVPWNEQTPEQRQQTLQRQ 156
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 217 TQpkllRQAQTQTSPEHLAPQQDQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQ--ASGPACATEPQLSSHAAEAGSD 294
Cdd:PRK10927  157 RQ----AQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQdlLQTPAHTTAQSKPQQAAPVTRA 232
                         170
                  ....*....|....*..
gi 1907143096 295 PDKalPEPVSAQSSEDR 311
Cdd:PRK10927  233 ADA--PKPTAEKKDERR 247
PHA03247 PHA03247
large tegument protein UL36; Provisional
38-305 1.91e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.40  E-value: 1.91e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   38 PPQMVTPNLQqffPQATRQSLLGPPPVGVPINPSqlnhsgrnTQKQARTPSSTTpnrkDSSSQTVPLEDREDPtEGSEEA 117
Cdd:PHA03247  2551 PPPPLPPAAP---PAAPDRSVPPPRPAPRPSEPA--------VTSRARRPDAPP----QSARPRAPVDDRGDP-RGPAPP 2614
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  118 TELQMDTCEDQDSLVGPDSMLSEPQVPEPEPFETLEPPAK-----RCRSSEESTEKGPTGQPQARVQPQTQMTAPKQTQT 192
Cdd:PHA03247  2615 SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDdpapgRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS 2694
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  193 PDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPE-HLAPQQDQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPE 271
Cdd:PHA03247  2695 LTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAlPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA 2774
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1907143096  272 QASGPA-CATEPQLSSHAAEAGSDPDKALPEPVSA 305
Cdd:PHA03247  2775 PAAGPPrRLTRPAVASLSESRESLPSPWDPADPPA 2809
zf-C2H2_jaz pfam12171
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ...
480-504 1.94e-05

Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.


Pssm-ID: 432381 [Multi-domain]  Cd Length: 27  Bit Score: 41.77  E-value: 1.94e-05
                          10        20
                  ....*....|....*....|....*
gi 1907143096 480 FCTICNRYFKTPRKFVEHVKSQGHK 504
Cdd:pfam12171   3 YCVLCDKYFKSENALQNHLKSKKHK 27
Agg_substance NF033875
LPXTG-anchored aggregation substance; Aggregation substances, as described in Enterococcus, ...
26-254 1.56e-04

LPXTG-anchored aggregation substance; Aggregation substances, as described in Enterococcus, are LPXTG-anchored large surface proteins that contribute to virulence. Several closely related paralogs may be found in a single strain.


Pssm-ID: 411439 [Multi-domain]  Cd Length: 1306  Bit Score: 45.09  E-value: 1.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   26 VTAPSLAAPSLTPPQMVTPNLQQffpqatrqSLLGPPPVGVPINPSQLN-HSGRNTQKQARTPSSTTpnRKDSSSQTVPL 104
Cdd:NF033875    21 VVAPILFLGVLGVVGLATDNVQA--------AELDTQPGTTTVQPDNPDpQSGSETPKTAVSEEATV--QKDTTSQPTKV 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  105 EDREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPEPEpfETLEPPAKrcrsseestekgPTGQPQARVQPQTQM 184
Cdd:NF033875    91 EEVASEKNGAEQSSATPNDTTNAQQPTVGAEKSAQEQPVVSPE--TTNEPLGQ------------PTEVAPAENEANKST 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  185 TAPKQTQTPDRLPEPPEVQMLPRI-----------------------QPQALQIQTQPKLLRQ-AQTQTSPEHLAPQQDQ 240
Cdd:NF033875   157 SIPKEFETPDVDKAVDEAKKDPNItvvekpaedlgnvsskdlaakekEVDQLQKEQAKKIAQQaAELKAKNEKIAKENAE 236
                          250
                   ....*....|....
gi 1907143096  241 VPTQAQSQEQTSEK 254
Cdd:NF033875   237 IAAKNKAEKERYEK 250
PRK10263 PRK10263
DNA translocase FtsK; Provisional
130-271 4.09e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.92  E-value: 4.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  130 SLVGPDSMLSEPQV---PEPEPfETLEP-----PAKRCRSSEESTEKGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPE 201
Cdd:PRK10263   345 PVASVDVPPAQPTVawqPVPGP-QTGEPviapaPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAP 423
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  202 VQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPE 271
Cdd:PRK10263   424 APEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPE 493
PRK10263 PRK10263
DNA translocase FtsK; Provisional
133-282 4.16e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.92  E-value: 4.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  133 GPDSMLSEPQV-PEPEPFETLEPPAKrcrsseestekgpTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQ 211
Cdd:PRK10263   739 GPHEPLFTPIVePVQQPQQPVAPQQQ-------------YQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQ 805
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907143096  212 ALQIQTQPKLLRQAQTQTSPEHLAPQQDQVPTQAQSQEQ-----TSEKTQDQPQTWPQGSVPPPEQASGPACATEP 282
Cdd:PRK10263   806 QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTllhplLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEP 881
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
10-388 1.51e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 42.06  E-value: 1.51e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  10 SLTMPTATLGNLRAFNVTAPSLAAPSLTPPQMVTPnlqQFFPQATRQSLLGPPPVGVPINPSQLNHSGrntqkqartPSS 89
Cdd:pfam03154 229 TLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSP---QPLPQPSLHGQMPPMPHSLQTGPSHMQHPV---------PPQ 296
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  90 TTPNRKDSSSQTVPLEdredPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPEPEPFETLEPPAkrcrsseeSTEKG 169
Cdd:pfam03154 297 PFPLTPQSSQSQVPPG----PSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPP--------TTPIP 364
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 170 PTGQPQARVQPqTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEhlAPQQDQVPTQAQSQE 249
Cdd:pfam03154 365 QLPNPQSHKHP-PHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPP--PPAQPPVLTQSQSLP 441
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 250 qtsektqdqpqtwPQGSVPPPEQASGPAcATEPQLSSHAAEAGSDPDKALPE--PVSAQSSEDRSREASAGGLDLGecek 327
Cdd:pfam03154 442 -------------PPAASHPPTSGLHQV-PSQSPFPQHPFVPGGPPPITPPSgpPTSTSSAMPGIQPPSSASVSSS---- 503
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907143096 328 ragemlGMWGAGSSLKVTILQSSNSRAFNTTPLTSGPRPGDSTSATPAIASTPS--KQSLQFF 388
Cdd:pfam03154 504 ------GPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPShaSQSARFY 560
PRK10263 PRK10263
DNA translocase FtsK; Provisional
170-309 1.59e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  170 PTGQPQARVQPqtqmtAP-KQTQTPDRLPEPPEVQMlpriQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVPtQAQSQ 248
Cdd:PRK10263   352 PPAQPTVAWQP-----VPgPQTGEPVIAPAPEGYPQ----QSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPA-QQPYY 421
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907143096  249 EQTSEKTQDQPQTWPQGSVPPPEQASGPacatEPQLSSHAAEAGSDPDKALPEPVSAQSSE 309
Cdd:PRK10263   422 APAPEQPAQQPYYAPAPEQPVAGNAWQA----EEQQSTFAPQSTYQTEQTYQQPAAQEPLY 478
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
55-311 1.77e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 41.56  E-value: 1.77e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  55 RQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPS-STTPNR----KDSSSQTVPledredptegseeatELQMDTcedqd 129
Cdd:pfam09770 101 RFNRQQPAARAAQSSAQPPASSLPQYQYASQQSQqPSKPVRtgyeKYKEPEPIP---------------DLQVDA----- 160
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 130 SL--VGPDSMLSEPQVPEPEPFETLEPPAKRCRSSEESTEKGPTGQPQARVQPQT--------QMTAPKQTQTPDRLPEP 199
Cdd:pfam09770 161 SLwgVAPKKAAAPAPAPQPAAQPASLPAPSRKMMSLEEVEAAMRAQAKKPAQQPApapaqppaAPPAQQAQQQQQFPPQI 240
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 200 PEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEhLAPQQDQVPTQAQSQEQTSEKTQDQP---------QTWPQGSVPPP 270
Cdd:pfam09770 241 QQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQ-PDPAQPSIQPQAQQFHQQPPPVPVQPtqilqnpnrLSAARVGYPQN 319
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|.
gi 1907143096 271 EQASGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDR 311
Cdd:pfam09770 320 PQPGVQPAPAHQAHRQQGSFGRQAPIITHPQQLAQLSEEEK 360
ZnF_U1 smart00451
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ...
596-629 1.94e-03

U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.


Pssm-ID: 197732 [Multi-domain]  Cd Length: 35  Bit Score: 36.08  E-value: 1.94e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1907143096  596 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 629
Cdd:smart00451   3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
PRK10263 PRK10263
DNA translocase FtsK; Provisional
200-311 2.26e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.61  E-value: 2.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  200 PEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVPTQAQSQEQTSEKTQDQPQTwPQGSVPPPEQASGPaca 279
Cdd:PRK10263   751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA-PQPQYQQPQQPVAP--- 826
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1907143096  280 tEPQLSSHAAEAGSDPDKALPEPVSAQSSEDR 311
Cdd:PRK10263   827 -QPQYQQPQQPVAPQPQDTLLHPLLMRNGDSR 857
GIY-YIG_PLEs cd10442
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ...
449-516 2.31e-03

Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.


Pssm-ID: 198389  Cd Length: 92  Bit Score: 37.73  E-value: 2.31e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 449 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 516
Cdd:cd10442     6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
133-321 2.62e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.12  E-value: 2.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 133 GPDSMLSEPQVPEPEPFETLEPPAKRCR------SSEESTEKGPTGQPQARVQPQTQMTAPKQT---QTPDRLPEPPEVQ 203
Cdd:PRK07764  602 APASSGPPEEAARPAAPAAPAAPAAPAPagaaaaPAEASAAPAPGVAAPEHHPKHVAVPDASDGgdgWPAKAGGAAPAAP 681
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 204 MLPRIQPQALQIQTQPKllRQAQTQTSPEHLAPQQDQVPTQAQSQEQTsektQDQPQTWPQGSVPPPEQASGPACATEPQ 283
Cdd:PRK07764  682 PPAPAPAAPAAPAGAAP--AQPAPAPAATPPAGQADDPAAQPPQAAQG----ASAPSPAADDPVPLPPEPDDPPDPAGAP 755
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 1907143096 284 LSSHAAEAGSDPDKALPEP-VSAQSSEDRSREASAGGLD 321
Cdd:PRK07764  756 AQPPPPPAPAPAAAPAAAPpPSPPSEEEEMAEDDAPSMD 794
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
67-323 4.26e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 40.46  E-value: 4.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  67 PINPSQLNHSGRNTQKQARTPSSTTPNRKDSSSQTVPLEDREDPTEGSEEATELQMDTcedQDSLVGPDSMLSEPQVP-- 144
Cdd:PRK08691  360 PLAAASCDANAVIENTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAAAMPS---EGKTAGPVSNQENNDVPpw 436
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 145 --EPEPFETLEPPAKrcrsseestekgptgQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKll 222
Cdd:PRK08691  437 edAPDEAQTAAGTAQ---------------TSAKSIQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPN-- 499
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 223 RQAQTQTSPEHLAPQQdqvPTQAQSQEQTSEKTQDQPQtwpqgsVPPPEQASGPACATEPQLSSHAAEAGSDPDKALP-E 301
Cdd:PRK08691  500 DEAVETETFAHEAPAE---PFYGYGFPDNDCPPEDGAE------IPPPDWEHAAPADTAGGGADEEAEAGGIGGNNTPsA 570
                         250       260
                  ....*....|....*....|..
gi 1907143096 302 PVSAQSSEDRSREASAGGLDLG 323
Cdd:PRK08691  571 PPPEFSTENWAAIVRHFARKLG 592
PRK12757 PRK12757
cell division protein FtsN; Provisional
170-277 4.82e-03

cell division protein FtsN; Provisional


Pssm-ID: 237191 [Multi-domain]  Cd Length: 256  Bit Score: 39.26  E-value: 4.82e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 170 PTGQPQARVQPQ---------TQMTAPKQtQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPehlAPQQDQ 240
Cdd:PRK12757   68 PSAGGEVNSPTQltdeqrqllEQMQADMR-QQPTQLSEVPYNEQTPQVPRSTVQIQQQAQQQQPPATTAQP---QPVTPP 143
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1907143096 241 VPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPA 277
Cdd:PRK12757  144 RQTTAPVQPQTPAPVRTQPAAPVTQAVEAPKVEAEKE 180
PHA03247 PHA03247
large tegument protein UL36; Provisional
29-295 6.31e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.92  E-value: 6.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   29 PSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSG-------RNTQKQARTPSSTTPNRKdsssqt 101
Cdd:PHA03247  2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPApgrvsrpRRARRLGRAAQASSPPQR------ 2682
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  102 vpledredPTEGSEEATelqmdtcedqdslVGPDSMLSEPQVPEPEPfETLEPPAKRCRSSEESTEKGPTGQPQARVQPQ 181
Cdd:PHA03247  2683 --------PRRRAARPT-------------VGSLTSLADPPPPPPTP-EPAPHALVSATPLPPGPAAARQASPALPAAPA 2740
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  182 TQM--TAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQT-QPKLLRQAQTQTSPEHLAPQQDQVPTQAQSQEQTSEKTQDQ 258
Cdd:PHA03247  2741 PPAvpAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGpPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1907143096  259 PQTwPQGSVPPPEQA--SGPACATEPQLSSHAAEAGSDP 295
Cdd:PHA03247  2821 AAS-PAGPLPPPTSAqpTAPPPPPGPPPPSLPLGGSVAP 2858
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
164-299 6.59e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 39.97  E-value: 6.59e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 164 ESTEKGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQAlqiQTQPKLLRQAQTQTSPEHLAPQQDQVPT 243
Cdd:PRK07764  379 ERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAA---APQPAPAPAPAPAPPSPAGNAPAGGAPS 455
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907143096 244 QAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATEPQLSSH-AAEAGSDPDKAL 299
Cdd:PRK07764  456 PPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAApAAPAGADDAATL 512
PRK10927 PRK10927
cell division protein FtsN;
153-273 6.60e-03

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 39.28  E-value: 6.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096 153 EPPAKRCRSSEESTEKGPTGQPQARVQPQTQMTAPKQTQTpdRLPEPPEVQMLPRIQPQALQIQTQPKLLrQAQTQTSPE 232
Cdd:PRK10927  144 QTPEQRQQTLQRQRQAQQLAEQQRLAQQSRTTEQSWQQQT--RTSQAAPVQAQPRQSKPASTQQPYQDLL-QTPAHTTAQ 220
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 1907143096 233 HLAPQQDQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQA 273
Cdd:PRK10927  221 SKPQQAAPVTRAADAPKPTAEKKDERRWMVQCGSFRGAEQA 261
PRK10263 PRK10263
DNA translocase FtsK; Provisional
7-293 9.03e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 39.30  E-value: 9.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096    7 DGASLTMPTATLGNLRAFNVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSqlnhsgrntqkqart 86
Cdd:PRK10263   312 NGAPITEPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPA--------------- 376
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096   87 PSSTTPNRKDSSSQTVPLEDREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPEPEPfetLEPPAKRCRSSEEst 166
Cdd:PRK10263   377 PEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAP---EQPVAGNAWQAEE-- 451
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143096  167 eKGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQmlPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVPTQAQ 246
Cdd:PRK10263   452 -QQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQ--PVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQ 528
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907143096  247 S-QEQTSEKTQDQPQTWPQG--SVPPPEQAS--GPACATEPQLSSHAAEAGS 293
Cdd:PRK10263   529 PiPEPVKEPEPIKSSLKAPSvaAVPPVEAAAavSPLASGVKKATLATGAAAT 580
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH