NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720400485|ref|XP_030107884|]
View 

cip1-interacting zinc finger protein isoform X15 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PRK10263 super family cl35903
DNA translocase FtsK; Provisional
171-360 2.59e-07

DNA translocase FtsK; Provisional


The actual alignment was detected with superfamily member PRK10263:

Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 54.32  E-value: 2.59e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  171 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 250
Cdd:PRK10263   327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  251 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 330
Cdd:PRK10263   397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
                          170       180       190
                   ....*....|....*....|....*....|
gi 1720400485  331 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 360
Cdd:PRK10263   476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
PHA03247 super family cl33720
large tegument protein UL36; Provisional
16-319 5.28e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 5.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485   16 ATLGNLRAFNVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPnrk 95
Cdd:PHA03247  2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP--- 2766
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485   96 dsSSQTVPledrEDPTEGSEEATElqmdtcedqdslVGPDSMLSEPQVPEPEPFETLEPPAKrcrSSEESTEKGPTGQPQ 175
Cdd:PHA03247  2767 --PAPAPP----AAPAAGPPRRLT------------RPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPA 2825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  176 ARVQPQT--QMTAPKQTQTPDRLPEPPEVQMLP----RIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQP 249
Cdd:PHA03247  2826 GPLPPPTsaQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  250 PwqlqpRETDPPNQAQAqtqpqplwQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPP 319
Cdd:PHA03247  2906 E-----RPPQPQAPPPP--------QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
zf-C2H2_jaz pfam12171
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ...
529-553 1.81e-05

Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.


:

Pssm-ID: 432381 [Multi-domain]  Cd Length: 27  Bit Score: 41.77  E-value: 1.81e-05
                          10        20
                  ....*....|....*....|....*
gi 1720400485 529 FCTICNRYFKTPRKFVEHVKSQGHK 553
Cdd:pfam12171   3 YCVLCDKYFKSENALQNHLKSKKHK 27
ZnF_U1 smart00451
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ...
645-678 1.74e-03

U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.


:

Pssm-ID: 197732 [Multi-domain]  Cd Length: 35  Bit Score: 36.46  E-value: 1.74e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1720400485  645 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 678
Cdd:smart00451   3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
GIY-YIG_SF super family cl15257
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ...
498-565 2.48e-03

GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions.


The actual alignment was detected with superfamily member cd10442:

Pssm-ID: 472790  Cd Length: 92  Bit Score: 37.73  E-value: 2.48e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 498 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 565
Cdd:cd10442     6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
 
Name Accession Description Interval E-value
PRK10263 PRK10263
DNA translocase FtsK; Provisional
171-360 2.59e-07

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 54.32  E-value: 2.59e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  171 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 250
Cdd:PRK10263   327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  251 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 330
Cdd:PRK10263   397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
                          170       180       190
                   ....*....|....*....|....*....|
gi 1720400485  331 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 360
Cdd:PRK10263   476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
PHA03247 PHA03247
large tegument protein UL36; Provisional
16-319 5.28e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 5.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485   16 ATLGNLRAFNVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPnrk 95
Cdd:PHA03247  2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP--- 2766
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485   96 dsSSQTVPledrEDPTEGSEEATElqmdtcedqdslVGPDSMLSEPQVPEPEPFETLEPPAKrcrSSEESTEKGPTGQPQ 175
Cdd:PHA03247  2767 --PAPAPP----AAPAAGPPRRLT------------RPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPA 2825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  176 ARVQPQT--QMTAPKQTQTPDRLPEPPEVQMLP----RIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQP 249
Cdd:PHA03247  2826 GPLPPPTsaQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  250 PwqlqpRETDPPNQAQAqtqpqplwQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPP 319
Cdd:PHA03247  2906 E-----RPPQPQAPPPP--------QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
zf-C2H2_jaz pfam12171
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ...
529-553 1.81e-05

Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.


Pssm-ID: 432381 [Multi-domain]  Cd Length: 27  Bit Score: 41.77  E-value: 1.81e-05
                          10        20
                  ....*....|....*....|....*
gi 1720400485 529 FCTICNRYFKTPRKFVEHVKSQGHK 553
Cdd:pfam12171   3 YCVLCDKYFKSENALQNHLKSKKHK 27
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
132-299 4.92e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 46.95  E-value: 4.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 132 VGPDSMLSEPQVPEPEPFETLEPPAKRCRSSEESTE-------KGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQM 204
Cdd:pfam09770 165 VAPKKAAAPAPAPQPAAQPASLPAPSRKMMSLEEVEaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 205 LPRIQPQ----------ALQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQPPWQLQ-------PRETDPPNQAQAQ 267
Cdd:pfam09770 245 QPQQQPQqpqqhpgqghPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQnpnrlsaARVGYPQNPQPGV 324
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 1720400485 268 TQPQPLWQAQSQ----KQAQTQAHPQVPTQAQSQEQ 299
Cdd:pfam09770 325 QPAPAHQAHRQQgsfgRQAPIITHPQQLAQLSEEEK 360
Agg_substance NF033875
LPXTG-anchored aggregation substance; Aggregation substances, as described in Enterococcus, ...
26-208 2.13e-04

LPXTG-anchored aggregation substance; Aggregation substances, as described in Enterococcus, are LPXTG-anchored large surface proteins that contribute to virulence. Several closely related paralogs may be found in a single strain.


Pssm-ID: 411439 [Multi-domain]  Cd Length: 1306  Bit Score: 45.09  E-value: 2.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485   26 VTAPSLAAPSLTPPQMVTPNLQQffpqatrqSLLGPPPVGVPINPSQLN-HSGRNTQKQARTPSSTTpnRKDSSSQTVPL 104
Cdd:NF033875    21 VVAPILFLGVLGVVGLATDNVQA--------AELDTQPGTTTVQPDNPDpQSGSETPKTAVSEEATV--QKDTTSQPTKV 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  105 EDREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPEPEpfETLEPPAKrcrsseestekgPTGQPQARVQPQTQM 184
Cdd:NF033875    91 EEVASEKNGAEQSSATPNDTTNAQQPTVGAEKSAQEQPVVSPE--TTNEPLGQ------------PTEVAPAENEANKST 156
                          170       180
                   ....*....|....*....|....
gi 1720400485  185 TAPKQTQTPDRLPEPPEVQMLPRI 208
Cdd:NF033875   157 SIPKEFETPDVDKAVDEAKKDPNI 180
ZnF_U1 smart00451
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ...
645-678 1.74e-03

U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.


Pssm-ID: 197732 [Multi-domain]  Cd Length: 35  Bit Score: 36.46  E-value: 1.74e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1720400485  645 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 678
Cdd:smart00451   3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
29-437 1.93e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 1.93e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  29 PSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQarTPSSTTPNRKDSSSQTVPLEDRE 108
Cdd:pfam03154 179 GAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQ--TPTLHPQRLPSPHPPLQPMTQPP 256
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 109 DPTEGSEEATElqmdtcedQDSLVGPDSMLSEPQVPEPEPFETLEPPAKRCRSSEESTEKGP-------TGQPQARVQPQ 181
Cdd:pfam03154 257 PPSQVSPQPLP--------QPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPpgpspaaPGQSQQRIHTP 328
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 182 TQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQpklLRQAQTQTSPEHL-APQQDQVEPQVPsqPPWQLQPRETDP 260
Cdd:pfam03154 329 PSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQ---LPNPQSHKHPPHLsGPSPFQMNSNLP--PPPALKPLSSLS 403
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 261 PNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEqtsektqdqpqtwPQGSVPPPEQASGPAcATEPQLSSHAAEA 340
Cdd:pfam03154 404 THHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLP-------------PPAASHPPTSGLHQV-PSQSPFPQHPFVP 469
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 341 GSDPDKALPE--PVSAQSSEDRSREASAGGLDLGecekragemlGMWGAGSSLKVTILQSSNSRAFNTTPLTSGPRPGDS 418
Cdd:pfam03154 470 GGPPPITPPSgpPTSTSSAMPGIQPPSSASVSSS----------GPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRS 539
                         410       420
                  ....*....|....*....|.
gi 1720400485 419 TSATPAIASTPS--KQSLQFF 437
Cdd:pfam03154 540 PSPEPTVVNTPShaSQSARFY 560
GIY-YIG_PLEs cd10442
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ...
498-565 2.48e-03

Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.


Pssm-ID: 198389  Cd Length: 92  Bit Score: 37.73  E-value: 2.48e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 498 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 565
Cdd:cd10442     6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
 
Name Accession Description Interval E-value
PRK10263 PRK10263
DNA translocase FtsK; Provisional
171-360 2.59e-07

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 54.32  E-value: 2.59e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  171 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 250
Cdd:PRK10263   327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  251 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 330
Cdd:PRK10263   397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
                          170       180       190
                   ....*....|....*....|....*....|
gi 1720400485  331 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 360
Cdd:PRK10263   476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
PHA03247 PHA03247
large tegument protein UL36; Provisional
26-366 3.69e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.17  E-value: 3.69e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485   26 VTAPSLAAPSLTPPQMVTPNLQQFfPQATRQSLLGPPPVGVPINPsqlnhsgRNTQKQARTPSSTTPNRKDSSSQTVPLe 105
Cdd:PHA03247  2667 ARRLGRAAQASSPPQRPRRRAARP-TVGSLTSLADPPPPPPTPEP-------APHALVSATPLPPGPAAARQASPALPA- 2737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  106 DREDPTEGSEEATELQMDTCEDQDSLVGPDSMlSEPQVPEPEPFETLEPPAkrCRSSEESTEKGPTgqPQARVQPQTQMT 185
Cdd:PHA03247  2738 APAPPAVPAGPATPGGPARPARPPTTAGPPAP-APPAAPAAGPPRRLTRPA--VASLSESRESLPS--PWDPADPPAAVL 2812
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  186 APKQTQTPDRLP---EPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSP--EHLAPQQDQVEPQVPSQPPWQLQPRETDP 260
Cdd:PHA03247  2813 APAAALPPAASPagpLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLARPAVS 2892
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  261 PNQAQAQTQPQPLwQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQtwpqgsvPPPEQASGPACATEPQLSSHAAEA 340
Cdd:PHA03247  2893 RSTESFALPPDQP-ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ-------PPLAPTTDPAGAGEPSGAVPQPWL 2964
                          330       340
                   ....*....|....*....|....*.
gi 1720400485  341 GSDPDKALPEPVSAQSSEDRSREASA 366
Cdd:PHA03247  2965 GALVPGRVAVPRFRVPQPAPSREAPA 2990
PHA03247 PHA03247
large tegument protein UL36; Provisional
16-319 5.28e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 5.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485   16 ATLGNLRAFNVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPnrk 95
Cdd:PHA03247  2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP--- 2766
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485   96 dsSSQTVPledrEDPTEGSEEATElqmdtcedqdslVGPDSMLSEPQVPEPEPFETLEPPAKrcrSSEESTEKGPTGQPQ 175
Cdd:PHA03247  2767 --PAPAPP----AAPAAGPPRRLT------------RPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPA 2825
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  176 ARVQPQT--QMTAPKQTQTPDRLPEPPEVQMLP----RIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQP 249
Cdd:PHA03247  2826 GPLPPPTsaQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  250 PwqlqpRETDPPNQAQAqtqpqplwQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPP 319
Cdd:PHA03247  2906 E-----RPPQPQAPPPP--------QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQP 2962
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
153-371 6.79e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.60  E-value: 6.79e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 153 EPPAKRCRSSEESTEKGPtGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPE 232
Cdd:PRK07764  591 APGAAGGEGPPAPASSGP-PEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 233 HLAPQQDQVEPQVPSQPPWQLQPRETDPPNQAQAQTQpqplWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQ----DQP 308
Cdd:PRK07764  670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPA----ATPPAGQADDPAAQPPQAAQGASAPSPAADDPvplpPEP 745
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720400485 309 QTWPQGSVPPPEQASGPACATEPQLSSHAAEA-GSDPDKALPEPVSAQSSEDRsREASAGGLDL 371
Cdd:PRK07764  746 DDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSpPSEEEEMAEDDAPSMDDEDR-RDAEEVAMEL 808
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
83-308 7.29e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 49.72  E-value: 7.29e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  83 QARTPSSTTPNRKDSSSQTVPLEDREDPTEGSEEATELQMDTCEDQD----SLVGPDSMLSEpqVPEPEPFETLEPpakr 158
Cdd:PRK14949  564 YNALSDDEQHSANVQSAQSAAEAQPSSQSLSPISAVTTAAASLADDDildaVLAARDSLLSD--LDALSPKEGDGK---- 637
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 159 cRSSEESTEKGPTGQPQARVQPQTQmTAPKQTQTPDRLPEPPEVQMLPR--IQPQALQIQTQPKLLRQAQTQTSPEHLAP 236
Cdd:PRK14949  638 -KSSADRKPKTPPSRAPPASLSKPA-SSPDASQTSASFDLDPDFELATHqsVPEAALASGSAPAPPPVPDPYDRPPWEEA 715
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720400485 237 QQDQVEPQVPSQPPwqlqpRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQP 308
Cdd:PRK14949  716 PEVASANDGPNNAA-----EGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQDTE 782
PRK10927 PRK10927
cell division protein FtsN;
144-322 9.46e-06

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 48.14  E-value: 9.46e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 144 PEPEP----FETLEPPAKRCRSSEESTEKGPTGQPQARVQPQTQMTAPKQT---QTPDRLPEPPEVQMLPRIQPQALQIQ 216
Cdd:PRK10927   77 PKPEErwryIKELESRQPGVRAPTEPSAGGEVKTPEQLTPEQRQLLEQMQAdmrQQPTQLVEVPWNEQTPEQRQQTLQRQ 156
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 217 TQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQPPWQLQPRetdPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPT--QA 294
Cdd:PRK10927  157 RQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPR---QSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVtrAA 233
                         170       180
                  ....*....|....*....|....*...
gi 1720400485 295 QSQEQTSEKTQDQPQTWPQGSVPPPEQA 322
Cdd:PRK10927  234 DAPKPTAEKKDERRWMVQCGSFRGAEQA 261
zf-C2H2_jaz pfam12171
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ...
529-553 1.81e-05

Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.


Pssm-ID: 432381 [Multi-domain]  Cd Length: 27  Bit Score: 41.77  E-value: 1.81e-05
                          10        20
                  ....*....|....*....|....*
gi 1720400485 529 FCTICNRYFKTPRKFVEHVKSQGHK 553
Cdd:pfam12171   3 YCVLCDKYFKSENALQNHLKSKKHK 27
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
132-299 4.92e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 46.95  E-value: 4.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 132 VGPDSMLSEPQVPEPEPFETLEPPAKRCRSSEESTE-------KGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQM 204
Cdd:pfam09770 165 VAPKKAAAPAPAPQPAAQPASLPAPSRKMMSLEEVEaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 205 LPRIQPQ----------ALQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQPPWQLQ-------PRETDPPNQAQAQ 267
Cdd:pfam09770 245 QPQQQPQqpqqhpgqghPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQnpnrlsaARVGYPQNPQPGV 324
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 1720400485 268 TQPQPLWQAQSQ----KQAQTQAHPQVPTQAQSQEQ 299
Cdd:pfam09770 325 QPAPAHQAHRQQgsfgRQAPIITHPQQLAQLSEEEK 360
PRK10263 PRK10263
DNA translocase FtsK; Provisional
199-319 1.71e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.08  E-value: 1.71e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  199 PPEVQMLPRIQPQAlqiQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPNQAQaqtqpqplwQAQS 278
Cdd:PRK10263   740 PHEPLFTPIVEPVQ---QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ---------YQQP 807
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1720400485  279 QKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPP 319
Cdd:PRK10263   808 QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHP 848
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
141-302 1.71e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 44.86  E-value: 1.71e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 141 PQVPEPEPFETLEPPAKRCRSSeestekgPTGQPQARVQPQTQMTAPKQTQTPdrlPEPPEVQMLPRIQPQALQIQTQpk 220
Cdd:PRK07994  361 PAAPLPEPEVPPQSAAPAASAQ-------ATAAPTAAVAPPQAPAVPPPPASA---PQQAPAVPLPETTSQLLAARQQ-- 428
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 221 lLRQAQTQTSPEHLAPQQDQVEPQVPSQPP--WQLQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQE 298
Cdd:PRK07994  429 -LQRAQGATKAKKSEPAAASRARPVNSALErlASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHE 507

                  ....
gi 1720400485 299 QTSE 302
Cdd:PRK07994  508 KTPE 511
Agg_substance NF033875
LPXTG-anchored aggregation substance; Aggregation substances, as described in Enterococcus, ...
26-208 2.13e-04

LPXTG-anchored aggregation substance; Aggregation substances, as described in Enterococcus, are LPXTG-anchored large surface proteins that contribute to virulence. Several closely related paralogs may be found in a single strain.


Pssm-ID: 411439 [Multi-domain]  Cd Length: 1306  Bit Score: 45.09  E-value: 2.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485   26 VTAPSLAAPSLTPPQMVTPNLQQffpqatrqSLLGPPPVGVPINPSQLN-HSGRNTQKQARTPSSTTpnRKDSSSQTVPL 104
Cdd:NF033875    21 VVAPILFLGVLGVVGLATDNVQA--------AELDTQPGTTTVQPDNPDpQSGSETPKTAVSEEATV--QKDTTSQPTKV 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  105 EDREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPEPEpfETLEPPAKrcrsseestekgPTGQPQARVQPQTQM 184
Cdd:NF033875    91 EEVASEKNGAEQSSATPNDTTNAQQPTVGAEKSAQEQPVVSPE--TTNEPLGQ------------PTEVAPAENEANKST 156
                          170       180
                   ....*....|....*....|....
gi 1720400485  185 TAPKQTQTPDRLPEPPEVQMLPRI 208
Cdd:NF033875   157 SIPKEFETPDVDKAVDEAKKDPNI 180
PRK10263 PRK10263
DNA translocase FtsK; Provisional
133-243 2.73e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.69  E-value: 2.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  133 GPDSMLSEPQV-PEPEPFETLEPPAKRCRSSEESTEKGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQ 211
Cdd:PRK10263   739 GPHEPLFTPIVePVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQ 818
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1720400485  212 ALQIQTQPKllRQAQTQTSPEHLAPQQDQVEP 243
Cdd:PRK10263   819 QPQQPVAPQ--PQYQQPQQPVAPQPQDTLLHP 848
PHA03247 PHA03247
large tegument protein UL36; Provisional
38-344 2.95e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 2.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485   38 PPQMVTPNLQqffPQATRQSLLGPPPVGVPINPSqlnhsgrnTQKQARTPSSTTpnrkDSSSQTVPLEDREDPtEGSEEA 117
Cdd:PHA03247  2551 PPPPLPPAAP---PAAPDRSVPPPRPAPRPSEPA--------VTSRARRPDAPP----QSARPRAPVDDRGDP-RGPAPP 2614
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  118 TELQMDTCEDQDSLVGPDSMLSEPQVPEPEPFETLEPPAK-----RCRSSEESTEKGPTGQPQARVQPQTQMTAPKQTQT 192
Cdd:PHA03247  2615 SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDdpapgRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS 2694
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  193 PDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPehlAPQQDQVEPQVPS------------QPPWQLQPRETDP 260
Cdd:PHA03247  2695 LTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASP---ALPAAPAPPAVPAgpatpggparpaRPPTTAGPPAPAP 2771
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  261 PNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTW-PQGSVPPPEQA--SGPACATEPQLSSHA 337
Cdd:PHA03247  2772 PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAsPAGPLPPPTSAqpTAPPPPPGPPPPSLP 2851

                   ....*..
gi 1720400485  338 AEAGSDP 344
Cdd:PHA03247  2852 LGGSVAP 2858
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
133-352 1.22e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.37  E-value: 1.22e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 133 GPDSMLSEPQVP-EPEPFETLEPPAKRCRSSEESTEKGPTGQPQARVQP--------------QTQMTAPKQTQTPDRlP 197
Cdd:PTZ00449  513 GPEASGLPPKAPgDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPgpakehkpskiptlSKKPEFPKDPKHPKD-P 591
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 198 EPPEVQMLPRI--------QPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPNQAQAQTQ 269
Cdd:PTZ00449  592 EEPKKPKRPRSaqrptrpkSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKF 671
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 270 PQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVP--PPEQASGPACATEPQlsshaaeagSDPDKA 347
Cdd:PTZ00449  672 KEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRplPPKLPRDEEFPFEPI---------GDPDAE 742

                  ....*
gi 1720400485 348 LPEPV 352
Cdd:PTZ00449  743 QPDDI 747
PRK10263 PRK10263
DNA translocase FtsK; Provisional
170-261 1.57e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 1.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  170 PTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrQAQTQTSPEHLAPQQDQVEPQVP--S 247
Cdd:PRK10263   751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP----QPQYQQPQQPVAPQPQYQQPQQPvaP 826
                           90
                   ....*....|....
gi 1720400485  248 QPPWQlQPRETDPP 261
Cdd:PRK10263   827 QPQYQ-QPQQPVAP 839
ZnF_U1 smart00451
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ...
645-678 1.74e-03

U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.


Pssm-ID: 197732 [Multi-domain]  Cd Length: 35  Bit Score: 36.46  E-value: 1.74e-03
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1720400485  645 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 678
Cdd:smart00451   3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
29-437 1.93e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 1.93e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  29 PSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQarTPSSTTPNRKDSSSQTVPLEDRE 108
Cdd:pfam03154 179 GAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQ--TPTLHPQRLPSPHPPLQPMTQPP 256
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 109 DPTEGSEEATElqmdtcedQDSLVGPDSMLSEPQVPEPEPFETLEPPAKRCRSSEESTEKGP-------TGQPQARVQPQ 181
Cdd:pfam03154 257 PPSQVSPQPLP--------QPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPpgpspaaPGQSQQRIHTP 328
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 182 TQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQpklLRQAQTQTSPEHL-APQQDQVEPQVPsqPPWQLQPRETDP 260
Cdd:pfam03154 329 PSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQ---LPNPQSHKHPPHLsGPSPFQMNSNLP--PPPALKPLSSLS 403
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 261 PNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEqtsektqdqpqtwPQGSVPPPEQASGPAcATEPQLSSHAAEA 340
Cdd:pfam03154 404 THHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLP-------------PPAASHPPTSGLHQV-PSQSPFPQHPFVP 469
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 341 GSDPDKALPE--PVSAQSSEDRSREASAGGLDLGecekragemlGMWGAGSSLKVTILQSSNSRAFNTTPLTSGPRPGDS 418
Cdd:pfam03154 470 GGPPPITPPSgpPTSTSSAMPGIQPPSSASVSSS----------GPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRS 539
                         410       420
                  ....*....|....*....|.
gi 1720400485 419 TSATPAIASTPS--KQSLQFF 437
Cdd:pfam03154 540 PSPEPTVVNTPShaSQSARFY 560
GIY-YIG_PLEs cd10442
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ...
498-565 2.48e-03

Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.


Pssm-ID: 198389  Cd Length: 92  Bit Score: 37.73  E-value: 2.48e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 498 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 565
Cdd:cd10442     6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
PRK10263 PRK10263
DNA translocase FtsK; Provisional
132-258 7.50e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 39.68  E-value: 7.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  132 VGPDSMLSEPQVPEPepfetlePPAKRCRSSEESTEKGPTGQPQARVQPQTQMTAPKQTQTpdrlPEPPEVQMLPRIQPQ 211
Cdd:PRK10263   772 VAPQPQYQQPQQPVA-------PQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVA----PQPQYQQPQQPVAPQ 840
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1720400485  212 ALQIQTQPKLLRQAQTQ--TSPEHLAPQQDQVEPqvpsqPPWQLQPRET 258
Cdd:PRK10263   841 PQDTLLHPLLMRNGDSRplHKPTTPLPSLDLLTP-----PPSEVEPVDT 884
PRK10263 PRK10263
DNA translocase FtsK; Provisional
135-325 7.56e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 39.68  E-value: 7.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  135 DSMLSEPQVPEPEPfetlePPAKRcrSSEESTEKGP-TGQPQARVQPQTQMTAPKQTQTPDRLPEP---PEVQMLPRIQP 210
Cdd:PRK10263   338 EPVTQTPPVASVDV-----PPAQP--TVAWQPVPGPqTGEPVIAPAPEGYPQQSQYAQPAVQYNEPlqqPVQPQQPYYAP 410
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485  211 QALQIQTQPKLLRQAQTQTSPEHLAPQqdqvEPQVPSQPPWQLQPREtdppnqaqaqtqpqPLWQAQSQKQAQtQAHPQv 290
Cdd:PRK10263   411 AAEQPAQQPYYAPAPEQPAQQPYYAPA----PEQPVAGNAWQAEEQQ--------------STFAPQSTYQTE-QTYQQ- 470
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1720400485  291 PTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGP 325
Cdd:PRK10263   471 PAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPP 505
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
149-321 8.71e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 39.63  E-value: 8.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 149 FETLEPPAKrcrsseeSTEKGPTGQPQARVQPQTQMTAPKQTQTPDR-------LPEP-PEVQML----------PRIQP 210
Cdd:pfam09770 102 FNRQQPAAR-------AAQSSAQPPASSLPQYQYASQQSQQPSKPVRtgyekykEPEPiPDLQVDaslwgvapkkAAAPA 174
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400485 211 QALQIQTQPKLLRQ------------AQTQTSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPNQAQAQTQPQPlwQAQS 278
Cdd:pfam09770 175 PAPQPAAQPASLPApsrkmmsleeveAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQ--QPQQ 252
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 1720400485 279 QKQAQTQAHP-QVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQ 321
Cdd:pfam09770 253 PQQHPGQGHPvTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVP 296
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH