|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 super family |
cl35903 |
DNA translocase FtsK; Provisional |
192-381 |
1.77e-07 |
|
DNA translocase FtsK; Provisional The actual alignment was detected with superfamily member PRK10263:
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 55.09 E-value: 1.77e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 192 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 271
Cdd:PRK10263 327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 272 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 351
Cdd:PRK10263 397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
|
170 180 190
....*....|....*....|....*....|
gi 1907143085 352 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 381
Cdd:PRK10263 476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
16-340 |
3.38e-06 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 3.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 16 ATLGNLRAFNVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPnrk 95
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP--- 2766
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 96 dsSSQTVPledrEDPTEGSEEATElqmdtcedqdslVGPDSMLSEPQVPEPEPFETLEPPAkrcrrvrikgidhhnwlfa 175
Cdd:PHA03247 2767 --PAPAPP----AAPAAGPPRRLT------------RPAVASLSESRESLPSPWDPADPPA------------------- 2809
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 176 ylwifASSEESTEKGPTGQPQARVQPQT--QMTAPKQTQTPDRLPEPPEVQMLP----RIQPQALQIQTQPKLLRQAQTQ 249
Cdd:PHA03247 2810 -----AVLAPAAALPPAASPAGPLPPPTsaQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVR 2884
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 250 TSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPnqaqaqtqpqplwQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQP 329
Cdd:PHA03247 2885 RLARPAVSRSTESFALPPDQPERPPQPQAPPPP-------------QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951
|
330
....*....|.
gi 1907143085 330 QTWPQGSVPPP 340
Cdd:PHA03247 2952 AGEPSGAVPQP 2962
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
550-574 |
2.11e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization. :
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 41.77 E-value: 2.11e-05
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
666-699 |
1.94e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins. :
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 36.46 E-value: 1.94e-03
10 20 30
....*....|....*....|....*....|....
gi 1907143085 666 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 699
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
| GIY-YIG_SF super family |
cl15257 |
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ... |
519-586 |
2.56e-03 |
|
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions. The actual alignment was detected with superfamily member cd10442:
Pssm-ID: 472790 Cd Length: 92 Bit Score: 37.73 E-value: 2.56e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 519 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 586
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
192-381 |
1.77e-07 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 55.09 E-value: 1.77e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 192 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 271
Cdd:PRK10263 327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 272 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 351
Cdd:PRK10263 397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
|
170 180 190
....*....|....*....|....*....|
gi 1907143085 352 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 381
Cdd:PRK10263 476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
16-340 |
3.38e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 3.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 16 ATLGNLRAFNVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPnrk 95
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP--- 2766
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 96 dsSSQTVPledrEDPTEGSEEATElqmdtcedqdslVGPDSMLSEPQVPEPEPFETLEPPAkrcrrvrikgidhhnwlfa 175
Cdd:PHA03247 2767 --PAPAPP----AAPAAGPPRRLT------------RPAVASLSESRESLPSPWDPADPPA------------------- 2809
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 176 ylwifASSEESTEKGPTGQPQARVQPQT--QMTAPKQTQTPDRLPEPPEVQMLP----RIQPQALQIQTQPKLLRQAQTQ 249
Cdd:PHA03247 2810 -----AVLAPAAALPPAASPAGPLPPPTsaQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVR 2884
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 250 TSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPnqaqaqtqpqplwQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQP 329
Cdd:PHA03247 2885 RLARPAVSRSTESFALPPDQPERPPQPQAPPPP-------------QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951
|
330
....*....|.
gi 1907143085 330 QTWPQGSVPPP 340
Cdd:PHA03247 2952 AGEPSGAVPQP 2962
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
550-574 |
2.11e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 41.77 E-value: 2.11e-05
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
199-357 |
1.20e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.80 E-value: 1.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 199 VQPQTQMTAPKQTQTPdrlPEPPEVQMLPRIQPQALQIQTQPKllrQAQTQTSPEHLAPQQDQvePQVPSQPPWQLQpre 278
Cdd:pfam09770 199 VEAAMRAQAKKPAQQP---APAPAQPPAAPPAQQAQQQQQFPP---QIQQQQQPQQQPQQPQQ--HPGQGHPVTILQ--- 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 279 tDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQvPTQA----QSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATEPQL 354
Cdd:pfam09770 268 -RPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQ-PTQIlqnpNRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI 345
|
...
gi 1907143085 355 SSH 357
Cdd:pfam09770 346 ITH 348
|
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
666-699 |
1.94e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 36.46 E-value: 1.94e-03
10 20 30
....*....|....*....|....*....|....
gi 1907143085 666 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 699
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
| GIY-YIG_PLEs |
cd10442 |
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ... |
519-586 |
2.56e-03 |
|
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.
Pssm-ID: 198389 Cd Length: 92 Bit Score: 37.73 E-value: 2.56e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 519 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 586
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
192-381 |
1.77e-07 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 55.09 E-value: 1.77e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 192 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 271
Cdd:PRK10263 327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 272 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 351
Cdd:PRK10263 397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
|
170 180 190
....*....|....*....|....*....|
gi 1907143085 352 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 381
Cdd:PRK10263 476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
29-387 |
8.95e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 53.02 E-value: 8.95e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 29 PSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSG-------RNTQKQARTPSSTTPNRK------ 95
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPApgrvsrpRRARRLGRAAQASSPPQRprrraa 2688
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 96 --------DSSSQTVPLEDREDPTEGSEEATELQMDTCEDQDSLVGPDSMLSEPQVPE-----------PEPFETLEPPA 156
Cdd:PHA03247 2689 rptvgsltSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAgpatpggparpARPPTTAGPPA 2768
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 157 KRCRRVRIKGIDHHnwlfAYLWIFASSEESTEKGPTgqPQARVQPQTQMTAPKQTQTPDRLP---EPPEVQMLPRIQPQA 233
Cdd:PHA03247 2769 PAPPAAPAAGPPRR----LTRPAVASLSESRESLPS--PWDPADPPAAVLAPAAALPPAASPagpLPPPTSAQPTAPPPP 2842
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 234 LQIQTQPKLLRQAQTQTSP--EHLAPQQDQVEPQVPSQPPWQLQPRETDPPNQAQAQTQPQPLwQAQSQKQAQTQAHPQV 311
Cdd:PHA03247 2843 PGPPPPSLPLGGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP-ERPPQPQAPPPPQPQP 2921
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907143085 312 PTQAQSQEQTSEKTQDQPQtwpqgsvPPPEQASGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 387
Cdd:PHA03247 2922 QPPPPPQPQPPPPPPPRPQ-------PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
16-340 |
3.38e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 3.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 16 ATLGNLRAFNVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPnrk 95
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP--- 2766
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 96 dsSSQTVPledrEDPTEGSEEATElqmdtcedqdslVGPDSMLSEPQVPEPEPFETLEPPAkrcrrvrikgidhhnwlfa 175
Cdd:PHA03247 2767 --PAPAPP----AAPAAGPPRRLT------------RPAVASLSESRESLPSPWDPADPPA------------------- 2809
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 176 ylwifASSEESTEKGPTGQPQARVQPQT--QMTAPKQTQTPDRLPEPPEVQMLP----RIQPQALQIQTQPKLLRQAQTQ 249
Cdd:PHA03247 2810 -----AVLAPAAALPPAASPAGPLPPPTsaQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVR 2884
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 250 TSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPnqaqaqtqpqplwQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQP 329
Cdd:PHA03247 2885 RLARPAVSRSTESFALPPDQPERPPQPQAPPPP-------------QPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951
|
330
....*....|.
gi 1907143085 330 QTWPQGSVPPP 340
Cdd:PHA03247 2952 AGEPSGAVPQP 2962
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
181-392 |
3.81e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 50.37 E-value: 3.81e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 181 ASSEESTEKGPTGQPQARVQPQ--TQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQ 258
Cdd:PRK07764 595 AGGEGPPAPASSGPPEEAARPAapAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAG 674
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 259 QDQVEPQVPSQPPWQLQPRETDPPNQAQAQTQpqplWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQ----DQPQTWPQ 334
Cdd:PRK07764 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPA----ATPPAGQADDPAAQPPQAAQGASAPSPAADDPvplpPEPDDPPD 750
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907143085 335 GSVPPPEQASGPACATEPQLSSHAAEA-GSDPDKALPEPVSAQSSEDRsREASAGGLDL 392
Cdd:PRK07764 751 PAGAPAQPPPPPAPAPAAAPAAAPPPSpPSEEEEMAEDDAPSMDDEDR-RDAEEVAMEL 808
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
550-574 |
2.11e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 41.77 E-value: 2.11e-05
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
185-343 |
3.69e-05 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 46.60 E-value: 3.69e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 185 ESTEKGPTGQPQARVQPQTQMTAPKQT---QTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQ 261
Cdd:PRK10927 101 EPSAGGEVKTPEQLTPEQRQLLEQMQAdmrQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSRTTEQSWQ 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 262 VEPQVPSQPPWQLQPRetdPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPT--QAQSQEQTSEKTQDQPQTWPQGSVPP 339
Cdd:PRK10927 181 QQTRTSQAAPVQAQPR---QSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVtrAADAPKPTAEKKDERRWMVQCGSFRG 257
|
....
gi 1907143085 340 PEQA 343
Cdd:PRK10927 258 AEQA 261
|
|
| PRK14949 |
PRK14949 |
DNA polymerase III subunits gamma and tau; Provisional |
182-329 |
4.02e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237863 [Multi-domain] Cd Length: 944 Bit Score: 47.41 E-value: 4.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 182 SSEESTEKGPTGQPQARVQPQTQmTAPKQTQTPDRLPEPPEVQMLPR--IQPQALQIQTQPKLLRQAQTQTSPEHLAPQQ 259
Cdd:PRK14949 639 SSADRKPKTPPSRAPPASLSKPA-SSPDASQTSASFDLDPDFELATHqsVPEAALASGSAPAPPPVPDPYDRPPWEEAPE 717
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 260 DQVEPQVPSQPPwqlqpRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQP 329
Cdd:PRK14949 718 VASANDGPNNAA-----EGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQDTE 782
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
199-357 |
1.20e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.80 E-value: 1.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 199 VQPQTQMTAPKQTQTPdrlPEPPEVQMLPRIQPQALQIQTQPKllrQAQTQTSPEHLAPQQDQvePQVPSQPPWQLQpre 278
Cdd:pfam09770 199 VEAAMRAQAKKPAQQP---APAPAQPPAAPPAQQAQQQQQFPP---QIQQQQQPQQQPQQPQQ--HPGQGHPVTILQ--- 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 279 tDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQvPTQA----QSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATEPQL 354
Cdd:pfam09770 268 -RPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQ-PTQIlqnpNRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI 345
|
...
gi 1907143085 355 SSH 357
Cdd:pfam09770 346 ITH 348
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
220-340 |
1.22e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 45.85 E-value: 1.22e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 220 PPEVQMLPRIQPQAlqiQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPNQAQAqtqpqplwqAQS 299
Cdd:PRK10263 740 PHEPLFTPIVEPVQ---QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQY---------QQP 807
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 1907143085 300 QKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPP 340
Cdd:PRK10263 808 QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHP 848
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
189-334 |
1.79e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 45.03 E-value: 1.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 189 KGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQ----PKLLRQAQTQTSPEHLAPQQDQVEP 264
Cdd:pfam09770 208 KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGqghpVTILQRPQSPQPDPAQPSIQPQAQQ 287
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907143085 265 QVPSQPPWQLQPRETDP-PNQAQAQTQPQPlwqaqsqkQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQ 334
Cdd:pfam09770 288 FHQQPPPVPVQPTQILQnPNRLSAARVGYP--------QNPQPGVQPAPAHQAHRQQGSFGRQAPIITHPQ 350
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
191-323 |
2.14e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 44.86 E-value: 2.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 191 PTGQPQARVQPQTQMTAPKQTQTPdrlPEPPEVQMLPRIQPQALQIQTQpklLRQAQTQTSPEHLAPQQDQVEPQVPSQP 270
Cdd:PRK07994 383 ATAAPTAAVAPPQAPAVPPPPASA---PQQAPAVPLPETTSQLLAARQQ---LQRAQGATKAKKSEPAAASRARPVNSAL 456
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1907143085 271 PW--QLQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSE 323
Cdd:PRK07994 457 ERlaSVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPE 511
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
194-279 |
2.33e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 44.69 E-value: 2.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 194 QPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrQAQTQTSPEHLAPQQDQVEPQVPSQPpwq 273
Cdd:PRK10263 767 QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP----QPQYQQPQQPVAPQPQYQQPQQPVAP--- 839
|
....*.
gi 1907143085 274 lQPRET 279
Cdd:PRK10263 840 -QPQDT 844
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
191-320 |
4.19e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.87 E-value: 4.19e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 191 PTGQPQARVQPQTQmtAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQP 270
Cdd:pfam09770 222 PAAPPAQQAQQQQQ--FPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQP 299
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907143085 271 PWQLQ-------PRETDPPNQAQAQTQPQPLWQAQSQ----KQAQTQAHPQVPTQAQSQEQ 320
Cdd:pfam09770 300 TQILQnpnrlsaARVGYPQNPQPGVQPAPAHQAHRQQgsfgRQAPIITHPQQLAQLSEEEK 360
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
191-282 |
1.14e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.76 E-value: 1.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 191 PTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrQAQTQTSPEHLAPQQDQVEPQVP--S 268
Cdd:PRK10263 751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP----QPQYQQPQQPVAPQPQYQQPQQPvaP 826
|
90
....*....|....
gi 1907143085 269 QPPWQlQPRETDPP 282
Cdd:PRK10263 827 QPQYQ-QPQQPVAP 839
|
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
666-699 |
1.94e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 36.46 E-value: 1.94e-03
10 20 30
....*....|....*....|....*....|....
gi 1907143085 666 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 699
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
| GIY-YIG_PLEs |
cd10442 |
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ... |
519-586 |
2.56e-03 |
|
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.
Pssm-ID: 198389 Cd Length: 92 Bit Score: 37.73 E-value: 2.56e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 519 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 586
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
188-373 |
6.75e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 40.06 E-value: 6.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 188 EKGPTGQPQARVQPQtqmtAPKQTQTPDRlPEPPEVQMLPRI--------QPQALQIQTQPKLLRQAQTQTSPEHLAPQQ 259
Cdd:PTZ00449 566 EHKPSKIPTLSKKPE----FPKDPKHPKD-PEEPKKPKRPRSaqrptrpkSPKLPELLDIPKSPKRPESPKSPKRPPPPQ 640
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 260 DQVEPQVPSQPPWQLQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVP- 338
Cdd:PTZ00449 641 RPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRp 720
|
170 180 190
....*....|....*....|....*....|....*.
gi 1907143085 339 -PPEQASGPACATEPQlsshaaeagSDPDKALPEPV 373
Cdd:PTZ00449 721 lPPKLPRDEEFPFEPI---------GDPDAEQPDDI 747
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
193-458 |
6.85e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.14 E-value: 6.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 193 GQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQpklLRQAQTQTSPEHL-APQQDQVEPQVPsqPP 271
Cdd:pfam03154 319 GQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQ---LPNPQSHKHPPHLsGPSPFQMNSNLP--PP 393
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 272 WQLQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEqtsektqdqpqtwPQGSVPPPEQASGPAcATE 351
Cdd:pfam03154 394 PALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLP-------------PPAASHPPTSGLHQV-PSQ 459
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 352 PQLSSHAAEAGSDPDKALPE--PVSAQSSEDRSREASAGGLDLGecekragemlGMWGAGSSLKVTILQSSNSRAFNTTP 429
Cdd:pfam03154 460 SPFPQHPFVPGGPPPITPPSgpPTSTSSAMPGIQPPSSASVSSS----------GPVPAAVSCPLPPVQIKEEALDEAEE 529
|
250 260 270
....*....|....*....|....*....|.
gi 1907143085 430 LTSGPRPGDSTSATPAIASTPS--KQSLQFF 458
Cdd:pfam03154 530 PESPPPPPRSPSPEPTVVNTPShaSQSARFY 560
|
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
196-351 |
7.89e-03 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 39.28 E-value: 7.89e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 196 QARVQPQTQMTAPKQTQTPDRLpEPPEVQMLPRIQPQALQIQTQpkLLRQAQTQTSPEHLAP--QQDQVEPQVPSQPPWQ 273
Cdd:PRK10927 93 QPGVRAPTEPSAGGEVKTPEQL-TPEQRQLLEQMQADMRQQPTQ--LVEVPWNEQTPEQRQQtlQRQRQAQQLAEQQRLA 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 274 LQPRETDPPNQAQAQTQPqplwqaQSQKQAQTQAHPQVPTQAQSQE--QTSEKTQDQPQtwPQGSVPPPEQASGPACATE 351
Cdd:PRK10927 170 QQSRTTEQSWQQQTRTSQ------AAPVQAQPRQSKPASTQQPYQDllQTPAHTTAQSK--PQQAAPVTRAADAPKPTAE 241
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
186-342 |
9.73e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 39.25 E-value: 9.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 186 STEKGPTGQPQARVQPQTQMTAPKQTQTPDR-------LPEP-PEVQMLPRI----------QPQALQIQTQPKLLRQ-- 245
Cdd:pfam09770 111 AAQSSAQPPASSLPQYQYASQQSQQPSKPVRtgyekykEPEPiPDLQVDASLwgvapkkaaaPAPAPQPAAQPASLPAps 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907143085 246 ----------AQTQTSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPNQAQAQTQPQPlwQAQSQKQAQTQAHP-QVPTQ 314
Cdd:pfam09770 191 rkmmsleeveAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQ--QPQQPQQHPGQGHPvTILQR 268
|
170 180
....*....|....*....|....*...
gi 1907143085 315 AQSQEQTSEKTQDQPQTWPQGSVPPPEQ 342
Cdd:pfam09770 269 PQSPQPDPAQPSIQPQAQQFHQQPPPVP 296
|
|
|