|
Name |
Accession |
Description |
Interval |
E-value |
| PACT_coil_coil |
pfam10495 |
Pericentrin-AKAP-450 domain of centrosomal targeting protein; This domain is a coiled-coil ... |
2713-2791 |
9.38e-27 |
|
Pericentrin-AKAP-450 domain of centrosomal targeting protein; This domain is a coiled-coil region close to the C-terminus of centrosomal proteins that is directly responsible for recruiting AKAP-450 and pericentrin to the centrosome. Hence the suggested name for this region is a PACT domain (pericentrin-AKAP-450 centrosomal targeting). This domain is also present at the C-terminus of coiled-coil proteins from Drosophila and S. pombe, and that from the Drosophila protein is sufficient for targeting to the centrosome in mammalian cells. The function of these proteins is unknown but they seem good candidates for having a centrosomal or spindle pole body location. The final 22 residues of this domain in AKAP-450 appear specifically to be a calmodulin-binding domain indicating that this member at least is likely to contribute to centrosome assembly.
Pssm-ID: 463115 Cd Length: 77 Bit Score: 105.37 E-value: 9.38e-27
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039732403 2713 YLRAESFRKALIYQKKYLLLLIGGFQDSEQETLSMIAHLGVFPSKADKKitmSRPFTKFRTAVRVVIAVLRLRFLVKKW 2791
Cdd:pfam10495 2 LLREESFRKDLAYQKKYLLLLLGGYQACNKADLRLLAQMGVTPPDDRPS---RKPRRKLKSAALAVIAIARMKRLAREW 77
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
868-1486 |
2.85e-12 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 73.05 E-value: 2.85e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 868 QELQASQDQVAQVRDKVFLLNRELEECRAELEQLQQRRERENQEGTTlicmLRADLELAQGEGKALRDALRRLLDLFGDT 947
Cdd:COG1196 239 AELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEE----AQAEEYELLAELARLEQDIARLEERRREL 314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 948 LKAAVTLKSRISERAGLLLDHEDAADTSDARLAAAALGDMWSDEGLLEIDRTLPEgaetssvceisshvcesffispent 1027
Cdd:COG1196 315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLE------------------------- 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1028 ldceqpIRRVYQSLSTAVEGLLEMALDSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALE 1107
Cdd:COG1196 370 ------AEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEA 443
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1108 LHTAKGLLEGFKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEEksvlwnqketltnEAKEREAGSPV 1187
Cdd:COG1196 444 LEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEA-------------EADYEGFLEGV 510
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1188 LPAHKDTALQEEVESLTRVQWESRKQSEKDRATLLSQMRVLESELEDQLVQHR--------GCAQLAEEVATLKQQLAAL 1259
Cdd:COG1196 511 KAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIeylkaakaGRATFLPLDKIRARAALAA 590
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1260 DKHLRSQRQFMDDQAAEREHEREEFQQEIQRLEGQLRQAARPRPPGPRDSQCVQLDEEVELLQEKLREKLDGFNELVIKK 1339
Cdd:COG1196 591 ALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRREL 670
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1340 DFADQQLLIQEEEIKRLEETNASIQRQMVQLQEELEKQKKSMEELKEKEILKQENMGDLLLTTVSRSGLDEAgcpMLPQG 1419
Cdd:COG1196 671 LAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELL---EEEEL 747
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039732403 1420 SSSRGPEAQPDVTERALLQHENEVVHRRNSE--------IDELKSLIENLQENQRQLQkDKAEEIEQLHEVIEKL 1486
Cdd:COG1196 748 LEEEALEELPEPPDLEELERELERLEREIEAlgpvnllaIEEYEELEERYDFLSEQRE-DLEEARETLEEAIEEI 821
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
118-584 |
1.06e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 71.12 E-value: 1.06e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 118 LELEALRLSLNNMHTAQLELTQA-NLQKEKETALTELREMLNGRRAQELALLQSRQQcELELLREQHAREKEEMAlrsgq 196
Cdd:COG1196 281 LELEEAQAEEYELLAELARLEQDiARLEERRRELEERLEELEEELAELEEELEELEE-ELEELEEELEEAEEELE----- 354
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 197 EAAELKEKLRSEMEKNAQTIETLKQDWESERELCLENLRQELSLKHQsemeglQSQFQKELSEQKVELEKIFQAKHEAEV 276
Cdd:COG1196 355 EAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQ------LEELEEAEEALLERLERLEEELEELEE 428
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 277 SLKNLEAQHQAAIKKLQEDLQSEhcQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLWSQLEsMKTNREE 356
Cdd:COG1196 429 ALAELEEEEEEEEEALEEAAEEE--AELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLE-AEADYEG 505
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 357 LNGSWDPVLAQASHLEELEHLRSGFAQQQQQERAQHESELEHLRVYFEKKLKDAEKTYQEDLTVFQQRLQEAREDSLEST 436
Cdd:COG1196 506 FLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRAR 585
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 437 EISSSCVLPEETSGREGKEPPDPLDLQLGQPKVQESLVEDCQVKLSKAEEKIQQMKEEFQKKEAEWELSREELKREAEER 516
Cdd:COG1196 586 AALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGG 665
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039732403 517 LASMFLELREKAESEKLSIISRFEHRESSMRHLQDQQAAQILDLERSLMEQQGHLRQLEQELTRDDLL 584
Cdd:COG1196 666 SRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAE 733
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
133-647 |
2.58e-10 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 66.70 E-value: 2.58e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 133 AQLELTQANLQKEKETALTELREMLNGRRAQELALLQSRQQCELELLREQHAREKEEM-------------ALRSGQEAA 199
Cdd:PTZ00121 1263 AHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAkkkaeeakkkadaAKKKAEEAK 1342
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 200 ELKEKLRSEMEKNAQTIETLKQDWESERELCLENLRQELSLKHQSEMEGLQSQFQKELSEQKVELE----------KIFQ 269
Cdd:PTZ00121 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADelkkaaaakkKADE 1422
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 270 AKHEAEVSLKNLEAQHQAAIKKLQEDLQ--SEHCQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLWSQL 347
Cdd:PTZ00121 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKkkAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEA 1502
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 348 ---ESMKTNREELNGSWDPVLAQASHLEELEHLRSGFAQQQQQERAQHESELEHLRVYFEKKLKDAEKTYQEDLTVFQQR 424
Cdd:PTZ00121 1503 kkaAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK 1582
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 425 LQEAREdsLESTEISSSCVLPEETSGREGKEPPDPLDLQLGQPKVQESLVEDCQV---------------KLSKAEEKIQ 489
Cdd:PTZ00121 1583 AEEAKK--AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVeqlkkkeaeekkkaeELKKAEEENK 1660
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 490 QMKEEFQKKEAEWELSREELKREAEERLASMFLELREKAESEKLSIISRFEHRESsmrhlqdQQAAQILDLERSLMEQQG 569
Cdd:PTZ00121 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK-------KKAEELKKAEEENKIKAE 1733
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039732403 570 HLRQLEQELTRDdllpcsqcGQEPAMAQEEKNGALLREKEDCAlqllmAQNRFLEERKEIMEKFAKEQDAFLRDAQEK 647
Cdd:PTZ00121 1734 EAKKEAEEDKKK--------AEEAKKDEEEKKKIAHLKKEEEK-----KAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
2214-2503 |
7.36e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 64.96 E-value: 7.36e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2214 RRQVELLAYKVEqekcianELQKTLSKEQETASDVRKRLVVEQNAVQDLKSELHACKQENTSLLESLDKVQQEVLRLRAV 2293
Cdd:COG1196 224 ELEAELLLLKLR-------ELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAE 296
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2294 LDGKEKELKVVLEELESERGKGQALQAQQEEQQLRYLQREGQSSRALEELKLSLEKQLAQNNQLCVALKHERAAKDNLQK 2373
Cdd:COG1196 297 LARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAE 376
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2374 ELQIEASRCEALLAQEKGQLSELQKSLEAERSrslELSEALQHERLLTEQLSRNSQEACARQETQVQHALLRKLKAEKTR 2453
Cdd:COG1196 377 AEEELEELAEELLEALRAAAELAAQLEELEEA---EEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAE 453
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2454 ALELEAMLEKVQKQAAHTQQQLEAQAQERCVELRREKERELEIQRQRDEH 2503
Cdd:COG1196 454 LEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADY 503
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1057-1708 |
2.17e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 63.54 E-value: 2.17e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1057 KQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALELHTAKGLLEGFKVEKVDLQEALGKKEESE 1136
Cdd:TIGR02168 288 KELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAEL 367
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1137 QQLILELEDLRKQLQQAARELLTLKEEKSVLWNQ-----------KETLTNEAKEREAGSPVLPAHKDTALQEEVESLTR 1205
Cdd:TIGR02168 368 EELESRLEELEEQLETLRSKVAQLELQIASLNNEierlearlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELEE 447
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1206 VQWESRKQSEKDRATLLSQMRVLESELEDQLVQHRGCAQLAEEVATLKQQLAALDKHLRSQRQFMDDQAAEREHEREEfq 1285
Cdd:TIGR02168 448 ELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVL-- 525
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1286 QEIQRLEGQLRQAARPRPPGPRDSQCVQLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASIQR 1365
Cdd:TIGR02168 526 SELISVDEGYEAAIEAALGGRLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAK 605
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1366 QMVQLQEELEKQKKSM-------EELKEK-EILKQENMGDLLLT----TVSRSG-------------------LDEAGCP 1414
Cdd:TIGR02168 606 DLVKFDPKLRKALSYLlggvlvvDDLDNAlELAKKLRPGYRIVTldgdLVRPGGvitggsaktnssilerrreIEELEEK 685
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1415 MLPQGSSSRGPEAQPDVTERALLQHENEV-------------VHRRNSEIDELKSLIENLQENQRQLQK---DKAEEIEQ 1478
Cdd:TIGR02168 686 IEELEEKIAELEKALAELRKELEELEEELeqlrkeleelsrqISALRKDLARLEAEVEQLEERIAQLSKeltELEAEIEE 765
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1479 LHEVIEKLQSELSLMGPKVHEVSDPQAG-------------SLHSELACLRGEGLGGQALRSELQAAQAAKEVFGQLLAD 1545
Cdd:TIGR02168 766 LEERLEEAEEELAEAEAEIEELEAQIEQlkeelkalrealdELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEE 845
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1546 QAHGHSQALEALQQRLQDAEEVAARHLAELEHCVALREAEVEAMA---SQIQEFAATLKAKEAIIEQRDLEIDAVNKWKV 1622
Cdd:TIGR02168 846 QIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALAllrSELEELSEELRELESKRSELRRELEELREKLA 925
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1623 SHSLELEAILLALAHFRHALEQQTCATPDEPPELRQLRVQCARlshqlqvlyrpflKCRMQLDQHQPHVASIGCANPCAD 1702
Cdd:TIGR02168 926 QLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEE-------------EARRRLKRLENKIKELGPVNLAAI 992
|
....*.
gi 1039732403 1703 DELEQE 1708
Cdd:TIGR02168 993 EEYEEL 998
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
345-1164 |
9.67e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 61.23 E-value: 9.67e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 345 SQLESMKTN-------REELNGSWDPVLAQASHLE-------ELEHLRSGFAQQQQQERAQHESELEHLrvyfEKKLKDA 410
Cdd:TIGR02168 179 RKLERTRENldrlediLNELERQLKSLERQAEKAErykelkaELRELELALLVLRLEELREELEELQEE----LKEAEEE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 411 EKTYQEDLTVFQQRLQEARedsLESTEISSSCVLPEETSGREGKEPPDpLDLQLGQPKVQESLVEDCQVKLSKAEEKIQQ 490
Cdd:TIGR02168 255 LEELTAELQELEEKLEELR---LEVSELEEEIEELQKELYALANEISR-LEQQKQILRERLANLERQLEELEAQLEELES 330
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 491 MKEEFQKKEAEWELSREELKREAEErLASMFLELREKAESEKLSIISRFEHREssmrhlqdQQAAQILDLERSLMEQQGH 570
Cdd:TIGR02168 331 KLDELAEELAELEEKLEELKEELES-LEAELEELEAELEELESRLEELEEQLE--------TLRSKVAQLELQIASLNNE 401
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 571 LRQLEQELTRddllpcsqcgqepamaQEEKNGALLREKEDCALQLLMAQNRFLEERKEIMEKFAKEQDAFLRDAQEKhnh 650
Cdd:TIGR02168 402 IERLEARLER----------------LEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEA--- 462
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 651 elQLLQQGHQQQLLALRMELETKHRsELTEQLASSESRRQALLETHVAELQVKHNAEISALEKRHLSNLDELESCYVADV 730
Cdd:TIGR02168 463 --LEELREELEEAEQALDAAERELA-QLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAI 539
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 731 QTIRDEHQQA---LELLRAELEEQLQKKESCHREMLTQelenLKRQHAEELQSVRDSLRMEmsaqhIENGKGPAADLQGA 807
Cdd:TIGR02168 540 EAALGGRLQAvvvENLNAAKKAIAFLKQNELGRVTFLP----LDSIKGTEIQGNDREILKN-----IEGFLGVAKDLVKF 610
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 808 HQQQDPAMALHNEGHLLVEDGDavlrsvDAEGLLHQAGPQELG---DAHTV---------EMQKSQAELAKPQELQASQD 875
Cdd:TIGR02168 611 DPKLRKALSYLLGGVLVVDDLD------NALELAKKLRPGYRIvtlDGDLVrpggvitggSAKTNSSILERRREIEELEE 684
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 876 QVAQVRDKVFLLNRELEECRAELEQLQQRRE---RENQEGTTLICMLRADLELAQGEGKALRDALRRLLDLFGDTLKAAV 952
Cdd:TIGR02168 685 KIEELEEKIAELEKALAELRKELEELEEELEqlrKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIE 764
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 953 TLKSRISERAGLLLDHEDAADTSDARLAAAAlgdmwsdEGLLEIDRTLPEGAETSSVCEISSHVCESFFISPENTL-DCE 1031
Cdd:TIGR02168 765 ELEERLEEAEEELAEAEAEIEELEAQIEQLK-------EELKALREALDELRAELTLLNEEAANLRERLESLERRIaATE 837
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1032 QPIRRVYQSLSTAVEGLLEMAL----------DSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAK 1101
Cdd:TIGR02168 838 RRLEDLEEQIEELSEDIESLAAeieeleelieELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRREL 917
|
810 820 830 840 850 860
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039732403 1102 DRLALELHTAKGLLEGFKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEEK 1164
Cdd:TIGR02168 918 EELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENK 980
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
142-529 |
2.94e-08 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 59.99 E-value: 2.94e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 142 LQKEKETALTELREMLNGRRAQELALLQSRQQCELELLREQHAREKEEMALRSGQEAAELKEKLRSEMEKNAQTIETLKQ 221
Cdd:pfam02463 634 LTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELK 713
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 222 DWESERELCLENLRQELSLKHQSEMEGLQSQF-QKELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAA--IKKLQEDLQS 298
Cdd:pfam02463 714 KLKLEAEELLADRVQEAQDKINEELKLLKQKIdEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREktEKLKVEEEKE 793
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 299 EHCQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLwSQLESMKTNREELNgswdpvlaqashlEELEHLR 378
Cdd:pfam02463 794 EKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEEL-ALELKEEQKLEKLA-------------EEELERL 859
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 379 SGFAQQQQQERAQHESELEHLRVYFEKKLKDAEKTYQEDLTVFQQRLQEAREDSLESTEISSSCVLPEETSGREGKEPPD 458
Cdd:pfam02463 860 EEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEE 939
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039732403 459 PLDLQLGQPKVQESLVEDCQVKLSKAEEKIQQ-------MKEEFQKKEAEWELSREELKREAEERLASMFLELREKAE 529
Cdd:pfam02463 940 LLLEEADEKEKEENNKEEEEERNKRLLLAKEElgkvnlmAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQ 1017
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2203-2516 |
8.44e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 58.14 E-value: 8.44e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2203 EARGSQREHQLRRQVE-----LLAYKVEQEKCIANELQKTLSKEQETASDVRKRLVVEQNAVQDLKSELHackqentSLL 2277
Cdd:TIGR02168 208 QAEKAERYKELKAELRelelaLLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVS-------ELE 280
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2278 ESLDKVQQEVLRLRAVLDGKEKELKVVLEELESergkgqalQAQQEEQQLRYLQREGQSSRALEELKLSLEKQLAQNNQL 2357
Cdd:TIGR02168 281 EEIEELQKELYALANEISRLEQQKQILRERLAN--------LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEE 352
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2358 CVALKHERAAKDNLQKELQIEASRCEALLAQEKGQLSELQKSLEAERSRSLELSEALqherlltEQLSRNSQEACARQET 2437
Cdd:TIGR02168 353 LESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARL-------ERLEDRRERLQQEIEE 425
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039732403 2438 QVQHALLRKLKAEKTRALELEAMLEKVQKQAAHTQQQLEAQAQErcVELRREKERELEIQRQRDEHKIEQLQRLVRELR 2516
Cdd:TIGR02168 426 LLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE--LEEAEQALDAAERELAQLQARLDSLERLQENLE 502
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
156-430 |
2.13e-07 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 56.99 E-value: 2.13e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 156 MLNGRRAQELALLQSRQQcELELLREQhaREKEEMALRSGQEAAELKEKLRSEMEKNAQTIETLKQDWE---SERELCLE 232
Cdd:TIGR02168 660 VITGGSAKTNSSILERRR-EIEELEEK--IEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSrqiSALRKDLA 736
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 233 NLRQ--ELSLKHQSEMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSLKNLEA---QHQAAIKKLQEDLQSEHCQY--LQ 305
Cdd:TIGR02168 737 RLEAevEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAqieQLKEELKALREALDELRAELtlLN 816
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 306 DLEQKFREKEKAKELELETLQASYEDLKAQS---QEEIRLLWSQLESMKTNREELNGSWDPVLAQASHLEELEHLRSGFA 382
Cdd:TIGR02168 817 EEAANLRERLESLERRIAATERRLEDLEEQIeelSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSEL 896
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 1039732403 383 QQQQQERAQHESELEHLRVYFEkKLKDAEKTYQEDLTVFQQRLQEARE 430
Cdd:TIGR02168 897 EELSEELRELESKRSELRRELE-ELREKLAQLELRLEGLEVRIDNLQE 943
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
2187-2506 |
7.16e-06 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 52.07 E-value: 7.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2187 QNQEKLQQLCAALTSTEARGSQREHQLRRQVELlayKVEQEKCIANELQKT--LSKEQETASDVRKRLVVEQNAVQDLKS 2264
Cdd:PTZ00121 1507 EAKKKADEAKKAEEAKKADEAKKAEEAKKADEA---KKAEEKKKADELKKAeeLKKAEEKKKAEEAKKAEEDKNMALRKA 1583
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2265 ELhaCKQENTSLLESLDKVQQEVLRLRAVLDGKEKELKVVLEELESErgkgqalqAQQEEQQLRYLQREGQSSRALEELK 2344
Cdd:PTZ00121 1584 EE--AKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKA--------EEEKKKVEQLKKKEAEEKKKAEELK 1653
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2345 LSLEKQLAQNNQLCVALKHERAAKDNLQKELQIEASRCEALL--AQEKGQLSELQKSLEAERSRSLELSEALQHERLLTE 2422
Cdd:PTZ00121 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKkeAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAE 1733
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2423 QLSRNSQEacarqetqvqhallRKLKAEKTRALELEAmlEKVQKQAAHTQQQLEAQAQERCVELRREKERELEIQRQRDE 2502
Cdd:PTZ00121 1734 EAKKEAEE--------------DKKKAEEAKKDEEEK--KKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
|
....
gi 1039732403 2503 HKIE 2506
Cdd:PTZ00121 1798 KKIK 1801
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
2209-2516 |
9.58e-06 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 51.29 E-value: 9.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2209 REHQLRRQVELLAYKVEQEKCIaNELQKTLSKEQETASDVRKRLVVEQNAVQDLKSELHACKQENTSLLESLDKVQQEVL 2288
Cdd:pfam07111 312 REKVFALMVQLKAQDLEHRDSV-KQLRGQVAELQEQVTSQSQEQAILQRALQDKAAEVEVERMSAKGLQMELSRAQEARR 390
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2289 RLRAVLDGKEKELKVVLEELESERGKGQALQAQQEEQQLR---------YLQREGQSSRALEELKLSLEKQLAQ------ 2353
Cdd:pfam07111 391 RQQQQTASAEEQLKFVVNAMSSTQIWLETTMTRVEQAVARipslsnrlsYAVRKVHTIKGLMARKVALAQLRQEscpppp 470
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2354 -----NNQLCVALKHERAAKDNLQKELQI-------EASRCEALLAQEKGQLSELQKSLEAERSRS--------LELSEA 2413
Cdd:pfam07111 471 pappvDADLSLELEQLREERNRLDAELQLsahliqqEVGRAREQGEAERQQLSEVAQQLEQELQRAqeslasvgQQLEVA 550
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2414 LQHERLLTEQLSRNSQEACARQETQVQhALLRKLKAEKTRALELEAMLEKVQKQAAHTQQQLEAQAQERCVELRREKERE 2493
Cdd:pfam07111 551 RQGQQESTEEAASLRQELTQQQEIYGQ-ALQEKVAEVETRLREQLSDTKRRLNEARREQAKAVVSLRQIQHRATQEKERN 629
|
330 340
....*....|....*....|...
gi 1039732403 2494 LEIQRQRDEHKIEQLQRLVRELR 2516
Cdd:pfam07111 630 QELRRLQDEARKEEGQRLARRVQ 652
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
485-1382 |
2.97e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 49.97 E-value: 2.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 485 EEKIQQMKEEFQKKEAEWELSREELKREAEERLASMFLELREKAESEKLSIISRFEHRESSMRHLQDQQAAQILDLERSL 564
Cdd:pfam02463 194 ELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKE 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 565 MEQQghLRQLEQELTRDDLLPCSQCGQEPAMAQEEKNGALLREKedcaLQLLMAQNRFLEERKEimekfaKEQDAFLRDA 644
Cdd:pfam02463 274 NKEE--EKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEK----LKESEKEKKKAEKELK------KEKEEIEELE 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 645 QEKHNHELQLLQQGHQQQLLALRMELETKHRSELTEQLASSESRRQALLETHVAELQVKHNAEISALEKRHLSNLDELES 724
Cdd:pfam02463 342 KELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLL 421
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 725 CYVADVQTIRDEHQQALELLRAELEEQLQKKEschremLTQELENLKRQHAEElQSVRDSLRMEMSAQHIENGKGPAADL 804
Cdd:pfam02463 422 KEEKKEELEILEEEEESIELKQGKLTEEKEEL------EKQELKLLKDELELK-KSEDLLKETQLVKLQEQLELLLSRQK 494
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 805 QGAhqqqdpamALHNEGHLLVEDGDAVLRSVDAEGLLHQAGPQELGDAHTVemqksqAELAKPQELQASQDQVAQVRDKV 884
Cdd:pfam02463 495 LEE--------RSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVA------VENYKVAISTAVIVEVSATADEV 560
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 885 FllnrELEECRAELEQLQQRRERENQEGTTLICMLRADLELAQGEGKALRDALRRLLDLFGDTLKAAVTLKSRISERAGL 964
Cdd:pfam02463 561 E----ERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTK 636
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 965 LLDHEDAADTSDARLAAAALGDMWSDEGLLEIDRTLPEGAETSSVCEISSHVCESFFISPENTLDCEQPIRRVYQSLSTA 1044
Cdd:pfam02463 637 LKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLK 716
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1045 VEGLLEMALDSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALEL-HTAKGLLEGFKVEKV 1123
Cdd:pfam02463 717 LEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEReKTEKLKVEEEKEEKL 796
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1124 DLQEALGKKEESEQQLILEledlrkQLQQAARELLTLKEEKSVLWNQKETLTNEAKEREAgspvlpahKDTALQEEVESL 1203
Cdd:pfam02463 797 KAQEEELRALEEELKEEAE------LLEEEQLLIEQEEKIKEEELEELALELKEEQKLEK--------LAEEELERLEEE 862
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1204 TRVQWESRKQSEKDRATLLSQMRVLESELEDQLVQHRgcaQLAEEVATLKQQLAALDKHLRSQRQFMDDQAAEREHEREE 1283
Cdd:pfam02463 863 ITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEK---KELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEE 939
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1284 FQQEIQRLEGQLRQAArprppgprdsqcvqLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASI 1363
Cdd:pfam02463 940 LLLEEADEKEKEENNK--------------EEEEERNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKK 1005
|
890
....*....|....*....
gi 1039732403 1364 QRQMVQLQEELEKQKKSME 1382
Cdd:pfam02463 1006 KLIRAIIEETCQRLKEFLE 1024
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
203-532 |
4.64e-05 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 49.24 E-value: 4.64e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 203 EKLRSEMEKNAQT--------IETLKQDWESERELCLENLRQELSLKHQSEMEGLQSQFQKELSEQKvelekifqaKHEA 274
Cdd:NF033838 72 SEIQKSLDKRKHTqnvalnkkLSDIKTEYLYELNVLKEKSEAELTSKTKKELDAAFEQFKKDTLEPG---------KKVA 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 275 EVSLKNLEAQHQAAIKKlQEDLQSEHCQYLQDLEQKFREKE-KAKELELETLQAsyEDLKAQSQEEIRLLWSQLESMKtn 353
Cdd:NF033838 143 EATKKVEEAEKKAKDQK-EEDRRNYPTNTYKTLELEIAESDvEVKKAELELVKE--EAKEPRDEEKIKQAKAKVESKK-- 217
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 354 reelngswdpvlAQASHLEELEHLRsgfaqqqqqERAQHESELEHLRVYFEKKLKDAEKTYQEDLTVFQQR--LQE-ARE 430
Cdd:NF033838 218 ------------AEATRLEKIKTDR---------EKAEEEAKRRADAKLKEAVEKNVATSEQDKPKRRAKRgvLGEpATP 276
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 431 DSLESTEISSSCVLPEETsgregkePPDPlDLQLGQpKVQESlvedcQVKLSKAEEKIQQMKEEFQK------------- 497
Cdd:NF033838 277 DKKENDAKSSDSSVGEET-------LPSP-SLKPEK-KVAEA-----EKKVEEAKKKAKDQKEEDRRnyptntyktlele 342
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1039732403 498 --------KEAEWELSREELKR-EAEERLAsmflELREKAESEK 532
Cdd:NF033838 343 iaesdvkvKEAELELVKEEAKEpRNEEKIK----QAKAKVESKK 382
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
887-1387 |
9.15e-05 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 48.11 E-value: 9.15e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 887 LNRELEECRAELEQLQQRRERENQEGTTLICMLrADLELAQGEGKALRDALRRLLDLFGDTLKAAVTLKSRISERAGLLL 966
Cdd:PRK02224 211 LESELAELDEEIERYEEQREQARETRDEADEVL-EEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLE 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 967 DHEDAADtsdARLAAAALGDMWSDEGLLEIDRTLPEGAETSSVCEisshvcesffispentlDCEQPIrrvyQSLSTAVE 1046
Cdd:PRK02224 290 ELEEERD---DLLAEAGLDDADAEAVEARREELEDRDEELRDRLE-----------------ECRVAA----QAHNEEAE 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1047 GLLEMALDSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALELHTAKGLLEGFKVEK---- 1122
Cdd:PRK02224 346 SLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERdelr 425
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1123 -------VDLQEALGKKEESEQQL-----------------ILELEDLRKQLQQAARELLTLKEEKSVLWNQKETLTnEA 1178
Cdd:PRK02224 426 ereaeleATLRTARERVEEAEALLeagkcpecgqpvegsphVETIEEDRERVEELEAELEDLEEEVEEVEERLERAE-DL 504
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1179 KEREAGSpvlpahkdTALQEEVESLTRVQWESRKQSEKDRATlLSQMRVLESELEDQLVQHRGCAQLA--------EEVA 1250
Cdd:PRK02224 505 VEAEDRI--------ERLEERREDLEELIAERRETIEEKRER-AEELRERAAELEAEAEEKREAAAEAeeeaeearEEVA 575
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1251 TLKQQLAALDKHLRSQRQFMDDQAAEREHEreefqQEIQRLEGQLRQAARprppgprdsqcvQLDEEVELLQEK------ 1324
Cdd:PRK02224 576 ELNSKLAELKERIESLERIRTLLAAIADAE-----DEIERLREKREALAE------------LNDERRERLAEKrerkre 638
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039732403 1325 LREKLDG--FNELVIKKDFADQQLLIQEEEIKRLEETNASIQRQMVQLQEELEkqkkSMEELKEK 1387
Cdd:PRK02224 639 LEAEFDEarIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELE----ELEELRER 699
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
2084-2309 |
9.01e-04 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 45.06 E-value: 9.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2084 QALSEVTT---DKGEKESLETH-----LTWSEELLRAIQEVFAREQEKAELQPRPYGSNLGDYNSLVQRLEKVIQEQGdp 2155
Cdd:TIGR02169 272 QLLEELNKkikDLGEEEQLRVKekigeLEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEER-- 349
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2156 qkvqdhlclSDRSSLLAEIQALRAQLRMTHLQNQEKlqqlcaaltSTEARGSQREH-QLRRQVELLAYKVEQEKCIANEL 2234
Cdd:TIGR02169 350 ---------KRRDKLTEEYAELKEELEDLRAELEEV---------DKEFAETRDELkDYREKLEKLKREINELKRELDRL 411
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2235 QKTLSKEQETASDVR---KRLVVEQNAVQ----DLKSELHACKQENTSLLESLDKVQQEVLRLRAVLDGKEKELKVVLEE 2307
Cdd:TIGR02169 412 QEELQRLSEELADLNaaiAGIEAKINELEeekeDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRE 491
|
..
gi 1039732403 2308 LE 2309
Cdd:TIGR02169 492 LA 493
|
|
| PRK09039 |
PRK09039 |
peptidoglycan -binding protein; |
1479-1615 |
1.74e-03 |
|
peptidoglycan -binding protein;
Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 43.42 E-value: 1.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1479 LHEVIEKLQSELSLMGPKVHEVSD------PQAGSLHSELACLRGEGLGGQALRSELQAAQAAKEVFGQLLADQAHGHSQ 1552
Cdd:PRK09039 44 LSREISGKDSALDRLNSQIAELADllslerQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQ 123
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039732403 1553 ALEALQQrlqdaeeVAARHLAElehcVALREAEVEAMASQIQEFAATLKAKEAIIEQRDLEID 1615
Cdd:PRK09039 124 ELDSEKQ-------VSARALAQ----VELLNQQIAALRRQLAALEAALDASEKRDRESQAKIA 175
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PACT_coil_coil |
pfam10495 |
Pericentrin-AKAP-450 domain of centrosomal targeting protein; This domain is a coiled-coil ... |
2713-2791 |
9.38e-27 |
|
Pericentrin-AKAP-450 domain of centrosomal targeting protein; This domain is a coiled-coil region close to the C-terminus of centrosomal proteins that is directly responsible for recruiting AKAP-450 and pericentrin to the centrosome. Hence the suggested name for this region is a PACT domain (pericentrin-AKAP-450 centrosomal targeting). This domain is also present at the C-terminus of coiled-coil proteins from Drosophila and S. pombe, and that from the Drosophila protein is sufficient for targeting to the centrosome in mammalian cells. The function of these proteins is unknown but they seem good candidates for having a centrosomal or spindle pole body location. The final 22 residues of this domain in AKAP-450 appear specifically to be a calmodulin-binding domain indicating that this member at least is likely to contribute to centrosome assembly.
Pssm-ID: 463115 Cd Length: 77 Bit Score: 105.37 E-value: 9.38e-27
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039732403 2713 YLRAESFRKALIYQKKYLLLLIGGFQDSEQETLSMIAHLGVFPSKADKKitmSRPFTKFRTAVRVVIAVLRLRFLVKKW 2791
Cdd:pfam10495 2 LLREESFRKDLAYQKKYLLLLLGGYQACNKADLRLLAQMGVTPPDDRPS---RKPRRKLKSAALAVIAIARMKRLAREW 77
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
868-1486 |
2.85e-12 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 73.05 E-value: 2.85e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 868 QELQASQDQVAQVRDKVFLLNRELEECRAELEQLQQRRERENQEGTTlicmLRADLELAQGEGKALRDALRRLLDLFGDT 947
Cdd:COG1196 239 AELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEE----AQAEEYELLAELARLEQDIARLEERRREL 314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 948 LKAAVTLKSRISERAGLLLDHEDAADTSDARLAAAALGDMWSDEGLLEIDRTLPEgaetssvceisshvcesffispent 1027
Cdd:COG1196 315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLE------------------------- 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1028 ldceqpIRRVYQSLSTAVEGLLEMALDSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALE 1107
Cdd:COG1196 370 ------AEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEA 443
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1108 LHTAKGLLEGFKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEEksvlwnqketltnEAKEREAGSPV 1187
Cdd:COG1196 444 LEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEA-------------EADYEGFLEGV 510
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1188 LPAHKDTALQEEVESLTRVQWESRKQSEKDRATLLSQMRVLESELEDQLVQHR--------GCAQLAEEVATLKQQLAAL 1259
Cdd:COG1196 511 KAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIeylkaakaGRATFLPLDKIRARAALAA 590
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1260 DKHLRSQRQFMDDQAAEREHEREEFQQEIQRLEGQLRQAARPRPPGPRDSQCVQLDEEVELLQEKLREKLDGFNELVIKK 1339
Cdd:COG1196 591 ALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRREL 670
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1340 DFADQQLLIQEEEIKRLEETNASIQRQMVQLQEELEKQKKSMEELKEKEILKQENMGDLLLTTVSRSGLDEAgcpMLPQG 1419
Cdd:COG1196 671 LAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELL---EEEEL 747
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039732403 1420 SSSRGPEAQPDVTERALLQHENEVVHRRNSE--------IDELKSLIENLQENQRQLQkDKAEEIEQLHEVIEKL 1486
Cdd:COG1196 748 LEEEALEELPEPPDLEELERELERLEREIEAlgpvnllaIEEYEELEERYDFLSEQRE-DLEEARETLEEAIEEI 821
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1085-1673 |
3.21e-12 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 72.66 E-value: 3.21e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1085 QKQQELLERLREESAAKDRLALELHTAKglLEGFKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEEK 1164
Cdd:COG1196 206 ERQAEKAERYRELKEELKELEAELLLLK--LRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELEL 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1165 SVLWNQKETLTNEAKEREAGSpvlpAHKDTALQEEVESLTRVQWEsRKQSEKDRATLLSQMRVLESELEDQLVQHRGCAQ 1244
Cdd:COG1196 284 EEAQAEEYELLAELARLEQDI----ARLEERRRELEERLEELEEE-LAELEEELEELEEELEELEEELEEAEEELEEAEA 358
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1245 LAEEVATLKQQLAALDKHLRSQRQFMDDQAAEREHEREEFQQEIQRLEGQLRQAARprppgprdsQCVQLDEEVELLQEK 1324
Cdd:COG1196 359 ELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLE---------RLERLEEELEELEEA 429
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1325 LREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASIQRQMVQLQEELEKQKKSMEELKEKEILKQENMGD--LLLTT 1402
Cdd:COG1196 430 LAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADyeGFLEG 509
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1403 VSRSGLDEAGCPMLPQGSSSRGPEAQPDVTERALLqhENEVVHRRNSEIDELKSLIENLQEN-------------QRQLQ 1469
Cdd:COG1196 510 VKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAAL--AAALQNIVVEDDEVAAAAIEYLKAAkagratflpldkiRARAA 587
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1470 KDKAEEIEQLHEVIEKLQSELSLMGPKVHEVSDPQAGSLHSELACLRGEGLGGQALRSELQAAQAAKEVF--GQLLADQA 1547
Cdd:COG1196 588 LAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSagGSLTGGSR 667
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1548 HGHSQALEALQQRLQDAEEVAARHLAELEHcVALREAEVEAMASQIQEFAATLKAKEAIIEQRDLEIDAvnkwkvshsle 1627
Cdd:COG1196 668 RELLAALLEAEAELEELAERLAEEELELEE-ALLAEEEEERELAEAEEERLEEELEEEALEEQLEAERE----------- 735
|
570 580 590 600
....*....|....*....|....*....|....*....|....*.
gi 1039732403 1628 leAILLALAHFRHALEQQTCATPDEPPELRQLRVQCARLSHQLQVL 1673
Cdd:COG1196 736 --ELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEAL 779
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
118-584 |
1.06e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 71.12 E-value: 1.06e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 118 LELEALRLSLNNMHTAQLELTQA-NLQKEKETALTELREMLNGRRAQELALLQSRQQcELELLREQHAREKEEMAlrsgq 196
Cdd:COG1196 281 LELEEAQAEEYELLAELARLEQDiARLEERRRELEERLEELEEELAELEEELEELEE-ELEELEEELEEAEEELE----- 354
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 197 EAAELKEKLRSEMEKNAQTIETLKQDWESERELCLENLRQELSLKHQsemeglQSQFQKELSEQKVELEKIFQAKHEAEV 276
Cdd:COG1196 355 EAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQ------LEELEEAEEALLERLERLEEELEELEE 428
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 277 SLKNLEAQHQAAIKKLQEDLQSEhcQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLWSQLEsMKTNREE 356
Cdd:COG1196 429 ALAELEEEEEEEEEALEEAAEEE--AELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLE-AEADYEG 505
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 357 LNGSWDPVLAQASHLEELEHLRSGFAQQQQQERAQHESELEHLRVYFEKKLKDAEKTYQEDLTVFQQRLQEAREDSLEST 436
Cdd:COG1196 506 FLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRAR 585
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 437 EISSSCVLPEETSGREGKEPPDPLDLQLGQPKVQESLVEDCQVKLSKAEEKIQQMKEEFQKKEAEWELSREELKREAEER 516
Cdd:COG1196 586 AALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGG 665
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039732403 517 LASMFLELREKAESEKLSIISRFEHRESSMRHLQDQQAAQILDLERSLMEQQGHLRQLEQELTRDDLL 584
Cdd:COG1196 666 SRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAE 733
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1038-1621 |
7.78e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 68.04 E-value: 7.78e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1038 YQSLSTAVEGLLEMALDSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALELHTAKGLLEG 1117
Cdd:COG1196 234 LRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRE 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1118 FKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEEksvLWNQKETLTNEAKEREagspvlpahkdtALQ 1197
Cdd:COG1196 314 LEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAE---LAEAEEALLEAEAELA------------EAE 378
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1198 EEVESLTRVQWESRKQSEKDRATLLSQMRVLESELEDQLVQHRGCAQLAEEVATLKQQLAALDKHLRSQRQFMDDQAAER 1277
Cdd:COG1196 379 EELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEE 458
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1278 EHEREEFQQEIQRLEGQLRQAARPRPPgprdsqcVQLDEEVELLQEKLREKLDGFNELVIKKDFADQQ--------LLIQ 1349
Cdd:COG1196 459 EALLELLAELLEEAALLEAALAELLEE-------LAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLrglagavaVLIG 531
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1350 EEEIKRLEETNASIQRQMVQLQEELEKQKKSMEELKEKeilkqenmgdlLLTTVSRSGLDEAgcpmlpQGSSSRGPEAQP 1429
Cdd:COG1196 532 VEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAA-----------KAGRATFLPLDKI------RARAALAAALAR 594
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1430 DVTERALLQHENEVVHRRNSEIDELKSLIENLQENQRQLQKDKAEEIEQLHEVIEKLQSELSLMGPkvhevsdpqagslh 1509
Cdd:COG1196 595 GAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGG-------------- 660
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1510 selaclRGEGLGGQALRSELQAAQAAKEVFGQLLADQAHGHSQALEALQQRLQDAEEVAARHLAELEHCVALREAEVEAM 1589
Cdd:COG1196 661 ------SLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAER 734
|
570 580 590
....*....|....*....|....*....|..
gi 1039732403 1590 ASQIQEFAATLKAKEAIIEQRDLEIDAVNKWK 1621
Cdd:COG1196 735 EELLEELLEEEELLEEEALEELPEPPDLEELE 766
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1039-1562 |
1.05e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 67.65 E-value: 1.05e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1039 QSLSTAVEGLLEMALDSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQEL-------LERLREESAAKDRLALELHTA 1111
Cdd:COG1196 249 EELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLeqdiarlEERRRELEERLEELEEELAEL 328
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1112 KGLLEGFKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEEKSVLWNQKETLTNEAKEREAgspvlpah 1191
Cdd:COG1196 329 EEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAA-------- 400
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1192 KDTALQEEVESLTRVQWESRKQSEKDRATLLSQMRVLESELEDQLVQHRGCAQLAEEVATLKQQLAALDKHLRSQRQFMD 1271
Cdd:COG1196 401 QLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALA 480
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1272 DQAAEREHEREEF---QQEIQRLEGQLRQAARPR--PPGPRDSQCVQLDEEVELLQEKLREKLDGFNELVI--------- 1337
Cdd:COG1196 481 ELLEELAEAAARLlllLEAEADYEGFLEGVKAALllAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIvveddevaa 560
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1338 ------KKDFADQQLLIQEEEIKRLEETNASIQRQMVQ---------LQEELEKQKKSMEELKEKEILKQENMGDLLLTT 1402
Cdd:COG1196 561 aaieylKAAKAGRATFLPLDKIRARAALAAALARGAIGaavdlvasdLREADARYYVLGDTLLGRTLVAARLEAALRRAV 640
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1403 VSRSGLDEAGCPM---LPQGSSSRGPEAQPDVTERALLQHENEVVHRRNSEIDELKSLIENLQENQRQLQKDKAEEIEQL 1479
Cdd:COG1196 641 TLAGRLREVTLEGeggSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEE 720
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1480 HEVIEKLQSELSLMGPKVHEVSDPQAGSLHSELAcLRGEGLGGQALRSELQAAQAAKEVFGQ--LLADQAHghsqalEAL 1557
Cdd:COG1196 721 LEEEALEEQLEAEREELLEELLEEEELLEEEALE-ELPEPPDLEELERELERLEREIEALGPvnLLAIEEY------EEL 793
|
....*
gi 1039732403 1558 QQRLQ 1562
Cdd:COG1196 794 EERYD 798
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
142-723 |
1.12e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 67.65 E-value: 1.12e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 142 LQKEKETAL--TELREMLNGRRAQELALlqsrqqcELELLREQHAREKEEMalrsgQEAAELKEKLRSEMEKNAQTIETL 219
Cdd:COG1196 205 LERQAEKAEryRELKEELKELEAELLLL-------KLRELEAELEELEAEL-----EELEAELEELEAELAELEAELEEL 272
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 220 KQDweserelcLENLRQELSLKHQSEMEgLQSQFQKELSEQKVELEKIfQAKHEAEVSLKNLEAQHQAAIKKLQEDLQSe 299
Cdd:COG1196 273 RLE--------LEELELELEEAQAEEYE-LLAELARLEQDIARLEERR-RELEERLEELEEELAELEEELEELEEELEE- 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 300 hcqyLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLWSQLESMKTNREELngswdpvLAQASHLEELEHLRS 379
Cdd:COG1196 342 ----LEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAA-------AELAAQLEELEEAEE 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 380 GfaqqQQQERAQHESELEHLrvyfEKKLKDAEKTYQEDLTVFQQRLQEAREDSLESTEISSScVLPEETSGREGKEPPDP 459
Cdd:COG1196 411 A----LLERLERLEEELEEL----EEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLEL-LAELLEEAALLEAALAE 481
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 460 LDLQLGQPKVQESLVEDCQVKLSKAEEKIQQMKEEFQKKE-----AEWELSREELKREAEERLASMFLELREKAESEKLS 534
Cdd:COG1196 482 LLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGlagavAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAA 561
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 535 IIsrfehressmRHLQDQQAAQILDLERSLMEQQGHLRQLEQELTRDDLLPCSQCGQEPAMAQEEKNGALLREKEDCALQ 614
Cdd:COG1196 562 AI----------EYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAAR 631
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 615 LLMAQNRFLEERKEIMEKFAKEQDAFLRDAQEKHNHELQLLQQGHQQQLLALRMELETKHRSELTEQLASSESRRQALLE 694
Cdd:COG1196 632 LEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAE 711
|
570 580
....*....|....*....|....*....
gi 1039732403 695 THVAELQVKHNAEISALEKRHLSNLDELE 723
Cdd:COG1196 712 AEEERLEEELEEEALEEQLEAEREELLEE 740
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
133-647 |
2.58e-10 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 66.70 E-value: 2.58e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 133 AQLELTQANLQKEKETALTELREMLNGRRAQELALLQSRQQCELELLREQHAREKEEM-------------ALRSGQEAA 199
Cdd:PTZ00121 1263 AHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAkkkaeeakkkadaAKKKAEEAK 1342
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 200 ELKEKLRSEMEKNAQTIETLKQDWESERELCLENLRQELSLKHQSEMEGLQSQFQKELSEQKVELE----------KIFQ 269
Cdd:PTZ00121 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADelkkaaaakkKADE 1422
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 270 AKHEAEVSLKNLEAQHQAAIKKLQEDLQ--SEHCQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLWSQL 347
Cdd:PTZ00121 1423 AKKKAEEKKKADEAKKKAEEAKKADEAKkkAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEA 1502
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 348 ---ESMKTNREELNGSWDPVLAQASHLEELEHLRSGFAQQQQQERAQHESELEHLRVYFEKKLKDAEKTYQEDLTVFQQR 424
Cdd:PTZ00121 1503 kkaAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK 1582
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 425 LQEAREdsLESTEISSSCVLPEETSGREGKEPPDPLDLQLGQPKVQESLVEDCQV---------------KLSKAEEKIQ 489
Cdd:PTZ00121 1583 AEEAKK--AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVeqlkkkeaeekkkaeELKKAEEENK 1660
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 490 QMKEEFQKKEAEWELSREELKREAEERLASMFLELREKAESEKLSIISRFEHRESsmrhlqdQQAAQILDLERSLMEQQG 569
Cdd:PTZ00121 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEK-------KKAEELKKAEEENKIKAE 1733
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039732403 570 HLRQLEQELTRDdllpcsqcGQEPAMAQEEKNGALLREKEDCAlqllmAQNRFLEERKEIMEKFAKEQDAFLRDAQEK 647
Cdd:PTZ00121 1734 EAKKEAEEDKKK--------AEEAKKDEEEKKKIAHLKKEEEK-----KAEEIRKEKEAVIEEELDEEDEKRRMEVDK 1798
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
152-784 |
2.93e-10 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 66.70 E-value: 2.93e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 152 ELREMLNGRRAQELALLQSRQQCELELLREQHAREKEEMALRSGQEAAELK----EKLRSEMEKNAQTIETLKQDWESER 227
Cdd:PTZ00121 1186 EVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKkdaeEAKKAEEERNNEEIRKFEEARMAHF 1265
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 228 ELCLENLRQELSLKHQsEMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAAIKKLQEDLQSEHCQYLQDL 307
Cdd:PTZ00121 1266 ARRQAAIKAEEARKAD-ELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 308 EQKFREKEKAKELELETLQASYEDLKAQSQEEIRllwsQLESMKTNREELNGSWDPVLAQASHLEELEHLRSGFAQQQQQ 387
Cdd:PTZ00121 1345 AEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKK----KADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 388 ERAQHESelEHLRVYFEKKLKDAEKTYQEDLtvfQQRLQEAREDSLESTEISSSCVLPEETSGREGKEPPDPLDLQLGQP 467
Cdd:PTZ00121 1421 DEAKKKA--EEKKKADEAKKKAEEAKKADEA---KKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEA 1495
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 468 KVQESLVEDCQVKLSKAEE--KIQQMKEEFQKKEAEWELSREELKREAEERLASmflELREKAESEKLSIISRFEH--RE 543
Cdd:PTZ00121 1496 KKKADEAKKAAEAKKKADEakKAEEAKKADEAKKAEEAKKADEAKKAEEKKKAD---ELKKAEELKKAEEKKKAEEakKA 1572
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 544 SSMRHLQDQQAAQILDLERSLMEQQGHLRQLEQELTrddllpcsqcGQEPAMAQEEKNGA-LLREKEDCALQLLMAQNRF 622
Cdd:PTZ00121 1573 EEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK----------AEEAKKAEEAKIKAeELKKAEEEKKKVEQLKKKE 1642
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 623 LEERKEIMEKFAKEQDAFLRDAQEKHNHELQLLQQGHQQQLLALRMELETKHRSELTEQLASSESRRQALLETHVAElQV 702
Cdd:PTZ00121 1643 AEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAE-EL 1721
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 703 KHNAEISALEKRHLSNLDELESCYVADVQTIRDEH---QQALELLRAELEEQLQKKESCHREMLTQELENLKRQHAEELQ 779
Cdd:PTZ00121 1722 KKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKkkiAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIK 1801
|
....*
gi 1039732403 780 SVRDS 784
Cdd:PTZ00121 1802 DIFDN 1806
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
2214-2503 |
7.36e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 64.96 E-value: 7.36e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2214 RRQVELLAYKVEqekcianELQKTLSKEQETASDVRKRLVVEQNAVQDLKSELHACKQENTSLLESLDKVQQEVLRLRAV 2293
Cdd:COG1196 224 ELEAELLLLKLR-------ELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAE 296
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2294 LDGKEKELKVVLEELESERGKGQALQAQQEEQQLRYLQREGQSSRALEELKLSLEKQLAQNNQLCVALKHERAAKDNLQK 2373
Cdd:COG1196 297 LARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAE 376
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2374 ELQIEASRCEALLAQEKGQLSELQKSLEAERSrslELSEALQHERLLTEQLSRNSQEACARQETQVQHALLRKLKAEKTR 2453
Cdd:COG1196 377 AEEELEELAEELLEALRAAAELAAQLEELEEA---EEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAE 453
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2454 ALELEAMLEKVQKQAAHTQQQLEAQAQERCVELRREKERELEIQRQRDEH 2503
Cdd:COG1196 454 LEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADY 503
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
118-631 |
9.40e-10 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 64.57 E-value: 9.40e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 118 LELEALRLSLNNMHTAQLELTQANLQKEKETALTELREMLNGRRAQELALLQSRQQCELELLREQHAREKEEMALRSGQE 197
Cdd:COG1196 227 AELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIAR 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 198 AAELKEKLRSEMEKNAQTIETLKQDWESERELCLENLRQELSLKHQ-SEMEGLQSQFQKELSEQKVELEKIFQAKHEAEV 276
Cdd:COG1196 307 LEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEElEEAEAELAEAEEALLEAEAELAEAEEELEELAE 386
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 277 SLKNLEAQHQAAIKKLQEDLQSEHcQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLWSQLESMKTNREE 356
Cdd:COG1196 387 ELLEALRAAAELAAQLEELEEAEE-ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELL 465
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 357 LNGSWDPVLAQASHLEELEHLRSGFAQQQQQERAQHESELEHLRVYFEKKLKDAEKTYQE--DLTVFQQRLQEAREDSL- 433
Cdd:COG1196 466 AELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAvaVLIGVEAAYEAALEAALa 545
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 434 --------ESTEISSSCV--LPEETSGREGKEPPDPLDLQLGQPKVQESLVEDCQVKLSKAEEKIQQMKEEFQKKEAEWE 503
Cdd:COG1196 546 aalqnivvEDDEVAAAAIeyLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGR 625
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 504 LSREELKREAEERLASMFLELREKAESEKLSIISRFEHRESSMRHLQDQQAAQILDLERSLMEQQGHLRQLEQELTRDDL 583
Cdd:COG1196 626 TLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEE 705
|
490 500 510 520
....*....|....*....|....*....|....*....|....*...
gi 1039732403 584 lpcsqcgQEPAMAQEEKNGALLREKEDCALQLLMAQNRFLEERKEIME 631
Cdd:COG1196 706 -------ERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEE 746
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1057-1708 |
2.17e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 63.54 E-value: 2.17e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1057 KQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALELHTAKGLLEGFKVEKVDLQEALGKKEESE 1136
Cdd:TIGR02168 288 KELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAEL 367
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1137 QQLILELEDLRKQLQQAARELLTLKEEKSVLWNQ-----------KETLTNEAKEREAGSPVLPAHKDTALQEEVESLTR 1205
Cdd:TIGR02168 368 EELESRLEELEEQLETLRSKVAQLELQIASLNNEierlearlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELEE 447
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1206 VQWESRKQSEKDRATLLSQMRVLESELEDQLVQHRGCAQLAEEVATLKQQLAALDKHLRSQRQFMDDQAAEREHEREEfq 1285
Cdd:TIGR02168 448 ELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVL-- 525
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1286 QEIQRLEGQLRQAARPRPPGPRDSQCVQLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASIQR 1365
Cdd:TIGR02168 526 SELISVDEGYEAAIEAALGGRLQAVVVENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAK 605
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1366 QMVQLQEELEKQKKSM-------EELKEK-EILKQENMGDLLLT----TVSRSG-------------------LDEAGCP 1414
Cdd:TIGR02168 606 DLVKFDPKLRKALSYLlggvlvvDDLDNAlELAKKLRPGYRIVTldgdLVRPGGvitggsaktnssilerrreIEELEEK 685
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1415 MLPQGSSSRGPEAQPDVTERALLQHENEV-------------VHRRNSEIDELKSLIENLQENQRQLQK---DKAEEIEQ 1478
Cdd:TIGR02168 686 IEELEEKIAELEKALAELRKELEELEEELeqlrkeleelsrqISALRKDLARLEAEVEQLEERIAQLSKeltELEAEIEE 765
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1479 LHEVIEKLQSELSLMGPKVHEVSDPQAG-------------SLHSELACLRGEGLGGQALRSELQAAQAAKEVFGQLLAD 1545
Cdd:TIGR02168 766 LEERLEEAEEELAEAEAEIEELEAQIEQlkeelkalrealdELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEE 845
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1546 QAHGHSQALEALQQRLQDAEEVAARHLAELEHCVALREAEVEAMA---SQIQEFAATLKAKEAIIEQRDLEIDAVNKWKV 1622
Cdd:TIGR02168 846 QIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALAllrSELEELSEELRELESKRSELRRELEELREKLA 925
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1623 SHSLELEAILLALAHFRHALEQQTCATPDEPPELRQLRVQCARlshqlqvlyrpflKCRMQLDQHQPHVASIGCANPCAD 1702
Cdd:TIGR02168 926 QLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEE-------------EARRRLKRLENKIKELGPVNLAAI 992
|
....*.
gi 1039732403 1703 DELEQE 1708
Cdd:TIGR02168 993 EEYEEL 998
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
667-1489 |
4.38e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 62.38 E-value: 4.38e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 667 RMELETKHRSELTEQLASSESRRQALlETHVAELQVK---HNAEISALEKRhlsnLDELESCYVADVQTIRD-EHQQALE 742
Cdd:TIGR02168 233 RLEELREELEELQEELKEAEEELEEL-TAELQELEEKleeLRLEVSELEEE----IEELQKELYALANEISRlEQQKQIL 307
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 743 LLRAELEEQLQKKESCHREMLTQELENLKRQHA------EELQSVRDSLRMEMSAQH--IENGKGPAADLQGAHQQQDPA 814
Cdd:TIGR02168 308 RERLANLERQLEELEAQLEELESKLDELAEELAeleeklEELKEELESLEAELEELEaeLEELESRLEELEEQLETLRSK 387
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 815 MALHNEGhllVEDGDAVLRSVDAEGllhqagpQELGDAHTVEMQKSQAELAKPQE--LQASQDQVAQVRDKVFLLNRELE 892
Cdd:TIGR02168 388 VAQLELQ---IASLNNEIERLEARL-------ERLEDRRERLQQEIEELLKKLEEaeLKELQAELEELEEELEELQEELE 457
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 893 ECRAELEQLQQRRERENQEgttlICMLRADLELAQGEGKALRDALRRLLDlFGDTLKAAVTLKSRISERAGLLLDHEDAA 972
Cdd:TIGR02168 458 RLEEALEELREELEEAEQA----LDAAERELAQLQARLDSLERLQENLEG-FSEGVKALLKNQSGLSGILGVLSELISVD 532
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 973 DTSDARLAAAALGDMWS------DEGLLEIDRTLPEGAETSSVCEISSHVCESFFISPENTLDCEQPIRRVYQSL---ST 1043
Cdd:TIGR02168 533 EGYEAAIEAALGGRLQAvvvenlNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLvkfDP 612
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1044 AVEGLLEMALDSSkqleearqlhRCVERefrhrneeMAQAMQKQQELLERLREESAAKDRLalelhTAKGLLEG------ 1117
Cdd:TIGR02168 613 KLRKALSYLLGGV----------LVVDD--------LDNALELAKKLRPGYRIVTLDGDLV-----RPGGVITGgsaktn 669
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1118 -----FKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEEKSVLwNQKETLTNEAKEREAGSPVLPAHK 1192
Cdd:TIGR02168 670 ssileRRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEEL-SRQISALRKDLARLEAEVEQLEER 748
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1193 DTALQEEVESLTRVQWESRKQSEKDRATLLSQMRVLESELEDQLVQHRGCAQLAEEVATLKQQLAALDKHLRSQRQFMDD 1272
Cdd:TIGR02168 749 IAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLES 828
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1273 QAAEREHEreefQQEIQRLEGQLRQAArprppgprdSQCVQLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQEEE 1352
Cdd:TIGR02168 829 LERRIAAT----ERRLEDLEEQIEELS---------EDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1353 IKRLEETNASIQRQMVQLQEELEKQKKSMEELKEKEIlKQENMGDLLLTTVSRSGLDEAGcpmlPQGSSSRGPEAQPDVT 1432
Cdd:TIGR02168 896 LEELSEELRELESKRSELRRELEELREKLAQLELRLE-GLEVRIDNLQERLSEEYSLTLE----EAEALENKIEDDEEEA 970
|
810 820 830 840 850 860
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1433 ERALLQHENEVVHRRN---SEIDELKSLIENLQENQRQlQKDKAEEIEQLHEVIEKLQSE 1489
Cdd:TIGR02168 971 RRRLKRLENKIKELGPvnlAAIEEYEELKERYDFLTAQ-KEDLTEAKETLEEAIEEIDRE 1029
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
345-1164 |
9.67e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 61.23 E-value: 9.67e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 345 SQLESMKTN-------REELNGSWDPVLAQASHLE-------ELEHLRSGFAQQQQQERAQHESELEHLrvyfEKKLKDA 410
Cdd:TIGR02168 179 RKLERTRENldrlediLNELERQLKSLERQAEKAErykelkaELRELELALLVLRLEELREELEELQEE----LKEAEEE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 411 EKTYQEDLTVFQQRLQEARedsLESTEISSSCVLPEETSGREGKEPPDpLDLQLGQPKVQESLVEDCQVKLSKAEEKIQQ 490
Cdd:TIGR02168 255 LEELTAELQELEEKLEELR---LEVSELEEEIEELQKELYALANEISR-LEQQKQILRERLANLERQLEELEAQLEELES 330
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 491 MKEEFQKKEAEWELSREELKREAEErLASMFLELREKAESEKLSIISRFEHREssmrhlqdQQAAQILDLERSLMEQQGH 570
Cdd:TIGR02168 331 KLDELAEELAELEEKLEELKEELES-LEAELEELEAELEELESRLEELEEQLE--------TLRSKVAQLELQIASLNNE 401
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 571 LRQLEQELTRddllpcsqcgqepamaQEEKNGALLREKEDCALQLLMAQNRFLEERKEIMEKFAKEQDAFLRDAQEKhnh 650
Cdd:TIGR02168 402 IERLEARLER----------------LEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEA--- 462
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 651 elQLLQQGHQQQLLALRMELETKHRsELTEQLASSESRRQALLETHVAELQVKHNAEISALEKRHLSNLDELESCYVADV 730
Cdd:TIGR02168 463 --LEELREELEEAEQALDAAERELA-QLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAI 539
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 731 QTIRDEHQQA---LELLRAELEEQLQKKESCHREMLTQelenLKRQHAEELQSVRDSLRMEmsaqhIENGKGPAADLQGA 807
Cdd:TIGR02168 540 EAALGGRLQAvvvENLNAAKKAIAFLKQNELGRVTFLP----LDSIKGTEIQGNDREILKN-----IEGFLGVAKDLVKF 610
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 808 HQQQDPAMALHNEGHLLVEDGDavlrsvDAEGLLHQAGPQELG---DAHTV---------EMQKSQAELAKPQELQASQD 875
Cdd:TIGR02168 611 DPKLRKALSYLLGGVLVVDDLD------NALELAKKLRPGYRIvtlDGDLVrpggvitggSAKTNSSILERRREIEELEE 684
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 876 QVAQVRDKVFLLNRELEECRAELEQLQQRRE---RENQEGTTLICMLRADLELAQGEGKALRDALRRLLDLFGDTLKAAV 952
Cdd:TIGR02168 685 KIEELEEKIAELEKALAELRKELEELEEELEqlrKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIE 764
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 953 TLKSRISERAGLLLDHEDAADTSDARLAAAAlgdmwsdEGLLEIDRTLPEGAETSSVCEISSHVCESFFISPENTL-DCE 1031
Cdd:TIGR02168 765 ELEERLEEAEEELAEAEAEIEELEAQIEQLK-------EELKALREALDELRAELTLLNEEAANLRERLESLERRIaATE 837
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1032 QPIRRVYQSLSTAVEGLLEMAL----------DSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAK 1101
Cdd:TIGR02168 838 RRLEDLEEQIEELSEDIESLAAeieeleelieELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRREL 917
|
810 820 830 840 850 860
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039732403 1102 DRLALELHTAKGLLEGFKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEEK 1164
Cdd:TIGR02168 918 EELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENK 980
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1038-1610 |
1.07e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 61.23 E-value: 1.07e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1038 YQSLSTAVEGL-LEMALDSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRlalELHTAKGLLE 1116
Cdd:TIGR02168 215 YKELKAELRELeLALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEE---EIEELQKELY 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1117 GFKVEKVDLQEALGKKEESEQQLI-------LELEDLRKQLQQAARELLTLKEEKSVLWNQKETLTNEAKEREAGSPVLP 1189
Cdd:TIGR02168 292 ALANEISRLEQQKQILRERLANLErqleeleAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELE 371
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1190 AHKDtALQEEVESLTRvqweSRKQSEKDRATLLSQMRVLESELEDQLVQHrgcAQLAEEVATLKQQLAALDKH-LRSQRQ 1268
Cdd:TIGR02168 372 SRLE-ELEEQLETLRS----KVAQLELQIASLNNEIERLEARLERLEDRR---ERLQQEIEELLKKLEEAELKeLQAELE 443
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1269 FMDDQAAEREHEREEFQQEIQRLEGQLRQAARPRPPGPRDSQcvQLDEEVELLqEKLREKLDGFNELVI-----KKDFAD 1343
Cdd:TIGR02168 444 ELEEELEELQEELERLEEALEELREELEEAEQALDAAERELA--QLQARLDSL-ERLQENLEGFSEGVKallknQSGLSG 520
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1344 -----QQLLIQEEEIKRLEETNASIQRQMVqLQEELEKQKKSMEELKEKEILKqenMGDLLLTTVSRSGLDEAGCPMLPQ 1418
Cdd:TIGR02168 521 ilgvlSELISVDEGYEAAIEAALGGRLQAV-VVENLNAAKKAIAFLKQNELGR---VTFLPLDSIKGTEIQGNDREILKN 596
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1419 GSSSRG---------PEAQPDVT-----------------ERALLQHEN-------EVVHR-----------------RN 1448
Cdd:TIGR02168 597 IEGFLGvakdlvkfdPKLRKALSyllggvlvvddldnaleLAKKLRPGYrivtldgDLVRPggvitggsaktnssileRR 676
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1449 SEIDELKSLIENLQENQRQLQK---DKAEEIEQLHEVIEKLQSELSLMGPKVHEvsdpqagsLHSELACLRGEGLGGQAL 1525
Cdd:TIGR02168 677 REIEELEEKIEELEEKIAELEKalaELRKELEELEEELEQLRKELEELSRQISA--------LRKDLARLEAEVEQLEER 748
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1526 RSELQAAQAAKEVFGQLLADQAHGHSQALEALQQRLQDAEEVAARHLAELEhcvALREA--EVEAMASQIQEFAATLKAK 1603
Cdd:TIGR02168 749 IAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELK---ALREAldELRAELTLLNEEAANLRER 825
|
....*..
gi 1039732403 1604 EAIIEQR 1610
Cdd:TIGR02168 826 LESLERR 832
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1069-1398 |
1.56e-08 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 60.85 E-value: 1.56e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1069 VEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALELHTAKGLLEGFKVEKVDLQEALGKKEESEQQlilELEDLRK 1148
Cdd:TIGR02169 668 FSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKE---RLEELEE 744
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1149 QLQQAARELLTLKEEKSVLWNQKETLTNE-AKEREAgspvLPAHKDTALQEEVESLTRvqwESRKQsEKDRATLLSQMRV 1227
Cdd:TIGR02169 745 DLSSLEQEIENVKSELKELEARIEELEEDlHKLEEA----LNDLEARLSHSRIPEIQA---ELSKL-EEEVSRIEARLRE 816
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1228 LESELEDQlvqHRGCAQLAEEVATLKQQLAALDkhlrSQRQFMDDQAAEREHEREEFQQEIQRLEGQLRQAarprppgpr 1307
Cdd:TIGR02169 817 IEQKLNRL---TLEKEYLEKEIQELQEQRIDLK----EQIKSIEKEIENLNGKKEELEEELEELEAALRDL--------- 880
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1308 DSQCVQLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASIQRQMVQLQEELEkQKKSMEELKEK 1387
Cdd:TIGR02169 881 ESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPE-EELSLEDVQAE 959
|
330
....*....|.
gi 1039732403 1388 EILKQENMGDL 1398
Cdd:TIGR02169 960 LQRVEEEIRAL 970
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
142-529 |
2.94e-08 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 59.99 E-value: 2.94e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 142 LQKEKETALTELREMLNGRRAQELALLQSRQQCELELLREQHAREKEEMALRSGQEAAELKEKLRSEMEKNAQTIETLKQ 221
Cdd:pfam02463 634 LTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELK 713
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 222 DWESERELCLENLRQELSLKHQSEMEGLQSQF-QKELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAA--IKKLQEDLQS 298
Cdd:pfam02463 714 KLKLEAEELLADRVQEAQDKINEELKLLKQKIdEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREktEKLKVEEEKE 793
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 299 EHCQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLwSQLESMKTNREELNgswdpvlaqashlEELEHLR 378
Cdd:pfam02463 794 EKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEEL-ALELKEEQKLEKLA-------------EEELERL 859
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 379 SGFAQQQQQERAQHESELEHLRVYFEKKLKDAEKTYQEDLTVFQQRLQEAREDSLESTEISSSCVLPEETSGREGKEPPD 458
Cdd:pfam02463 860 EEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEE 939
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039732403 459 PLDLQLGQPKVQESLVEDCQVKLSKAEEKIQQ-------MKEEFQKKEAEWELSREELKREAEERLASMFLELREKAE 529
Cdd:pfam02463 940 LLLEEADEKEKEENNKEEEEERNKRLLLAKEElgkvnlmAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQ 1017
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1131-1622 |
4.86e-08 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 59.18 E-value: 4.86e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1131 KKEESEQQL------ILELEDLRKQLqqaARELLTLKEEKSVLwNQKETLTNEAKEREAgspvlpahkdtalqeeveSLT 1204
Cdd:COG1196 173 RKEEAERKLeateenLERLEDILGEL---ERQLEPLERQAEKA-ERYRELKEELKELEA------------------ELL 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1205 RVQWESRkqsEKDRATLLSQMRVLESELEDQLVQHRgcaQLAEEVATLKQQLAALDKHLRSQRQfmddqaaerehEREEF 1284
Cdd:COG1196 231 LLKLREL---EAELEELEAELEELEAELEELEAELA---ELEAELEELRLELEELELELEEAQA-----------EEYEL 293
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1285 QQEIQRLEGQLRQAARPRppGPRDSQCVQLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASIQ 1364
Cdd:COG1196 294 LAELARLEQDIARLEERR--RELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAE 371
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1365 RQMVQLQEELEKQKKSMEELKEKEILKQENMGDLLLTTVSRSGLDEAgcpMLPQGSSSRGPEAQPDVTERALLQHENEVV 1444
Cdd:COG1196 372 AELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLER---LEEELEELEEALAELEEEEEEEEEALEEAA 448
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1445 HRRNSEIDELKSLIENLQENQRQLQKDKAEEIEQLHEVIEKLQSELSLMGPKVHEVSDPQAGSLHSELACLRG-EGLGGQ 1523
Cdd:COG1196 449 EEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGlAGAVAV 528
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1524 ALRSELQAAQAAKEVFGQLLADQAHGHSQALEALQQRLQDA----------EEVAARHLAELEHCVALREAEVEAMASQI 1593
Cdd:COG1196 529 LIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAkagratflplDKIRARAALAAALARGAIGAAVDLVASDL 608
|
490 500
....*....|....*....|....*....
gi 1039732403 1594 QEFAATLKAKEAIIEQRDLEIDAVNKWKV 1622
Cdd:COG1196 609 READARYYVLGDTLLGRTLVAARLEAALR 637
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2203-2516 |
8.44e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 58.14 E-value: 8.44e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2203 EARGSQREHQLRRQVE-----LLAYKVEQEKCIANELQKTLSKEQETASDVRKRLVVEQNAVQDLKSELHackqentSLL 2277
Cdd:TIGR02168 208 QAEKAERYKELKAELRelelaLLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVS-------ELE 280
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2278 ESLDKVQQEVLRLRAVLDGKEKELKVVLEELESergkgqalQAQQEEQQLRYLQREGQSSRALEELKLSLEKQLAQNNQL 2357
Cdd:TIGR02168 281 EEIEELQKELYALANEISRLEQQKQILRERLAN--------LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEE 352
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2358 CVALKHERAAKDNLQKELQIEASRCEALLAQEKGQLSELQKSLEAERSRSLELSEALqherlltEQLSRNSQEACARQET 2437
Cdd:TIGR02168 353 LESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARL-------ERLEDRRERLQQEIEE 425
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039732403 2438 QVQHALLRKLKAEKTRALELEAMLEKVQKQAAHTQQQLEAQAQErcVELRREKERELEIQRQRDEHKIEQLQRLVRELR 2516
Cdd:TIGR02168 426 LLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE--LEEAEQALDAAERELAQLQARLDSLERLQENLE 502
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
156-430 |
2.13e-07 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 56.99 E-value: 2.13e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 156 MLNGRRAQELALLQSRQQcELELLREQhaREKEEMALRSGQEAAELKEKLRSEMEKNAQTIETLKQDWE---SERELCLE 232
Cdd:TIGR02168 660 VITGGSAKTNSSILERRR-EIEELEEK--IEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSrqiSALRKDLA 736
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 233 NLRQ--ELSLKHQSEMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSLKNLEA---QHQAAIKKLQEDLQSEHCQY--LQ 305
Cdd:TIGR02168 737 RLEAevEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAqieQLKEELKALREALDELRAELtlLN 816
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 306 DLEQKFREKEKAKELELETLQASYEDLKAQS---QEEIRLLWSQLESMKTNREELNGSWDPVLAQASHLEELEHLRSGFA 382
Cdd:TIGR02168 817 EEAANLRERLESLERRIAATERRLEDLEEQIeelSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSEL 896
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 1039732403 383 QQQQQERAQHESELEHLRVYFEkKLKDAEKTYQEDLTVFQQRLQEARE 430
Cdd:TIGR02168 897 EELSEELRELESKRSELRRELE-ELREKLAQLELRLEGLEVRIDNLQE 943
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
2195-2434 |
1.31e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 53.61 E-value: 1.31e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2195 LCAALTSTEARGSQREHQLRRQVELLAYKVEQEkcianELQKTLSKEQETASDVRKRLVVEQNAVQDLKSELHACKQENT 2274
Cdd:COG4942 5 LLLALLLALAAAAQADAAAEAEAELEQLQQEIA-----ELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELA 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2275 SLLESLDKVQQEVLRLRAVLDGKEKELKVVLEELESERGKGQAL---QAQQEEQQLRYLQREGQSSRALEELKLSLEKQL 2351
Cdd:COG4942 80 ALEAELAELEKEIAELRAELEAQKEELAELLRALYRLGRQPPLAlllSPEDFLDAVRRLQYLKYLAPARREQAEELRADL 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2352 AQNNQLCVALKHERAAKDNLQKELQIEASRCEALLAQEKGQLSELQKSLEAERSRSLELSEALQHERLLTEQLSRNSQEA 2431
Cdd:COG4942 160 AELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
|
...
gi 1039732403 2432 CAR 2434
Cdd:COG4942 240 AER 242
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2172-2431 |
1.37e-06 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 54.29 E-value: 1.37e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2172 AEIQALRAQLRMTHLQNQEKLQQLCAALTSTEARGSQRE------HQLRRQVELLAYKVEQEKCIANELQKTLSKEQETA 2245
Cdd:TIGR02168 677 REIEELEEKIEELEEKIAELEKALAELRKELEELEEELEqlrkelEELSRQISALRKDLARLEAEVEQLEERIAQLSKEL 756
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2246 SDVRKRLVVEQNAVQDLKSELHACKQENTSLLESLDKVQQEVLRLRAVLDGKEKELKvvleELESERGKgqalqaqQEEQ 2325
Cdd:TIGR02168 757 TELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELT----LLNEEAAN-------LRER 825
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2326 QLRYLQREGQSSRALEELKLSLEKQLAQNNQLCVALKHERAAKDNLQKELQI---EASRCEALLAQEKGQLSELQKSLEA 2402
Cdd:TIGR02168 826 LESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEAllnERASLEEALALLRSELEELSEELRE 905
|
250 260
....*....|....*....|....*....
gi 1039732403 2403 ERSRSLELSEALQHERLLTEQLSRNSQEA 2431
Cdd:TIGR02168 906 LESKRSELRRELEELREKLAQLELRLEGL 934
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
242-915 |
2.05e-06 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 53.99 E-value: 2.05e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 242 HQSEMEGLQ---SQFQKELSEQKVELEKIFQAKHEAEVSLKNL------EAQHQAAIKKLQEDLQSEHCQYLQDLeQKFR 312
Cdd:PTZ00121 1065 HVGQDEGLKpsyKDFDFDAKEDNRADEATEEAFGKAEEAKKTEtgkaeeARKAEEAKKKAEDARKAEEARKAEDA-RKAE 1143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 313 EKEKAKELELETLQASYEDlkAQSQEEIRLLWSQLESMKTNREElngswdpVLAQASHLEELEHLRsgfaQQQQQERAQH 392
Cdd:PTZ00121 1144 EARKAEDAKRVEIARKAED--ARKAEEARKAEDAKKAEAARKAE-------EVRKAEELRKAEDAR----KAEAARKAEE 1210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 393 ESELEHLRVYFEKKLKDAEKtyqedltvfqqRLQEAREDSLES---TEISSSCVLPEETSGREGKEPPDPLDLQLGQPKV 469
Cdd:PTZ00121 1211 ERKAEEARKAEDAKKAEAVK-----------KAEEAKKDAEEAkkaEEERNNEEIRKFEEARMAHFARRQAAIKAEEARK 1279
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 470 QESLVEDCQVK----LSKAEEKiqqMKEEFQKKEAEWELSREELKREAEERLASMfLELREKAESEKLSIISRFEHRESS 545
Cdd:PTZ00121 1280 ADELKKAEEKKkadeAKKAEEK---KKADEAKKKAEEAKKADEAKKKAEEAKKKA-DAAKKKAEEAKKAAEAAKAEAEAA 1355
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 546 MRHLQ-DQQAAQILDLERSLMEQQGH-LRQLEQELTRDDLLPcSQCGQEPAMAQEEKNGALLREKEDcALQLLMAQNRFL 623
Cdd:PTZ00121 1356 ADEAEaAEEKAEAAEKKKEEAKKKADaAKKKAEEKKKADEAK-KKAEEDKKKADELKKAAAAKKKAD-EAKKKAEEKKKA 1433
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 624 EERKEIMEKfAKEQDAFLRDAQEKHNHELQLLQQGHQQQLLALRMELETKHRSELTEQLASsESRRQALLETHVAELQVK 703
Cdd:PTZ00121 1434 DEAKKKAEE-AKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE-EAKKKADEAKKAAEAKKK 1511
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 704 HNAEISALEKRHLSNLDELESCYVADVQTIRDEHQQALELLRAELEEQLQKKESCHREMLTQELENLKRQHAEELQSVRD 783
Cdd:PTZ00121 1512 ADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEE 1591
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 784 SLRMEMSAQHIENGKGPAADLQGAHQQQDPAMALHNEGHLLVEDGDAVLRSVDAEGLLHQAGPQELGDAHTVEMQKSQAE 863
Cdd:PTZ00121 1592 ARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|..
gi 1039732403 864 LAKPQELQASQDQVAQvRDKVFLLNRELEECRaELEQLQQRRERENQEGTTL 915
Cdd:PTZ00121 1672 EDKKKAEEAKKAEEDE-KKAAEALKKEAEEAK-KAEELKKKEAEEKKKAEEL 1721
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
117-577 |
3.70e-06 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 52.92 E-value: 3.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 117 SLELEALRLSLNNMHTAQLELTQANLQKEKETALTELREMLNGRRAQeLALLQSRQQCELELLREQHAREKEEMALRSGQ 196
Cdd:pfam12128 252 TLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLDDQ-WKEKRDELNGELSAADAAVAKDRSELEALEDQ 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 197 EAAELKEKLrsemEKNAQTIETLKQdWESErelcLENLRQELSL---KHQSEMEGLQSQFQKELSEQKVELEKIFQAKHE 273
Cdd:pfam12128 331 HGAFLDADI----ETAAADQEQLPS-WQSE----LENLEERLKAltgKHQDVTAKYNRRRSKIKEQNNRDIAGIKDKLAK 401
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 274 AEVSLKNLEAQHQAAIKKLQEDLQSEHCQYLQDL-EQKFREKEKAKELELETLQASY-EDLK---AQSQEEIRLLWSQLE 348
Cdd:pfam12128 402 IREARDRQLAVAEDDLQALESELREQLEAGKLEFnEEEYRLKSRLGELKLRLNQATAtPELLlqlENFDERIERAREEQE 481
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 349 SMKTNREELNGSwdpvLAQASHLEELEHLRSGFAQQQQQERAQHESELEH--------LRVYFEK----------KLKDA 410
Cdd:pfam12128 482 AANAEVERLQSE----LRQARKRRDQASEALRQASRRLEERQSALDELELqlfpqagtLLHFLRKeapdweqsigKVISP 557
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 411 EKTYQEDLTVFQQRLQEAREDS-------LESTEISSSCVLPEETSGREGKeppdpldlqlgqpkvQESLVEDCQVKLSK 483
Cdd:pfam12128 558 ELLHRTDLDPEVWDGSVGGELNlygvkldLKRIDVPEWAASEEELRERLDK---------------AEEALQSAREKQAA 622
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 484 AEEKIQQMKEEFQKKEAEWELSREELKrEAEERLASMFLELREKAESEKLSIISRFEHRESSMRHLQDQQAAQILDLERS 563
Cdd:pfam12128 623 AEEQLVQANGELEKASREETFARTALK-NARLDLRRLFDEKQSEKDKKNKALAERKDSANERLNSLEAQLKQLDKKHQAW 701
|
490
....*....|....
gi 1039732403 564 LMEQQGHLRQLEQE 577
Cdd:pfam12128 702 LEEQKEQKREARTE 715
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
145-519 |
3.86e-06 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 52.76 E-value: 3.86e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 145 EKETALTELREMlnGRRAQELALLQSRQQCELELLREQHAREKEEMALRSGQEAAELKEKLRSEMEKNAQTIETLKQDWE 224
Cdd:TIGR02169 171 KKEKALEELEEV--EENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLAS 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 225 SEREL---------------CLENLRQELSLKHQSEMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAAI 289
Cdd:TIGR02169 249 LEEELeklteeiselekrleEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLE 328
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 290 KKLQEdLQSEHCQYLQDLEQKFREKEKAKElELETLQASYEDLKAQSQE---EIRLLWSQLESMKTNREELNGSWDPVLA 366
Cdd:TIGR02169 329 AEIDK-LLAEIEELEREIEEERKRRDKLTE-EYAELKEELEDLRAELEEvdkEFAETRDELKDYREKLEKLKREINELKR 406
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 367 QASHLEELEHLRSGfaqqqqqERAQHESELEHLRvyfekklkdaektyqEDLTVFQQRLQEAREdslesteissscvlpe 446
Cdd:TIGR02169 407 ELDRLQEELQRLSE-------ELADLNAAIAGIE---------------AKINELEEEKEDKAL---------------- 448
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039732403 447 ETSGREGKeppdpldlqlgqpkvQESLVEDcqvkLSKAEEKIQQMKEEFQKKEAEWELSREELKREAEERLAS 519
Cdd:TIGR02169 449 EIKKQEWK---------------LEQLAAD----LSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARAS 502
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
2187-2506 |
7.16e-06 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 52.07 E-value: 7.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2187 QNQEKLQQLCAALTSTEARGSQREHQLRRQVELlayKVEQEKCIANELQKT--LSKEQETASDVRKRLVVEQNAVQDLKS 2264
Cdd:PTZ00121 1507 EAKKKADEAKKAEEAKKADEAKKAEEAKKADEA---KKAEEKKKADELKKAeeLKKAEEKKKAEEAKKAEEDKNMALRKA 1583
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2265 ELhaCKQENTSLLESLDKVQQEVLRLRAVLDGKEKELKVVLEELESErgkgqalqAQQEEQQLRYLQREGQSSRALEELK 2344
Cdd:PTZ00121 1584 EE--AKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKA--------EEEKKKVEQLKKKEAEEKKKAEELK 1653
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2345 LSLEKQLAQNNQLCVALKHERAAKDNLQKELQIEASRCEALL--AQEKGQLSELQKSLEAERSRSLELSEALQHERLLTE 2422
Cdd:PTZ00121 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKkeAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAE 1733
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2423 QLSRNSQEacarqetqvqhallRKLKAEKTRALELEAmlEKVQKQAAHTQQQLEAQAQERCVELRREKERELEIQRQRDE 2502
Cdd:PTZ00121 1734 EAKKEAEE--------------DKKKAEEAKKDEEEK--KKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
|
....
gi 1039732403 2503 HKIE 2506
Cdd:PTZ00121 1798 KKIK 1801
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
2209-2516 |
9.58e-06 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 51.29 E-value: 9.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2209 REHQLRRQVELLAYKVEQEKCIaNELQKTLSKEQETASDVRKRLVVEQNAVQDLKSELHACKQENTSLLESLDKVQQEVL 2288
Cdd:pfam07111 312 REKVFALMVQLKAQDLEHRDSV-KQLRGQVAELQEQVTSQSQEQAILQRALQDKAAEVEVERMSAKGLQMELSRAQEARR 390
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2289 RLRAVLDGKEKELKVVLEELESERGKGQALQAQQEEQQLR---------YLQREGQSSRALEELKLSLEKQLAQ------ 2353
Cdd:pfam07111 391 RQQQQTASAEEQLKFVVNAMSSTQIWLETTMTRVEQAVARipslsnrlsYAVRKVHTIKGLMARKVALAQLRQEscpppp 470
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2354 -----NNQLCVALKHERAAKDNLQKELQI-------EASRCEALLAQEKGQLSELQKSLEAERSRS--------LELSEA 2413
Cdd:pfam07111 471 pappvDADLSLELEQLREERNRLDAELQLsahliqqEVGRAREQGEAERQQLSEVAQQLEQELQRAqeslasvgQQLEVA 550
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2414 LQHERLLTEQLSRNSQEACARQETQVQhALLRKLKAEKTRALELEAMLEKVQKQAAHTQQQLEAQAQERCVELRREKERE 2493
Cdd:pfam07111 551 RQGQQESTEEAASLRQELTQQQEIYGQ-ALQEKVAEVETRLREQLSDTKRRLNEARREQAKAVVSLRQIQHRATQEKERN 629
|
330 340
....*....|....*....|...
gi 1039732403 2494 LEIQRQRDEHKIEQLQRLVRELR 2516
Cdd:pfam07111 630 QELRRLQDEARKEEGQRLARRVQ 652
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
119-357 |
1.00e-05 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 51.28 E-value: 1.00e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 119 ELEALRLSLNNMHTAQLELTQANLQKEKETALTEL---REMLNGRRAQelallqsrqqcELELLREQHAREKEEMalRSG 195
Cdd:pfam17380 349 ELERIRQEERKRELERIRQEEIAMEISRMRELERLqmeRQQKNERVRQ-----------ELEAARKVKILEEERQ--RKI 415
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 196 QEAAELKEKLRSEMEkNAQTIETLKQDWESERElcLENLRQElSLKHQSEMEGLQSQfQKELSEQKVELEKIFQAKHEAE 275
Cdd:pfam17380 416 QQQKVEMEQIRAEQE-EARQREVRRLEEERARE--MERVRLE-EQERQQQVERLRQQ-EEERKRKKLELEKEKRDRKRAE 490
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 276 -----VSLKNLEAQHQAAI------KKLQEDLQSEHCQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEirllw 344
Cdd:pfam17380 491 eqrrkILEKELEERKQAMIeeerkrKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATEER----- 565
|
250
....*....|...
gi 1039732403 345 SQLESMKTNREEL 357
Cdd:pfam17380 566 SRLEAMEREREMM 578
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
309-907 |
1.01e-05 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 51.48 E-value: 1.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 309 QKFRE-KEKAKELELETLQASYEDLKAQS---QEEIRLLWSQLESMKTNREELNGSWDPVLAQASHLEELEHLRSGFAQQ 384
Cdd:COG1196 213 ERYRElKEELKELEAELLLLKLRELEAELeelEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYE 292
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 385 QQQERAQHESELEHLrvyfEKKLKDAEKTYQEDLtvfQQRLQEAREDSLESTEISSscvlpeetsgregkeppdpLDLQL 464
Cdd:COG1196 293 LLAELARLEQDIARL----EERRRELEERLEELE---EELAELEEELEELEEELEE-------------------LEEEL 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 465 gqpKVQESLVEDCQVKLSKAEEKIQQMKEEFQKKEAEWELSREELKREAEERLAsmflelREKAESEKLSIISRFEHRES 544
Cdd:COG1196 347 ---EEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAE------LAAQLEELEEAEEALLERLE 417
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 545 SMRHLQDQQAAQILDLERSLMEQQGHLRQLEQELTRDDLLpcsQCGQEPAMAQEEKNGALLREKEDCALQLLMAQNRFLE 624
Cdd:COG1196 418 RLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEE---EEALLELLAELLEEAALLEAALAELLEELAEAAARLL 494
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 625 ERKEIMEKFAKEQDAFLRDAQEKHNHELQLLQQGHQQQLLALRMELETKHRSELTEQLASSESRRQALLETHVAELQVKH 704
Cdd:COG1196 495 LLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRA 574
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 705 NAEISALEKRHLSNLDELESCYVADVQTIRDEHQQALELLRAELEEQLQKKESCHREMLTqeLENLKRQHAEELQSVRDS 784
Cdd:COG1196 575 TFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEA--ALRRAVTLAGRLREVTLE 652
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 785 LRMEMSAQHIENGKGPAADLQGAHQQQDPAMALHNEGHLLVEDGDAVLRSVDAEGLLHQAGPQELGDAHTVEMQKSQAEL 864
Cdd:COG1196 653 GEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEA 732
|
570 580 590 600
....*....|....*....|....*....|....*....|....*.
gi 1039732403 865 AKPQ---ELQASQDQVAQVRDKVFLLNRELEECRAELEQLQQRRER 907
Cdd:COG1196 733 EREElleELLEEEELLEEEALEELPEPPDLEELERELERLEREIEA 778
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
2203-2521 |
1.11e-05 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 51.30 E-value: 1.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2203 EARGSQREHQLRRQVELLAYKVEQEKCIANELQKTlSKEQETASDVRKrlVVEQNAVQDLKSELHACKQENTSLLESLDK 2282
Cdd:PTZ00121 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA-AEAKKKADEAKK--AEEAKKADEAKKAEEAKKADEAKKAEEKKK 1547
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2283 VQQevlrLRavldgKEKELKVVLEELESERGKGQALQAQQEEQQLRYLqREGQSSRALEELKLSLEKQLAQNNQLCVA-- 2360
Cdd:PTZ00121 1548 ADE----LK-----KAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEA-KKAEEARIEEVMKLYEEEKKMKAEEAKKAee 1617
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2361 ----------LKHERAAKDNLQKELQIEASRCEALLAQE---KGQLSELQKSLEAERSRSLELSEALQHERLLTEQLSRN 2427
Cdd:PTZ00121 1618 akikaeelkkAEEEKKKVEQLKKKEAEEKKKAEELKKAEeenKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKE 1697
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2428 SQEACARQEtqvqhalLRKLKAEKTRALELEAMLEKVQKQAAHTQQQLEAQAQERCVELRREKERELEIQR-QRDEHKIE 2506
Cdd:PTZ00121 1698 AEEAKKAEE-------LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHlKKEEEKKA 1770
|
330
....*....|....*
gi 1039732403 2507 QLQRLVRELRWKEEV 2521
Cdd:PTZ00121 1771 EEIRKEKEAVIEEEL 1785
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1085-1383 |
1.12e-05 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 51.22 E-value: 1.12e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1085 QKQQELLERLREESAAKDRLALELHTAKGLLEGFKVEKVDLQEALGKKEESEQQLIL-ELEDLRKQLQQAARELLTLKEE 1163
Cdd:TIGR02169 173 EKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLkEKEALERQKEAIERQLASLEEE 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1164 KSVLWNQKETLTNEAKEREA------------GSPVLPAHKDT--ALQEEVESLTRVQWES----------RKQSEKDRA 1219
Cdd:TIGR02169 253 LEKLTEEISELEKRLEEIEQlleelnkkikdlGEEEQLRVKEKigELEAEIASLERSIAEKereledaeerLAKLEAEID 332
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1220 TLLSQMRVLESELEDQLVQHRGC----AQLAEEVATLKQQLAALDKHLRSQRQFMDD----------QAAEREHEREEFQ 1285
Cdd:TIGR02169 333 KLLAEIEELEREIEEERKRRDKLteeyAELKEELEDLRAELEEVDKEFAETRDELKDyrekleklkrEINELKRELDRLQ 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1286 QEIQRLEGQLRQAaRPRPPGPRDSQcVQLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASIQR 1365
Cdd:TIGR02169 413 EELQRLSEELADL-NAAIAGIEAKI-NELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQR 490
|
330
....*....|....*...
gi 1039732403 1366 QMvqlqEELEKQKKSMEE 1383
Cdd:TIGR02169 491 EL----AEAEAQARASEE 504
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1131-1565 |
1.48e-05 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 50.83 E-value: 1.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1131 KKEESEQQLILELEDLrKQLQQAARELltlkeEKsvlwnQKETLTNEAKEreagspvlpAHKDTALQEEVESLTRVQWES 1210
Cdd:TIGR02168 173 RRKETERKLERTRENL-DRLEDILNEL-----ER-----QLKSLERQAEK---------AERYKELKAELRELELALLVL 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1211 RKQSEKDRatlLSQMRVLESELEDQlvqhrgCAQLAEEVATLKQQLAALdkhlRSQRQFMDDQAAEREHEREEFQQEIQR 1290
Cdd:TIGR02168 233 RLEELREE---LEELQEELKEAEEE------LEELTAELQELEEKLEEL----RLEVSELEEEIEELQKELYALANEISR 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1291 LEGQLRqaarprppgprdsqcvqldeeveLLQEKLREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASIQRQMVQL 1370
Cdd:TIGR02168 300 LEQQKQ-----------------------ILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL 356
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1371 QEELEKQKKSMEELKEKEILKQENMGDLllttvsrsgldeagcpmlpqgsssrgpeaqpdvtERALLQHENEvVHRRNSE 1450
Cdd:TIGR02168 357 EAELEELEAELEELESRLEELEEQLETL----------------------------------RSKVAQLELQ-IASLNNE 401
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1451 IDELKSLIENLQENQRQLQKDKAEEIEQLHEVIEKLQSElslmgpkvhevsdpQAGSLHSELACLRGEGLGGQALRSELQ 1530
Cdd:TIGR02168 402 IERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQA--------------ELEELEEELEELQEELERLEEALEELR 467
|
410 420 430
....*....|....*....|....*....|....*
gi 1039732403 1531 AAQAAKEVFGQLLADQAHGHSQALEALQQRLQDAE 1565
Cdd:TIGR02168 468 EELEEAEQALDAAERELAQLQARLDSLERLQENLE 502
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
134-501 |
1.74e-05 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 50.83 E-value: 1.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 134 QLELTQANLQKeKETALTELREMLNGRRAQ-ELAllqsrqqcelELLREQHAREKE---EMALRSGQEAAELKEKLRSEM 209
Cdd:TIGR02168 180 KLERTRENLDR-LEDILNELERQLKSLERQaEKA----------ERYKELKAELRElelALLVLRLEELREELEELQEEL 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 210 EKNAQTIETLkQDWESERELCLENLRQElslkhQSEMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAAI 289
Cdd:TIGR02168 249 KEAEEELEEL-TAELQELEEKLEELRLE-----VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELE 322
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 290 KKLQEDLQSEHCQylqdleqkfREKEKAKELELETLQASYEDLKAQSQEEIRllwsQLESMKTNREELNgswdpvlaqas 369
Cdd:TIGR02168 323 AQLEELESKLDEL---------AEELAELEEKLEELKEELESLEAELEELEA----ELEELESRLEELE----------- 378
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 370 hlEELEHLRSGFAqQQQQERAQHESELEHLrvyfEKKLKDAEKTYQEdltvfQQRLQEAREDSLESTEIssscvlpEETS 449
Cdd:TIGR02168 379 --EQLETLRSKVA-QLELQIASLNNEIERL----EARLERLEDRRER-----LQQEIEELLKKLEEAEL-------KELQ 439
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 1039732403 450 GREGKEPPDPLDLQLGQPKVQESLvEDCQVKLSKAEEKIQQMKEEFQKKEAE 501
Cdd:TIGR02168 440 AELEELEEELEELQEELERLEEAL-EELREELEEAEQALDAAERELAQLQAR 490
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
2081-2521 |
2.29e-05 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 50.32 E-value: 2.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2081 RDSQALSEVTTDKGEKESLETHL-TWSEELLRAIQEVFAREQEKAELQprpygsnlGDYNSLVQRLEKVIQEQGDPQKVQ 2159
Cdd:COG1196 310 RRRELEERLEELEEELAELEEELeELEEELEELEEELEEAEEELEEAE--------AELAEAEEALLEAEAELAEAEEEL 381
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2160 DHLcLSDRSSLLAEIQALRAQLRMTHLQNQEKLQQLCAALTSTEARGSQREHQLRRQVELLAYKVEQEKCIANELQKTLS 2239
Cdd:COG1196 382 EEL-AEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEA 460
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2240 KEQETASDVRKRLVVEQnAVQDLKSELHACKQENTSLLE---------------SLDKVQQEVLRLRAVLDGKEKELKVV 2304
Cdd:COG1196 461 LLELLAELLEEAALLEA-ALAELLEELAEAAARLLLLLEaeadyegflegvkaaLLLAGLRGLAGAVAVLIGVEAAYEAA 539
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2305 LEELESERGK-GQALQAQQEEQQLRYLQREGQSSRALEELKLSLEKQLAQNNQLCVALKHERAAKDNLQKELQIEASRCE 2383
Cdd:COG1196 540 LEAALAAALQnIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLG 619
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2384 ALL-------AQEKGQLSELQKSLEAERSRSLELSEALQHERLLTEQLSRNSQEACARQ-----ETQVQHALLRKLKAEK 2451
Cdd:COG1196 620 DTLlgrtlvaARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEaeleeLAERLAEEELELEEAL 699
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2452 TRALELEAMLEKVQKQAAHTQQQLEAQAQERCVELRREKERELEIQRQRDEHKIEQLQRLVRELRWKEEV 2521
Cdd:COG1196 700 LAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELEREL 769
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
144-542 |
2.69e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 50.06 E-value: 2.69e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 144 KEKETALTELREMLNG--RRAQELALLQSR-QQCELELLREQHAREKEEMALRSGQEAAELKEKLRSEMEKNAQTIETLK 220
Cdd:PRK03918 210 NEISSELPELREELEKleKEVKELEELKEEiEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELK 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 221 QDWESERELC---------LENLRQELSlKHQSEMEGLQSQFqKELSEQKVELEKIFQAKHEAEVSLKNLEAQHqaaikK 291
Cdd:PRK03918 290 EKAEEYIKLSefyeeyldeLREIEKRLS-RLEEEINGIEERI-KELEEKEERLEELKKKLKELEKRLEELEERH-----E 362
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 292 LQEDLqsehcqyLQDLEQKFREKEKAKELELETLQASYEDL---KAQSQEEIRLLWSQLESMKTNREELNgswdpvlaqa 368
Cdd:PRK03918 363 LYEEA-------KAKKEELERLKKRLTGLTPEKLEKELEELekaKEEIEEEISKITARIGELKKEIKELK---------- 425
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 369 shlEELEHLRSGFAQQQQQERAQHESELEHLRVYFEKKLKDAEKTYQEDLTVFQQRLQEAREdsLESTEISSSCVLPEET 448
Cdd:PRK03918 426 ---KAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRE--LEKVLKKESELIKLKE 500
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 449 SGREGKEppdpldlqlgqpkVQESLVEDCQVKLSKAEEKIQQMKEEFQKKEAEWELSREELKREAE--ERLASMFLELRE 526
Cdd:PRK03918 501 LAEQLKE-------------LEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEElkKKLAELEKKLDE 567
|
410
....*....|....*.
gi 1039732403 527 kAESEKLSIISRFEHR 542
Cdd:PRK03918 568 -LEEELAELLKELEEL 582
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
485-1382 |
2.97e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 49.97 E-value: 2.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 485 EEKIQQMKEEFQKKEAEWELSREELKREAEERLASMFLELREKAESEKLSIISRFEHRESSMRHLQDQQAAQILDLERSL 564
Cdd:pfam02463 194 ELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKE 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 565 MEQQghLRQLEQELTRDDLLPCSQCGQEPAMAQEEKNGALLREKedcaLQLLMAQNRFLEERKEimekfaKEQDAFLRDA 644
Cdd:pfam02463 274 NKEE--EKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEK----LKESEKEKKKAEKELK------KEKEEIEELE 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 645 QEKHNHELQLLQQGHQQQLLALRMELETKHRSELTEQLASSESRRQALLETHVAELQVKHNAEISALEKRHLSNLDELES 724
Cdd:pfam02463 342 KELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLL 421
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 725 CYVADVQTIRDEHQQALELLRAELEEQLQKKEschremLTQELENLKRQHAEElQSVRDSLRMEMSAQHIENGKGPAADL 804
Cdd:pfam02463 422 KEEKKEELEILEEEEESIELKQGKLTEEKEEL------EKQELKLLKDELELK-KSEDLLKETQLVKLQEQLELLLSRQK 494
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 805 QGAhqqqdpamALHNEGHLLVEDGDAVLRSVDAEGLLHQAGPQELGDAHTVemqksqAELAKPQELQASQDQVAQVRDKV 884
Cdd:pfam02463 495 LEE--------RSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVA------VENYKVAISTAVIVEVSATADEV 560
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 885 FllnrELEECRAELEQLQQRRERENQEGTTLICMLRADLELAQGEGKALRDALRRLLDLFGDTLKAAVTLKSRISERAGL 964
Cdd:pfam02463 561 E----ERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTK 636
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 965 LLDHEDAADTSDARLAAAALGDMWSDEGLLEIDRTLPEGAETSSVCEISSHVCESFFISPENTLDCEQPIRRVYQSLSTA 1044
Cdd:pfam02463 637 LKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLK 716
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1045 VEGLLEMALDSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALEL-HTAKGLLEGFKVEKV 1123
Cdd:pfam02463 717 LEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEReKTEKLKVEEEKEEKL 796
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1124 DLQEALGKKEESEQQLILEledlrkQLQQAARELLTLKEEKSVLWNQKETLTNEAKEREAgspvlpahKDTALQEEVESL 1203
Cdd:pfam02463 797 KAQEEELRALEEELKEEAE------LLEEEQLLIEQEEKIKEEELEELALELKEEQKLEK--------LAEEELERLEEE 862
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1204 TRVQWESRKQSEKDRATLLSQMRVLESELEDQLVQHRgcaQLAEEVATLKQQLAALDKHLRSQRQFMDDQAAEREHEREE 1283
Cdd:pfam02463 863 ITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEK---KELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEE 939
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1284 FQQEIQRLEGQLRQAArprppgprdsqcvqLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASI 1363
Cdd:pfam02463 940 LLLEEADEKEKEENNK--------------EEEEERNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKK 1005
|
890
....*....|....*....
gi 1039732403 1364 QRQMVQLQEELEKQKKSME 1382
Cdd:pfam02463 1006 KLIRAIIEETCQRLKEFLE 1024
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
116-798 |
3.35e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 49.58 E-value: 3.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 116 HSLELEALRLSLNNMHTAQLELTQANLQKEKETALTELREMLNGRRAQELALLQSRQQCELELLREQHAREKEEMALRSG 195
Cdd:pfam02463 193 EELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLK 272
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 196 QEAAELKEKLRSEMEKNAQTIETLKQDWESERELCLENLRQELSLKHQSEMEGLQSQFQKElsEQKVELEKIFQAKHEAE 275
Cdd:pfam02463 273 ENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKE--KEEIEELEKELKELEIK 350
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 276 VSLKNLEAQHQAAIKKLQEDLQSEHCQYLQDLEQKFREKEKAKELELEtlqasYEDLKAQSQEEIRLLWSQLESMKTNRE 355
Cdd:pfam02463 351 REAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELE-----LKSEEEKEAQLLLELARQLEDLLKEEK 425
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 356 ElngswdpvlAQASHLEELEHLRsGFAQQQQQERAQHESELEHLRVYFEKKLKDAEKTYQEDLTVFQQRLQEAREDSLES 435
Cdd:pfam02463 426 K---------EELEILEEEEESI-ELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKL 495
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 436 TEISSSCVLPEETSGREGKEPPDPLDLQLG---------QPKVQESLVEDCQVKLSKAEEKIQQMKEEFQKKEAEWELSR 506
Cdd:pfam02463 496 EERSQKESKARSGLKVLLALIKDGVGGRIIsahgrlgdlGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPL 575
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 507 EELKREAEERLASMFLELREKAESEKLSIISRFEHRE-SSMRHLQDQQAAQILDLERSLMEQQGHLRQLEQELTRDDLLP 585
Cdd:pfam02463 576 GARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATlEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLE 655
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 586 CSQCGQEPAMAQ--EEKNGALLREKEDCALQLLMAQNRFLEERKEIMEKFAKEQDAFLRDAQEKHNHELQLLQQGHQQQL 663
Cdd:pfam02463 656 EGLAEKSEVKASlsELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKIN 735
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 664 LALRmELETKHRSELTEQLASSESRRQALLETHVAELQVKHNAEISALEKRHLSNLDELESCYVADVQTIRDEHQQALEL 743
Cdd:pfam02463 736 EELK-LLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEA 814
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|....*..
gi 1039732403 744 LRAELEEQLQKKESCHREMLTQELENLK--RQHAEELQSVRDSLRMEMSAQHIENGK 798
Cdd:pfam02463 815 ELLEEEQLLIEQEEKIKEEELEELALELkeEQKLEKLAEEELERLEEEITKEELLQE 871
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1075-1294 |
3.57e-05 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.53 E-value: 3.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1075 HRNEEMAQAMQKQQELLERLREESAAKDRLALELHTAKGLLEGFKVEKvdLQEALGKKEESEQQLILELEDLRKQLQQAA 1154
Cdd:COG4913 238 ERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAALRLWF--AQRRLELLEAELEELRAELARLEAELERLE 315
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1155 RELLTLKEEKSVLWNQKETLTNEAKEReagspvlpahkdtaLQEEVESLTRvqweSRKQSEKDRATLLSQMRVLESEL-- 1232
Cdd:COG4913 316 ARLDALREELDELEAQIRGNGGDRLEQ--------------LEREIERLER----ELEERERRRARLEALLAALGLPLpa 377
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039732403 1233 -EDQLVQHRgcAQLAEEVATLKQQLAALDKHLRSQRqfmdDQAAEREHEREEFQQEIQRLEGQ 1294
Cdd:COG4913 378 sAEEFAALR--AEAAALLEALEEELEALEEALAEAE----AALRDLRRELRELEAEIASLERR 434
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
203-532 |
4.64e-05 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 49.24 E-value: 4.64e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 203 EKLRSEMEKNAQT--------IETLKQDWESERELCLENLRQELSLKHQSEMEGLQSQFQKELSEQKvelekifqaKHEA 274
Cdd:NF033838 72 SEIQKSLDKRKHTqnvalnkkLSDIKTEYLYELNVLKEKSEAELTSKTKKELDAAFEQFKKDTLEPG---------KKVA 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 275 EVSLKNLEAQHQAAIKKlQEDLQSEHCQYLQDLEQKFREKE-KAKELELETLQAsyEDLKAQSQEEIRLLWSQLESMKtn 353
Cdd:NF033838 143 EATKKVEEAEKKAKDQK-EEDRRNYPTNTYKTLELEIAESDvEVKKAELELVKE--EAKEPRDEEKIKQAKAKVESKK-- 217
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 354 reelngswdpvlAQASHLEELEHLRsgfaqqqqqERAQHESELEHLRVYFEKKLKDAEKTYQEDLTVFQQR--LQE-ARE 430
Cdd:NF033838 218 ------------AEATRLEKIKTDR---------EKAEEEAKRRADAKLKEAVEKNVATSEQDKPKRRAKRgvLGEpATP 276
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 431 DSLESTEISSSCVLPEETsgregkePPDPlDLQLGQpKVQESlvedcQVKLSKAEEKIQQMKEEFQK------------- 497
Cdd:NF033838 277 DKKENDAKSSDSSVGEET-------LPSP-SLKPEK-KVAEA-----EKKVEEAKKKAKDQKEEDRRnyptntyktlele 342
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1039732403 498 --------KEAEWELSREELKR-EAEERLAsmflELREKAESEK 532
Cdd:NF033838 343 iaesdvkvKEAELELVKEEAKEpRNEEKIK----QAKAKVESKK 382
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
2197-2520 |
6.74e-05 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 48.98 E-value: 6.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2197 AALTSTEARGSQREHQLRRQVELLAYKVEqEKCIANELQKTlSKEQETASDVRKRlvveqnAVQDLKSELHACKQENTSL 2276
Cdd:PTZ00121 1413 AAAAKKKADEAKKKAEEKKKADEAKKKAE-EAKKADEAKKK-AEEAKKAEEAKKK------AEEAKKADEAKKKAEEAKK 1484
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2277 LESLDKVQQEVLRLRAVLDGKEKELKVVLEELESERGKGQALQAQQEEQQLRYLQREGQSSRALEELKLSLEKQLAQNNQ 2356
Cdd:PTZ00121 1485 ADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKK 1564
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2357 lcvalKHERAAKDNLQKELQIEasRCEALLAQEKGQLSELQKSLEAERSRSLELSEALQHERLLTEQLsRNSQEACARQE 2436
Cdd:PTZ00121 1565 -----KAEEAKKAEEDKNMALR--KAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEEL-KKAEEEKKKVE 1636
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2437 TQVQHALLRKLKAEKTRALELEAMLEKVQKQAAHTQQQLEAQAQERCVELRREKERELEiQRQRDEHKIEQLQRLVRELR 2516
Cdd:PTZ00121 1637 QLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALK-KEAEEAKKAEELKKKEAEEK 1715
|
....
gi 1039732403 2517 WKEE 2520
Cdd:PTZ00121 1716 KKAE 1719
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2274-2520 |
6.74e-05 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 48.90 E-value: 6.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2274 TSLLESLDKVQQEVLRLRAVLDGKEKELKVVLEELESERGKGQALQAQQEEQQLRYLQREGQSSRaLEELKLSLEKQLAQ 2353
Cdd:TIGR02168 235 EELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISR-LEQQKQILRERLAN 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2354 NNQLCVALKHERAAKDNLQKELQIEASRCEALLAQEKGQLSELQKSLEAERSrslELSEALQHERLLTEQL-SRNSQEAC 2432
Cdd:TIGR02168 314 LERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEA---ELEELESRLEELEEQLeTLRSKVAQ 390
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2433 ARQETQVQHALLRKLKAEKTRaleLEAMLEKVQKQAAHTQQQLEAQAQERCVELRREKERELEIQRQRDEHKIEQLQRLV 2512
Cdd:TIGR02168 391 LELQIASLNNEIERLEARLER---LEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELR 467
|
....*...
gi 1039732403 2513 RELRWKEE 2520
Cdd:TIGR02168 468 EELEEAEQ 475
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
856-1370 |
7.98e-05 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 48.39 E-value: 7.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 856 EMQKSQAELAkpQELQASQDQVAQVRDKVFLLNRELEECRAELEQLQQRRERENQEGTTLIcmlRADLELAQGEGKALRD 935
Cdd:COG1196 320 ELEEELAELE--EELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAE---EELEELAEELLEALRA 394
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 936 ALRRLLDLfGDTLKAAVTLKSRISERAGLLLDHEDAADTSDARLAAAALGDMWSDEGLLEIDRTLPEGAETSSVCEISSH 1015
Cdd:COG1196 395 AAELAAQL-EELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAA 473
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1016 VCESFFISPENTLDCEQPIRRVYQSLSTAVEGLLEMALdSSKQLEEARQLHRCV-------EREFRHRNEEMAQAMqkQQ 1088
Cdd:COG1196 474 LLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVK-AALLLAGLRGLAGAVavligveAAYEAALEAALAAAL--QN 550
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1089 ELLERLREESAAKDRLALELHTAKGLLEGFKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEeksvlw 1168
Cdd:COG1196 551 IVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLG------ 624
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1169 nqkETLTNEAKEREAGSPVLPAHKDTALQEEVESLTRVQWESRKQSEKDRATLLSQMRVLESELEDQLVQHRGCAQLAEE 1248
Cdd:COG1196 625 ---RTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLA 701
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1249 VATLKQQLAALDKHLRSQRQFMDDQAAEREHEREEFQQEIQRLEGQLRQAARPRPPGPRDSQcvQLDEEVELLQEKLREK 1328
Cdd:COG1196 702 EEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLE--ELERELERLEREIEAL 779
|
490 500 510 520
....*....|....*....|....*....|....*....|....*....
gi 1039732403 1329 -------LDGFNELVIKKDFADQQLLIQEEEIKRLEETNASIQRQMVQL 1370
Cdd:COG1196 780 gpvnllaIEEYEELEERYDFLSEQREDLEEARETLEEAIEEIDRETRER 828
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
887-1387 |
9.15e-05 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 48.11 E-value: 9.15e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 887 LNRELEECRAELEQLQQRRERENQEGTTLICMLrADLELAQGEGKALRDALRRLLDLFGDTLKAAVTLKSRISERAGLLL 966
Cdd:PRK02224 211 LESELAELDEEIERYEEQREQARETRDEADEVL-EEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLE 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 967 DHEDAADtsdARLAAAALGDMWSDEGLLEIDRTLPEGAETSSVCEisshvcesffispentlDCEQPIrrvyQSLSTAVE 1046
Cdd:PRK02224 290 ELEEERD---DLLAEAGLDDADAEAVEARREELEDRDEELRDRLE-----------------ECRVAA----QAHNEEAE 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1047 GLLEMALDSSKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALELHTAKGLLEGFKVEK---- 1122
Cdd:PRK02224 346 SLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERdelr 425
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1123 -------VDLQEALGKKEESEQQL-----------------ILELEDLRKQLQQAARELLTLKEEKSVLWNQKETLTnEA 1178
Cdd:PRK02224 426 ereaeleATLRTARERVEEAEALLeagkcpecgqpvegsphVETIEEDRERVEELEAELEDLEEEVEEVEERLERAE-DL 504
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1179 KEREAGSpvlpahkdTALQEEVESLTRVQWESRKQSEKDRATlLSQMRVLESELEDQLVQHRGCAQLA--------EEVA 1250
Cdd:PRK02224 505 VEAEDRI--------ERLEERREDLEELIAERRETIEEKRER-AEELRERAAELEAEAEEKREAAAEAeeeaeearEEVA 575
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1251 TLKQQLAALDKHLRSQRQFMDDQAAEREHEreefqQEIQRLEGQLRQAARprppgprdsqcvQLDEEVELLQEK------ 1324
Cdd:PRK02224 576 ELNSKLAELKERIESLERIRTLLAAIADAE-----DEIERLREKREALAE------------LNDERRERLAEKrerkre 638
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039732403 1325 LREKLDG--FNELVIKKDFADQQLLIQEEEIKRLEETNASIQRQMVQLQEELEkqkkSMEELKEK 1387
Cdd:PRK02224 639 LEAEFDEarIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELE----ELEELRER 699
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1056-1596 |
9.47e-05 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 48.12 E-value: 9.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1056 SKQLEEARQLHRCVEREFRHRNEEMAQAMQKQQELLERLREESAAKDRLALElhTAKGLLEGFKVEKVDLQEALGKKEES 1135
Cdd:TIGR00606 446 KEILEKKQEELKFVIKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTE--TLKKEVKSLQNEKADLDRKLRKLDQE 523
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1136 EQQLILELEDLRKQLQQAARELLTLKEEKSVLWNQKETLTNEAKEreagSPVLPAHKDT----------------ALQEE 1199
Cdd:TIGR00606 524 MEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGY----FPNKKQLEDWlhskskeinqtrdrlaKLNKE 599
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1200 VESLTRVQWESRKQSEKDRATLLSqmrvleseLEDQLVQHRGC-------AQLAEEVATLKQQLAALDKHLRSQRQFMDD 1272
Cdd:TIGR00606 600 LASLEQNKNHINNELESKEEQLSS--------YEDKLFDVCGSqdeesdlERLKEEIEKSSKQRAMLAGATAVYSQFITQ 671
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1273 QAAERE------HEREEFQQEIQRLEGQLRQAARPRPpgprdsqcvqldEEVELLQEKLREKLDGFNELVIKKDFADQQL 1346
Cdd:TIGR00606 672 LTDENQsccpvcQRVFQTEAELQEFISDLQSKLRLAP------------DKLKSTESELKKKEKRRDEMLGLAPGRQSII 739
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1347 LIQEEEIKRLEETNASIQRQMVQLQEELEKQKKSMEELKEKEILKQENMGDLLLTTVSRSGLDEAGCPMLPQGSSSRGPE 1426
Cdd:TIGR00606 740 DLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSD 819
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1427 AQPDVTE-RALLQHENEVVHRRNSEIDELKSLIENLQENQRQLQKD----KAEEI-------------EQLHEVIEKLQS 1488
Cdd:TIGR00606 820 LDRTVQQvNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKtnelKSEKLqigtnlqrrqqfeEQLVELSTEVQS 899
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1489 ELSLMGPKVHEVSdPQAGSLHSELAclRGEGLGGQALRSELQAAQAAKEVFGQLlaDQAHGHSQALEALQQ-----RLQD 1563
Cdd:TIGR00606 900 LIREIKDAKEQDS-PLETFLEKDQQ--EKEELISSKETSNKKAQDKVNDIKEKV--KNIHGYMKDIENKIQdgkddYLKQ 974
|
570 580 590
....*....|....*....|....*....|...
gi 1039732403 1564 AEEVAARHLAELEHCVALREAEVEAMASQIQEF 1596
Cdd:TIGR00606 975 KETELNTVNAQLEECEKHQEKINEDMRLMRQDI 1007
|
|
| PRK00409 |
PRK00409 |
recombination and DNA strand exchange inhibitor protein; Reviewed |
203-319 |
1.16e-04 |
|
recombination and DNA strand exchange inhibitor protein; Reviewed
Pssm-ID: 234750 [Multi-domain] Cd Length: 782 Bit Score: 47.90 E-value: 1.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 203 EKLRSEMEKNAQTIETLKQDweserelcLENLRQELSLKhqseMEGLQSQFQKELSEQKVEL-EKIFQAKHEAEVSLKNL 281
Cdd:PRK00409 526 EELERELEQKAEEAEALLKE--------AEKLKEELEEK----KEKLQEEEDKLLEEAEKEAqQAIKEAKKEADEIIKEL 593
|
90 100 110
....*....|....*....|....*....|....*....
gi 1039732403 282 EAQHQAAIKKLQE-DLQSEHCQYLQDLEQKFREKEKAKE 319
Cdd:PRK00409 594 RQLQKGGYASVKAhELIEARKRLNKANEKKEKKKKKQKE 632
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
199-970 |
1.25e-04 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 48.04 E-value: 1.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 199 AELKEKLRSEMEKNAQTIETLKQDWESERELCLEnlRQELSLKHQSEMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSL 278
Cdd:pfam02463 165 SRLKRKKKEALKKLIEETENLAELIIDLEELKLQ--ELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLL 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 279 KNLEAQHQAAIKKLQEDLQSEhcQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLWSQLESmktNREELN 358
Cdd:pfam02463 243 QELLRDEQEEIESSKQEIEKE--EEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVD---DEEKLK 317
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 359 GSWDPVLAQASHLEELEHLRSGFAQQQQQERAQHESELEhlRVYFEKKLKDAEKTYQEDLTVFQQRLQEAREDSLESTEI 438
Cdd:pfam02463 318 ESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEE--EEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEE 395
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 439 SSSCVLPEETSGREGKEPPDPLDLQLGQPKVQESLVEDCQVKLSKAEEKIQQMKEEFQKKEA--EWELSREELKREAEER 516
Cdd:pfam02463 396 ELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQElkLLKDELELKKSEDLLK 475
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 517 LASMFLELREKAESEKLSIISRFEHRESSMRHLQDQQAAQILDLE-RSLMEQQGHLRQLEQELTRDDLLPCSQCGQEPAM 595
Cdd:pfam02463 476 ETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVgGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSA 555
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 596 AQEEKNGALLREKEDCALQLLMAQNRFLEERKEIMEKFAKEQDAFLRDAQEKhNHELQLLQQGHQQQLLALRMELETKHR 675
Cdd:pfam02463 556 TADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQ-LDKATLEADEDDKRAKVVEGILKDTEL 634
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 676 SELTEQLASSESRRQALLETHVAELQVKHNAEISALEKRHLSNLDELESCYVADVQTIRDEHQQALELLRAELEEQLQKK 755
Cdd:pfam02463 635 TKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKK 714
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 756 ESCHRE----MLTQELENLKRQHAEELQSVRDSLRMEMSAQHIENGKGPAADLQGAHQQQDPA-MALHNEGHLLVEDGDA 830
Cdd:pfam02463 715 LKLEAEellaDRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAeEREKTEKLKVEEEKEE 794
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 831 VLRSVDAEGLLHQAGPQELGDAHTVEMQKSQAELAKPQELQASQDQVAQVRDKVFLLNRE-LEECRAELEQLQQRREREN 909
Cdd:pfam02463 795 KLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEeLERLEEEITKEELLQELLL 874
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039732403 910 QEGTTLICMLRADLELAQGEGKALRDALRRLLDLFGDTLKAAVTLKSRISERAGLLLDHED 970
Cdd:pfam02463 875 KEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEE 935
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
136-409 |
1.31e-04 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 47.83 E-value: 1.31e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 136 ELTQANLQKEKEtaltELREMLNGRRAQELALLQSRQQCELELLREQHAREKEE--MALRSGQEAAELKEKLRSEMEKNA 213
Cdd:PTZ00121 1526 EAKKAEEAKKAD----EAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDknMALRKAEEAKKAEEARIEEVMKLY 1601
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 214 QTIETLKQDWESERElclENLRQELSLKHQSEMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAAIKKLQ 293
Cdd:PTZ00121 1602 EEEKKMKAEEAKKAE---EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAE 1678
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 294 EDLQSEHCQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLWSQLESMKTNREELNGSWDPVLAQASHLEE 373
Cdd:PTZ00121 1679 EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKK 1758
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 1039732403 374 LEHLRSGFAQQQQQERAQHES--------ELEHLRVYFEKKLKD 409
Cdd:PTZ00121 1759 IAHLKKEEEKKAEEIRKEKEAvieeeldeEDEKRRMEVDKKIKD 1802
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
2143-2516 |
1.55e-04 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 47.66 E-value: 1.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2143 QRLEKVIQEQGDPQKVQDHLCLSDRSSLLAEIQALRAqlrMTHLQNQEKLQQLCAALTSTEARGSQREHQLRRQVELLAY 2222
Cdd:pfam02463 173 EALKKLIEETENLAELIIDLEELKLQELKLKEQAKKA---LEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDE 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2223 KVEQEKcianeLQKTLSKEQETASDVRKRLVVEQNAVQDLKSELHACKQENTSLLESLDKVQQEVLRLRAVLDGKEKELK 2302
Cdd:pfam02463 250 QEEIES-----SKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKK 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2303 VVLEELESERGKGQalqaqqeeqqlrylqregQSSRALEELKLSLEKQLAQNNQLCVALKHERAAKDNLQKELQIEASRC 2382
Cdd:pfam02463 325 KAEKELKKEKEEIE------------------ELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERL 386
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2383 EALLAQEKGQLSELQKSLEAERSRSLELSEALQHERLLTEQLSRNSQEACARQETQVQHALLRKLKAEKTRALELEAMLE 2462
Cdd:pfam02463 387 SSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELE 466
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 1039732403 2463 KVQKQAAHTQQQLEAQAQERCVELRREKERELEIQRQRDEHKIEQLQRLVRELR 2516
Cdd:pfam02463 467 LKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGV 520
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
119-722 |
1.59e-04 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 47.36 E-value: 1.59e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 119 ELEALRLSLNNMHTAQLELT---------QANLQKEKETALTELREMLNGRRAQELALLQSRQQCELELLREQHAREKEE 189
Cdd:TIGR02168 268 KLEELRLEVSELEEEIEELQkelyalaneISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE 347
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 190 MALRSGQEAAELKEKLRSEMEKNAQTIETLKQDWESERELCLENLRQELSL-KHQSEMEGLQSQFQKELSEQKVELEKIF 268
Cdd:TIGR02168 348 ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLnNEIERLEARLERLEDRRERLQQEIEELL 427
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 269 QAKHEAEVSLKNLE-AQHQAAIKKLQEDLQSEHCQY--LQDLEQKFREKEKAKELELETLQASYEDLKAQsQEEIRLLWS 345
Cdd:TIGR02168 428 KKLEEAELKELQAElEELEEELEELQEELERLEEALeeLREELEEAEQALDAAERELAQLQARLDSLERL-QENLEGFSE 506
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 346 QLESMKTNREELNGSWDPVLAQ---------------------------------ASHLEELEHLRSGFAQQQQQERAQH 392
Cdd:TIGR02168 507 GVKALLKNQSGLSGILGVLSELisvdegyeaaieaalggrlqavvvenlnaakkaIAFLKQNELGRVTFLPLDSIKGTEI 586
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 393 ESELEHLR-------------VYFEKKLKDAeKTYQEDLTVFQQRLQEAREDSLESTEISSSCVLPEETSGREG-----K 454
Cdd:TIGR02168 587 QGNDREILkniegflgvakdlVKFDPKLRKA-LSYLLGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVRPGGvitggS 665
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 455 EPPDPLDLQLGQP--------KVQESLVEDCQVKLSKAEEKIQQMKEEFQKKEAEWE------------LSREELKREAE 514
Cdd:TIGR02168 666 AKTNSSILERRREieeleekiEELEEKIAELEKALAELRKELEELEEELEQLRKELEelsrqisalrkdLARLEAEVEQL 745
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 515 E----RLASMFLELREKAESEKLSIISRFEHRESSMRHLQDQQaAQILDLERSLMEQQGHLRQLEQELTRDdllpcsqcg 590
Cdd:TIGR02168 746 EeriaQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE-AQIEQLKEELKALREALDELRAELTLL--------- 815
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 591 QEPAMAQEEKNGALLREKEDCALQLLMAQNRfLEERKEIMEKFAKEQDAfLRDAQEKhnHELQLLQQGHQQQLLALRMEL 670
Cdd:TIGR02168 816 NEEAANLRERLESLERRIAATERRLEDLEEQ-IEELSEDIESLAAEIEE-LEELIEE--LESELEALLNERASLEEALAL 891
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|....
gi 1039732403 671 ETKHRSELTEQLASSESRRQALLE--THVAELQVKHNAEISALEKRHLSNLDEL 722
Cdd:TIGR02168 892 LRSELEELSEELRELESKRSELRRelEELREKLAQLELRLEGLEVRIDNLQERL 945
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
1285-1411 |
1.97e-04 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 47.16 E-value: 1.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1285 QQEIQRLEGQLRQaarprppgprdsqcvqLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQ---EEEIKRLEETNA 1361
Cdd:COG2433 412 EEEIRRLEEQVER----------------LEAEVEELEAELEEKDERIERLERELSEARSEERREirkDREISRLDREIE 475
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1039732403 1362 SIQRQMVQLQEELEKQKKSMEELkeKEILKQENMGDLL----LTTVSRSGLDEA 1411
Cdd:COG2433 476 RLERELEEERERIEELKRKLERL--KELWKLEHSGELVpvkvVEKFTKEAIRRL 527
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
139-583 |
5.21e-04 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 45.73 E-value: 5.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 139 QANLQKEKETALTELREMLNGRRAQELALLQSRQQCE----------LELLREQHAREKEEMALRSGQEAAELKEKLRSE 208
Cdd:TIGR00618 450 TAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETrkkavvlarlLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRR 529
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 209 MEKNAQTIETLKQDWESERELCLENLRQELSLKHQseMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAA 288
Cdd:TIGR00618 530 MQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQ--MQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAE 607
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 289 IKKLQED----LQSEHCQYLQDLEQKFREKEKAKELELETLQASYEDLKAQSQEEIRLLWSQLESMKTNREELNGSwdpv 364
Cdd:TIGR00618 608 DMLACEQhallRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQ---- 683
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 365 lAQASHLEELEHLRSGFAQQQQQERAQHESELEHLRVYFE--KKLKDAEKTYQEDLTVFQQRLQEAREDSLEsteisssc 442
Cdd:TIGR00618 684 -KMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEieNASSSLGSDLAAREDALNQSLKELMHQART-------- 754
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 443 VLPEETSGREGKEPPDPLDLQLGQPkvQESLVEDCQVKLSKAEEKIQQMKE---EFQKKEAEWELSREELKREAEERLAS 519
Cdd:TIGR00618 755 VLKARTEAHFNNNEEVTAALQTGAE--LSHLAAEIQFFNRLREEDTHLLKTleaEIGQEIPSDEDILNLQCETLVQEEEQ 832
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039732403 520 MFLELREKAESEkLSIISRFEHRESSMRHLQD--QQAAQILDLERSL--------MEQQGHLRQLEQELTRDDL 583
Cdd:TIGR00618 833 FLSRLEEKSATL-GEITHQLLKYEECSKQLAQltQEQAKIIQLSDKLnginqikiQFDGDALIKFLHEITLYAN 905
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
2256-2459 |
6.23e-04 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 44.82 E-value: 6.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2256 QNAVQDLKSELHACKQENTSLLESLDKVQQEVLRLRAVLDGKEKELKVVLEELESERGKgQALQAQQEEQQLRYLQREGQ 2335
Cdd:COG3883 22 QKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAE-IEERREELGERARALYRSGG 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2336 SSRALEELKLS------------LEKQLAQNNQLCVALKHERAAKDNLQKELQIEASRCEALLAQEKGQLSELQKSLEAE 2403
Cdd:COG3883 101 SVSYLDVLLGSesfsdfldrlsaLSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQ 180
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1039732403 2404 RSRSLELSEAlqhERLLTEQLSRNSQEACARQETQVQHALLRKLKAEKTRALELEA 2459
Cdd:COG3883 181 EALLAQLSAE---EAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAA 233
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
134-358 |
6.96e-04 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 45.40 E-value: 6.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 134 QLELTQANlqkekeTALTELREMLNGRRAQELALLQSRQQCELELLREQHAREKEEmalrsgqeaaelKEKLRSEMEKNA 213
Cdd:TIGR04523 273 QKELEQNN------KKIKELEKQLNQLKSEISDLNNQKEQDWNKELKSELKNQEKK------------LEEIQNQISQNN 334
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 214 QTIETLKQDweserelcLENLRQELSlkhqsEMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAAIKKLQ 293
Cdd:TIGR04523 335 KIISQLNEQ--------ISQLKKELT-----NSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQ 401
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039732403 294 EDlqsehcqylQDLEQKFREKEKAKELELETLQASYEDLKAQ---SQEEIRLLWSQLESMKTNREELN 358
Cdd:TIGR04523 402 NQ---------EKLNQQKDEQIKKLQQEKELLEKEIERLKETiikNNSEIKDLTNQDSVKELIIKNLD 460
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
2211-2521 |
7.50e-04 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 45.52 E-value: 7.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2211 HQLRRQVELLAykveQEKCIANELQKTLSK----EQETASDVRKRLVVEQNAVQDLKS-ELHACKQENTSLLESLDKVQQ 2285
Cdd:PTZ00121 1264 HFARRQAAIKA----EEARKADELKKAEEKkkadEAKKAEEKKKADEAKKKAEEAKKAdEAKKKAEEAKKKADAAKKKAE 1339
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2286 EVLRLRAVLDGKEKELKVVLEELEsERGKGQALQAQQEEQQLRYLQREGQSSRALEELKLSLEKQLAQNNQLCVALKHER 2365
Cdd:PTZ00121 1340 EAKKAAEAAKAEAEAAADEAEAAE-EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2366 AAkDNLQKElqieasrceallAQEKGQLSELQKslEAERSRSLELSEALQHERLLTEQLSRNSQEACARQEtqvqhalLR 2445
Cdd:PTZ00121 1419 KA-DEAKKK------------AEEKKKADEAKK--KAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADE-------AK 1476
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2446 KLKAEKTRALELEAMLEKVQKQAahTQQQLEAQAQERCVELRREKE-------RELEIQRQRDEHKIEQLQRLVRELRWK 2518
Cdd:PTZ00121 1477 KKAEEAKKADEAKKKAEEAKKKA--DEAKKAAEAKKKADEAKKAEEakkadeaKKAEEAKKADEAKKAEEKKKADELKKA 1554
|
...
gi 1039732403 2519 EEV 2521
Cdd:PTZ00121 1555 EEL 1557
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1092-1306 |
7.92e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 44.75 E-value: 7.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1092 ERLREESAAKDRLALELHTAKGLLEGFKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEEKSVLWNQK 1171
Cdd:COG4942 20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1172 ETLTNEAKER--------EAGSPVLPAHKDTALQEE-----VESLTRVQWESRKQSEKDRATLLSQMRVLESELEDQLVQ 1238
Cdd:COG4942 100 EAQKEELAELlralyrlgRQPPLALLLSPEDFLDAVrrlqyLKYLAPARREQAEELRADLAELAALRAELEAERAELEAL 179
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039732403 1239 HRGCAQLAEEVATLKQQLAALDKHLRSQRQFMDDQAAEREHEREEFQQEIQRLEGQLRQAARPRPPGP 1306
Cdd:COG4942 180 LAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAG 247
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1313-1488 |
8.06e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 44.75 E-value: 8.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1313 QLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASIQRQMVQLQEELEKQKKSMEELkEKEILKQ 1392
Cdd:COG4942 24 EAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAEL-RAELEAQ 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1393 ENMGDLLLTTVSRSGLDEAGCPMLPQGSSSRGpeaqpdVTERALLQHeneVVHRRNSEIDELKSLIENLQENQRQLQKDK 1472
Cdd:COG4942 103 KEELAELLRALYRLGRQPPLALLLSPEDFLDA------VRRLQYLKY---LAPARREQAEELRADLAELAALRAELEAER 173
|
170
....*....|....*.
gi 1039732403 1473 AEEIEQLHEVIEKLQS 1488
Cdd:COG4942 174 AELEALLAELEEERAA 189
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1200-1604 |
8.68e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 44.76 E-value: 8.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1200 VESLTRVQWESRKQSEKDRATLLSQMRVLESELEDQLVQHRGCAQLAEEVATLKQQLAALD---KHLRSQRQFMDD--QA 1274
Cdd:COG4717 48 LERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEaelEELREELEKLEKllQL 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1275 AEREHEREEFQQEIQRLEGQLRQA-ARPRPPGPRDSQCVQLDEEVELLQEKLREKLDGFNeLVIKKDFADQQLLIQE--E 1351
Cdd:COG4717 128 LPLYQELEALEAELAELPERLEELeERLEELRELEEELEELEAELAELQEELEELLEQLS-LATEEELQDLAEELEElqQ 206
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1352 EIKRLEETNASIQRQMVQLQEELEKQKKSMEELKEKEILKQE---------------NMGDLLLTTVSRSGLDEAGCPML 1416
Cdd:COG4717 207 RLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEArlllliaaallallgLGGSLLSLILTIAGVLFLVLGLL 286
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1417 PQGSSSRGPEAQPDVTERALLQHENEVVHRRNSEIDELKS---LIENLQENQRQLQKDKAEEIEQLHEVIEKLQSELSLM 1493
Cdd:COG4717 287 ALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAalgLPPDLSPEELLELLDRIEELQELLREAEELEEELQLE 366
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1494 GPKVHEVSDPQAGSLHSElaclrgeglggQALRSELQAAQAAKEVFGQLLADQAHGHSQALEALQQRLQDAEEVAARHLA 1573
Cdd:COG4717 367 ELEQEIAALLAEAGVEDE-----------EELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEELEEELE 435
|
410 420 430
....*....|....*....|....*....|.
gi 1039732403 1574 ELEHCVALREAEVEAMASQIQEFAATLKAKE 1604
Cdd:COG4717 436 ELEEELEELEEELEELREELAELEAELEQLE 466
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
2084-2309 |
9.01e-04 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 45.06 E-value: 9.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2084 QALSEVTT---DKGEKESLETH-----LTWSEELLRAIQEVFAREQEKAELQPRPYGSNLGDYNSLVQRLEKVIQEQGdp 2155
Cdd:TIGR02169 272 QLLEELNKkikDLGEEEQLRVKekigeLEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEER-- 349
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2156 qkvqdhlclSDRSSLLAEIQALRAQLRMTHLQNQEKlqqlcaaltSTEARGSQREH-QLRRQVELLAYKVEQEKCIANEL 2234
Cdd:TIGR02169 350 ---------KRRDKLTEEYAELKEELEDLRAELEEV---------DKEFAETRDELkDYREKLEKLKREINELKRELDRL 411
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2235 QKTLSKEQETASDVR---KRLVVEQNAVQ----DLKSELHACKQENTSLLESLDKVQQEVLRLRAVLDGKEKELKVVLEE 2307
Cdd:TIGR02169 412 QEELQRLSEELADLNaaiAGIEAKINELEeekeDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRE 491
|
..
gi 1039732403 2308 LE 2309
Cdd:TIGR02169 492 LA 493
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
2132-2514 |
9.48e-04 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 45.06 E-value: 9.48e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2132 GSNLGDYNSLVQR--LEKVIQEQGDPQKVQDHL--CLSDRSSLLAEIQALRAQLRMTHLQNQEKLQQLcAALTSTEARGS 2207
Cdd:TIGR02169 658 GSRAPRGGILFSRsePAELQRLRERLEGLKRELssLQSELRRIENRLDELSQELSDASRKIGEIEKEI-EQLEQEEEKLK 736
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2208 QREHQLRR-----QVELLAYKVEQEKCIAN--ELQKTLSKEQETASDVRKRLVVEQ-NAVQDLKSELHACKQENTSLLES 2279
Cdd:TIGR02169 737 ERLEELEEdlsslEQEIENVKSELKELEARieELEEDLHKLEEALNDLEARLSHSRiPEIQAELSKLEEEVSRIEARLRE 816
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2280 LDKVQQEVLRLRAVLDGKEKELKVVLEELESERGKgqalqaqqeeqqlryLQREGQSSRA-LEELKLSLEKQLAQNNQLC 2358
Cdd:TIGR02169 817 IEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKS---------------IEKEIENLNGkKEELEEELEELEAALRDLE 881
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2359 VALKHERAAKDNLQKELQieasrceallaQEKGQLSELQKSLEAERSRSLELSEALQherLLTEQLSRNSQEACARQETQ 2438
Cdd:TIGR02169 882 SRLGDLKKERDELEAQLR-----------ELERKIEELEAQIEKKRKRLSELKAKLE---ALEEELSEIEDPKGEDEEIP 947
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039732403 2439 VQHALLRKLKAEKTRALELEAMLEKVQKQAahtqQQLEAQAQERCVELrREKERELEIQRQRDEHKIEQLQRLVRE 2514
Cdd:TIGR02169 948 EEELSLEDVQAELQRVEEEIRALEPVNMLA----IQEYEEVLKRLDEL-KEKRAKLEEERKAILERIEEYEKKKRE 1018
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1190-1599 |
9.77e-04 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 44.83 E-value: 9.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1190 AHKDTALQEEVESLtrvqwesrKQSEKDRATLLSQMRVLESELEDQLvqhrgcAQLAEEVATLKQQLAALDKHLRSQRQF 1269
Cdd:pfam12128 240 RPEFTKLQQEFNTL--------ESAELRLSHLHFGYKSDETLIASRQ------EERQETSAELNQLLRTLDDQWKEKRDE 305
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1270 MDDQAAEREHEREEFQQEIQRLEGQLRQAARPRPPGPR---------DSQCVQLDEEVELLQEKLREKLDGFNELVIK-- 1338
Cdd:pfam12128 306 LNGELSAADAAVAKDRSELEALEDQHGAFLDADIETAAadqeqlpswQSELENLEERLKALTGKHQDVTAKYNRRRSKik 385
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1339 -------KDFADQQLLIQEEEIKRLEETNASIQRQMVQLQEELEKQKKsmeELKEKEILKQENMGDLllttvsrsgldea 1411
Cdd:pfam12128 386 eqnnrdiAGIKDKLAKIREARDRQLAVAEDDLQALESELREQLEAGKL---EFNEEEYRLKSRLGEL------------- 449
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1412 gcpmlpqgsssRGPEAQPDVTERALLQHEN--EVVHRRNSEIDELKSLIENLQENQRQLQK--DKAEE-IEQLHEVIEKL 1486
Cdd:pfam12128 450 -----------KLRLNQATATPELLLQLENfdERIERAREEQEAANAEVERLQSELRQARKrrDQASEaLRQASRRLEER 518
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1487 QSELSlmgpKVHEVSDPQAGSLHSELA------------CLRGEGLGGQALRSELQAAQAAKE--VFG------QLLADQ 1546
Cdd:pfam12128 519 QSALD----ELELQLFPQAGTLLHFLRkeapdweqsigkVISPELLHRTDLDPEVWDGSVGGElnLYGvkldlkRIDVPE 594
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|...
gi 1039732403 1547 AHGHSQALEALQQRLQDAEEVAARHLAELEHCVALREAEVEAmASQIQEFAAT 1599
Cdd:pfam12128 595 WAASEEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEK-ASREETFART 646
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
176-963 |
1.02e-03 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 44.72 E-value: 1.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 176 LELLREQHAREKEEMALRSgQEAAELKEKLRSEMEKNAQTIETLKQDWESERELCLENLRQElslkHQSEmEGLQSQFQK 255
Cdd:pfam15921 76 IERVLEEYSHQVKDLQRRL-NESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRE----SQSQ-EDLRNQLQN 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 256 ---ELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAAIKKLQEDLqsehCQYLQDLEQKFREKEKAKELELETLQASYEDL 332
Cdd:pfam15921 150 tvhELEAAKCLKEDMLEDSNTQIEQLRKMMLSHEGVLQEIRSIL----VDFEEASGKKIYEHDSMSTMHFRSLGSAISKI 225
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 333 KAQSQEEIRLLWSQL----ESMKTNREELNGSWDPVLAQasHLEELEHLRS-------GFAQQQQQERAQHESELEHLRV 401
Cdd:pfam15921 226 LRELDTEISYLKGRIfpveDQLEALKSESQNKIELLLQQ--HQDRIEQLISeheveitGLTEKASSARSQANSIQSQLEI 303
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 402 yFEKKLKDAEKTYQEDLTvfqqrlqearedSLESTEISSSCVLPEETSGREGKEPPDPLDLQLGQPKVQESLVEDCQVKL 481
Cdd:pfam15921 304 -IQEQARNQNSMYMRQLS------------DLESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQ 370
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 482 SKA--EEKIQQMKEEFQKKEAEWELSREELKREAEERLASmflelrekaeseklSIISRFEHRESSMRHLQDQQAAQILD 559
Cdd:pfam15921 371 ESGnlDDQLQKLLADLHKREKELSLEKEQNKRLWDRDTGN--------------SITIDHLRRELDDRNMEVQRLEALLK 436
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 560 LERSlmEQQGhlrQLEQELTrddllpcsqcgqepamAQEEKNGALLRekedcaLQLLMAQnrfLEERKEIMEKFAKEQDA 639
Cdd:pfam15921 437 AMKS--ECQG---QMERQMA----------------AIQGKNESLEK------VSSLTAQ---LESTKEMLRKVVEELTA 486
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 640 flrdaqekhnhelqllqqghqqqllaLRMELETKHRSeLTEQLASSESRRQALLEThvaelqvkhNAEISALEKRHLSNL 719
Cdd:pfam15921 487 --------------------------KKMTLESSERT-VSDLTASLQEKERAIEAT---------NAEITKLRSRVDLKL 530
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 720 DELESCyvadvqTIRDEHQQALELLRAELEEQLQKKESChREMLTQELENLkrqhaeelqsvrdslrMEMSAQHIEN-GK 798
Cdd:pfam15921 531 QELQHL------KNEGDHLRNVQTECEALKLQMAEKDKV-IEILRQQIENM----------------TQLVGQHGRTaGA 587
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 799 GPAADLQGAHQQQDPAMALHnEGHLLVEDGDAVLRSVDAEgllhqagpqelgdAHTVEMQKSQAELAKPQELQASQDqVA 878
Cdd:pfam15921 588 MQVEKAQLEKEINDRRLELQ-EFKILKDKKDAKIRELEAR-------------VSDLELEKVKLVNAGSERLRAVKD-IK 652
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 879 QVRDKvfLLNrELEECRAELEQLQQRRE-------RENQEGTTLICMLRADLELAQGEGKALRDALRRLLDLFGDTLKAA 951
Cdd:pfam15921 653 QERDQ--LLN-EVKTSRNELNSLSEDYEvlkrnfrNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVA 729
|
810
....*....|..
gi 1039732403 952 VTLKSRISERAG 963
Cdd:pfam15921 730 MGMQKQITAKRG 741
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
176-431 |
1.02e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 44.91 E-value: 1.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 176 LELLREQHAREKEEMAlrsgqEAAELKEKLRSEMEKNAQTIETLKQ---------DWES-ERELclENLRQELS--LKHQ 243
Cdd:COG4913 612 LAALEAELAELEEELA-----EAEERLEALEAELDALQERREALQRlaeyswdeiDVASaEREI--AELEAELErlDASS 684
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 244 SEMEGLQSQF---QKELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAAIKKLQEDLQSEHCQYLQDLEQKFREkEKAKEL 320
Cdd:COG4913 685 DDLAALEEQLeelEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAA-ALGDAV 763
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 321 ElETLQASYEDLKAQSQEEIRLLWSQLESMKT--NRE--ELNGSWDPVLAQASH-LEELEHLRsgfaqqqqqeraqhESE 395
Cdd:COG4913 764 E-RELRENLEERIDALRARLNRAEEELERAMRafNREwpAETADLDADLESLPEyLALLDRLE--------------EDG 828
|
250 260 270
....*....|....*....|....*....|....*..
gi 1039732403 396 LEHlrvyFEKKLKDA-EKTYQEDLTVFQQRLQEARED 431
Cdd:COG4913 829 LPE----YEERFKELlNENSIEFVADLLSKLRRAIRE 861
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
203-349 |
1.33e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 44.00 E-value: 1.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 203 EKLRSEMEKNAQTIetLKqdwESERELclENLRQELSLKHQSEMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSLKNle 282
Cdd:PRK12704 30 EAKIKEAEEEAKRI--LE---EAKKEA--EAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDR-- 100
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039732403 283 aqHQAAIKKLQEDLQSEhcqyLQDLEQKFREKEKaKELELETLQASY----EDLKAQSQEEIR-LLWSQLES 349
Cdd:PRK12704 101 --KLELLEKREEELEKK----EKELEQKQQELEK-KEEELEELIEEQlqelERISGLTAEEAKeILLEKVEE 165
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1143-1373 |
1.53e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 44.14 E-value: 1.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1143 LEDLRKQLQQAARELLTLKEEKSVLWNQKETLtneakereagspvlpahkdTALQEEVESLTRVQWEsrkqsEKDRATLL 1222
Cdd:COG4913 612 LAALEAELAELEEELAEAEERLEALEAELDAL-------------------QERREALQRLAEYSWD-----EIDVASAE 667
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1223 SQMRVLESELEDQLVQHRGCAQLAEEVATLKQQLAALDKHLRSQRQfmddqaaerehEREEFQQEIQRLEGQLRQAARpr 1302
Cdd:COG4913 668 REIAELEAELERLDASSDDLAALEEQLEELEAELEELEEELDELKG-----------EIGRLEKELEQAEEELDELQD-- 734
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039732403 1303 ppgprdsqcvQLDEEVELLQEKLREKLDGFNELVIKKDFADQQLLIQEEEIKRLEETNASIQRQMVQLQEE 1373
Cdd:COG4913 735 ----------RLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMRA 795
|
|
| PRK09039 |
PRK09039 |
peptidoglycan -binding protein; |
1479-1615 |
1.74e-03 |
|
peptidoglycan -binding protein;
Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 43.42 E-value: 1.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1479 LHEVIEKLQSELSLMGPKVHEVSD------PQAGSLHSELACLRGEGLGGQALRSELQAAQAAKEVFGQLLADQAHGHSQ 1552
Cdd:PRK09039 44 LSREISGKDSALDRLNSQIAELADllslerQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQ 123
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039732403 1553 ALEALQQrlqdaeeVAARHLAElehcVALREAEVEAMASQIQEFAATLKAKEAIIEQRDLEID 1615
Cdd:PRK09039 124 ELDSEKQ-------VSARALAQ----VELLNQQIAALRRQLAALEAALDASEKRDRESQAKIA 175
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
2235-2520 |
1.84e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 43.96 E-value: 1.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2235 QKTLSKEQETASDVRkrlvVEQNAVQDLKSELhACKQENTSLLESLDKVQQEVLRLRAVLDGkEKELKVVLEELESERGK 2314
Cdd:pfam17380 281 QKAVSERQQQEKFEK----MEQERLRQEKEEK-AREVERRRKLEEAEKARQAEMDRQAAIYA-EQERMAMERERELERIR 354
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2315 GQALQAQQEEQQLRYLQREGQSSRALEELKLSLEKQLAQNNQLCVALKHERAAKDNLQKELQIEASRCEALLAQEKGQLS 2394
Cdd:pfam17380 355 QEERKRELERIRQEEIAMEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQ 434
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2395 ELQKSLEAERSRSLELSEALQHERLLTEQLSRNSQEACARQETQVQHALLRKLKAEKTRALELE--------AMLEKVQK 2466
Cdd:pfam17380 435 REVRRLEEERAREMERVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEkeleerkqAMIEEERK 514
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1039732403 2467 QAAHTQQQLEAQAQERCVELRREKERELEIQRQRDEHK-IEQLQRLVRELRWKEE 2520
Cdd:pfam17380 515 RKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRrIQEQMRKATEERSRLE 569
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
2084-2556 |
1.87e-03 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 43.77 E-value: 1.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2084 QALSEVTTDKGEKESLETHLTWSEELLRAIQEVFAREQEKAELQPRPYGSNLGDYNSLVQRLEKVIQEQGDPQKVQDHLc 2163
Cdd:COG1196 327 ELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEEL- 405
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2164 LSDRSSLLAEIQALRAQLRmTHLQNQEKLQQLCAALTSTEARGSQREHQLRRQVELLAYKVEQEKCIANELQKTLSKEQE 2243
Cdd:COG1196 406 EEAEEALLERLERLEEELE-ELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLE 484
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2244 TASDVRKRLVVEQNAVQDLKSELHAckqentSLLESLDKVQQEVLRLRAVLDGKEKELKVVLEELESERGKGQALQAQQE 2323
Cdd:COG1196 485 ELAEAAARLLLLLEAEADYEGFLEG------VKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEV 558
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2324 EQQLRYLQREGQSSRAlEELKLSLEKQLAQNNQLCVALKHERAAkdNLQKELQIEASRCEALLAQEKGQLSELQKSLEAE 2403
Cdd:COG1196 559 AAAAIEYLKAAKAGRA-TFLPLDKIRARAALAAALARGAIGAAV--DLVASDLREADARYYVLGDTLLGRTLVAARLEAA 635
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2404 RSRSLELSEALQHERLLTEQLSRNSQEAcaRQETQVQHALLRKLKAEKTRALELEAMLEKVQKQAAHTQQQLEAQAQERC 2483
Cdd:COG1196 636 LRRAVTLAGRLREVTLEGEGGSAGGSLT--GGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAE 713
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039732403 2484 VELRREKERELEIQRQRDEHKIEQLQRLVRELrwKEEVSGGNGPCRGSPGRGSLERDQFqeqqqeleKIRQQL 2556
Cdd:COG1196 714 EERLEEELEEEALEEQLEAEREELLEELLEEE--ELLEEEALEELPEPPDLEELERELE--------RLEREI 776
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
119-551 |
2.02e-03 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 43.90 E-value: 2.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 119 ELEALRLSLNNMHTAQLELTQANLQKEK--------ETALTELREMLNGRRAQELALLQSRQQCE-----------LELL 179
Cdd:PRK03918 222 ELEKLEKEVKELEELKEEIEELEKELESlegskrklEEKIRELEERIEELKKEIEELEEKVKELKelkekaeeyikLSEF 301
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 180 REQHAREKEEMALRSG---QEAAELKEKLRSEMEKNAQTIETLKQDWESERELCLENLRQEL---SLKHQSEMEGLQSQF 253
Cdd:PRK03918 302 YEEYLDELREIEKRLSrleEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELyeeAKAKKEELERLKKRL 381
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 254 Q-KELSEQKVELEKIFQAKHEAEV----------SLKNLEAQHQAAIKKLQE----------DLQSEHcqyLQDLEQKFR 312
Cdd:PRK03918 382 TgLTPEKLEKELEELEKAKEEIEEeiskitarigELKKEIKELKKAIEELKKakgkcpvcgrELTEEH---RKELLEEYT 458
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 313 EKEKAKELELETLQASYEDLKAQ---------SQEEIRLLWSQLESMKTNREELNGSWDPVLAQAShlEELEHLRSGFAQ 383
Cdd:PRK03918 459 AELKRIEKELKEIEEKERKLRKElrelekvlkKESELIKLKELAEQLKELEEKLKKYNLEELEKKA--EEYEKLKEKLIK 536
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 384 -----QQQQERAQHESELEHLRVYFEKKLKDAEktyqEDLTVFQQRLQEAREDSLESTEISSSCVLPEETSGREGKEPPD 458
Cdd:PRK03918 537 lkgeiKSLKKELEKLEELKKKLAELEKKLDELE----EELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEK 612
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 459 PLdlqlgqpKVQESLVEDCQVKLSKAEEKIQQMKEEFQKKEAEWELSREELKREAEERLASMFLELREKAESEKLSIISR 538
Cdd:PRK03918 613 EL-------EREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEEL 685
|
490
....*....|...
gi 1039732403 539 FEHRESSMRHLQD 551
Cdd:PRK03918 686 EKRREEIKKTLEK 698
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
2150-2459 |
2.52e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 43.57 E-value: 2.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2150 QEQGDPQKVQDHLCLSDRSSLLAEIQAlRAQLRMTHLQNQEKLQQLCAALTSTEARGSQREHQLRRqVELLAYKVEQEKC 2229
Cdd:pfam17380 288 QQQEKFEKMEQERLRQEKEEKAREVER-RRKLEEAEKARQAEMDRQAAIYAEQERMAMERERELER-IRQEERKRELERI 365
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2230 IANELQKTLSKEQETasdvrKRLVVE-QNAVQDLKSELHACKQENTSLLESLDKVQQEVLRLRAVLDGKEKELKVVLEEL 2308
Cdd:pfam17380 366 RQEEIAMEISRMREL-----ERLQMErQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRL 440
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2309 ESERGKGQALQAQQEEQQLRYLQREGQSSRALEELKLSLEKQlAQNNQLCVALKH---ERAAKDNLQKELQIEASR--CE 2383
Cdd:pfam17380 441 EEERAREMERVRLEEQERQQQVERLRQQEEERKRKKLELEKE-KRDRKRAEEQRRkilEKELEERKQAMIEEERKRklLE 519
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039732403 2384 ALLAQEKGQLSELQKSLEAERSRSLElsEALQHERLLTEQLSRNSQEACARQETQVQHALLRKLKAEKTRALELEA 2459
Cdd:pfam17380 520 KEMEERQKAIYEEERRREAEEERRKQ--QEMEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKARAEYEA 593
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
852-1463 |
2.54e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 43.37 E-value: 2.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 852 AHTVEMQKSQAELAKPQELQASQDQVAQVRdKVFLLNRELEECRAELEQLQQRRERenqegttlicmLRADLELAQGEgk 931
Cdd:COG4913 259 ELAERYAAARERLAELEYLRAALRLWFAQR-RLELLEAELEELRAELARLEAELER-----------LEARLDALREE-- 324
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 932 aLRDALRRLLDLFGDTLKAavtLKSRISERAGLLLDHEDAADTSDARLAAAALGDMWSDEGLLEIDRTLPEGAEtssvce 1011
Cdd:COG4913 325 -LDELEAQIRGNGGDRLEQ---LEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLE------ 394
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1012 isshvcesffispentldceqpirrvyqslstavegllemaldsskqlEEARQLHRCVEREFRHRNEEmAQAMQKQQELL 1091
Cdd:COG4913 395 ------------------------------------------------ALEEELEALEEALAEAEAAL-RDLRRELRELE 425
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1092 ERLREESAAKDRLALELHTAKGLLEgfkvekvdlqEALGKKEeseqqlilelEDLRkqlqqAARELLTLKEEKS------ 1165
Cdd:COG4913 426 AEIASLERRKSNIPARLLALRDALA----------EALGLDE----------AELP-----FVGELIEVRPEEErwrgai 480
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1166 --VLWNQKETLTneakereagspVLPAHKDTALqEEVESL---TRVQWE-------SRKQSEKDRATLLSQMRVLESELE 1233
Cdd:COG4913 481 erVLGGFALTLL-----------VPPEHYAAAL-RWVNRLhlrGRLVYErvrtglpDPERPRLDPDSLAGKLDFKPHPFR 548
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1234 D---QLVQHRGCAQLAEEVATLKQQ-----LAALDKHLRSQRQfMDDQAAER---------EHEREEFQQEIQRLEGQLR 1296
Cdd:COG4913 549 AwleAELGRRFDYVCVDSPEELRRHpraitRAGQVKGNGTRHE-KDDRRRIRsryvlgfdnRAKLAALEAELAELEEELA 627
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1297 QAarprppgprDSQCVQLDEEVELLQEKLR--EKLDGFNELVIKKDFADQQLLIQEEEIKRLEETN---ASIQRQMVQLQ 1371
Cdd:COG4913 628 EA---------EERLEALEAELDALQERREalQRLAEYSWDEIDVASAEREIAELEAELERLDASSddlAALEEQLEELE 698
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1372 EELEKQKKSMEELKEKEILKQENMGDLL-LTTVSRSGLDEAGcpmlPQGSSSRGPEAQPDVTERALLQHENEVVHRRNSE 1450
Cdd:COG4913 699 AELEELEEELDELKGEIGRLEKELEQAEeELDELQDRLEAAE----DLARLELRALLEERFAAALGDAVERELRENLEER 774
|
650
....*....|...
gi 1039732403 1451 IDELKSLIENLQE 1463
Cdd:COG4913 775 IDALRARLNRAEE 787
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1085-1489 |
2.81e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 43.47 E-value: 2.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1085 QKQQELLERLREESAAKDRLALELHTAKGLLEGFKVEKVDLQEALGKKEESEQQLILELEDLRKQLQQAARELLTLKEEK 1164
Cdd:TIGR04523 225 KQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQK 304
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1165 SVLWNQ--KETLTNEAKE-REAGSPVLPAHKD-TALQEEVESLTRvqweSRKQSEKDRATLLSQMRVLESELEDQLVQHr 1240
Cdd:TIGR04523 305 EQDWNKelKSELKNQEKKlEEIQNQISQNNKIiSQLNEQISQLKK----ELTNSESENSEKQRELEEKQNEIEKLKKEN- 379
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1241 gcAQLAEEVATLKQQLAALDKHLRSQR---QFMDDQAAEREHEREEFQQEIQRLEgqlrqaarprppgprdSQCVQLDEE 1317
Cdd:TIGR04523 380 --QSYKQEIKNLESQINDLESKIQNQEklnQQKDEQIKKLQQEKELLEKEIERLK----------------ETIIKNNSE 441
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1318 VELLQEKLREKLDGFNELVIKKDfadqqllIQEEEIKRLEETNASIQRQMVQLQEELEKQKKSMEELKEKEILKQENMGD 1397
Cdd:TIGR04523 442 IKDLTNQDSVKELIIKNLDNTRE-------SLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKD 514
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1398 LLlttvsrsgldeagcpmlPQGSSSRgpeaqpdVTERALLQHENEVVHRRNSEIDELKSLIENLQENQRQLQKD-KAEEI 1476
Cdd:TIGR04523 515 LT-----------------KKISSLK-------EKIEKLESEKKEKESKISDLEDELNKDDFELKKENLEKEIDeKNKEI 570
|
410
....*....|...
gi 1039732403 1477 EQLHEVIEKLQSE 1489
Cdd:TIGR04523 571 EELKQTQKSLKKK 583
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
2139-2465 |
5.12e-03 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 42.41 E-value: 5.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2139 NSLVQRLEkVIQEQGDPQKVQDHLCLSDRSSLLAEIQA-LRAQLRMTHLQNQEKLQQLCAALTS-TEARG-----SQREH 2211
Cdd:pfam15921 295 NSIQSQLE-IIQEQARNQNSMYMRQLSDLESTVSQLRSeLREAKRMYEDKIEELEKQLVLANSElTEARTerdqfSQESG 373
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2212 QLRRQVE-LLA--YKVEQEKCIANELQKTL----SKEQETASDVRKRLVVEQNAVQDLKSELHACKQE-------NTSLL 2277
Cdd:pfam15921 374 NLDDQLQkLLAdlHKREKELSLEKEQNKRLwdrdTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSEcqgqmerQMAAI 453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2278 ESLDKVQQEVLRLRAVLDGKEKELKVVLEELESERGKGQALQAQQEEQQLRYLQREgqssRALEELKLSLEKQLAQNNQL 2357
Cdd:pfam15921 454 QGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKE----RAIEATNAEITKLRSRVDLK 529
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2358 CVALKHERAAKDNLQkELQIEasrCEALlaqeKGQLSELQKSLEAERSRSLELSEAL-QHERLLTEQLSRNSQ---EACA 2433
Cdd:pfam15921 530 LQELQHLKNEGDHLR-NVQTE---CEAL----KLQMAEKDKVIEILRQQIENMTQLVgQHGRTAGAMQVEKAQlekEIND 601
|
330 340 350
....*....|....*....|....*....|....*
gi 1039732403 2434 RQETQVQHALLRKLKAEKTRALELEAM---LEKVQ 2465
Cdd:pfam15921 602 RRLELQEFKILKDKKDAKIRELEARVSdleLEKVK 636
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
144-294 |
6.32e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.21 E-value: 6.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 144 KEKETALTELREMLNGRRAQELALLQSRQQCELELLREQHAREKEEMALRSGQ--EAAELKEKLRSEMEKN-AQTIETLK 220
Cdd:COG4913 265 AAARERLAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELERLEARldALREELDELEAQIRGNgGDRLEQLE 344
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 221 QD---WESERELCLENLRQ--------ELSLKH-QSEMEGLQSQFQKELSEQKVELEKIFQAKHEAEVSLKNLEAQHQAA 288
Cdd:COG4913 345 REierLERELEERERRRARleallaalGLPLPAsAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELREL 424
|
....*.
gi 1039732403 289 IKKLQE 294
Cdd:COG4913 425 EAEIAS 430
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
73-399 |
8.40e-03 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 41.95 E-value: 8.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 73 PSCSNTLEGTRGASAAQEQ----EDCELDVTDLQGQQQTQPPPPQTAHSL---ELEALRLSLNNMHTAQLELTQANLQKE 145
Cdd:PRK02224 455 PECGQPVEGSPHVETIEEDrervEELEAELEDLEEEVEEVEERLERAEDLveaEDRIERLEERREDLEELIAERRETIEE 534
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 146 KETALTELREmlngrRAQELallqsrqQCELELLREQhAREKEEMALRSGQEAAELKEKLrsemEKNAQTIETLKQDWES 225
Cdd:PRK02224 535 KRERAEELRE-----RAAEL-------EAEAEEKREA-AAEAEEEAEEAREEVAELNSKL----AELKERIESLERIRTL 597
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 226 ERELclENLRQELS-----LKHQSEMEGLQSQFQKELSEQKVELEKIFQAkheaevslknleaqhqAAIKKLQEDLQSEH 300
Cdd:PRK02224 598 LAAI--ADAEDEIErlrekREALAELNDERRERLAEKRERKRELEAEFDE----------------ARIEEAREDKERAE 659
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 301 cQYLQDLEQKFREKEKAKEleletlqasyeDLkaqsQEEIRLLWSQLESMKTNREELNGSWDPVLAQASHLEELEHLRSG 380
Cdd:PRK02224 660 -EYLEQVEEKLDELREERD-----------DL----QAEIGAVENELEELEELRERREALENRVEALEALYDEAEELESM 723
|
330
....*....|....*....
gi 1039732403 381 FAQQQQQERAQHESELEHL 399
Cdd:PRK02224 724 YGDLRAELRQRNVETLERM 742
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1070-1386 |
8.51e-03 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 41.75 E-value: 8.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1070 EREFRHRNEEMAQAMQKQQELLERLREEsaakdrlaleLHTAKGLLEGFKVEKVDLQEALGKKEESEQQLILELEDLRKQ 1149
Cdd:pfam12128 599 EEELRERLDKAEEALQSAREKQAAAEEQ----------LVQANGELEKASREETFARTALKNARLDLRRLFDEKQSEKDK 668
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1150 LQQAarelltLKEEKSVLWNQKETLTNEAKEREAGspvlpaHKDtALQEEvesltrvqwesRKQSEKDRATLLSQMRVLE 1229
Cdd:pfam12128 669 KNKA------LAERKDSANERLNSLEAQLKQLDKK------HQA-WLEEQ-----------KEQKREARTEKQAYWQVVE 724
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1230 SELEDQLvqhrgcAQLAEEVATLKQQLAALDKHLRSQRQFMDDQAAEREHEREEFQQEIQRLEGQLRQAARPRPPGPR-- 1307
Cdd:pfam12128 725 GALDAQL------ALLKAAIAARRSGAKAELKALETWYKRDLASLGVDPDVIAKLKREIRTLERKIERIAVRRQEVLRyf 798
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 1308 DSQCVQLDEEVELLQEKLREKLDGFNELvikkdfaDQQLLIQEEEIKR----LEETNASIQRQMVQLQEELEKQKKSMEE 1383
Cdd:pfam12128 799 DWYQETWLQRRPRLATQLSNIERAISEL-------QQQLARLIADTKLrrakLEMERKASEKQQVRLSENLRGLRCEMSK 871
|
...
gi 1039732403 1384 LKE 1386
Cdd:pfam12128 872 LAT 874
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
2297-2516 |
8.92e-03 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 41.88 E-value: 8.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2297 KEKELKVVLEELESERgkgqALQAQQEEQQLRYLQREGQSSRALEELKLSLEKQLAQNNQLCVA-LKHERAAKDNLQKEL 2375
Cdd:pfam02463 171 KKEALKKLIEETENLA----ELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDyLKLNEERIDLLQELL 246
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2376 QIEASRCEALLAQEKGQLSELQKSLEAERSRslELSEALQHERLLTEQLSRNSQEACARQETQVQHALLRKLKAEKTRAL 2455
Cdd:pfam02463 247 RDEQEEIESSKQEIEKEEEKLAQVLKENKEE--EKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKK 324
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039732403 2456 ELEAMLEKVQKQAAHTQQQLEAQAQERcvELRREKERELEIQRQRDEHKIEQLQRLVRELR 2516
Cdd:pfam02463 325 KAEKELKKEKEEIEELEKELKELEIKR--EAEEEEEEELEKLQEKLEQLEEELLAKKKLES 383
|
|
| PRK10929 |
PRK10929 |
putative mechanosensitive channel protein; Provisional |
2190-2443 |
9.15e-03 |
|
putative mechanosensitive channel protein; Provisional
Pssm-ID: 236798 [Multi-domain] Cd Length: 1109 Bit Score: 41.58 E-value: 9.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2190 EKLQQLCAALTS-TEARGSQREHQLRRQVellaykVEQEKCIANELQKTLSKEQETASDVRKRLvveqnAVQDLKSELha 2268
Cdd:PRK10929 45 EIVEALQSALNWlEERKGSLERAKQYQQV------IDNFPKLSAELRQQLNNERDEPRSVPPNM-----STDALEQEI-- 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2269 cKQENTSLLESLDKVQQEVLRLRAVLD------GKEKELKVVLEELESERGKGQALQAQQEEQQLRYLQREGQSSRA-LE 2341
Cdd:PRK10929 112 -LQVSSQLLEKSRQAQQEQDRAREISDslsqlpQQQTEARRQLNEIERRLQTLGTPNTPLAQAQLTALQAESAALKAlVD 190
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039732403 2342 ELKLSlekQLAQNN-QLCVALKHERAAKDNLQKELQIEASRC--------EA--------LLAQEKGQLSE-LQKSLEAE 2403
Cdd:PRK10929 191 ELELA---QLSANNrQELARLRSELAKKRSQQLDAYLQALRNqlnsqrqrEAeralesteLLAEQSGDLPKsIVAQFKIN 267
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 1039732403 2404 RSRSLELSEALQHERLLTEQlsrnsQEACARQETQVQHAL 2443
Cdd:PRK10929 268 RELSQALNQQAQRMDLIASQ-----QRQAASQTLQVRQAL 302
|
|
|