NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|601984298|ref|NP_001278162|]
View 

homeobox protein cut-like 1 isoform c [Mus musculus]

Protein Classification

CUT and homeodomain domain-containing protein( domain architecture ID 13530023)

CUT and homeodomain domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CUT pfam02376
CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in ...
1215-1292 6.37e-31

CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein.


:

Pssm-ID: 460543  Cd Length: 78  Bit Score: 116.54  E-value: 6.37e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298  1215 MSPELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREpFVRMQLWLNDPNNVEKLM 1292
Cdd:pfam02376    1 DSEELDTKDIADRIKEELKRHKISQAVFAEKVLNRSQGTLSDLLRKPKPWEKLKSGGRT-FIRMYNWLSLPEDERMEI 77
CUT pfam02376
CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in ...
1032-1100 4.59e-28

CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein.


:

Pssm-ID: 460543  Cd Length: 78  Bit Score: 108.45  E-value: 4.59e-28
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984298  1032 MYQEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREpFIRMQLWLN 1100
Cdd:pfam02376    1 DSEELDTKDIADRIKEELKRHKISQAVFAEKVLNRSQGTLSDLLRKPKPWEKLKSGGRT-FIRMYNWLS 68
CUT pfam02376
CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in ...
643-719 1.99e-27

CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein.


:

Pssm-ID: 460543  Cd Length: 78  Bit Score: 106.91  E-value: 1.99e-27
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298   643 EGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVRGKEpFHKMKQFLSDEQNILAL 719
Cdd:pfam02376    1 DSEELDTKDIADRIKEELKRHKISQAVFAEKVLNRSQGTLSDLLRKPKPWEKLKSGGRT-FIRMYNWLSLPEDERME 76
Homeodomain pfam00046
Homeodomain;
1338-1394 3.27e-16

Homeodomain;


:

Pssm-ID: 459649 [Multi-domain]  Cd Length: 57  Bit Score: 74.07  E-value: 3.27e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298  1338 KKPRVVLAPEEKEALKRAYQQKPYPSPKTIEELATQLNLKTSTVINWFHNYRSRIRR 1394
Cdd:pfam00046    1 RRKRTTFTPEQLEELEKEFQENPYPSAEEREELAAQLGLTERQVKVWFQNRRAKWKR 57
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
118-454 1.60e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 79.33  E-value: 1.60e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   118 NRQDESEQSRKRLIEQSREfKKNTPEDLRKQVAPL-LKSFQGEIDALSKRSKEAEAAFLTVYKRLidvpdpvpaldvgQQ 196
Cdd:TIGR02168  199 ERQLKSLERQAEKAERYKE-LKAELRELELALLVLrLEELREELEELQEELKEAEEELEELTAEL-------------QE 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   197 LEIKVQRLhdiETENQKLRETLEEYNKEFAEVKNQ----EVTIKALKEKIREYEQTLKSQAETIALEKEQK--LQNDFAE 270
Cdd:TIGR02168  265 LEEKLEEL---RLEVSELEEEIEELQKELYALANEisrlEQQKQILRERLANLERQLEELEAQLEELESKLdeLAEELAE 341
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   271 KERKLQETQMSTTS---KLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKAD---EIEMIMTDLERANQRAEVAQR 344
Cdd:TIGR02168  342 LEEKLEELKEELESleaELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASlnnEIERLEARLERLEDRRERLQQ 421
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   345 EAETLREQLSSANhslqlasqiqkapdveqaievltRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSA---SQIS 421
Cdd:TIGR02168  422 EIEELLKKLEEAE-----------------------LKELQAELEELEEELEELQEELERLEEALEELREELEeaeQALD 478
                          330       340       350
                   ....*....|....*....|....*....|...
gi 601984298   422 QLEQQLNAKNSTLKQLEEKLKGQADYEEVKKEL 454
Cdd:TIGR02168  479 AAERELAQLQARLDSLERLQENLEGFSEGVKAL 511
PHA03247 super family cl33720
large tegument protein UL36; Provisional
843-1026 2.49e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 2.49e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  843 SASPMSTVSTYPPLAISLKKTPAAPETST--AALPSAPALKKEAQDVPTLDPPGSADAAQGVLRPMksELVRGSTWKDPW 920
Cdd:PHA03247  269 PETARGATGPPPPPEAAAPNGAAAPPDGVwgAALAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAM--EVVSPLPRPRQH 346
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  921 WSPIQPERRNLTSSEETKADETTASGKERAGSSQPRAERsqlqgpsasaeywKEWPSAESPYSQSSELSLTGASRSETPQ 1000
Cdd:PHA03247  347 YPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKR-------------RSARHAATPFARGPGGDDQTRPAAPVPA 413
                         170       180
                  ....*....|....*....|....*.
gi 601984298 1001 NSPLPSSPIVPMAKPAKPSVPPLTPE 1026
Cdd:PHA03247  414 SVPTPAPTPVPASAPPPPATPLPSAE 439
 
Name Accession Description Interval E-value
CUT pfam02376
CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in ...
1215-1292 6.37e-31

CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein.


Pssm-ID: 460543  Cd Length: 78  Bit Score: 116.54  E-value: 6.37e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298  1215 MSPELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREpFVRMQLWLNDPNNVEKLM 1292
Cdd:pfam02376    1 DSEELDTKDIADRIKEELKRHKISQAVFAEKVLNRSQGTLSDLLRKPKPWEKLKSGGRT-FIRMYNWLSLPEDERMEI 77
CUT pfam02376
CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in ...
1032-1100 4.59e-28

CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein.


Pssm-ID: 460543  Cd Length: 78  Bit Score: 108.45  E-value: 4.59e-28
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984298  1032 MYQEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREpFIRMQLWLN 1100
Cdd:pfam02376    1 DSEELDTKDIADRIKEELKRHKISQAVFAEKVLNRSQGTLSDLLRKPKPWEKLKSGGRT-FIRMYNWLS 68
CUT pfam02376
CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in ...
643-719 1.99e-27

CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein.


Pssm-ID: 460543  Cd Length: 78  Bit Score: 106.91  E-value: 1.99e-27
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298   643 EGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVRGKEpFHKMKQFLSDEQNILAL 719
Cdd:pfam02376    1 DSEELDTKDIADRIKEELKRHKISQAVFAEKVLNRSQGTLSDLLRKPKPWEKLKSGGRT-FIRMYNWLSLPEDERME 76
Homeodomain pfam00046
Homeodomain;
1338-1394 3.27e-16

Homeodomain;


Pssm-ID: 459649 [Multi-domain]  Cd Length: 57  Bit Score: 74.07  E-value: 3.27e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298  1338 KKPRVVLAPEEKEALKRAYQQKPYPSPKTIEELATQLNLKTSTVINWFHNYRSRIRR 1394
Cdd:pfam00046    1 RRKRTTFTPEQLEELEKEFQENPYPSAEEREELAAQLGLTERQVKVWFQNRRAKWKR 57
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
118-454 1.60e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 79.33  E-value: 1.60e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   118 NRQDESEQSRKRLIEQSREfKKNTPEDLRKQVAPL-LKSFQGEIDALSKRSKEAEAAFLTVYKRLidvpdpvpaldvgQQ 196
Cdd:TIGR02168  199 ERQLKSLERQAEKAERYKE-LKAELRELELALLVLrLEELREELEELQEELKEAEEELEELTAEL-------------QE 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   197 LEIKVQRLhdiETENQKLRETLEEYNKEFAEVKNQ----EVTIKALKEKIREYEQTLKSQAETIALEKEQK--LQNDFAE 270
Cdd:TIGR02168  265 LEEKLEEL---RLEVSELEEEIEELQKELYALANEisrlEQQKQILRERLANLERQLEELEAQLEELESKLdeLAEELAE 341
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   271 KERKLQETQMSTTS---KLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKAD---EIEMIMTDLERANQRAEVAQR 344
Cdd:TIGR02168  342 LEEKLEELKEELESleaELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASlnnEIERLEARLERLEDRRERLQQ 421
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   345 EAETLREQLSSANhslqlasqiqkapdveqaievltRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSA---SQIS 421
Cdd:TIGR02168  422 EIEELLKKLEEAE-----------------------LKELQAELEELEEELEELQEELERLEEALEELREELEeaeQALD 478
                          330       340       350
                   ....*....|....*....|....*....|...
gi 601984298   422 QLEQQLNAKNSTLKQLEEKLKGQADYEEVKKEL 454
Cdd:TIGR02168  479 AAERELAQLQARLDSLERLQENLEGFSEGVKAL 511
homeodomain cd00086
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic ...
1338-1395 1.20e-13

Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.


Pssm-ID: 238039 [Multi-domain]  Cd Length: 59  Bit Score: 66.88  E-value: 1.20e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298 1338 KKPRVVLAPEEKEALKRAYQQKPYPSPKTIEELATQLNLKTSTVINWFHNYRSRIRRE 1395
Cdd:cd00086     1 RRKRTRFTPEQLEELEKEFEKNPYPSREEREELAKELGLTERQVKIWFQNRRAKLKRS 58
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
196-449 2.04e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 75.74  E-value: 2.04e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  196 QLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREyEQTLKSQAETIALEKEQKLQNDFAEKERKL 275
Cdd:COG1196   226 EAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRL-ELEELELELEEAQAEEYELLAELARLEQDI 304
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  276 ---QETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMImtdLERANQRAEVAQREAETLREQ 352
Cdd:COG1196   305 arlEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAEL---AEAEEALLEAEAELAEAEEEL 381
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  353 LSSANHSLQLASQIQKAPDVEQAIEvLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNS 432
Cdd:COG1196   382 EELAEELLEALRAAAELAAQLEELE-EAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEA 460
                         250
                  ....*....|....*..
gi 601984298  433 TLKQLEEKLKGQADYEE 449
Cdd:COG1196   461 LLELLAELLEEAALLEA 477
HOX smart00389
Homeodomain; DNA-binding factors that are involved in the transcriptional regulation of key ...
1338-1393 7.10e-13

Homeodomain; DNA-binding factors that are involved in the transcriptional regulation of key developmental processes


Pssm-ID: 197696 [Multi-domain]  Cd Length: 57  Bit Score: 64.58  E-value: 7.10e-13
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298   1338 KKPRVVLAPEEKEALKRAYQQKPYPSPKTIEELATQLNLKTSTVINWFHNYRSRIR 1393
Cdd:smart00389    2 RRKRTSFTPEQLEELEKEFQKNPYPSREEREELAKKLGLSERQVKVWFQNRRAKWK 57
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
104-460 4.22e-12

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 71.25  E-value: 4.22e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  104 KLKRELDAtatvLANRQDESEQSRKRLIEQSREFKKNTpEDLRKQV----------------------APLLKSFQGEID 161
Cdd:PRK03918  388 KLEKELEE----LEKAKEEIEEEISKITARIGELKKEI-KELKKAIeelkkakgkcpvcgrelteehrKELLEEYTAELK 462
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  162 ALSKRSKEAEAAFLTVYKRLIDVpdpvpalDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEvtIKALKEK 241
Cdd:PRK03918  463 RIEKELKEIEEKERKLRKELREL-------EKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEE--YEKLKEK 533
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  242 IREYEQTLKSQAETiaLEKEQKLQNDFAEKERKLQETQmsttSKLEEAEHKLQTLQ-TALEKTRTELFDLKTKYDEETTA 320
Cdd:PRK03918  534 LIKLKGEIKSLKKE--LEKLEELKKKLAELEKKLDELE----EELAELLKELEELGfESVEELEERLKELEPFYNEYLEL 607
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  321 KADEIEmimtdLERANQRAEVAQREAETLREQLSSANHSLQlasqiqkapDVEQAIEVLTRSSLEVELAAKEREIAQLVE 400
Cdd:PRK03918  608 KDAEKE-----LEREEKELKKLEEELDKAFEELAETEKRLE---------ELRKELEELEKKYSEEEYEELREEYLELSR 673
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  401 DVQRLQAsltklrensasQISQLEQQLNAKNSTLKQLEEKLKgqaDYEEVKKELNTLKSM 460
Cdd:PRK03918  674 ELAGLRA-----------ELEELEKRREEIKKTLEKLKEELE---EREKAKKELEKLEKA 719
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
103-513 7.10e-09

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 61.01  E-value: 7.10e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   103 AKLKRELDATATVLANRQDESEqsrkRLIEQSREFKKNTPEDlRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTV---YK 179
Cdd:pfam12128  304 DELNGELSAADAAVAKDRSELE----ALEDQHGAFLDADIET-AAADQEQLPSWQSELENLEERLKALTGKHQDVtakYN 378
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   180 RLIdvpdpvpaLDVGQQLEIKVQRLHDiETENQKlretlEEYNKEFAEVKNQ-EVTIKALKEKIREYEQTLKSQAETIAL 258
Cdd:pfam12128  379 RRR--------SKIKEQNNRDIAGIKD-KLAKIR-----EARDRQLAVAEDDlQALESELREQLEAGKLEFNEEEYRLKS 444
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   259 E-KEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAkadeiemimtdLERANQ 337
Cdd:pfam12128  445 RlGELKLRLNQATATPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEA-----------LRQASR 513
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   338 RAEVAQREAETLREQLSSANHSLqLASQIQKAPDVEQAI------EVLTRSSLEVEL----AAKEREIAQLVEDVQRLQA 407
Cdd:pfam12128  514 RLEERQSALDELELQLFPQAGTL-LHFLRKEAPDWEQSIgkvispELLHRTDLDPEVwdgsVGGELNLYGVKLDLKRIDV 592
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   408 -SLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLkGQAdyeevKKELNTLK-SMEFAPSEGAGTQDSTKPL----EVLL 481
Cdd:pfam12128  593 pEWAASEEELRERLDKAEEALQSAREKQAAAEEQL-VQA-----NGELEKASrEETFARTALKNARLDLRRLfdekQSEK 666
                          410       420       430
                   ....*....|....*....|....*....|..
gi 601984298   482 LEKNRSLQSENATLRISNSDLSGSARRKGRDQ 513
Cdd:pfam12128  667 DKKNKALAERKDSANERLNSLEAQLKQLDKKH 698
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
143-335 7.42e-08

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 55.84  E-value: 7.42e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  143 EDLRKQVAPLLKSFQGEIDALSKRSKEAEAAfLTVYKRLIDvpdpvpalDVGQQLEIKVQRLHDI-ETENQKL-RETLEE 220
Cdd:cd22656   113 EEAKKTIKALLDDLLKEAKKYQDKAAKVVDK-LTDFENQTE--------KDQTALETLEKALKDLlTDEGGAIaRKEIKD 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  221 YNKEFAevKNQEVTIKALKEKIREYEQTLKSQAETIAleKEQKLQNDFaekerKLQETQM-STTSKLEEAEHKLQTLQTA 299
Cdd:cd22656   184 LQKELE--KLNEEYAAKLKAKIDELKALIADDEAKLA--AALRLIADL-----TAADTDLdNLLALIGPAIPALEKLQGA 254
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 601984298  300 LEKTRTELFDLKTKYDEETTaKADEIEMIMTDLERA 335
Cdd:cd22656   255 WQAIATDLDSLKDLLEDDIS-KIPAAILAKLELEKA 289
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
281-628 2.48e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 51.75  E-value: 2.48e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  281 STTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEettaKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSL 360
Cdd:COG3883    20 AKQKELSELQAELEAAQAELDALQAELEELNEEYNE----LQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARAL 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  361 Q-------LASQIQKAPDVE------QAIEVLTRSSLEV--ELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQ 425
Cdd:COG3883    96 YrsggsvsYLDVLLGSESFSdfldrlSALSKIADADADLleELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEA 175
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  426 QLNAKNSTLKQLEEKlkgQADYEEVKKEL-NTLKSMEFAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSG 504
Cdd:COG3883   176 QQAEQEALLAQLSAE---EAAAEAQLAELeAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAA 252
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  505 SARRKGRDQPESRRPGPLPASPPPQLPRNTGEQVSNTNGTHHFSPAGLSQDFFSSNLASPSLPLASTGKFALNSLLQRQL 584
Cdd:COG3883   253 GAAGAAAGSAGAAGAAAGAAGAGAAAASAAGGGAGGAGGGGGGGGAASGGSGGGSGGAGGVGSGGGAGAVVGGASAGGGG 332
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 601984298  585 MQSFYSKAMQEAGSTSTIFSTGPYSTNSISSPSPLQQSPDVNGM 628
Cdd:COG3883   333 GSGGGGGSSGGGSGGGGGGGGGGGGSSSGGGGGGVGLSVGGGYV 376
PHA03247 PHA03247
large tegument protein UL36; Provisional
843-1026 2.49e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 2.49e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  843 SASPMSTVSTYPPLAISLKKTPAAPETST--AALPSAPALKKEAQDVPTLDPPGSADAAQGVLRPMksELVRGSTWKDPW 920
Cdd:PHA03247  269 PETARGATGPPPPPEAAAPNGAAAPPDGVwgAALAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAM--EVVSPLPRPRQH 346
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  921 WSPIQPERRNLTSSEETKADETTASGKERAGSSQPRAERsqlqgpsasaeywKEWPSAESPYSQSSELSLTGASRSETPQ 1000
Cdd:PHA03247  347 YPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKR-------------RSARHAATPFARGPGGDDQTRPAAPVPA 413
                         170       180
                  ....*....|....*....|....*.
gi 601984298 1001 NSPLPSSPIVPMAKPAKPSVPPLTPE 1026
Cdd:PHA03247  414 SVPTPAPTPVPASAPPPPATPLPSAE 439
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
203-312 2.09e-05

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 48.48  E-value: 2.09e-05
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298    203 RLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAET---IALEKEQKLQNDFA---EKERKLQ 276
Cdd:smart00787  138 RMKLLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQlkqLEDELEDCDPTELDrakEKLKKLL 217
                            90       100       110
                    ....*....|....*....|....*....|....*.
gi 601984298    277 ETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKT 312
Cdd:smart00787  218 QEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT 253
COG5576 COG5576
Homeodomain-containing transcription factor [Transcription];
1338-1423 4.53e-05

Homeodomain-containing transcription factor [Transcription];


Pssm-ID: 227863 [Multi-domain]  Cd Length: 156  Bit Score: 45.51  E-value: 4.53e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298 1338 KKPRVVLAPEEKEALKRAYQQKPYPSPKTIEELATQLNLKTSTVINWFHNYRSRIRRelfieeiqagsQGQAGASDSPSA 1417
Cdd:COG5576    52 KSKRRRTTDEQLMVLEREFEINPYPSSITRIKLSLLLNMPPKSVQIWFQNKRAKEKK-----------KRSGKVEQRPGE 120

                  ....*.
gi 601984298 1418 RSSRAA 1423
Cdd:COG5576   121 EEADLA 126
 
Name Accession Description Interval E-value
CUT pfam02376
CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in ...
1215-1292 6.37e-31

CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein.


Pssm-ID: 460543  Cd Length: 78  Bit Score: 116.54  E-value: 6.37e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298  1215 MSPELDTYGITKRVKEVLTDNNLGQRLFGETILGLTQGSVSDLLARPKPWHKLSLKGREpFVRMQLWLNDPNNVEKLM 1292
Cdd:pfam02376    1 DSEELDTKDIADRIKEELKRHKISQAVFAEKVLNRSQGTLSDLLRKPKPWEKLKSGGRT-FIRMYNWLSLPEDERMEI 77
CUT pfam02376
CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in ...
1032-1100 4.59e-28

CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein.


Pssm-ID: 460543  Cd Length: 78  Bit Score: 108.45  E-value: 4.59e-28
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984298  1032 MYQEVDTIELTRQVKEKLAKNGICQRIFGEKVLGLSQGSVSDMLSRPKPWSKLTQKGREpFIRMQLWLN 1100
Cdd:pfam02376    1 DSEELDTKDIADRIKEELKRHKISQAVFAEKVLNRSQGTLSDLLRKPKPWEKLKSGGRT-FIRMYNWLS 68
CUT pfam02376
CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in ...
643-719 1.99e-27

CUT domain; The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein.


Pssm-ID: 460543  Cd Length: 78  Bit Score: 106.91  E-value: 1.99e-27
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298   643 EGEEIDTAEIARQVKEQLIKHNIGQRIFGHYVLGLSQGSVSEILARPKPWNKLTVRGKEpFHKMKQFLSDEQNILAL 719
Cdd:pfam02376    1 DSEELDTKDIADRIKEELKRHKISQAVFAEKVLNRSQGTLSDLLRKPKPWEKLKSGGRT-FIRMYNWLSLPEDERME 76
Homeodomain pfam00046
Homeodomain;
1338-1394 3.27e-16

Homeodomain;


Pssm-ID: 459649 [Multi-domain]  Cd Length: 57  Bit Score: 74.07  E-value: 3.27e-16
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298  1338 KKPRVVLAPEEKEALKRAYQQKPYPSPKTIEELATQLNLKTSTVINWFHNYRSRIRR 1394
Cdd:pfam00046    1 RRKRTTFTPEQLEELEKEFQENPYPSAEEREELAAQLGLTERQVKVWFQNRRAKWKR 57
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
118-454 1.60e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 79.33  E-value: 1.60e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   118 NRQDESEQSRKRLIEQSREfKKNTPEDLRKQVAPL-LKSFQGEIDALSKRSKEAEAAFLTVYKRLidvpdpvpaldvgQQ 196
Cdd:TIGR02168  199 ERQLKSLERQAEKAERYKE-LKAELRELELALLVLrLEELREELEELQEELKEAEEELEELTAEL-------------QE 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   197 LEIKVQRLhdiETENQKLRETLEEYNKEFAEVKNQ----EVTIKALKEKIREYEQTLKSQAETIALEKEQK--LQNDFAE 270
Cdd:TIGR02168  265 LEEKLEEL---RLEVSELEEEIEELQKELYALANEisrlEQQKQILRERLANLERQLEELEAQLEELESKLdeLAEELAE 341
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   271 KERKLQETQMSTTS---KLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKAD---EIEMIMTDLERANQRAEVAQR 344
Cdd:TIGR02168  342 LEEKLEELKEELESleaELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASlnnEIERLEARLERLEDRRERLQQ 421
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   345 EAETLREQLSSANhslqlasqiqkapdveqaievltRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSA---SQIS 421
Cdd:TIGR02168  422 EIEELLKKLEEAE-----------------------LKELQAELEELEEELEELQEELERLEEALEELREELEeaeQALD 478
                          330       340       350
                   ....*....|....*....|....*....|...
gi 601984298   422 QLEQQLNAKNSTLKQLEEKLKGQADYEEVKKEL 454
Cdd:TIGR02168  479 AAERELAQLQARLDSLERLQENLEGFSEGVKAL 511
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
192-462 4.86e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 77.79  E-value: 4.86e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   192 DVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVT----IKALKEKIREYEQtlKSQAETIALEKEQKLQND 267
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEElsrqISALRKDLARLEA--EVEQLEERIAQLSKELTE 758
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   268 FAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDE----------ETTAKADEIEMIMTDLERANQ 337
Cdd:TIGR02168  759 LEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDElraeltllneEAANLRERLESLERRIAATER 838
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   338 RAEVAQREAETLREQLSSANHSLqlasqiqkapdveqaievltrSSLEVELAAKEREIAQLVEDVQRLQASLTKLRE--- 414
Cdd:TIGR02168  839 RLEDLEEQIEELSEDIESLAAEI---------------------EELEELIEELESELEALLNERASLEEALALLRSele 897
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298   415 -------NSASQISQLEQQLNAKNSTLKQLEEKLKG-QADYEEVKKELNTLKSMEF 462
Cdd:TIGR02168  898 elseelrELESKRSELRRELEELREKLAQLELRLEGlEVRIDNLQERLSEEYSLTL 953
homeodomain cd00086
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic ...
1338-1395 1.20e-13

Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.


Pssm-ID: 238039 [Multi-domain]  Cd Length: 59  Bit Score: 66.88  E-value: 1.20e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298 1338 KKPRVVLAPEEKEALKRAYQQKPYPSPKTIEELATQLNLKTSTVINWFHNYRSRIRRE 1395
Cdd:cd00086     1 RRKRTRFTPEQLEELEKEFEKNPYPSREEREELAKELGLTERQVKIWFQNRRAKLKRS 58
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
196-449 2.04e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 75.74  E-value: 2.04e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  196 QLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREyEQTLKSQAETIALEKEQKLQNDFAEKERKL 275
Cdd:COG1196   226 EAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRL-ELEELELELEEAQAEEYELLAELARLEQDI 304
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  276 ---QETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMImtdLERANQRAEVAQREAETLREQ 352
Cdd:COG1196   305 arlEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAEL---AEAEEALLEAEAELAEAEEEL 381
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  353 LSSANHSLQLASQIQKAPDVEQAIEvLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNS 432
Cdd:COG1196   382 EELAEELLEALRAAAELAAQLEELE-EAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEA 460
                         250
                  ....*....|....*..
gi 601984298  433 TLKQLEEKLKGQADYEE 449
Cdd:COG1196   461 LLELLAELLEEAALLEA 477
HOX smart00389
Homeodomain; DNA-binding factors that are involved in the transcriptional regulation of key ...
1338-1393 7.10e-13

Homeodomain; DNA-binding factors that are involved in the transcriptional regulation of key developmental processes


Pssm-ID: 197696 [Multi-domain]  Cd Length: 57  Bit Score: 64.58  E-value: 7.10e-13
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298   1338 KKPRVVLAPEEKEALKRAYQQKPYPSPKTIEELATQLNLKTSTVINWFHNYRSRIR 1393
Cdd:smart00389    2 RRKRTSFTPEQLEELEKEFQKNPYPSREEREELAKKLGLSERQVKVWFQNRRAKWK 57
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
104-460 4.22e-12

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 71.25  E-value: 4.22e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  104 KLKRELDAtatvLANRQDESEQSRKRLIEQSREFKKNTpEDLRKQV----------------------APLLKSFQGEID 161
Cdd:PRK03918  388 KLEKELEE----LEKAKEEIEEEISKITARIGELKKEI-KELKKAIeelkkakgkcpvcgrelteehrKELLEEYTAELK 462
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  162 ALSKRSKEAEAAFLTVYKRLIDVpdpvpalDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEvtIKALKEK 241
Cdd:PRK03918  463 RIEKELKEIEEKERKLRKELREL-------EKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEE--YEKLKEK 533
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  242 IREYEQTLKSQAETiaLEKEQKLQNDFAEKERKLQETQmsttSKLEEAEHKLQTLQ-TALEKTRTELFDLKTKYDEETTA 320
Cdd:PRK03918  534 LIKLKGEIKSLKKE--LEKLEELKKKLAELEKKLDELE----EELAELLKELEELGfESVEELEERLKELEPFYNEYLEL 607
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  321 KADEIEmimtdLERANQRAEVAQREAETLREQLSSANHSLQlasqiqkapDVEQAIEVLTRSSLEVELAAKEREIAQLVE 400
Cdd:PRK03918  608 KDAEKE-----LEREEKELKKLEEELDKAFEELAETEKRLE---------ELRKELEELEKKYSEEEYEELREEYLELSR 673
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  401 DVQRLQAsltklrensasQISQLEQQLNAKNSTLKQLEEKLKgqaDYEEVKKELNTLKSM 460
Cdd:PRK03918  674 ELAGLRA-----------ELEELEKRREEIKKTLEKLKEELE---EREKAKKELEKLEKA 719
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
208-496 6.61e-12

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 70.86  E-value: 6.61e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   208 ETENqKLRET---LEEYNKEFAEVKNQEVTIKALKEKIREYeQTLKSQAETIALEKEQKLQNDFAEKERKLQETQMSTTS 284
Cdd:TIGR02168  176 ETER-KLERTrenLDRLEDILNELERQLKSLERQAEKAERY-KELKAELRELELALLVLRLEELREELEELQEELKEAEE 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   285 KLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETT---AKADEIEMIMTDLERANQRAEVAQREAETLREQLssanhsLQ 361
Cdd:TIGR02168  254 ELEELTAELQELEEKLEELRLEVSELEEEIEELQKelyALANEISRLEQQKQILRERLANLERQLEELEAQL------EE 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   362 LASQIQKAPDVEQAIEVLTrSSLEVELAAKEREIAQLVEDVQ---RLQASLTKLRENSASQISQLEQQLNAKNSTLKQLE 438
Cdd:TIGR02168  328 LESKLDELAEELAELEEKL-EELKEELESLEAELEELEAELEeleSRLEELEEQLETLRSKVAQLELQIASLNNEIERLE 406
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 601984298   439 EKLKGQAD-----YEEVKKELNTLKSMEFApsEGAGTQDSTKPLEVLLLEKNRSLQSENATLR 496
Cdd:TIGR02168  407 ARLERLEDrrerlQQEIEELLKKLEEAELK--ELQAELEELEEELEELQEELERLEEALEELR 467
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
167-455 1.01e-11

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 69.60  E-value: 1.01e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  167 SKEAEAAFLTVYKRLIDVPDPVPALdvgQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVT-----IKALKEK 241
Cdd:COG5185   221 LLEKAKEIINIEEALKGFQDPESEL---EDLAQTSDKLEKLVEQNTDLRLEKLGENAESSKRLNENANnlikqFENTKEK 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  242 IREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYD-EETTA 320
Cdd:COG5185   298 IAEYTKSIDIKKATESLEEQLAAAEAEQELEESKRETETGIQNLTAEIEQGQESLTENLEAIKEEIENIVGEVElSKSSE 377
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  321 KAD-----------EIEMIMTDLERANQraEVAQREAETLREQLS-SANHSLQLASQIQKAPDVEQAIEVLTRSSLEVEL 388
Cdd:COG5185   378 ELDsfkdtiestkeSLDEIPQNQRGYAQ--EILATLEDTLKAADRqIEELQRQIEQATSSNEEVSKLLNELISELNKVMR 455
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298  389 AAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQadYEEVKKELN 455
Cdd:COG5185   456 EADEESQSRLEEAYDEINRSVRSKKEDLNEELTQIESRVSTLKATLEKLRAKLERQ--LEGVRSKLD 520
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
143-459 1.30e-11

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 69.71  E-value: 1.30e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   143 EDLRKQVAPLLK---SFQGEIDALSKRSKEAEAAFLTVYKRLidvpdpvpaLDVGQQLEIKVQRLHDIETENQKLRETLE 219
Cdd:TIGR02169  684 EGLKRELSSLQSelrRIENRLDELSQELSDASRKIGEIEKEI---------EQLEQEEEKLKERLEELEEDLSSLEQEIE 754
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   220 EYNKEFAEVknqEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQN--DFAEKERKLQETQmstTSKLEEAEHKLQTLQ 297
Cdd:TIGR02169  755 NVKSELKEL---EARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAelSKLEEEVSRIEAR---LREIEQKLNRLTLEK 828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   298 TALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSS-----ANHSLQLASQIQKAPDV 372
Cdd:TIGR02169  829 EYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDlkkerDELEAQLRELERKIEEL 908
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   373 EQAIEVLTR--SSLEVELAAKEREIAQLVEDVQRLQASltklRENSASqISQLEQQLNAKNSTLKQLEE-KLKGQADYEE 449
Cdd:TIGR02169  909 EAQIEKKRKrlSELKAKLEALEEELSEIEDPKGEDEEI----PEEELS-LEDVQAELQRVEEEIRALEPvNMLAIQEYEE 983
                          330
                   ....*....|
gi 601984298   450 VKKELNTLKS 459
Cdd:TIGR02169  984 VLKRLDELKE 993
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
195-459 8.89e-11

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 66.97  E-value: 8.89e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   195 QQLEIKVQRLHDIETENQKLRETLEEYNKEfaeVKNQEVTIKALKEKIreyeQTLKSQAETIALEKEQKLQNDFAE---- 270
Cdd:TIGR04523  246 TEISNTQTQLNQLKDEQNKIKKQLSEKQKE---LEQNNKKIKELEKQL----NQLKSEISDLNNQKEQDWNKELKSelkn 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   271 KERKLQETQmSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEmimtDLERANQRaevAQREAETLR 350
Cdd:TIGR04523  319 QEKKLEEIQ-NQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIE----KLKKENQS---YKQEIKNLE 390
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   351 EQLSSanhslqLASQIQKAPDVEQAIEVLTRsSLEVELAAKEREIAQLVEDVQRLQASLTKLREnsasQISQLEQQLNAK 430
Cdd:TIGR04523  391 SQIND------LESKIQNQEKLNQQKDEQIK-KLQQEKELLEKEIERLKETIIKNNSEIKDLTN----QDSVKELIIKNL 459
                          250       260
                   ....*....|....*....|....*....
gi 601984298   431 NSTLKQLEEKLKgqadyeEVKKELNTLKS 459
Cdd:TIGR04523  460 DNTRESLETQLK------VLSRSINKIKQ 482
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
197-459 1.00e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 67.02  E-value: 1.00e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   197 LEIKVQRLHDIETEnqklRETLEEYnkefaevknqevtiKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQ 276
Cdd:TIGR02169  193 IDEKRQQLERLRRE----REKAERY--------------QALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELE 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   277 ETQMSTTSKLEEAEHKLQTLQTALEKtrtelfdLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLssA 356
Cdd:TIGR02169  255 KLTEEISELEKRLEEIEQLLEELNKK-------IKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERL--A 325
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   357 NHSLQLASQIQKAPDVEQAIEVLT--RSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSAS---QISQLEQQLNAKN 431
Cdd:TIGR02169  326 KLEAEIDKLLAEIEELEREIEEERkrRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDyreKLEKLKREINELK 405
                          250       260
                   ....*....|....*....|....*....
gi 601984298   432 STLKQL-EEKLKGQADYEEVKKELNTLKS 459
Cdd:TIGR02169  406 RELDRLqEELQRLSEELADLNAAIAGIEA 434
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
212-438 3.45e-10

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 65.32  E-value: 3.45e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  212 QKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALekEQKLQNDFAEKERKLQETQM-STTSKLEEAE 290
Cdd:COG4913   231 VEHFDDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYL--RAALRLWFAQRRLELLEAELeELRAELARLE 308
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  291 HKLQTLQTALEKTRTELFDLKTKYDEettAKADEIEmimtDLERANQRAEVAQREAETLREQLSSANHSLQLAsqiqkAP 370
Cdd:COG4913   309 AELERLEARLDALREELDELEAQIRG---NGGDRLE----QLEREIERLERELEERERRRARLEALLAALGLP-----LP 376
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298  371 DVEQaievltrsslevELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLE 438
Cdd:COG4913   377 ASAE------------EFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLE 432
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
103-457 3.51e-10

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 65.09  E-value: 3.51e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNtPEDLR-KQVAPLLKSFQGE-----IDALSKRSKEAEaaflT 176
Cdd:PRK03918  455 EEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKE-SELIKlKELAEQLKELEEKlkkynLEELEKKAEEYE----K 529
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  177 VYKRLIDVPDPVpaldvgQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEV-----TIKALKEKIREYEQtlks 251
Cdd:PRK03918  530 LKEKLIKLKGEI------KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEelgfeSVEELEERLKELEP---- 599
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  252 qaetiALEKEQKLQNDFAEKERKLQEtQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEiemIMTD 331
Cdd:PRK03918  600 -----FYNEYLELKDAEKELEREEKE-LKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELRE---EYLE 670
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  332 LERANQRAEvaqREAETLREQLSSANHSLQlasqiqkapDVEQAIEVLTRSSLEVELAAKERE-IAQLVEDVQRLQAslt 410
Cdd:PRK03918  671 LSRELAGLR---AELEELEKRREEIKKTLE---------KLKEELEEREKAKKELEKLEKALErVEELREKVKKYKA--- 735
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 601984298  411 KLRENSASQISQLEQQL-----NAKNS--TLKQLEEKLKGQADYEEVKKELNTL 457
Cdd:PRK03918  736 LLKERALSKVGEIASEIfeeltEGKYSgvRVKAEENKVKLFVVYQGKERPLTFL 789
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
190-419 4.41e-10

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 63.63  E-value: 4.41e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  190 ALDVGQQLEIKVQRLH-DIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIAL--EKEQKLQN 266
Cdd:COG4942    18 QADAAAEAEAELEQLQqEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAEleKEIAELRA 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  267 DFAEKERKLQET-----QMSTTSKLEEAEHKLQTLQTAlekTRTELFDLKTKYDEEttaKADEIEMIMTDLERANQRAEV 341
Cdd:COG4942    98 ELEAQKEELAELlralyRLGRQPPLALLLSPEDFLDAV---RRLQYLKYLAPARRE---QAEELRADLAELAALRAELEA 171
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298  342 AQREAETLREQLSSANHSLQLASQIQKApdveqaievlTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQ 419
Cdd:COG4942   172 ERAELEALLAELEEERAALEALKAERQK----------LLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
195-368 9.26e-10

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 63.25  E-value: 9.26e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  195 QQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERK 274
Cdd:COG4717    74 KELEEELKEAEEKEEEYAELQEELEELEEELEELEAE---LEELREELEKLEKLLQLLPLYQELEALEAELAELPERLEE 150
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  275 LQETQMSttskLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAK----ADEIEMIMTDLERANQRAEVAQREAETLR 350
Cdd:COG4717   151 LEERLEE----LRELEEELEELEAELAELQEELEELLEQLSLATEEElqdlAEELEELQQRLAELEEELEEAQEELEELE 226
                         170
                  ....*....|....*...
gi 601984298  351 EQLSSANHSLQLASQIQK 368
Cdd:COG4717   227 EELEQLENELEAAALEER 244
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
95-460 9.34e-10

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 63.25  E-value: 9.34e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   95 RWMLCvagAKLKRELDA-------TATVLANRQDESEQSRKRLIEQSREFKKNtpedlrkqvapllksfQGEIDALSKRS 167
Cdd:COG4717    44 RAMLL---ERLEKEADElfkpqgrKPELNLKELKELEEELKEAEEKEEEYAEL----------------QEELEELEEEL 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  168 KEAEAAFLTVYKRLIDVPDPVPALDVGQQLEIKVQRLHDIETENQKLRETLEEYnkefaevKNQEVTIKALKEKIREYEQ 247
Cdd:COG4717   105 EELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPERLEELEERLEEL-------RELEEELEELEAELAELQE 177
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  248 TLKSQAETIALEKEQKLQnDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKtkyDEETTAKADEIEM 327
Cdd:COG4717   178 ELEELLEQLSLATEEELQ-DLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAA---LEERLKEARLLLL 253
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  328 IMT--------DLERANQRAEVAqrEAETLREQLSSANHSLQLASQIQKAPDVEQAIEVLTRSSLE------------VE 387
Cdd:COG4717   254 IAAallallglGGSLLSLILTIA--GVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEeeeleellaalgLP 331
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984298  388 LAAKEREIAQLVEDVQRLQASLTKLRE-NSASQISQLEQQLN-----AKNSTLKQLEEKLKGQADYEEVKKELNTLKSM 460
Cdd:COG4717   332 PDLSPEELLELLDRIEELQELLREAEElEEELQLEELEQEIAallaeAGVEDEEELRAALEQAEEYQELKEELEELEEQ 410
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
103-407 9.36e-10

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 63.80  E-value: 9.36e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNTpEDLRKQVAPLLKSfQGEIDALSKRSKEAEAAFLTVYKRLi 182
Cdd:COG1196   249 EELEAELEELEAELAELEAELEELRLELEELELELEEAQ-AEEYELLAELARL-EQDIARLEERRRELEERLEELEEEL- 325
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  183 dvpdpvpaLDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEvtiKALKEKIREYEQTLKSQAEtiALEKEQ 262
Cdd:COG1196   326 --------AELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEAL---LEAEAELAEAEEELEELAE--ELLEAL 392
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  263 KLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEI-EMIMTDLERANQRAEV 341
Cdd:COG1196   393 RAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEeEEEALLELLAELLEEA 472
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298  342 AQREAETLREQLSSANHSLQLASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQA 407
Cdd:COG1196   473 ALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEA 538
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
195-459 2.14e-09

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 62.35  E-value: 2.14e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   195 QQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQ----EVTIKALKEKIREYEQTLKSQAETIALEKeQKLQNDFAE 270
Cdd:TIGR04523  412 EQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQdsvkELIIKNLDNTRESLETQLKVLSRSINKIK-QNLEQKQKE 490
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   271 KERKLQETQMST--TSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAET 348
Cdd:TIGR04523  491 LKSKEKELKKLNeeKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKENLEKEIDEKNKEI 570
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   349 lrEQLSSANHSLqLASQIQKAPDVEQAIEvlTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLrensASQISQLEQQLN 428
Cdd:TIGR04523  571 --EELKQTQKSL-KKKQEEKQELIDQKEK--EKKDLIKEIEEKEKKISSLEKELEKAKKENEKL----SSIIKNIKSKKN 641
                          250       260       270
                   ....*....|....*....|....*....|..
gi 601984298   429 AKNSTLKQLEEKLKG-QADYEEVKKELNTLKS 459
Cdd:TIGR04523  642 KLKQEVKQIKETIKEiRNKWPEIIKKIKESKT 673
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
104-406 2.32e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 62.26  E-value: 2.32e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  104 KLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPL---LKSFQGEIDALSKRSKEAEAAFLTVYKR 180
Cdd:COG1196   217 ELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELeaeLEELRLELEELELELEEAQAEEYELLAE 296
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  181 LIDVpdpvpaldvGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKA----LKEKIREYEQTLKSQAETI 256
Cdd:COG1196   297 LARL---------EQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEeleeAEEELEEAEAELAEAEEAL 367
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  257 -----ALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEmimtd 331
Cdd:COG1196   368 leaeaELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEE----- 442
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 601984298  332 lERANQRAEVAQREAETLREQLSSANHSLQLASQIQKAPDVEQAIEVLtRSSLEVELAAKEREiAQLVEDVQRLQ 406
Cdd:COG1196   443 -ALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEA-AARLLLLLEAEADY-EGFLEGVKAAL 514
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
121-442 2.41e-09

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 60.31  E-value: 2.41e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  121 DESEQSRKRLIEQSREFKKNtpedlRKQVAPLLKSFQGEIDALSKRSKE--AEAafltvyKRLIDVPDpvpalDVGQQL- 197
Cdd:COG1340    11 EELEEKIEELREEIEELKEK-----RDELNEELKELAEKRDELNAQVKElrEEA------QELREKRD-----ELNEKVk 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  198 EIKVQRLhDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLksQAETIALEKEQKLQNDFAEKERKLQE 277
Cdd:COG1340    75 ELKEERD-ELNEKLNELREELDELRKELAELNKAGGSIDKLRKEIERLEWRQ--QTEVLSPEEEKELVEKIKELEKELEK 151
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  278 TQmsttsKLEEAEHKLQTLQTALEKTRTELFDLKTKYdEETTAKADEIEMIMTDLeranqraevaQREAETLREQLSSAN 357
Cdd:COG1340   152 AK-----KALEKNEKLKELRAELKELRKEAEEIHKKI-KELAEEAQELHEEMIEL----------YKEADELRKEADELH 215
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  358 HSLQLASQiqkapdveqaievltrsslevELAAKEREIAQLVEDVQRLQASLTKLRENSASqiSQLEQQLNAKNSTLKQL 437
Cdd:COG1340   216 KEIVEAQE---------------------KADELHEEIIELQKELRELRKELKKLRKKQRA--LKREKEKEELEEKAEEI 272

                  ....*
gi 601984298  438 EEKLK 442
Cdd:COG1340   273 FEKLK 277
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
103-384 2.81e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 62.38  E-value: 2.81e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREF--KKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAfltvykr 180
Cdd:TIGR02168  294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELesKLDELAEELAELEEKLEELKEELESLEAELEELEAE------- 366
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   181 lidvpdpvpaldvgqqleikvqrLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAETIALEK 260
Cdd:TIGR02168  367 -----------------------LEELESRLEELEEQLETLRSKVAQLELQ---IASLNNEIERLEARLERLEDRRERLQ 420
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   261 EQKLQNDFAEKERKLQETQMSTTSK---LEEAEHKLQTLQTALEKTRTELFDLKTKYDeETTAKADEIEMIMTDLERANQ 337
Cdd:TIGR02168  421 QEIEELLKKLEEAELKELQAELEELeeeLEELQEELERLEEALEELREELEEAEQALD-AAERELAQLQARLDSLERLQE 499
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 601984298   338 RAEVAQRE-AETLREQLSSANHSLQLASQIQKAPDVEQAIEVLTRSSL 384
Cdd:TIGR02168  500 NLEGFSEGvKALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRL 547
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
251-460 4.23e-09

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 60.55  E-value: 4.23e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  251 SQAETIA-LEKE-QKLQNDFAEKERKLQETQmsttSKLEEAEHKLQTLQTALEKTRTELFDLKtkydeettakaDEIEMI 328
Cdd:COG4942    17 AQADAAAeAEAElEQLQQEIAELEKELAALK----KEEKALLKQLAALERRIAALARRIRALE-----------QELAAL 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  329 MTDLERANQRAEVAQREAETLREQLSSANHSLQLASQIQK------APDVEQAIEVLT--------RSSLEVELAAKERE 394
Cdd:COG4942    82 EAELAELEKEIAELRAELEAQKEELAELLRALYRLGRQPPlalllsPEDFLDAVRRLQylkylapaRREQAEELRADLAE 161
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298  395 IAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKG-QADYEEVKKELNTLKSM 460
Cdd:COG4942   162 LAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAElAAELAELQQEAEELEAL 228
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
103-513 7.10e-09

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 61.01  E-value: 7.10e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   103 AKLKRELDATATVLANRQDESEqsrkRLIEQSREFKKNTPEDlRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTV---YK 179
Cdd:pfam12128  304 DELNGELSAADAAVAKDRSELE----ALEDQHGAFLDADIET-AAADQEQLPSWQSELENLEERLKALTGKHQDVtakYN 378
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   180 RLIdvpdpvpaLDVGQQLEIKVQRLHDiETENQKlretlEEYNKEFAEVKNQ-EVTIKALKEKIREYEQTLKSQAETIAL 258
Cdd:pfam12128  379 RRR--------SKIKEQNNRDIAGIKD-KLAKIR-----EARDRQLAVAEDDlQALESELREQLEAGKLEFNEEEYRLKS 444
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   259 E-KEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAkadeiemimtdLERANQ 337
Cdd:pfam12128  445 RlGELKLRLNQATATPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEA-----------LRQASR 513
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   338 RAEVAQREAETLREQLSSANHSLqLASQIQKAPDVEQAI------EVLTRSSLEVEL----AAKEREIAQLVEDVQRLQA 407
Cdd:pfam12128  514 RLEERQSALDELELQLFPQAGTL-LHFLRKEAPDWEQSIgkvispELLHRTDLDPEVwdgsVGGELNLYGVKLDLKRIDV 592
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   408 -SLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLkGQAdyeevKKELNTLK-SMEFAPSEGAGTQDSTKPL----EVLL 481
Cdd:pfam12128  593 pEWAASEEELRERLDKAEEALQSAREKQAAAEEQL-VQA-----NGELEKASrEETFARTALKNARLDLRRLfdekQSEK 666
                          410       420       430
                   ....*....|....*....|....*....|..
gi 601984298   482 LEKNRSLQSENATLRISNSDLSGSARRKGRDQ 513
Cdd:pfam12128  667 DKKNKALAERKDSANERLNSLEAQLKQLDKKH 698
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
206-445 1.10e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 59.01  E-value: 1.10e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  206 DIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIalekeQKLQNDFAEKERKLQETQMsttsK 285
Cdd:COG4942    21 AAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRI-----RALEQELAALEAELAELEK----E 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  286 LEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSsanhslQLASQ 365
Cdd:COG4942    92 IAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLA------ELAAL 165
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  366 IQkapdvEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQA 445
Cdd:COG4942   166 RA-----ELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAA 240
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
227-459 2.14e-08

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 59.16  E-value: 2.14e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  227 EVKNQEVTIKALKEKIREYEQ------TLKSQAETiaLEKEQKLQNDFAEKERKLQETQ-MSTTSKLEEAEHKLQTLQTA 299
Cdd:COG4913   219 EEPDTFEAADALVEHFDDLERahealeDAREQIEL--LEPIRELAERYAAARERLAELEyLRAALRLWFAQRRLELLEAE 296
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  300 LEKTRTELFDLKTKydeettakadeiemimtdLERANQRAEVAQREAETLREQLSSANHslqlasqiqkapdveQAIEvl 379
Cdd:COG4913   297 LEELRAELARLEAE------------------LERLEARLDALREELDELEAQIRGNGG---------------DRLE-- 341
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  380 trsSLEVELAAKEREIAQLVEDVQRLQASLTKL-------RENSASQISQLEQQLNAKNSTLKQLEEKL--------KGQ 444
Cdd:COG4913   342 ---QLEREIERLERELEERERRRARLEALLAALglplpasAEEFAALRAEAAALLEALEEELEALEEALaeaeaalrDLR 418
                         250
                  ....*....|....*
gi 601984298  445 ADYEEVKKELNTLKS 459
Cdd:COG4913   419 RELRELEAEIASLER 433
GAS pfam13851
Growth-arrest specific micro-tubule binding; This family is the highly conserved central ...
216-437 2.16e-08

Growth-arrest specific micro-tubule binding; This family is the highly conserved central region of a number of metazoan proteins referred to as growth-arrest proteins. In mouse, Gas8 is predominantly a testicular protein, whose expression is developmentally regulated during puberty and spermatogenesis. In humans, it is absent in infertile males who lack the ability to generate gametes. The localization of Gas8 in the motility apparatus of post-meiotic gametocytes and mature spermatozoa, together with the detection of Gas8 also in cilia at the apical surfaces of epithelial cells lining the pulmonary bronchi and Fallopian tubes suggests that the Gas8 protein may have a role in the functioning of motile cellular appendages. Gas8 is a microtubule-binding protein localized to regions of dynein regulation in mammalian cells.


Pssm-ID: 464001 [Multi-domain]  Cd Length: 200  Bit Score: 56.07  E-value: 2.16e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   216 ETLEEYNKEFAEVKN--QEVT------IKALKEKIREyeqtLKSQAEtialEKEQKLQNDFAEKERkLQEtqmsttsKLE 287
Cdd:pfam13851    1 ELMKNHEKAFNEIKNyyNDITrnnlelIKSLKEEIAE----LKKKEE----RNEKLMSEIQQENKR-LTE-------PLQ 64
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   288 EAEHKLQTLQTAL---EKTRTELfdlktkydEETTAKADEIEMIMTDLERAN----QRAEVAQREAETLREQLSSANHSL 360
Cdd:pfam13851   65 KAQEEVEELRKQLenyEKDKQSL--------KNLKARLKVLEKELKDLKWEHevleQRFEKVERERDELYDKFEAAIQDV 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   361 QlasqiQKAPD----VEQAIEVLTRsslevELAAKEREIAQLvedvqrLQASltKLRENSASQISQ-LEQQLNAKNSTLK 435
Cdd:pfam13851  137 Q-----QKTGLknllLEKKLQALGE-----TLEKKEAQLNEV------LAAA--NLDPDALQAVTEkLEDVLESKNQLIK 198

                   ..
gi 601984298   436 QL 437
Cdd:pfam13851  199 DL 200
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
193-459 2.36e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 58.92  E-value: 2.36e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  193 VGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTI--------------KALKEKIREYEQTLKSQAETIAL 258
Cdd:PRK03918  198 KEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIeelekeleslegskRKLEEKIRELEERIEELKKEIEE 277
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  259 EKEQ------------------KLQNDFAEKERKLQETQMSTT----------SKLEEAEHKLQTLQTALEKTRTELFDL 310
Cdd:PRK03918  278 LEEKvkelkelkekaeeyiklsEFYEEYLDELREIEKRLSRLEeeingieeriKELEEKEERLEELKKKLKELEKRLEEL 357
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  311 KTKYDEETTAKA--------------DEIEMIMTDLERANQRAEVAQREAETLREQLSSANH---SLQLA-SQIQKAP-- 370
Cdd:PRK03918  358 EERHELYEEAKAkkeelerlkkrltgLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKeikELKKAiEELKKAKgk 437
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  371 --------DVEQAIEVLTRSSLEV--------ELAAKEREI---AQLVEDVQRLQASLTKLREnSASQISQLEQQLNAKN 431
Cdd:PRK03918  438 cpvcgrelTEEHRKELLEEYTAELkriekelkEIEEKERKLrkeLRELEKVLKKESELIKLKE-LAEQLKELEEKLKKYN 516
                         330       340
                  ....*....|....*....|....*...
gi 601984298  432 stLKQLEEKLKgqaDYEEVKKELNTLKS 459
Cdd:PRK03918  517 --LEELEKKAE---EYEKLKEKLIKLKG 539
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
274-455 3.06e-08

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 56.09  E-value: 3.06e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  274 KLQETQmsttSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADeiemimTDLERANQRAEVAQREA--ETLRE 351
Cdd:COG1579    11 DLQELD----SELDRLEHRLKELPAELAELEDELAALEARLEAAKTELED------LEKEIKRLELEIEEVEAriKKYEE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  352 QLSSA--NHSLQ-LASQIQKApdvEQAIEVLTRSSLEV--ELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQ 426
Cdd:COG1579    81 QLGNVrnNKEYEaLQKEIESL---KRRISDLEDEILELmeRIEELEEELAELEAELAELEAELEEKKAELDEELAELEAE 157
                         170       180       190
                  ....*....|....*....|....*....|.
gi 601984298  427 LNAKNSTLKQLEEKLKGQ--ADYEEVKKELN 455
Cdd:COG1579   158 LEELEAEREELAAKIPPEllALYERIRKRKN 188
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
116-461 4.30e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 58.15  E-value: 4.30e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  116 LANRQDESEQSRKRLIEQSREF-----KKNTPEDLRKQVAPLlksfqgEIDALSKRSKEAEAAFLTVYKRLIDVPDPVPA 190
Cdd:PRK03918  343 LKKKLKELEKRLEELEERHELYeeakaKKEELERLKKRLTGL------TPEKLEKELEELEKAKEEIEEEISKITARIGE 416
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  191 LD-VGQQLEIKVQRLHDIE----------TENQKLrETLEEYNKEFAEVKNQEVTIKA----LKEKIREYEQTLKSQAET 255
Cdd:PRK03918  417 LKkEIKELKKAIEELKKAKgkcpvcgrelTEEHRK-ELLEEYTAELKRIEKELKEIEEkerkLRKELRELEKVLKKESEL 495
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  256 IALEK--------EQKLQNDFAEK-ERKLQE----------------TQMSTTSKLEEAEHKLQTLQTALEKTRTELFDL 310
Cdd:PRK03918  496 IKLKElaeqlkelEEKLKKYNLEElEKKAEEyeklkekliklkgeikSLKKELEKLEELKKKLAELEKKLDELEEELAEL 575
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  311 KTKYDEETTAKADEIEMIMTDLER----------ANQRAEVAQREAETLREQLSSANHSLQLASqiQKAPDVEQAIEVLT 380
Cdd:PRK03918  576 LKELEELGFESVEELEERLKELEPfyneylelkdAEKELEREEKELKKLEEELDKAFEELAETE--KRLEELRKELEELE 653
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  381 RSSLEVELAAKEREIAQLVEDVQRLQA---SLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQADYEEVKKELNTL 457
Cdd:PRK03918  654 KKYSEEEYEELREEYLELSRELAGLRAeleELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKY 733

                  ....
gi 601984298  458 KSME 461
Cdd:PRK03918  734 KALL 737
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
143-335 7.42e-08

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 55.84  E-value: 7.42e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  143 EDLRKQVAPLLKSFQGEIDALSKRSKEAEAAfLTVYKRLIDvpdpvpalDVGQQLEIKVQRLHDI-ETENQKL-RETLEE 220
Cdd:cd22656   113 EEAKKTIKALLDDLLKEAKKYQDKAAKVVDK-LTDFENQTE--------KDQTALETLEKALKDLlTDEGGAIaRKEIKD 183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  221 YNKEFAevKNQEVTIKALKEKIREYEQTLKSQAETIAleKEQKLQNDFaekerKLQETQM-STTSKLEEAEHKLQTLQTA 299
Cdd:cd22656   184 LQKELE--KLNEEYAAKLKAKIDELKALIADDEAKLA--AALRLIADL-----TAADTDLdNLLALIGPAIPALEKLQGA 254
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 601984298  300 LEKTRTELFDLKTKYDEETTaKADEIEMIMTDLERA 335
Cdd:cd22656   255 WQAIATDLDSLKDLLEDDIS-KIPAAILAKLELEKA 289
46 PHA02562
endonuclease subunit; Provisional
236-440 9.03e-08

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 56.95  E-value: 9.03e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  236 KALKEKIREYEQTLKSQAETIALEKEQ-KLQNDFAEKERKL-----QETQMSTTSKLEEAE-HKLQTLQT---------- 298
Cdd:PHA02562  170 KLNKDKIRELNQQIQTLDMKIDHIQQQiKTYNKNIEEQRKKngeniARKQNKYDELVEEAKtIKAEIEELtdellnlvmd 249
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  299 ------ALEKTRTELFDLKTKYdeETTAKadEIEMI---------MTDLERANQRaevaqreAETLREQLSSANHSL-QL 362
Cdd:PHA02562  250 iedpsaALNKLNTAAAKIKSKI--EQFQK--VIKMYekggvcptcTQQISEGPDR-------ITKIKDKLKELQHSLeKL 318
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  363 ASQIQKAPDVEQAIEVLTRSSLEV--ELAAKEREIAQLVEDVQRLQASLTKL---RENSASQISQLEQQLNAKNSTLKQL 437
Cdd:PHA02562  319 DTAIDELEEIMDEFNEQSKKLLELknKISTNKQSLITLVDKAKKVKAAIEELqaeFVDNAEELAKLQDELDKIVKTKSEL 398

                  ...
gi 601984298  438 EEK 440
Cdd:PHA02562  399 VKE 401
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
110-445 1.43e-07

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 56.59  E-value: 1.43e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  110 DATATVLANRQDE----SEQSRKRLIEQ--SREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRlid 183
Cdd:PRK02224  306 DADAEAVEARREEledrDEELRDRLEECrvAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDR--- 382
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  184 vPDPVPALDvgQQLEIKVQRLHDIETEnqklRETLEEYNKEFAEVKNqevtikALKEKIREYEQTLKSQAETIAlEKEQK 263
Cdd:PRK02224  383 -REEIEELE--EEIEELRERFGDAPVD----LGNAEDFLEELREERD------ELREREAELEATLRTARERVE-EAEAL 448
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  264 LQ-NDFAEKERKLQETQMSTTskLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAK--ADEIEMIMTDLERANQRAE 340
Cdd:PRK02224  449 LEaGKCPECGQPVEGSPHVET--IEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVeaEDRIERLEERREDLEELIA 526
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  341 VAQREAETLREQLSSANHSLQ-LASQIQKAPDVEQAIEVLTRSSLEvELAAKEREIAQLVEDVQRLQ--ASLTKLRENSA 417
Cdd:PRK02224  527 ERRETIEEKRERAEELRERAAeLEAEAEEKREAAAEAEEEAEEARE-EVAELNSKLAELKERIESLEriRTLLAAIADAE 605
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 601984298  418 SQISQLEQQ--------------LNAKNSTLKQLEEKLKGQA 445
Cdd:PRK02224  606 DEIERLREKrealaelnderrerLAEKRERKRELEAEFDEAR 647
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
204-458 1.57e-07

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 56.37  E-value: 1.57e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   204 LHDIETENQKLRETLEEYNKEFAEVKN----------QEVTIKALKEKIREYEQTLKSQAETIALEKEQKlqndfaEKER 273
Cdd:pfam10174  201 LDQKEKENIHLREELHRRNQLQPDPAKtkalqtviemKDTKISSLERNIRDLEDEVQMLKTNGLLHTEDR------EEEI 274
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   274 KLQETQMSTTSKLeeaEHKLQTLQTALEKTRTELFDLKTKYDEETTAKAD---EIEMIMTDLERANQRAEVAQREAETLR 350
Cdd:pfam10174  275 KQMEVYKSHSKFM---KNKIDQLKQELSKKESELLALQTKLETLTNQNSDckqHIEVLKESLTAKEQRAAILQTEVDALR 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   351 EQLSSANHSLQ---------------LASQIQKAPDV----EQAIEVLTR--SSLEVELAAKEREIAQLVEDVQRLQASL 409
Cdd:pfam10174  352 LRLEEKESFLNkktkqlqdlteekstLAGEIRDLKDMldvkERKINVLQKkiENLQEQLRDKDKQLAGLKERVKSLQTDS 431
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 601984298   410 TklreNSASQISQLEQQLNAKNSTLKQLEEK--LKGQADYEEV---KKELNTLK 458
Cdd:pfam10174  432 S----NTDTALTTLEEALSEKERIIERLKEQreREDRERLEELeslKKENKDLK 481
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
141-506 2.64e-07

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 55.61  E-value: 2.64e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   141 TPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFltvykrlidvpdpvpaldvgQQLEIKVQRLHDietENQKLRETLEE 220
Cdd:pfam12128  615 SAREKQAAAEEQLVQANGELEKASREETFARTAL--------------------KNARLDLRRLFD---EKQSEKDKKNK 671
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   221 YNKEFAEVKNQEVT-----IKALKEKIREYEQTLKSQAETIALEKEQKLQNdfAEKERKLQETQMSTTSKLEEAEHKLQt 295
Cdd:pfam12128  672 ALAERKDSANERLNsleaqLKQLDKKHQAWLEEQKEQKREARTEKQAYWQV--VEGALDAQLALLKAAIAARRSGAKAE- 748
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   296 lQTALEKTRTElfDLKTK-YDEETTAK-ADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQlasqIQKApDVE 373
Cdd:pfam12128  749 -LKALETWYKR--DLASLgVDPDVIAKlKREIRTLERKIERIAVRRQEVLRYFDWYQETWLQRRPRLA----TQLS-NIE 820
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   374 QAIEvltrsslevelaakereiaqlvedvqRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQADYEEvkkE 453
Cdd:pfam12128  821 RAIS--------------------------ELQQQLARLIADTKLRRAKLEMERKASEKQQVRLSENLRGLRCEMS---K 871
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298   454 LNTLKsmEFAPSEGAGTQDSTKP--LEVLLLEKNR---SLQSENATLRISNSDLSGSA 506
Cdd:pfam12128  872 LATLK--EDANSEQAQGSIGERLaqLEDLKLKRDYlseSVKKYVEHFKNVIADHSGSG 927
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
195-355 2.70e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 55.69  E-value: 2.70e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  195 QQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERK 274
Cdd:COG4913   274 LEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGGDRLEQLEREIERLERE 353
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  275 LQETQmsttSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLS 354
Cdd:COG4913   354 LEERE----RRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIA 429

                  .
gi 601984298  355 S 355
Cdd:COG4913   430 S 430
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
202-444 2.89e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 55.69  E-value: 2.89e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  202 QRLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAETIALekeQKLQNDFAEKERKLQETQmS 281
Cdd:COG4913   610 AKLAALEAELAELEEELAEAEERLEALEAE---LDALQERREALQRLAEYSWDEIDV---ASAEREIAELEAELERLD-A 682
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  282 TTSKLEEAEHKLQTLQTALEKTRTELFDLKTKY---DEETTAKADEIEMIMTDLERANQRAEVAQRE-AETLREQLSSAN 357
Cdd:COG4913   683 SSDDLAALEEQLEELEAELEELEEELDELKGEIgrlEKELEQAEEELDELQDRLEAAEDLARLELRAlLEERFAAALGDA 762
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  358 HSLQLASQIQKApdveqaievltRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNA-KNSTLKQ 436
Cdd:COG4913   763 VERELRENLEER-----------IDALRARLNRAEEELERAMRAFNREWPAETADLDADLESLPEYLALLDRlEEDGLPE 831

                  ....*...
gi 601984298  437 LEEKLKGQ 444
Cdd:COG4913   832 YEERFKEL 839
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
181-460 3.36e-07

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 55.02  E-value: 3.36e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  181 LIDVPDPVPALDVGQQLEIKVQrlhDIETENQKLR---------ETLEEYNKEFAEVKNQEVTIKALKEKIreyeqTLKS 251
Cdd:COG3206    60 LVEPQSSDVLLSGLSSLSASDS---PLETQIEILKsrpvlervvDKLNLDEDPLGEEASREAAIERLRKNL-----TVEP 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  252 QAETIAL---------EKEQKLQNDFAE------KERKLQETQMSTT---SKLEEAEHKLQTLQTALE--KTRTELFDLk 311
Cdd:COG3206   132 VKGSNVIeisytspdpELAAAVANALAEayleqnLELRREEARKALEfleEQLPELRKELEEAEAALEefRQKNGLVDL- 210
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  312 tkyDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSanhSLQLASQIQKAPDVEQAIEVLtrSSLEVELA-- 389
Cdd:COG3206   211 ---SEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGS---GPDALPELLQSPVIQQLRAQL--AELEAELAel 282
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  390 ------------AKEREIA----QLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKlkgQADYEEVKKE 453
Cdd:COG3206   283 sarytpnhpdviALRAQIAalraQLQQEAQRILASLEAELEALQAREASLQAQLAQLEARLAELPEL---EAELRRLERE 359

                  ....*..
gi 601984298  454 LNTLKSM 460
Cdd:COG3206   360 VEVAREL 366
COG5022 COG5022
Myosin heavy chain [General function prediction only];
99-495 3.92e-07

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 55.08  E-value: 3.92e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   99 CVAGAKLKRELDATATVLANRQDESEQSRKRlieQSREFKKNTPEDLRKQVAPLLKSFQ------GEIDALSKRSKEAEA 172
Cdd:COG5022   830 KKLRETEEVEFSLKAEVLIQKFGRSLKAKKR---FSLLKKETIYLQSAQRVELAERQLQelkidvKSISSLKLVNLELES 906
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  173 ---------------------AFLTVYKRLIDVPDPVPALDVGQQLEIKVQRLHdieTENQKLRETLEEYNKEFAEVKNQ 231
Cdd:COG5022   907 eiielkkslssdlienlefktELIARLKKLLNNIDLEEGPSIEYVKLPELNKLH---EVESKLKETSEEYEDLLKKSTIL 983
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  232 EVTIKALKEKIREYEQTLKS--------QAETIALEKEQK----LQND----FAEKERKLQETQMS---TTSKLEEAEHK 292
Cdd:COG5022   984 VREGNKANSELKNFKKELAElskqygalQESTKQLKELPVevaeLQSAskiiSSESTELSILKPLQklkGLLLLENNQLQ 1063
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  293 LQTLQTALEKTRTELFDLKTKYDEETTAKADEIEmiMTDLERANQRAEVAQREAETLREQLSSANHSLQ----LASQIQK 368
Cdd:COG5022  1064 ARYKALKLRRENSLLDDKQLYQLESTENLLKTIN--VKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQEiskfLSQLVNT 1141
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  369 APDVEQAIEV-------LTRSSLEVELAAKEREIAqLVEDVQRLQASLTKLRENSASQISQLEQQLNAK------NSTLK 435
Cdd:COG5022  1142 LEPVFQKLSVlqleldgLFWEANLEALPSPPPFAA-LSEKRLYQSALYDEKSKLSSSEVNDLKNELIALfskifsGWPRG 1220
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 601984298  436 QLEEKLKGQA-DYEEVKKELNTLKSMEFAPSEGAGtQDSTKPLEVL-LLEKNRSLQSENATL 495
Cdd:COG5022  1221 DKLKKLISEGwVPTEYSTSLKGFNNLNKKFDTPAS-MSNEKLLSLLnSIDNLLSSYKLEEEV 1281
ClyA_NheA-like cd22654
Bacillus cereus non-hemolytic enterotoxin (Nhe) component A (NheA), and similar proteins; This ...
190-457 5.56e-07

Bacillus cereus non-hemolytic enterotoxin (Nhe) component A (NheA), and similar proteins; This model contains Bacillus cereus tripartite non-hemolytic enterotoxin (Nhe) component A (NheA), a member of the cytolysin A (ClyA) family of alpha pore-forming toxins (alpha-PFTs). Non-hemolytic enterotoxin (Nhe), despite its name, is hemolytic and able to lyse erythrocytes from various mammalian organisms. It consists of three proteins, NheA, NheB and NheC, encoded by one operon containing three genes nheA, nheB and nheC, respectively. Separately, these three proteins show no toxicity; maximal activity is seen only when all three components are presented. The NheB and NheC components are able to bind to cell membranes while NheA is not; NheC primes the host cell for the formation of ion permeable NheB/C pores. Binding of NheA to NheB/NheC is thought to be the final stage of pore formation. Structure of NheA shows an elongated, almost entirely alpha-helical protein with an enlarged "head" domain compared with other cytolysins, displaying on its surface an enlarged beta-tongue which is of amphipathic rather than hydrophobic nature. It has been proposed that NheA could even form beta-barrel pores.


Pssm-ID: 439152 [Multi-domain]  Cd Length: 333  Bit Score: 53.42  E-value: 5.56e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  190 ALDVGQQLEIKVQRLHDIeTENQKL-----RETLEEYNKEFAEVkNQEvtIKALKEKIREYEQTLKSQAETIALEKEQKl 264
Cdd:cd22654    18 ALVILKQPNVKIEAMPSL-TNHQQTakenvREWLDEYNPKLIDL-NQD--MINFSQRFNNYYDKLYDLAGKINEDEQAK- 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  265 qNDFAEKERKLQEtqmsttskleeaehKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANqrAEVAQr 344
Cdd:cd22654    93 -EDFLNGINKLQS--------------QLQTIQNSMEQTSSNLNRFKTLLDADSKNFSTDAKKAIDSLSGSN--GEIAQ- 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  345 eaetLREQLSSANHSLQ----------------------LASQIQKAPDVEQAIEVLTRSSL----------EVELAAKe 392
Cdd:cd22654   155 ----LRTQIKTINDEIQeeltkilnrpievgdgsinigkQVFTITITTATTKTVDVTSIGGLingignasddEVKEAAN- 229
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 601984298  393 rEIAQLVEDVQRLQASLTKLrENSASQISQLEQQLNAKNSTLKqleeklKGQADYEEVKKELNTL 457
Cdd:cd22654   230 -KIQQKQKELVDLIKKLSDA-EIQATQLTLVEDQVNGFTELIK------RQIATLENLVEDWEML 286
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
210-456 6.22e-07

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 54.59  E-value: 6.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   210 ENQKLRETLE---EYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKL 286
Cdd:pfam02463  174 ALKKLIEETEnlaELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEI 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   287 EEAEhKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREA--ETLREQLSSANHSLQLAS 364
Cdd:pfam02463  254 ESSK-QEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDeeKLKESEKEKKKAEKELKK 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   365 QIQKAPDVEQAIEVLTRSSLEVELA--AKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLK 442
Cdd:pfam02463  333 EKEEIEELEKELKELEIKREAEEEEeeELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLE 412
                          250
                   ....*....|....
gi 601984298   443 GQADYEEVKKELNT 456
Cdd:pfam02463  413 LARQLEDLLKEEKK 426
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
192-442 7.67e-07

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 54.28  E-value: 7.67e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  192 DVGQQLEIKVQ-----RLHDIETENQKLRETLEEY--NKEFAevknqEVTIKALKEKIREYEQTlksQAETIALEKE-QK 263
Cdd:PRK02224  191 QLKAQIEEKEEkdlheRLNGLESELAELDEEIERYeeQREQA-----RETRDEADEVLEEHEER---REELETLEAEiED 262
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  264 LQNDFAEKERKLQETQmsttSKLEEAEHKLQTLQTALEKTRTE----------LFDLKTKYDEETTAKADEIEMIMTDLE 333
Cdd:PRK02224  263 LRETIAETEREREELA----EEVRDLRERLEELEEERDDLLAEaglddadaeaVEARREELEDRDEELRDRLEECRVAAQ 338
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  334 RANQRAEVAQREAETLREQLSSANhslqlasqiQKAPDVEQAIEvltrsSLEVELAAKEREIAQLVEDVQRLQASLTKL- 412
Cdd:PRK02224  339 AHNEEAESLREDADDLEERAEELR---------EEAAELESELE-----EAREAVEDRREEIEELEEEIEELRERFGDAp 404
                         250       260       270
                  ....*....|....*....|....*....|..
gi 601984298  413 --RENSASQISQLEQQLNAKNSTLKQLEEKLK 442
Cdd:PRK02224  405 vdLGNAEDFLEELREERDELREREAELEATLR 436
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
111-449 7.87e-07

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 53.89  E-value: 7.87e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  111 ATATVLANRQDESEQSRKRLIEQSREFkkntpEDLRKQVAPLLKsfqgEIDALSKRSKEAEAAFLTVYKRLIDVPDPVpA 190
Cdd:PRK02224  234 ETRDEADEVLEEHEERREELETLEAEI-----EDLRETIAETER----EREELAEEVRDLRERLEELEEERDDLLAEA-G 303
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  191 LDVGQQlEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEqtlksqaetialEKEQKLQNDFAE 270
Cdd:PRK02224  304 LDDADA-EAVEARREELEDRDEELRDRLEECRVAAQAHNEE---AESLREDADDLE------------ERAEELREEAAE 367
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  271 KERKLQETQMSTT---SKLEEAEHKLQTLQTALEKTRTELFDLKTkYDEETTAKADEIEMIMTDLERANQRAEVAQREAE 347
Cdd:PRK02224  368 LESELEEAREAVEdrrEEIEELEEEIEELRERFGDAPVDLGNAED-FLEELREERDELREREAELEATLRTARERVEEAE 446
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  348 TLRE--QLSSANHSLQLASQIQKAPDVEQAIEVLT--RSSLEVELAAKEREIAQLvEDVQRLQASLTKLRENSASQISQL 423
Cdd:PRK02224  447 ALLEagKCPECGQPVEGSPHVETIEEDRERVEELEaeLEDLEEEVEEVEERLERA-EDLVEAEDRIERLEERREDLEELI 525
                         330       340
                  ....*....|....*....|....*.
gi 601984298  424 EQQLNAKNSTLKQLEEKLKGQADYEE 449
Cdd:PRK02224  526 AERRETIEEKRERAEELRERAAELEA 551
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
193-454 9.68e-07

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 53.57  E-value: 9.68e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   193 VGQQLEIKVQRLHDIETENQKLR--------------ETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIAL 258
Cdd:pfam05483  132 VSLKLEEEIQENKDLIKENNATRhlcnllketcarsaEKTKKYEYEREETRQVYMDLNNNIEKMILAFEELRVQAENARL 211
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   259 EKEQKLQNDFAEKERKLQETQMSTTSK----------LEEAEHKLQTLQTALEKTRTELFDL--KTKYDEE----TTAKA 322
Cdd:pfam05483  212 EMHFKLKEDHEKIQHLEEEYKKEINDKekqvsllliqITEKENKMKDLTFLLEESRDKANQLeeKTKLQDEnlkeLIEKK 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   323 DEIEMIMTDLERANQRAEVAQReaeTLREQLSSANHSL-QLASqiQKAPDVEQAIEVLTRSSLEV-ELAAKEREIAQLVE 400
Cdd:pfam05483  292 DHLTKELEDIKMSLQRSMSTQK---ALEEDLQIATKTIcQLTE--EKEAQMEELNKAKAAHSFVVtEFEATTCSLEELLR 366
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298   401 -DVQRLqasltklrENSASQISQLEQQLNAKNSTLKQLEE-KLKGQADYEEVKKEL 454
Cdd:pfam05483  367 tEQQRL--------EKNEDQLKIITMELQKKSSELEEMTKfKNNKEVELEELKKIL 414
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
104-461 9.76e-07

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 53.57  E-value: 9.76e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   104 KLKRELDATATVLANRQDESEQSRKRLIEqsrefkKNTPEDLRKQVAPLLKSFQGeidalskrsKEAEAAFLtvykrlid 183
Cdd:pfam05483  388 KKSSELEEMTKFKNNKEVELEELKKILAE------DEKLLDEKKQFEKIAEELKG---------KEQELIFL-------- 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   184 vpdpvpaldvgqqLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQevtikALKEKIREYEqtLKSQAETIALEKeQK 263
Cdd:pfam05483  445 -------------LQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTE-----LEKEKLKNIE--LTAHCDKLLLEN-KE 503
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   264 LQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTaLEKTRTELFDLKTKYDEETTAKADEIE-----------MIMTDL 332
Cdd:pfam05483  504 LTQEASDMTLELKKHQEDIINCKKQEERMLKQIEN-LEEKEMNLRDELESVREEFIQKGDEVKckldkseenarSIEYEV 582
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   333 ERANQRAEVAQREAETLREQLSSANHSLQLASQIQKAPDVEQAIEV-------LTRSSLEVELAAKEREIAQLVEDVQRl 405
Cdd:pfam05483  583 LKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENkqlnayeIKVNKLELELASAKQKFEEIIDNYQK- 661
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 601984298   406 QASLTKLRENS-----------ASQISQLEQQLNA----KNSTLKQLEEKLKGQAD--YEEVKKELNTLKSME 461
Cdd:pfam05483  662 EIEDKKISEEKlleevekakaiADEAVKLQKEIDKrcqhKIAEMVALMEKHKHQYDkiIEERDSELGLYKNKE 734
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
263-484 1.03e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 52.98  E-value: 1.03e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  263 KLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDE---ETTAKADEIEMIMTDLERANQRA 339
Cdd:COG4372    17 GLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQarsELEQLEEELEELNEQLQAAQAEL 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  340 EVAQREAETLREQLSSANHSLQ-LASQIQKAPDVEQAIEVlTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSA- 417
Cdd:COG4372    97 AQAQEELESLQEEAEELQEELEeLQKERQDLEQQRKQLEA-QIAELQSEIAEREEELKELEEQLESLQEELAALEQELQa 175
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 601984298  418 -------SQISQLEQQLNAKNSTLKQLEEKLKGQADYEEVKKELNTLKSMEFAPSEGAGTQDSTKPLEVLLLEK 484
Cdd:COG4372   176 lseaeaeQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKE 249
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
103-446 1.10e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 53.77  E-value: 1.10e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  103 AKLKRELDATATVLANRQDESEQSRKRLiEQSREFKKNTPEDLRkqvaplLKSFQGEIDALSKRSKEAEAAFLTVykrli 182
Cdd:COG4913   620 AELEEELAEAEERLEALEAELDALQERR-EALQRLAEYSWDEID------VASAEREIAELEAELERLDASSDDL----- 687
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  183 dvpdpvpaldvgQQLEikvQRLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAETIALEKEQ 262
Cdd:COG4913   688 ------------AALE---EQLEELEAELEELEEELDELKGEIGRLEKE---LEQAEEELDELQDRLEAAEDLARLELRA 749
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  263 KLQNDFAEKERKLQETQMSttsklEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEiemIMTDLERANQ-RAEV 341
Cdd:COG4913   750 LLEERFAAALGDAVERELR-----ENLEERIDALRARLNRAEEELERAMRAFNREWPAETAD---LDADLESLPEyLALL 821
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  342 AQREAETL-------REQLSSANHS--LQLASQIQKAP-DVEQAIEVLTRSSLEVE--------LAAKEREiaqlVEDVQ 403
Cdd:COG4913   822 DRLEEDGLpeyeerfKELLNENSIEfvADLLSKLRRAIrEIKERIDPLNDSLKRIPfgpgrylrLEARPRP----DPEVR 897
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 601984298  404 RLQASLTKLRENSASQIsqlEQQLNAKNSTLKQLEEKLKGQAD 446
Cdd:COG4913   898 EFRQELRAVTSGASLFD---EELSEARFAALKRLIERLRSEEE 937
PRK11281 PRK11281
mechanosensitive channel MscK;
186-442 1.45e-06

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 53.38  E-value: 1.45e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  186 DPVPALDVGQQLEiKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLK-SQAETIALEKeqkl 264
Cdd:PRK11281   34 DLPTEADVQAQLD-ALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRqAQAELEALKD---- 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  265 QNDFAEKER--KLQETQMSttSKLEEAEHKLQTLQTALEKTRTELFDLKTKydeettakadeiemimtdLERA-NQRAEV 341
Cdd:PRK11281  109 DNDEETRETlsTLSLRQLE--SRLAQTLDQLQNAQNDLAEYNSQLVSLQTQ------------------PERAqAALYAN 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  342 AQREAEtLREQLSSANHSlqlasqiQKAPDVEQaievltRSSLEVELAAKEREIA---QLVEDVQRLQASLTKLRENSAS 418
Cdd:PRK11281  169 SQRLQQ-IRNLLKGGKVG-------GKALRPSQ------RVLLQAEQALLNAQNDlqrKSLEGNTQLQDLLQKQRDYLTA 234
                         250       260       270
                  ....*....|....*....|....*....|.
gi 601984298  419 QISQLEQQL-------NAKNstLKQLEEKLK 442
Cdd:PRK11281  235 RIQRLEHQLqllqeaiNSKR--LTLSEKTVQ 263
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
103-353 1.60e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.14  E-value: 1.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNTP-----EDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTV 177
Cdd:TIGR02168  736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEelaeaEAEIEELEAQIEQLKEELKALREALDELRAELTLL 815
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   178 YKRlidvpdpvpALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFA----EVKNQEVTIKALKEKIREYEQTLKSQA 253
Cdd:TIGR02168  816 NEE---------AANLRERLESLERRIAATERRLEDLEEQIEELSEDIEslaaEIEELEELIEELESELEALLNERASLE 886
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   254 ETIALEKE--QKLQNDFAEKERKLQETQmsttSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETtakADEIEMIMTD 331
Cdd:TIGR02168  887 EALALLRSelEELSEELRELESKRSELR----RELEELREKLAQLELRLEGLEVRIDNLQERLSEEY---SLTLEEAEAL 959
                          250       260
                   ....*....|....*....|..
gi 601984298   332 LERANQRAEVAQREAETLREQL 353
Cdd:TIGR02168  960 ENKIEDDEEEARRRLKRLENKI 981
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
296-513 1.66e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 52.21  E-value: 1.66e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  296 LQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQLA-SQIQKAPD-VE 373
Cdd:COG4372     4 LGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQArSELEQLEEeLE 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  374 QAIEVLTRSSLEV-----ELAAKEREIAQLVEDVQRLQASLTKL---RENSASQISQLEQQLNAKNSTLKQLEEKLKG-Q 444
Cdd:COG4372    84 ELNEQLQAAQAELaqaqeELESLQEEAEELQEELEELQKERQDLeqqRKQLEAQIAELQSEIAEREEELKELEEQLESlQ 163
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984298  445 ADYEEVKKELNTLKsmefapsegagTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGSARRKGRDQ 513
Cdd:COG4372   164 EELAALEQELQALS-----------EAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELL 221
PTZ00121 PTZ00121
MAEBL; Provisional
103-453 2.44e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 52.84  E-value: 2.44e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNTPEDLRKqvAPLLKSFQGEIDALSKRSKEAEAAfltvyKRLI 182
Cdd:PTZ00121 1362 AEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK--ADELKKAAAAKKKADEAKKKAEEK-----KKAD 1434
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  183 DVPDPVPALDVGQQLEIKVQRLHDIETENQKLRET--LEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETI---- 256
Cdd:PTZ00121 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAkkADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKkade 1514
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  257 ---ALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTElfdlKTKYDEETTAKADEIEMIMTDLE 333
Cdd:PTZ00121 1515 akkAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAE----EAKKAEEDKNMALRKAEEAKKAE 1590
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  334 --RANQRAEVAQREAETLREQLSSANHSLQLASQIQKAPDVEQAIEVLTRSslEVELAAKEREIAQLVEDVQRLQASLTK 411
Cdd:PTZ00121 1591 eaRIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK--EAEEKKKAEELKKAEEENKIKAAEEAK 1668
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|..
gi 601984298  412 LRENSASQISQLEQQLNAKNSTLKQLEEKLKGQADYEEVKKE 453
Cdd:PTZ00121 1669 KAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKK 1710
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
213-504 2.46e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 52.67  E-value: 2.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   213 KLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTL--KSQAETIALEKEQKLQN---DFAEKERKLQETQMSTT---- 283
Cdd:pfam02463  230 DYLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKlaQVLKENKEEEKEKKLQEeelKLLAKEEEELKSELLKLerrk 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   284 ----SKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMImTDLERANQRAEVAQREAETLREQLSSANHS 359
Cdd:pfam02463  310 vddeEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEE-EELEKLQEKLEQLEEELLAKKKLESERLSS 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   360 LQLASQIQKAPDVEQAIEVltrsSLEVELAAKEREIAQLvedvqrlqASLTKLRENSASQISQLEQQLNAKNSTLKQLEE 439
Cdd:pfam02463  389 AAKLKEEELELKSEEEKEA----QLLLELARQLEDLLKE--------EKKEELEILEEEEESIELKQGKLTEEKEELEKQ 456
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 601984298   440 KLKGQADYEEVKKELNTLKSMEFAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSG 504
Cdd:pfam02463  457 ELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVG 521
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
281-628 2.48e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 51.75  E-value: 2.48e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  281 STTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEettaKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSL 360
Cdd:COG3883    20 AKQKELSELQAELEAAQAELDALQAELEELNEEYNE----LQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARAL 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  361 Q-------LASQIQKAPDVE------QAIEVLTRSSLEV--ELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQ 425
Cdd:COG3883    96 YrsggsvsYLDVLLGSESFSdfldrlSALSKIADADADLleELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEA 175
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  426 QLNAKNSTLKQLEEKlkgQADYEEVKKEL-NTLKSMEFAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSG 504
Cdd:COG3883   176 QQAEQEALLAQLSAE---EAAAEAQLAELeAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAA 252
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  505 SARRKGRDQPESRRPGPLPASPPPQLPRNTGEQVSNTNGTHHFSPAGLSQDFFSSNLASPSLPLASTGKFALNSLLQRQL 584
Cdd:COG3883   253 GAAGAAAGSAGAAGAAAGAAGAGAAAASAAGGGAGGAGGGGGGGGAASGGSGGGSGGAGGVGSGGGAGAVVGGASAGGGG 332
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 601984298  585 MQSFYSKAMQEAGSTSTIFSTGPYSTNSISSPSPLQQSPDVNGM 628
Cdd:COG3883   333 GSGGGGGSSGGGSGGGGGGGGGGGGSSSGGGGGGVGLSVGGGYV 376
PHA03247 PHA03247
large tegument protein UL36; Provisional
843-1026 2.49e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 2.49e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  843 SASPMSTVSTYPPLAISLKKTPAAPETST--AALPSAPALKKEAQDVPTLDPPGSADAAQGVLRPMksELVRGSTWKDPW 920
Cdd:PHA03247  269 PETARGATGPPPPPEAAAPNGAAAPPDGVwgAALAGAPLALPAPPDPPPPAPAGDAEEEDDEDGAM--EVVSPLPRPRQH 346
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  921 WSPIQPERRNLTSSEETKADETTASGKERAGSSQPRAERsqlqgpsasaeywKEWPSAESPYSQSSELSLTGASRSETPQ 1000
Cdd:PHA03247  347 YPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKR-------------RSARHAATPFARGPGGDDQTRPAAPVPA 413
                         170       180
                  ....*....|....*....|....*.
gi 601984298 1001 NSPLPSSPIVPMAKPAKPSVPPLTPE 1026
Cdd:PHA03247  414 SVPTPAPTPVPASAPPPPATPLPSAE 439
46 PHA02562
endonuclease subunit; Provisional
199-412 2.52e-06

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 52.32  E-value: 2.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  199 IKVQRLHdIETENQKLRETLEEYNKEFAEVKNQEVTIKA----LKEKIREYEQTLKSQAETIalekeQKLQNDFAEKERK 274
Cdd:PHA02562  197 IKTYNKN-IEEQRKKNGENIARKQNKYDELVEEAKTIKAeieeLTDELLNLVMDIEDPSAAL-----NKLNTAAAKIKSK 270
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  275 LQ----ETQMST--------TSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETtakadeiemimtdlERANQRAEvA 342
Cdd:PHA02562  271 IEqfqkVIKMYEkggvcptcTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELE--------------EIMDEFNE-Q 335
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  343 QREAETLREQLSSANHSLQLAsqIQKAPDVEQAIEvltrsSLEVELAAKEREIAQLVEDVQRLQASLTKL 412
Cdd:PHA02562  336 SKKLLELKNKISTNKQSLITL--VDKAKKVKAAIE-----ELQAEFVDNAEELAKLQDELDKIVKTKSEL 398
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
259-515 2.60e-06

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 52.05  E-value: 2.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   259 EKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRtELFDLKTKYDEETTAKADEIEMIMTDLERANQR 338
Cdd:pfam05557   48 DRNQELQKRIRLLEKREAEAEEALREQAELNRLKKKYLEALNKKLN-EKESQLADAREVISCLKNELSELRRQIQRAELE 126
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   339 AEVAQREAETLREQL----------SSANHSLQ-----LASQIQKAPDVEQAIEVLTRSSLEVElAAKER--EIAQLVED 401
Cdd:pfam05557  127 LQSTNSELEELQERLdllkakaseaEQLRQNLEkqqssLAEAEQRIKELEFEIQSQEQDSEIVK-NSKSElaRIPELEKE 205
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   402 VQRLQASLTKLRENSASqISQLEQQLNAKNSTLKQLE----EKLKGQADYEEVKKELNTLKSMefapsegAGTQDSTKPL 477
Cdd:pfam05557  206 LERLREHNKHLNENIEN-KLLLKEEVEDLKRKLEREEkyreEAATLELEKEKLEQELQSWVKL-------AQDTGLNLRS 277
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 601984298   478 EVLLLEKNRSLQSENATLRISNSDLSGSARRKGRDQPE 515
Cdd:pfam05557  278 PEDLSRRIEQLQQREIVLKEENSSLTSSARQLEKARRE 315
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
103-459 2.86e-06

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 52.08  E-value: 2.86e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  103 AKLKRELDATATVLANRQDESEQSRkrlIEQSREFKKNTPEDLRKQVAPLlKSFQGEIDALSKRSKEAEAAfLTVYKRLI 182
Cdd:COG4717   112 EELREELEKLEKLLQLLPLYQELEA---LEAELAELPERLEELEERLEEL-RELEEELEELEAELAELQEE-LEELLEQL 186
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  183 DVPDPVPALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNqEVTIKALKEKIREYEQTLKSQAETIALEKEQ 262
Cdd:COG4717   187 SLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLEN-ELEAAALEERLKEARLLLLIAAALLALLGLG 265
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  263 KLQNDFAEKERKLQ-----------ETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTD 331
Cdd:COG4717   266 GSLLSLILTIAGVLflvlgllallfLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDR 345
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  332 LERANQ-RAEVAQREAETLREQLSSANHSL----------QLASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVE 400
Cdd:COG4717   346 IEELQElLREAEELEEELQLEELEQEIAALlaeagvedeeELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEAL 425
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 601984298  401 DVQRLQASLTKLRENSAS---QISQLEQQLNAKNSTLKQLEEklkgQADYEEVKKELNTLKS 459
Cdd:COG4717   426 DEEELEEELEELEEELEEleeELEELREELAELEAELEQLEE----DGELAELLQELEELKA 483
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
203-495 3.75e-06

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 52.04  E-value: 3.75e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   203 RLHDIETENQKLRETLEEYNKEFAE--VKNQEVTIKALKEKI---REYEQTLKSQAETIALEKEQK----------LQND 267
Cdd:pfam15921  246 QLEALKSESQNKIELLLQQHQDRIEqlISEHEVEITGLTEKAssaRSQANSIQSQLEIIQEQARNQnsmymrqlsdLEST 325
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   268 FAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTElfdlKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAE 347
Cdd:pfam15921  326 VSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTE----RDQFSQESGNLDDQLQKLLADLHKREKELSLEKEQNK 401
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   348 TLREQlssanhslqlasqiqkapDVEQAIevlTRSSLEVELAAKEREiaqlvedVQRLQASLTKLRENSASQISQLEQQL 427
Cdd:pfam15921  402 RLWDR------------------DTGNSI---TIDHLRRELDDRNME-------VQRLEALLKAMKSECQGQMERQMAAI 453
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 601984298   428 NAKNSTLKQ---LEEKLKGQADY-EEVKKELnTLKSMEFAPSEgagtqDSTKPLEVLLLEKNRSLQSENATL 495
Cdd:pfam15921  454 QGKNESLEKvssLTAQLESTKEMlRKVVEEL-TAKKMTLESSE-----RTVSDLTASLQEKERAIEATNAEI 519
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
195-461 3.97e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 51.06  E-value: 3.97e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  195 QQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAETIalekeQKLQNDFAEKERK 274
Cdd:COG4372    31 EQLRKALFELDKLQEELEQLREELEQAREELEQLEEE---LEQARSELEQLEEELEELNEQL-----QAAQAELAQAQEE 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  275 LQETQmSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLS 354
Cdd:COG4372   103 LESLQ-EEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEA 181
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  355 SANHSLQLASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTL 434
Cdd:COG4372   182 EQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIE 261
                         250       260
                  ....*....|....*....|....*..
gi 601984298  435 KQLEEKLKGQADYEEVKKELNTLKSME 461
Cdd:COG4372   262 ELELAILVEKDTEEEELEIAALELEAL 288
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
195-459 4.82e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 51.56  E-value: 4.82e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   195 QQLEIKVQRL-HDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIReyeqTLKSQaETIALEKEQKLQND--FAEK 271
Cdd:TIGR04523   36 KQLEKKLKTIkNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIK----DLNDK-LKKNKDKINKLNSDlsKINS 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   272 ERKLQETQMSTT----SKLE----EAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKAD-EIEMIMTDLERANQraeva 342
Cdd:TIGR04523  111 EIKNDKEQKNKLevelNKLEkqkkENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEElENELNLLEKEKLNI----- 185
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   343 QREAETLREQLSSANHSL-QLASQIQKAPDVEQAIEVL--TRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQ 419
Cdd:TIGR04523  186 QKNIDKIKNKLLKLELLLsNLKKKIQKNKSLESQISELkkQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKI 265
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 601984298   420 ISQLE---QQLNAKNSTLKQLEEKLKgqadyeEVKKELNTLKS 459
Cdd:TIGR04523  266 KKQLSekqKELEQNNKKIKELEKQLN------QLKSEISDLNN 302
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
197-445 5.03e-06

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 51.38  E-value: 5.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   197 LEIKVQRLH--------DIETENQKLRETLEEYNKEFAEVKNQevtikaLKEKIREYEQTLKSQAETIALEKEQ------ 262
Cdd:pfam12128  256 AELRLSHLHfgyksdetLIASRQEERQETSAELNQLLRTLDDQ------WKEKRDELNGELSAADAAVAKDRSElealed 329
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   263 ---KLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETtakADEIEMIMTDLE-----R 334
Cdd:pfam12128  330 qhgAFLDADIETAAADQEQLPSWQSELENLEERLKALTGKHQDVTAKYNRRRSKIKEQN---NRDIAGIKDKLAkireaR 406
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   335 ANQRAE---VAQREAETLREQLSSANHSL-----QLAS-------QIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQL- 398
Cdd:pfam12128  407 DRQLAVaedDLQALESELREQLEAGKLEFneeeyRLKSrlgelklRLNQATATPELLLQLENFDERIERAREEQEAANAe 486
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 601984298   399 VEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQA 445
Cdd:pfam12128  487 VERLQSELRQARKRRDQASEALRQASRRLEERQSALDELELQLFPQA 533
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
211-438 6.56e-06

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 50.91  E-value: 6.56e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   211 NQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQA-ETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEA 289
Cdd:pfam07111   58 SQALSQQAELISRQLQELRRLEEEVRLLRETSLQQKMRLEAQAmELDALAVAEKAGQAEAEGLRAALAGAEMVRKNLEEG 137
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   290 EHK-LQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLE--RANQRAE--VAQREAETLREQLSSANHSLQ--- 361
Cdd:pfam07111  138 SQReLEEIQRLHQEQLSSLTQAHEEALSSLTSKAEGLEKSLNSLEtkRAGEAKQlaEAQKEAELLRKQLSKTQEELEaqv 217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   362 -LASQIQKAPDvEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQAS--LTKLRENSASQISQL-EQQLNAKNSTLKQL 437
Cdd:pfam07111  218 tLVESLRKYVG-EQVPPEVHSQTWELERQELLDTMQHLQEDRADLQATveLLQVRVQSLTHMLALqEEELTRKIQPSDSL 296

                   .
gi 601984298   438 E 438
Cdd:pfam07111  297 E 297
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
226-472 7.76e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 50.29  E-value: 7.76e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  226 AEVKNQEVTIKALKEKIREYEQTLKSQAETiALEKEQKLQNDFAEKERKLQETQM---STTSKLEEAEHKLQTLQTALEK 302
Cdd:COG4372     6 EKVGKARLSLFGLRPKTGILIAALSEQLRK-ALFELDKLQEELEQLREELEQAREeleQLEEELEQARSELEQLEEELEE 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  303 TRTELfdlkTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQLASQIQKAPDVEqaievltRS 382
Cdd:COG4372    85 LNEQL----QAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEE-------LK 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  383 SLEVELAAKEREIAQLVEDVQRLqaSLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQADYEEVKKELNTLKSMEF 462
Cdd:COG4372   154 ELEEQLESLQEELAALEQELQAL--SEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKL 231
                         250
                  ....*....|
gi 601984298  463 APSEGAGTQD 472
Cdd:COG4372   232 GLALSALLDA 241
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
204-501 8.31e-06

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 50.82  E-value: 8.31e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   204 LHDIETENQKLRETLEEYNKEFAevKNQEVTI--------KALKEKIREYEQTLKSqAETIALEKEQKLQNDFAEKERKL 275
Cdd:TIGR01612  988 LNDYEAKNNELIKYFNDLKANLG--KNKENMLyhqfdekeKATNDIEQKIEDANKN-IPNIEIAIHTSIYNIIDEIEKEI 1064
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   276 -QETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKydEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQls 354
Cdd:TIGR01612 1065 gKNIELLNKEILEEAEINITNFNEIKEKLKHYNFDDFGK--EENIKYADEINKIKDDIKNLDQKIDHHIKALEEIKKK-- 1140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   355 SANHSLQLASQIQKAPDVeqAIEVLTRSSLEvELAAKEREIA-------QLVEDVQRLQASLTKLRENSASqisqLEQQL 427
Cdd:TIGR01612 1141 SENYIDEIKAQINDLEDV--ADKAISNDDPE-EIEKKIENIVtkidkkkNIYDEIKKLLNEIAEIEKDKTS----LEEVK 1213
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 601984298   428 NAKNSTLKQLEEKLKGQADyEEVKKELNTLKSMEfAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSD 501
Cdd:TIGR01612 1214 GINLSYGKNLGKLFLEKID-EEKKKSEHMIKAME-AYIEDLDEIKEKSPEIENEMGIEMDIKAEMETFNISHDD 1285
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
201-512 8.63e-06

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 50.74  E-value: 8.63e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   201 VQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQM 280
Cdd:TIGR00618  593 TVRLQDLTEKLSEAEDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPK 672
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   281 STTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAkadeiemIMTDLERANQRAEVAQREAETLREQLSSANHSL 360
Cdd:TIGR00618  673 ELLASRQLALQKMQSEKEQLTYWKEMLAQCQTLLRELETH-------IEEYDREFNEIENASSSLGSDLAAREDALNQSL 745
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   361 QLASQIQK------APDVEQAIEVLT----------------------RSSLEVELAAKEREIAQLVED----------- 401
Cdd:TIGR00618  746 KELMHQARtvlkarTEAHFNNNEEVTaalqtgaelshlaaeiqffnrlREEDTHLLKTLEAEIGQEIPSdedilnlqcet 825
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   402 -VQRLQASLTKLRENSASQIsQLEQQLNAKNSTLKQLEEKLKGQADYEEVKKELNTLKSMEFAPSEGAgtqdSTKPLEVL 480
Cdd:TIGR00618  826 lVQEEEQFLSRLEEKSATLG-EITHQLLKYEECSKQLAQLTQEQAKIIQLSDKLNGINQIKIQFDGDA----LIKFLHEI 900
                          330       340       350
                   ....*....|....*....|....*....|..
gi 601984298   481 LLEKNRSLQSENATLRISNSDLSGSARRKGRD 512
Cdd:TIGR00618  901 TLYANVRLANQSEGRFHGRYADSHVNARKYQG 932
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
204-461 1.07e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 50.45  E-value: 1.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  204 LHDIETENQKLRETLEEYNKEFAEVKNQEVtIKALKEKIREYEQTLKSQAETIALEKEQKlqndfAEKERKLQEtqmstt 283
Cdd:PRK03918  141 LESDESREKVVRQILGLDDYENAYKNLGEV-IKEIKRRIERLEKFIKRTENIEELIKEKE-----KELEEVLRE------ 208
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  284 skLEEAEHKLQTLQTALEKTRTELFDLKtKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSsanhslQLA 363
Cdd:PRK03918  209 --INEISSELPELREELEKLEKEVKELE-ELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIE------ELE 279
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  364 SQIQKAPDVEQ-AIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKlrensasQISQLEQqlnaKNSTLKQLEEKLK 442
Cdd:PRK03918  280 EKVKELKELKEkAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEE-------RIKELEE----KEERLEELKKKLK 348
                         250       260
                  ....*....|....*....|...
gi 601984298  443 G-QADYEEVK---KELNTLKSME 461
Cdd:PRK03918  349 ElEKRLEELEerhELYEEAKAKK 371
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
215-495 1.57e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 50.05  E-value: 1.57e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   215 RETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETialekEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQ 294
Cdd:TIGR00606  237 REIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSR-----KKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQ 311
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   295 TLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQ---------REAETLREQLSSANHSLQLASQ 365
Cdd:TIGR00606  312 RTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQehirardslIQSLATRLELDGFERGPFSERQ 391
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   366 IQKAPDVE---QAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLK 442
Cdd:TIGR00606  392 IKNFHTLVierQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSD 471
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298   443 G--QADYEEVKKELNTLKSMEFAPSEGAGTQD-STKPLEVLLLEKNRSLQSENATL 495
Cdd:TIGR00606  472 RilELDQELRKAERELSKAEKNSLTETLKKEVkSLQNEKADLDRKLRKLDQEMEQL 527
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
207-439 1.64e-05

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 49.63  E-value: 1.64e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  207 IETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYeqtlKSQAETIALEKEQKLqndFAEKERKLQETQMSTTSKL 286
Cdd:COG3206   166 LELRREEARKALEFLEEQLPELRKE---LEEAEAALEEF----RQKNGLVDLSEEAKL---LLQQLSELESQLAEARAEL 235
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  287 EEAEHKLQTLQTALEKT----------------RTELFDLKTKYDEE-TTAKADEIEMIMTDLERANQRAEVAQREAETL 349
Cdd:COG3206   236 AEAEARLAALRAQLGSGpdalpellqspviqqlRAQLAELEAELAELsARYTPNHPDVIALRAQIAALRAQLQQEAQRIL 315
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  350 REQLSsanhslQLASQIQKAPDVEQAIEVLTRSSLEveLAAKEREIAQLVEDVQRLQASLTKLREnsasQISQLEQQLNA 429
Cdd:COG3206   316 ASLEA------ELEALQAREASLQAQLAQLEARLAE--LPELEAELRRLEREVEVARELYESLLQ----RLEEARLAEAL 383
                         250
                  ....*....|
gi 601984298  430 KNSTLKQLEE 439
Cdd:COG3206   384 TVGNVRVIDP 393
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
154-459 1.74e-05

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 50.05  E-value: 1.74e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   154 KSFQGEIDAlSKRSKEAEAAFLTVYKRLIDVpdpvpALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEV 233
Cdd:TIGR01612 1496 KGCKDEADK-NAKAIEKNKELFEQYKKDVTE-----LLNKYSALAIKNKFAKTKKDSEIIIKEIKDAHKKFILEAEKSEQ 1569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   234 TIKALK-EKIR---EYEQTLKSQAETIALEKE-QKLQNDFAE------------KERKLQETQMSTTS------KLEEAE 290
Cdd:TIGR01612 1570 KIKEIKkEKFRiedDAAKNDKSNKAAIDIQLSlENFENKFLKisdikkkindclKETESIEKKISSFSidsqdtELKENG 1649
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   291 HKLQTLQTALEKTRTE---LFDLKTKYDEETTakadEIEMIMTDLERANQRAEVAQRE-----AETLREQLSSANHSLQl 362
Cdd:TIGR01612 1650 DNLNSLQEFLESLKDQkknIEDKKKELDELDS----EIEKIEIDVDQHKKNYEIGIIEkikeiAIANKEEIESIKELIE- 1724
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   363 asqiqkaPDVEQAIEVLTRSSLE-----VELAAKEREIAQLVEDVQRLQASLTK-LRENSASQISQLE---QQLNAKNST 433
Cdd:TIGR01612 1725 -------PTIENLISSFNTNDLEgidpnEKLEEYNTEIGDIYEEFIELYNIIAGcLETVSKEPITYDEiknTRINAQNEF 1797
                          330       340       350
                   ....*....|....*....|....*....|.
gi 601984298   434 LKQLEEKLKGQA-----DYEEVKKELNTLKS 459
Cdd:TIGR01612 1798 LKIIEIEKKSKSylddiEAKEFDRIINHFKK 1828
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
204-463 1.94e-05

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 49.44  E-value: 1.94e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   204 LHDIETENQKLRETL----EEYNKEFAEVKNQEVTIKALKEKIREYEQTL-KSQAETIALE-KEQKLQNDFAEKERKLQE 277
Cdd:pfam10174  249 IRDLEDEVQMLKTNGllhtEDREEEIKQMEVYKSHSKFMKNKIDQLKQELsKKESELLALQtKLETLTNQNSDCKQHIEV 328
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   278 TQMSTTSKleeaEHKLQTLQTALEKTRTEL------FDLKTKY----DEETTAKADEIEMIMTDLERANQRAEVAQREAE 347
Cdd:pfam10174  329 LKESLTAK----EQRAAILQTEVDALRLRLeekesfLNKKTKQlqdlTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIE 404
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   348 TLREQLSSANHSLQLASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVE-----DVQRLQ--ASLTKLRENSASQI 420
Cdd:pfam10174  405 NLQEQLRDKDKQLAGLKERVKSLQTDSSNTDTALTTLEEALSEKERIIERLKEqrereDRERLEelESLKKENKDLKEKV 484
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 601984298   421 SQLEQQLNAKNSTLKQLEEKLKGQADyEEVKKElNTLKSMEFA 463
Cdd:pfam10174  485 SALQPELTEKESSLIDLKEHASSLAS-SGLKKD-SKLKSLEIA 525
mukB PRK04863
chromosome partition protein MukB;
202-455 2.00e-05

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 49.57  E-value: 2.00e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  202 QRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQtlKSQAETIALEKEQKLQNDFAEKERKLQETQMS 281
Cdd:PRK04863  293 RELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLN--LVQTALRQQEKIERYQADLEELEERLEEQNEV 370
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  282 T---TSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTaKADEIEMIMTDLERANQ--------------RAEVAQR 344
Cdd:PRK04863  371 VeeaDEQQEENEARAEAAEEEVDELKSQLADYQQALDVQQT-RAIQYQQAVQALERAKQlcglpdltadnaedWLEEFQA 449
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  345 EAETLREQLSSANHSLQLASQIQKApdVEQAIEVLTRSSLEVElaakeREIAQlvedvQRLQASLTKLRE--NSASQISQ 422
Cdd:PRK04863  450 KEQEATEELLSLEQKLSVAQAAHSQ--FEQAYQLVRKIAGEVS-----RSEAW-----DVARELLRRLREqrHLAEQLQQ 517
                         250       260       270
                  ....*....|....*....|....*....|...
gi 601984298  423 LEQQLNAknstlkqLEEKLKGQADYEEVKKELN 455
Cdd:PRK04863  518 LRMRLSE-------LEQRLRQQQRAERLLAEFC 543
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
127-440 2.00e-05

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 49.57  E-value: 2.00e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  127 RKRLIEQSREFKKntpeDLRKQVAPL------LKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVPDPVpaldvgQQLEiK 200
Cdd:COG3096   280 RRELSERALELRR----ELFGARRQLaeeqyrLVEMARELEELSARESDLEQDYQAASDHLNLVQTAL------RQQE-K 348
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  201 VQRLH-DIETENQKLRE---TLEEYNKEFAEVKNQ----EVTIKALKEKIREYEQTLKSQaETIALEKEQKLQNdFAEKE 272
Cdd:COG3096   349 IERYQeDLEELTERLEEqeeVVEEAAEQLAEAEARleaaEEEVDSLKSQLADYQQALDVQ-QTRAIQYQQAVQA-LEKAR 426
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  273 RKLQETQMSttskLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKA--DE----IEMIMTDLER--ANQRAE---- 340
Cdd:COG3096   427 ALCGLPDLT----PENAEDYLAAFRAKEQQATEEVLELEQKLSVADAARRqfEKayelVCKIAGEVERsqAWQTARellr 502
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  341 -------VAQReAETLREQLSSANhslQLASQIQKApdVEQAIEVLTRSSLEVELAAK-EREIAQLVEDVQRLQASLTKL 412
Cdd:COG3096   503 ryrsqqaLAQR-LQQLRAQLAELE---QRLRQQQNA--ERLLEEFCQRIGQQLDAAEElEELLAELEAQLEELEEQAAEA 576
                         330       340
                  ....*....|....*....|....*...
gi 601984298  413 REnsasQISQLEQQLNAKNSTLKQLEEK 440
Cdd:COG3096   577 VE----QRSELRQQLEQLRARIKELAAR 600
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
203-312 2.09e-05

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 48.48  E-value: 2.09e-05
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298    203 RLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAET---IALEKEQKLQNDFA---EKERKLQ 276
Cdd:smart00787  138 RMKLLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQlkqLEDELEDCDPTELDrakEKLKKLL 217
                            90       100       110
                    ....*....|....*....|....*....|....*.
gi 601984298    277 ETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKT 312
Cdd:smart00787  218 QEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT 253
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
202-344 2.67e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 47.23  E-value: 2.67e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  202 QRLHDIETENQKLRETLEEYNKEF----AEVKNQEVTIKALKEKIREYEQTL---KSQAETIALEKE---QKLQNDFAEK 271
Cdd:COG1579    31 AELAELEDELAALEARLEAAKTELedleKEIKRLELEIEEVEARIKKYEEQLgnvRNNKEYEALQKEiesLKRRISDLED 110
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 601984298  272 E-RKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELfdlktkyDEETTAKADEIEMIMTdlERANQRAEVAQR 344
Cdd:COG1579   111 EiLELMERIEELEEELAELEAELAELEAELEEKKAEL-------DEELAELEAELEELEA--EREELAAKIPPE 175
HOOK pfam05622
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ...
116-490 3.01e-05

HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.


Pssm-ID: 461694 [Multi-domain]  Cd Length: 528  Bit Score: 48.53  E-value: 3.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   116 LANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPLlksfQGEIDALSKRSKEAEAafltvYKRLIDV----PDPVPAL 191
Cdd:pfam05622   63 LLQKQLEQLQEENFRLETARDDYRIKCEELEKEVLEL----QHRNEELTSLAEEAQA-----LKDEMDIlresSDKVKKL 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   192 DvgQQLEIKVQRLHDIETENQKLReTLEEYNKEFAEVKNQevtikaLKEKIREYeQTLKSQAETIalekEQKLQndfaEK 271
Cdd:pfam05622  134 E--ATVETYKKKLEDLGDLRRQVK-LLEERNAEYMQRTLQ------LEEELKKA-NALRGQLETY----KRQVQ----EL 195
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   272 ERKLQEtQMSTTSKLE----EAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTD---------------- 331
Cdd:pfam05622  196 HGKLSE-ESKKADKLEfeykKLEEKLEALQKEKERLIIERDTLRETNEELRCAQLQQAELSQADallspssdpgdnlaae 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   332 ------------LERANQRAEVAQREAE-----TLREQLSSANHSL-----QLASQIQKAPDVEQAIEVLTRSSLEV--- 386
Cdd:pfam05622  275 impaeireklirLQHENKMLRLGQEGSYrerltELQQLLEDANRRKneletQNRLANQRILELQQQVEELQKALQEQgsk 354
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   387 --ELAAKEREIAQLVEDVQRLQASLTKLRE-----------NSASQISQLEQQLNAKNSTLKQLEEKLKgqadyEEVKKE 453
Cdd:pfam05622  355 aeDSSLLKQKLEEHLEKLHEAQSELQKKKEqieelepkqdsNLAQKIDELQEALRKKDEDMKAMEERYK-----KYVEKA 429
                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 601984298   454 LNTLKSMEfaPSEGAGTQDSTKPLEVLLLEKNRSLQS 490
Cdd:pfam05622  430 KSVIKTLD--PKQNPASPPEIQALKNQLLEKDKKIEH 464
PTZ00121 PTZ00121
MAEBL; Provisional
103-331 3.05e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 48.98  E-value: 3.05e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEaafltvykrli 182
Cdd:PTZ00121 1589 AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE----------- 1657
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  183 dvPDPVPALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKsqAETIALEKEQ 262
Cdd:PTZ00121 1658 --ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKK--AEEENKIKAE 1733
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984298  263 KLQNDFAEKERKLQETQmsttsKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTD 331
Cdd:PTZ00121 1734 EAKKEAEEDKKKAEEAK-----KDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
mukB PRK04863
chromosome partition protein MukB;
157-443 3.76e-05

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 48.80  E-value: 3.76e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  157 QGEIDALSKRSKEAEAAFLTVYKRLIDVpdpVPALDVGQ----QLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQE 232
Cdd:PRK04863  375 DEQQEENEARAEAAEEEVDELKSQLADY---QQALDVQQtraiQYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKE 451
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  233 vtiKALKEKIREYEQTLkSQAETIALEKEQKLQ---------------NDFAEKERKLQETQMSTTSkLEEAEHKLQTLQ 297
Cdd:PRK04863  452 ---QEATEELLSLEQKL-SVAQAAHSQFEQAYQlvrkiagevsrseawDVARELLRRLREQRHLAEQ-LQQLRMRLSELE 526
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  298 TALEKTRTeLFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSL-QLASQIQK----APDV 372
Cdd:PRK04863  527 QRLRQQQR-AERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLeQLQARIQRlaarAPAW 605
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984298  373 EQAIEVLTRSSLEVELAAKERE-IAQLVEDVQRLQASLTKLRENSASQISQLEQQLN-------AKNSTLKQLEEKLKG 443
Cdd:PRK04863  606 LAAQDALARLREQSGEEFEDSQdVTEYMQQLLERERELTVERDELAARKQALDEEIErlsqpggSEDPRLNALAERFGG 684
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
306-458 4.08e-05

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 48.31  E-value: 4.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   306 ELFD-LKTKYDEETTAKADEIEMIMTDLERANQRAEV--AQREAETLREQLSSANHSLQlasQIQKapDVEQAIEVLTRS 382
Cdd:pfam06160   45 EKFEeWRKKWDDIVTKSLPDIEELLFEAEELNDKYRFkkAKKALDEIEELLDDIEEDIK---QILE--ELDELLESEEKN 119
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298   383 SLEVElaakereiaQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEkLKGQADYEEVKKELNTLK 458
Cdd:pfam06160  120 REEVE---------ELKDKYRELRKTLLANRFSYGPAIDELEKQLAEIEEEFSQFEE-LTESGDYLEAREVLEKLE 185
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
116-322 4.15e-05

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 48.47  E-value: 4.15e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  116 LANRQDESEQSRKRLieqsREFKKNT----PEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVPDPVPAL 191
Cdd:COG3206   184 LPELRKELEEAEAAL----EEFRQKNglvdLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPEL 259
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  192 DVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAET---IALEKEQKLQNDF 268
Cdd:COG3206   260 LQSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQ---IAALRAQLQQEAQRILASLEAeleALQAREASLQAQL 336
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 601984298  269 AEKERKLQEtqmsttskLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKA 322
Cdd:COG3206   337 AQLEARLAE--------LPELEAELRRLEREVEVARELYESLLQRLEEARLAEA 382
COG5576 COG5576
Homeodomain-containing transcription factor [Transcription];
1338-1423 4.53e-05

Homeodomain-containing transcription factor [Transcription];


Pssm-ID: 227863 [Multi-domain]  Cd Length: 156  Bit Score: 45.51  E-value: 4.53e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298 1338 KKPRVVLAPEEKEALKRAYQQKPYPSPKTIEELATQLNLKTSTVINWFHNYRSRIRRelfieeiqagsQGQAGASDSPSA 1417
Cdd:COG5576    52 KSKRRRTTDEQLMVLEREFEINPYPSSITRIKLSLLLNMPPKSVQIWFQNKRAKEKK-----------KRSGKVEQRPGE 120

                  ....*.
gi 601984298 1418 RSSRAA 1423
Cdd:COG5576   121 EEADLA 126
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
103-458 5.71e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 48.01  E-value: 5.71e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNtpEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAfLTVYKRLI 182
Cdd:COG1196   421 EELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEE--EALLELLAELLEEAALLEAALAELLEELAEA-AARLLLLL 497
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  183 DVPDPVPALDVGQQLEIKVQRLHDIETENQKLRETLEEYnkefaEVKNQEVTIKALKEKIREYEQTLksqAETIALEKEQ 262
Cdd:COG1196   498 EAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAY-----EAALEAALAAALQNIVVEDDEVA---AAAIEYLKAA 569
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  263 KLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVA 342
Cdd:COG1196   570 KAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREV 649
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  343 QREAETLREQLSSANHS-------LQLASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLREN 415
Cdd:COG1196   650 TLEGEGGSAGGSLTGGSrrellaaLLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQ 729
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 601984298  416 SASQISQ-LEQQLNAKNSTLKQLEEKLKGQADYEEVKKELNTLK 458
Cdd:COG1196   730 LEAEREElLEELLEEEELLEEEALEELPEPPDLEELERELERLE 773
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
212-461 5.75e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 48.19  E-value: 5.75e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   212 QKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIAlekeqKLQNDFAEKERKLQETQMSTTSKLEEAEH 291
Cdd:pfam15921  454 QGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVS-----DLTASLQEKERAIEATNAEITKLRSRVDL 528
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   292 KLQTLQtALEKTRTELFDLKTKYDE---ETTAKADEIEMIMTDLERANQ------------RAEVAQREAETLREQLSSA 356
Cdd:pfam15921  529 KLQELQ-HLKNEGDHLRNVQTECEAlklQMAEKDKVIEILRQQIENMTQlvgqhgrtagamQVEKAQLEKEINDRRLELQ 607
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   357 NHSLQLASQIQKAPDVEQAIEVLTRSSLEVELAAKER---------EIAQLVEDVQRLQASLTKLRENsasqISQLEQQL 427
Cdd:pfam15921  608 EFKILKDKKDAKIRELEARVSDLELEKVKLVNAGSERlravkdikqERDQLLNEVKTSRNELNSLSED----YEVLKRNF 683
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 601984298   428 NAKNSTLKQLEEKLKGQ--ADYEEVKKELNTLKSME 461
Cdd:pfam15921  684 RNKSEEMETTTNKLKMQlkSAQSELEQTRNTLKSME 719
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
208-449 5.91e-05

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 47.86  E-value: 5.91e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   208 ETENQKLRETLEEYNKEFAEV--KNQEVTikalKEKIREYEQTlksQAETIALEKEQKLQNDFAEKERKLQETQMSTTSK 285
Cdd:pfam01576   11 EEELQKVKERQQKAESELKELekKHQQLC----EEKNALQEQL---QAETELCAEAEEMRARLAARKQELEEILHELESR 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   286 LEEAEHKLQTLQTALEKTRTELFDLKTKYDEE-------------TTAKADEIEMIMTDLERANQRaevAQREAETLREQ 352
Cdd:pfam01576   84 LEEEEERSQQLQNEKKKMQQHIQDLEEQLDEEeaarqklqlekvtTEAKIKKLEEDILLLEDQNSK---LSKERKLLEER 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   353 LSsaNHSLQLASQIQKA--------------PDVEQAI--EVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLR--- 413
Cdd:pfam01576  161 IS--EFTSNLAEEEEKAkslsklknkheamiSDLEERLkkEEKGRQELEKAKRKLEGESTDLQEQIAELQAQIAELRaql 238
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 601984298   414 ----ENSASQISQLEQQLNAKNSTLKQLEEKLKGQADYEE 449
Cdd:pfam01576  239 akkeEELQAALARLEEETAQKNNALKKIRELEAQISELQE 278
PRK12704 PRK12704
phosphodiesterase; Provisional
198-349 6.39e-05

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 47.47  E-value: 6.39e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  198 EIKVQRLHDIETENQKLRETLE-EYNKEFAEVKNQEvtikalkEKIREYEQTLKSQAETIALEKEQ---------KLQND 267
Cdd:PRK12704   53 AIKKEALLEAKEEIHKLRNEFEkELRERRNELQKLE-------KRLLQKEENLDRKLELLEKREEElekkekeleQKQQE 125
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  268 FAEKERKLQETQMSTTSKLEEA------EHKLQTLQTALEKTRTELFDLKTKYDEETTAKAD-EIEMIMTDlerANQR-- 338
Cdd:PRK12704  126 LEKKEEELEELIEEQLQELERIsgltaeEAKEILLEKVEEEARHEAAVLIKEIEEEAKEEADkKAKEILAQ---AIQRca 202
                         170
                  ....*....|.
gi 601984298  339 AEVAqreAETL 349
Cdd:PRK12704  203 ADHV---AETT 210
mukB PRK04863
chromosome partition protein MukB;
239-439 8.90e-05

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 47.64  E-value: 8.90e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  239 KEKIREYEQTLKSQAETIA----LEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALektrtelfdlktKY 314
Cdd:PRK04863  279 NERRVHLEEALELRRELYTsrrqLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQTAL------------RQ 346
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  315 DEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHS-LQLASQIQkapDVEQAIEVLTRSSLE----VELA 389
Cdd:PRK04863  347 QEKIERYQADLEELEERLEEQNEVVEEADEQQEENEARAEAAEEEvDELKSQLA---DYQQALDVQQTRAIQyqqaVQAL 423
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 601984298  390 AKEREIAQL----VEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEE 439
Cdd:PRK04863  424 ERAKQLCGLpdltADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAHSQFEQ 477
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
134-453 9.68e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 47.27  E-value: 9.68e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   134 SREFKKNtpEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVPDpvpALDVGQQLEIKVQRLHDIETENQK 213
Cdd:TIGR00618  183 LMEFAKK--KSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLRE---ALQQTQQSHAYLTQKREAQEEQLK 257
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   214 LRETLEEYNKEFAEVKNQEVTIKALKEKIreyeqTLKSQAETIALEKEQKLQNDFaEKERKLQETQmSTTSKLEEAEHKL 293
Cdd:TIGR00618  258 KQQLLKQLRARIEELRAQEAVLEETQERI-----NRARKAAPLAAHIKAVTQIEQ-QAQRIHTELQ-SKMRSRAKLLMKR 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   294 QTLQtalektrTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAE-----TLREQLSSANHSLQLASQIQK 368
Cdd:TIGR00618  331 AAHV-------KQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLtqhihTLQQQKTTLTQKLQSLCKELD 403
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   369 APDVEQA---IEVLTRSSLEVELAAKEREI-AQLVEDVQRLQASLTKLRENSASQISQLE--QQLNAKNSTLKQLEEKLK 442
Cdd:TIGR00618  404 ILQREQAtidTRTSAFRDLQGQLAHAKKQQeLQQRYAELCAAAITCTAQCEKLEKIHLQEsaQSLKEREQQLQTKEQIHL 483
                          330
                   ....*....|.
gi 601984298   443 gqaDYEEVKKE 453
Cdd:TIGR00618  484 ---QETRKKAV 491
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
106-502 1.02e-04

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 47.04  E-value: 1.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   106 KRELDATATVLANRQDESEQSRKRLIEQSREFKK---------NTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAF-- 174
Cdd:pfam05557   50 NQELQKRIRLLEKREAEAEEALREQAELNRLKKKylealnkklNEKESQLADAREVISCLKNELSELRRQIQRAELELqs 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   175 ----LTVYKRLIDVPDP--VPALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKN--QEV----TIKALKEKI 242
Cdd:pfam05557  130 tnseLEELQERLDLLKAkaSEAEQLRQNLEKQQSSLAEAEQRIKELEFEIQSQEQDSEIVKNskSELaripELEKELERL 209
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   243 REYEQTLKSQAETIALEKEQKlqndfAEKERKLqetqmsttSKLEEAEHKLQTLQTALEKTRTELfdlktkYDEETTAKA 322
Cdd:pfam05557  210 REHNKHLNENIENKLLLKEEV-----EDLKRKL--------EREEKYREEAATLELEKEKLEQEL------QSWVKLAQD 270
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   323 DEIEMIMTDLERAnqRAEVAQREAETLREQLSSANHSlqlASQIQKA-PDVEQAIEVLTRSSLEVELAAKEREiaQLVED 401
Cdd:pfam05557  271 TGLNLRSPEDLSR--RIEQLQQREIVLKEENSSLTSS---ARQLEKArRELEQELAQYLKKIEDLNKKLKRHK--ALVRR 343
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   402 VQRLQASLTKLRENSASQISQLEQQLNAKNSTlKQLEEKLKGQADY-EEVKKELNTLK-SMEFAPSEGAGTQDSTKPLEV 479
Cdd:pfam05557  344 LQRRVLLLTKERDGYRAILESYDKELTMSNYS-PQLLERIEEAEDMtQKMQAHNEEMEaQLSVAEEELGGYKQQAQTLER 422
                          410       420       430
                   ....*....|....*....|....*....|....*...
gi 601984298   480 LLLEKNR---------------SLQSENATLRISNSDL 502
Cdd:pfam05557  423 ELQALRQqesladpsyskeevdSLRRKLETLELERQRL 460
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
196-446 1.13e-04

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 46.82  E-value: 1.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   196 QLEIKVQRLH-DIETENQKLRETLEEYNKEFA-EVKNQEVTIKALKEKIREyeqtLKSQAETIALEKeQKLQNDFAEKER 273
Cdd:pfam15964  364 ELERQKERLEkELASQQEKRAQEKEALRKEMKkEREELGATMLALSQNVAQ----LEAQVEKVTREK-NSLVSQLEEAQK 438
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   274 KLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQR-------EA 346
Cdd:pfam15964  439 QLASQEMDVTKVCGEMRYQLNQTKMKKDEAEKEHREYRTKTGRQLEIKDQEIEKLGLELSESKQRLEQAQQdaarareEC 518
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   347 ETLREQLSSANHSLQLASQ----IQKAPDVEQAIEVLTRSSLEVELAAK-----------ERE-----------IAQLVE 400
Cdd:pfam15964  519 LKLTELLGESEHQLHLTRLekesIQQSFSNEAKAQALQAQQREQELTQKmqqmeaqhdktVNEqyslltsqntfIAKLKE 598
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 601984298   401 DVQRLQASLTKLRENSASQISQLEQQlnakNSTLKQLEEKLKGQAD 446
Cdd:pfam15964  599 ECCTLAKKLEEITQKSRSEVEQLSQE----KEYLQDRLEKLQKRNE 640
PRK10263 PRK10263
DNA translocase FtsK; Provisional
805-1050 1.30e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.00  E-value: 1.30e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  805 ILQQARREMEAQQAALDPALKPAPLSQPdltiltpkhLSASPMSTVSTYPPLAISLKKTPAAPETSTAALPSAPALKKEA 884
Cdd:PRK10263  292 VLFSGNRATQPEYDEYDPLLNGAPITEP---------VAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQP 362
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  885 QDVPTLDPPGSADAAQGvLRPMKSELVRGSTWKDPWWSPIQPERRNLTSSEETKADETTASGKERAGSSQPRAERSQLQG 964
Cdd:PRK10263  363 VPGPQTGEPVIAPAPEG-YPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQP 441
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  965 PSASA-------EYWKEWPSAEsPYSQSSELSLTGASRSETPQNSPLPSSPIVPMAKPAKPSVPPLtpeqyevYMYQEVD 1037
Cdd:PRK10263  442 VAGNAwqaeeqqSTFAPQSTYQ-TEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPL-------YYFEEVE 513
                         250
                  ....*....|...
gi 601984298 1038 tiELTRQVKEKLA 1050
Cdd:PRK10263  514 --EKRAREREQLA 524
PRK12704 PRK12704
phosphodiesterase; Provisional
249-453 1.33e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 46.31  E-value: 1.33e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  249 LKSQAETIALEKEQKlqndfAEKerklqetqmsttsKLEEAEHKLQTL-QTALEKTRTELFDLKTKYDEETTAKADEIEm 327
Cdd:PRK12704   25 RKKIAEAKIKEAEEE-----AKR-------------ILEEAKKEAEAIkKEALLEAKEEIHKLRNEFEKELRERRNELQ- 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  328 imtDLERANQraevaQREaETLREQLSSANHSlqlasqiqkapdvEQAIEvltrsSLEVELAAKEREIAQLVEDVQRLQA 407
Cdd:PRK12704   86 ---KLEKRLL-----QKE-ENLDRKLELLEKR-------------EEELE-----KKEKELEQKQQELEKKEEELEELIE 138
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 601984298  408 SLTKLRENsasqISQLEQQlNAKNSTLKQLEEKLKGQA-----DYEEVKKE 453
Cdd:PRK12704  139 EQLQELER----ISGLTAE-EAKEILLEKVEEEARHEAavlikEIEEEAKE 184
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
145-366 1.51e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 46.30  E-value: 1.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  145 LRKQVAPLLKSFQgEIDALSKRSKEAEAAFLTVYKR--LIDVPDPVPALDVGQQLEIKVQRLHDIETENQKLR------- 215
Cdd:COG4717   293 LAREKASLGKEAE-ELQALPALEELEEEELEELLAAlgLPPDLSPEELLELLDRIEELQELLREAEELEEELQleeleqe 371
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  216 --ETLEEYN----KEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIaleKEQKLQNDFAEKERKLQETQMsttsKLEEA 289
Cdd:COG4717   372 iaALLAEAGvedeEELRAALEQAEEYQELKEELEELEEQLEELLGEL---EELLEALDEEELEEELEELEE----ELEEL 444
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  290 EHKLQTLQTALEKTRTELFDLKTkyDEETTAKADEIEMI---MTDLERANQRAEVAQREAETLREQLSSAN--HSLQLAS 364
Cdd:COG4717   445 EEELEELREELAELEAELEQLEE--DGELAELLQELEELkaeLRELAEEWAALKLALELLEEAREEYREERlpPVLERAS 522

                  ..
gi 601984298  365 QI 366
Cdd:COG4717   523 EY 524
PRK01156 PRK01156
chromosome segregation protein; Provisional
139-458 1.67e-04

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 46.43  E-value: 1.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  139 KNTPEDLRKQVAPL------LKSFQGEIDALSK--------------RSKEAEAAFLTVYKRLIDVPDPVPAL----DVG 194
Cdd:PRK01156  172 KDVIDMLRAEISNIdyleekLKSSNLELENIKKqiaddekshsitlkEIERLSIEYNNAMDDYNNLKSALNELssleDMK 251
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  195 QQLEIKVQ----RLHDIETENQKLRETLEEYNK--------------EFAEVKNQEVT----IKALKEKIREYEQTLKSq 252
Cdd:PRK01156  252 NRYESEIKtaesDLSMELEKNNYYKELEERHMKiindpvyknrnyinDYFKYKNDIENkkqiLSNIDAEINKYHAIIKK- 330
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  253 aetiaLEKEQKLQNDFAEKERKLQETQmSTTSKLEEAEHKLQTLQTALEKtrtelfdLKTKYDEETTAKADEIEMIMTDL 332
Cdd:PRK01156  331 -----LSVLQKDYNDYIKKKSRYDDLN-NQILELEGYEMDYNSYLKSIES-------LKKKIEEYSKNIERMSAFISEIL 397
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  333 ERANQRAEVAQREAETLR---EQLSSANHSLQ--LASQIQKAPDVEQAIEVLT--------------------------- 380
Cdd:PRK01156  398 KIQEIDPDAIKKELNEINvklQDISSKVSSLNqrIRALRENLDELSRNMEMLNgqsvcpvcgttlgeeksnhiinhynek 477
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  381 RSSLEVELAAKEREIAQLVEDVQRLQASLTKLR-------ENSASQISQLEQQLNAKNSTLKQLEEKlkgQADYEEVKKE 453
Cdd:PRK01156  478 KSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLEseeinksINEYNKIESARADLEDIKIKINELKDK---HDKYEEIKNR 554

                  ....*
gi 601984298  454 LNTLK 458
Cdd:PRK01156  555 YKSLK 559
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
226-444 1.80e-04

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 44.98  E-value: 1.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   226 AEVKNQEVTIKALKEKIREYEQTLKSQAETialekeqklqNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRT 305
Cdd:pfam12795    3 DELEKAKLDEAAKKKLLQDLQQALSLLDKI----------DASKQRAAAYQKALDDAPAELRELRQELAALQAKAEAAPK 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   306 ELFD------LKTKYDEETTAKADEIEmimtDLERANQRAEVAQREAETLREQLSSANHSLQ-LASQIQKAPDVEQAIEV 378
Cdd:pfam12795   73 EILAslsleeLEQRLLQTSAQLQELQN----QLAQLNSQLIELQTRPERAQQQLSEARQRLQqIRNRLNGPAPPGEPLSE 148
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298   379 LTRSSLEVELAAKEREIAQLvedvQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQ 444
Cdd:pfam12795  149 AQRWALQAELAALKAQIDML----EQELLSNNNRQDLLKARRDLLTLRIQRLEQQLQALQELLNEK 210
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
125-461 2.26e-04

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 46.19  E-value: 2.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   125 QSRKRLIEQSREFKKNTPEDLRKQVAPL------LKSFQGEIDALSKRSKEAEAAFLTVYKrlidvpdpvpaldvgqqLE 198
Cdd:TIGR00606  206 QMELKYLKQYKEKACEIRDQITSKEAQLessreiVKSYENELDPLKNRLKEIEHNLSKIMK-----------------LD 268
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   199 IKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEvtIKALKE----KIREYEQTL-KSQAETIALEKEQKLQNdfaeKER 273
Cdd:TIGR00606  269 NEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQ--LNDLYHnhqrTVREKERELvDCQRELEKLNKERRLLN----QEK 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   274 KLQETQMSTTSkLEEAEHKLQTLQTALEK----TRTELFDLKTKYDEETTAK----------ADEIEMIMTDLERANQRA 339
Cdd:TIGR00606  343 TELLVEQGRLQ-LQADRHQEHIRARDSLIqslaTRLELDGFERGPFSERQIKnfhtlvierqEDEAKTAAQLCADLQSKE 421
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   340 EVAQREAETLREQLSSANHSLQLASQIqkapdVEQAIEVLTRSSLEVE-LAAKEREIAQLVEDVQRLQASLTKLRENSAS 418
Cdd:TIGR00606  422 RLKQEQADEIRDEKKGLGRTIELKKEI-----LEKKQEELKFVIKELQqLEGSSDRILELDQELRKAERELSKAEKNSLT 496
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 601984298   419 QiSQLEQQLNAKNSTLKQLEEKLKGQADYEEVKKELNTLKSME 461
Cdd:TIGR00606  497 E-TLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQME 538
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
203-518 2.40e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 46.19  E-value: 2.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  203 RLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALkEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQmST 282
Cdd:PTZ00108 1047 RFKDIIKKKSEKITAEEEEGAEEDDEADDEDDEEEL-GAAVSYDYLLSMPIWSLTKEKVEKLNAELEKKEKELEKLK-NT 1124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  283 TSK---LEEaehkLQTLQTALEKTRtelfdlktKYDEETTAKADEIEMIMTDLERANQR-----AEVAQREAETLREQLS 354
Cdd:PTZ00108 1125 TPKdmwLED----LDKFEEALEEQE--------EVEEKEIAKEQRLKSKTKGKASKLRKpklkkKEKKKKKSSADKSKKA 1192
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  355 SANHSLQLASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTL 434
Cdd:PTZ00108 1193 SVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPK 1272
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  435 KQLEEKLKGQADYEEVKKELNTLKSmeFAPSEGAGTQDST-KPLEVLLLEKNRSLQSENATLRISNSD---LSGSARRKG 510
Cdd:PTZ00108 1273 NAPKRVSAVQYSPPPPSKRPDGESN--GGSKPSSPTKKKVkKRLEGSLAALKKKKKSEKKTARKKKSKtrvKQASASQSS 1350

                  ....*...
gi 601984298  511 RDQPESRR 518
Cdd:PTZ00108 1351 RLLRRPRK 1358
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
113-439 2.52e-04

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 45.66  E-value: 2.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   113 ATVLANRQDESEQSRKRLIeQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTvykrlidvpdpvpALD 192
Cdd:pfam07888   29 AELLQNRLEECLQERAELL-QAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQ-------------SRE 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   193 VGQQLEIKVQRLHDIETENQKLRETLEEYNKE-FAEVKNQEVTIKALKEKIREYEQTLKSQAETIalEKEQKLQNDFAEK 271
Cdd:pfam07888   95 KHEELEEKYKELSASSEELSEEKDALLAQRAAhEARIRELEEDIKTLTQRVLERETELERMKERA--KKAGAQRKEEEAE 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   272 ERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLK----TKYDEETTA--KADEIEMIMTDLERANQRAEVAQRE 345
Cdd:pfam07888  173 RKQLQAKLQQTEEELRSLSKEFQELRNSLAQRDTQVLQLQdtitTLTQKLTTAhrKEAENEALLEELRSLQERLNASERK 252
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   346 AETLREQLSS-------------------ANHSLQLAsQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQ 406
Cdd:pfam07888  253 VEGLGEELSSmaaqrdrtqaelhqarlqaAQLTLQLA-DASLALREGRARWAQERETLQQSAEADKDRIEKLSAELQRLE 331
                          330       340       350
                   ....*....|....*....|....*....|....
gi 601984298   407 ASLTKLRensaSQISQLEQQL-NAKNSTLKQLEE 439
Cdd:pfam07888  332 ERLQEER----MEREKLEVELgREKDCNRVQLSE 361
GAS pfam13851
Growth-arrest specific micro-tubule binding; This family is the highly conserved central ...
202-307 2.82e-04

Growth-arrest specific micro-tubule binding; This family is the highly conserved central region of a number of metazoan proteins referred to as growth-arrest proteins. In mouse, Gas8 is predominantly a testicular protein, whose expression is developmentally regulated during puberty and spermatogenesis. In humans, it is absent in infertile males who lack the ability to generate gametes. The localization of Gas8 in the motility apparatus of post-meiotic gametocytes and mature spermatozoa, together with the detection of Gas8 also in cilia at the apical surfaces of epithelial cells lining the pulmonary bronchi and Fallopian tubes suggests that the Gas8 protein may have a role in the functioning of motile cellular appendages. Gas8 is a microtubule-binding protein localized to regions of dynein regulation in mammalian cells.


Pssm-ID: 464001 [Multi-domain]  Cd Length: 200  Bit Score: 43.74  E-value: 2.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   202 QRLHDIETENQKLRETLEE--------------YNKEFAEVKNQEVTIKALKEKIR--EYEQTLKSQaetiALEKEQKLQ 265
Cdd:pfam13851   47 KLMSEIQQENKRLTEPLQKaqeeveelrkqlenYEKDKQSLKNLKARLKVLEKELKdlKWEHEVLEQ----RFEKVERER 122
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 601984298   266 NDFAEK-ERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTEL 307
Cdd:pfam13851  123 DELYDKfEAAIQDVQQKTGLKNLLLEKKLQALGETLEKKEAQL 165
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
197-529 2.93e-04

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 45.58  E-value: 2.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   197 LEIKVQRLHDIETENQKLRETLEEynKEfaevknqevtikALKEKIREYEQTLKSQAETIALEKEQkLQNDFAEKERK-- 274
Cdd:pfam10174  333 LTAKEQRAAILQTEVDALRLRLEE--KE------------SFLNKKTKQLQDLTEEKSTLAGEIRD-LKDMLDVKERKin 397
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   275 --------LQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLktkydEETTAKADE-IEMIMTDLERANQ--RAEVAQ 343
Cdd:pfam10174  398 vlqkkienLQEQLRDKDKQLAGLKERVKSLQTDSSNTDTALTTL-----EEALSEKERiIERLKEQREREDRerLEELES 472
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   344 --REAETLREQLSSANhsLQLASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIA--QLVEDVQRLQASLTKLRENS--- 416
Cdd:pfam10174  473 lkKENKDLKEKVSALQ--PELTEKESSLIDLKEHASSLASSGLKKDSKLKSLEIAveQKKEECSKLENQLKKAHNAEeav 550
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   417 ------ASQISQLEQQLNAKNstlkqlEEKLKGQAdyeEVKKELNTLKSMEfapsegagTQDSTKPLEVLLLEKNRSLQS 490
Cdd:pfam10174  551 rtnpeiNDRIRLLEQEVARYK------EESGKAQA---EVERLLGILREVE--------NEKNDKDKKIAELESLTLRQM 613
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 601984298   491 ENATLRISN-SDLSGSARRKGRDQPESRRPGPLPASPPPQ 529
Cdd:pfam10174  614 KEQNKKVANiKHGQQEMKKKGAQLLEEARRREDNLADNSQ 653
PTZ00121 PTZ00121
MAEBL; Provisional
206-468 2.96e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.90  E-value: 2.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  206 DIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQT-LKSQAETIALEKEQKLqndfAEKERKLQETQMSTTS 284
Cdd:PTZ00121 1040 DVLKEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDaKEDNRADEATEEAFGK----AEEAKKTETGKAEEAR 1115
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  285 KLEEAEHKLQTLQTALEKTRTElfDLKtKYDEETTAKADEIEMIMTDLERAnQRAEVAQReaetlreqlssanhslqlAS 364
Cdd:PTZ00121 1116 KAEEAKKKAEDARKAEEARKAE--DAR-KAEEARKAEDAKRVEIARKAEDA-RKAEEARK------------------AE 1173
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  365 QIQKAPDVEQAIEVltRSSLEVELAakereiaqlvEDVQRLQASLTKLRENSASQISQLEQQLNAKnsTLKQLEEKLKGQ 444
Cdd:PTZ00121 1174 DAKKAEAARKAEEV--RKAEELRKA----------EDARKAEAARKAEEERKAEEARKAEDAKKAE--AVKKAEEAKKDA 1239
                         250       260
                  ....*....|....*....|....
gi 601984298  445 ADYEEVKKELNTLKSMEFAPSEGA 468
Cdd:PTZ00121 1240 EEAKKAEEERNNEEIRKFEEARMA 1263
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
192-459 3.17e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 45.33  E-value: 3.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  192 DVGQQLEIKVQRLHDIETENQKLRETLEEYNKEfaevKNQEVTIKALKEKIREYEQTLKSQAETIALEK-------EQKL 264
Cdd:COG5185    97 KILQEYVNSLIKLPNYEWSADILISLLYLYKSE----IVALKDELIKVEKLDEIADIEASYGEVETGIIkdifgklTQEL 172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  265 -QNDFAEKERKLQETQMSTTSKLEEAEH-----------------KLQTLQTALEKTRTE----LFDLKTKYDEETTAKA 322
Cdd:COG5185   173 nQNLKKLEIFGLTLGLLKGISELKKAEPsgtvnsikesetgnlgsESTLLEKAKEIINIEealkGFQDPESELEDLAQTS 252
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  323 DEIEMIMTD-----LERANQRAEVAQR---EAETLREQLSSANHSLQ-LASQIQKAPDVEQAIEVLTRSSLEVELAAKER 393
Cdd:COG5185   253 DKLEKLVEQntdlrLEKLGENAESSKRlneNANNLIKQFENTKEKIAeYTKSIDIKKATESLEEQLAAAEAEQELEESKR 332
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298  394 EIAqlvEDVQRLQASLTKLREnsasqisQLEQQLNAKNSTLKQLEeklkGQADYEEVKKELNTLKS 459
Cdd:COG5185   333 ETE---TGIQNLTAEIEQGQE-------SLTENLEAIKEEIENIV----GEVELSKSSEELDSFKD 384
PTZ00121 PTZ00121
MAEBL; Provisional
104-537 3.48e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.52  E-value: 3.48e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  104 KLKRELDATATVLANRQDESEQSRKRLIEQSREFK-KNTPEDLRKQVApllksfQGEIDALSKRSKEAEAAfLTVYKRLI 182
Cdd:PTZ00121 1253 EIRKFEEARMAHFARRQAAIKAEEARKADELKKAEeKKKADEAKKAEE------KKKADEAKKKAEEAKKA-DEAKKKAE 1325
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  183 DVPDPVPALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQ 262
Cdd:PTZ00121 1326 EAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKK 1405
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  263 KlqndfAEKERKLQETQmsttSKLEEAEHKLQTLQTALE-KTRTE----LFDLKTKYDE-----------ETTAKADEIE 326
Cdd:PTZ00121 1406 K-----ADELKKAAAAK----KKADEAKKKAEEKKKADEaKKKAEeakkADEAKKKAEEakkaeeakkkaEEAKKADEAK 1476
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  327 MIMTDLERANQ---RAEVAQREAETLREQLSSANHSLQL--------------------ASQIQKAPDVEQAIEVltRSS 383
Cdd:PTZ00121 1477 KKAEEAKKADEakkKAEEAKKKADEAKKAAEAKKKADEAkkaeeakkadeakkaeeakkADEAKKAEEKKKADEL--KKA 1554
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  384 LEVELAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQadyEEVKKELNTLKSMEfa 463
Cdd:PTZ00121 1555 EELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKA---EEAKIKAEELKKAE-- 1629
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 601984298  464 psEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGSARRKGRDQPESRRPGPLPASPPPQLPRNTGEQ 537
Cdd:PTZ00121 1630 --EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
153-355 3.48e-04

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 43.86  E-value: 3.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   153 LKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVPDpvpALDVGQQ-LEIKVQRLHDIET---ENQKLRETLEeyNKEFAEv 228
Cdd:pfam00261   17 LKEAMKKLEEAEKRAEKAEAEVAALNRRIQLLEE---ELERTEErLAEALEKLEEAEKaadESERGRKVLE--NRALKD- 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   229 knqevtikalKEKIREYEQTLKsQAETIALEKEQKLqndfAEKERKLQETQMS---TTSKLEEAEHKLQTLQTALEKTRT 305
Cdd:pfam00261   91 ----------EEKMEILEAQLK-EAKEIAEEADRKY----EEVARKLVVVEGDlerAEERAELAESKIVELEEELKVVGN 155
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 601984298   306 ELFDLKT---KYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSS 355
Cdd:pfam00261  156 NLKSLEAseeKASEREDKYEEQIRFLTEKLKEAETRAEFAERSVQKLEKEVDR 208
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
117-317 3.84e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.44  E-value: 3.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   117 ANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPL---LKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVPDPVPALDv 193
Cdd:TIGR02169  817 IEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIekeIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE- 895
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   194 gQQLEIKVQRLHDIETENQKLRETLEEynkefaevknQEVTIKALKEKIREYEQTLKS----QAETIALEKEQKLQNDFA 269
Cdd:TIGR02169  896 -AQLRELERKIEELEAQIEKKRKRLSE----------LKAKLEALEEELSEIEDPKGEdeeiPEEELSLEDVQAELQRVE 964
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 601984298   270 EKERKLQETQMSTTSKLEEAEHKLQTLQ---TALEKTRTELFDLKTKYDEE 317
Cdd:TIGR02169  965 EEIRALEPVNMLAIQEYEEVLKRLDELKekrAKLEEERKAILERIEEYEKK 1015
iSH2_PIK3R2 cd12926
Inter-Src homology 2 (iSH2) helical domain of Class IA Phosphoinositide 3-kinase Regulatory ...
193-333 3.86e-04

Inter-Src homology 2 (iSH2) helical domain of Class IA Phosphoinositide 3-kinase Regulatory subunit 2, PIK3R2, also called p85beta; PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. They play an important role in a variety of fundamental cellular processes, including cell motility, the Ras pathway, vesicle trafficking and secretion, immune cell activation, and apoptosis. They are classified according to their substrate specificity, regulation, and domain structure. Class IA PI3Ks are heterodimers of a p110 catalytic (C) subunit and a p85-related regulatory (R) subunit. The R subunit down-regulates PI3K basal activity, stabilizes the C subunit, and plays a role in the activation downstream of tyrosine kinases. All R subunits contain two SH2 domains that flank an intervening helical domain (iSH2), which binds to the N-terminal adaptor-binding domain (ABD) of the catalytic subunit. p85beta, also called PIK3R2, contains N-terminal SH3 and GAP domains. It is expressed ubiquitously but at lower levels than p85alpha. Its expression is increased in breast and colon cancer, correlates with tumor progression, and enhanced invasion. During viral infection, the viral nonstructural (NS1) protein binds p85beta specifically, which leads to PI3K activation and the promotion of viral replication. Mice deficient with PIK3R2 develop normally and exhibit moderate metabolic and immunological defects.


Pssm-ID: 214019 [Multi-domain]  Cd Length: 161  Bit Score: 42.76  E-value: 3.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  193 VGQQLEIKVQRLHDIETENQKLretLEEYNKEFAEVKNQEVTIKALKEKIREYEQtlksQAETialekEQKLQNDFAEKE 272
Cdd:cd12926     6 VGAQLKVYHQQYQDKSREYDQL---YEEYTRTSQELQMKRTAIEAFNETIKIFEE----QGQT-----QEKCSKEYLERF 73
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298  273 RKlqetqmsttsklEEAEHKLQTLQTALEKTR---TELFDLKTKYDEETTAKAD---EIEMIMTDLE 333
Cdd:cd12926    74 RR------------EGNEKEMQRILLNSERLKsriAEIHESRTKLEQDLRAQASdnrEIDKRMNSLK 128
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
108-446 4.02e-04

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 45.17  E-value: 4.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   108 ELDATATVLANRQDESEQSRKRLIEQSREF--KKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVP 185
Cdd:pfam01576  388 ELQAELRTLQQAKQDSEHKRKKLEGQLQELqaRLSESERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLE 467
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   186 DPVPalDVGQQLEIKVQ-------RLHDIETENQKLRETLEEynkEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIAL 258
Cdd:pfam01576  468 SQLQ--DTQELLQEETRqklnlstRLRQLEDERNSLQEQLEE---EEEAKRNVERQLSTLQAQLSDMKKKLEEDAGTLEA 542
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   259 EKEQKlqndfaekeRKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYD---------EETTAKAD----EI 325
Cdd:pfam01576  543 LEEGK---------KRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDLDhqrqlvsnlEKKQKKFDqmlaEE 613
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   326 EMIMTDLERANQRAEVAQREAETlreqlssanHSLQLASQIQKAPDVEQAIEVLTRSsleveLAAKEREIAQLVEDVQRL 405
Cdd:pfam01576  614 KAISARYAEERDRAEAEAREKET---------RALSLARALEEALEAKEELERTNKQ-----LRAEMEDLVSSKDDVGKN 679
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 601984298   406 QASLTKLREnsasqisQLEQQLNAKNSTLKQLEEKLKGQAD 446
Cdd:pfam01576  680 VHELERSKR-------ALEQQVEEMKTQLEELEDELQATED 713
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
108-458 4.68e-04

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 45.04  E-value: 4.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   108 ELDATATVLANRQDESEQSRKRLIEQSREFKKNTPEDLR----------KQVAPLLKSFQGEIDALSKRSKE---AEAAF 174
Cdd:TIGR00606  266 KLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNdlyhnhqrtvREKERELVDCQRELEKLNKERRLlnqEKTEL 345
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   175 LTVYKRLIDVPDPVPALDVGQQLEIKVQRLH------------DIETEN--QKLRETLEEYNKEFAEVKNQEVTIKALKE 240
Cdd:TIGR00606  346 LVEQGRLQLQADRHQEHIRARDSLIQSLATRleldgfergpfsERQIKNfhTLVIERQEDEAKTAAQLCADLQSKERLKQ 425
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   241 K-IREYEQTLKSQAETIALEKE--QKLQNDFAEKERKLQ--ETQMSTTSKLEEAEHK-LQTLQTALEKTRTElfdlkTKY 314
Cdd:TIGR00606  426 EqADEIRDEKKGLGRTIELKKEilEKKQEELKFVIKELQqlEGSSDRILELDQELRKaERELSKAEKNSLTE-----TLK 500
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   315 DEETTAKADEIEMIMTdLERANQRAEVAQREAETLREQLSSANHSLQLASQIQKAPD------VEQAIEVLTRSSLEVEL 388
Cdd:TIGR00606  501 KEVKSLQNEKADLDRK-LRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSrhsdelTSLLGYFPNKKQLEDWL 579
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   389 AAKEREIAQLVEDVQRLQASLTKLRENSasqiSQLEQQLNAKNSTLKQLEEKLKGQADYEEVKKELNTLK 458
Cdd:TIGR00606  580 HSKSKEINQTRDRLAKLNKELASLEQNK----NHINNELESKEEQLSSYEDKLFDVCGSQDEESDLERLK 645
ATG17_like pfam04108
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
214-442 4.73e-04

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 44.30  E-value: 4.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   214 LRETLEEYNKEFAEVknqevtIKALKEKIREYEQTLKSQAETIAL------EKEQKLQNDFAEkERKLQETQMSTTSKLE 287
Cdd:pfam04108   54 LEKVLNELKKDFKQL------LKDLDAALERLEETLDKLRNTPVEpalppgEEKQKTLLDFID-EDSVEILRDALKELID 126
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   288 EAEHKLQTLQTALEKTRTELFDLKtKYDEETTAKADEIEMIMTDLERANQRAEvaqrEAETLREQLssANHsLQLASQIQ 367
Cdd:pfam04108  127 ELQAAQESLDSDLKRFDDDLRDLQ-KELESLSSPSESISLIPTLLKELESLEE----EMASLLESL--TNH-YDQCVTAV 198
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   368 KAPDVEQA--IEVLTRSSLEV-----ELAAKEREIAQLVEDVQRLQASLTKLRENSAS---QISQLEQQLNAKNSTLKQL 437
Cdd:pfam04108  199 KLTEGGRAemLEVLENDARELddvvpELQDRLDEMENNYERLQKLLEQKNSLIDELLSalqLIAEIQSRLPEYLAALKEF 278

                   ....*
gi 601984298   438 EEKLK 442
Cdd:pfam04108  279 EERWE 283
PRK10929 PRK10929
putative mechanosensitive channel protein; Provisional
114-504 4.78e-04

putative mechanosensitive channel protein; Provisional


Pssm-ID: 236798 [Multi-domain]  Cd Length: 1109  Bit Score: 45.04  E-value: 4.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  114 TVLANRQDESEQSRKRLiEQSREFKknTPEDlrkqvAPLLKSFQGEIDALSKRSKEAEAAflTVYKRLIDvpdpvpaldv 193
Cdd:PRK10929   16 GAYAATAPDEKQITQEL-EQAKAAK--TPAQ-----AEIVEALQSALNWLEERKGSLERA--KQYQQVID---------- 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  194 gqqleikvqrlhDIETENQKLRETLEEYNKEFAEVkNQEVTIKALKEKIREYEQTLKSQAETiaLEKEQKLQNDFAEKER 273
Cdd:PRK10929   76 ------------NFPKLSAELRQQLNNERDEPRSV-PPNMSTDALEQEILQVSSQLLEKSRQ--AQQEQDRAREISDSLS 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  274 KLQETQMSTTSKLEEAEHKLQTL--------QTALEKTRTELFDLKtkydeettAKADEIEMIMTDL----ERANQRAEV 341
Cdd:PRK10929  141 QLPQQQTEARRQLNEIERRLQTLgtpntplaQAQLTALQAESAALK--------ALVDELELAQLSAnnrqELARLRSEL 212
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  342 AQREAETLREQLSSANHslQLASQIQKapDVEQAIEvltrsslEVELAAKE---------------REIAQ-LVEDVQRL 405
Cdd:PRK10929  213 AKKRSQQLDAYLQALRN--QLNSQRQR--EAERALE-------STELLAEQsgdlpksivaqfkinRELSQaLNQQAQRM 281
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  406 QASLTKLREnSASQISQLEQQLN-----------------AKNSTLKQLEEKLKGQA-DYEEVKKELNTLK---SMEFAP 464
Cdd:PRK10929  282 DLIASQQRQ-AASQTLQVRQALNtlreqsqwlgvsnalgeALRAQVARLPEMPKPQQlDTEMAQLRVQRLRyedLLNKQP 360
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|.
gi 601984298  465 SEGAGTQDSTKPLEVlllEKNRSLQSENATLR-ISNSDLSG 504
Cdd:PRK10929  361 QLRQIRQADGQPLTA---EQNRILDAQLRTQReLLNSLLSG 398
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
237-438 4.90e-04

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 43.98  E-value: 4.90e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   237 ALKEKIREYEQ----TLKSQAETIALEKEQKLQNDFaekerklqETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKT 312
Cdd:pfam09787    4 SAKQELADYKQkaarILQSKEKLIASLKEGSGVEGL--------DSSTALTLELEELRQERDLLREEIQKLRGQIQQLRT 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   313 KydeettakadeiemiMTDLEranqraEVAQREAETLREQLSSANHslQLASQIQKAPDVEQAIEVLTR--SSLEVELaa 390
Cdd:pfam09787   76 E---------------LQELE------AQQQEEAESSREQLQELEE--QLATERSARREAEAELERLQEelRYLEEEL-- 130
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 601984298   391 kEREIAQLVEDVQRLQASLTKLR------ENSASQISQLEQQLNA------------------KNSTLKQLE 438
Cdd:pfam09787  131 -RRSKATLQSRIKDREAEIEKLRnqltskSQSSSSQSELENRLHQltetliqkqtmlealsteKNSLVLQLE 201
CCDC144C pfam14915
CCDC144C protein coiled-coil region; This family includes the human protein CCDC144C and the ...
196-496 5.11e-04

CCDC144C protein coiled-coil region; This family includes the human protein CCDC144C and the ankyrin repeat domain-containing protein 26-like 1 found in eukaryotes. Its function remains unknown, however, it is known to contain a coiled-coil domain which corresponds to this region. The ankyrin repeat which features in this protein is a common amino acid motif.


Pssm-ID: 464371 [Multi-domain]  Cd Length: 304  Bit Score: 43.82  E-value: 5.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   196 QLEIKVQRLhDIET-ENQKLRETlEEYNKEfaevknqevtIKALKEKIREYEQTLKSQAETIA---LEKEQKLQNDFAEK 271
Cdd:pfam14915    5 QDEIAMLRL-EIDTiKNQNQEKE-KKYLED----------IEILKEKNDDLQKTLKLNEETLTktvFQYNGQLNVLKAEN 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   272 ERklqetqmsTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKadeiemimTDLERANQRaevAQREAETLRE 351
Cdd:pfam14915   73 TM--------LNSKLENEKQNKERLETEVESYRSRLAAAIQDHEQSQTSK--------RDLELAFQR---ERDEWLRLQD 133
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   352 QLSSANHSLQLASQI--QKAPDVEQAIevltrSSLEVEL---AAKEREIAQLVEDVQRlqasltklrensasQISQLEQQ 426
Cdd:pfam14915  134 KMNFDVSNLRDENEIlsQQLSKAESKA-----NSLENELhrtRDALREKTLLLESVQR--------------DLSQAQCQ 194
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   427 lnaknstLKQLEEKLkgQADYEEVKKELNTLKSMEfapsegagtqdstkplevlllEKNRSLQSENATLR 496
Cdd:pfam14915  195 -------KKELEHMY--QNEQDKVNKYIGKQESLE---------------------ERLAQLQSENMLLR 234
PHA03247 PHA03247
large tegument protein UL36; Provisional
826-1026 5.58e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 5.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  826 PAPLSQPDLTILTPKhlSASPMSTVSTYPPLAISLKKTPAAPETSTAALPSAPALKKEAQDVPTLDPPGSADAAQGVLRP 905
Cdd:PHA03247 2754 PARPARPPTTAGPPA--PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPP 2831
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  906 MKSELVRGSTWKDPWWSPIQPERRNLTSSEETKADETTASGKERAGSSQPRAERSQLQGPSASAEywkewPSAESPYSQS 985
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE-----SFALPPDQPE 2906
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 601984298  986 SELSLTGASRSETPQNSPLPSSPIVPMAKPAKPSvPPLTPE 1026
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQ-PPLAPT 2946
PTZ00121 PTZ00121
MAEBL; Provisional
118-461 6.01e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 44.75  E-value: 6.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  118 NRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEA----------EAAFLTVYKRLID---- 183
Cdd:PTZ00121 1087 NRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDArkaeearkaeDAKRVEIARKAEDarka 1166
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  184 -----------VPDPVPALDVGQQLEI-KVQRLHDIET----ENQKLRETLEEYN--KEFAEVKNQEVTIKALK-----E 240
Cdd:PTZ00121 1167 eearkaedakkAEAARKAEEVRKAEELrKAEDARKAEAarkaEEERKAEEARKAEdaKKAEAVKKAEEAKKDAEeakkaE 1246
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  241 KIREYEQTLK-----------SQAETIALEKEQKLQNDFAEKERKLQETQMS-TTSKLEEAEHKlqtlqtALEKTRTElf 308
Cdd:PTZ00121 1247 EERNNEEIRKfeearmahfarRQAAIKAEEARKADELKKAEEKKKADEAKKAeEKKKADEAKKK------AEEAKKAD-- 1318
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  309 DLKTKyDEETTAKADEIEmimTDLERANQRAEVAQREAETLREQLSSANHSLQLASqiQKAPDVEQAIEVLTRSSLEV-- 386
Cdd:PTZ00121 1319 EAKKK-AEEAKKKADAAK---KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAE--KKKEEAKKKADAAKKKAEEKkk 1392
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 601984298  387 --ELAAKEREIAQLVEDVQRLQASLTKLRE--NSASQISQLEQQlnAKNSTLKQLEEKLKGQAdyEEVKKELNTLKSME 461
Cdd:PTZ00121 1393 adEAKKKAEEDKKKADELKKAAAAKKKADEakKKAEEKKKADEA--KKKAEEAKKADEAKKKA--EEAKKAEEAKKKAE 1467
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
188-461 6.32e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 44.57  E-value: 6.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   188 VPALDVGQQLEIKVQRLHDIET-ENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQn 266
Cdd:TIGR00618  434 ELQQRYAELCAAAITCTAQCEKlEKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIH- 512
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   267 dfAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDL--KTKYDEETTAKADEIEMIMTDL-ERANQRAEVAQ 343
Cdd:TIGR00618  513 --PNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSErkQRASLKEQMQEIQQSFSILTQCdNRSKEDIPNLQ 590
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   344 REAETLR---EQLSSANHSLQLASQIQKapdvEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQASLTKLREN-SASQ 419
Cdd:TIGR00618  591 NITVRLQdltEKLSEAEDMLACEQHALL----RKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVReHALS 666
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 601984298   420 ISQLEQQLNAKNSTLKQLEEKLKGQADY--EEVKKELNTLKSME 461
Cdd:TIGR00618  667 IRVLPKELLASRQLALQKMQSEKEQLTYwkEMLAQCQTLLRELE 710
CCDC22 pfam05667
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ...
119-459 6.45e-04

Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.


Pssm-ID: 461708 [Multi-domain]  Cd Length: 600  Bit Score: 44.25  E-value: 6.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   119 RQDESEQSRKRLIEQSREFKKNTPE---DLRKQVAPLLKSFQ------GEIDALSKRSKEAEAAFLTVYKRLIDVPdPVP 189
Cdd:pfam05667  241 RKRKRTKLLKRIAEQLRSAALAGTEatsGASRSAQDLAELLSsfsgssTTDTGLTKGSRFTHTEKLQFTNEAPAAT-SSP 319
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   190 ALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQ-KLQNDF 268
Cdd:pfam05667  320 PTKVETEEELQQQREEELEELQEQLEDLESSIQELEKEIKKLESSIKQVEEELEELKEQNEELEKQYKVKKKTlDLLPDA 399
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   269 AEKERKLQetqmsttSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERanqraevaqreaet 348
Cdd:pfam05667  400 EENIAKLQ-------ALVDASAQRLVELAGQWEKHRVPLIEEYRALKEAKSNKEDESQRKLEEIKE-------------- 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   349 LREQLssanhslqlasqiqkapdvEQAIEvltrsslevELAAKEREIAQLVEDVQRLQasltklRENSASQISQ--LEQQ 426
Cdd:pfam05667  459 LREKI-------------------KEVAE---------EAKQKEELYKQLVAEYERLP------KDVSRSAYTRriLEIV 504
                          330       340       350
                   ....*....|....*....|....*....|...
gi 601984298   427 LNAKnstlKQLEEKLKGQADYEEVKKELNTLKS 459
Cdd:pfam05667  505 KNIK----KQKEEITKILSDTKSLQKEINSLTG 533
PRK11281 PRK11281
mechanosensitive channel MscK;
362-515 6.98e-04

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 44.52  E-value: 6.98e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  362 LASQIQKAPDVEQAIEVLTRSSLEVEL-AAKEREIAQlVED---VQRLQASLTKLrensaSQISQLEQQLNAKNSTLKQL 437
Cdd:PRK11281   19 LLCLSSAFARAASNGDLPTEADVQAQLdALNKQKLLE-AEDklvQQDLEQTLALL-----DKIDRQKEETEQLKQQLAQA 92
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298  438 EEKLKgqadyeEVKKELNTLKSMEFAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSGSarrkgRDQPE 515
Cdd:PRK11281   93 PAKLR------QAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSL-----QTQPE 159
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
212-443 7.27e-04

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 44.56  E-value: 7.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  212 QKLRETLEEYnkefAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKE-QKLQNDFAEKERKLQETQMSTTSKLEEAE 290
Cdd:COG3096   495 QTARELLRRY----RSQQALAQRLQQLRAQLAELEQRLRQQQNAERLLEEfCQRIGQQLDAAEELEELLAELEAQLEELE 570
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  291 HKLQTLQTALEKTRTELFDLKTKYDEETtakadeiemimtdleranQRAEV---AQREAETLREQLSsanhsLQLASqiq 367
Cdd:COG3096   571 EQAAEAVEQRSELRQQLEQLRARIKELA------------------ARAPAwlaAQDALERLREQSG-----EALAD--- 624
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298  368 kAPDVEQAIEvltrsslevELAAKEREIAQLVEDVQRLQASLtklrensASQISQLEQQLNAKNSTLKQLEEKLKG 443
Cdd:COG3096   625 -SQEVTAAMQ---------QLLEREREATVERDELAARKQAL-------ESQIERLSQPGGAEDPRLLALAERLGG 683
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
200-458 7.39e-04

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 44.56  E-value: 7.39e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  200 KVQRLH-----------------DIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQ 262
Cdd:COG3096   810 KLQRLHqafsqfvgghlavafapDPEAELAALRQRRSELERELAQHRAQEQQLRQQLDQLKEQLQLLNKLLPQANLLADE 889
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  263 KLQNDFAEKERKLQETQMSTTS------KLEEAEHKLQTLQT------ALEKTRTELFDLKTKYDEETTA---------- 320
Cdd:COG3096   890 TLADRLEELREELDAAQEAQAFiqqhgkALAQLEPLVAVLQSdpeqfeQLQADYLQAKEQQRRLKQQIFAlsevvqrrph 969
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  321 --KADEIEMIM--TDL-ERANQRAEVAQREAETLREQLSsanhslQLASQiqkapdVEQAIEVLT--RSSLEV---ELAA 390
Cdd:COG3096   970 fsYEDAVGLLGenSDLnEKLRARLEQAEEARREAREQLR------QAQAQ------YSQYNQVLAslKSSRDAkqqTLQE 1037
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  391 KEREIAQL------------VEDVQRLQASLTKLRensaSQISQLEQQLNAKNSTLKQLEEKL-KGQADYEEVKKELNTL 457
Cdd:COG3096  1038 LEQELEELgvqadaeaeeraRIRRDELHEELSQNR----SRRSQLEKQLTRCEAEMDSLQKRLrKAERDYKQEREQVVQA 1113

                  .
gi 601984298  458 K 458
Cdd:COG3096  1114 K 1114
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
240-492 7.88e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 44.33  E-value: 7.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   240 EKIREYEQTLKSQAEtialEKEQKLQND--FAEKERK-LQETQM---STTSKLEEAEHKLQTLQTALEKTR--TELFDLK 311
Cdd:pfam05483   88 EKIKKWKVSIEAELK----QKENKLQENrkIIEAQRKaIQELQFeneKVSLKLEEEIQENKDLIKENNATRhlCNLLKET 163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   312 TKYDEETTAK----ADEIEMIMTDLeraNQRAEVAQREAETLREQlsSANHSLQLASQIQKAPDVEQAIEvltrSSLEVE 387
Cdd:pfam05483  164 CARSAEKTKKyeyeREETRQVYMDL---NNNIEKMILAFEELRVQ--AENARLEMHFKLKEDHEKIQHLE----EEYKKE 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   388 LAAKEREIA----QLVEDVQRLQaSLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKG-QADYEEVKKELNtlKSMEF 462
Cdd:pfam05483  235 INDKEKQVSllliQITEKENKMK-DLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHlTKELEDIKMSLQ--RSMST 311
                          250       260       270
                   ....*....|....*....|....*....|
gi 601984298   463 APSEGAGTQDSTKPLEVLLLEKNRSLQSEN 492
Cdd:pfam05483  312 QKALEEDLQIATKTICQLTEEKEAQMEELN 341
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
103-276 8.07e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 44.29  E-value: 8.07e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFK--KNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVY-- 178
Cdd:TIGR02169  325 AKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAelKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINel 404
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   179 KRLIDvpdpvPALDVGQQLEIKVQRLH-DIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIA 257
Cdd:TIGR02169  405 KRELD-----RLQEELQRLSEELADLNaAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYD 479
                          170       180
                   ....*....|....*....|.
gi 601984298   258 -LEKEQ-KLQNDFAEKERKLQ 276
Cdd:TIGR02169  480 rVEKELsKLQRELAEAEAQAR 500
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
113-503 8.51e-04

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 44.27  E-value: 8.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   113 ATVLANRQDESEQSRKRLIEQSREFK--KNTPEDLRKQ---VAPLLKSFQ------GEIDALSKRSKEAEAAFLTVYKRL 181
Cdd:TIGR00606  736 QSIIDLKEKEIPELRNKLQKVNRDIQrlKNDIEEQETLlgtIMPEEESAKvcltdvTIMERFQMELKDVERKIAQQAAKL 815
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   182 IDVPDPVPALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEfaevknqevtIKALKEKIREyeqtLKSQAETIALEKE 261
Cdd:TIGR00606  816 QGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQ----------IQHLKSKTNE----LKSEKLQIGTNLQ 881
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   262 QKLQndFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMI----------MTD 331
Cdd:TIGR00606  882 RRQQ--FEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDKVNDIkekvknihgyMKD 959
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   332 LERANQRA---EVAQREAE--TLREQLSSA-------NHSLQLASQIQKAPDVEQAI--EVLTRSSLEVELAAKEREIAQ 397
Cdd:TIGR00606  960 IENKIQDGkddYLKQKETElnTVNAQLEECekhqekiNEDMRLMRQDIDTQKIQERWlqDNLTLRKRENELKEVEEELKQ 1039
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   398 LvedvqrlqasLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKG-QADYEEVKKELNT-LKSMEFAPSEgagtqdstk 475
Cdd:TIGR00606 1040 H----------LKEMGQMQVLQMKQEHQKLEENIDLIKRNHVLALGrQKGYEKEIKHFKKeLREPQFRDAE--------- 1100
                          410       420
                   ....*....|....*....|....*...
gi 601984298   476 plevlllEKNRSLQSENATLRISNSDLS 503
Cdd:TIGR00606 1101 -------EKYREMMIVMRTTELVNKDLD 1121
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
148-313 8.98e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 42.60  E-value: 8.98e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  148 QVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVpdpvpaldvgqQLEIKvqrlhDIETENQKLRETLEEYNKEFAE 227
Cdd:COG1579    21 RLEHRLKELPAELAELEDELAALEARLEAAKTELEDL-----------EKEIK-----RLELEIEEVEARIKKYEEQLGN 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  228 VKNQEVtIKALKEKIrEYEQTLKSQAETIALEKEQK---LQNDFAEKERKLQETQmsttsklEEAEHKLQTLQTALEKTR 304
Cdd:COG1579    85 VRNNKE-YEALQKEI-ESLKRRISDLEDEILELMERieeLEEELAELEAELAELE-------AELEEKKAELDEELAELE 155

                  ....*....
gi 601984298  305 TELFDLKTK 313
Cdd:COG1579   156 AELEELEAE 164
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
122-441 9.55e-04

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 43.67  E-value: 9.55e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  122 ESEQSRKRLIEQSREfkknTPEDLRKQVapLLKSFQ-GE-IDALSKRSKEAEAAFlTVYKRLIDVPDPVPALDVGQQLEI 199
Cdd:PRK04778  133 ESEEKNREEVEQLKD----LYRELRKSL--LANRFSfGPaLDELEKQLENLEEEF-SQFVELTESGDYVEAREILDQLEE 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  200 KV------------------------------------------------QRLHDIETENQKLRETLEEYNKEFAEVKNQ 231
Cdd:PRK04778  206 ELaaleqimeeipellkelqtelpdqlqelkagyrelveegyhldhldieKEIQDLKEQIDENLALLEELDLDEAEEKNE 285
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  232 EvtikaLKEKIRE-YEQtlksqaetiaLEKEQKLQNdfaekerklqetqmsttskleEAEHKLQTLQTALEKTRTELFDL 310
Cdd:PRK04778  286 E-----IQERIDQlYDI----------LEREVKARK---------------------YVEKNSDTLPDFLEHAKEQNKEL 329
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  311 KTKydeettakadeiemimtdLERANQRAEVAQREAETLR---EQLSSANHSLQlasQIQKAPDvEQAIevlTRSSLEVE 387
Cdd:PRK04778  330 KEE------------------IDRVKQSYTLNESELESVRqleKQLESLEKQYD---EITERIA-EQEI---AYSELQEE 384
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298  388 LAAKEREIAQLVEDVQRLQASLTKLR--ENSASQ-ISQLEQQLnaknSTLKQLEEKL 441
Cdd:PRK04778  385 LEEILKQLEEIEKEQEKLSEMLQGLRkdELEAREkLERYRNKL----HEIKRYLEKS 437
BBC smart00502
B-Box C-terminal domain; Coiled coil region C-terminal to (some) B-Box domains
212-302 9.71e-04

B-Box C-terminal domain; Coiled coil region C-terminal to (some) B-Box domains


Pssm-ID: 128778  Cd Length: 127  Bit Score: 40.71  E-value: 9.71e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298    212 QKLRETLEEYNKEFAEVKNQEVTIKA----LKEKIREYEQTLKSQAETI--ALEK-EQKLQND----FAEKERKLQETQM 280
Cdd:smart00502    3 EALEELLTKLRKKAAELEDALKQLISiiqeVEENAADVEAQIKAAFDELrnALNKrKKQLLEDleeqKENKLKVLEQQLE 82
                            90       100
                    ....*....|....*....|..
gi 601984298    281 STTSKLEEAEHKLQTLQTALEK 302
Cdd:smart00502   83 SLTQKQEKLSHAINFTEEALNS 104
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
384-458 1.02e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 43.75  E-value: 1.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  384 LEVELAAKEREIAQLVEDVQRLQASLTKLRENS-------------------ASQISQLEQQ---LNAKNSTLKQLEEKL 441
Cdd:COG4913   615 LEAELAELEEELAEAEERLEALEAELDALQERRealqrlaeyswdeidvasaEREIAELEAElerLDASSDDLAALEEQL 694
                          90
                  ....*....|....*...
gi 601984298  442 KG-QADYEEVKKELNTLK 458
Cdd:COG4913   695 EElEAELEELEEELDELK 712
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
192-454 1.03e-03

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 43.67  E-value: 1.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  192 DVGQQLEIKVQRLHDIETENQKLREtleeynkefAEVKNQEVtIKALKEKIREYEQTLKSQA----ETIA-LEKE-QKLQ 265
Cdd:PRK04778  109 EIESLLDLIEEDIEQILEELQELLE---------SEEKNREE-VEQLKDLYRELRKSLLANRfsfgPALDeLEKQlENLE 178
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  266 NDFAEKERKLQETQMSTTSK-LEEAEHKLQTLQTALEKTRTELFDLKTKYDEETtakaDEIEMIMTDLERANQRAEVAQ- 343
Cdd:PRK04778  179 EEFSQFVELTESGDYVEAREiLDQLEEELAALEQIMEEIPELLKELQTELPDQL----QELKAGYRELVEEGYHLDHLDi 254
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  344 -REAETLREQLSSAN---HSLQLASQIQKAPDVEQAIEVLTrSSLEVELAAK---EREIAQLVEDVQRLQASLTKLREns 416
Cdd:PRK04778  255 eKEIQDLKEQIDENLallEELDLDEAEEKNEEIQERIDQLY-DILEREVKARkyvEKNSDTLPDFLEHAKEQNKELKE-- 331
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|..
gi 601984298  417 asQISQLEQ--QLNAKN-STLKQLEEKLKG-QADYEEVKKEL 454
Cdd:PRK04778  332 --EIDRVKQsyTLNESElESVRQLEKQLESlEKQYDEITERI 371
GBP_C cd16269
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ...
212-355 1.04e-03

Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.


Pssm-ID: 293879 [Multi-domain]  Cd Length: 291  Bit Score: 42.95  E-value: 1.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  212 QKLRETLEEYNKE-FAEVKNQEVTIKALKEKIREYE---QTLKSQAETIALEKEQKLQNDFAEKERK-LQETQMSTTSKL 286
Cdd:cd16269   149 EDREKLVEKYRQVpRKGVKAEEVLQEFLQSKEAEAEailQADQALTEKEKEIEAERAKAEAAEQERKlLEEQQRELEQKL 228
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 601984298  287 EEAEHKLQTLQTALEKtrtelfdlktKYDEETTAKADEIEMIMTDLERANQR--AEVAQREAETLREQLSS 355
Cdd:cd16269   229 EDQERSYEEHLRQLKE----------KMEEERENLLKEQERALESKLKEQEAllEEGFKEQAELLQEEIRS 289
Homeobox_KN pfam05920
Homeobox KN domain; This is a homeobox transcription factor KN domain conserved from fungi to ...
1357-1391 1.22e-03

Homeobox KN domain; This is a homeobox transcription factor KN domain conserved from fungi to human and plants. They were first identified as TALE homeobox genes in eukaryotes, (including KNOX and MEIS genes). They have been recently classified.


Pssm-ID: 428673  Cd Length: 39  Bit Score: 37.88  E-value: 1.22e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 601984298  1357 QQKPYPSPKTIEELATQLNLKTSTVINWFHNYRSR 1391
Cdd:pfam05920    5 LHNPYPSEEEKAELAKETGLSRKQISNWFINARRR 39
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
306-458 2.06e-03

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 42.52  E-value: 2.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  306 ELFD-LKTKYDEETTAKADEIEMIMTDLERANQRAEV--AQREAETLREQLSSANhslqlaSQIQkapDVEQAIEVLtrs 382
Cdd:PRK04778   64 EKFEeWRQKWDEIVTNSLPDIEEQLFEAEELNDKFRFrkAKHEINEIESLLDLIE------EDIE---QILEELQEL--- 131
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298  383 sleVELAAKER-EIAQLVEDVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEkLKGQADYEEVKKELNTLK 458
Cdd:PRK04778  132 ---LESEEKNReEVEQLKDLYRELRKSLLANRFSFGPALDELEKQLENLEEEFSQFVE-LTESGDYVEAREILDQLE 204
Mitofilin pfam09731
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...
117-362 2.07e-03

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.


Pssm-ID: 430783 [Multi-domain]  Cd Length: 618  Bit Score: 42.82  E-value: 2.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   117 ANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVpdpvpaldVGQQ 196
Cdd:pfam09731  231 VEEKVEKAQSLAKLVDQYKELVASERIVFQQELVSIFPDIIPVLKEDNLLSNDDLNSLIAHAHREIDQ--------LSKK 302
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   197 L-EIKVQRLHDIETENQKLRETL----EEYNKEFAEVKNQE-----VTIKALKEKIRE-YEQTLKSQAETIALEKEQKLQ 265
Cdd:pfam09731  303 LaELKKREEKHIERALEKQKEELdklaEELSARLEEVRAADeaqlrLEFEREREEIREsYEEKLRTELERQAEAHEEHLK 382
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   266 NDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTAlektrtelfdlktkydeETTAKADEIEMIMT---DLERANQRAEVA 342
Cdd:pfam09731  383 DVLVEQEIELQREFLQDIKEKVEEERAGRLLKLN-----------------ELLANLKGLEKATSshsEVEDENRKAQQL 445
                          250       260
                   ....*....|....*....|
gi 601984298   343 QREAETLREQLSSANHSLQL 362
Cdd:pfam09731  446 WLAVEALRSTLEDGSADSRP 465
mukB PRK04863
chromosome partition protein MukB;
116-373 2.48e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 42.64  E-value: 2.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  116 LANRQDESEQSRKRLIEQSREFKKNTPE-DLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRLIDVP--------- 185
Cdd:PRK04863  892 LADRVEEIREQLDEAEEAKRFVQQHGNAlAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTevvqrrahf 971
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  186 ---DPVPALDVGQQLEIKV-QRLHDIETENQKLRETLEEYNKEFAEvKNQEVTikALKEKIREYEQTLKsqaetialEKE 261
Cdd:PRK04863  972 syeDAAEMLAKNSDLNEKLrQRLEQAEQERTRAREQLRQAQAQLAQ-YNQVLA--SLKSSYDAKRQMLQ--------ELK 1040
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  262 QKLQN-----DFAEKERKlqetqMSTTSKLEEAEHKLQTLQTALEKTRTELfdlktkydeettakadEIEMimtdlERAN 336
Cdd:PRK04863 1041 QELQDlgvpaDSGAEERA-----RARRDELHARLSANRSRRNQLEKQLTFC----------------EAEM-----DNLT 1094
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 601984298  337 QRAEVAQREAETLREQLSSANHSLQLASQIQKAPDVE 373
Cdd:PRK04863 1095 KKLRKLERDYHEMREQVVNAKAGWCAVLRLVKDNGVE 1131
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
337-466 2.73e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 42.63  E-value: 2.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  337 QRAEVAQREAETLREQLSSANHSLQ------------LASQIQKA--PDVEQAIEVLTRSSLEVElaakeREIAQLVEDV 402
Cdd:COG3096   785 KRLEELRAERDELAEQYAKASFDVQklqrlhqafsqfVGGHLAVAfaPDPEAELAALRQRRSELE-----RELAQHRAQE 859
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  403 QRLQASLTKLRENS------------------ASQISQLEQQLNAKNS----------TLKQLEEKLKG----------- 443
Cdd:COG3096   860 QQLRQQLDQLKEQLqllnkllpqanlladetlADRLEELREELDAAQEaqafiqqhgkALAQLEPLVAVlqsdpeqfeql 939
                         170       180
                  ....*....|....*....|...
gi 601984298  444 QADYEEVKKELNTLKSMEFAPSE 466
Cdd:COG3096   940 QADYLQAKEQQRRLKQQIFALSE 962
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
191-457 2.78e-03

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 42.15  E-value: 2.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   191 LDVGQQLEiKVQRLHDI-ETEnqklrETLEEYNKEFAEVKNQEvtIKALKEKIREYEQtlksQAETIALEKEQKLQNDfa 269
Cdd:pfam06160   25 LPVQEELS-KVKKLNLTgETQ-----EKFEEWRKKWDDIVTKS--LPDIEELLFEAEE----LNDKYRFKKAKKALDE-- 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   270 ekerklqetqmsTTSKLEEAEHKLQTLQTAL-------EKTRTELFDLKTKYDEettakadeiemIMTDLEranqraevA 342
Cdd:pfam06160   91 ------------IEELLDDIEEDIKQILEELdelleseEKNREEVEELKDKYRE-----------LRKTLL--------A 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   343 QREA-----ETLREQLSSANHSLQLASQIQKAPDVEQAIEVLtrSSLEVELAAKER---EIAQLVEDVQ-RLQASLTKLR 413
Cdd:pfam06160  140 NRFSygpaiDELEKQLAEIEEEFSQFEELTESGDYLEAREVL--EKLEEETDALEElmeDIPPLYEELKtELPDQLEELK 217
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   414 E---------------NSASQISQLEQQLNAKNSTLKQLEekLKG-QADYEEVKKELNTL 457
Cdd:pfam06160  218 EgyremeeegyalehlNVDKEIQQLEEQLEENLALLENLE--LDEaEEALEEIEERIDQL 275
PRK12704 PRK12704
phosphodiesterase; Provisional
226-395 2.87e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 42.07  E-value: 2.87e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  226 AEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFaEKERKLQEtqmsttSKLEEAEHKLQTLQTALEKtRT 305
Cdd:PRK12704   31 AKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEF-EKELRERR------NELQKLEKRLLQKEENLDR-KL 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  306 ELFDLKtkyDEETTAKADEIEMIMTDLEraNQRAEVAQREAETLREqlssanhsLQLASQIQKapdvEQAIEVL---TRS 382
Cdd:PRK12704  103 ELLEKR---EEELEKKEKELEQKQQELE--KKEEELEELIEEQLQE--------LERISGLTA----EEAKEILlekVEE 165
                         170
                  ....*....|...
gi 601984298  383 SLEVELAAKEREI 395
Cdd:PRK12704  166 EARHEAAVLIKEI 178
Wtap pfam17098
WTAP/Mum2p family; The Wtap family includes female-lethal(2)D from Drosophila and ...
382-500 3.67e-03

WTAP/Mum2p family; The Wtap family includes female-lethal(2)D from Drosophila and pre-mRNA-splicing regulator WTAP from mammals. The former is required for female-specific splicing of Sex-lethal RNA, and the latter is a regulatory subunit of the RNA N6-methyladenosine methyltransferase. The family also includes the yeast Mum2p protein which is part of the Mis complex.


Pssm-ID: 465345 [Multi-domain]  Cd Length: 155  Bit Score: 39.59  E-value: 3.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   382 SSLEVELAAKEREIAQLVEDVQRLQASLTKlrENSASQISQLEQQLNAKNSTLKQLEEKLKgqADYEEVKKELNTLKsme 461
Cdd:pfam17098    7 NLLLARLAEKEQEIQELKAQLQDLKQSLQP--PSSQLRSLLLDPAVNLEFLRLKKELEEKK--KKLKEAQLELAAWK--- 79
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 601984298   462 FAPSEGAGTQdstkplevlLLEKNRSLQSENATLRISNS 500
Cdd:pfam17098   80 FTPDSTTGKR---------LMAKCRLLQQENEELGRQLS 109
SHE3 pfam17078
SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an ...
191-376 4.18e-03

SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an RNA-binding protein that binds specific mRNAs, including the mRNA of Ash1, which is invalid in cell-fate determination. She3 acts as an adapter protein that docks the myosin motor Myo4p onto an Ash1-She2p ribonucleoprotein complex. She3 seems to bind to Myo4p and Shep2p via different domains.


Pssm-ID: 293683 [Multi-domain]  Cd Length: 228  Bit Score: 40.49  E-value: 4.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   191 LDVGQQLEIK-VQRLHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAEtialekeqKLQNDFA 269
Cdd:pfam17078   33 LEIAQQKESKfLENLASLKHENDNLSSMLNRKERRLKDLEDQLSELKNSYEELTESNKQLKKRLE--------NSSASET 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   270 EKERKLQETQMS----TTSKLEEAEH---KLQTLQTALEKTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVA 342
Cdd:pfam17078  105 TLEAELERLQIQydalVDSQNEYKDHyqqEINTLQESLEDLKLENEKQLENYQQRISSNDKDIDTKLDSYNNKFKNLDNI 184
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 601984298   343 QREAET-LREQLSSANHSLQLASQIQKAPDVEQAI 376
Cdd:pfam17078  185 YVNKNNkLLTKLDSLAQLLDLPSWLNLYPESRNKI 219
KpsE COG3524
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];
320-446 4.39e-03

Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442746 [Multi-domain]  Cd Length: 370  Bit Score: 41.37  E-value: 4.39e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  320 AKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSLQLASQIQkapdveqaievlTRSSLEVELAAKEREIAQLv 399
Cdd:COG3524   174 AREDAVRFAEEEVERAEERLRDAREALLAFRNRNGILDPEATAEALLQ------------LIATLEGQLAELEAELAAL- 240
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 601984298  400 edvqrlqasLTKLRENSAsQISQLEQQLNAKNSTLKQLEEKLKGQAD 446
Cdd:COG3524   241 ---------RSYLSPNSP-QVRQLRRRIAALEKQIAAERARLTGASG 277
DUF4201 pfam13870
Domain of unknown function (DUF4201); This is a family of coiled-coil proteins from eukaryotes. ...
207-338 4.49e-03

Domain of unknown function (DUF4201); This is a family of coiled-coil proteins from eukaryotes. The function is not known.


Pssm-ID: 464008 [Multi-domain]  Cd Length: 177  Bit Score: 39.90  E-value: 4.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   207 IETENQKLRETLEEYNKEFAEVKNQEV----TIKALKEKIREYEQTLKSQAETIAlekeqKLQNDFAEKERKLQETQMsT 282
Cdd:pfam13870   47 LQIENQALNEKIEERNKELKRLKLKVTntvhALTHLKEKLHFLSAELSRLKKELR-----ERQELLAKLRKELYRVKL-E 120
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 601984298   283 TSKLEEAEHKLqtlqtaleKTRTELF---DLKTKYD---EETTAKADEIEMIMTDLERANQR 338
Cdd:pfam13870  121 RDKLRKQNKKL--------RQQGGLLhvpALLHDYDktkAEVEEKRKSVKKLRRKVKILEMR 174
OmpH smart00935
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
219-306 4.70e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 214922 [Multi-domain]  Cd Length: 140  Bit Score: 39.10  E-value: 4.70e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298    219 EEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQMSTTSKLEEAEHK-----L 293
Cdd:smart00935   21 KQLEKEFKKRQAE---LEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEelqkiL 97
                            90
                    ....*....|...
gi 601984298    294 QTLQTALEKTRTE 306
Cdd:smart00935   98 DKINKAIKEVAKK 110
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
103-277 5.04e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 40.29  E-value: 5.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFkkntpEDLRKQvaplLKSFQGEIDALSKRSKEAEaafltvyKRLI 182
Cdd:COG1579    20 DRLEHRLKELPAELAELEDELAALEARLEAAKTEL-----EDLEKE----IKRLELEIEEVEARIKKYE-------EQLG 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  183 DVPDPVPALDVGQQLEIKVQRLHDIETENQKLRETLEEYNKEFAEVKNQevtIKALKEKIREYEQTLKSQAETIAlEKEQ 262
Cdd:COG1579    84 NVRNNKEYEALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAE---LAELEAELEEKKAELDEELAELE-AELE 159
                         170
                  ....*....|....*
gi 601984298  263 KLQNDFAEKERKLQE 277
Cdd:COG1579   160 ELEAEREELAAKIPP 174
PRK01156 PRK01156
chromosome segregation protein; Provisional
196-504 5.74e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 41.43  E-value: 5.74e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  196 QLEIKVQRLHD---------------------IETENQKLRE---------------------TLEEYNKEFAEVK---- 229
Cdd:PRK01156  340 DYIKKKSRYDDlnnqilelegyemdynsylksIESLKKKIEEyskniermsafiseilkiqeiDPDAIKKELNEINvklq 419
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  230 -------NQEVTIKALKEKIREYEQT---LKSQ------AETIALEKEQKLQNDFAEKERKLQETQMST---TSKLEEAE 290
Cdd:PRK01156  420 disskvsSLNQRIRALRENLDELSRNmemLNGQsvcpvcGTTLGEEKSNHIINHYNEKKSRLEEKIREIeieVKDIDEKI 499
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  291 HKLQTLQTALEKTRTElfDLKTKYDEETTAKADeIEMIMTDLERAnqraevaqREAETLREQLSSANHSLQLASQIQKAP 370
Cdd:PRK01156  500 VDLKKRKEYLESEEIN--KSINEYNKIESARAD-LEDIKIKINEL--------KDKHDKYEEIKNRYKSLKLEDLDSKRT 568
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  371 DVEQAIEVltRSSLEVE-LAAKEREIAQLVEDV----QRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKlkgQA 445
Cdd:PRK01156  569 SWLNALAV--ISLIDIEtNRSRSNEIKKQLNDLesrlQEIEIGFPDDKSYIDKSIREIENEANNLNNKYNEIQEN---KI 643
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 601984298  446 DYEEVKKELNTLKSmefAPSEGAGTQDSTKPLEVLLLEKNRSLQSENATLRISNSDLSG 504
Cdd:PRK01156  644 LIEKLRGKIDNYKK---QIAEIDSIIPDLKEITSRINDIEDNLKKSRKALDDAKANRAR 699
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
207-442 6.01e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 41.31  E-value: 6.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   207 IETENQKLRETLEEYNKEFAEVKNQEVTIKA-LKEKIREYEQTLKSQAETIAL--EKEQKLQNDFAEKERkLQETQMSTT 283
Cdd:pfam01576  775 LELDLKELEAQIDAANKGREEAVKQLKKLQAqMKDLQRELEEARASRDEILAQskESEKKLKNLEAELLQ-LQEDLAASE 853
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   284 SKLEEAEHKLQTLQTALE---KTRTELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREAETLREQLSSANHSL 360
Cdd:pfam01576  854 RARRQAQQERDELADEIAsgaSGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTTELAAERSTS 933
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   361 QlasqiqkapDVEQAIEVLTRSSleVELAAKEREIAQLVEdvQRLQASLTKLRensaSQISQLEQQLN-------AKNST 433
Cdd:pfam01576  934 Q---------KSESARQQLERQN--KELKAKLQEMEGTVK--SKFKSSIAALE----AKIAQLEEQLEqesrerqAANKL 996

                   ....*....
gi 601984298   434 LKQLEEKLK 442
Cdd:pfam01576  997 VRRTEKKLK 1005
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
103-459 6.04e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 41.46  E-value: 6.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  103 AKLKRELDATATVLANRQDESEQSRKRLIEQSREFKKNTPEDLRKQVAPLLKSFQGEIDALSKRSKEAEAAFLTVYKRLI 182
Cdd:COG1196   425 ELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYE 504
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  183 DVPDPVPALDVGQQLEIKVQRLHDIETENQKLRETLEEYnKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQ 262
Cdd:COG1196   505 GFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAA-LAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIR 583
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  263 K----------------------LQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTKYDEETTA 320
Cdd:COG1196   584 AraalaaalargaigaavdlvasDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLT 663
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  321 KADEIEMIMTDLERANQRAEVAQREAETLREQLSsanhsLQLASQIQKApdvEQAIEVLTRSSLEVELAAKEREIAQLVE 400
Cdd:COG1196   664 GGSRRELLAALLEAEAELEELAERLAEEELELEE-----ALLAEEEEER---ELAEAEEERLEEELEEEALEEQLEAERE 735
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 601984298  401 DVQRLQASLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKG------QA--DYEEVKKELNTLKS 459
Cdd:COG1196   736 ELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEAlgpvnlLAieEYEELEERYDFLSE 802
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
202-363 6.29e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 41.48  E-value: 6.29e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  202 QRLHDIETENQKLRETLE-------EYNKEFA------EVKNQevTIKALKEKIREYEQTLKSQAETIALEKEQKLQN-- 266
Cdd:COG3096   991 ARLEQAEEARREAREQLRqaqaqysQYNQVLAslkssrDAKQQ--TLQELEQELEELGVQADAEAEERARIRRDELHEel 1068
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  267 -------DFAEKERKLQETQM-STTSKLEEAEHKLQTLQTALEktrtelfdlktkydeetTAKA---DEIEMIM-TDLER 334
Cdd:COG3096  1069 sqnrsrrSQLEKQLTRCEAEMdSLQKRLRKAERDYKQEREQVV-----------------QAKAgwcAVLRLARdNDVER 1131
                         170       180
                  ....*....|....*....|....*....
gi 601984298  335 ANQRAEVAQREAETLREQLSSANHSLQLA 363
Cdd:COG3096  1132 RLHRRELAYLSADELRSMSDKALGALRLA 1160
PRK11637 PRK11637
AmiB activator; Provisional
262-451 6.67e-03

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 40.83  E-value: 6.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  262 QKLQNDFAEKERKLQETQMSTTSKLEEAEHKLQTLQTA---LEKTRTELFDLKTKYDEETTAKAdeiemimtdleranqR 338
Cdd:PRK11637   50 KSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQAsrkLRETQNTLNQLNKQIDELNASIA---------------K 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  339 AEVAQREAET-LREQLSSA----NHS-LQL-----ASQ-----------IQKAPdvEQAIEVLTRSSleVELAAKEREIa 396
Cdd:PRK11637  115 LEQQQAAQERlLAAQLDAAfrqgEHTgLQLilsgeESQrgerilayfgyLNQAR--QETIAELKQTR--EELAAQKAEL- 189
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 601984298  397 qlvEDVQRLQASLtkLRENSASQiSQLEQQLNAKNSTLKQLEEKL-KGQADYEEVK 451
Cdd:PRK11637  190 ---EEKQSQQKTL--LYEQQAQQ-QKLEQARNERKKTLTGLESSLqKDQQQLSELR 239
PRK12704 PRK12704
phosphodiesterase; Provisional
319-456 6.75e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 40.92  E-value: 6.75e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  319 TAKADEIEmimtdlERANQRAEVAQREAETLREQLSsanhsLQLASQIQKApdveqaievltRSSLEVELAAKEREIAQL 398
Cdd:PRK12704   30 EAKIKEAE------EEAKRILEEAKKEAEAIKKEAL-----LEAKEEIHKL-----------RNEFEKELRERRNELQKL 87
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 601984298  399 VEDVQRLQASLtklrENSASQISQLEQQLNAKNSTLKQLEEKLKG-QADYEEVKKELNT 456
Cdd:PRK12704   88 EKRLLQKEENL----DRKLELLEKREEELEKKEKELEQKQQELEKkEEELEELIEEQLQ 142
PspA COG1842
Phage shock protein A [Transcription, Signal transduction mechanisms];
212-390 7.32e-03

Phage shock protein A [Transcription, Signal transduction mechanisms];


Pssm-ID: 441447 [Multi-domain]  Cd Length: 217  Bit Score: 39.81  E-value: 7.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  212 QKLRETLEEYNKEFAEVKNQEVTIKALKEKIREYEQTLKSQAETiALEK-EQKLQNDFAEKERKLQETQMSTTSKLEEAE 290
Cdd:COG1842    33 RDMEEDLVEARQALAQVIANQKRLERQLEELEAEAEKWEEKARL-ALEKgREDLAREALERKAELEAQAEALEAQLAQLE 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  291 HKLQTLQTALEKTRTELFDLKTKYDE----ETTAKADE----------IEMIMTDLERANQRaeVAQREAEtlreqlssa 356
Cdd:COG1842   112 EQVEKLKEALRQLESKLEELKAKKDTlkarAKAAKAQEkvnealsgidSDDATSALERMEEK--IEEMEAR--------- 180
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 601984298  357 nhsLQLASQIQKAPDVEQAIEVL-TRSSLEVELAA 390
Cdd:COG1842   181 ---AEAAAELAAGDSLDDELAELeADSEVEDELAA 212
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
362-453 7.73e-03

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 41.00  E-value: 7.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  362 LASQIQKAPDVEQAIEVLTRSSLEVELAAKEREIAQLVEDVQRLQAsltklrensasQISQLEQQLNAKNSTLKQLEEKL 441
Cdd:COG2433   382 LEELIEKELPEEEPEAEREKEHEERELTEEEEEIRRLEEQVERLEA-----------EVEELEAELEEKDERIERLEREL 450
                          90
                  ....*....|....
gi 601984298  442 K--GQADYEEVKKE 453
Cdd:COG2433   451 SeaRSEERREIRKD 464
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
204-518 7.87e-03

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 41.36  E-value: 7.87e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  204 LHDIETENQKLRETLEEYNKEFAEVKNQEVTIKALKEkirEYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQMSTT 283
Cdd:PTZ00440 2338 LDNYQDENYGKDKNIELNNENNSYIIKTKEKINNLKE---EFSKLLKNIKRNNTLCNNNNIKDFISNIGKSVETIKQRFS 2414
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  284 SKLEEAEhklqtlqtalektrtELFDLKTKYDEettakadeIEMIMTDLERANQRAEVAQREAETLREQLSSANHslqla 363
Cdd:PTZ00440 2415 SNLPEKE---------------KLHQIEENLNE--------IKNIMNETKRISNVDAFTNKILQDIDNEKNKENN----- 2466
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  364 sqIQKAPDVEQAIEVLTrsSLEVELAAKEREIAQLVEDVQRLQASLTKL----------RENSA---------------S 418
Cdd:PTZ00440 2467 --NMNAEKIDDLIENVT--SHNEKIKSELLIINDALRRVKEKKDEMNKLfnsltennnnNNNSAknivdnstyiineleS 2542
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298  419 QISQLEQQLNAKNSTLKQLEE---KLKGQADYEEVKKELNTLKSMEfapSEGAGTQDSTKPLE---VLLLEKNRSLQSEN 492
Cdd:PTZ00440 2543 HVSKLNELLSYIDNEIKELENeklKLLEKAKIEESRKERERIESET---QEDNTDEEQINRQQqerLQKEEEQKAYSQER 2619
                         330       340
                  ....*....|....*....|....*.
gi 601984298  493 ATLRISNSDLSGSARRKGRDQPESRR 518
Cdd:PTZ00440 2620 LNREVSGTDDTNKNHNTGHDESNYGR 2645
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
200-454 8.61e-03

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 40.88  E-value: 8.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   200 KVQRLHDIETENQKLRETLEEYNKEFaevKNQEVTIKALKEKIREYEQTLKSQAETIALEKEQKLQNDFAEKERKLQETQ 279
Cdd:pfam17380  235 KMERRKESFNLAEDVTTMTPEYTVRY---NGQTMTENEFLNQLLHIVQHQKAVSERQQQEKFEKMEQERLRQEKEEKARE 311
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   280 MSTTSKLEEAEHKLQTL------------QTALEKTRT-ELFDLKTKYDEETTAKADEIEMIMTDLERANQRAEVAQREA 346
Cdd:pfam17380  312 VERRRKLEEAEKARQAEmdrqaaiyaeqeRMAMERERElERIRQEERKRELERIRQEEIAMEISRMRELERLQMERQQKN 391
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   347 ETLREQLSSANHSLQLASQIQKAPDVEQAIEVLTRSSLEvelAAKEREIAQLVEDVQRLQASLTKLRENSASQISQLEQQ 426
Cdd:pfam17380  392 ERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQE---EARQREVRRLEEERAREMERVRLEEQERQQQVERLRQQ 468
                          250       260
                   ....*....|....*....|....*...
gi 601984298   427 LNAKNSTLKQLEEKLKGQADYEEVKKEL 454
Cdd:pfam17380  469 EEERKRKKLELEKEKRDRKRAEEQRRKI 496
HAP1_N pfam04849
HAP1 N-terminal conserved region; This family represents an N-terminal conserved region found ...
196-313 8.79e-03

HAP1 N-terminal conserved region; This family represents an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologs. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntington's disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34.


Pssm-ID: 461455 [Multi-domain]  Cd Length: 309  Bit Score: 40.01  E-value: 8.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   196 QLEIKVQRLHDIETENQKLRE----------TLEEYN--------KEFAEVKNQevtIKALKEKI-REYEQTLKSQAETI 256
Cdd:pfam04849  165 QLDALQEKLRGLEEENLKLRSeashlktetdTYEEKEqqlmsdcvEQLSEANQQ---MAELSEELaRKMEENLRQQEEIT 241
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298   257 AL-----EKEQKLQNDFAEKERKLQ------ETQMSTTSKLEEAEHKLQTLQTALEKTRTELFDLKTK 313
Cdd:pfam04849  242 SLlaqivDLQHKCKELGIENEELQQhlqaskEAQRQLTSELQELQDRYAECLGMLHEAQEELKELRKK 309
RNase_Y_N pfam12072
RNase Y N-terminal region;
210-358 9.56e-03

RNase Y N-terminal region;


Pssm-ID: 463456 [Multi-domain]  Cd Length: 201  Bit Score: 39.10  E-value: 9.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 601984298   210 ENQKLRETLEEynkefaEVKNQEVTIKALKEKIREYEQTLKSQAETIAlEKEQKLQndfaEKERKLQETQMSTTSKLEEA 289
Cdd:pfam12072   61 EIHKLRAEAER------ELKERRNELQRQERRLLQKEETLDRKDESLE-KKEESLE----KKEKELEAQQQQLEEKEEEL 129
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 601984298   290 EHKLQTLQTALEK----TRTE----LFD-LKTKYDEETTAKADEIEmimtdlERANQRAEVAQREAETLREQLSSANH 358
Cdd:pfam12072  130 EELIEEQRQELERisglTSEEakeiLLDeVEEELRHEAAVMIKEIE------EEAKEEADKKAKEIIALAIQRCAADH 201
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH