|
Name |
Accession |
Description |
Interval |
E-value |
| SH3_MIA2 |
cd11892 |
Src Homology 3 domain of Melanoma Inhibitory Activity 2 protein; MIA2 is expressed ... |
31-103 |
9.70e-46 |
|
Src Homology 3 domain of Melanoma Inhibitory Activity 2 protein; MIA2 is expressed specifically in hepatocytes and its expression is controlled by hepatocyte nuclear factor 1 binding sites in the MIA2 promoter. It inhibits the growth and invasion of hepatocellular carcinomas (HCC) and may act as a tumor suppressor. A mutation in MIA2 in mice resulted in reduced cholesterol and triglycerides. Since MIA2 localizes to ER exit sites, it may function as an ER-to-Golgi trafficking protein that regulates lipid metabolism. MIA2 contains an N-terminal SH3-like domain, similar to MIA. It is a member of the recently identified family that also includes MIA, MIAL, and MIA3 (also called TANGO). MIA is a single domain protein that adopts a SH3 domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. Unlike classical SH3 domains, MIA does not bind proline-rich ligands.
Pssm-ID: 212825 Cd Length: 73 Bit Score: 158.85 E-value: 9.70e-46
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1370465039 31 KCGDLECEALINRVSAMRDYRGPDCRYLNFTKGEEISVYVKLAGEREDLWAGSKGKEFGYFPRDAVQIEEVFI 103
Cdd:cd11892 1 KCGDPECERLMSRVQAIRDYRGPDCRYLSFKKGDEIIVYYKLSGKREDLWAGSTGKEFGYFPKDAVKVEEVYI 73
|
|
| SH3_MIA_like |
cd11760 |
Src Homology 3 domain of Melanoma Inhibitory Activity protein and similar proteins; MIA is a ... |
31-103 |
4.50e-36 |
|
Src Homology 3 domain of Melanoma Inhibitory Activity protein and similar proteins; MIA is a single domain protein that adopts a SH3 domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. MIA is secreted from malignant melanoma cells and it plays an important role in melanoma development and invasion. MIA is expressed by chondrocytes in normal tissues and may be important in the cartilage cell phenotype. Unlike classical SH3 domains, MIA does not bind proline-rich ligands. MIA is a member of the recently identified family that also includes MIA-like (MIAL), MIA2, and MIA3 (also called TANGO); the biological functions of this family are not yet fully understood.
Pssm-ID: 212694 Cd Length: 76 Bit Score: 131.06 E-value: 4.50e-36
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1370465039 31 KCGDLECEALINRVSAMRDYRGPDCRYLNFTKGEEISVYVKLAGEREDLWAGSKG---KEFGYFPRDAVQIEEVFI 103
Cdd:cd11760 1 LCADAECSNPISRARALEDYHGPDCRFLNFKKGDTIYVYSKLAGERQDLWAGSVGgdaGLFGYFPKNLVQELKVYE 76
|
|
| SH3_MIA3 |
cd11893 |
Src Homology 3 domain of Melanoma Inhibitory Activity 3 protein; MIA3, also called TANGO or ... |
31-102 |
7.24e-25 |
|
Src Homology 3 domain of Melanoma Inhibitory Activity 3 protein; MIA3, also called TANGO or TANGO1, acts as a tumor suppressor of malignant melanoma. It is downregulated or lost in melanoma cells lines. Unlike other MIA family members, MIA3 is widely expressed except in hematopoietic cells. MIA3 is an ER resident transmembrane protein that is required for the loading of collagen VII into transport vesicles. SNPs in the MIA3 gene have been associated with coronary arterial disease and myocardial infarction. MIA3 contains an N-terminal SH3-like domain, similar to MIA. It is a member of the recently identified family that also includes MIA, MIAL, and MIA2. MIA is a single domain protein that adopts a SH3 domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. Unlike classical SH3 domains, MIA does not bind proline-rich ligands.
Pssm-ID: 212826 Cd Length: 73 Bit Score: 99.15 E-value: 7.24e-25
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1370465039 31 KCGDLECEALINRVSAMRDYRGPDCRYLNFTKGEEISVYVKLAGEREDLWAGSKGKEFGYFPRDAVQIEEVF 102
Cdd:cd11893 1 RCADEECSMLLCRGKAVKDFTGPDCRFLSFKKGETIYVYYKLSGRRTDLWAGSVGFDFGYFPKDLLDVNHLY 72
|
|
| MIA |
cd11890 |
Melanoma Inhibitory Activity protein; MIA is a single domain protein that adopts a Src ... |
30-118 |
1.43e-14 |
|
Melanoma Inhibitory Activity protein; MIA is a single domain protein that adopts a Src Homology 3 (SH3) domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. MIA is secreted from malignant melanoma cells and it plays an important role in melanoma development and invasion. MIA is expressed by chondrocytes in normal tissues and may be important in the cartilage cell phenotype. Unlike classical SH3 domains, MIA does not bind proline-rich ligands. It binds peptide ligands with sequence similarity to type III human fibronectin repeats.
Pssm-ID: 212823 Cd Length: 98 Bit Score: 70.68 E-value: 1.43e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 30 KKCGDLECEALINRVSAMRDYRGPDCRYLNFTKGEEISVYVKLAGEREDLWAGS--------KGKEFGYFPRDAVQIEEV 101
Cdd:cd11890 2 KLCADQECSHPISIAVALQDYMAPDCRFIPIQRGQVVYVFSKLKGRGRLFWGGSvqgdyygeQAARLGYFPSSIVQEDQY 81
|
90
....*....|....*..
gi 1370465039 102 FISEEIQMSTKESDFLC 118
Cdd:cd11890 82 LKPGKVEVKTDKWDFYC 98
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
715-1061 |
1.40e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 69.70 E-value: 1.40e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 715 EVESSLKDASFEKEATEAQSLEfvegsqiSEATCEKLNRSNSELEDEILCLEKELKEEKSKHSEQDELMADISKRIQSLE 794
Cdd:TIGR02168 674 ERRREIEELEEKIEELEEKIAE-------LEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLE 746
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 795 DESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNENSQLQESQKQLLQEAEVWKEQVSELNKQkvtfedskvhaeqvL 874
Cdd:TIGR02168 747 ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAE--------------L 812
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 875 NDKESHIKTLTERLLKMKDWAAMLGEDITDDDNlELEMNSEsengayldnppkgalkkliHAAKLNASLKTLEGERNQIY 954
Cdd:TIGR02168 813 TLLNEEAANLRERLESLERRIAATERRLEDLEE-QIEELSE-------------------DIESLAAEIEELEELIEELE 872
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 955 IQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTEL-------YQENEMKLHRKL-TVEENYRLE 1026
Cdd:TIGR02168 873 SELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKlaqlelrLEGLEVRIDNLQeRLSEEYSLT 952
|
330 340 350
....*....|....*....|....*....|....*
gi 1370465039 1027 KEEklskVDEKISHATEELETYRKRAKDLEEELER 1061
Cdd:TIGR02168 953 LEE----AEALENKIEDDEEEARRRLKRLENKIKE 983
|
|
| MIAL |
cd11891 |
Melanoma Inhibitory Activity-Like protein; MIAL is specifically expressed in the cochlea and ... |
32-102 |
2.24e-11 |
|
Melanoma Inhibitory Activity-Like protein; MIAL is specifically expressed in the cochlea and the vestibule of the inner ear and may contribute to inner ear dysfunction in humans. MIAL is a member of the recently identified family that also includes MIA, MIA2, and MIA3 (also called TANGO); MIA is the most studied member of the family. MIA is a single domain protein that adopts a Src Homology 3 (SH3) domain-like fold; it contains an additional antiparallel beta sheet and two disulfide bonds compared to classical SH3 domains. MIA is secreted from malignant melanoma cells and it plays an important role in melanoma development and invasion. MIA is expressed by chondrocytes in normal tissues and may be important in the cartilage cell phenotype. Unlike classical SH3 domains, MIA does not bind proline-rich ligands.
Pssm-ID: 212824 Cd Length: 83 Bit Score: 61.02 E-value: 2.24e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 32 CGDLECEALINRVSAMRDYRGPDCRYLNFTKGEEISVYVKLAGERE--DLWAGSKGKE--------FGYFPRDAVQIEEV 101
Cdd:cd11891 2 CADEECVYAISLARAEDDYNAPDCRFINIKKGQLIYVYSKLVKENGagEFWSGSVYSEryvdqmgiVGYFPSNLVKEQTV 81
|
.
gi 1370465039 102 F 102
Cdd:cd11891 82 Y 82
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
682-984 |
3.06e-10 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 65.09 E-value: 3.06e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 682 REKKLALMLSGLIEEKSKL------LEKFSLVQK---EYEGYEVESSLKDASFEKEATEAQSLEFvegsqisEATCEKLN 752
Cdd:TIGR02169 185 NIERLDLIIDEKRQQLERLrrerekAERYQALLKekrEYEGYELLKEKEALERQKEAIERQLASL-------EEELEKLT 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 753 RSNSELEDEIlclekelkeekskhSEQDELMADISKRIQSL-EDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNE 831
Cdd:TIGR02169 258 EEISELEKRL--------------EEIEQLLEELNKKIKDLgEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEER 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 832 NSQLQESQKQLLQEAEVWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHI-------KTLTERLLKMKDWAAMLGEDI-- 902
Cdd:TIGR02169 324 LAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELeevdkefAETRDELKDYREKLEKLKREIne 403
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 903 --TDDDNLELEMNSESENGAYLDNPPKGA---LKKLIHAAK-LNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQT 976
Cdd:TIGR02169 404 lkRELDRLQEELQRLSEELADLNAAIAGIeakINELEEEKEdKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEK 483
|
....*...
gi 1370465039 977 EQASLQSE 984
Cdd:TIGR02169 484 ELSKLQRE 491
|
|
| SH3_2 |
pfam07653 |
Variant SH3 domain; SH3 (Src homology 3) domains are often indicative of a protein involved in ... |
46-99 |
6.70e-10 |
|
Variant SH3 domain; SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
Pssm-ID: 429575 [Multi-domain] Cd Length: 54 Bit Score: 56.07 E-value: 6.70e-10
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1370465039 46 AMRDYRGPDCRYLNFTKGEEISVYVKlagEREDLWAGSKGKEFGYFPRDAVQIE 99
Cdd:pfam07653 4 VIFDYVGTDKNGLTLKKGDVVKVLGK---DNDGWWEGETGGRVGLVPSTAVEEI 54
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
790-1118 |
5.20e-09 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 60.80 E-value: 5.20e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 790 IQSLEDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNENSQLQESQKQLLQEAEVWKEQVSELNKQKVTfedskvh 869
Cdd:TIGR04523 206 LKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQ------- 278
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 870 AEQVLNDKESHIKTLTERLLKMK-----DWAAMLGEDITDDDNLELEMNSE-SENGAYLDNppkgaLKKLIhaAKLNASL 943
Cdd:TIGR04523 279 NNKKIKELEKQLNQLKSEISDLNnqkeqDWNKELKSELKNQEKKLEEIQNQiSQNNKIISQ-----LNEQI--SQLKKEL 351
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 944 KTLEGERNQIYIQLSE-------VDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTELYQ--ENEMKLH 1014
Cdd:TIGR04523 352 TNSESENSEKQRELEEkqneiekLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKEllEKEIERL 431
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1015 RKLTVEENYRL--------EKEEKLSKVDEKISHATEELETYRKRAKDLEEELERTihsyQGQIISHEKKAHDNWLAARN 1086
Cdd:TIGR04523 432 KETIIKNNSEIkdltnqdsVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQK----QKELKSKEKELKKLNEEKKE 507
|
330 340 350
....*....|....*....|....*....|..
gi 1370465039 1087 AERNLNDLRKENAhnRQKLTETELKFELLEKD 1118
Cdd:TIGR04523 508 LEEKVKDLTKKIS--SLKEKIEKLESEKKEKE 537
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
741-1059 |
1.67e-08 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 59.26 E-value: 1.67e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 741 SQIS--EATCEKLNRSNSELEDEILCLEKELKEEKSKHSEQDELMADISKRIQSLEDESKSLKSQVAEAKMTFKIFQMNE 818
Cdd:TIGR04523 328 NQISqnNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLN 407
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 819 ERLKIAIKDALNENSQLQESQKQLLQEAEVWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKDWAAML 898
Cdd:TIGR04523 408 QQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQK 487
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 899 GEDITDDDNLELEMNSESENgayLDNPPKgALKKLIhaAKLNASLKTLEGERNQIYIQLSEV---------DKTKEELTE 969
Cdd:TIGR04523 488 QKELKSKEKELKKLNEEKKE---LEEKVK-DLTKKI--SSLKEKIEKLESEKKEKESKISDLedelnkddfELKKENLEK 561
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 970 HIKNLQTEQASLQSENTHFENENQKLQQKLKVMTElyqenemklhrkltveENYRLEKEekLSKVDEKISHATEELETYR 1049
Cdd:TIGR04523 562 EIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEK----------------EKKDLIKE--IEEKEKKISSLEKELEKAK 623
|
330
....*....|
gi 1370465039 1050 KRAKDLEEEL 1059
Cdd:TIGR04523 624 KENEKLSSII 633
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
715-1121 |
3.89e-08 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 58.19 E-value: 3.89e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 715 EVESSLKDASFEKEATEAQSLEFVEGSQISEATCEKLNRSNSELEDEilclekelkeekskhseqdelMADISKRIQSLE 794
Cdd:pfam05483 251 EKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKE---------------------LEDIKMSLQRSM 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 795 DESKSLKSQVAEAKMTfkIFQMNEERlkiaikdalneNSQLQESQKQLLQEAEVwkeqVSELNKQKVTFEDSKVHAEQVL 874
Cdd:pfam05483 310 STQKALEEDLQIATKT--ICQLTEEK-----------EAQMEELNKAKAAHSFV----VTEFEATTCSLEELLRTEQQRL 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 875 NDKESHIKTLTERLLKMkdwAAMLGEDITDDDNLELEMNSE----SENGAYLDNppKGALKKLIHAAK-----LNASLKT 945
Cdd:pfam05483 373 EKNEDQLKIITMELQKK---SSELEEMTKFKNNKEVELEELkkilAEDEKLLDE--KKQFEKIAEELKgkeqeLIFLLQA 447
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 946 LEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTElyQENEMKLHRKLTVEENYRL 1025
Cdd:pfam05483 448 REKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQ--EASDMTLELKKHQEDIINC 525
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1026 EKEE-----KLSKVDEKISHATEELETYRKRAKDLEEELERTIHSYQGQIISHEKKAHDNWLAARNAERNLNDLRKENAH 1100
Cdd:pfam05483 526 KKQEermlkQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIEN 605
|
410 420
....*....|....*....|.
gi 1370465039 1101 NRQKLTETELKFELLEKDPYA 1121
Cdd:pfam05483 606 KNKNIEELHQENKALKKKGSA 626
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
694-1063 |
4.44e-08 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 57.72 E-value: 4.44e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 694 IEEKSKLLEKfslVQKEYEGYEVEssLKDASFEKEATEAQSLEFVEGSQISEATCEKLNRSNSELEDEILCLEKELKEEK 773
Cdd:TIGR04523 365 LEEKQNEIEK---LKKENQSYKQE--IKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNN 439
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 774 SKHSEQDELMADISKRIQSLEDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNENSQLQESQKQLLQEAEVWKEQV 853
Cdd:TIGR04523 440 SEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKI 519
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 854 SELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKDWAAMLGEDIT------DDDNLELEmNSESEN-----GAYL 922
Cdd:TIGR04523 520 SSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKENLEKEIDEKNKEieelkqTQKSLKKK-QEEKQElidqkEKEK 598
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 923 DNPPKGALKKLIHAAKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVM 1002
Cdd:TIGR04523 599 KDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNKWPEIIKKIKESKTKIDDI 678
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1370465039 1003 TELYQ--ENEMKLHRKLTVEENYRLEKEEKLSKVDEKISHATEELETYRKRAKDLEEELERTI 1063
Cdd:TIGR04523 679 IELMKdwLKELSLHYKKYITRMIRIKDLPKLEEKYKEIEKELKKLDEFSKELENIIKNFNKKF 741
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
936-1118 |
2.36e-07 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 55.83 E-value: 2.36e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 936 AAKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENthfENENQKLQQKLKVMTELyQENEMKLHR 1015
Cdd:TIGR02168 693 IAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEV---EQLEERIAQLSKELTEL-EAEIEELEE 768
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1016 KLTVEENYRLEKEEKLSKVDEKISHATEELETYRKRAKDLEEELERT---IHSYQGQIISHEKKAHDNWLAARNAERNLN 1092
Cdd:TIGR02168 769 RLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLneeAANLRERLESLERRIAATERRLEDLEEQIE 848
|
170 180
....*....|....*....|....*.
gi 1370465039 1093 DLRKENAHNRQKLTETELKFELLEKD 1118
Cdd:TIGR02168 849 ELSEDIESLAAEIEELEELIEELESE 874
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
786-1122 |
4.27e-07 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 54.68 E-value: 4.27e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 786 ISKRIQSLEDESKS------LKSQVAEAKMTFKIFQMNEERLKI-AIKDALNENSQLQESQKQLLQEAEvwkEQVSELNK 858
Cdd:TIGR02168 198 LERQLKSLERQAEKaerykeLKAELRELELALLVLRLEELREELeELQEELKEAEEELEELTAELQELE---EKLEELRL 274
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 859 QKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKDwaamlgeditdddNLELEMNSESENGAYLDNPPKGALKKLIHAAK 938
Cdd:TIGR02168 275 EVSELEEEIEELQKELYALANEISRLEQQKQILRE-------------RLANLERQLEELEAQLEELESKLDELAEELAE 341
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 939 LNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVM-TELYQENEMKLHRKL 1017
Cdd:TIGR02168 342 LEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLeARLERLEDRRERLQQ 421
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1018 TVEENYRLEKEEKLSKVDEKISHATEELETYRKRAKDLEEELERTihsyQGQIishekkahdnwlaaRNAERNLNDLRKE 1097
Cdd:TIGR02168 422 EIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEEL----REEL--------------EEAEQALDAAERE 483
|
330 340
....*....|....*....|....*
gi 1370465039 1098 NAHNRQKLTETELKFELLEKDPYAL 1122
Cdd:TIGR02168 484 LAQLQARLDSLERLQENLEGFSEGV 508
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
778-1097 |
4.46e-07 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 54.56 E-value: 4.46e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 778 EQDELMADISKRIQSLEDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNENSQLQESQKQLLQEAEVWKEQVSELN 857
Cdd:COG1196 257 ELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE 336
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 858 KQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKdwAAMLGEDITDDDNLELEMNSESEngayldnppkgALKKLIHAA 937
Cdd:COG1196 337 EELEELEEELEEAEEELEEAEAELAEAEEALLEAE--AELAEAEEELEELAEELLEALRA-----------AAELAAQLE 403
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 938 KLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTELYQENEMKLHRkl 1017
Cdd:COG1196 404 ELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAE-- 481
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1018 tveenyRLEKEEKLSKVDEKISHATEELETYRKRAKDLEEELERTIHSYQGQIISHEKKAHDNWLAARNAERNLNDLRKE 1097
Cdd:COG1196 482 ------LLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVED 555
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
682-1118 |
9.22e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 53.53 E-value: 9.22e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 682 REKKLALMLSGLIEEKSKLLEKfSLVQKEYEGYEVESSLKDASFEKEATEAQSLEFvegsqisEATCEKLNRSNSELEDE 761
Cdd:PRK03918 179 ERLEKFIKRTENIEELIKEKEK-ELEEVLREINEISSELPELREELEKLEKEVKEL-------EELKEEIEELEKELESL 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 762 ilclekelkeeKSKHSEQDELMADISKRIQSLEDESKSLKSQVAEAKmtfKIFQMNEE--RLKIAIKDALNENSQLQESQ 839
Cdd:PRK03918 251 -----------EGSKRKLEEKIRELEERIEELKKEIEELEEKVKELK---ELKEKAEEyiKLSEFYEEYLDELREIEKRL 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 840 KQLLQEAEVWKEQVSELNKqkvtfedskvhaeqvlndKESHIKTLTERLLKMKDWAAMLGEDITDDDNLELEMNSESENG 919
Cdd:PRK03918 317 SRLEEEINGIEERIKELEE------------------KEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLK 378
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 920 AYLDNPPKGALKKLIHAA------------KLNASLKTLEGERNQIYIQLSEVDKTK-------EELTEH-----IKNLQ 975
Cdd:PRK03918 379 KRLTGLTPEKLEKELEELekakeeieeeisKITARIGELKKEIKELKKAIEELKKAKgkcpvcgRELTEEhrkelLEEYT 458
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 976 TEQASLQSENTHFENENQKLQQKLKVMtELYQENEMKLHRKLTVEENYRlEKEEKLSKVD-EKISHATEELETYRKRA-- 1052
Cdd:PRK03918 459 AELKRIEKELKEIEEKERKLRKELREL-EKVLKKESELIKLKELAEQLK-ELEEKLKKYNlEELEKKAEEYEKLKEKLik 536
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1053 -----KDLEEELERtIHSYQGQIISHEKK-----------------------------------AHDNWLAARNAERNLN 1092
Cdd:PRK03918 537 lkgeiKSLKKELEK-LEELKKKLAELEKKldeleeelaellkeleelgfesveeleerlkelepFYNEYLELKDAEKELE 615
|
490 500
....*....|....*....|....*.
gi 1370465039 1093 DLRKENAHNRQKLTETELKFELLEKD 1118
Cdd:PRK03918 616 REEKELKKLEEELDKAFEELAETEKR 641
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
819-1063 |
1.14e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 52.46 E-value: 1.14e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 819 ERLKIAIKDALNENSQLQESQKQLLQEAEVWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKDW-AAM 897
Cdd:COG4942 30 EQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEElAEL 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 898 LGEditdddnleLEMNSESENGAYLDNP--PKGALKKLIHAAKLNASLKTlegernqiyiQLSEVDKTKEELTEHIKNLQ 975
Cdd:COG4942 110 LRA---------LYRLGRQPPLALLLSPedFLDAVRRLQYLKYLAPARRE----------QAEELRADLAELAALRAELE 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 976 TEQASLQSENTHFENENQKLQQKLKvmtelyqenemklhrkltveenyrlEKEEKLSKVDEKISHATEELETYRKRAKDL 1055
Cdd:COG4942 171 AERAELEALLAELEEERAALEALKA-------------------------ERQKLLARLEKELAELAAELAELQQEAEEL 225
|
....*...
gi 1370465039 1056 EEELERTI 1063
Cdd:COG4942 226 EALIARLE 233
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
778-1060 |
1.19e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 53.14 E-value: 1.19e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 778 EQDELMADISKRIQSLED------ESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDA-------LNENSQLQESQKQLLQ 844
Cdd:PRK03918 142 ESDESREKVVRQILGLDDyenaykNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKekeleevLREINEISSELPELRE 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 845 EAEVWKEQVSELNKQKVTFEDSKVHAEQVLNDK---ESHIKTLTERLLKMKDWAAMLGEDITDDDNLELEMNSESENGAY 921
Cdd:PRK03918 222 ELEKLEKEVKELEELKEEIEELEKELESLEGSKrklEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEF 301
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 922 LDNppkgALKKLihaAKLNASLKTLEGERNQIYIQLSEVDKTK---EELTEHIKNLQTEQASLQSENTHFENENQKLQQK 998
Cdd:PRK03918 302 YEE----YLDEL---REIEKRLSRLEEEINGIEERIKELEEKEerlEELKKKLKELEKRLEELEERHELYEEAKAKKEEL 374
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1370465039 999 LKVMTELYQENEMKLHRKLTVEENYRLEKEEKLSKVDEKIShateELETYRKRAKDLEEELE 1060
Cdd:PRK03918 375 ERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIG----ELKKEIKELKKAIEELK 432
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
937-1061 |
2.26e-06 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 52.17 E-value: 2.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 937 AKLNASLKTLEGERNQIYIQLSEVDKTKEEltEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMtelyqENEMKLHRK 1016
Cdd:COG2433 383 EELIEKELPEEEPEAEREKEHEERELTEEE--EEIRRLEEQVERLEAEVEELEAELEEKDERIERL-----ERELSEARS 455
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1370465039 1017 ltvEENYRLEKEEKLSKVDEKISHATEELETYRKRAKDLEEELER 1061
Cdd:COG2433 456 ---EERREIRKDREISRLDREIERLERELEEERERIEELKRKLER 497
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
825-1010 |
4.70e-06 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 50.60 E-value: 4.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 825 IKDALNENSQLQESQKQLLQEAEVWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLlkmKDWAAMLGEDITD 904
Cdd:COG3883 25 LSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREEL---GERARALYRSGGS 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 905 DDNLELEMNSESeNGAYLDNppKGALKKLIHAAklNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSE 984
Cdd:COG3883 102 VSYLDVLLGSES-FSDFLDR--LSALSKIADAD--ADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQ 176
|
170 180
....*....|....*....|....*.
gi 1370465039 985 NTHFENENQKLQQKLKVMTELYQENE 1010
Cdd:COG3883 177 QAEQEALLAQLSAEEAAAEAQLAELE 202
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
690-1117 |
5.25e-06 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 51.27 E-value: 5.25e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 690 LSGLIEEKSKLLEKFSLVQKEYEGYEVESSLKDASFEKEATEAQSL------EFVEGSQISEATCEKLNR----SNSELE 759
Cdd:pfam15921 280 ITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMRQLSDLESTvsqlrsELREAKRMYEDKIEELEKqlvlANSELT 359
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 760 DEILCLEKELKEEKSKHSEQDELMADISKRIQ--SLEDE-SKSLKSQVAEAKMTFKIFQ-------MNEERLKIAIKDAL 829
Cdd:pfam15921 360 EARTERDQFSQESGNLDDQLQKLLADLHKREKelSLEKEqNKRLWDRDTGNSITIDHLRrelddrnMEVQRLEALLKAMK 439
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 830 NENSQLQESQKQLLQEAEVWKEQVSELNKQkvtFEDSKVHAEQVLNDKESHIKTLTERLLKMKDWAAMLGEDitdddnlE 909
Cdd:pfam15921 440 SECQGQMERQMAAIQGKNESLEKVSSLTAQ---LESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEK-------E 509
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 910 LEMNSESENGAYLDNPPKGALKKLIHAAKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNL--------------Q 975
Cdd:pfam15921 510 RAIEATNAEITKLRSRVDLKLQELQHLKNEGDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMtqlvgqhgrtagamQ 589
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 976 TEQASLQSENTHFENENQKLQ----------QKLKVMTELYQENEMKL----HRKLTVEENYRLEKEEKLSKVD---EKI 1038
Cdd:pfam15921 590 VEKAQLEKEINDRRLELQEFKilkdkkdakiRELEARVSDLELEKVKLvnagSERLRAVKDIKQERDQLLNEVKtsrNEL 669
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1039 SHATEELETYRKRAKDLEEELERTIHSYQGQIisheKKAHDNWLAARNAERNLN-----------DLRKENAHNRQKLTE 1107
Cdd:pfam15921 670 NSLSEDYEVLKRNFRNKSEEMETTTNKLKMQL----KSAQSELEQTRNTLKSMEgsdghamkvamGMQKQITAKRGQIDA 745
|
490
....*....|
gi 1370465039 1108 TELKFELLEK 1117
Cdd:pfam15921 746 LQSKIQFLEE 755
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
930-1076 |
5.99e-06 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 49.15 E-value: 5.99e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 930 LKKLIHAAKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTEL---- 1005
Cdd:COG1579 6 LRALLDLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQlgnv 85
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1370465039 1006 -----YQ--ENEM-KLHRKLTVEENYRLEKEEKLSKVDEKISHATEELETYRKRAKDLEEELERTIHSYQGQIISHEKK 1076
Cdd:COG1579 86 rnnkeYEalQKEIeSLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAE 164
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
686-1118 |
7.00e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 50.54 E-value: 7.00e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 686 LALMLSGLIEEKSKLL----EKFSLVQKEYEgyEVESSLKdasfEKEATEAQSLEFVEGSQISEATCEKLNRSNSELEDE 761
Cdd:COG4717 44 RAMLLERLEKEADELFkpqgRKPELNLKELK--ELEEELK----EAEEKEEEYAELQEELEELEEELEELEAELEELREE 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 762 I--LCLEKELKEEKSKHSEQDELMADISKRIQSLEDESKSLKSQVAEAKMTFKIFQMNEERLKIA-----------IKDA 828
Cdd:COG4717 118 LekLEKLLQLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELleqlslateeeLQDL 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 829 LNENSQLQESQKQLLQEAEVWKEQVSELNKQKVTFEDSKVHAEQVlndkeshiktltERLLKMKDWAAMLGEditdddNL 908
Cdd:COG4717 198 AEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALE------------ERLKEARLLLLIAAA------LL 259
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 909 ELEMNSESENGAYLDNPPKGALKK---LIHAAKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSEN 985
Cdd:COG4717 260 ALLGLGGSLLSLILTIAGVLFLVLgllALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEEL 339
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 986 THFENENQKLQQKLKVMTELyqENEMKLhrkltveENYRLEKEEKLSKVDekishaTEELETYRKRAKDLEE--ELERTI 1063
Cdd:COG4717 340 LELLDRIEELQELLREAEEL--EEELQL-------EELEQEIAALLAEAG------VEDEEELRAALEQAEEyqELKEEL 404
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1370465039 1064 HSYQGQIISHEKKAHDNWLAA---------RNAERNLNDLRKENAHNRQKLTETELKFELLEKD 1118
Cdd:COG4717 405 EELEEQLEELLGELEELLEALdeeeleeelEELEEELEELEEELEELREELAELEAELEQLEED 468
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
683-1117 |
1.18e-05 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 50.17 E-value: 1.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 683 EKKLALMLSGLI--EEKSKLLEKFSLVQkEYEGYEVESSLKdasfeKEATEAQSLE----FVEG------SQISE--ATC 748
Cdd:pfam01576 158 EERISEFTSNLAeeEEKAKSLSKLKNKH-EAMISDLEERLK-----KEEKGRQELEkakrKLEGestdlqEQIAElqAQI 231
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 749 EKLNRSNSELEDEILCLEKELKEEKSKHSEQ----DELMADISKRIQSLEDEskslKSQVAEAKMTFKIFQMNEERLKIA 824
Cdd:pfam01576 232 AELRAQLAKKEEELQAALARLEEETAQKNNAlkkiRELEAQISELQEDLESE----RAARNKAEKQRRDLGEELEALKTE 307
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 825 IKDALNENSQLQESQKQLLQEaevwkeqVSELnkQKVTFEDSKVHAEQVLNDKESH---IKTLTERLLKMKDWAAML--G 899
Cdd:pfam01576 308 LEDTLDTTAAQQELRSKREQE-------VTEL--KKALEEETRSHEAQLQEMRQKHtqaLEELTEQLEQAKRNKANLekA 378
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 900 EDITDDDNLELEMNSESENGAYLDNPPKGalkklihaaklnaslKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQA 979
Cdd:pfam01576 379 KQALESENAELQAELRTLQQAKQDSEHKR---------------KKLEGQLQELQARLSESERQRAELAEKLSKLQSELE 443
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 980 SLQSENTHFENENQKLQQKLKVMtelyqenEMKLHrklTVEENYRLEKEEKLSkVDEKISHATEELETYRKRAKDLEE-- 1057
Cdd:pfam01576 444 SVSSLLNEAEGKNIKLSKDVSSL-------ESQLQ---DTQELLQEETRQKLN-LSTRLRQLEDERNSLQEQLEEEEEak 512
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1370465039 1058 -ELERTIHSYQGQIISHEKKAHDNWLAARNAERNLNDLRKENAHNRQKLTETELKFELLEK 1117
Cdd:pfam01576 513 rNVERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEK 573
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
681-982 |
1.63e-05 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 49.68 E-value: 1.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 681 GREKKLALMLSGLIEEKSKLLEKFSLVQKEYEGYEVESSLKDASFEKEATEAQSLEfvegSQISEATCEKLNRSNSELED 760
Cdd:TIGR02169 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLE----ARLSHSRIPEIQAELSKLEE 805
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 761 EilclekelkeekskHSEQDELMADISKRIQS-------LEDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNENS 833
Cdd:TIGR02169 806 E--------------VSRIEARLREIEQKLNRltlekeyLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELE 871
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 834 QLQESQKQLLQEAEVWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLlkmkdwaAMLGEDITDDDNLELEMN 913
Cdd:TIGR02169 872 ELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKL-------EALEEELSEIEDPKGEDE 944
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1370465039 914 SESENgayldNPPKGALKKLIHaaKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQ 982
Cdd:TIGR02169 945 EIPEE-----ELSLEDVQAELQ--RVEEEIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAIL 1006
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
694-1032 |
1.70e-05 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 49.67 E-value: 1.70e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 694 IEEKSKLLEKFSLVQKEYEGYEVESSLKD--------ASFEKEATEAQSLEFVEGSQISEATcEKLNRSNS---ELEDEI 762
Cdd:TIGR02168 205 LERQAEKAERYKELKAELRELELALLVLRleelreelEELQEELKEAEEELEELTAELQELE-EKLEELRLevsELEEEI 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 763 lclEKELKEEKSKHSEQDELMADI---SKRIQSLEDESKSLKSQVAEakmtfkifqmNEERLKIAIKDAlnenSQLQESQ 839
Cdd:TIGR02168 284 ---EELQKELYALANEISRLEQQKqilRERLANLERQLEELEAQLEE----------LESKLDELAEEL----AELEEKL 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 840 KQLLQEAEVWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKDWAAMLGEDITD-DDNLElemNSESEN 918
Cdd:TIGR02168 347 EELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERlEDRRE---RLQQEI 423
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 919 GAYLDNPPKGALKKLIHA-AKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQ 997
Cdd:TIGR02168 424 EELLKKLEEAELKELQAElEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG 503
|
330 340 350 360
....*....|....*....|....*....|....*....|.
gi 1370465039 998 KLKVMTELYQENEMK------LHRKLTVEENYRLEKEEKLS 1032
Cdd:TIGR02168 504 FSEGVKALLKNQSGLsgilgvLSELISVDEGYEAAIEAALG 544
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
788-1118 |
2.17e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 49.20 E-value: 2.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 788 KRIQSLEDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNENsQLQESQKQLLQEAEVWKEQV-SELNKQKVTFEDS 866
Cdd:pfam02463 153 ERRLEIEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKL-QELKLKEQAKKALEYYQLKEkLELEEEYLLYLDY 231
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 867 KVHAEQVLNDKESHIKTLTERLLKMKdwaamlGEDITDDDNLELEMNSESENGAYLDNPPKGALKKLIHAAKLNASLKTL 946
Cdd:pfam02463 232 LKLNEERIDLLQELLRDEQEEIESSK------QEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKL 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 947 EGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTELYQENEMKLHRKLTV------- 1019
Cdd:pfam02463 306 ERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKkkleser 385
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1020 -------EENYRLEKEEKLSKVDEKISHATEELETYRKRAKDL---EEELERTIHSYQGQIIshEKKAHDNWLAARNAER 1089
Cdd:pfam02463 386 lssaaklKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEEleiLEEEEESIELKQGKLT--EEKEELEKQELKLLKD 463
|
330 340
....*....|....*....|....*....
gi 1370465039 1090 NLNDLRKENAHNRQKLTETELKFELLEKD 1118
Cdd:pfam02463 464 ELELKKSEDLLKETQLVKLQEQLELLLSR 492
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
777-1139 |
3.07e-05 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 48.91 E-value: 3.07e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 777 SEQDELmADISKRIQSLEDESKSLKSQVAEAKMtfkifQMNEerLKIAIKDALNENSQLQESQKQLLQEAEVWKEQVSEL 856
Cdd:TIGR02169 671 SEPAEL-QRLRERLEGLKRELSSLQSELRRIEN-----RLDE--LSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEEL 742
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 857 NKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKDWAamlgEDITDDDNLELEMNSESEngayldnppkgalkklihA 936
Cdd:TIGR02169 743 EEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEAL----NDLEARLSHSRIPEIQAE------------------L 800
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 937 AKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENEnqklqqklkvmtelyqenemklhrk 1016
Cdd:TIGR02169 801 SKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKE------------------------- 855
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1017 ltvEENYRLEKEEKLskvdekishatEELETYRKRAKDLEEELErtihSYQGQIISHEKKahdnwlaARNAERNLNDLRK 1096
Cdd:TIGR02169 856 ---IENLNGKKEELE-----------EELEELEAALRDLESRLG----DLKKERDELEAQ-------LRELERKIEELEA 910
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 1370465039 1097 ENAHNRQKLTETELKFELLEKDPYALDvPNTAFGREHSPYGPS 1139
Cdd:TIGR02169 911 QIEKKRKRLSELKAKLEALEEELSEIE-DPKGEDEEIPEEELS 952
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
788-1117 |
5.11e-05 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 47.71 E-value: 5.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 788 KRIQSLEDESKSLKSQVAEAKMTF-----KIFQMNEERLKIAI-----KDALNEN-SQLQESQKQLLQ------------ 844
Cdd:TIGR04523 68 EKINNSNNKIKILEQQIKDLNDKLkknkdKINKLNSDLSKINSeikndKEQKNKLeVELNKLEKQKKEnkknidkfltei 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 845 -----EAEVWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKDWAAMLGEDITDDDNLELEMN-SESEN 918
Cdd:TIGR04523 148 kkkekELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSNLKKKIQKNKSLESQISeLKKQN 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 919 gayldnppkgalkklihaAKLNASLKTLEGERNQIYIQLSEVDK----TKEELTEHIKNLQTEQASLQSENT---HFENE 991
Cdd:TIGR04523 228 ------------------NQLKDNIEKKQQEINEKTTEISNTQTqlnqLKDEQNKIKKQLSEKQKELEQNNKkikELEKQ 289
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 992 NQKLQQKLKVMTELYQENEMK-LHRKLTVEENYRLEKEEKLSKVDEKISHATEELETYRKRAKDLE---EELERTIHSYQ 1067
Cdd:TIGR04523 290 LNQLKSEISDLNNQKEQDWNKeLKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSEsenSEKQRELEEKQ 369
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*..
gi 1370465039 1068 GQIISHEKKAHDNWLAARNAERNLNDLR-------KENAHNRQKLTETELKFELLEK 1117
Cdd:TIGR04523 370 NEIEKLKKENQSYKQEIKNLESQINDLEskiqnqeKLNQQKDEQIKKLQQEKELLEK 426
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
850-1061 |
1.70e-04 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 46.16 E-value: 1.70e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 850 KEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTL-------TERLLKMKDWAAMLGEDI------TDDDNLELEMNSES 916
Cdd:PHA02562 173 KDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQrkkngenIARKQNKYDELVEEAKTIkaeieeLTDELLNLVMDIED 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 917 engayldnpPKGALKKLIHA-AKLNASLKTLEGERN------------QiyiQLSEVDKTKEELTEHIKNLQTeqaSLQS 983
Cdd:PHA02562 253 ---------PSAALNKLNTAaAKIKSKIEQFQKVIKmyekggvcptctQ---QISEGPDRITKIKDKLKELQH---SLEK 317
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1370465039 984 ENTHFENENQKLQQklkvmtelYQENEMKLHrkltveenyrlEKEEKLSKVDEKIShateeleTYRKRAKDLEEELER 1061
Cdd:PHA02562 318 LDTAIDELEEIMDE--------FNEQSKKLL-----------ELKNKISTNKQSLI-------TLVDKAKKVKAAIEE 369
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
701-1116 |
2.09e-04 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 45.80 E-value: 2.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 701 LEKFSLVQKEYEGY-----EVESSLKDASFEKEATEAQSLEFVEGSQISEATCEKLNRSNSELEDEilclekelkeeksk 775
Cdd:PRK02224 236 RDEADEVLEEHEERreeleTLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAE-------------- 301
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 776 hSEQDELMAD-ISKRIQSLEDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNENSQLQESQKQLLQEAEVWKEQVS 854
Cdd:PRK02224 302 -AGLDDADAEaVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVE 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 855 ElnkqkvtfedskvhAEQVLNDKESHIKTLTERLlkmkdwaamlgEDITDD-DNLELEMNSESENgayldnppKGALKKL 933
Cdd:PRK02224 381 D--------------RREEIEELEEEIEELRERF-----------GDAPVDlGNAEDFLEELREE--------RDELRER 427
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 934 IhaAKLNASLKTLEG--ERNQiyiQLSEVDKTKE-----ELTEHIKNL---QTEQASLQSENTHFENENQKLQQKLKVMT 1003
Cdd:PRK02224 428 E--AELEATLRTARErvEEAE---ALLEAGKCPEcgqpvEGSPHVETIeedRERVEELEAELEDLEEEVEEVEERLERAE 502
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1004 ELyQENEMKLHRKLTVEENYRLEKEEKLSKVDEKishaTEELETYRKRAKDLEEELERTIHSYQgqiishekKAHDNWLA 1083
Cdd:PRK02224 503 DL-VEAEDRIERLEERREDLEELIAERRETIEEK----RERAEELRERAAELEAEAEEKREAAA--------EAEEEAEE 569
|
410 420 430
....*....|....*....|....*....|...
gi 1370465039 1084 ARNAERNLNDLRKENAHNRQKLTETELKFELLE 1116
Cdd:PRK02224 570 AREEVAELNSKLAELKERIESLERIRTLLAAIA 602
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
929-1125 |
2.53e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 45.14 E-value: 2.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 929 ALKKLIhaAKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTELY-- 1006
Cdd:COG4942 31 QLQQEI--AELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELae 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1007 -------------------QENEMKLHRKLTVEE---NYRLEKEEKLSKVDEKISHATEELETYRKRAKDLEEELERTIH 1064
Cdd:COG4942 109 llralyrlgrqpplalllsPEDFLDAVRRLQYLKylaPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERA 188
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1370465039 1065 SYQGQIISHEKKAHDNWLAARNAERNLNDLRKENAHNRQKLTETELKFELLEKDPYALDVP 1125
Cdd:COG4942 189 ALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFA 249
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
694-1010 |
2.95e-04 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 45.31 E-value: 2.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 694 IEEKSKLLEKFSLVqKEYEGYEVESSLKDAsfEKEATEAQSLEFVEGSQISEATCEKLNRSNSELEDEIlclEKELKEEK 773
Cdd:COG1196 218 LKEELKELEAELLL-LKLRELEAELEELEA--ELEELEAELEELEAELAELEAELEELRLELEELELEL---EEAQAEEY 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 774 SKHSEQDELMADIS---KRIQSLEDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNENSQLQESQKQLLQEAEVWK 850
Cdd:COG1196 292 ELLAELARLEQDIArleERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAE 371
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 851 EQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKMKDwaamlgeditDDDNLELEMNSESENgayldnppkgal 930
Cdd:COG1196 372 AELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLE----------RLERLEEELEELEEA------------ 429
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 931 kklihAAKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTELYQENE 1010
Cdd:COG1196 430 -----LAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYE 504
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
929-1118 |
4.14e-04 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 44.93 E-value: 4.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 929 ALKKLIHAAKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQK---LQQKL-KVMTE 1004
Cdd:COG1196 217 ELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELEleeAQAEEyELLAE 296
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1005 LYQENEMKLHRKLTVEENY--RLEKEEKLSKVDEKISHATEELETYRKRAKDLEEEL----------ERTIHSYQGQIIS 1072
Cdd:COG1196 297 LARLEQDIARLEERRRELEerLEELEEELAELEEELEELEEELEELEEELEEAEEELeeaeaelaeaEEALLEAEAELAE 376
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1370465039 1073 HEKKAHDNWLAARNAERNLNDLRKENAHNRQKLTETELKFELLEKD 1118
Cdd:COG1196 377 AEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEE 422
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
778-1010 |
4.69e-04 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 44.83 E-value: 4.69e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 778 EQDELMADISKRIQSLEDESKSLKSQVAEAKMTFK-IFQMNEERLKIAIKDALNENSQLQESQKQLLQEAEVWKEQVSEL 856
Cdd:pfam12128 280 ERQETSAELNQLLRTLDDQWKEKRDELNGELSAADaAVAKDRSELEALEDQHGAFLDADIETAAADQEQLPSWQSELENL 359
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 857 NKQKVTFEDSKVHAEQVLNDKESHIKT--------LTERLLKMKDWAAMLGEDITDD-DNLELEMNSESENGAYLDNPPK 927
Cdd:pfam12128 360 EERLKALTGKHQDVTAKYNRRRSKIKEqnnrdiagIKDKLAKIREARDRQLAVAEDDlQALESELREQLEAGKLEFNEEE 439
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 928 GALKKLIHAAK--LNASLKTLEgERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTEL 1005
Cdd:pfam12128 440 YRLKSRLGELKlrLNQATATPE-LLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEER 518
|
....*
gi 1370465039 1006 YQENE 1010
Cdd:pfam12128 519 QSALD 523
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
757-1069 |
4.98e-04 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 44.89 E-value: 4.98e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 757 ELEDEILCLEKELKEEKSKHSEQDELMADIS------KRIQSLEDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALN 830
Cdd:PRK01156 153 KILDEILEINSLERNYDKLKDVIDMLRAEISnidyleEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMD 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 831 ENSQLQESQKQLLQEAEVWKEQVSELNKQkvtfeDSKVHAEQVLNDKeshIKTLTERLLKMKDWAAMLG-EDITDDDNLE 909
Cdd:PRK01156 233 DYNNLKSALNELSSLEDMKNRYESEIKTA-----ESDLSMELEKNNY---YKELEERHMKIINDPVYKNrNYINDYFKYK 304
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 910 LEMNSESENGAYLDnppkGALKKLIHAAKlnaSLKTLEGERNQIYIQLSEVDKTKEELTEhiknLQTEQASLQSENTHFE 989
Cdd:PRK01156 305 NDIENKKQILSNID----AEINKYHAIIK---KLSVLQKDYNDYIKKKSRYDDLNNQILE----LEGYEMDYNSYLKSIE 373
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 990 NENQKLQQKLKVMTELYQENEMKLHRKLTVEENYRLEKEEKLSKVDEkISHATEELETYRKRAKDLEEELERTIHSYQGQ 1069
Cdd:PRK01156 374 SLKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQD-ISSKVSSLNQRIRALRENLDELSRNMEMLNGQ 452
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
783-1008 |
5.10e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 44.62 E-value: 5.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 783 MADISKRIQSLEDESKSLKSQVAEAkmtfkifqmnEERLKiAIKDAlNENSQLQESQKQLLQeaevwkeQVSELNKQKVT 862
Cdd:COG3206 170 REEARKALEFLEEQLPELRKELEEA----------EAALE-EFRQK-NGLVDLSEEAKLLLQ-------QLSELESQLAE 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 863 fedskvhAEQVLNDKESHIKTLTERLLKMKDWAAMLGED------ITDDDNLELEMNSESENgaYLDNPPKgalkklihA 936
Cdd:COG3206 231 -------ARAELAEAEARLAALRAQLGSGPDALPELLQSpviqqlRAQLAELEAELAELSAR--YTPNHPD--------V 293
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1370465039 937 AKLNASLKTLEGERNQiyiqlsEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTELYQE 1008
Cdd:COG3206 294 IALRAQIAALRAQLQQ------EAQRILASLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLERE 359
|
|
| Spc7 |
smart00787 |
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ... |
780-892 |
8.53e-04 |
|
Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.
Pssm-ID: 197874 [Multi-domain] Cd Length: 312 Bit Score: 43.08 E-value: 8.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 780 DELMADISKRIQSLEDESKSLKSQVAEAKmtfKIFQMNEERLKIAIKDALNENSQLQESQKQLLQEAEVWKEQVSELNKQ 859
Cdd:smart00787 171 NSIKPKLRDRKDALEEELRQLKQLEDELE---DCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNK 247
|
90 100 110
....*....|....*....|....*....|....*..
gi 1370465039 860 KVTFEDSKVHAEQVLND----KESHIKTLTERLLKMK 892
Cdd:smart00787 248 KSELNTEIAEAEKKLEQcrgfTFKEIEKLKEQLKLLQ 284
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
754-1103 |
9.70e-04 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 43.67 E-value: 9.70e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 754 SNSELEDEILCLEKELKEEKSKHSEQDELMADISKRIQSLEDESKSLKSQVAEAKMTFKIF--QMNEERLKI------AI 825
Cdd:pfam12128 598 SEEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETFARTALKNARLDLRRLfdEKQSEKDKKnkalaeRK 677
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 826 KDALNENSQLQESQKQLLQEAEVWKEQVSElnkQKVTFEDSKVHAEQVL-NDKESHIKTLTERLLKMKDWAAMLGEDITD 904
Cdd:pfam12128 678 DSANERLNSLEAQLKQLDKKHQAWLEEQKE---QKREARTEKQAYWQVVeGALDAQLALLKAAIAARRSGAKAELKALET 754
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 905 DDNLELE-MNSESENGAYLDNPPKGALKKLIHAAKLNASL--------KTLEGERNQIYIQLSEVDKTKEELTEhikNLQ 975
Cdd:pfam12128 755 WYKRDLAsLGVDPDVIAKLKREIRTLERKIERIAVRRQEVlryfdwyqETWLQRRPRLATQLSNIERAISELQQ---QLA 831
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 976 TEQASLQSENTHFENENQKLQQKLKVMTElyqenemkLHRKLTVEENY--RLEKEEKLSKVDEKISHATEELETYRKRAK 1053
Cdd:pfam12128 832 RLIADTKLRRAKLEMERKASEKQQVRLSE--------NLRGLRCEMSKlaTLKEDANSEQAQGSIGERLAQLEDLKLKRD 903
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1370465039 1054 DLEEELERTIHSYQGQIISHEKKAHD-NWLAARNAERNLNDLRKENAHNRQ 1103
Cdd:pfam12128 904 YLSESVKKYVEHFKNVIADHSGSGLAeTWESLREEDHYQNDKGIRLLDYRK 954
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
776-1111 |
1.01e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 43.89 E-value: 1.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 776 HSEQDELMADISKRIQSLEDESKSLKS-QVAEAKMTFKIFQMNEERLKIAIKdalnenSQLQESQKQLLQEAEVWKEQVS 854
Cdd:TIGR01612 488 NSKQDNTVKLILMRMKDFKDIIDFMELyKPDEVPSKNIIGFDIDQNIKAKLY------KEIEAGLKESYELAKNWKKLIH 561
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 855 ELNKQKVTFEDSKVHAEQVLNDkeshiktLTERLLKMKDwaamlgeDITDDDNLELEMNSESENGAYLDNPPKGA--LKK 932
Cdd:TIGR01612 562 EIKKELEEENEDSIHLEKEIKD-------LFDKYLEIDD-------EIIYINKLKLELKEKIKNISDKNEYIKKAidLKK 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 933 LIhaaklnaslktlegERNQIYIqlSEVDKTKE-ELTEHIKNLQTEQASLQSENTH-FENENQKLQQKLkvmTELYQENE 1010
Cdd:TIGR01612 628 II--------------ENNNAYI--DELAKISPyQVPEHLKNKDKIYSTIKSELSKiYEDDIDALYNEL---SSIVKENA 688
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1011 MKlhrklTVEENYRLEK-EEKLSKVDEKISHATEE--------LETYRKRAKDLEEELERTIHSYqgqiISHE--KKAHD 1079
Cdd:TIGR01612 689 ID-----NTEDKAKLDDlKSKIDKEYDKIQNMETAtvelhlsnIENKKNELLDIIVEIKKHIHGE----INKDlnKILED 759
|
330 340 350
....*....|....*....|....*....|...
gi 1370465039 1080 NWLAARNAERNLNDLRKENAH-NRQKLTETELK 1111
Cdd:TIGR01612 760 FKNKEKELSNKINDYAKEKDElNKYKSKISEIK 792
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
780-1080 |
1.22e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 43.50 E-value: 1.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 780 DELMADISKRiQSLEDESKSLKSQVAEAKMTFKIFQMNEERL-----KIAIKDALNEN--------SQLQESQKQLLQEA 846
Cdd:TIGR01612 1486 NELKEHIDKS-KGCKDEADKNAKAIEKNKELFEQYKKDVTELlnkysALAIKNKFAKTkkdseiiiKEIKDAHKKFILEA 1564
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 847 EVWKEQVSELNKQKVTFEDSKVH---AEQVLNDKESHIKTLTERLLKMKDWAAMLGEDITDDDNLE-----LEMNSE--- 915
Cdd:TIGR01612 1565 EKSEQKIKEIKKEKFRIEDDAAKndkSNKAAIDIQLSLENFENKFLKISDIKKKINDCLKETESIEkkissFSIDSQdte 1644
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 916 -SENGAYLD----------NPPKGALKKLIHAAKLNASLKTLEGERNQ--------IYIQLSEVDKTKEELTEHIKNL-- 974
Cdd:TIGR01612 1645 lKENGDNLNslqefleslkDQKKNIEDKKKELDELDSEIEKIEIDVDQhkknyeigIIEKIKEIAIANKEEIESIKELie 1724
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 975 QTEQASLQSENTH-FE--NENQKLQQKLKVMTELYQENeMKLHRKLTveeNYRlekeEKLSKvdEKISHatEELETYRKR 1051
Cdd:TIGR01612 1725 PTIENLISSFNTNdLEgiDPNEKLEEYNTEIGDIYEEF-IELYNIIA---GCL----ETVSK--EPITY--DEIKNTRIN 1792
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 1370465039 1052 AKD--LEE-ELERTIHSYQ--------GQIISHEKKAHDN 1080
Cdd:TIGR01612 1793 AQNefLKIiEIEKKSKSYLddieakefDRIINHFKKKLDH 1832
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
943-1067 |
1.22e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 43.37 E-value: 1.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 943 LKTLEGERNQIYIQLSEVDKTK---EELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTELYQENEMKLHR---K 1016
Cdd:COG4913 663 VASAEREIAELEAELERLDASSddlAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAaedL 742
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1370465039 1017 LTVEENYRLEK---EEKLSKVDEKISHA-TEELETYRKRAKDLEEELERTIHSYQ 1067
Cdd:COG4913 743 ARLELRALLEErfaAALGDAVERELRENlEERIDALRARLNRAEEELERAMRAFN 797
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
815-1062 |
1.29e-03 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 42.59 E-value: 1.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 815 QMNEERLKIAIKDALNENSQLQESQKQLLQEAEVWKEQVSELNKQKvtfedsKVHAEQVLNDKEShIKTLTERLLKMKDW 894
Cdd:COG1340 7 SSSLEELEEKIEELREEIEELKEKRDELNEELKELAEKRDELNAQV------KELREEAQELREK-RDELNEKVKELKEE 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 895 AAMLGEDITDDDNlELEMNSESENGAYLDNPPKGALKKLIhaAKL-----NASLkTLEGERnQIYIQLSEVD------KT 963
Cdd:COG1340 80 RDELNEKLNELRE-ELDELRKELAELNKAGGSIDKLRKEI--ERLewrqqTEVL-SPEEEK-ELVEKIKELEkelekaKK 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 964 KEELTEHIKNLQTEQASLQSE-NTHFENEN---QKLQQKLKVMTELYQE-NEMK-----LHRKLtveenyrLEKEEKLSK 1033
Cdd:COG1340 155 ALEKNEKLKELRAELKELRKEaEEIHKKIKelaEEAQELHEEMIELYKEaDELRkeadeLHKEI-------VEAQEKADE 227
|
250 260
....*....|....*....|....*....
gi 1370465039 1034 VDEKISHATEELETYRKRAKDLEEELERT 1062
Cdd:COG1340 228 LHEEIIELQKELRELRKELKKLRKKQRAL 256
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
935-1118 |
1.53e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.58 E-value: 1.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 935 HAAKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKL-KVMTELYQENE--M 1011
Cdd:COG4372 4 LGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELeQARSELEQLEEelE 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1012 KLHRKLTVEENYRLEKEEKLSKVDEKISHATEELETYRKRAKDLEEELERtihsyqgqiISHEKKAHDNWLAARNAErnL 1091
Cdd:COG4372 84 ELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQ---------LEAQIAELQSEIAEREEE--L 152
|
170 180
....*....|....*....|....*..
gi 1370465039 1092 NDLRKENAHNRQKLTETELKFELLEKD 1118
Cdd:COG4372 153 KELEEQLESLQEELAALEQELQALSEA 179
|
|
| YabA |
COG4467 |
Regulator of replication initiation timing YabA [Replication, recombination and repair]; |
949-1016 |
2.25e-03 |
|
Regulator of replication initiation timing YabA [Replication, recombination and repair];
Pssm-ID: 443564 [Multi-domain] Cd Length: 107 Bit Score: 39.08 E-value: 2.25e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1370465039 949 ERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTELYQENEMKLHRK 1016
Cdd:COG4467 2 DKKELFDRLSELEEQLGELLKELGELKDEVAELLEENARLRIENEHLRERLEELEKKKEKKAEKDIGE 69
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
930-1116 |
2.31e-03 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 41.83 E-value: 2.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 930 LKKLIHAAKLNASLKTLEGERNQIYIQLSEVDKTKEELTE--HIKNLQTEQASL-----QSENTHFENENQ---KLQQKL 999
Cdd:pfam13868 11 LNSKLLAAKCNKERDAQIAEKKRIKAEEKEEERRLDEMMEeeRERALEEEEEKEeerkeERKRYRQELEEQieeREQKRQ 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1000 KVMTELYQENEMKLHRKLTVEENYRLEKEEKLSKVDEK---ISHATEELETYRKRAKDLEEELERTIHSYQGQIISHEKK 1076
Cdd:pfam13868 91 EEYEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQRQLreeIDEFNEEQAEWKELEKEEEREEDERILEYLKEKAEREEE 170
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1370465039 1077 AHDNWLAARNA-ERNLNDLRKENAHNRQKLTE-TELKFELLE 1116
Cdd:pfam13868 171 REAEREEIEEEkEREIARLRAQQEKAQDEKAErDELRAKLYQ 212
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
783-1061 |
2.38e-03 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 42.36 E-value: 2.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 783 MADISKRIQSLEDESKSLKSQVAEAKMTFKifqmNEERLkIAIKDALNENSQLQESQKQL--------LQEAEVWKEQVS 854
Cdd:PRK03918 461 LKRIEKELKEIEEKERKLRKELRELEKVLK----KESEL-IKLKELAEQLKELEEKLKKYnleelekkAEEYEKLKEKLI 535
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 855 ELNKQKVTFEDSKVHAEQVLNDK---ESHIKTLTERLLKMKDWAAMLGEDITDDDNLELEmNSESENGAYLDnpPKGALK 931
Cdd:PRK03918 536 KLKGEIKSLKKELEKLEELKKKLaelEKKLDELEEELAELLKELEELGFESVEELEERLK-ELEPFYNEYLE--LKDAEK 612
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 932 KLihaAKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQT-----EQASLQSENTHFENENQKLQQKLKVMTELY 1006
Cdd:PRK03918 613 EL---EREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKkyseeEYEELREEYLELSRELAGLRAELEELEKRR 689
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1370465039 1007 QENEMKLhRKLTVEENYRLEKEEKLskvdEKISHATEELETYRKRAKDLEEELER 1061
Cdd:PRK03918 690 EEIKKTL-EKLKEELEEREKAKKEL----EKLEKALERVEELREKVKKYKALLKE 739
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
778-1118 |
2.53e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 42.50 E-value: 2.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 778 EQDELMADISKRIQSLEDESKSlkSQVAEAKMtfkifqmnEERL--KIAIKDALNEnsQLQESQKQLLQEAEVWKEQVSE 855
Cdd:pfam10174 412 DKDKQLAGLKERVKSLQTDSSN--TDTALTTL--------EEALseKERIIERLKE--QREREDRERLEELESLKKENKD 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 856 LNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKmkdwaamlgediTDDDNLELEMNSEsengayldnppkgalKKLIH 935
Cdd:pfam10174 480 LKEKVSALQPELTEKESSLIDLKEHASSLASSGLK------------KDSKLKSLEIAVE---------------QKKEE 532
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 936 AAKLNASLKTLEgernqiyiQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMtelyqenemklhr 1015
Cdd:pfam10174 533 CSKLENQLKKAH--------NAEEAVRTNPEINDRIRLLEQEVARYKEESGKAQAEVERLLGILREV------------- 591
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1016 kltveENYRLEKEEKLSKVDEKISHATEELETYRKRAKDLEEELERTihsyQGQIISHEKKAHDNwLAARNAERNLNDLR 1095
Cdd:pfam10174 592 -----ENEKNDKDKKIAELESLTLRQMKEQNKKVANIKHGQQEMKKK----GAQLLEEARRREDN-LADNSQQLQLEELM 661
|
330 340
....*....|....*....|....*...
gi 1370465039 1096 KENAHNRQKLTETELKFE-----LLEKD 1118
Cdd:pfam10174 662 GALEKTRQELDATKARLSstqqsLAEKD 689
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
874-1079 |
2.59e-03 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 42.40 E-value: 2.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 874 LNDKESHIKTL----TERLLKMKDWAAMLGEDITDDDNLELEMNSESENGAYLdNPPKGALKKLIHAAKLN-----ASLK 944
Cdd:pfam05483 235 INDKEKQVSLLliqiTEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKEL-IEKKDHLTKELEDIKMSlqrsmSTQK 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 945 TLEgERNQIYI------------QLSEVDKTKEELTEHIKNLQTEQASLQ----SENTHFENENQKLqqKLKVMTELYQE 1008
Cdd:pfam05483 314 ALE-EDLQIATkticqlteekeaQMEELNKAKAAHSFVVTEFEATTCSLEellrTEQQRLEKNEDQL--KIITMELQKKS 390
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1370465039 1009 NEMKLHRKLTVEENYRLEKEEKLSKVDEKISHATEELETyrkrakdLEEELERTIHSYQGQIISHEKKAHD 1079
Cdd:pfam05483 391 SELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEK-------IAEELKGKEQELIFLLQAREKEIHD 454
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
726-1107 |
2.99e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 42.08 E-value: 2.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 726 EKEATEAQSLEFVEGSQISEATCEKLNRSNSELEDEILCLEKELKEEKSKHSEQDELMADISKRIQSLEDESKSLKSQVA 805
Cdd:pfam01576 6 EMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQLQAETELCAEAEEMRARLAARKQELEEILHELESRLE 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 806 EakmtfkifqmNEERlkiaikdalneNSQLQESQKQLLQEAEVWKEQVSELN--KQKVTFEdsKVHAEQVLNDKESHIKT 883
Cdd:pfam01576 86 E----------EEER-----------SQQLQNEKKKMQQHIQDLEEQLDEEEaaRQKLQLE--KVTTEAKIKKLEEDILL 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 884 LTERLLKMKDWAAMLGEDITDddnLELEMNSESENGAYLDnppKGALKKLIHAAKLNASLKTLEGERNQIYIQLSEVDKT 963
Cdd:pfam01576 143 LEDQNSKLSKERKLLEERISE---FTSNLAEEEEKAKSLS---KLKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGE 216
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 964 KEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKvmTELYQENE-MKLHRKLtveENYRLEKEEKLSKvdEKISHAT 1042
Cdd:pfam01576 217 STDLQEQIAELQAQIAELRAQLAKKEEELQAALARLE--EETAQKNNaLKKIREL---EAQISELQEDLES--ERAARNK 289
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1043 EEletyrKRAKDLEEELE--RT--------------IHSYQGQIISHEKKAHDNwlAARNAERNLNDLRKENAHNRQKLT 1106
Cdd:pfam01576 290 AE-----KQRRDLGEELEalKTeledtldttaaqqeLRSKREQEVTELKKALEE--ETRSHEAQLQEMRQKHTQALEELT 362
|
.
gi 1370465039 1107 E 1107
Cdd:pfam01576 363 E 363
|
|
| HMMR_N |
pfam15905 |
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ... |
788-998 |
3.34e-03 |
|
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.
Pssm-ID: 464932 [Multi-domain] Cd Length: 329 Bit Score: 41.34 E-value: 3.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 788 KRIQSLEDESKSLKSQVaEAKMtfKIFQMNEERLKIaikdalnensQLQESQKQLLQEaevwKEQVSELNKQKVTFEDSK 867
Cdd:pfam15905 152 KKMSSLSMELMKLRNKL-EAKM--KEVMAKQEGMEG----------KLQVTQKNLEHS----KGKVAQLEEKLVSTEKEK 214
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 868 V----HAEQVLNDKEsHIKTLTERLLKMKDWAAMLGEDI-TDDDNLELEMNSESENgayldnppKGALKKLIHaaKLNAS 942
Cdd:pfam15905 215 IeeksETEKLLEYIT-ELSCVSEQVEKYKLDIAQLEELLkEKNDEIESLKQSLEEK--------EQELSKQIK--DLNEK 283
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1370465039 943 LKTLEGERNQIyiqLSEvDKTKEEltehikNLQTEQASLQSENTHFENENQKLQQK 998
Cdd:pfam15905 284 CKLLESEKEEL---LRE-YEEKEQ------TLNAELEELKEKLTLEEQEHQKLQQK 329
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1026-1109 |
3.34e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 41.67 E-value: 3.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1026 EKEEKLSKVDEKISHATEELETYRKRAKDLEEELERTihsyQGQIISHEKKAHDNWLAARNAERNLNDLRKENAHNRQKL 1105
Cdd:COG4942 24 EAEAELEQLQQEIAELEKELAALKKEEKALLKQLAAL----ERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL 99
|
....
gi 1370465039 1106 TETE 1109
Cdd:COG4942 100 EAQK 103
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
690-1117 |
3.70e-03 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 41.96 E-value: 3.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 690 LSGLIEEKSKLLEK--FSLVQKEYEGYEVESSLKDAsfekeateaqsLEFVEGSQISEATCEKL--NRSNSELEDEILCL 765
Cdd:TIGR00606 438 LGRTIELKKEILEKkqEELKFVIKELQQLEGSSDRI-----------LELDQELRKAERELSKAekNSLTETLKKEVKSL 506
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 766 EKELKEEKSKHSEQDELMADISKRIQSLEDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNEnSQLQESQKQLLQE 845
Cdd:TIGR00606 507 QNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFPNK-KQLEDWLHSKSKE 585
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 846 AEVWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTLTERLLKmkdwaAMLGEDITDD-DNLELEMNSESENGAYLDN 924
Cdd:TIGR00606 586 INQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLFD-----VCGSQDEESDlERLKEEIEKSSKQRAMLAG 660
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 925 ppkgalkklihAAKLNASLKTLEGERNQIYIQLSEVD-KTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMT 1003
Cdd:TIGR00606 661 -----------ATAVYSQFITQLTDENQSCCPVCQRVfQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEML 729
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1004 ELY--QENEMKLHRKLTVEENYRLEK-EEKLSKVDEKISHATEELETY---RKRAKDLEEELErTIHSYQGQIISHEKKA 1077
Cdd:TIGR00606 730 GLApgRQSIIDLKEKEIPELRNKLQKvNRDIQRLKNDIEEQETLLGTImpeEESAKVCLTDVT-IMERFQMELKDVERKI 808
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 1370465039 1078 HD--NWLAARNAERNLNDLRKENAHNRQKLTETELKFELLEK 1117
Cdd:TIGR00606 809 AQqaAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRK 850
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
937-1109 |
4.32e-03 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 41.49 E-value: 4.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 937 AKLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKVMTELYQ--------E 1008
Cdd:TIGR00618 194 GKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQlrarieelR 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1009 NEMKLHRKLTVEENYRLEKE------EKLSKVDEKISHATEEL-ETYRKRAKDLEeelERTIHSYQGQIISHEKKAHDNW 1081
Cdd:TIGR00618 274 AQEAVLEETQERINRARKAAplaahiKAVTQIEQQAQRIHTELqSKMRSRAKLLM---KRAAHVKQQSSIEEQRRLLQTL 350
|
170 180
....*....|....*....|....*....
gi 1370465039 1082 LAARNAERNLNDLRKE-NAHNRQKLTETE 1109
Cdd:TIGR00618 351 HSQEIHIRDAHEVATSiREISCQQHTLTQ 379
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
684-1118 |
4.65e-03 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 41.59 E-value: 4.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 684 KKLALMLSGLIEEKSKLLEKFSLVQKEYEGYE-----VESSLKDASFEKEATEAQSLEFVEgsqiSEATCEKLNRSNSEL 758
Cdd:PRK03918 289 KEKAEEYIKLSEFYEEYLDELREIEKRLSRLEeeingIEERIKELEEKEERLEELKKKLKE----LEKRLEELEERHELY 364
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 759 ED-EILCLEKELKEEKSKHSEQDELMADI---SKRIQSLEDESKSLKSQVAEAKMTFKIFQMNEERLKIAIKDALNENSQ 834
Cdd:PRK03918 365 EEaKAKKEELERLKKRLTGLTPEKLEKELeelEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRE 444
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 835 L-QESQKQLLQEaevWKEQVSELNKQKVTFEDSKVHAEQVLNDKESHIKTlTERLLKMKDWAAMLgeditdddnLELEMN 913
Cdd:PRK03918 445 LtEEHRKELLEE---YTAELKRIEKELKEIEEKERKLRKELRELEKVLKK-ESELIKLKELAEQL---------KELEEK 511
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 914 SESENGAYLDNPPKGALKKLIHAAKLNASLKTLEGERNQiyiqLSEVDKTKEELTEHIKNLQTEQASL--QSENTHFENE 991
Cdd:PRK03918 512 LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEK----LEELKKKLAELEKKLDELEEELAELlkELEELGFESV 587
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 992 nQKLQQKLKVMTELYQE-NEMKLHRKltveenyRLE-KEEKLSKVDEKISHATEELETYRKRAKDLEEELERTIHSYqgq 1069
Cdd:PRK03918 588 -EELEERLKELEPFYNEyLELKDAEK-------ELErEEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKY--- 656
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 1370465039 1070 iiSHEKKahdnwlaaRNAERNLNDLRKENAHNRQKLTETELKFELLEKD 1118
Cdd:PRK03918 657 --SEEEY--------EELREEYLELSRELAGLRAELEELEKRREEIKKT 695
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
938-1123 |
5.22e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 41.04 E-value: 5.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 938 KLNASLKTLEGERNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENENQKLQQKLKvmtELYQENEmklhrKL 1017
Cdd:COG4372 56 QAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELE---ELQKERQ-----DL 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1018 TVEENyrlEKEEKLSKVDEKISHATEELETYRKRAKDLEEELERTIHSYQGQIISHEKKAHDNWLAARNAERNLNDLRKE 1097
Cdd:COG4372 128 EQQRK---QLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAE 204
|
170 180
....*....|....*....|....*.
gi 1370465039 1098 NAHNRQKLTETELKFELLEKDPYALD 1123
Cdd:COG4372 205 AEKLIESLPRELAEELLEAKDSLEAK 230
|
|
| Golgin_A5 |
pfam09787 |
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ... |
932-1108 |
6.00e-03 |
|
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.
Pssm-ID: 462900 [Multi-domain] Cd Length: 305 Bit Score: 40.51 E-value: 6.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 932 KLIHAAKLNASLKTLEGErNQIYIQLSEVDKTKEELTEHIKNLQTEQASLQSENTHFENenqklQQKLKVMTelYQENEM 1011
Cdd:pfam09787 25 KLIASLKEGSGVEGLDSS-TALTLELEELRQERDLLREEIQKLRGQIQQLRTELQELEA-----QQQEEAES--SREQLQ 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 1012 KLHRKLTVEENYRLEKEEKLSKVDEKISHateeletyrkrakdLEEELERTIHSYQGQIISHEKKAHDnwLAARNAERNL 1091
Cdd:pfam09787 97 ELEEQLATERSARREAEAELERLQEELRY--------------LEEELRRSKATLQSRIKDREAEIEK--LRNQLTSKSQ 160
|
170
....*....|....*...
gi 1370465039 1092 NDLRKENAHNRQK-LTET 1108
Cdd:pfam09787 161 SSSSQSELENRLHqLTET 178
|
|
| Mitofilin |
pfam09731 |
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ... |
826-1062 |
8.05e-03 |
|
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.
Pssm-ID: 430783 [Multi-domain] Cd Length: 618 Bit Score: 40.51 E-value: 8.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 826 KDALNENSQLQESQKQLLQEAEVWKEQVSELN---KQKVTFEDSKVHAEqVLNDKESHIKTLTERLLKMKDWAAMLGEDI 902
Cdd:pfam09731 121 KSEQEKEKALEEVLKEAISKAESATAVAKEAKddaIQAVKAHTDSLKEA-SDTAEISREKATDSALQKAEALAEKLKEVI 199
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 903 TdddnleLEMNSESENGAYLDNPPKGALKKLIHA--------------AKLNASLKTLEGERNQIYIQlsEVDKTKEELT 968
Cdd:pfam09731 200 N------LAKQSEEEAAPPLLDAAPETPPKLPEHldnveekvekaqslAKLVDQYKELVASERIVFQQ--ELVSIFPDII 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 969 EHIKNLQ-TEQASLQSENTHFENENQKLQQKL---KVMTELYQENEMK----LHRKLTVEENYRLE--KEEKLSKVDEKI 1038
Cdd:pfam09731 272 PVLKEDNlLSNDDLNSLIAHAHREIDQLSKKLaelKKREEKHIERALEkqkeELDKLAEELSARLEevRAADEAQLRLEF 351
|
250 260
....*....|....*....|....*
gi 1370465039 1039 SHATEEL-ETYRKRakdLEEELERT 1062
Cdd:pfam09731 352 EREREEIrESYEEK---LRTELERQ 373
|
|
| EzrA |
pfam06160 |
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ... |
949-1107 |
9.87e-03 |
|
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.
Pssm-ID: 428797 [Multi-domain] Cd Length: 542 Bit Score: 40.22 E-value: 9.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370465039 949 ERNQIYIQLSEVDKTKEELTEHIKNLQTEQAslqsenthfENENQKLQQKLKVMTELyqenemklhrkLTVEENYRLEKE 1028
Cdd:pfam06160 231 EHLNVDKEIQQLEEQLEENLALLENLELDEA---------EEALEEIEERIDQLYDL-----------LEKEVDAKKYVE 290
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1370465039 1029 EKLSKVDEKISHATEELetyrkraKDLEEELERTIHSYqgqIISHEKKAHdnwlaARNAERNLNDLRKENAHNRQKLTE 1107
Cdd:pfam06160 291 KNLPEIEDYLEHAEEQN-------KELKEELERVQQSY---TLNENELER-----VRGLEKQLEELEKRYDEIVERLEE 354
|
|
|