|
Name |
Accession |
Description |
Interval |
E-value |
| LamNT |
smart00136 |
Laminin N-terminal domain (domain VI); N-terminal domain of laminins and laminin-related ... |
42-282 |
5.54e-119 |
|
Laminin N-terminal domain (domain VI); N-terminal domain of laminins and laminin-related protein such as Unc-6/ netrins.
Pssm-ID: 214532 Cd Length: 238 Bit Score: 373.62 E-value: 5.54e-119
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 42 GGRPQRCMPEFVNAAFNVTVVATNTCGTP-PEEYCVQTGVTGVTKSCHLCDAGQLHLQHGAAFLTDYNNQADTTWWQSQT 120
Cdd:smart00136 1 AGRPRSCYPPFVNLAFGREVTATSTCGEPgPERYCKLVGHTEQGKKCDYCDARNPRRSHPAENLTDGNNPNNPTWWQSEP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 121 MLAGVQYpnsINLTLHLGKAFDITYVRLKFHTSRPeSFAIYKRTREDGPWIPYQYYSGSCENTYSKANRGFIRTGgDEQQ 200
Cdd:smart00136 81 LSNGPQN---VNLTLDLGKEFHVTYVILKFCSPRP-SLWILERSDFGKTWQPWQYFSSDCRRTFGRPPRGPITKG-NEDE 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 201 ALCTDEFSDISPLTGGNVAFSTLEGRPSAYNFDNSPVLQEWVTATDIRVTLNRLNTFGDEVFND-PKVLKSYYYAISDFA 279
Cdd:smart00136 156 VICTSEYSDIVPLEGGEIAFSLLEGRPSATDFDNSPVLQEWVTATNIRVRLTRLRTLGDELMDDrPEVTRRYYYAISDIA 235
|
...
gi 281371490 280 VGG 282
Cdd:smart00136 236 VGG 238
|
|
| Laminin_N |
pfam00055 |
Laminin N-terminal (Domain VI); |
48-282 |
1.13e-113 |
|
Laminin N-terminal (Domain VI);
Pssm-ID: 459653 Cd Length: 230 Bit Score: 358.43 E-value: 1.13e-113
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 48 CMPEFVNAAFNVTVVATNTCGT-PPEEYCVQTGVTGVtKSCHLCDAGQLHLQHGAAFLTDYNNQADTTWWQSQTMLagVQ 126
Cdd:pfam00055 1 CYPAFGNLAFGREVSATSTCGLnGPERYCILSGLEGG-KKCFICDSRDPHNSHPPSNLTDSNNGTNETWWQSETGV--IQ 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 127 YPNsINLTLHLGKAFDITYVRLKFHTSRPESFAIYKRTREDGPWIPYQYYSGSCENTYSKANRGfiRTGGDEQQALCTDE 206
Cdd:pfam00055 78 YEN-VNLTLDLGKEFHFTYLILKFKSPRPAAMVLERSTDFGKTWQPYQYFASDCRRTFGRPSGP--SRGIKDDEVICTSE 154
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 281371490 207 FSDISPLTGGNVAFSTLEGRPSAYNFDNSPVLQEWVTATDIRVTLNRLNTFGDEVFNDPKVLKSYYYAISDFAVGG 282
Cdd:pfam00055 155 YSDISPLTGGEVIFSTLEGRPSANIFDYSPELQDWLTATNIRIRLLRLHTLGDELLDDPSVLRKYYYAISDISVGG 230
|
|
| LamB |
smart00281 |
Laminin B domain; |
552-676 |
4.86e-41 |
|
Laminin B domain;
Pssm-ID: 214597 Cd Length: 127 Bit Score: 147.41 E-value: 4.86e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 552 YFPRYFVAPAKFLGNQVLSYGQNLSFSFRVD--RRDTRLSAEDLVLEGAGLRVSVPliAQGNSYPSE-TTVKYIFRLHEA 628
Cdd:smart00281 2 NEPVYWVAPEQFLGDKVTSYGGKLRYTLSFDgrRGGTHVSAPDVILEGNGLRISHP--AEGPPLPDElTTVEVRFREENW 79
|
90 100 110 120
....*....|....*....|....*....|....*....|....*....
gi 281371490 629 TDYPWRPALSPfEFQKLLNNLTSIKIRGTYSERSAG-YLDDVTLQSARP 676
Cdd:smart00281 80 QYYGGRPVTRE-DLMMVLANLTAILIRATYSQQMAGsRLSDVSLEVAVP 127
|
|
| Laminin_B |
pfam00052 |
Laminin B (Domain IV); |
556-686 |
6.55e-41 |
|
Laminin B (Domain IV);
Pssm-ID: 459652 Cd Length: 136 Bit Score: 147.42 E-value: 6.55e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 556 YFVAPAKFLGNQVLSYGQNLSFSFRVDRRD---TRLSAEDLVLEGAGLRVSVPLIAQGNSYPSETTvKYIFRLHEATdyp 632
Cdd:pfam00052 1 YWSAPEQFLGNKLTSYGGYLTYTVRYEPLPgggSLNSEPDVILEGNGLRLSYSSPDQPPPDPGQEQ-TYSVRLHEEN--- 76
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 633 WR----PALSPFEFQKLLNNLTSIKIRGTYSERSAG-YLDDVTLQSARPGP-GVPATWVE 686
Cdd:pfam00052 77 WRdsdgAPVSREDFMMVLANLTAILIRATYSTGSGQvSLSNVSLDSAVPGGsGPPASWVE 136
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1034-1596 |
9.39e-18 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 90.12 E-value: 9.39e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1034 RLVKDKVAEHRVKLQELESLIANLGTGDETVTDQA--FEDRLKEAEREVTDLLREAQEV-------KDVDQNLMDRLQRV 1104
Cdd:TIGR02168 429 KLEEAELKELQAELEELEEELEELQEELERLEEALeeLREELEEAEQALDAAERELAQLqarldslERLQENLEGFSEGV 508
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1105 NNSLHSQiSRLHNIrntieeTGILAERARSRvESTEQLIEIASREL------EKAKMAAANVSITQPESTG--------- 1169
Cdd:TIGR02168 509 KALLKNQ-SGLSGI------LGVLSELISVD-EGYEAAIEAALGGRlqavvvENLNAAKKAIAFLKQNELGrvtflplds 580
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1170 ------EPNNMTLLAEE--ARRLAERHKQEADDI----------VRVAK---TANETSAE-AYNLLLRTLAGE------- 1220
Cdd:TIGR02168 581 ikgteiQGNDREILKNIegFLGVAKDLVKFDPKLrkalsyllggVLVVDdldNALELAKKlRPGYRIVTLDGDlvrpggv 660
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1221 --------NQTAL----EIEELNRKYEQAKNISQDLEKQAARV---HEEAKRAGDKAVEIYASVAQLTPVDSEALESEAN 1285
Cdd:TIGR02168 661 itggsaktNSSILerrrEIEELEEKIEELEEKIAELEKALAELrkeLEELEEELEQLRKELEELSRQISALRKDLARLEA 740
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1286 KIKKEAADLDRLiDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAKALAeeaakkgRNTLQEANDIL 1365
Cdd:TIGR02168 741 EVEQLEERIAQL-SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKAL-------REALDELRAEL 812
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1366 NNLKdfdRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAA---DATEAKNKAHEaERIASGVQKNATSTK 1442
Cdd:TIGR02168 813 TLLN---EEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAeieELEELIEELES-ELEALLNERASLEEA 888
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1443 ADAERtfSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQ--------DMMMAGMASQAAQEAELNARKAKNSVSSLLS- 1513
Cdd:TIGR02168 889 LALLR--SELEELSEELRELESKRSELRRELEELREKLAQlelrleglEVRIDNLQERLSEEYSLTLEEAEALENKIEDd 966
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1514 ------QLTDLLEQLGQLDTVDLNKLNEIEgSLNKAKDEMK--VSDLdrkvsnleNEARKQ-EAAILDYDRDIAEIIKDI 1584
Cdd:TIGR02168 967 eeearrRLKRLENKIKELGPVNLAAIEEYE-ELKERYDFLTaqKEDL--------TEAKETlEEAIEEIDREARERFKDT 1037
|
650
....*....|..
gi 281371490 1585 hnLEDIKKTLPT 1596
Cdd:TIGR02168 1038 --FDQVNENFQR 1047
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1070-1594 |
1.68e-16 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 85.50 E-value: 1.68e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1070 EDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNnslhSQISRLHNIRNTIEETGILAERARSRVESTEQLI-EIASR 1148
Cdd:PRK03918 192 EELIKEKEKELEEVLREINEISSELPELREELEKLE----KEVKELEELKEEIEELEKELESLEGSKRKLEEKIrELEER 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1149 ------ELEKAKmaaanvsitqpESTGEPNNMTLLAEEARRLAERHKQEADDIVRVAKTANETSAEAYNLLLRTLAGENQ 1222
Cdd:PRK03918 268 ieelkkEIEELE-----------EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK 336
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1223 TAlEIEELNRKYEQAKNISQDLEKQaARVHEEAKRAGDKAVEIYASVAQLTPVDSEA-----------LESEANKIKKEA 1291
Cdd:PRK03918 337 EE-RLEELKKKLKELEKRLEELEER-HELYEEAKAKKEELERLKKRLTGLTPEKLEKeleelekakeeIEEEISKITARI 414
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1292 ADLDRLIDQKLKDYEDLREdMRGK---------EHEVKNLLEKGKAEQQTAdqllaradaakalaeeaakkgRNTLQEAN 1362
Cdd:PRK03918 415 GELKKEIKELKKAIEELKK-AKGKcpvcgreltEEHRKELLEEYTAELKRI---------------------EKELKEIE 472
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1363 DILNNLKDFDRRVnDNKTAAEEALRRIPAINRTIAEANEKTREaqlalgnaaADATEAKNKAHEAERI---ASGVQKNAT 1439
Cdd:PRK03918 473 EKERKLRKELREL-EKVLKKESELIKLKELAEQLKELEEKLKK---------YNLEELEKKAEEYEKLkekLIKLKGEIK 542
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1440 STKADAERTfsevtdldNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASqaaqEAELNAR------------KAKNS 1507
Cdd:PRK03918 543 SLKKELEKL--------EELKKKLAELEKKLDELEEELAELLKELEELGFES----VEELEERlkelepfyneylELKDA 610
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1508 VSSLLSQL-------TDLLEQLGQLDTVDlNKLNEIEGSLNKAK---DEMKVSDLDRKVSNLENEARKQEAAILDYDRDI 1577
Cdd:PRK03918 611 EKELEREEkelkkleEELDKAFEELAETE-KRLEELRKELEELEkkySEEEYEELREEYLELSRELAGLRAELEELEKRR 689
|
570
....*....|....*..
gi 281371490 1578 AEIIKDIHNLEDIKKTL 1594
Cdd:PRK03918 690 EEIKKTLEKLKEELEER 706
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
933-978 |
3.34e-14 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 68.11 E-value: 3.34e-14
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 281371490 933 CDCHALGSTNGQCDIRTGQCECQPGITGQHCERCETNHFGFGPEGC 978
Cdd:smart00180 1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1038-1592 |
4.48e-14 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 77.45 E-value: 4.48e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1038 DKVAEHRVKLQELESLIANLGTGDETVTdQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRL-- 1115
Cdd:pfam05483 169 EKTKKYEYEREETRQVYMDLNNNIEKMI-LAFEELRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDKEKQVSLLli 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1116 --HNIRNTIEETGILAERARSRV-----------ESTEQLIEIA---SRELEKAKMAAANVSITQPESTGEPNNMT---- 1175
Cdd:pfam05483 248 qiTEKENKMKDLTFLLEESRDKAnqleektklqdENLKELIEKKdhlTKELEDIKMSLQRSMSTQKALEEDLQIATktic 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1176 -LLAEEARRLAERHKQEADDIVRVakTANETSAEAYNLLLRTlagenqtalEIEELNRKYEQAKNISQDLEKQAARVHEE 1254
Cdd:pfam05483 328 qLTEEKEAQMEELNKAKAAHSFVV--TEFEATTCSLEELLRT---------EQQRLEKNEDQLKIITMELQKKSSELEEM 396
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1255 AKRAGDKAVEIyasvaqltpvdsealeseaNKIKKEAADLDRLIDQKlKDYEDLREDMRGKEHEVKNLLEKGKAEqqtad 1334
Cdd:pfam05483 397 TKFKNNKEVEL-------------------EELKKILAEDEKLLDEK-KQFEKIAEELKGKEQELIFLLQAREKE----- 451
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1335 qllaradaakalaeeaakkgrntlqeandilnnLKDFDRRVNDNKTAAEEALRRIPAINRTIAeaNEKTREAQLAlGNAA 1414
Cdd:pfam05483 452 ---------------------------------IHDLEIQLTAIKTSEEHYLKEVEDLKTELE--KEKLKNIELT-AHCD 495
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1415 ADATEAKNKAHEAERIASGV---QKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDA----DQDMMMA 1487
Cdd:pfam05483 496 KLLLENKELTQEASDMTLELkkhQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVkcklDKSEENA 575
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1488 GMASQAAQEAELNARKAKNSVSSLLSQLTD---LLEQLGQLDTVDLNKLNEIEGSLNKAkdEMKVSDLDRKvsnLENEAR 1564
Cdd:pfam05483 576 RSIEYEVLKKEKQMKILENKCNNLKKQIENknkNIEELHQENKALKKKGSAENKQLNAY--EIKVNKLELE---LASAKQ 650
|
570 580 590
....*....|....*....|....*....|.
gi 281371490 1565 KQEAAILDYDRDIAeiIKDIHN---LEDIKK 1592
Cdd:pfam05483 651 KFEEIIDNYQKEIE--DKKISEeklLEEVEK 679
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
443-490 |
7.47e-14 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 66.99 E-value: 7.47e-14
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 443 CSCDPSGS-TDECNVETGRCVCKDNVEGFSCERCKPGFFNLESSNPKGC 490
Cdd:pfam00053 1 CDCNPHGSlSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
932-979 |
2.64e-13 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 65.45 E-value: 2.64e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 281371490 932 SCDCHALGSTNGQCDIRTGQCECQPGITGQHCERCETNHFGF--GPEGCK 979
Cdd:cd00055 1 PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLpsQGGGCQ 50
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
933-981 |
5.98e-13 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 64.68 E-value: 5.98e-13
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 933 CDCHALGSTNGQCDIRTGQCECQPGITGQHCERCETNHFGFGPEGCKPC 981
Cdd:pfam00053 1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
396-443 |
4.17e-12 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 62.37 E-value: 4.17e-12
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 396 CHCSPVGSLSTQCDSY-GRCSCKPGVMGDKCDRCQPGFHSLTEAGCRPC 443
Cdd:pfam00053 1 CDCNPHGSLSDTCDPEtGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
722-761 |
6.48e-12 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 61.60 E-value: 6.48e-12
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 281371490 722 CTCNGH---SETCDPETGVCDCRDNTAGPHCEKCSDGYYGDST 761
Cdd:pfam00053 1 CDCNPHgslSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPS 43
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1033-1592 |
8.82e-12 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 70.35 E-value: 8.82e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1033 YRLVKD--KVAEHRVKLQELESLIANLgtgdetvtdQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHS 1110
Cdd:COG1196 215 YRELKEelKELEAELLLLKLRELEAEL---------EELEAELEELEAELEELEAELAELEAELEELRLELEELELELEE 285
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1111 QISRLHNIRNTIEEtgilAERARSRVEstEQLIEIASRELEkakmaaanvsitqpestgepnnmtlLAEEARRLAERHKQ 1190
Cdd:COG1196 286 AQAEEYELLAELAR----LEQDIARLE--ERRRELEERLEE-------------------------LEEELAELEEELEE 334
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1191 EADDIVRVAKTANETSAEaynlllrtlagENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVA 1270
Cdd:COG1196 335 LEEELEELEEELEEAEEE-----------LEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLE 403
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1271 QLTPVDS------EALESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAK 1344
Cdd:COG1196 404 ELEEAEEallerlERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELL 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1345 ALAEEAAKKgRNTLQEANDILNNLKDFDRRVNDNK--------------------TAAEEAL-RRIPAINRTIAEANEKT 1403
Cdd:COG1196 484 EELAEAAAR-LLLLLEAEADYEGFLEGVKAALLLAglrglagavavligveaayeAALEAALaAALQNIVVEDDEVAAAA 562
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1404 RE--AQLALGNA------AADATEAKNKAHEAERIASGVQKNATSTKADAERTF-----SEVTDLDNEVNGMLRQLEEAE 1470
Cdd:COG1196 563 IEylKAAKAGRAtflpldKIRARAALAAALARGAIGAAVDLVASDLREADARYYvlgdtLLGRTLVAARLEAALRRAVTL 642
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1471 NELKRKQDDADQDMMMAGMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLGQLDTVDLNKLNEIEGSLNKAKDEMKVS 1550
Cdd:COG1196 643 AGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELE 722
|
570 580 590 600
....*....|....*....|....*....|....*....|..
gi 281371490 1551 DLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNLEDIKK 1592
Cdd:COG1196 723 EEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEE 764
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
722-769 |
9.07e-12 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 61.22 E-value: 9.07e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 281371490 722 CTCNGH---SETCDPETGVCDCRDNTAGPHCEKCSDGYYGDSTLGTssDCQ 769
Cdd:cd00055 2 CDCNGHgslSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGG--GCQ 50
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
442-491 |
1.14e-11 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 60.83 E-value: 1.14e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 281371490 442 PCSCDPSGS-TDECNVETGRCVCKDNVEGFSCERCKPGFFNLeSSNPKGCT 491
Cdd:cd00055 1 PCDCNGHGSlSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGL-PSQGGGCQ 50
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
395-441 |
1.70e-11 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 60.45 E-value: 1.70e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 281371490 395 PCHCSPVGSLSTQCDSY-GRCSCKPGVMGDKCDRCQPGFHSLTEA--GCR 441
Cdd:cd00055 1 PCDCNGHGSLSGQCDPGtGQCECKPNTTGRRCDRCAPGYYGLPSQggGCQ 50
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
882-930 |
2.07e-11 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 60.06 E-value: 2.07e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 281371490 882 CACNPYGTVQqqSSCNPVTGQCQCLPHVSGRDCGACDPGFYNLQ--SGQGC 930
Cdd:pfam00053 1 CDCNPHGSLS--DTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPsdPPQGC 49
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
396-440 |
2.88e-11 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 59.63 E-value: 2.88e-11
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 281371490 396 CHCSPVGSLSTQCDS-YGRCSCKPGVMGDKCDRCQPGFHSLTEAGC 440
Cdd:smart00180 1 CDCDPGGSASGTCDPdTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
443-490 |
4.05e-11 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 59.25 E-value: 4.05e-11
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 443 CSCDPSGS-TDECNVETGRCVCKDNVEGFSCERCKPGFFNlesSNPKGC 490
Cdd:smart00180 1 CDCDPGGSaSGTCDPDTGQCECKPNVTGRRCDRCAPGYYG---DGPPGC 46
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
881-931 |
7.21e-11 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 58.90 E-value: 7.21e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 281371490 881 ACACNPYGTVQQQssCNPVTGQCQCLPHVSGRDCGACDPGFYNLQS-GQGCE 931
Cdd:cd00055 1 PCDCNGHGSLSGQ--CDPGTGQCECKPNTTGRRCDRCAPGYYGLPSqGGGCQ 50
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
722-763 |
2.86e-10 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 56.94 E-value: 2.86e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 281371490 722 CTCNG---HSETCDPETGVCDCRDNTAGPHCEKCSDGYYGDSTLG 763
Cdd:smart00180 1 CDCDPggsASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPG 45
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
980-1027 |
3.51e-10 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 56.59 E-value: 3.51e-10
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 281371490 980 PCDCHHEGSLSLQC-KDDGRCECREGFVGNRCDQCEENYFYNRSWP-GCQ 1027
Cdd:cd00055 1 PCDCNGHGSLSGQCdPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGgGCQ 50
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
882-930 |
3.55e-10 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 56.55 E-value: 3.55e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 882 CACNPYGTVQqqSSCNPVTGQCQCLPHVSGRDCGACDPGFYNlQSGQGC 930
Cdd:smart00180 1 CDCDPGGSAS--GTCDPDTGQCECKPNVTGRRCDRCAPGYYG-DGPPGC 46
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
981-1026 |
9.92e-10 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 55.40 E-value: 9.92e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 281371490 981 CDCHHEGSLSLQC-KDDGRCECREGFVGNRCDQCEENYfYNRSWPGC 1026
Cdd:smart00180 1 CDCDPGGSASGTCdPDTGQCECKPNVTGRRCDRCAPGY-YGDGPPGC 46
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1129-1504 |
3.32e-09 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 62.15 E-value: 3.32e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1129 AERARSRVES--------TEQLIEIASRELEKAKMAAANVSITqpeSTGEPNNMTLLAEEARRLAERHKQEADDIVRVAK 1200
Cdd:NF041483 202 AESARAEAEAilrrarkdAERLLNAASTQAQEATDHAEQLRSS---TAAESDQARRQAAELSRAAEQRMQEAEEALREAR 278
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1201 T-ANETSAEAYNLLLRTLAG-----ENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAK--------RAGDKAVEIY 1266
Cdd:NF041483 279 AeAEKVVAEAKEAAAKQLASaesanEQRTRTAKEEIARLVGEATKEAEALKAEAEQALADARaeaeklvaEAAEKARTVA 358
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1267 A--SVAQLTPVDSEALE-----------------SEANKIKKEA-ADLDRLIDQKlkdyEDLREDMRG------KEHEVK 1320
Cdd:NF041483 359 AedTAAQLAKAARTAEEvltkasedakattraaaEEAERIRREAeAEADRLRGEA----ADQAEQLKGaakddtKEYRAK 434
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1321 N--LLE-----KGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLK-DFDRRVNDNKTAAE----EALRR 1388
Cdd:NF041483 435 TveLQEearrlRGEAEQLRAEAVAEGERIRGEARREAVQQIEEAARTAEELLTKAKaDADELRSTATAESErvrtEAIER 514
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1389 IPAINRTIAEANEKTREaqlalgnaaadateaknkahEAERIASGVQKNATSTKADAERTFSEVTDlDNEVNGMLRQLEE 1468
Cdd:NF041483 515 ATTLRRQAEETLERTRA--------------------EAERLRAEAEEQAEEVRAAAERAARELRE-ETERAIAARQAEA 573
|
410 420 430
....*....|....*....|....*....|....*..
gi 281371490 1469 AEnELKRKQDDADQDMMMAGMA-SQAAQEAELNARKA 1504
Cdd:NF041483 574 AE-ELTRLHTEAEERLTAAEEAlADARAEAERIRREA 609
|
|
| MA |
smart00283 |
Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo ... |
1089-1300 |
4.77e-09 |
|
Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo reversible methylation in response to attractants or repellants during bacterial chemotaxis.
Pssm-ID: 214599 [Multi-domain] Cd Length: 262 Bit Score: 59.22 E-value: 4.77e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1089 EVKDVDQNLMDRLQRVNNSLHSQISRLHNIRNTIEETGILAERARSRVESTEQLIEIASRELEKAKMAAANVSITQPEST 1168
Cdd:smart00283 1 DVSEAVEEIAAGAEEQAEELEELAERMEELSASIEEVAANADEIAATAQSAAEAAEEGREAVEDAITAMDQIREVVEEAV 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1169 G-----------------------EPNNMTLL-------------------AEEARRLAERHKQEADDIVRVAKTANETS 1206
Cdd:smart00283 81 SaveeleessdeigeivsviddiaDQTNLLALnaaieaarageagrgfavvADEVRKLAERSAESAKEIESLIKEIQEET 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1207 AEAYNLLLRTLAGENQTALEIEELNRKYE----QAKNISQDLEKQAARVHEEAKRAGD--KAVEIYASVAQLTPVDSEAL 1280
Cdd:smart00283 161 NEAVAAMEESSSEVEEGVELVEETGDALEeivdSVEEIADLVQEIAAATDEQAAGSEEvnAAIDEIAQVTQETAAMSEEI 240
|
250 260
....*....|....*....|
gi 281371490 1281 ESEANKIKKEAADLDRLIDQ 1300
Cdd:smart00283 241 SAAAEELSGLAEELDELVER 260
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1065-1504 |
6.41e-09 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 61.00 E-value: 6.41e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1065 TDQAFEDRLKEAEREVTDLLREAQEvkdvdqnlmdRLQRVNNSL---HSQISRLHniRNTIEETGIL----AERARSRVE 1137
Cdd:NF041483 562 TERAIAARQAEAAEELTRLHTEAEE----------RLTAAEEALadaRAEAERIR--REAAEETERLrteaAERIRTLQA 629
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1138 STEQLIEIASRElekakmAAANVSITQPEstGEpNNMTLLAEEARRLAERHK---QEADDIVRV-AKTANE----TSAEA 1209
Cdd:NF041483 630 QAEQEAERLRTE------AAADASAARAE--GE-NVAVRLRSEAAAEAERLKseaQESADRVRAeAAAAAErvgtEAAEA 700
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1210 YNLLLRTLAGENQTALEI-----EELNRKYEQAKNISQDL-----------EKQAARVHEEA-KRAGD-------KAVEI 1265
Cdd:NF041483 701 LAAAQEEAARRRREAEETlgsarAEADQERERAREQSEELlasarkrveeaQAEAQRLVEEAdRRATElvsaaeqTAQQV 780
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1266 YASVAQLTPVDSE-------ALESEANKIKKEAAD-LDRL-------IDQKLKDYEDLREDMRGKEHEVKNLLEKGKAE- 1329
Cdd:NF041483 781 RDSVAGLQEQAEEeiaglrsAAEHAAERTRTEAQEeADRVrsdayaeRERASEDANRLRREAQEETEAAKALAERTVSEa 860
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1330 --------QQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLKD-----FDRRVNDNKTAAEealrRIPAINRTI 1396
Cdd:NF041483 861 iaeaerlrSDASEYAQRVRTEASDTLASAEQDAARTRADAREDANRIRSdaaaqADRLIGEATSEAE----RLTAEARAE 936
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1397 AE--ANEKTREAQLALGNAAADATEAKNKAH-EAERIASGVQKNATSTKADAERTFSE----VTDLDNEVNGML------ 1463
Cdd:NF041483 937 AErlRDEARAEAERVRADAAAQAEQLIAEATgEAERLRAEAAETVGSAQQHAERIRTEaervKAEAAAEAERLRtearee 1016
|
490 500 510 520
....*....|....*....|....*....|....*....|....*...
gi 281371490 1464 --RQLEEAENEL-KRKQDDADQ-DMMMAGMASQAAQ---EAELNARKA 1504
Cdd:NF041483 1017 adRTLDEARKDAnKRRSEAAEQaDTLITEAAAEADQltaKAQEEALRT 1064
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
826-873 |
8.67e-09 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 52.74 E-value: 8.67e-09
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 281371490 826 CQCNDNIDPNavGNCNRLTGECLkCIYNTAGFYCDRCKEGFFGNPLAP 873
Cdd:pfam00053 1 CDCNPHGSLS--DTCDPETGQCL-CKPGVTGRHCDRCKPGYYGLPSDP 45
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
981-1027 |
1.77e-08 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 51.97 E-value: 1.77e-08
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 281371490 981 CDCHHEGSLSLQC-KDDGRCECREGFVGNRCDQCEENYFYNRSWPGCQ 1027
Cdd:pfam00053 1 CDCNPHGSLSDTCdPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQG 48
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1070-1503 |
3.22e-08 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 58.68 E-value: 3.22e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1070 EDRLKEAEREVTDLLREAQE--VKDVDQNLMDRLQRVNNSlHSQISRLHNiRNTIEETGILAE----RARSRVEStEQLI 1143
Cdd:NF041483 271 EEALREARAEAEKVVAEAKEaaAKQLASAESANEQRTRTA-KEEIARLVG-EATKEAEALKAEaeqaLADARAEA-EKLV 347
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1144 EIASrelEKAKMAAANVSITQ---PESTGEpNNMTLLAEEARRLAERHKQEADDIVRVAKT-ANETSAEAYNLL--LRTL 1217
Cdd:NF041483 348 AEAA---EKARTVAAEDTAAQlakAARTAE-EVLTKASEDAKATTRAAAEEAERIRREAEAeADRLRGEAADQAeqLKGA 423
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1218 AGEN------QTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVAQ--LTPVDSEALE------SE 1283
Cdd:NF041483 424 AKDDtkeyraKTVELQEEARRLRGEAEQLRAEAVAEGERIRGEARREAVQQIEEAARTAEelLTKAKADADElrstatAE 503
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1284 ANKIKKEAadLDRLIDQKlKDYEDLREDMRGKEHEVKNllekgKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEAND 1363
Cdd:NF041483 504 SERVRTEA--IERATTLR-RQAEETLERTRAEAERLRA-----EAEEQAEEVRAAAERAARELREETERAIAARQAEAAE 575
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1364 ILNNLK-DFDRRVndnkTAAEEAL----------RRIPA--INRTIAEANEKTR--------EAQLALGNAAADATEAKN 1422
Cdd:NF041483 576 ELTRLHtEAEERL----TAAEEALadaraeaeriRREAAeeTERLRTEAAERIRtlqaqaeqEAERLRTEAAADASAARA 651
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1423 K------------AHEAERIASGVQKNATSTKAD----AERTFSEVTD-LDNEVNGMLRQLEEAENELKRKQDDADQDMm 1485
Cdd:NF041483 652 EgenvavrlrseaAAEAERLKSEAQESADRVRAEaaaaAERVGTEAAEaLAAAQEEAARRRREAEETLGSARAEADQER- 730
|
490
....*....|....*...
gi 281371490 1486 maGMASQAAQEAELNARK 1503
Cdd:NF041483 731 --ERAREQSEELLASARK 746
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
825-874 |
5.71e-08 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 50.43 E-value: 5.71e-08
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 281371490 825 PCQCNDNIDPNavGNCNRLTGECLkCIYNTAGFYCDRCKEGFFGNPLAPN 874
Cdd:cd00055 1 PCDCNGHGSLS--GQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQGG 47
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1067-1572 |
7.05e-08 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 57.53 E-value: 7.05e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1067 QAFEDRL-KEAEREVTDLL----REAQEVKDVDQNLMDRLQRvnnslhsQISRLhniRNTIEETgilAERARSrvESTEQ 1141
Cdd:NF041483 752 QAEAQRLvEEADRRATELVsaaeQTAQQVRDSVAGLQEQAEE-------EIAGL---RSAAEHA---AERTRT--EAQEE 816
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1142 LIEIASRELEKAKMAAANVSITQPESTGEpnnmtllAEEARRLAERHKQEA-DDIVRVAKTANETSAEAYNLLLRTLAGE 1220
Cdd:NF041483 817 ADRVRSDAYAERERASEDANRLRREAQEE-------TEAAKALAERTVSEAiAEAERLRSDASEYAQRVRTEASDTLASA 889
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1221 NQTALEIEELNRkyEQAKNISQDLEKQAARVHEEAKRAGDKAVEiyASVAQLTPVDSEALESEANKIKKEAADLDRLIDQ 1300
Cdd:NF041483 890 EQDAARTRADAR--EDANRIRSDAAAQADRLIGEATSEAERLTA--EARAEAERLRDEARAEAERVRADAAAQAEQLIAE 965
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1301 KLKDYEDLR--------------EDMRGKEHEVKNLLEkGKAEQQTADqllaradaakalaeeAAKKGRNTLQEANdiln 1366
Cdd:NF041483 966 ATGEAERLRaeaaetvgsaqqhaERIRTEAERVKAEAA-AEAERLRTE---------------AREEADRTLDEAR---- 1025
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1367 nlKDFDRRVNDnktAAEEALRripAINRTIAEANEKTREAQLALGNAAADATEAKNKA-----HEAERIAS--------- 1432
Cdd:NF041483 1026 --KDANKRRSE---AAEQADT---LITEAAAEADQLTAKAQEEALRTTTEAEAQADTMvgaarKEAERIVAeatvegnsl 1097
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1433 -------------GVQKNATSTKADAE----RTFSEVTDLDNevngmlRQLEEAENELKRKQDDADQdMMMAGMASQAAQ 1495
Cdd:NF041483 1098 vekartdadellvGARRDATAIRERAEelrdRITGEIEELHE------RARRESAEQMKSAGERCDA-LVKAAEEQLAEA 1170
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 281371490 1496 EAelnarKAKNSVSSLLSQLTdlleqlgqldTVDLNKLNEIEGSLNKAkdEMKVSDLDRKVSNLENEARKQEAAILD 1572
Cdd:NF041483 1171 EA-----KAKELVSDANSEAS----------KVRIAAVKKAEGLLKEA--EQKKAELVREAEKIKAEAEAEAKRTVE 1230
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1049-1514 |
6.33e-07 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 54.45 E-value: 6.33e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1049 ELESLIANLGTGDETVTDQAFEDRLKEAEREVTDLLREAQE-VKDVDQNLMDRLQRVNNSLHSQISRLHNIRNTIEE--- 1124
Cdd:NF041483 342 EAEKLVAEAAEKARTVAAEDTAAQLAKAARTAEEVLTKASEdAKATTRAAAEEAERIRREAEAEADRLRGEAADQAEqlk 421
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1125 -------------TGILAERARSRVESTEQLIEIASRELEKAKMAAANVSITQPESTgepnnmtllAEEARRLAERHKQE 1191
Cdd:NF041483 422 gaakddtkeyrakTVELQEEARRLRGEAEQLRAEAVAEGERIRGEARREAVQQIEEA---------ARTAEELLTKAKAD 492
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1192 ADDIVRVAKTANETsaeaynllLRTLAGENQTAL--EIEE-LNRKYEQAKNISQDLEKQAARVHEEAKRAG-------DK 1261
Cdd:NF041483 493 ADELRSTATAESER--------VRTEAIERATTLrrQAEEtLERTRAEAERLRAEAEEQAEEVRAAAERAArelreetER 564
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1262 AVEIYASVA--QLTPVDSEALE-------------SEANKIKKEAAD-LDRLidqKLKDYEDLREDMRGKEHEVKNLLEK 1325
Cdd:NF041483 565 AIAARQAEAaeELTRLHTEAEErltaaeealadarAEAERIRREAAEeTERL---RTEAAERIRTLQAQAEQEAERLRTE 641
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1326 GKAEQQTADqllaradaakalaeeaakkgrntlQEANDILNNLKDfdrrvndnkTAAEEALRRipainRTIA-EANEKTR 1404
Cdd:NF041483 642 AAADASAAR------------------------AEGENVAVRLRS---------EAAAEAERL-----KSEAqESADRVR 683
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1405 -EAQLALGNAAADATEAKNKAH-EAERIASGVQKNATSTKADAERTFSEVTDLDNEVNGMLR-QLEEAENELKRKQDDAD 1481
Cdd:NF041483 684 aEAAAAAERVGTEAAEALAAAQeEAARRRREAEETLGSARAEADQERERAREQSEELLASARkRVEEAQAEAQRLVEEAD 763
|
490 500 510
....*....|....*....|....*....|...
gi 281371490 1482 QdmmmagMASQAAQEAELNARKAKNSVSSLLSQ 1514
Cdd:NF041483 764 R------RATELVSAAEQTAQQVRDSVAGLQEQ 790
|
|
| ClyA_Cry6Aa-like |
cd22656 |
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ... |
1192-1333 |
8.71e-07 |
|
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.
Pssm-ID: 439154 [Multi-domain] Cd Length: 309 Bit Score: 52.76 E-value: 8.71e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1192 ADDIVRVAKTANETSAEAYNLLLRtLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKaveiYASVAQ 1271
Cdd:cd22656 75 AGDIYNYAQNAGGTIDSYYAEILE-LIDDLADATDDEELEEAKKTIKALLDDLLKEAKKYQDKAAKVVDK----LTDFEN 149
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 281371490 1272 LTPVDSEALESEANKIKK---------EAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTA 1333
Cdd:cd22656 150 QTEKDQTALETLEKALKDlltdeggaiARKEIKDLQKELEKLNEEYAAKLKAKIDELKALIADDEAKLAAA 220
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
283-328 |
1.20e-06 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 46.96 E-value: 1.20e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 281371490 283 RCKCNGHASECVKNEFGKLVCNCKHNTYGVDCEKCLPFFNDRPWRR 328
Cdd:cd00055 1 PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQG 46
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
826-870 |
7.02e-05 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 41.53 E-value: 7.02e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 281371490 826 CQCNDniDPNAVGNCNRLTGECLkCIYNTAGFYCDRCKEGFFGNP 870
Cdd:smart00180 1 CDCDP--GGSASGTCDPDTGQCE-CKPNVTGRRCDRCAPGYYGDG 42
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1071-1565 |
7.84e-05 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 47.51 E-value: 7.84e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1071 DRLK-EAEREvtdlLREAQ-EVKDVDQNLMDRLQRVNNSLHSQ-ISR-------LHNIRNTIE----ETGILAERARSRV 1136
Cdd:NF041483 86 DQLRaDAERE----LRDARaQTQRILQEHAEHQARLQAELHTEaVQRrqqldqeLAERRQTVEshvnENVAWAEQLRART 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1137 ES-TEQLIEIASRELEKAkMAAANVsitqpestgepnnmtllaeEARRLAERHKQeaddivRVAKTANETSAEAYNLLLR 1215
Cdd:NF041483 162 ESqARRLLDESRAEAEQA-LAAARA-------------------EAERLAEEARQ------RLGSEAESARAEAEAILRR 215
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1216 TLAGENQTaleieeLNRKYEQAknisqdlekQAARVHEEAKRAgdkaveiyasvaqltpvdSEALESEAnkIKKEAADLD 1295
Cdd:NF041483 216 ARKDAERL------LNAASTQA---------QEATDHAEQLRS------------------STAAESDQ--ARRQAAELS 260
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1296 RLIDQKLKDYEdlrEDMRGKEHEVKNLLEKGK--AEQQTAdqllaradaakalaeeaakkgrnTLQEANDilnnlkdfdr 1373
Cdd:NF041483 261 RAAEQRMQEAE---EALREARAEAEKVVAEAKeaAAKQLA-----------------------SAESANE---------- 304
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1374 rvNDNKTAAEEalrripaINRTIAEAN---EKTR-EAQLALGNAAADATEAKNKAHEAERIASGvqKNATSTKADAERTf 1449
Cdd:NF041483 305 --QRTRTAKEE-------IARLVGEATkeaEALKaEAEQALADARAEAEKLVAEAAEKARTVAA--EDTAAQLAKAART- 372
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1450 sevtdldnevngmlrqleeAENELKRKQDDADQdmmmagMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLgqldtvd 1529
Cdd:NF041483 373 -------------------AEEVLTKASEDAKA------TTRAAAEEAERIRREAEAEADRLRGEAADQAEQL------- 420
|
490 500 510
....*....|....*....|....*....|....*.
gi 281371490 1530 lnklneiEGSlnkAKDEMKvsDLDRKVSNLENEARK 1565
Cdd:NF041483 421 -------KGA---AKDDTK--EYRAKTVELQEEARR 444
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
340-394 |
1.31e-04 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 41.18 E-value: 1.31e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 281371490 340 CDCNGR---SQECYFdpelyrstgHGGHCTnCRDNTDGAKCERCRENFFRLGNTEACS 394
Cdd:pfam00053 1 CDCNPHgslSDTCDP---------ETGQCL-CKPGVTGRHCDRCKPGYYGLPSDPPQG 48
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
770-818 |
1.53e-04 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 40.80 E-value: 1.53e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 770 PCPCPGGSSCAIVPKTKEVVCtHCPTGTAGKRCELCDDGYFGDPLGSNG 818
Cdd:cd00055 1 PCDCNGHGSLSGQCDPGTGQC-ECKPNTTGRRCDRCAPGYYGLPSQGGG 48
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
284-325 |
3.27e-04 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 40.03 E-value: 3.27e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 281371490 284 CKCNGHASECVKNEFGKLVCNCKHNTYGVDCEKCLPFFNDRP 325
Cdd:pfam00053 1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLP 42
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
339-387 |
4.29e-04 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 39.64 E-value: 4.29e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 281371490 339 PCDCNGR---SQECYFdpelyrstgHGGHCTnCRDNTDGAKCERCRENFFRL 387
Cdd:cd00055 1 PCDCNGHgslSGQCDP---------GTGQCE-CKPNTTGRRCDRCAPGYYGL 42
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
284-326 |
8.91e-03 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 35.75 E-value: 8.91e-03
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 281371490 284 CKCNGHASECVKNEFGKLVCNCKHNTYGVDCEKCLPFFNDRPW 326
Cdd:smart00180 1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGP 43
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| LamNT |
smart00136 |
Laminin N-terminal domain (domain VI); N-terminal domain of laminins and laminin-related ... |
42-282 |
5.54e-119 |
|
Laminin N-terminal domain (domain VI); N-terminal domain of laminins and laminin-related protein such as Unc-6/ netrins.
Pssm-ID: 214532 Cd Length: 238 Bit Score: 373.62 E-value: 5.54e-119
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 42 GGRPQRCMPEFVNAAFNVTVVATNTCGTP-PEEYCVQTGVTGVTKSCHLCDAGQLHLQHGAAFLTDYNNQADTTWWQSQT 120
Cdd:smart00136 1 AGRPRSCYPPFVNLAFGREVTATSTCGEPgPERYCKLVGHTEQGKKCDYCDARNPRRSHPAENLTDGNNPNNPTWWQSEP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 121 MLAGVQYpnsINLTLHLGKAFDITYVRLKFHTSRPeSFAIYKRTREDGPWIPYQYYSGSCENTYSKANRGFIRTGgDEQQ 200
Cdd:smart00136 81 LSNGPQN---VNLTLDLGKEFHVTYVILKFCSPRP-SLWILERSDFGKTWQPWQYFSSDCRRTFGRPPRGPITKG-NEDE 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 201 ALCTDEFSDISPLTGGNVAFSTLEGRPSAYNFDNSPVLQEWVTATDIRVTLNRLNTFGDEVFND-PKVLKSYYYAISDFA 279
Cdd:smart00136 156 VICTSEYSDIVPLEGGEIAFSLLEGRPSATDFDNSPVLQEWVTATNIRVRLTRLRTLGDELMDDrPEVTRRYYYAISDIA 235
|
...
gi 281371490 280 VGG 282
Cdd:smart00136 236 VGG 238
|
|
| Laminin_N |
pfam00055 |
Laminin N-terminal (Domain VI); |
48-282 |
1.13e-113 |
|
Laminin N-terminal (Domain VI);
Pssm-ID: 459653 Cd Length: 230 Bit Score: 358.43 E-value: 1.13e-113
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 48 CMPEFVNAAFNVTVVATNTCGT-PPEEYCVQTGVTGVtKSCHLCDAGQLHLQHGAAFLTDYNNQADTTWWQSQTMLagVQ 126
Cdd:pfam00055 1 CYPAFGNLAFGREVSATSTCGLnGPERYCILSGLEGG-KKCFICDSRDPHNSHPPSNLTDSNNGTNETWWQSETGV--IQ 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 127 YPNsINLTLHLGKAFDITYVRLKFHTSRPESFAIYKRTREDGPWIPYQYYSGSCENTYSKANRGfiRTGGDEQQALCTDE 206
Cdd:pfam00055 78 YEN-VNLTLDLGKEFHFTYLILKFKSPRPAAMVLERSTDFGKTWQPYQYFASDCRRTFGRPSGP--SRGIKDDEVICTSE 154
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 281371490 207 FSDISPLTGGNVAFSTLEGRPSAYNFDNSPVLQEWVTATDIRVTLNRLNTFGDEVFNDPKVLKSYYYAISDFAVGG 282
Cdd:pfam00055 155 YSDISPLTGGEVIFSTLEGRPSANIFDYSPELQDWLTATNIRIRLLRLHTLGDELLDDPSVLRKYYYAISDISVGG 230
|
|
| LamB |
smart00281 |
Laminin B domain; |
552-676 |
4.86e-41 |
|
Laminin B domain;
Pssm-ID: 214597 Cd Length: 127 Bit Score: 147.41 E-value: 4.86e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 552 YFPRYFVAPAKFLGNQVLSYGQNLSFSFRVD--RRDTRLSAEDLVLEGAGLRVSVPliAQGNSYPSE-TTVKYIFRLHEA 628
Cdd:smart00281 2 NEPVYWVAPEQFLGDKVTSYGGKLRYTLSFDgrRGGTHVSAPDVILEGNGLRISHP--AEGPPLPDElTTVEVRFREENW 79
|
90 100 110 120
....*....|....*....|....*....|....*....|....*....
gi 281371490 629 TDYPWRPALSPfEFQKLLNNLTSIKIRGTYSERSAG-YLDDVTLQSARP 676
Cdd:smart00281 80 QYYGGRPVTRE-DLMMVLANLTAILIRATYSQQMAGsRLSDVSLEVAVP 127
|
|
| Laminin_B |
pfam00052 |
Laminin B (Domain IV); |
556-686 |
6.55e-41 |
|
Laminin B (Domain IV);
Pssm-ID: 459652 Cd Length: 136 Bit Score: 147.42 E-value: 6.55e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 556 YFVAPAKFLGNQVLSYGQNLSFSFRVDRRD---TRLSAEDLVLEGAGLRVSVPLIAQGNSYPSETTvKYIFRLHEATdyp 632
Cdd:pfam00052 1 YWSAPEQFLGNKLTSYGGYLTYTVRYEPLPgggSLNSEPDVILEGNGLRLSYSSPDQPPPDPGQEQ-TYSVRLHEEN--- 76
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 633 WR----PALSPFEFQKLLNNLTSIKIRGTYSERSAG-YLDDVTLQSARPGP-GVPATWVE 686
Cdd:pfam00052 77 WRdsdgAPVSREDFMMVLANLTAILIRATYSTGSGQvSLSNVSLDSAVPGGsGPPASWVE 136
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1034-1596 |
9.39e-18 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 90.12 E-value: 9.39e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1034 RLVKDKVAEHRVKLQELESLIANLGTGDETVTDQA--FEDRLKEAEREVTDLLREAQEV-------KDVDQNLMDRLQRV 1104
Cdd:TIGR02168 429 KLEEAELKELQAELEELEEELEELQEELERLEEALeeLREELEEAEQALDAAERELAQLqarldslERLQENLEGFSEGV 508
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1105 NNSLHSQiSRLHNIrntieeTGILAERARSRvESTEQLIEIASREL------EKAKMAAANVSITQPESTG--------- 1169
Cdd:TIGR02168 509 KALLKNQ-SGLSGI------LGVLSELISVD-EGYEAAIEAALGGRlqavvvENLNAAKKAIAFLKQNELGrvtflplds 580
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1170 ------EPNNMTLLAEE--ARRLAERHKQEADDI----------VRVAK---TANETSAE-AYNLLLRTLAGE------- 1220
Cdd:TIGR02168 581 ikgteiQGNDREILKNIegFLGVAKDLVKFDPKLrkalsyllggVLVVDdldNALELAKKlRPGYRIVTLDGDlvrpggv 660
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1221 --------NQTAL----EIEELNRKYEQAKNISQDLEKQAARV---HEEAKRAGDKAVEIYASVAQLTPVDSEALESEAN 1285
Cdd:TIGR02168 661 itggsaktNSSILerrrEIEELEEKIEELEEKIAELEKALAELrkeLEELEEELEQLRKELEELSRQISALRKDLARLEA 740
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1286 KIKKEAADLDRLiDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAKALAeeaakkgRNTLQEANDIL 1365
Cdd:TIGR02168 741 EVEQLEERIAQL-SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKAL-------REALDELRAEL 812
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1366 NNLKdfdRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAA---DATEAKNKAHEaERIASGVQKNATSTK 1442
Cdd:TIGR02168 813 TLLN---EEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAeieELEELIEELES-ELEALLNERASLEEA 888
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1443 ADAERtfSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQ--------DMMMAGMASQAAQEAELNARKAKNSVSSLLS- 1513
Cdd:TIGR02168 889 LALLR--SELEELSEELRELESKRSELRRELEELREKLAQlelrleglEVRIDNLQERLSEEYSLTLEEAEALENKIEDd 966
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1514 ------QLTDLLEQLGQLDTVDLNKLNEIEgSLNKAKDEMK--VSDLdrkvsnleNEARKQ-EAAILDYDRDIAEIIKDI 1584
Cdd:TIGR02168 967 eeearrRLKRLENKIKELGPVNLAAIEEYE-ELKERYDFLTaqKEDL--------TEAKETlEEAIEEIDREARERFKDT 1037
|
650
....*....|..
gi 281371490 1585 hnLEDIKKTLPT 1596
Cdd:TIGR02168 1038 --FDQVNENFQR 1047
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1070-1594 |
1.68e-16 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 85.50 E-value: 1.68e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1070 EDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNnslhSQISRLHNIRNTIEETGILAERARSRVESTEQLI-EIASR 1148
Cdd:PRK03918 192 EELIKEKEKELEEVLREINEISSELPELREELEKLE----KEVKELEELKEEIEELEKELESLEGSKRKLEEKIrELEER 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1149 ------ELEKAKmaaanvsitqpESTGEPNNMTLLAEEARRLAERHKQEADDIVRVAKTANETSAEAYNLLLRTLAGENQ 1222
Cdd:PRK03918 268 ieelkkEIEELE-----------EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK 336
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1223 TAlEIEELNRKYEQAKNISQDLEKQaARVHEEAKRAGDKAVEIYASVAQLTPVDSEA-----------LESEANKIKKEA 1291
Cdd:PRK03918 337 EE-RLEELKKKLKELEKRLEELEER-HELYEEAKAKKEELERLKKRLTGLTPEKLEKeleelekakeeIEEEISKITARI 414
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1292 ADLDRLIDQKLKDYEDLREdMRGK---------EHEVKNLLEKGKAEQQTAdqllaradaakalaeeaakkgRNTLQEAN 1362
Cdd:PRK03918 415 GELKKEIKELKKAIEELKK-AKGKcpvcgreltEEHRKELLEEYTAELKRI---------------------EKELKEIE 472
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1363 DILNNLKDFDRRVnDNKTAAEEALRRIPAINRTIAEANEKTREaqlalgnaaADATEAKNKAHEAERI---ASGVQKNAT 1439
Cdd:PRK03918 473 EKERKLRKELREL-EKVLKKESELIKLKELAEQLKELEEKLKK---------YNLEELEKKAEEYEKLkekLIKLKGEIK 542
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1440 STKADAERTfsevtdldNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASqaaqEAELNAR------------KAKNS 1507
Cdd:PRK03918 543 SLKKELEKL--------EELKKKLAELEKKLDELEEELAELLKELEELGFES----VEELEERlkelepfyneylELKDA 610
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1508 VSSLLSQL-------TDLLEQLGQLDTVDlNKLNEIEGSLNKAK---DEMKVSDLDRKVSNLENEARKQEAAILDYDRDI 1577
Cdd:PRK03918 611 EKELEREEkelkkleEELDKAFEELAETE-KRLEELRKELEELEkkySEEEYEELREEYLELSRELAGLRAELEELEKRR 689
|
570
....*....|....*..
gi 281371490 1578 AEIIKDIHNLEDIKKTL 1594
Cdd:PRK03918 690 EEIKKTLEKLKEELEER 706
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1070-1588 |
1.36e-14 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 79.34 E-value: 1.36e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1070 EDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRLHNIRNTIEETGILAERARSRVESTEQLIE-IASR 1148
Cdd:PRK03918 164 YKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEeIEEL 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1149 ELEKAKmaaanvsitqpestgepnnmtlLAEEARRLAERHKQEADDIVRVAKTANETSAEAYNllLRTLAGENQTALEIE 1228
Cdd:PRK03918 244 EKELES----------------------LEGSKRKLEEKIRELEERIEELKKEIEELEEKVKE--LKELKEKAEEYIKLS 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1229 ELNRKYEQAKNisqDLEKQAARVHEEAKragdkavEIYASVAQLtpvdsEALESEANKIKKEAADLDRLIdQKLKDYEDL 1308
Cdd:PRK03918 300 EFYEEYLDELR---EIEKRLSRLEEEIN-------GIEERIKEL-----EEKEERLEELKKKLKELEKRL-EELEERHEL 363
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1309 REDMRGKEHEVKNL--------LEKGKAEQQTADQLlaradaakalaeeaakkgRNTLQEA-NDILNNLKDFDRRVNDNK 1379
Cdd:PRK03918 364 YEEAKAKKEELERLkkrltgltPEKLEKELEELEKA------------------KEEIEEEiSKITARIGELKKEIKELK 425
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1380 TAAEE---ALRRIPAINRTIAEANEKT--REAQLALGNAAADATEAKNKAHEAERIASGVQK--NATSTKADAERTFSEV 1452
Cdd:PRK03918 426 KAIEElkkAKGKCPVCGRELTEEHRKEllEEYTAELKRIEKELKEIEEKERKLRKELRELEKvlKKESELIKLKELAEQL 505
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1453 TDLDNEVNGM-LRQLEEAENE---LKRKQDDADQDmmMAGMASQAAQEAELNARKA-----KNSVSSLLSQLTDLLEQLG 1523
Cdd:PRK03918 506 KELEEKLKKYnLEELEKKAEEyekLKEKLIKLKGE--IKSLKKELEKLEELKKKLAelekkLDELEEELAELLKELEELG 583
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 281371490 1524 QLDTVDLN-KLNEIEG------SLNKAKDEMKvsDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNLE 1588
Cdd:PRK03918 584 FESVEELEeRLKELEPfyneylELKDAEKELE--REEKELKKLEEELDKAFEELAETEKRLEELRKELEELE 653
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
933-978 |
3.34e-14 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 68.11 E-value: 3.34e-14
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 281371490 933 CDCHALGSTNGQCDIRTGQCECQPGITGQHCERCETNHFGFGPEGC 978
Cdd:smart00180 1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1038-1592 |
4.48e-14 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 77.45 E-value: 4.48e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1038 DKVAEHRVKLQELESLIANLGTGDETVTdQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRL-- 1115
Cdd:pfam05483 169 EKTKKYEYEREETRQVYMDLNNNIEKMI-LAFEELRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDKEKQVSLLli 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1116 --HNIRNTIEETGILAERARSRV-----------ESTEQLIEIA---SRELEKAKMAAANVSITQPESTGEPNNMT---- 1175
Cdd:pfam05483 248 qiTEKENKMKDLTFLLEESRDKAnqleektklqdENLKELIEKKdhlTKELEDIKMSLQRSMSTQKALEEDLQIATktic 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1176 -LLAEEARRLAERHKQEADDIVRVakTANETSAEAYNLLLRTlagenqtalEIEELNRKYEQAKNISQDLEKQAARVHEE 1254
Cdd:pfam05483 328 qLTEEKEAQMEELNKAKAAHSFVV--TEFEATTCSLEELLRT---------EQQRLEKNEDQLKIITMELQKKSSELEEM 396
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1255 AKRAGDKAVEIyasvaqltpvdsealeseaNKIKKEAADLDRLIDQKlKDYEDLREDMRGKEHEVKNLLEKGKAEqqtad 1334
Cdd:pfam05483 397 TKFKNNKEVEL-------------------EELKKILAEDEKLLDEK-KQFEKIAEELKGKEQELIFLLQAREKE----- 451
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1335 qllaradaakalaeeaakkgrntlqeandilnnLKDFDRRVNDNKTAAEEALRRIPAINRTIAeaNEKTREAQLAlGNAA 1414
Cdd:pfam05483 452 ---------------------------------IHDLEIQLTAIKTSEEHYLKEVEDLKTELE--KEKLKNIELT-AHCD 495
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1415 ADATEAKNKAHEAERIASGV---QKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDA----DQDMMMA 1487
Cdd:pfam05483 496 KLLLENKELTQEASDMTLELkkhQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVkcklDKSEENA 575
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1488 GMASQAAQEAELNARKAKNSVSSLLSQLTD---LLEQLGQLDTVDLNKLNEIEGSLNKAkdEMKVSDLDRKvsnLENEAR 1564
Cdd:pfam05483 576 RSIEYEVLKKEKQMKILENKCNNLKKQIENknkNIEELHQENKALKKKGSAENKQLNAY--EIKVNKLELE---LASAKQ 650
|
570 580 590
....*....|....*....|....*....|.
gi 281371490 1565 KQEAAILDYDRDIAeiIKDIHN---LEDIKK 1592
Cdd:pfam05483 651 KFEEIIDNYQKEIE--DKKISEeklLEEVEK 679
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
443-490 |
7.47e-14 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 66.99 E-value: 7.47e-14
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 443 CSCDPSGS-TDECNVETGRCVCKDNVEGFSCERCKPGFFNLESSNPKGC 490
Cdd:pfam00053 1 CDCNPHGSlSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
932-979 |
2.64e-13 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 65.45 E-value: 2.64e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 281371490 932 SCDCHALGSTNGQCDIRTGQCECQPGITGQHCERCETNHFGF--GPEGCK 979
Cdd:cd00055 1 PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLpsQGGGCQ 50
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1071-1567 |
4.36e-13 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 74.79 E-value: 4.36e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1071 DRLKEAE-REVTDLLREAQEVKDVDQnLMDRLQRVNNSLHSQiSRLHNIRNTIEETGILAERARSRVESTEQLIEIASRE 1149
Cdd:PTZ00121 1281 DELKKAEeKKKADEAKKAEEKKKADE-AKKKAEEAKKADEAK-KKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1150 LEKAKMAAANVSITQPESTGEPNNMTLLAEEARRL------AERHKQEADDIVRVA---KTANETSAEAYNLL----LRT 1216
Cdd:PTZ00121 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKAdeakkkAEEDKKKADELKKAAaakKKADEAKKKAEEKKkadeAKK 1438
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1217 LAGENQTAleiEELNRKYEQAKNiSQDLEKQA--ARVHEEAKRAGD---KAVEIYASVAQLTPVDSEALESEANKIK--- 1288
Cdd:PTZ00121 1439 KAEEAKKA---DEAKKKAEEAKK-AEEAKKKAeeAKKADEAKKKAEeakKADEAKKKAEEAKKKADEAKKAAEAKKKade 1514
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1289 ------KEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEAN 1362
Cdd:PTZ00121 1515 akkaeeAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARI 1594
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1363 DILNNLKDFD--------RRVNDNKTAA------EEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAE 1428
Cdd:PTZ00121 1595 EEVMKLYEEEkkmkaeeaKKAEEAKIKAeelkkaEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1429 RIASGVQKNATSTKADAERTFSEvtdldNEVNGMLRQLEEAENELKRKQDDADQ-DMMMAGMASQAAQEAELNARKA--- 1504
Cdd:PTZ00121 1675 KKAEEAKKAEEDEKKAAEALKKE-----AEEAKKAEELKKKEAEEKKKAEELKKaEEENKIKAEEAKKEAEEDKKKAeea 1749
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1505 ------KNSVSSLLSqltdllEQLGQLDTVDLNKLNEIEGSLNKaKDEMKVSDLDRKVSNL-ENEARKQE 1567
Cdd:PTZ00121 1750 kkdeeeKKKIAHLKK------EEEKKAEEIRKEKEAVIEEELDE-EDEKRRMEVDKKIKDIfDNFANIIE 1812
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1038-1594 |
4.38e-13 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 74.72 E-value: 4.38e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1038 DKVAEHRVKLQELESLIANLGtGDETVtdqAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRLHN 1117
Cdd:TIGR02169 265 KRLEEIEQLLEELNKKIKDLG-EEEQL---RVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEE 340
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1118 IRNTIEETGILAERARSRVESTEQLIEIASRELE---------KAKMAAANVSITQPESTGEPNNMTL--LAEEARRLAE 1186
Cdd:TIGR02169 341 LEREIEEERKRRDKLTEEYAELKEELEDLRAELEevdkefaetRDELKDYREKLEKLKREINELKRELdrLQEELQRLSE 420
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1187 R---HKQEADDIvRVAKTANETSAEAYNLLLRTLAGENQTALEI-EELNRKYEQAKNISQDLEKQAARVHEEAKRA---- 1258
Cdd:TIGR02169 421 EladLNAAIAGI-EAKINELEEEKEDKALEIKKQEWKLEQLAADlSKYEQELYDLKEEYDRVEKELSKLQRELAEAeaqa 499
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1259 --------GDKAVE---------IYASVAQLTPVDSE---ALESEA-----NKIKKEAADLDRLIdQKLKDYEDLR---- 1309
Cdd:TIGR02169 500 raseervrGGRAVEevlkasiqgVHGTVAQLGSVGERyatAIEVAAgnrlnNVVVEDDAVAKEAI-ELLKRRKAGRatfl 578
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1310 --EDMRGKEHEVKNLLEKGkaeqqtadqllaradaakalaeeaakkgrntlqeANDILNNLKDFDRR------------- 1374
Cdd:TIGR02169 579 plNKMRDERRDLSILSEDG----------------------------------VIGFAVDLVEFDPKyepafkyvfgdtl 624
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1375 -VNDNKTAaeealRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASgVQKNATSTKADAERTFSEVT 1453
Cdd:TIGR02169 625 vVEDIEAA-----RRLMGKYRMVTLEGELFEKSGAMTGGSRAPRGGILFSRSEPAELQR-LRERLEGLKRELSSLQSELR 698
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1454 DLDNEVNGMLRQLEEAENELKRKQDDadqdmmmagmASQAAQEAELNARKAKNSVSSL--LSQ--------LTDLLEQLG 1523
Cdd:TIGR02169 699 RIENRLDELSQELSDASRKIGEIEKE----------IEQLEQEEEKLKERLEELEEDLssLEQeienvkseLKELEARIE 768
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 281371490 1524 QLdTVDLNKLNEIEGSLNKAKDEMKVSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNLEDIKKTL 1594
Cdd:TIGR02169 769 EL-EEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQEL 838
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
933-981 |
5.98e-13 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 64.68 E-value: 5.98e-13
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 933 CDCHALGSTNGQCDIRTGQCECQPGITGQHCERCETNHFGFGPEGCKPC 981
Cdd:pfam00053 1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
396-443 |
4.17e-12 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 62.37 E-value: 4.17e-12
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 396 CHCSPVGSLSTQCDSY-GRCSCKPGVMGDKCDRCQPGFHSLTEAGCRPC 443
Cdd:pfam00053 1 CDCNPHGSLSDTCDPEtGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQGC 49
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1062-1592 |
5.27e-12 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 71.33 E-value: 5.27e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1062 ETVTDQAFEDRLKEAEREVTDLLREAQEVKDV-DQNLMDRLQRVNNSLHSQISRLHNIRNTIEETGiLAERARsRVESTE 1140
Cdd:PTZ00121 1105 KTETGKAEEARKAEEAKKKAEDARKAEEARKAeDARKAEEARKAEDAKRVEIARKAEDARKAEEAR-KAEDAK-KAEAAR 1182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1141 QLIEI-ASRELEKAKMAAANVSITQPESTGEPNNMTLlAEEARRL-----AERHKQEADDIVRVAKTANETSAEAYNLLL 1214
Cdd:PTZ00121 1183 KAEEVrKAEELRKAEDARKAEAARKAEEERKAEEARK-AEDAKKAeavkkAEEAKKDAEEAKKAEEERNNEEIRKFEEAR 1261
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1215 RTLAGENQTALEIEElNRKYEQAKNISQDLEKQAARVHEEAKRAGD---KAVEiyasvaqltPVDSEALESEANKIKKEA 1291
Cdd:PTZ00121 1262 MAHFARRQAAIKAEE-ARKADELKKAEEKKKADEAKKAEEKKKADEakkKAEE---------AKKADEAKKKAEEAKKKA 1331
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1292 ADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAKalaeeaakkgrntlQEANDIlNNLKDF 1371
Cdd:PTZ00121 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK--------------KKAEEK-KKADEA 1396
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1372 DRRVNDNKTAAEEaLRRIPAINRTIAEANEKTREAQlalgnaaaDATEAKNKAHEAERIASGVQKNATSTKADAERTFSE 1451
Cdd:PTZ00121 1397 KKKAEEDKKKADE-LKKAAAAKKKADEAKKKAEEKK--------KADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1452 VTDLDNEVNGMLRQLEEAEnELKRKQDDADQDMMMAGMASQAAQEAE-----LNARKAKNSVSSLLSQLTDLLEQLGQLD 1526
Cdd:PTZ00121 1468 EAKKADEAKKKAEEAKKAD-EAKKKAEEAKKKADEAKKAAEAKKKADeakkaEEAKKADEAKKAEEAKKADEAKKAEEKK 1546
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 281371490 1527 TVD-LNKLNEIEGSLNKAKDEMKVSDLDRKVSNLE--NEARKQEAAILDYDRDIAEIIKDIhNLEDIKK 1592
Cdd:PTZ00121 1547 KADeLKKAEELKKAEEKKKAEEAKKAEEDKNMALRkaEEAKKAEEARIEEVMKLYEEEKKM-KAEEAKK 1614
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
722-761 |
6.48e-12 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 61.60 E-value: 6.48e-12
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 281371490 722 CTCNGH---SETCDPETGVCDCRDNTAGPHCEKCSDGYYGDST 761
Cdd:pfam00053 1 CDCNPHgslSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPS 43
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1033-1592 |
8.82e-12 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 70.35 E-value: 8.82e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1033 YRLVKD--KVAEHRVKLQELESLIANLgtgdetvtdQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHS 1110
Cdd:COG1196 215 YRELKEelKELEAELLLLKLRELEAEL---------EELEAELEELEAELEELEAELAELEAELEELRLELEELELELEE 285
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1111 QISRLHNIRNTIEEtgilAERARSRVEstEQLIEIASRELEkakmaaanvsitqpestgepnnmtlLAEEARRLAERHKQ 1190
Cdd:COG1196 286 AQAEEYELLAELAR----LEQDIARLE--ERRRELEERLEE-------------------------LEEELAELEEELEE 334
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1191 EADDIVRVAKTANETSAEaynlllrtlagENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVA 1270
Cdd:COG1196 335 LEEELEELEEELEEAEEE-----------LEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLE 403
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1271 QLTPVDS------EALESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAK 1344
Cdd:COG1196 404 ELEEAEEallerlERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELL 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1345 ALAEEAAKKgRNTLQEANDILNNLKDFDRRVNDNK--------------------TAAEEAL-RRIPAINRTIAEANEKT 1403
Cdd:COG1196 484 EELAEAAAR-LLLLLEAEADYEGFLEGVKAALLLAglrglagavavligveaayeAALEAALaAALQNIVVEDDEVAAAA 562
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1404 RE--AQLALGNA------AADATEAKNKAHEAERIASGVQKNATSTKADAERTF-----SEVTDLDNEVNGMLRQLEEAE 1470
Cdd:COG1196 563 IEylKAAKAGRAtflpldKIRARAALAAALARGAIGAAVDLVASDLREADARYYvlgdtLLGRTLVAARLEAALRRAVTL 642
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1471 NELKRKQDDADQDMMMAGMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLGQLDTVDLNKLNEIEGSLNKAKDEMKVS 1550
Cdd:COG1196 643 AGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELE 722
|
570 580 590 600
....*....|....*....|....*....|....*....|..
gi 281371490 1551 DLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNLEDIKK 1592
Cdd:COG1196 723 EEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEE 764
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
722-769 |
9.07e-12 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 61.22 E-value: 9.07e-12
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 281371490 722 CTCNGH---SETCDPETGVCDCRDNTAGPHCEKCSDGYYGDSTLGTssDCQ 769
Cdd:cd00055 2 CDCNGHgslSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGG--GCQ 50
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1034-1535 |
9.21e-12 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 70.35 E-value: 9.21e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1034 RLVKDKVAEHRVKLQELESLIANLgtgdeTVTDQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQIS 1113
Cdd:COG1196 305 ARLEERRRELEERLEELEEELAEL-----EEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEE 379
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1114 RLHNIRNTIEEtgILAERARSRVESTEQLIEIASRELEKAKMAAANVSITQPESTGEpnnmTLLAEEARRLAERHKQEAD 1193
Cdd:COG1196 380 ELEELAEELLE--ALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELE----EEEEEEEEALEEAAEEEAE 453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1194 DIVRVAKTANETSAEAYNLLLRTLAGENQTALEIEELNRKY--EQAKNISQDLEKQAARVHEEAKRA---GDKAVEIYAS 1268
Cdd:COG1196 454 LEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLllLEAEADYEGFLEGVKAALLLAGLRglaGAVAVLIGVE 533
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1269 VAQLTPVDSEALESEANKIKKEAADLDRLID------------------------QKLKDYEDLREDMRGKEHEVKNLLE 1324
Cdd:COG1196 534 AAYEAALEAALAAALQNIVVEDDEVAAAAIEylkaakagratflpldkiraraalAAALARGAIGAAVDLVASDLREADA 613
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1325 KGKAEQQT---ADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANE 1401
Cdd:COG1196 614 RYYVLGDTllgRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEEL 693
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1402 KTREAQLALGNAAADATEAKNKAHEAERIASGVQKNATSTKADAERTFSEVTDLDNEVNGM-------LRQLEEAENELK 1474
Cdd:COG1196 694 ELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEelpeppdLEELERELERLE 773
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1475 RKqddadqdMMMAG---MAsqAAQE-AELNARKAknsvsSLLSQLTDLLEQLGQLDTVdLNKLNE 1535
Cdd:COG1196 774 RE-------IEALGpvnLL--AIEEyEELEERYD-----FLSEQREDLEEARETLEEA-IEEIDR 823
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
442-491 |
1.14e-11 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 60.83 E-value: 1.14e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 281371490 442 PCSCDPSGS-TDECNVETGRCVCKDNVEGFSCERCKPGFFNLeSSNPKGCT 491
Cdd:cd00055 1 PCDCNGHGSlSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGL-PSQGGGCQ 50
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1037-1594 |
1.52e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 69.70 E-value: 1.52e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1037 KDKVAEHRVKLQELESLIanlgtgdetvtdQAFEDRLKEAEREVTDLLREAQEVkdvdQNLMDRLQRVNNSLHSQISRlh 1116
Cdd:TIGR02168 336 AEELAELEEKLEELKEEL------------ESLEAELEELEAELEELESRLEEL----EEQLETLRSKVAQLELQIAS-- 397
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1117 nIRNTIEETGILAERARSRVESTEQLIEIASRELEKAKMAAANVSItqpestgepnnmtllaEEARRLAERHKQEADDIV 1196
Cdd:TIGR02168 398 -LNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAEL----------------EELEEELEELQEELERLE 460
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1197 RVAKTANETSAEAYNLLLRTLAGENQTALEIEELNRKYEQAKNISQD---LEKQAARVHEEAKRAGD---------KAVE 1264
Cdd:TIGR02168 461 EALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGvkaLLKNQSGLSGILGVLSElisvdegyeAAIE 540
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1265 IY--ASVAQLTPVDSEALESEANKIKKEAAD------LDRLIDQKLK-DYEDLREDMRGKEhEVKNLLEKGKAEQQTADQ 1335
Cdd:TIGR02168 541 AAlgGRLQAVVVENLNAAKKAIAFLKQNELGrvtflpLDSIKGTEIQgNDREILKNIEGFL-GVAKDLVKFDPKLRKALS 619
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1336 llaradaakalaeeaAKKGR----NTLQEANDILNNLK-----------------------------DFDRRVN--DNKT 1380
Cdd:TIGR02168 620 ---------------YLLGGvlvvDDLDNALELAKKLRpgyrivtldgdlvrpggvitggsaktnssILERRREieELEE 684
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1381 AAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASGVQKNATSTKADAERTFSEVTDLDNEVN 1460
Cdd:TIGR02168 685 KIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIE 764
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1461 GMLRQLEEAENELKRKQDDadqdmmmagmasQAAQEAELNarKAKNSVSSLLSQLTDLLEQLGQLDTVDLNKLNEIEGSL 1540
Cdd:TIGR02168 765 ELEERLEEAEEELAEAEAE------------IEELEAQIE--QLKEELKALREALDELRAELTLLNEEAANLRERLESLE 830
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1541 NKAKDEMK-VSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNLEDIKKTL 1594
Cdd:TIGR02168 831 RRIAATERrLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASL 885
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
395-441 |
1.70e-11 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 60.45 E-value: 1.70e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 281371490 395 PCHCSPVGSLSTQCDSY-GRCSCKPGVMGDKCDRCQPGFHSLTEA--GCR 441
Cdd:cd00055 1 PCDCNGHGSLSGQCDPGtGQCECKPNTTGRRCDRCAPGYYGLPSQggGCQ 50
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
882-930 |
2.07e-11 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 60.06 E-value: 2.07e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 281371490 882 CACNPYGTVQqqSSCNPVTGQCQCLPHVSGRDCGACDPGFYNLQ--SGQGC 930
Cdd:pfam00053 1 CDCNPHGSLS--DTCDPETGQCLCKPGVTGRHCDRCKPGYYGLPsdPPQGC 49
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1034-1521 |
2.28e-11 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 68.64 E-value: 2.28e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1034 RLVKDKVAEHRVKLQELESLIANLGTGDETVTD-QAFEDRLKEAEREVTDLlREAQEVKDVDQNLMDRLQRvnnsLHSQI 1112
Cdd:COG4717 57 ELFKPQGRKPELNLKELKELEEELKEAEEKEEEyAELQEELEELEEELEEL-EAELEELREELEKLEKLLQ----LLPLY 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1113 SRLHNIRNTIEETGILAERARSRVESTEQLI-EIASRELEKAKMAAANVSITQPESTGEPNNMTLLAEEARRLAERhKQE 1191
Cdd:COG4717 132 QELEALEAELAELPERLEELEERLEELRELEeELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQR-LAE 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1192 ADDIVRVAKTANETSAEAYNLLlrtlagenQTALEIEELNRKYEQAKNISQDLekqAARVHEEAKRAGDKAVEIY----- 1266
Cdd:COG4717 211 LEEELEEAQEELEELEEELEQL--------ENELEAAALEERLKEARLLLLIA---AALLALLGLGGSLLSLILTiagvl 279
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1267 ASVAQLTPVDSEALESEANKIKKEAADLD------RLIDQKLKDY-EDLREDMRGKEHEVKNLLEKGKAEQQTADQllar 1339
Cdd:COG4717 280 FLVLGLLALLFLLLAREKASLGKEAEELQalpaleELEEEELEELlAALGLPPDLSPEELLELLDRIEELQELLRE---- 355
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1340 adaakalaeeaakkgRNTLQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALgNAAADATE 1419
Cdd:COG4717 356 ---------------AEELEEELQLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEELEELEEQL-EELLGELE 419
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1420 AKNKAHEAERIasgvqknatstKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASQAAQEAEL 1499
Cdd:COG4717 420 ELLEALDEEEL-----------EEELEELEEELEELEEELEELREELAELEAELEQLEEDGELAELLQELEELKAELREL 488
|
490 500
....*....|....*....|...
gi 281371490 1500 NARKAKNSV-SSLLSQLTDLLEQ 1521
Cdd:COG4717 489 AEEWAALKLaLELLEEAREEYRE 511
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1111-1544 |
2.40e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 68.81 E-value: 2.40e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1111 QISRL-----HNIRNTIEET-GILA-----ERARSRVESTEQLIEIAS---RELEKakmaaanvsitqpestgepnNMTL 1176
Cdd:COG1196 145 MIDRIieakpEERRAIIEEAaGISKykerkEEAERKLEATEENLERLEdilGELER--------------------QLEP 204
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1177 LAEEARRlAERHKQeaddivrVAKTANETSAEAYnlllrtlagenqtALEIEELNRKYEQAKNISQDLEKQAARVHEEAk 1256
Cdd:COG1196 205 LERQAEK-AERYRE-------LKEELKELEAELL-------------LLKLRELEAELEELEAELEELEAELEELEAEL- 262
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1257 ragdkaveiyasvaqltpvdsEALESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQl 1336
Cdd:COG1196 263 ---------------------AELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEE- 320
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1337 laradaakalaeeaakkgrntLQEANDILnnlkdfdrrvndnKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAAD 1416
Cdd:COG1196 321 ---------------------LEEELAEL-------------EEELEELEEELEELEEELEEAEEELEEAEAELAEAEEA 366
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1417 ATEAKNKAHEAERIASGVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQdmmmAGMASQAAQE 1496
Cdd:COG1196 367 LLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAE----LEEEEEEEEE 442
|
410 420 430 440
....*....|....*....|....*....|....*....|....*...
gi 281371490 1497 AELNARKAKNSVSSLLSQLTDLLEQLGQLDTVDLNKLNEIEGSLNKAK 1544
Cdd:COG1196 443 ALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAA 490
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
396-440 |
2.88e-11 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 59.63 E-value: 2.88e-11
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 281371490 396 CHCSPVGSLSTQCDS-YGRCSCKPGVMGDKCDRCQPGFHSLTEAGC 440
Cdd:smart00180 1 CDCDPGGSASGTCDPdTGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
1042-1325 |
3.20e-11 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 68.23 E-value: 3.20e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1042 EHRVKLQELESLIANLGTGDETVTDQafEDRLKEAEREVTDLLREAQEVKDVDQNL--MDRLQRVNNSLHSQISRLhniR 1119
Cdd:pfam05557 143 LLKAKASEAEQLRQNLEKQQSSLAEA--EQRIKELEFEIQSQEQDSEIVKNSKSELarIPELEKELERLREHNKHL---N 217
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1120 NTIEETGILAERA---RSRVESTEQL-IEIASRELEKAKMAAanvSITQPESTGEPNNMTLLAEEArrLAERHKQ-EADD 1194
Cdd:pfam05557 218 ENIENKLLLKEEVedlKRKLEREEKYrEEAATLELEKEKLEQ---ELQSWVKLAQDTGLNLRSPED--LSRRIEQlQQRE 292
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1195 IVRVA-KTANETSAEAYNLLLRTLAGENQTAL-EIEELNRKYEQAKNISQDLEKQAARVHEEakRAGDKA-VEIYasvaq 1271
Cdd:pfam05557 293 IVLKEeNSSLTSSARQLEKARRELEQELAQYLkKIEDLNKKLKRHKALVRRLQRRVLLLTKE--RDGYRAiLESY----- 365
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 281371490 1272 ltpvDSEALESEANKIKKEaadldrlidqKLKDYEDLREDMRGKEHEVKNLLEK 1325
Cdd:pfam05557 366 ----DKELTMSNYSPQLLE----------RIEEAEDMTQKMQAHNEEMEAQLSV 405
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
443-490 |
4.05e-11 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 59.25 E-value: 4.05e-11
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 443 CSCDPSGS-TDECNVETGRCVCKDNVEGFSCERCKPGFFNlesSNPKGC 490
Cdd:smart00180 1 CDCDPGGSaSGTCDPDTGQCECKPNVTGRRCDRCAPGYYG---DGPPGC 46
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
881-931 |
7.21e-11 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 58.90 E-value: 7.21e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 281371490 881 ACACNPYGTVQQQssCNPVTGQCQCLPHVSGRDCGACDPGFYNLQS-GQGCE 931
Cdd:cd00055 1 PCDCNGHGSLSGQ--CDPGTGQCECKPNTTGRRCDRCAPGYYGLPSqGGGCQ 50
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1046-1584 |
1.24e-10 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 66.68 E-value: 1.24e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1046 KLQELESLIANLGTgDETVTDQAFEDRLKEAER-------EVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRL--- 1115
Cdd:pfam15921 318 QLSDLESTVSQLRS-ELREAKRMYEDKIEELEKqlvlansELTEARTERDQFSQESGNLDDQLQKLLADLHKREKELsle 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1116 --HNIRNTIEETG--ILAERARS-------RVESTEQLIEIASRELE---KAKMAA---ANVSI-------TQPESTGEp 1171
Cdd:pfam15921 397 keQNKRLWDRDTGnsITIDHLRRelddrnmEVQRLEALLKAMKSECQgqmERQMAAiqgKNESLekvssltAQLESTKE- 475
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1172 nNMTLLAEE--ARRLAERHKQEADDIVRVAKTANETSAEAYNLLLRTLAGENQTAL-EIEELNRKYEQAKNISQDLEkqA 1248
Cdd:pfam15921 476 -MLRKVVEEltAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLqELQHLKNEGDHLRNVQTECE--A 552
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1249 ARVHEEAKragDKAVEIY-ASVAQLTPVDSE------ALESEANKIKKEAADlDRLIDQKLKDYEDLRE-DMRGKEHEVK 1320
Cdd:pfam15921 553 LKLQMAEK---DKVIEILrQQIENMTQLVGQhgrtagAMQVEKAQLEKEIND-RRLELQEFKILKDKKDaKIRELEARVS 628
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1321 NL-LEKGKAEQQTADQLLARadaakalaeeaakkgRNTLQEANDILNNLKDFDRRVNdNKTAAEEALRRipAINRTIAEA 1399
Cdd:pfam15921 629 DLeLEKVKLVNAGSERLRAV---------------KDIKQERDQLLNEVKTSRNELN-SLSEDYEVLKR--NFRNKSEEM 690
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1400 NEKTREAQLALGNAAADATEAKN--KAHE-----AERIASGVQKNATS-------------------TKADAERTF---- 1449
Cdd:pfam15921 691 ETTTNKLKMQLKSAQSELEQTRNtlKSMEgsdghAMKVAMGMQKQITAkrgqidalqskiqfleeamTNANKEKHFlkee 770
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1450 -----SEVTDLDNEVNGMLRQLEEAENELKR-KQDDADQDMMMAGMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLG 1523
Cdd:pfam15921 771 knklsQELSTVATEKNKMAGELEVLRSQERRlKEKVANMEVALDKASLQFAECQDIIQRQEQESVRLKLQHTLDVKELQG 850
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 281371490 1524 QLDTVDLNKLNEIEGSLNKAKDEMKVSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDI 1584
Cdd:pfam15921 851 PGYTSNSSMKPRLLQPASFTRTHSNVPSSQSTASFLSHHSRKTNALKEDPTRDLKQLLQEL 911
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1027-1335 |
1.33e-10 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 65.94 E-value: 1.33e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1027 QECPACYRLVKDKVAEHRVKLQELESLIANLGTGdETVTDQAFEDRLKEAEREVTDLLREAQEvkdvdqnLMDRLQRVNN 1106
Cdd:COG4717 142 AELPERLEELEERLEELRELEEELEELEAELAEL-QEELEELLEQLSLATEEELQDLAEELEE-------LQQRLAELEE 213
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1107 SLHSQISRLHNIRNTIE--ETGILAERARSRVESTEQLIEIAS--------------RELEKAKMAAANVSITQPESTGE 1170
Cdd:COG4717 214 ELEEAQEELEELEEELEqlENELEAAALEERLKEARLLLLIAAallallglggsllsLILTIAGVLFLVLGLLALLFLLL 293
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1171 PNNMTLLAEEARRLAERHKQEA------DDIVRVAKTANETSAEAYNLLLRTLAGENQTALEIEELNRKYEQA---KNIS 1241
Cdd:COG4717 294 AREKASLGKEAEELQALPALEEleeeelEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEeleQEIA 373
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1242 QDLEK----------QAARVHEEAKRAGDKAVEIYASVAQLTPVDSEALES-EANKIKKEAADLDRLIDQKLKDYEDLRE 1310
Cdd:COG4717 374 ALLAEagvedeeelrAALEQAEEYQELKEELEELEEQLEELLGELEELLEAlDEEELEEELEELEEELEELEEELEELRE 453
|
330 340
....*....|....*....|....*
gi 281371490 1311 DMRGKEHEVKNLLEKGKAEQQTADQ 1335
Cdd:COG4717 454 ELAELEAELEQLEEDGELAELLQEL 478
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
722-763 |
2.86e-10 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 56.94 E-value: 2.86e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 281371490 722 CTCNG---HSETCDPETGVCDCRDNTAGPHCEKCSDGYYGDSTLG 763
Cdd:smart00180 1 CDCDPggsASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGPPG 45
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1072-1568 |
3.38e-10 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 65.06 E-value: 3.38e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1072 RLKEAEREVTDLLREAQEVKDvDQNLMDRLqrvnNSLHSQISRLHNIRNTIEETGILAERARSRVEST-----EQLIEIA 1146
Cdd:PRK02224 180 RVLSDQRGSLDQLKAQIEEKE-EKDLHERL----NGLESELAELDEEIERYEEQREQARETRDEADEVleeheERREELE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1147 SRELEKAKMAAAnvsITQPESTGEPnnmtlLAEEARRLAERHKQEADDIVRVAKTANETSAEAYNLLLRTLAGENqtalE 1226
Cdd:PRK02224 255 TLEAEIEDLRET---IAETEREREE-----LAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELED----R 322
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1227 IEELNRKYEQAKNISQDLEKQAARVHEEAKRagdkaveiyasvaqltpvdseaLESEANKIKKEAADLDrlidqklKDYE 1306
Cdd:PRK02224 323 DEELRDRLEECRVAAQAHNEEAESLREDADD----------------------LEERAEELREEAAELE-------SELE 373
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1307 DLREDMRGKEHEVKNLlekgkaEQQTADQllaradaakalaeeaakkgRNTLQEANDILNNLKDFDRRVNDNKTAAEEAL 1386
Cdd:PRK02224 374 EAREAVEDRREEIEEL------EEEIEEL-------------------RERFGDAPVDLGNAEDFLEELREERDELRERE 428
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1387 RRIPAinrTIAEANEKTREAQ--LALGN--------------AAADATEAKNKAHEAERIASGVQKNATSTK----ADAE 1446
Cdd:PRK02224 429 AELEA---TLRTARERVEEAEalLEAGKcpecgqpvegsphvETIEEDRERVEELEAELEDLEEEVEEVEERleraEDLV 505
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1447 RTFSEVTDLDNEVNGMLRQLEEAENELKRKQD-------DADQDMMMAGMASQAAQEAELNARKAKNSVSSLLSQLTDLL 1519
Cdd:PRK02224 506 EAEDRIERLEERREDLEELIAERRETIEEKREraeelreRAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELK 585
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 281371490 1520 EQLGQLDTV------------DLNKLNEIEGSLNKAKDEMK--VSDLDRKVSNLE---NEARKQEA 1568
Cdd:PRK02224 586 ERIESLERIrtllaaiadaedEIERLREKREALAELNDERRerLAEKRERKRELEaefDEARIEEA 651
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
980-1027 |
3.51e-10 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 56.59 E-value: 3.51e-10
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 281371490 980 PCDCHHEGSLSLQC-KDDGRCECREGFVGNRCDQCEENYFYNRSWP-GCQ 1027
Cdd:cd00055 1 PCDCNGHGSLSGQCdPGTGQCECKPNTTGRRCDRCAPGYYGLPSQGgGCQ 50
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
882-930 |
3.55e-10 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 56.55 E-value: 3.55e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 882 CACNPYGTVQqqSSCNPVTGQCQCLPHVSGRDCGACDPGFYNlQSGQGC 930
Cdd:smart00180 1 CDCDPGGSAS--GTCDPDTGQCECKPNVTGRRCDRCAPGYYG-DGPPGC 46
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1044-1580 |
7.30e-10 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 64.22 E-value: 7.30e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1044 RVKLQ---ELESLIANLGTGDETVTDQAFEDRLKEAEREVTDL---------LREAQEVKDVDQNLMDRLQRVNNSLHSQ 1111
Cdd:TIGR00618 196 AELLTlrsQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTqqshayltqKREAQEEQLKKQQLLKQLRARIEELRAQ 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1112 ISRLHNIRNTIEetgilaeraRSRvesteqlieiasrelEKAKMAAANVSITQPESTGEPNNMTLLAEEARRLAERHKqe 1191
Cdd:TIGR00618 276 EAVLEETQERIN---------RAR---------------KAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMK-- 329
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1192 addivRVAKTANETSAEAYNLLLRTLAGENqtaleiEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKaveiyasvaq 1271
Cdd:TIGR00618 330 -----RAAHVKQQSSIEEQRRLLQTLHSQE------IHIRDAHEVATSIREISCQQHTLTQHIHTLQQQK---------- 388
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1272 ltpvdsEALESEANKIKKEAADLDRL---IDQKLKDYEDLREDMRGKEHEVKnlLEKGKAEQQtaDQLLARADAAKALAE 1348
Cdd:TIGR00618 389 ------TTLTQKLQSLCKELDILQREqatIDTRTSAFRDLQGQLAHAKKQQE--LQQRYAELC--AAAITCTAQCEKLEK 458
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1349 EAAKKGRNTLQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAAD-----ATEAKNK 1423
Cdd:TIGR00618 459 IHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLtrrmqRGEQTYA 538
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1424 AHE-----AERIASGVQKNATSTKADAER---TFSEVTDLDNEV----NGMLRQLE------EAENELKRKQDDADQDMM 1485
Cdd:TIGR00618 539 QLEtseedVYHQLTSERKQRASLKEQMQEiqqSFSILTQCDNRSkediPNLQNITVrlqdltEKLSEAEDMLACEQHALL 618
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1486 MAGMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLGQLD--------TVD-LNKLNEIEGSLNKAKDEMKVSDLDRKV 1556
Cdd:TIGR00618 619 RKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERvrehalsiRVLpKELLASRQLALQKMQSEKEQLTYWKEM 698
|
570 580
....*....|....*....|....*
gi 281371490 1557 SNLENEA-RKQEAAILDYDRDIAEI 1580
Cdd:TIGR00618 699 LAQCQTLlRELETHIEEYDREFNEI 723
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
981-1026 |
9.92e-10 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 55.40 E-value: 9.92e-10
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 281371490 981 CDCHHEGSLSLQC-KDDGRCECREGFVGNRCDQCEENYfYNRSWPGC 1026
Cdd:smart00180 1 CDCDPGGSASGTCdPDTGQCECKPNVTGRRCDRCAPGY-YGDGPPGC 46
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1042-1594 |
1.32e-09 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 63.27 E-value: 1.32e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1042 EHRVKL----QELESLIANLgtgdetvtdqafEDRLKEAEREVTDLLRE----AQEVKDVDQNLMD------RLQRVNNS 1107
Cdd:pfam01576 61 EMRARLaarkQELEEILHEL------------ESRLEEEEERSQQLQNEkkkmQQHIQDLEEQLDEeeaarqKLQLEKVT 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1108 LHSQISRLhnirntiEETGILAERARSRVESTEQLIEIASRELEkakmaaanvsitqpestgepNNMTLLAEEARRLAE- 1186
Cdd:pfam01576 129 TEAKIKKL-------EEDILLLEDQNSKLSKERKLLEERISEFT--------------------SNLAEEEEKAKSLSKl 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1187 RHKQEA---DDIVRVAKtaNETSAEAYNLLLRTLAGE-----NQTA---LEIEELnrkyeQAKNISQDLEKQAA--RVHE 1253
Cdd:pfam01576 182 KNKHEAmisDLEERLKK--EEKGRQELEKAKRKLEGEstdlqEQIAelqAQIAEL-----RAQLAKKEEELQAAlaRLEE 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1254 E--AKRAGDKAV-EIYASVAQLtpvdSEALESEA---NKIKKEAADLDRLIDQKLKDYED------LREDMRGK-EHEV- 1319
Cdd:pfam01576 255 EtaQKNNALKKIrELEAQISEL----QEDLESERaarNKAEKQRRDLGEELEALKTELEDtldttaAQQELRSKrEQEVt 330
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1320 ---KNLLEKGKA-EQQTAD--QLLARADAAKALAEEAAKKGRNTLQEANDIL----NNLKDFDRRVNDNKTAAEEALRRI 1389
Cdd:pfam01576 331 elkKALEEETRShEAQLQEmrQKHTQALEELTEQLEQAKRNKANLEKAKQALesenAELQAELRTLQQAKQDSEHKRKKL 410
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1390 PA-----------INRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASGVQknatSTKADAERTFSEVT----- 1453
Cdd:pfam01576 411 EGqlqelqarlseSERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLE----SQLQDTQELLQEETrqkln 486
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1454 ------DLDNEVNGMLRQLEEAE-----------------NELKRKQDDadqdmmMAGmASQAAQEAElnaRKAKNSVSS 1510
Cdd:pfam01576 487 lstrlrQLEDERNSLQEQLEEEEeakrnverqlstlqaqlSDMKKKLEE------DAG-TLEALEEGK---KRLQRELEA 556
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1511 LLSQLT------DLLEQL-----GQLD--TVDLNKLNEIEGSLNKAK--------DEMKVS-----DLDRKvsnlENEAR 1564
Cdd:pfam01576 557 LTQQLEekaaayDKLEKTknrlqQELDdlLVDLDHQRQLVSNLEKKQkkfdqmlaEEKAISaryaeERDRA----EAEAR 632
|
650 660 670
....*....|....*....|....*....|
gi 281371490 1565 KQEAAILDYDRDIAEIIKDIHNLEDIKKTL 1594
Cdd:pfam01576 633 EKETRALSLARALEEALEAKEELERTNKQL 662
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1033-1483 |
2.13e-09 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 61.85 E-value: 2.13e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1033 YRLVKDKVAEHrvkLQELESLIANLGTGDETVTD--QAFEDRLKEAEREVtDLLRE-------AQEVKDVDQNLMDRlqr 1103
Cdd:COG5278 81 YEEARAEIDEL---LAELRSLTADNPEQQARLDEleALIDQWLAELEQVI-ALRRAggleaalALVRSGEGKALMDE--- 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1104 vnnslhsqisrlhnIRNTIEETGILAERARSRVESTEQLIEIASRELEKAKMAAANVSITQPESTGEPNNMTLLAEEARR 1183
Cdd:COG5278 154 --------------IRARLLLLALALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAAL 219
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1184 LAERHKQEADDIVRVAKTANETSAEAYNLLLRTLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKAV 1263
Cdd:COG5278 220 LAAAAALAALAALELLAALALALALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALAL 299
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1264 EIYASVAQLTPVDSEALESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAA 1343
Cdd:COG5278 300 AELELELLLAAAAAAAAAAAAAAAALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEV 379
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1344 KALAEEAAKKGRNTLQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNK 1423
Cdd:COG5278 380 EAEGAAEAVELEVLAIAAAAAAAAAEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALA 459
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1424 AHEAERIASGVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQD 1483
Cdd:COG5278 460 AAEALAEELAAVAALAALAAAAAALAEAEAAAALAAAAALSLALALAALLLAAAEAALAA 519
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1129-1504 |
3.32e-09 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 62.15 E-value: 3.32e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1129 AERARSRVES--------TEQLIEIASRELEKAKMAAANVSITqpeSTGEPNNMTLLAEEARRLAERHKQEADDIVRVAK 1200
Cdd:NF041483 202 AESARAEAEAilrrarkdAERLLNAASTQAQEATDHAEQLRSS---TAAESDQARRQAAELSRAAEQRMQEAEEALREAR 278
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1201 T-ANETSAEAYNLLLRTLAG-----ENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAK--------RAGDKAVEIY 1266
Cdd:NF041483 279 AeAEKVVAEAKEAAAKQLASaesanEQRTRTAKEEIARLVGEATKEAEALKAEAEQALADARaeaeklvaEAAEKARTVA 358
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1267 A--SVAQLTPVDSEALE-----------------SEANKIKKEA-ADLDRLIDQKlkdyEDLREDMRG------KEHEVK 1320
Cdd:NF041483 359 AedTAAQLAKAARTAEEvltkasedakattraaaEEAERIRREAeAEADRLRGEA----ADQAEQLKGaakddtKEYRAK 434
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1321 N--LLE-----KGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLK-DFDRRVNDNKTAAE----EALRR 1388
Cdd:NF041483 435 TveLQEearrlRGEAEQLRAEAVAEGERIRGEARREAVQQIEEAARTAEELLTKAKaDADELRSTATAESErvrtEAIER 514
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1389 IPAINRTIAEANEKTREaqlalgnaaadateaknkahEAERIASGVQKNATSTKADAERTFSEVTDlDNEVNGMLRQLEE 1468
Cdd:NF041483 515 ATTLRRQAEETLERTRA--------------------EAERLRAEAEEQAEEVRAAAERAARELRE-ETERAIAARQAEA 573
|
410 420 430
....*....|....*....|....*....|....*..
gi 281371490 1469 AEnELKRKQDDADQDMMMAGMA-SQAAQEAELNARKA 1504
Cdd:NF041483 574 AE-ELTRLHTEAEERLTAAEEAlADARAEAERIRREA 609
|
|
| MA |
smart00283 |
Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo ... |
1089-1300 |
4.77e-09 |
|
Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo reversible methylation in response to attractants or repellants during bacterial chemotaxis.
Pssm-ID: 214599 [Multi-domain] Cd Length: 262 Bit Score: 59.22 E-value: 4.77e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1089 EVKDVDQNLMDRLQRVNNSLHSQISRLHNIRNTIEETGILAERARSRVESTEQLIEIASRELEKAKMAAANVSITQPEST 1168
Cdd:smart00283 1 DVSEAVEEIAAGAEEQAEELEELAERMEELSASIEEVAANADEIAATAQSAAEAAEEGREAVEDAITAMDQIREVVEEAV 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1169 G-----------------------EPNNMTLL-------------------AEEARRLAERHKQEADDIVRVAKTANETS 1206
Cdd:smart00283 81 SaveeleessdeigeivsviddiaDQTNLLALnaaieaarageagrgfavvADEVRKLAERSAESAKEIESLIKEIQEET 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1207 AEAYNLLLRTLAGENQTALEIEELNRKYE----QAKNISQDLEKQAARVHEEAKRAGD--KAVEIYASVAQLTPVDSEAL 1280
Cdd:smart00283 161 NEAVAAMEESSSEVEEGVELVEETGDALEeivdSVEEIADLVQEIAAATDEQAAGSEEvnAAIDEIAQVTQETAAMSEEI 240
|
250 260
....*....|....*....|
gi 281371490 1281 ESEANKIKKEAADLDRLIDQ 1300
Cdd:smart00283 241 SAAAEELSGLAEELDELVER 260
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1037-1479 |
4.84e-09 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 61.21 E-value: 4.84e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1037 KDKVAEHRVKLQELESLIANLGTGDETVTD-----QAFEDRLKEAEREVTDL------LREAQEVKDVDQN-LMDRLQRV 1104
Cdd:PRK02224 240 DEVLEEHEERREELETLEAEIEDLRETIAEtererEELAEEVRDLRERLEELeeerddLLAEAGLDDADAEaVEARREEL 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1105 NNSLHSQISRLHNIRNTIEETGILAERARsrvESTEQLIEIASRELEKAKMAAANVSITQPESTGEPNNMTLLAEEARRL 1184
Cdd:PRK02224 320 EDRDEELRDRLEECRVAAQAHNEEAESLR---EDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEEL 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1185 AERHKQEADDIVRvaktanetsAEAYNLLLRtlagenqtaleiEELNRKYEQAKNISQDLEKQAARVHE-EAKRAGDKAV 1263
Cdd:PRK02224 397 RERFGDAPVDLGN---------AEDFLEELR------------EERDELREREAELEATLRTARERVEEaEALLEAGKCP 455
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1264 EIYASVAQLTPVDS--------EALESEANKIKKEAADLDRLIDQ--KLKDYEDLREDMRGKEHEVKNLLEKGKA----E 1329
Cdd:PRK02224 456 ECGQPVEGSPHVETieedrervEELEAELEDLEEEVEEVEERLERaeDLVEAEDRIERLEERREDLEELIAERREtieeK 535
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1330 QQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLKDFDRRVNDNKTAAeEALRRIPAINRTIAEA-------NEK 1402
Cdd:PRK02224 536 RERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERI-ESLERIRTLLAAIADAedeierlREK 614
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1403 tREAQLALGNAAADATEAKNK-------AHEAERIASGVQK--NATSTKADAERTFSEVT----DLDNEVNGMLRQLEEA 1469
Cdd:PRK02224 615 -REALAELNDERRERLAEKRErkreleaEFDEARIEEAREDkeRAEEYLEQVEEKLDELReerdDLQAEIGAVENELEEL 693
|
490
....*....|
gi 281371490 1470 EnELKRKQDD 1479
Cdd:PRK02224 694 E-ELRERREA 702
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
1089-1335 |
5.53e-09 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 59.54 E-value: 5.53e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1089 EVKDVDQNLMDRLQRVNNSLHSQISRLHNIRNT-IEETGILAERARSRVESTEQLIEIASRELEKAKMAAANVSitqpES 1167
Cdd:COG1340 1 SKTDELSSSLEELEEKIEELREEIEELKEKRDElNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKVK----EL 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1168 TGEPNNmtlLAEEARRL---AERHKQEADDIVRVAKTANETSaEAYNLLLR-------TLAGENQTALEIEELNRKYEQA 1237
Cdd:COG1340 77 KEERDE---LNEKLNELreeLDELRKELAELNKAGGSIDKLR-KEIERLEWrqqtevlSPEEEKELVEKIKELEKELEKA 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1238 KNI-------------SQDLEKQAARVHEEAKRAGDKAVEIYASVAQLTP-VDS-----EALESEANKIKKEAADLDRLI 1298
Cdd:COG1340 153 KKAlekneklkelraeLKELRKEAEEIHKKIKELAEEAQELHEEMIELYKeADElrkeaDELHKEIVEAQEKADELHEEI 232
|
250 260 270
....*....|....*....|....*....|....*..
gi 281371490 1299 DQKLKDYEDLREdmrgkehEVKNLLEKGKAEQQTADQ 1335
Cdd:COG1340 233 IELQKELRELRK-------ELKKLRKKQRALKREKEK 262
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1172-1589 |
5.66e-09 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 61.29 E-value: 5.66e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1172 NNMTLLAEEAR----RLAERHKQEADDIVRVAKTAnetsaeaynlllRTLAGENQTALEIEElnrkyEQAKNIS------ 1241
Cdd:pfam15921 256 NKIELLLQQHQdrieQLISEHEVEITGLTEKASSA------------RSQANSIQSQLEIIQ-----EQARNQNsmymrq 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1242 -QDLEKQAARVH---EEAKRAGDKAVEIYASvaQLTPVDSEALE--SEANKIKKEAADLDRLIDQKLKDYEDLREDMRGK 1315
Cdd:pfam15921 319 lSDLESTVSQLRselREAKRMYEDKIEELEK--QLVLANSELTEarTERDQFSQESGNLDDQLQKLLADLHKREKELSLE 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1316 EHEVKNLLEKGKAEQQTADQLLARADAakalaeeaakkgRNT-LQEANDILNNLKDFDRRVNDNKTAA----EEALRRIP 1390
Cdd:pfam15921 397 KEQNKRLWDRDTGNSITIDHLRRELDD------------RNMeVQRLEALLKAMKSECQGQMERQMAAiqgkNESLEKVS 464
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1391 AINRTIaeanEKTREAqlaLGNAAADATEAKNKAHEAERIASGVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAE 1470
Cdd:pfam15921 465 SLTAQL----ESTKEM---LRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQHLK 537
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1471 NE---LKRKQDDADQ-DMMMAGmasqaaqeaelnarkaKNSVSSLLSQ----LTDLLEQLGQLDTVDLNKLNEIEGSLN- 1541
Cdd:pfam15921 538 NEgdhLRNVQTECEAlKLQMAE----------------KDKVIEILRQqienMTQLVGQHGRTAGAMQVEKAQLEKEINd 601
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*....
gi 281371490 1542 -----------KAKDEMKVSDLDRKVSNLENEARKqeaaILDYDRDIAEIIKDIHNLED 1589
Cdd:pfam15921 602 rrlelqefkilKDKKDAKIRELEARVSDLELEKVK----LVNAGSERLRAVKDIKQERD 656
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1065-1504 |
6.41e-09 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 61.00 E-value: 6.41e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1065 TDQAFEDRLKEAEREVTDLLREAQEvkdvdqnlmdRLQRVNNSL---HSQISRLHniRNTIEETGIL----AERARSRVE 1137
Cdd:NF041483 562 TERAIAARQAEAAEELTRLHTEAEE----------RLTAAEEALadaRAEAERIR--REAAEETERLrteaAERIRTLQA 629
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1138 STEQLIEIASRElekakmAAANVSITQPEstGEpNNMTLLAEEARRLAERHK---QEADDIVRV-AKTANE----TSAEA 1209
Cdd:NF041483 630 QAEQEAERLRTE------AAADASAARAE--GE-NVAVRLRSEAAAEAERLKseaQESADRVRAeAAAAAErvgtEAAEA 700
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1210 YNLLLRTLAGENQTALEI-----EELNRKYEQAKNISQDL-----------EKQAARVHEEA-KRAGD-------KAVEI 1265
Cdd:NF041483 701 LAAAQEEAARRRREAEETlgsarAEADQERERAREQSEELlasarkrveeaQAEAQRLVEEAdRRATElvsaaeqTAQQV 780
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1266 YASVAQLTPVDSE-------ALESEANKIKKEAAD-LDRL-------IDQKLKDYEDLREDMRGKEHEVKNLLEKGKAE- 1329
Cdd:NF041483 781 RDSVAGLQEQAEEeiaglrsAAEHAAERTRTEAQEeADRVrsdayaeRERASEDANRLRREAQEETEAAKALAERTVSEa 860
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1330 --------QQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLKD-----FDRRVNDNKTAAEealrRIPAINRTI 1396
Cdd:NF041483 861 iaeaerlrSDASEYAQRVRTEASDTLASAEQDAARTRADAREDANRIRSdaaaqADRLIGEATSEAE----RLTAEARAE 936
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1397 AE--ANEKTREAQLALGNAAADATEAKNKAH-EAERIASGVQKNATSTKADAERTFSE----VTDLDNEVNGML------ 1463
Cdd:NF041483 937 AErlRDEARAEAERVRADAAAQAEQLIAEATgEAERLRAEAAETVGSAQQHAERIRTEaervKAEAAAEAERLRtearee 1016
|
490 500 510 520
....*....|....*....|....*....|....*....|....*...
gi 281371490 1464 --RQLEEAENEL-KRKQDDADQ-DMMMAGMASQAAQ---EAELNARKA 1504
Cdd:NF041483 1017 adRTLDEARKDAnKRRSEAAEQaDTLITEAAAEADQltaKAQEEALRT 1064
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
826-873 |
8.67e-09 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 52.74 E-value: 8.67e-09
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 281371490 826 CQCNDNIDPNavGNCNRLTGECLkCIYNTAGFYCDRCKEGFFGNPLAP 873
Cdd:pfam00053 1 CDCNPHGSLS--DTCDPETGQCL-CKPGVTGRHCDRCKPGYYGLPSDP 45
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1032-1524 |
9.09e-09 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 60.57 E-value: 9.09e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1032 CYRLVKDkVAEHRVKLQELESLIAnlgtgDETVTDQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQ 1111
Cdd:pfam01576 456 NIKLSKD-VSSLESQLQDTQELLQ-----EETRQKLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSDM 529
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1112 ISRLHNIRNTIEetgiLAERARSRVESTeqlIEIASRELEKAKMAAANVSITQPESTGEPNNMT---------------- 1175
Cdd:pfam01576 530 KKKLEEDAGTLE----ALEEGKKRLQRE---LEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLvdldhqrqlvsnlekk 602
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1176 ------LLAEE----ARRLAERHKQEADdivrvaktANETSAEAYnlllrTLAGENQTALE-IEELNRKYEQAKNISQDL 1244
Cdd:pfam01576 603 qkkfdqMLAEEkaisARYAEERDRAEAE--------AREKETRAL-----SLARALEEALEaKEELERTNKQLRAEMEDL 669
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1245 --EKQAA--RVH--EEAKRAGDKAVEiyASVAQLTPVDSEALESEANKIKKEA------ADLDRLIDQKLKDYEDLREDM 1312
Cdd:pfam01576 670 vsSKDDVgkNVHelERSKRALEQQVE--EMKTQLEELEDELQATEDAKLRLEVnmqalkAQFERDLQARDEQGEEKRRQL 747
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1313 RGKEHEVKNLLE---KGKAEQQTADQLLARADAAKALAEEAAKKGRN-TLQEANDILNNLKDFDRRVNDNKTAAEEALrr 1388
Cdd:pfam01576 748 VKQVRELEAELEderKQRAQAVAAKKKLELDLKELEAQIDAANKGREeAVKQLKKLQAQMKDLQRELEEARASRDEIL-- 825
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1389 ipainrTIAEANE---KTREAQ-LALGNAAADATEAKNKAhEAER------IASGVQKNAtSTKADAERTFSEVTDLDNE 1458
Cdd:pfam01576 826 ------AQSKESEkklKNLEAElLQLQEDLAASERARRQA-QQERdeladeIASGASGKS-ALQDEKRRLEARIAQLEEE 897
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1459 ----------VNGMLR----QLEEAENELK------RKQDDADQDMmmagmasqAAQEAELNAR----------KAKNSV 1508
Cdd:pfam01576 898 leeeqsntelLNDRLRkstlQVEQLTTELAaerstsQKSESARQQL--------ERQNKELKAKlqemegtvksKFKSSI 969
|
570
....*....|....*.
gi 281371490 1509 SSLLSQLTDLLEQLGQ 1524
Cdd:pfam01576 970 AALEAKIAQLEEQLEQ 985
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1208-1565 |
9.14e-09 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 60.46 E-value: 9.14e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1208 EAYNLLLRTLagENQTALE--IEELNRKYEQaknisqdLEKQAARVHE-EAKRAGDKAVEIYASVAQLTPVDSE--ALES 1282
Cdd:TIGR02168 176 ETERKLERTR--ENLDRLEdiLNELERQLKS-------LERQAEKAERyKELKAELRELELALLVLRLEELREEleELQE 246
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1283 EANKIKKEAADLDRLIDQKLKDYEDLR----EDMRGKEHEVKNLLEKgKAEQQTADQllaradaakalaeeaakkgrnTL 1358
Cdd:TIGR02168 247 ELKEAEEELEELTAELQELEEKLEELRlevsELEEEIEELQKELYAL-ANEISRLEQ---------------------QK 304
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1359 QEANDILNNLKDFDRRVNdnkTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASGVQKNA 1438
Cdd:TIGR02168 305 QILRERLANLERQLEELE---AQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL 381
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1439 TSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQD---DADQDMMMAGMASQAAQEAELN-----ARKAKNSVSS 1510
Cdd:TIGR02168 382 ETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQeieELLKKLEEAELKELQAELEELEeeleeLQEELERLEE 461
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 281371490 1511 LLSQLTDLLEQLGQLDTVDLNKLNEIEGSLNKAKDEM-KVSDLDRKVSNLENEARK 1565
Cdd:TIGR02168 462 ALEELREELEEAEQALDAAERELAQLQARLDSLERLQeNLEGFSEGVKALLKNQSG 517
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
1076-1587 |
1.62e-08 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 59.53 E-value: 1.62e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1076 AEREvtDLLREAQEVKDVDQNlMDRLQRVNNSLHSQISRLHNIRNTIEETGILAERARSRVESTEQLIEIASRELEKAKM 1155
Cdd:PRK01156 149 AQRK--KILDEILEINSLERN-YDKLKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSI 225
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1156 AAANVSITQPESTGEPNNMTLLAEEARRLAERHKQEADDIVRVAKTANETSA--EAYNLLLRTLAGENQTAL-----EIE 1228
Cdd:PRK01156 226 EYNNAMDDYNNLKSALNELSSLEDMKNRYESEIKTAESDLSMELEKNNYYKEleERHMKIINDPVYKNRNYIndyfkYKN 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1229 ELNRKYEQAKNISQDLEKqaarvHEEAKRagdKAVEIyasvaqltpvdsEALESEANKIKKEAADLDRLIDQkLKDYEDl 1308
Cdd:PRK01156 306 DIENKKQILSNIDAEINK-----YHAIIK---KLSVL------------QKDYNDYIKKKSRYDDLNNQILE-LEGYEM- 363
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1309 reDMRGKEHEVKNLleKGKAEQQTADQLLARADAAKALAeeaakkgrntLQEAN-DILNN-LKDFDRRVNDNKTAAEEAL 1386
Cdd:PRK01156 364 --DYNSYLKSIESL--KKKIEEYSKNIERMSAFISEILK----------IQEIDpDAIKKeLNEINVKLQDISSKVSSLN 429
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1387 RRIPAInrtIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASGVQKNATSTKADAERTFSEVTDLDNEV------- 1459
Cdd:PRK01156 430 QRIRAL---RENLDELSRNMEMLNGQSVCPVCGTTLGEEKSNHIINHYNEKKSRLEEKIREIEIEVKDIDEKIvdlkkrk 506
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1460 ----NGMLRQLEEAENELKRKQDDADQDMmmagMASQAAQEAELNARKAKNSVSS-----LLSQLTDLLEQLGQLDTVDL 1530
Cdd:PRK01156 507 eyleSEEINKSINEYNKIESARADLEDIK----IKINELKDKHDKYEEIKNRYKSlkledLDSKRTSWLNALAVISLIDI 582
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 281371490 1531 NKL----NEIEGSLNKAKDEMKV---------SDLDRKVSNLENEA---RKQEAAILDYDRDIAEIIKDIHNL 1587
Cdd:PRK01156 583 ETNrsrsNEIKKQLNDLESRLQEieigfpddkSYIDKSIREIENEAnnlNNKYNEIQENKILIEKLRGKIDNY 655
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1028-1454 |
1.63e-08 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 59.69 E-value: 1.63e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1028 ECPACYRLVKDkvaEHRVKLqeleslianlgtgdetvtdqafedrLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNS 1107
Cdd:PRK03918 437 KCPVCGRELTE---EHRKEL-------------------------LEEYTAELKRIEKELKEIEEKERKLRKELRELEKV 488
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1108 L--HSQISRLHNIRNTIEETgilaeraRSRVES--TEQLiEIASRELEKAKMAAANVSitqpestGEPNNmtlLAEEARR 1183
Cdd:PRK03918 489 LkkESELIKLKELAEQLKEL-------EEKLKKynLEEL-EKKAEEYEKLKEKLIKLK-------GEIKS---LKKELEK 550
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1184 LAErHKQEADDIVRVAKTANETSAEAYNLLLR----TLAGENQTALEIEELNRKYEQAKNISQDLEkqaaRVHEEAKRAG 1259
Cdd:PRK03918 551 LEE-LKKKLAELEKKLDELEEELAELLKELEElgfeSVEELEERLKELEPFYNEYLELKDAEKELE----REEKELKKLE 625
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1260 DKAVEIYASVAQLtpvdsealESEANKIKKEAADLDRLIDQKlkDYEDLREDMRGKEHEVKNLLEKGKAEqqtadqllar 1339
Cdd:PRK03918 626 EELDKAFEELAET--------EKRLEELRKELEELEKKYSEE--EYEELREEYLELSRELAGLRAELEEL---------- 685
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1340 adaakalaeeaakkgRNTLQEANDILNNLKdfdrrvnDNKTAAEEALRRIPAINRTIAEAnEKTREAQLALGNaaadatE 1419
Cdd:PRK03918 686 ---------------EKRREEIKKTLEKLK-------EELEEREKAKKELEKLEKALERV-EELREKVKKYKA------L 736
|
410 420 430
....*....|....*....|....*....|....*.
gi 281371490 1420 AKNKA-HEAERIASGVqknatstkadaertFSEVTD 1454
Cdd:PRK03918 737 LKERAlSKVGEIASEI--------------FEELTE 758
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
981-1027 |
1.77e-08 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 51.97 E-value: 1.77e-08
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 281371490 981 CDCHHEGSLSLQC-KDDGRCECREGFVGNRCDQCEENYFYNRSWPGCQ 1027
Cdd:pfam00053 1 CDCNPHGSLSDTCdPETGQCLCKPGVTGRHCDRCKPGYYGLPSDPPQG 48
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1039-1428 |
2.06e-08 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 58.38 E-value: 2.06e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1039 KVAEHRVKLQELE----SLIANLgtgDETVTDQAFEdrLKEAEREVTDLLREAQEVKdvdqnlmDRLQRVNNSLHSQISR 1114
Cdd:COG4372 7 KVGKARLSLFGLRpktgILIAAL---SEQLRKALFE--LDKLQEELEQLREELEQAR-------EELEQLEEELEQARSE 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1115 LHNIRNTIEETGILAERARSRVESTEQLIEIASRELEKAKMAAANvsitqpestgepnnmtlLAEEARRLAERHKQEADD 1194
Cdd:COG4372 75 LEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEE-----------------LQKERQDLEQQRKQLEAQ 137
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1195 IVRVAKTANETSAEaynllLRTLAGE-NQTALEIEELNRKYEQAKNisQDLEKQAARVHEEAKRAGDKAVEIY-ASVAQL 1272
Cdd:COG4372 138 IAELQSEIAEREEE-----LKELEEQlESLQEELAALEQELQALSE--AEAEQALDELLKEANRNAEKEEELAeAEKLIE 210
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1273 TPVDSEALESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAKALAEEAAK 1352
Cdd:COG4372 211 SLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEE 290
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 281371490 1353 KGRNTLQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAE 1428
Cdd:COG4372 291 AALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAG 366
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1070-1503 |
3.22e-08 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 58.68 E-value: 3.22e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1070 EDRLKEAEREVTDLLREAQE--VKDVDQNLMDRLQRVNNSlHSQISRLHNiRNTIEETGILAE----RARSRVEStEQLI 1143
Cdd:NF041483 271 EEALREARAEAEKVVAEAKEaaAKQLASAESANEQRTRTA-KEEIARLVG-EATKEAEALKAEaeqaLADARAEA-EKLV 347
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1144 EIASrelEKAKMAAANVSITQ---PESTGEpNNMTLLAEEARRLAERHKQEADDIVRVAKT-ANETSAEAYNLL--LRTL 1217
Cdd:NF041483 348 AEAA---EKARTVAAEDTAAQlakAARTAE-EVLTKASEDAKATTRAAAEEAERIRREAEAeADRLRGEAADQAeqLKGA 423
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1218 AGEN------QTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVAQ--LTPVDSEALE------SE 1283
Cdd:NF041483 424 AKDDtkeyraKTVELQEEARRLRGEAEQLRAEAVAEGERIRGEARREAVQQIEEAARTAEelLTKAKADADElrstatAE 503
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1284 ANKIKKEAadLDRLIDQKlKDYEDLREDMRGKEHEVKNllekgKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEAND 1363
Cdd:NF041483 504 SERVRTEA--IERATTLR-RQAEETLERTRAEAERLRA-----EAEEQAEEVRAAAERAARELREETERAIAARQAEAAE 575
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1364 ILNNLK-DFDRRVndnkTAAEEAL----------RRIPA--INRTIAEANEKTR--------EAQLALGNAAADATEAKN 1422
Cdd:NF041483 576 ELTRLHtEAEERL----TAAEEALadaraeaeriRREAAeeTERLRTEAAERIRtlqaqaeqEAERLRTEAAADASAARA 651
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1423 K------------AHEAERIASGVQKNATSTKAD----AERTFSEVTD-LDNEVNGMLRQLEEAENELKRKQDDADQDMm 1485
Cdd:NF041483 652 EgenvavrlrseaAAEAERLKSEAQESADRVRAEaaaaAERVGTEAAEaLAAAQEEAARRRREAEETLGSARAEADQER- 730
|
490
....*....|....*...
gi 281371490 1486 maGMASQAAQEAELNARK 1503
Cdd:NF041483 731 --ERAREQSEELLASARK 746
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
1047-1605 |
4.86e-08 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 58.52 E-value: 4.86e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1047 LQELESLIANLgtgdETVTDQA-FEDRLKEAEREVTDLLREAQEVKDVDQNLmdrlqrvnNSLHSQISRLHNIRNTIEET 1125
Cdd:TIGR01612 1145 IDEIKAQINDL----EDVADKAiSNDDPEEIEKKIENIVTKIDKKKNIYDEI--------KKLLNEIAEIEKDKTSLEEV 1212
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1126 G-----------------ILAERARSR--VESTEQLIE------IASRELE---------KAKMAAANVSITQPE----- 1166
Cdd:TIGR01612 1213 KginlsygknlgklflekIDEEKKKSEhmIKAMEAYIEdldeikEKSPEIEnemgiemdiKAEMETFNISHDDDKdhhii 1292
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1167 STGEPNNMTLLAEEARRLAERHKQEAD--DIVRVAKTaNETSAEAYNLLLRTLAGENQTALEIEELNRkyeqAKNISQDL 1244
Cdd:TIGR01612 1293 SKKHDENISDIREKSLKIIEDFSEESDinDIKKELQK-NLLDAQKHNSDINLYLNEIANIYNILKLNK----IKKIIDEV 1367
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1245 EKQAARVHEEAKRAGDkavEIYASVAQLTPV-DSEALESEANKIKKEAADLDrlIDQKLKDYEDLREDMRGKEHEVKNLL 1323
Cdd:TIGR01612 1368 KEYTKEIEENNKNIKD---ELDKSEKLIKKIkDDINLEECKSKIESTLDDKD--IDECIKKIKELKNHILSEESNIDTYF 1442
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1324 EKGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDI-LNNLKDFDRRVNDNKTAAEEAlrripainrtiAEANEK 1402
Cdd:TIGR01612 1443 KNADENNENVLLLFKNIEMADNKSQHILKIKKDNATNDHDFnINELKEHIDKSKGCKDEADKN-----------AKAIEK 1511
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1403 TREAqlaLGNAAADATEAKNKAHEAEriasgVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQ 1482
Cdd:TIGR01612 1512 NKEL---FEQYKKDVTELLNKYSALA-----IKNKFAKTKKDSEIIIKEIKDAHKKFILEAEKSEQKIKEIKKEKFRIED 1583
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1483 DMMMAGMASQAAQEAEL-------------NARKAKN-------------SVSSLLSQLTDLLEQLGQLDTVD--LNKLN 1534
Cdd:TIGR01612 1584 DAAKNDKSNKAAIDIQLslenfenkflkisDIKKKINdclketesiekkiSSFSIDSQDTELKENGDNLNSLQefLESLK 1663
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1535 EIEGSLNKAKDEMkvSDLDRKVSNLENEARKQEAailDYDRDIAEIIKDI-----HNLEDIKKTLP------TGCFNTPS 1603
Cdd:TIGR01612 1664 DQKKNIEDKKKEL--DELDSEIEKIEIDVDQHKK---NYEIGIIEKIKEIaiankEEIESIKELIEptienlISSFNTND 1738
|
..
gi 281371490 1604 IE 1605
Cdd:TIGR01612 1739 LE 1740
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
825-874 |
5.71e-08 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 50.43 E-value: 5.71e-08
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 281371490 825 PCQCNDNIDPNavGNCNRLTGECLkCIYNTAGFYCDRCKEGFFGNPLAPN 874
Cdd:cd00055 1 PCDCNGHGSLS--GQCDPGTGQCE-CKPNTTGRRCDRCAPGYYGLPSQGG 47
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1025-1522 |
7.01e-08 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 57.75 E-value: 7.01e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1025 GCqeCPACYRLVKDKvaehrvklQELESLIANLgtgdETVTdQAFEDRLKEAEREVTDLLREAQEV---KDVDQNLMDRL 1101
Cdd:TIGR00606 678 SC--CPVCQRVFQTE--------AELQEFISDL----QSKL-RLAPDKLKSTESELKKKEKRRDEMlglAPGRQSIIDLK 742
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1102 QRvnnslhsqisRLHNIRNTIEETGILAERARSRVESTEQLIEIASRELEKAKMAAANVSITQpestgepNNMTLLAEEA 1181
Cdd:TIGR00606 743 EK----------EIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIME-------RFQMELKDVE 805
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1182 RRLAER-HKQEADDIVRVAKTANETSAEAYNlLLRTLAGENqtaleieELNRKyeqaknISQDLEKQAARVHEEAKRAGD 1260
Cdd:TIGR00606 806 RKIAQQaAKLQGSDLDRTVQQVNQEKQEKQH-ELDTVVSKI-------ELNRK------LIQDQQEQIQHLKSKTNELKS 871
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1261 KAVEIYASVAQltpvdSEALESEANKIKKEAADLDRLIDQK----LKDYEDLREDMRGKEHEV-KNLLEKGKAEQQTADQ 1335
Cdd:TIGR00606 872 EKLQIGTNLQR-----RQQFEEQLVELSTEVQSLIREIKDAkeqdSPLETFLEKDQQEKEELIsSKETSNKKAQDKVNDI 946
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1336 LlaradaakalaeeaakkgrntlQEANDILNNLKDFDRRVNDNKtaaeealrripainrtiaEANEKTREAQLALGNAAA 1415
Cdd:TIGR00606 947 K----------------------EKVKNIHGYMKDIENKIQDGK------------------DDYLKQKETELNTVNAQL 986
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1416 DATEAKNKAHEAERiasGVQKNATSTKADAERTfsevtdLDNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASQAAQ 1495
Cdd:TIGR00606 987 EECEKHQEKINEDM---RLMRQDIDTQKIQERW------LQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQ 1057
|
490 500
....*....|....*....|....*..
gi 281371490 1496 EAELNARKAKNSVSSLLSQLTDLLEQL 1522
Cdd:TIGR00606 1058 KLEENIDLIKRNHVLALGRQKGYEKEI 1084
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1067-1572 |
7.05e-08 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 57.53 E-value: 7.05e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1067 QAFEDRL-KEAEREVTDLL----REAQEVKDVDQNLMDRLQRvnnslhsQISRLhniRNTIEETgilAERARSrvESTEQ 1141
Cdd:NF041483 752 QAEAQRLvEEADRRATELVsaaeQTAQQVRDSVAGLQEQAEE-------EIAGL---RSAAEHA---AERTRT--EAQEE 816
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1142 LIEIASRELEKAKMAAANVSITQPESTGEpnnmtllAEEARRLAERHKQEA-DDIVRVAKTANETSAEAYNLLLRTLAGE 1220
Cdd:NF041483 817 ADRVRSDAYAERERASEDANRLRREAQEE-------TEAAKALAERTVSEAiAEAERLRSDASEYAQRVRTEASDTLASA 889
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1221 NQTALEIEELNRkyEQAKNISQDLEKQAARVHEEAKRAGDKAVEiyASVAQLTPVDSEALESEANKIKKEAADLDRLIDQ 1300
Cdd:NF041483 890 EQDAARTRADAR--EDANRIRSDAAAQADRLIGEATSEAERLTA--EARAEAERLRDEARAEAERVRADAAAQAEQLIAE 965
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1301 KLKDYEDLR--------------EDMRGKEHEVKNLLEkGKAEQQTADqllaradaakalaeeAAKKGRNTLQEANdiln 1366
Cdd:NF041483 966 ATGEAERLRaeaaetvgsaqqhaERIRTEAERVKAEAA-AEAERLRTE---------------AREEADRTLDEAR---- 1025
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1367 nlKDFDRRVNDnktAAEEALRripAINRTIAEANEKTREAQLALGNAAADATEAKNKA-----HEAERIAS--------- 1432
Cdd:NF041483 1026 --KDANKRRSE---AAEQADT---LITEAAAEADQLTAKAQEEALRTTTEAEAQADTMvgaarKEAERIVAeatvegnsl 1097
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1433 -------------GVQKNATSTKADAE----RTFSEVTDLDNevngmlRQLEEAENELKRKQDDADQdMMMAGMASQAAQ 1495
Cdd:NF041483 1098 vekartdadellvGARRDATAIRERAEelrdRITGEIEELHE------RARRESAEQMKSAGERCDA-LVKAAEEQLAEA 1170
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 281371490 1496 EAelnarKAKNSVSSLLSQLTdlleqlgqldTVDLNKLNEIEGSLNKAkdEMKVSDLDRKVSNLENEARKQEAAILD 1572
Cdd:NF041483 1171 EA-----KAKELVSDANSEAS----------KVRIAAVKKAEGLLKEA--EQKKAELVREAEKIKAEAEAEAKRTVE 1230
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1040-1562 |
1.23e-07 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 56.90 E-value: 1.23e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1040 VAEHRVKLQELESLIANLGTGDETVTDQAFED--RLKEAEREVTDL--LREAQEVKDVDQNLMdrlqrvnNSLHSQISRL 1115
Cdd:TIGR00618 333 HVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVAtsIREISCQQHTLTqhIHTLQQQKTTLTQKL-------QSLCKELDIL 405
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1116 HNIRNTIEetgilAERARSRVESTEQLIEIASRELEKAKMAAANVSITQpESTGEPNNMTLLAEEARRLAER-------- 1187
Cdd:TIGR00618 406 QREQATID-----TRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITC-TAQCEKLEKIHLQESAQSLKEReqqlqtke 479
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1188 --HKQEADDIVRVAKTANET--------------SAEAYNLLL------RTLAGENQTALEIEELNRKYEQAknisQDLE 1245
Cdd:TIGR00618 480 qiHLQETRKKAVVLARLLELqeepcplcgscihpNPARQDIDNpgpltrRMQRGEQTYAQLETSEEDVYHQL----TSER 555
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1246 KQAARVHEEAKRAgDKAVEIYASVAQLTPVDSEALESEANKIKKEAADLDRLIDQKLkdyEDLREDMRGKEHEVkNLLEK 1325
Cdd:TIGR00618 556 KQRASLKEQMQEI-QQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLA---CEQHALLRKLQPEQ-DLQDV 630
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1326 GKAEQQTADQLLARADAAKALAEeaakkgrNTLQEANDI-LNNLKDFDRRVNDNKTAAEEAL----RRIPAINRTIAEAN 1400
Cdd:TIGR00618 631 RLHLQQCSQELALKLTALHALQL-------TLTQERVREhALSIRVLPKELLASRQLALQKMqsekEQLTYWKEMLAQCQ 703
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1401 EKTR--------------EAQLALGNAAADaTEAKNKAH-----EAERIASGVQKNATSTKADA-------ERTFSEVTD 1454
Cdd:TIGR00618 704 TLLRelethieeydrefnEIENASSSLGSD-LAAREDALnqslkELMHQARTVLKARTEAHFNNneevtaaLQTGAELSH 782
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1455 LDNEVNGMLRQLEEAENELKRKQD------DADQDMMMAGMASQAAQEAELNARKAKNsvSSLLSQLTDLLEQLG----Q 1524
Cdd:TIGR00618 783 LAAEIQFFNRLREEDTHLLKTLEAeigqeiPSDEDILNLQCETLVQEEEQFLSRLEEK--SATLGEITHQLLKYEecskQ 860
|
570 580 590 600
....*....|....*....|....*....|....*....|
gi 281371490 1525 LDTVDLN--KLNEIEGSLNkAKDEMKVSDLDRKVSNLENE 1562
Cdd:TIGR00618 861 LAQLTQEqaKIIQLSDKLN-GINQIKIQFDGDALIKFLHE 899
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1395-1588 |
1.60e-07 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 55.61 E-value: 1.60e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1395 TIAEANEKTREAQLALGNAAADATEAKNKAHEAeriasgvQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELK 1474
Cdd:COG3883 10 TPAFADPQIQAKQKELSELQAELEAAQAELDAL-------QAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIE 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1475 RKQDDADQDMmmagmasQAAQEAELNArkakNSVSSLLSQ--LTDLLEQLGQLDTV-------------DLNKLNEIEGS 1539
Cdd:COG3883 83 ERREELGERA-------RALYRSGGSV----SYLDVLLGSesFSDFLDRLSALSKIadadadlleelkaDKAELEAKKAE 151
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 281371490 1540 LNKAKDEM--KVSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNLE 1588
Cdd:COG3883 152 LEAKLAELeaLKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELE 202
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1226-1510 |
2.08e-07 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 55.22 E-value: 2.08e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1226 EIEELNRKYEQAKNISQDLEKQAARVHEEAkragdkaveiyasvaqltpvdsEALESEANKIKKEAADLDRLIDQKLKDY 1305
Cdd:COG3883 17 QIQAKQKELSELQAELEAAQAELDALQAEL----------------------EELNEEYNELQAELEALQAEIDKLQAEI 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1306 EDLREDMRGKEHEVKNLLekgKAEQQTADQLLARadaakalaeeaakkgrNTLQEANDIlnnlKDFDRRVNDNKTAAEEA 1385
Cdd:COG3883 75 AEAEAEIEERREELGERA---RALYRSGGSVSYL----------------DVLLGSESF----SDFLDRLSALSKIADAD 131
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1386 LRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKaheaeriasgvQKNATSTKADAErtfSEVTDLDNEVNGMLRQ 1465
Cdd:COG3883 132 ADLLEELKADKAELEAKKAELEAKLAELEALKAELEAA-----------KAELEAQQAEQE---ALLAQLSAEEAAAEAQ 197
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 281371490 1466 LEEAENELKRKQDDADQDMMMAGMASQAAQEAELNARKAKNSVSS 1510
Cdd:COG3883 198 LAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 242
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1427-1589 |
2.11e-07 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 54.91 E-value: 2.11e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1427 AERIASGVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQdmmmagMASQAAQ-EAELNARKAK 1505
Cdd:COG4372 1 GDRLGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQ------AREELEQlEEELEQARSE 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1506 -NSVSSLLSQLTDLLEQLGQLDTVDLNKLNEIEGSLNKAKDEmkVSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDI 1584
Cdd:COG4372 75 lEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEE--LEELQKERQDLEQQRKQLEAQIAELQSEIAEREEEL 152
|
....*
gi 281371490 1585 HNLED 1589
Cdd:COG4372 153 KELEE 157
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1105-1594 |
2.29e-07 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 55.80 E-value: 2.29e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1105 NNSLHSQISRLHNIRNTIEETgilaerarsrVESTEQLIEIASRELEKAKmaaanvsiTQPESTGEPNNmtllaEEARRL 1184
Cdd:TIGR04523 213 NKSLESQISELKKQNNQLKDN----------IEKKQQEINEKTTEISNTQ--------TQLNQLKDEQN-----KIKKQL 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1185 AERHKQeaddivrvAKTANETSAEAYNLLlrtlageNQTALEIEELNRKYEQaknisqDLEKQaarVHEEAKRAGDKAVE 1264
Cdd:TIGR04523 270 SEKQKE--------LEQNNKKIKELEKQL-------NQLKSEISDLNNQKEQ------DWNKE---LKSELKNQEKKLEE 325
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1265 IYASVAQLTPVDSEaLESEANKIKKEAADL-------DRLIDQKLKDYEDLREDMRGKEHEVKNLlekgkaEQQTADQll 1337
Cdd:TIGR04523 326 IQNQISQNNKIISQ-LNEQISQLKKELTNSesensekQRELEEKQNEIEKLKKENQSYKQEIKNL------ESQINDL-- 396
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1338 aradaakalaeeaakkgRNTLQEaNDILNNLKDfdRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAqlalgnaaada 1417
Cdd:TIGR04523 397 -----------------ESKIQN-QEKLNQQKD--EQIKKLQQEKELLEKEIERLKETIIKNNSEIKDL----------- 445
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1418 teaKNKAHEAERIAsgvqKNATSTKADAERTFSEVTDLDNEVNgmlRQLEEAENELKRKQDDADqdmmmagMASQAAQEA 1497
Cdd:TIGR04523 446 ---TNQDSVKELII----KNLDNTRESLETQLKVLSRSINKIK---QNLEQKQKELKSKEKELK-------KLNEEKKEL 508
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1498 ELNARKAKNSVSSLLSQLTDLLEQLGQLDT--VDLN-KLNEIEGSLNKAKDEMKVSDLDRKVS-------NLENEARKQE 1567
Cdd:TIGR04523 509 EEKVKDLTKKISSLKEKIEKLESEKKEKESkiSDLEdELNKDDFELKKENLEKEIDEKNKEIEelkqtqkSLKKKQEEKQ 588
|
490 500
....*....|....*....|....*..
gi 281371490 1568 AAILDYDRDIAEIIKDIHNLEDIKKTL 1594
Cdd:TIGR04523 589 ELIDQKEKEKKDLIKEIEEKEKKISSL 615
|
|
| Borrelia_P83 |
pfam05262 |
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins. |
1061-1283 |
3.72e-07 |
|
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
Pssm-ID: 114011 [Multi-domain] Cd Length: 489 Bit Score: 54.62 E-value: 3.72e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1061 DETVTDQAFEDRLKEAE--REVTDLL-REAQEVKDVDQNLMDRLQRvnnslhsqisrlhnirntieeTGILAERARSRVE 1137
Cdd:pfam05262 179 DKKVVEALREDNEKGVNfrRDMTDLKeRESQEDAKRAQQLKEELDK---------------------KQIDADKAQQKAD 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1138 STEQLIEIASRELEKAKMAAANVSitQPESTGEPNNMTLLAEEARRLAERHKQEADDIVRVAKTANETSAEAynlLLRTL 1217
Cdd:pfam05262 238 FAQDNADKQRDEVRQKQQEAKNLP--KPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFD---LKQES 312
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 281371490 1218 AGENQTAlEIEELNRKYEQaKNISQDLEKQAARVHEEAKRAGDKAVEIYASVAQLTPVDSEALESE 1283
Cdd:pfam05262 313 KASEKEA-EDKELEAQKKR-EPVAEDLQKTKPQVEAQPTSLNEDAIDSSNPVYGLKVVDPITNLSE 376
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1069-1592 |
3.78e-07 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 55.23 E-value: 3.78e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1069 FEDRLKEAEREVtDLLREAQevKDVDQNLMDRLQRVNNSLHSQISRLHNIRNTIEETgilAERARSRVESTEQLIEIASR 1148
Cdd:pfam12128 352 WQSELENLEERL-KALTGKH--QDVTAKYNRRRSKIKEQNNRDIAGIKDKLAKIREA---RDRQLAVAEDDLQALESELR 425
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1149 ELEKAKMAAANVSITQPESTGEPNNMTL---LAEEARRLAERHKQEADDIVRVAKTANETSAEAYNLLLRTLAGENQTAL 1225
Cdd:pfam12128 426 EQLEAGKLEFNEEEYRLKSRLGELKLRLnqaTATPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQAS 505
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1226 E-IEELNRKYEQAKNISQDLEKQ----AARVHE-EAKRAGDKAVEI--YASVAQL-----TPVDSEALESEANKIKKEAA 1292
Cdd:pfam12128 506 EaLRQASRRLEERQSALDELELQlfpqAGTLLHfLRKEAPDWEQSIgkVISPELLhrtdlDPEVWDGSVGGELNLYGVKL 585
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1293 DLDRLidqKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLKDFD 1372
Cdd:pfam12128 586 DLKRI---DVPEWAASEEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETFARTALKNARLDLRRLFDEK 662
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1373 RRVNDNKT--------AAEEALRRIPA---------------INRTIAEAN----------EKTREAQLALGNAAADATE 1419
Cdd:pfam12128 663 QSEKDKKNkalaerkdSANERLNSLEAqlkqldkkhqawleeQKEQKREARtekqaywqvvEGALDAQLALLKAAIAARR 742
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1420 AKNKAHeaeriASGVQK-NATSTKA---DAERtfseVTDLDNEVNGMLRQLEEAEnelKRKQDDADQDMMMAGMASQAAQ 1495
Cdd:pfam12128 743 SGAKAE-----LKALETwYKRDLASlgvDPDV----IAKLKREIRTLERKIERIA---VRRQEVLRYFDWYQETWLQRRP 810
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1496 EAELNARKAKNSVSSLLSQLTDLLEqlgqlDT-VDLNKLNEIEGSLNKAKDEM-----KVSDLDRKVSNLENEARKQEAA 1569
Cdd:pfam12128 811 RLATQLSNIERAISELQQQLARLIA-----DTkLRRAKLEMERKASEKQQVRLsenlrGLRCEMSKLATLKEDANSEQAQ 885
|
570 580
....*....|....*....|....*.
gi 281371490 1570 I-LDYDRDIAEIIKDIHN--LEDIKK 1592
Cdd:pfam12128 886 GsIGERLAQLEDLKLKRDylSESVKK 911
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1355-1595 |
4.54e-07 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 54.92 E-value: 4.54e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1355 RNTLQEANDILNNLKDFDRRVNDNKTAAE--EALRRIPAINRTIAEANEKTREAQLALgnAAADATEAKNKAHEAERIAS 1432
Cdd:COG4913 221 PDTFEAADALVEHFDDLERAHEALEDAREqiELLEPIRELAERYAAARERLAELEYLR--AALRLWFAQRRLELLEAELE 298
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1433 GVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEA----ENELKRKQDDAdqdmmmagmasqaaqEAELNARKAKnsv 1508
Cdd:COG4913 299 ELRAELARLEAELERLEARLDALREELDELEAQIRGNggdrLEQLEREIERL---------------ERELEERERR--- 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1509 sslLSQLTDLLEQLGQldtvdlnKLNEIEGSLNKAKDEMK--VSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHN 1586
Cdd:COG4913 361 ---RARLEALLAALGL-------PLPASAEEFAALRAEAAalLEALEEELEALEEALAEAEAALRDLRRELRELEAEIAS 430
|
....*....
gi 281371490 1587 LEDIKKTLP 1595
Cdd:COG4913 431 LERRKSNIP 439
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1046-1458 |
4.92e-07 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 54.53 E-value: 4.92e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1046 KLQELESLIANLGTGDetvTDQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRLHNIRNTIEET 1125
Cdd:COG5278 119 WLAELEQVIALRRAGG---LEAALALVRSGEGKALMDEIRARLLLLALALAALLLAAAALLLLLLALAALLALAELLLLA 195
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1126 GILAERARSRVESTEQLIEIASRELEKAKMAAANVSITQPESTGEPNNMTLLAEEARRLAERHKQEADDIVRVAKTANET 1205
Cdd:COG5278 196 LARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALALALLLAALLLALLAALALAALLAAALLALAALLL 275
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1206 SAEAYNLLLRTLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVAQLTPVDSEALESEAN 1285
Cdd:COG5278 276 ALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAAAALAALLALALATALAAAAAALALLAALLAEAA 355
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1286 KIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDIL 1365
Cdd:COG5278 356 AAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAAAAEAAAAAAAAAAASAAEALELAEALAEALALA 435
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1366 NNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASGVQKNATSTKADA 1445
Cdd:COG5278 436 EEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAAALAEAEAAAALAAAAALSLALALAALLLAAAEA 515
|
410
....*....|...
gi 281371490 1446 ERTFSEVTDLDNE 1458
Cdd:COG5278 516 ALAAALAAALASA 528
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1070-1596 |
5.01e-07 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 54.64 E-value: 5.01e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1070 EDRLKEAEREV----TDLLREAQEVKDVDQNLMDRLQRVNNS------LHSQISRLHNiRNTIEETGIlaerarsrVEST 1139
Cdd:TIGR04523 32 DTEEKQLEKKLktikNELKNKEKELKNLDKNLNKDEEKINNSnnkikiLEQQIKDLND-KLKKNKDKI--------NKLN 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1140 EQLIEIASrelekakmaaanvsitqpESTGEPNNMTLLAEEARRLaERHKQEADdivrvaKTANETSAEAYNLllrtlag 1219
Cdd:TIGR04523 103 SDLSKINS------------------EIKNDKEQKNKLEVELNKL-EKQKKENK------KNIDKFLTEIKKK------- 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1220 ENQTAL---EIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDK--AVEIYASVAQLTPVDSEALESEANKIKKEAADL 1294
Cdd:TIGR04523 151 EKELEKlnnKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKllKLELLLSNLKKKIQKNKSLESQISELKKQNNQL 230
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1295 DRLIDQKLKDYEDLredmrgkehevknllekgKAEQQTADQLLAradaakalaeeaakkgrNTLQEANDILNNLKDFDRR 1374
Cdd:TIGR04523 231 KDNIEKKQQEINEK------------------TTEISNTQTQLN-----------------QLKDEQNKIKKQLSEKQKE 275
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1375 VNDNKTAAEEALRRIPAINRTIAEANeKTREAQLaLGNAAADATEAKNKAHEAERIASGVQKNATSTKADAERTFSEVTD 1454
Cdd:TIGR04523 276 LEQNNKKIKELEKQLNQLKSEISDLN-NQKEQDW-NKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTN 353
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1455 LDNEVNGMLRQLEEAENELKRKQDDadqdmmmagmasqaaqeaelNARKaKNSVSSLLSQLTDLLEQLGQLDTVDLNKLN 1534
Cdd:TIGR04523 354 SESENSEKQRELEEKQNEIEKLKKE--------------------NQSY-KQEIKNLESQINDLESKIQNQEKLNQQKDE 412
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 281371490 1535 EIEgSLNKAKDEmkvsdLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNLEDIKKTLPT 1596
Cdd:TIGR04523 413 QIK-KLQQEKEL-----LEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLET 468
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1371-1593 |
5.15e-07 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 54.40 E-value: 5.15e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1371 FDRRVNDNK-TAAEEALRRIpaINRTIAEANEKTREAQLalgnaaadatEAKNKAHEAeriasgvqknatstKADAERtf 1449
Cdd:PRK12704 24 VRKKIAEAKiKEAEEEAKRI--LEEAKKEAEAIKKEALL----------EAKEEIHKL--------------RNEFEK-- 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1450 sEVTDLDNEVNGMLRQLEEAENELKRKQDDADQdmmmagmasqaaQEAELNARkaKNSVSSLLSQLTDLLEQLGQLDTVD 1529
Cdd:PRK12704 76 -ELRERRNELQKLEKRLLQKEENLDRKLELLEK------------REEELEKK--EKELEQKQQELEKKEEELEELIEEQ 140
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1530 LNKLNEIEG-SLNKAKDEMkvsdldrkVSNLENEARkqeaaildydRDIAEIIKDIHnlEDIKKT 1593
Cdd:PRK12704 141 LQELERISGlTAEEAKEIL--------LEKVEEEAR----------HEAAVLIKEIE--EEAKEE 185
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1049-1514 |
6.33e-07 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 54.45 E-value: 6.33e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1049 ELESLIANLGTGDETVTDQAFEDRLKEAEREVTDLLREAQE-VKDVDQNLMDRLQRVNNSLHSQISRLHNIRNTIEE--- 1124
Cdd:NF041483 342 EAEKLVAEAAEKARTVAAEDTAAQLAKAARTAEEVLTKASEdAKATTRAAAEEAERIRREAEAEADRLRGEAADQAEqlk 421
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1125 -------------TGILAERARSRVESTEQLIEIASRELEKAKMAAANVSITQPESTgepnnmtllAEEARRLAERHKQE 1191
Cdd:NF041483 422 gaakddtkeyrakTVELQEEARRLRGEAEQLRAEAVAEGERIRGEARREAVQQIEEA---------ARTAEELLTKAKAD 492
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1192 ADDIVRVAKTANETsaeaynllLRTLAGENQTAL--EIEE-LNRKYEQAKNISQDLEKQAARVHEEAKRAG-------DK 1261
Cdd:NF041483 493 ADELRSTATAESER--------VRTEAIERATTLrrQAEEtLERTRAEAERLRAEAEEQAEEVRAAAERAArelreetER 564
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1262 AVEIYASVA--QLTPVDSEALE-------------SEANKIKKEAAD-LDRLidqKLKDYEDLREDMRGKEHEVKNLLEK 1325
Cdd:NF041483 565 AIAARQAEAaeELTRLHTEAEErltaaeealadarAEAERIRREAAEeTERL---RTEAAERIRTLQAQAEQEAERLRTE 641
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1326 GKAEQQTADqllaradaakalaeeaakkgrntlQEANDILNNLKDfdrrvndnkTAAEEALRRipainRTIA-EANEKTR 1404
Cdd:NF041483 642 AAADASAAR------------------------AEGENVAVRLRS---------EAAAEAERL-----KSEAqESADRVR 683
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1405 -EAQLALGNAAADATEAKNKAH-EAERIASGVQKNATSTKADAERTFSEVTDLDNEVNGMLR-QLEEAENELKRKQDDAD 1481
Cdd:NF041483 684 aEAAAAAERVGTEAAEALAAAQeEAARRRREAEETLGSARAEADQERERAREQSEELLASARkRVEEAQAEAQRLVEEAD 763
|
490 500 510
....*....|....*....|....*....|...
gi 281371490 1482 QdmmmagMASQAAQEAELNARKAKNSVSSLLSQ 1514
Cdd:NF041483 764 R------RATELVSAAEQTAQQVRDSVAGLQEQ 790
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1034-1579 |
6.56e-07 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 54.41 E-value: 6.56e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1034 RLVKDKVAEHRVKLQELESLIANLGT----GDETVTDqaFEDRLKEAERevtdlLREAQEvkdvdqNLMDRLQRVNNSLH 1109
Cdd:pfam01576 155 KLLEERISEFTSNLAEEEEKAKSLSKlknkHEAMISD--LEERLKKEEK-----GRQELE------KAKRKLEGESTDLQ 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1110 SQISRLhniRNTIEEtgILAERARSRVESTEQLieiASRELEKAKMAAANVSITQPESTGEPNNMTLLAEE-ARRLAERH 1188
Cdd:pfam01576 222 EQIAEL---QAQIAE--LRAQLAKKEEELQAAL---ARLEEETAQKNNALKKIRELEAQISELQEDLESERaARNKAEKQ 293
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1189 KQEADDIVRVAKTANETSAEAYN--------------LLLRTLAGENQT------------ALEIEELNRKYEQAKNISQ 1242
Cdd:pfam01576 294 RRDLGEELEALKTELEDTLDTTAaqqelrskreqevtELKKALEEETRSheaqlqemrqkhTQALEELTEQLEQAKRNKA 373
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1243 DLEK-------------------QAARVHEEAKR-----------------------AGDKAVEIYASVAQLTPVDSEAl 1280
Cdd:pfam01576 374 NLEKakqalesenaelqaelrtlQQAKQDSEHKRkklegqlqelqarlseserqraeLAEKLSKLQSELESVSSLLNEA- 452
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1281 ESEANKIKKEAADLD-RLID----------QKL------KDYED----LREDMRGKEHEVKNL----------------- 1322
Cdd:pfam01576 453 EGKNIKLSKDVSSLEsQLQDtqellqeetrQKLnlstrlRQLEDernsLQEQLEEEEEAKRNVerqlstlqaqlsdmkkk 532
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1323 ----------LEKGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEAND-----------ILNNL----KDFDRRVND 1377
Cdd:pfam01576 533 leedagtleaLEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDdllvdldhqrqLVSNLekkqKKFDQMLAE 612
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1378 NKTA----AEEAlrripaiNRTIAEANEKTREAqLALGNAAADATEAKNKAHEAERIASGVQKNATSTKADAERTFSEVT 1453
Cdd:pfam01576 613 EKAIsaryAEER-------DRAEAEAREKETRA-LSLARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHELE 684
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1454 D----LDNEVNGMLRQLEEAENELKrKQDDADQDMMMAGMASQAAQEAELNARKAKNSVS--SLLSQLTDLLEQLgqldt 1527
Cdd:pfam01576 685 RskraLEQQVEEMKTQLEELEDELQ-ATEDAKLRLEVNMQALKAQFERDLQARDEQGEEKrrQLVKQVRELEAEL----- 758
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|....*....
gi 281371490 1528 vDLNKLNEIEGSLNKAKDEMKVSDLDRKVSNL---ENEARKQ----EAAILDYDRDIAE 1579
Cdd:pfam01576 759 -EDERKQRAQAVAAKKKLELDLKELEAQIDAAnkgREEAVKQlkklQAQMKDLQRELEE 816
|
|
| ClyA_Cry6Aa-like |
cd22656 |
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ... |
1192-1333 |
8.71e-07 |
|
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.
Pssm-ID: 439154 [Multi-domain] Cd Length: 309 Bit Score: 52.76 E-value: 8.71e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1192 ADDIVRVAKTANETSAEAYNLLLRtLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKaveiYASVAQ 1271
Cdd:cd22656 75 AGDIYNYAQNAGGTIDSYYAEILE-LIDDLADATDDEELEEAKKTIKALLDDLLKEAKKYQDKAAKVVDK----LTDFEN 149
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 281371490 1272 LTPVDSEALESEANKIKK---------EAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTA 1333
Cdd:cd22656 150 QTEKDQTALETLEKALKDlltdeggaiARKEIKDLQKELEKLNEEYAAKLKAKIDELKALIADDEAKLAAA 220
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1071-1335 |
8.88e-07 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 53.92 E-value: 8.88e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1071 DRLKEAEREVTDLLREAQEVKdvdqNLMDRLQRVNNSLHSQISRLHNIRNTIEETgilAERARSRVESTEQLIEIASREL 1150
Cdd:TIGR02169 681 ERLEGLKRELSSLQSELRRIE----NRLDELSQELSDASRKIGEIEKEIEQLEQE---EEKLKERLEELEEDLSSLEQEI 753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1151 E--KAKMAAANVSITQPESTgepnnmtlLAEEARRLAERHKQEADDIVR-VAKTANETSAEAYNLLLRTLAGENqtalEI 1227
Cdd:TIGR02169 754 EnvKSELKELEARIEELEED--------LHKLEEALNDLEARLSHSRIPeIQAELSKLEEEVSRIEARLREIEQ----KL 821
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1228 EELNRKYEQAKNISQDLEKQaaRVHEEAKRAGDKAvEIYASVAQLtpvdsEALESEANKIKKEAADLD-RLIDQKlKDYE 1306
Cdd:TIGR02169 822 NRLTLEKEYLEKEIQELQEQ--RIDLKEQIKSIEK-EIENLNGKK-----EELEEELEELEAALRDLEsRLGDLK-KERD 892
|
250 260
....*....|....*....|....*....
gi 281371490 1307 DLREDMRgkehEVKNLLEKGKAEQQTADQ 1335
Cdd:TIGR02169 893 ELEAQLR----ELERKIEELEAQIEKKRK 917
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1052-1310 |
9.13e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 53.23 E-value: 9.13e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1052 SLIANLGTGDETVTDQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRLHNIRNTIEETgilaer 1131
Cdd:COG4942 8 ALLLALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAAL------ 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1132 aRSRVESTEQLIEIASRELEKAKMAAANVSITQpESTGEPNNMTLLAeearrlaerHKQEADDIVRvaktanetSAEAYN 1211
Cdd:COG4942 82 -EAELAELEKEIAELRAELEAQKEELAELLRAL-YRLGRQPPLALLL---------SPEDFLDAVR--------RLQYLK 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1212 LLLRTLAGE-NQTALEIEELNRKYEQAKNISQDLEKQAARVhEEAKRAGDKAVEIYASVAQLTPVDSEALESEANKIKKE 1290
Cdd:COG4942 143 YLAPARREQaEELRADLAELAALRAELEAERAELEALLAEL-EEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221
|
250 260
....*....|....*....|
gi 281371490 1291 AADLDRLIDQKLKDYEDLRE 1310
Cdd:COG4942 222 AEELEALIARLEAEAAAAAE 241
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1359-1525 |
1.13e-06 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 52.91 E-value: 1.13e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1359 QEANDILNNLKDFDRRVNDNKTAAEEAL-------RRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIA 1431
Cdd:COG3883 30 AELEAAQAELDALQAELEELNEEYNELQaelealqAEIDKLQAEIAEAEAEIEERREELGERARALYRSGGSVSYLDVLL 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1432 sgvqkNATSTKADAER--TFSEVTDLDNEvngMLRQLEEAENELKRKQDDAD--QDMMMAGMASQAAQEAELNARKAKNS 1507
Cdd:COG3883 110 -----GSESFSDFLDRlsALSKIADADAD---LLEELKADKAELEAKKAELEakLAELEALKAELEAAKAELEAQQAEQE 181
|
170 180
....*....|....*....|
gi 281371490 1508 --VSSLLSQLTDLLEQLGQL 1525
Cdd:COG3883 182 alLAQLSAEEAAAEAQLAEL 201
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1067-1579 |
1.16e-06 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 53.64 E-value: 1.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1067 QAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQ---RVNNSLHSQISRLHNIRNTIEEtgILAErARSRVESTEQli 1143
Cdd:pfam01576 15 QKVKERQQKAESELKELEKKHQQLCEEKNALQEQLQaetELCAEAEEMRARLAARKQELEE--ILHE-LESRLEEEEE-- 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1144 eiASREL--EKAKMA--------------AANVSITQPESTGEP------NNMTLLAEEARRLAERHKQEADDIVRVAKT 1201
Cdd:pfam01576 90 --RSQQLqnEKKKMQqhiqdleeqldeeeAARQKLQLEKVTTEAkikkleEDILLLEDQNSKLSKERKLLEERISEFTSN 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1202 ANETSAEAYNLL-LRT------------LAGENQTALEIEELNRKYE--------------------QAKNISQDLEKQA 1248
Cdd:pfam01576 168 LAEEEEKAKSLSkLKNkheamisdleerLKKEEKGRQELEKAKRKLEgestdlqeqiaelqaqiaelRAQLAKKEEELQA 247
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1249 A--RVHEE--AKRAGDKAV-EIYASVAQLtpvdSEALESEA---NKIKKEAADLDRLIDQKLKDYED------LREDMRG 1314
Cdd:pfam01576 248 AlaRLEEEtaQKNNALKKIrELEAQISEL----QEDLESERaarNKAEKQRRDLGEELEALKTELEDtldttaAQQELRS 323
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1315 K-EHEV----KNLLEKGKA-EQQTAD--QLLARADAAKALAEEAAKKGRNTLQEANDIL----NNLKDFDRRVNDNKTAA 1382
Cdd:pfam01576 324 KrEQEVtelkKALEEETRShEAQLQEmrQKHTQALEELTEQLEQAKRNKANLEKAKQALesenAELQAELRTLQQAKQDS 403
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1383 EEALRRIPA-----------INRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASGVQknatSTKADAERTFSE 1451
Cdd:pfam01576 404 EHKRKKLEGqlqelqarlseSERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLE----SQLQDTQELLQE 479
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1452 VT-----------DLDNEVNGMLRQLEEAE-----------------NELKRKQDDadqdmmMAGmASQAAQEAElnaRK 1503
Cdd:pfam01576 480 ETrqklnlstrlrQLEDERNSLQEQLEEEEeakrnverqlstlqaqlSDMKKKLEE------DAG-TLEALEEGK---KR 549
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1504 AKNSVSSLLSQLTdllEQLGQLDtvdlnKLNEIEGSLNKAKDEMKVsDLDRK---VSNLENEARK------QEAAI---L 1571
Cdd:pfam01576 550 LQRELEALTQQLE---EKAAAYD-----KLEKTKNRLQQELDDLLV-DLDHQrqlVSNLEKKQKKfdqmlaEEKAIsarY 620
|
....*...
gi 281371490 1572 DYDRDIAE 1579
Cdd:pfam01576 621 AEERDRAE 628
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
283-328 |
1.20e-06 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 46.96 E-value: 1.20e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 281371490 283 RCKCNGHASECVKNEFGKLVCNCKHNTYGVDCEKCLPFFNDRPWRR 328
Cdd:cd00055 1 PCDCNGHGSLSGQCDPGTGQCECKPNTTGRRCDRCAPGYYGLPSQG 46
|
|
| MDN1 |
COG5271 |
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal ... |
1049-1568 |
2.38e-06 |
|
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal structure and biogenesis];
Pssm-ID: 444083 [Multi-domain] Cd Length: 1028 Bit Score: 52.71 E-value: 2.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1049 ELESLIANLGTGDETVTDQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLhSQISRLHNIRNTIEETGIL 1128
Cdd:COG5271 156 LPSLADNDEAAADEGDELAADGDDTLAVADAIEATPGGTDAVELTATLGATVTTDPGDSV-AADDDLAAEEGASAVVEEE 234
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1129 AERARSRVESTEQLIEIASRELEKAKMAAANVSITQPESTGEPNNMTLLAEEARRLAERHKQEADDIVRVAKTANETSAE 1208
Cdd:COG5271 235 DASEDAVAAADETLLADDDDTESAGATAEVGGTPDTDDEATDDADGLEAAEDDALDAELTAAQAADPESDDDADDSTLAA 314
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1209 AYNLLLRTLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVAQLTPVDSEALESEANKIK 1288
Cdd:COG5271 315 LEGAAEDTEIATADELAAADDEDDDDSAAEDAAEEAATAEDSAAEDTQDAEDEAAGEAADESEGADTDAAADEADAAADD 394
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1289 KEAADLDRLIDQKLKDYEDLREdmrgkEHEVKNLLEKGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNL 1368
Cdd:COG5271 395 SADDEEASADGGTSPTSDTDEE-----EEEADEDASAGETEDESTDVTSAEDDIATDEEADSLADEEEEAEAELDTEEDT 469
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1369 KDFDRRVNDNKTAAEEALR--RIPAINRTIAEANEKTREAQLALGNA-AADATEAKNKAHEAERIASGVQKNATSTKADA 1445
Cdd:COG5271 470 ESAEEDADGDEATDEDDASddGDEEEAEEDAEAEADSDELTAEETSAdDGADTDAAADPEDSDEDALEDETEGEENAPGS 549
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1446 ERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLGQL 1525
Cdd:COG5271 550 DQDADETDEPEATAEEDEPDEAEAETEDATENADADETEESADESEEAEASEDEAAEEEEADDDEADADADGAADEEETE 629
|
490 500 510 520
....*....|....*....|....*....|....*....|...
gi 281371490 1526 DTVDLNKLNEIEGSLNKAKDEMkvSDLDRKVSNLENEARKQEA 1568
Cdd:COG5271 630 EEAAEDEAAEPETDASEAADED--ADAETEAEASADESEEEAE 670
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1208-1595 |
3.50e-06 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 51.90 E-value: 3.50e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1208 EAYNLLLRTLAGENQTALEIEELNRKY----EQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVAQLTPVDSEALESE 1283
Cdd:pfam02463 173 EALKKLIEETENLAELIIDLEELKLQElklkEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEE 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1284 ANKIKKEAADLDRLIDQKLKDYEdlredmrgKEHEVKNLLEKGKAEQQTAdqllaradaakalaeeaakkgRNTLQEAND 1363
Cdd:pfam02463 253 IESSKQEIEKEEEKLAQVLKENK--------EEEKEKKLQEEELKLLAKE---------------------EEELKSELL 303
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1364 ILNNLKDFDRRV-----NDNKTAAEEALRRIPAINRTIAEANEKTReAQLALGNAAADATEAKNKAHEAE-RIASGVQKN 1437
Cdd:pfam02463 304 KLERRKVDDEEKlkeseKEKKKAEKELKKEKEEIEELEKELKELEI-KREAEEEEEEELEKLQEKLEQLEeELLAKKKLE 382
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1438 ATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASQAAQEAELNARKAKNsvsSLLSQLTD 1517
Cdd:pfam02463 383 SERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKE---ELEKQELK 459
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281371490 1518 LLEQLGQLDTVDLNKLNEIEGSLNKAKDEMKVSDLDRKVSNLENEARKQEAAILDYDRDIAeiiKDIHNLEDIKKTLP 1595
Cdd:pfam02463 460 LLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGV---GGRIISAHGRLGDL 534
|
|
| Laminin_I |
pfam06008 |
Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from ... |
1385-1568 |
4.30e-06 |
|
Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
Pssm-ID: 310534 [Multi-domain] Cd Length: 258 Bit Score: 50.10 E-value: 4.30e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1385 ALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNK----AHEAERIASGVQKN---ATSTKADAERTFSEVTDLDN 1457
Cdd:pfam06008 10 ALPAPYKINYNLENLTKQLQEYLSPENAHKIQIEILEKElsslAQETEELQKKATQTlakAQQVNAESERTLGHAKELAE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1458 EVNGMLRQLEE---------------AENELKRKQDDADQdmMMAGMAS----QAAQEAELNARKAKNSVSSLLSQLTDL 1518
Cdd:pfam06008 90 AIKNLIDNIKEinekvatlgendfalPSSDLSRMLAEAQR--MLGEIRSrdfgTQLQNAEAELKAAQDLLSRIQTWFQSP 167
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1519 LEQLGQLDTVDLNKLNEIEGSLNKAKD-----EMKVSDLDRKvsNLENEARKQEA 1568
Cdd:pfam06008 168 QEENKALANALRDSLAEYEAKLSDLREllreaAAKTRDANRL--NLANQANLREF 220
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1128-1597 |
5.13e-06 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 51.19 E-value: 5.13e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1128 LAERARSRVESTEQLIEIASRE-LEKAKMAAANVSITQPEstgepnnmtLLAEEARRLAERHKQEADDIVRVAKTANETS 1206
Cdd:COG3064 31 AEQKAKEEAEEERLAELEAKRQaEEEAREAKAEAEQRAAE---------LAAEAAKKLAEAEKAAAEAEKKAAAEKAKAA 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1207 AEAYNLLlrtlagENQTALEIEElNRKYEQAKnisQDLEKQAARVHEEAKRAGDKAVEIYASVAQLTPVDSEALESEANK 1286
Cdd:COG3064 102 KEAEAAA------AAEKAAAAAE-KEKAEEAK---RKAEEEAKRKAEEERKAAEAEAAAKAEAEAARAAAAAAAAAAAAA 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1287 IKKEAADLDRLIDQKLKDYEdlredmRGKEHEVKNLLEKGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDILN 1366
Cdd:COG3064 172 ARAAAGAAAALVAAAAAAVE------AADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAAL 245
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1367 NLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEkTREAQLALGNAAADATEAKNKAHEAERIASGVQKNATSTKADAE 1446
Cdd:COG3064 246 GGAEEAADLAAVGVLGAALAAAAAGAAALSSGLVV-VAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLAAAAAA 324
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1447 RTfsevtdlDNEVNGMLRQLEEAENELKRKQDDADQDmmmAGMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLGQLD 1526
Cdd:COG3064 325 GA-------LVVRGGGAASLEAALSLLAAGAAAAAAG---AGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAA 394
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 281371490 1527 TVDLNKLNeIEGSLNKAKDEMKVSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNLEDIKKTLPTG 1597
Cdd:COG3064 395 AGGGGLLG-LRLDLGAALLEAASAVELRVLLALAGAAGAVVALLVKLVADLAGGLVGIGKALTGDADALLG 464
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1037-1584 |
5.73e-06 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 51.45 E-value: 5.73e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1037 KDKVAEHRVKLQELESLIANLGTGDETvTDQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVN----------N 1106
Cdd:COG4913 351 ERELEERERRRARLEALLAALGLPLPA-SAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRrelreleaeiA 429
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1107 SLHSQIS----RLHNIRNTI-EETGILAERARSRVEsteqLIEIASRE------LEKA---------------KMAAANV 1160
Cdd:COG4913 430 SLERRKSnipaRLLALRDALaEALGLDEAELPFVGE----LIEVRPEEerwrgaIERVlggfaltllvppehyAAALRWV 505
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1161 sitqpestgEPNNM-----TLLAEEARRLAERHKQEADDIVRVAKTAnETSAEAYnllLRTLAGENQTAL---EIEELnR 1232
Cdd:COG4913 506 ---------NRLHLrgrlvYERVRTGLPDPERPRLDPDSLAGKLDFK-PHPFRAW---LEAELGRRFDYVcvdSPEEL-R 571
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1233 KYEQAknISQD-LEKQAARVHEeaKRAGDKAVEIYA----SVAQLtpvdsEALESEANKIKKEAADLDRLID-------- 1299
Cdd:COG4913 572 RHPRA--ITRAgQVKGNGTRHE--KDDRRRIRSRYVlgfdNRAKL-----AALEAELAELEEELAEAEERLEaleaelda 642
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1300 --------QKLKDYEDLREDMRGKEHEVKNLlekgkaEQQtadqllaradaakalaeeaakkgRNTLQEANDILNNLKdf 1371
Cdd:COG4913 643 lqerrealQRLAEYSWDEIDVASAEREIAEL------EAE-----------------------LERLDASSDDLAALE-- 691
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1372 dRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALgNAAADATEAKNKAHEAERIASGVQKNATSTKADAERTFSE 1451
Cdd:COG4913 692 -EQLEELEAELEELEEELDELKGEIGRLEKELEQAEEEL-DELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRE 769
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1452 vtDLDNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASQAAQEAelnarkaknsvssllsqLTDLLEQLGQLDTVDL- 1530
Cdd:COG4913 770 --NLEERIDALRARLNRAEEELERAMRAFNREWPAETADLDADLES-----------------LPEYLALLDRLEEDGLp 830
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....
gi 281371490 1531 NKLNEIEGSLNKAKDEmKVSDLDRKvsnLENEArkqeaaildydRDIAEIIKDI 1584
Cdd:COG4913 831 EYEERFKELLNENSIE-FVADLLSK---LRRAI-----------REIKERIDPL 869
|
|
| PRK07352 |
PRK07352 |
F0F1 ATP synthase subunit B; Validated |
1387-1483 |
5.97e-06 |
|
F0F1 ATP synthase subunit B; Validated
Pssm-ID: 180941 [Multi-domain] Cd Length: 174 Bit Score: 48.41 E-value: 5.97e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1387 RRIPAINRTIAEANEKTREAQLALGNAAADATEAKnkaHEAERIASGVQKNATSTKADAE-RTFSEV--------TDLDN 1457
Cdd:PRK07352 50 ERREAILQALKEAEERLRQAAQALAEAQQKLAQAQ---QEAERIRADAKARAEAIRAEIEkQAIEDMarlkqtaaADLSA 126
|
90 100 110
....*....|....*....|....*....|....*
gi 281371490 1458 E---VNGMLRQ------LEEAENELKRKQDDADQD 1483
Cdd:PRK07352 127 EqerVIAQLRReaaelaIAKAESQLPGRLDEDAQQ 161
|
|
| Tropomyosin |
pfam00261 |
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ... |
1355-1585 |
7.94e-06 |
|
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.
Pssm-ID: 459736 [Multi-domain] Cd Length: 235 Bit Score: 48.87 E-value: 7.94e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1355 RNTLQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASGV 1434
Cdd:pfam00261 4 QQIKEELDEAEERLKEAMKKLEEAEKRAEKAEAEVAALNRRIQLLEEELERTEERLAEALEKLEEAEKAADESERGRKVL 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1435 QknaTSTKADAERtfsevtdldnevngmLRQLEEAENELKRKQDDADQDMMMAG----MASQAAQEAELNARKAKNSVSS 1510
Cdd:pfam00261 84 E---NRALKDEEK---------------MEILEAQLKEAKEIAEEADRKYEEVArklvVVEGDLERAEERAELAESKIVE 145
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1511 LLSQLTDLLEQLGQLDtVDLNKLNEIEGSL-NKAKD--------EMKVSDLDRKVSNLENEARKQEAAILDYDRDIAEII 1581
Cdd:pfam00261 146 LEEELKVVGNNLKSLE-ASEEKASEREDKYeEQIRFlteklkeaETRAEFAERSVQKLEKEVDRLEDELEAEKEKYKAIS 224
|
....
gi 281371490 1582 KDIH 1585
Cdd:pfam00261 225 EELD 228
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1036-1335 |
8.41e-06 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 50.79 E-value: 8.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1036 VKDKVAEHRVKLQELESLIANLGT--GDETVTDQAFEDRLKEAEREVTDLLREAQEVKDVDQNlmdrlqrvnnsLHSQIS 1113
Cdd:TIGR04523 326 IQNQISQNNKIISQLNEQISQLKKelTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKN-----------LESQIN 394
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1114 RLhniRNTIEEtgilaerarsrVESTEQLIEIASRELEKAKmaaanvsitqpestgepnnmTLLAEEARRLAERHKQEAD 1193
Cdd:TIGR04523 395 DL---ESKIQN-----------QEKLNQQKDEQIKKLQQEK--------------------ELLEKEIERLKETIIKNNS 440
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1194 DIVRVAKTANETSAEAYNLLLRTLAGENQtaleIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVAQLT 1273
Cdd:TIGR04523 441 EIKDLTNQDSVKELIIKNLDNTRESLETQ----LKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLT 516
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281371490 1274 PVDS------EALESEANKIKKEAADLDRLI---DQKLKdYEDLREDMRGK-------EHEVKNLLEKGKAEQQTADQ 1335
Cdd:TIGR04523 517 KKISslkekiEKLESEKKEKESKISDLEDELnkdDFELK-KENLEKEIDEKnkeieelKQTQKSLKKKQEEKQELIDQ 593
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
1040-1333 |
9.00e-06 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 50.59 E-value: 9.00e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1040 VAEHRVKLQELESLIAN-----LGTGDETVTDQA----FEDRLKEAEREVTDLLRE----AQEVKDVDQ---NLMDRLQr 1103
Cdd:pfam10174 347 VDALRLRLEEKESFLNKktkqlQDLTEEKSTLAGeirdLKDMLDVKERKINVLQKKienlQEQLRDKDKqlaGLKERVK- 425
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1104 vnnSLHSQISRLHNIRNTIEETgiLAE-----------RARSRVESTEQLiEIASRELEKAKmaaANVSITQPESTGEPN 1172
Cdd:pfam10174 426 ---SLQTDSSNTDTALTTLEEA--LSEkeriierlkeqREREDRERLEEL-ESLKKENKDLK---EKVSALQPELTEKES 496
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1173 NMTLLAEEARRLAERHKQEaDDIVRVAKTANETSAEAYNlllrTLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVH 1252
Cdd:pfam10174 497 SLIDLKEHASSLASSGLKK-DSKLKSLEIAVEQKKEECS----KLENQLKKAHNAEEAVRTNPEINDRIRLLEQEVARYK 571
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1253 EEAKRAgdkaveiYASVAQLTPVDSEAlESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVK----NLLEKGKA 1328
Cdd:pfam10174 572 EESGKA-------QAEVERLLGILREV-ENEKNDKDKKIAELESLTLRQMKEQNKKVANIKHGQQEMKkkgaQLLEEARR 643
|
....*
gi 281371490 1329 EQQTA 1333
Cdd:pfam10174 644 REDNL 648
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1039-1606 |
1.08e-05 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 50.40 E-value: 1.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1039 KVAEHRVKLQELESLIANLGTGDETVTDQAFEDRLKEaereVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRLhni 1118
Cdd:TIGR04523 275 ELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKE----LKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQL--- 347
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1119 rntieetgilaERARSRVESTeqlieiasrelekakmaaaNVSITqpestgepnnmtllaeeaRRLAERHKQeaddIVRV 1198
Cdd:TIGR04523 348 -----------KKELTNSESE-------------------NSEKQ------------------RELEEKQNE----IEKL 375
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1199 AKTANETSAEAYNLllrtlagENQtaleIEELNRKYEQAKNISQDLEKQAarvheeakragdkaveiyasvaqltpvdsE 1278
Cdd:TIGR04523 376 KKENQSYKQEIKNL-------ESQ----INDLESKIQNQEKLNQQKDEQI-----------------------------K 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1279 ALESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNlLEKGKAEQQTadqllaradaakalaeeaakkgrntl 1358
Cdd:TIGR04523 416 KLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKN-LDNTRESLET-------------------------- 468
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1359 qeandilnNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKaheaeriasgvQKNA 1438
Cdd:TIGR04523 469 --------QLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEK-----------IEKL 529
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1439 TSTKADAErtfSEVTDLDNEVNGMLRQLEeaENELKRKQDDADQdmmmagmasqaaqeaELNarKAKNSVSSLLS---QL 1515
Cdd:TIGR04523 530 ESEKKEKE---SKISDLEDELNKDDFELK--KENLEKEIDEKNK---------------EIE--ELKQTQKSLKKkqeEK 587
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1516 TDLLEQLgQLDTVDLNKlnEIEgslnkaKDEMKVSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNledIKKTLP 1595
Cdd:TIGR04523 588 QELIDQK-EKEKKDLIK--EIE------EKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQ---IKETIK 655
|
570
....*....|.
gi 281371490 1596 TGCFNTPSIEK 1606
Cdd:TIGR04523 656 EIRNKWPEIIK 666
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1040-1583 |
1.08e-05 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 50.30 E-value: 1.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1040 VAEHRVKLQELESLIANLGTGDETVTDQAFEDRLKEAEREVTDLLREAQEVKDvdqnLMDRLQRVNNSLHSQISRLHNIR 1119
Cdd:COG4913 257 IRELAERYAAARERLAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEA----ELERLEARLDALREELDELEAQI 332
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1120 NTIEetGILAERARSRVESTEQLIEIASRELEKAKMAAANVSITQPESTGEpnnMTLLAEEARRLAERHKQEADDiVRVA 1199
Cdd:COG4913 333 RGNG--GDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEE---FAALRAEAAALLEALEEELEA-LEEA 406
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1200 KTANETSAEAYNLLLRTLAGEnqtaleIEELNRkyeQAKNISQDLekQAARvHEEAKRAGDKAVEIyasvaqltPVDSEA 1279
Cdd:COG4913 407 LAEAEAALRDLRRELRELEAE------IASLER---RKSNIPARL--LALR-DALAEALGLDEAEL--------PFVGEL 466
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1280 LEseankIKKEAAD-------------LDRLIDQKlkDYEDLR---EDMRGKEHEVKNLLEKGKAEQQTADqllaradaa 1343
Cdd:COG4913 467 IE-----VRPEEERwrgaiervlggfaLTLLVPPE--HYAAALrwvNRLHLRGRLVYERVRTGLPDPERPR--------- 530
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1344 kalaeeaakKGRNTL------------QEANDILNN------------LKDFDRRVN------DNKTAAE----EALRRI 1389
Cdd:COG4913 531 ---------LDPDSLagkldfkphpfrAWLEAELGRrfdyvcvdspeeLRRHPRAITragqvkGNGTRHEkddrRRIRSR 601
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1390 PAI---NRT-IAEANEKTREAQLALGNAAADATEAKNkaheaeriasgvQKNATSTKADAERTFSEVTDLDNEVNGMLRQ 1465
Cdd:COG4913 602 YVLgfdNRAkLAALEAELAELEEELAEAEERLEALEA------------ELDALQERREALQRLAEYSWDEIDVASAERE 669
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1466 LEEAENELKRkQDDADQDMmmagmasqaaQEAELNARKAKNSVSSLLSQLTDLLEQLGQLDtvdlNKLNEIEGSLNKAKD 1545
Cdd:COG4913 670 IAELEAELER-LDASSDDL----------AALEEQLEELEAELEELEEELDELKGEIGRLE----KELEQAEEELDELQD 734
|
570 580 590
....*....|....*....|....*....|....*....
gi 281371490 1546 EM-KVSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKD 1583
Cdd:COG4913 735 RLeAAEDLARLELRALLEERFAAALGDAVERELRENLEE 773
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1047-1326 |
1.52e-05 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 49.79 E-value: 1.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1047 LQELESLIANLGTGDETVTDQafedrLKEAEREVTDLLREAQEVKdvdqnlMDRLQRVNNSLHSQiSRLHNIRNTI---E 1123
Cdd:pfam01576 779 LKELEAQIDAANKGREEAVKQ-----LKKLQAQMKDLQRELEEAR------ASRDEILAQSKESE-KKLKNLEAELlqlQ 846
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1124 ETGILAERARSRV--ESTEQLIEIASRELEKAKMA------AANVSITQPESTGEPNNMTLLAEEARRLAerhkQEADDI 1195
Cdd:pfam01576 847 EDLAASERARRQAqqERDELADEIASGASGKSALQdekrrlEARIAQLEEELEEEQSNTELLNDRLRKST----LQVEQL 922
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1196 vrvaktANETSAEaynlllRTLAGENQTALEieELNRKYEQAKNISQDLEKQAARVHeeakRAGDKAVEiyASVAQLtpv 1275
Cdd:pfam01576 923 ------TTELAAE------RSTSQKSESARQ--QLERQNKELKAKLQEMEGTVKSKF----KSSIAALE--AKIAQL--- 979
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 281371490 1276 dSEALESEAnKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKG 1326
Cdd:pfam01576 980 -EEQLEQES-RERQAANKLVRRTEKKLKEVLLQVEDERRHADQYKDQAEKG 1028
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1359-1592 |
1.53e-05 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 49.13 E-value: 1.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1359 QEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASGVQKNA 1438
Cdd:COG4372 45 EELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKER 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1439 TSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQddadqdmmmAGMASQAAQEAELNARKAKNSVSSLLSQltdL 1518
Cdd:COG4372 125 QDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQ---------EELAALEQELQALSEAEAEQALDELLKE---A 192
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 281371490 1519 LEQLGQLDTVDLNKLNEIEGSLNKAKDEMKVSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNLEDIKK 1592
Cdd:COG4372 193 NRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELA 266
|
|
| PTZ00440 |
PTZ00440 |
reticulocyte binding protein 2-like protein; Provisional |
1034-1594 |
1.78e-05 |
|
reticulocyte binding protein 2-like protein; Provisional
Pssm-ID: 240419 [Multi-domain] Cd Length: 2722 Bit Score: 49.83 E-value: 1.78e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1034 RLVKDKVAEHRVKlQELESLIANLgtgdeTVTDQAFEDRLKEAEREVTDLLReaQEVKDVDQNLMDRLQRVNNSLHSQIS 1113
Cdd:PTZ00440 985 KLDKEKDEWEHFK-SEIDKLNVNY-----NILNKKIDDLIKKQHDDIIELID--KLIKEKGKEIEEKVDQYISLLEKMKT 1056
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1114 RLHNIRNTIEETGILAERARSRVESTEQLIEIASRELEKAKMAAANVSITQPEstgepnNMTLLAEEARRLAERHKQEAD 1193
Cdd:PTZ00440 1057 KLSSFHFNIDIKKYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHE------HVVNADKEKNKQTEHYNKKKK 1130
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1194 DIVRVAKTANETSAEaynllLRTLAGENQTALEIEELNRKYEQAKnISQDLEKqaarVHEEAKRAGDKAVEIYASVAQLT 1273
Cdd:PTZ00440 1131 SLEKIYKQMEKTLKE-----LENMNLEDITLNEVNEIEIEYERIL-IDHIVEQ----INNEAKKSKTIMEEIESYKKDID 1200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1274 PVDsealeseANKIKKEAADLDRLidqklkDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAKALAEEAAKK 1353
Cdd:PTZ00440 1201 QVK-------KNMSKERNDHLTTF------EYNAYYDKATASYENIEELTTEAKGLKGEANRSTNVDELKEIKLQVFSYL 1267
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1354 GR-----NTLQEANDILNNLKDFDRRVNDNKTAAEealrripaINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAE 1428
Cdd:PTZ00440 1268 QQvikenNKMENALHEIKNMYEFLISIDSEKILKE--------ILNSTKKAEEFSNDAKKELEKTDNLIKQVEAKIEQAK 1339
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1429 RiasgvQKNATSTKADAERtfsevtdLDNEVNgMLRQLEEaenELKRKQDDadqdMMMAGMASQAAQE-AELNARKAKNS 1507
Cdd:PTZ00440 1340 E-----HKNKIYGSLEDKQ-------IDDEIK-KIEQIKE---EISNKRKE----INKYLSNIKSNKEkCDLHVRNASRG 1399
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1508 VSsllsqLTDLLEQLGQLDTVDLNKLN--EIEGSLNKAKDEMKvsdldrKVSNLENEARKQEAAILDYDRDIAEIIKDIH 1585
Cdd:PTZ00440 1400 KD-----KIDFLNKHEAIEPSNSKEVNiiKITDNINKCKQYSN------EAMETENKADENNDSIIKYEKEITNILNNSS 1468
|
....*....
gi 281371490 1586 NLEdIKKTL 1594
Cdd:PTZ00440 1469 ILG-KKTKL 1476
|
|
| HEC1 |
COG5185 |
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ... |
1042-1323 |
2.33e-05 |
|
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 49.19 E-value: 2.33e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1042 EHRVKLQELESLIANLgtgdetvtdQAFEDRLKEAEREVTDLLREaqevkDVDQN--LMDRLQRVNNSLHSQISRLH-NI 1118
Cdd:COG5185 233 EALKGFQDPESELEDL---------AQTSDKLEKLVEQNTDLRLE-----KLGENaeSSKRLNENANNLIKQFENTKeKI 298
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1119 RNTIEETGILAE---------RARSRVESTEQLIEIASRELE-KAKMAAANVSITQPESTGEPNNMTLLAEEARRLAERH 1188
Cdd:COG5185 299 AEYTKSIDIKKAtesleeqlaAAEAEQELEESKRETETGIQNlTAEIEQGQESLTENLEAIKEEIENIVGEVELSKSSEE 378
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1189 KQEADDIVRVAKTANE----TSAEAYNLLLRTLA---GENQTALE-----IEELNRKYEQAKNISQDLEKQAARVHEEAK 1256
Cdd:COG5185 379 LDSFKDTIESTKESLDeipqNQRGYAQEILATLEdtlKAADRQIEelqrqIEQATSSNEEVSKLLNELISELNKVMREAD 458
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 281371490 1257 RAGDKAVEIYASVAQLTpvdseaLESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLL 1323
Cdd:COG5185 459 EESQSRLEEAYDEINRS------VRSKKEDLNEELTQIESRVSTLKATLEKLRAKLERQLEGVRSKL 519
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1070-1534 |
2.61e-05 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 49.19 E-value: 2.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1070 EDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQisrlhnirNTIeetgilaERARSRVEsteqliEIASRe 1149
Cdd:PRK04863 306 QYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQTALRQQ--------EKI-------ERYQADLE------ELEER- 363
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1150 LEKAKMAAAnvsitqpESTGEpnnmtllAEEARRLAERHKQEADDI------VRVAKTANETSAEAYN----LL--LRTL 1217
Cdd:PRK04863 364 LEEQNEVVE-------EADEQ-------QEENEARAEAAEEEVDELksqladYQQALDVQQTRAIQYQqavqALerAKQL 429
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1218 AGENQTALE-----IEELNRKYEQAKNISQDLEkQAARVHEEAKRAGDKAveiYASVAQLT-PVDsealESEANKIKKEA 1291
Cdd:PRK04863 430 CGLPDLTADnaedwLEEFQAKEQEATEELLSLE-QKLSVAQAAHSQFEQA---YQLVRKIAgEVS----RSEAWDVAREL 501
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1292 adLDRLIDQKLKDyeDLREDMRGKEHEvknlLEKGKAEQQTADQLlaradaakalaeeaakkgrntlqeandilnnLKDF 1371
Cdd:PRK04863 502 --LRRLREQRHLA--EQLQQLRMRLSE----LEQRLRQQQRAERL-------------------------------LAEF 542
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1372 DRRVNDNKTAAEEAlrripaiNRTIAEANEKTREAQLALGNAAADATEAKnkaHEAERIASGVQKNAtsTKADAERTFSE 1451
Cdd:PRK04863 543 CKRLGKNLDDEDEL-------EQLQEELEARLESLSESVSEARERRMALR---QQLEQLQARIQRLA--ARAPAWLAAQD 610
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1452 vtdldnevngMLRQLEEAENElkrkqDDADQDMMMAGMASQAAQEAELnaRKAKNSVSSLLSQLTDLLEQLGQLDTVDLN 1531
Cdd:PRK04863 611 ----------ALARLREQSGE-----EFEDSQDVTEYMQQLLEREREL--TVERDELAARKQALDEEIERLSQPGGSEDP 673
|
...
gi 281371490 1532 KLN 1534
Cdd:PRK04863 674 RLN 676
|
|
| HEC1 |
COG5185 |
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ... |
1172-1584 |
3.50e-05 |
|
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 48.42 E-value: 3.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1172 NNMTLLAEEARRLAERHKQEADDIVRVAKTaNETSAEAYNLLLRTlaGENQTaLEIEELNRKYEQAKNISQDLEKQAARv 1251
Cdd:COG5185 110 PNYEWSADILISLLYLYKSEIVALKDELIK-VEKLDEIADIEASY--GEVET-GIIKDIFGKLTQELNQNLKKLEIFGL- 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1252 hEEAKRAGDKAVEIYASVAQLT-PVDSEALESEANKIKKEAAD---LDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKgK 1327
Cdd:COG5185 185 -TLGLLKGISELKKAEPSGTVNsIKESETGNLGSESTLLEKAKeiiNIEEALKGFQDPESELEDLAQTSDKLEKLVEQ-N 262
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1328 AEQQTADqllaradaakalaeeaAKKGRNTLQEANDILNNLKDFDRRVND---NKTAAEEALRRIPAINRTIAEANEktr 1404
Cdd:COG5185 263 TDLRLEK----------------LGENAESSKRLNENANNLIKQFENTKEkiaEYTKSIDIKKATESLEEQLAAAEA--- 323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1405 EAQLALGNAAADaTEAKNKAHEAERIASGVQKNATSTKADAERTFSEVTDLDNEvngmlRQLEEAENELKRKQDDADQDM 1484
Cdd:COG5185 324 EQELEESKRETE-TGIQNLTAEIEQGQESLTENLEAIKEEIENIVGEVELSKSS-----EELDSFKDTIESTKESLDEIP 397
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1485 MMAgmaSQAAQEAELNARKAKNSVSSLLSQLTDLLEQLGQLDTVDLNKLNEIEGSLNKAKDEMKvsdlDRKVSNLENEAR 1564
Cdd:COG5185 398 QNQ---RGYAQEILATLEDTLKAADRQIEELQRQIEQATSSNEEVSKLLNELISELNKVMREAD----EESQSRLEEAYD 470
|
410 420
....*....|....*....|
gi 281371490 1565 KQEAAILDYDRDIAEIIKDI 1584
Cdd:COG5185 471 EINRSVRSKKEDLNEELTQI 490
|
|
| ATP-synt_B |
pfam00430 |
ATP synthase B/B' CF(0); Part of the CF(0) (base unit) of the ATP synthase. The base unit is ... |
1391-1503 |
5.45e-05 |
|
ATP synthase B/B' CF(0); Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits are thought to interact with the stalk of the CF(1) subunits. This domain should not be confused with the ab CF(1) proteins (in the head of the ATP synthase) which are found in pfam00006
Pssm-ID: 425677 [Multi-domain] Cd Length: 132 Bit Score: 44.61 E-value: 5.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1391 AINRTIAEANEKTREAQLALgnaaADATEAKNKAH-EAERIASGVQKNATSTKADAertfseVTDLDNEVNgmlRQLEEA 1469
Cdd:pfam00430 34 LIADEIAEAEERRKDAAAAL----AEAEQQLKEARaEAQEIIENAKKRAEKLKEEI------VAAAEAEAE---RIIEQA 100
|
90 100 110
....*....|....*....|....*....|....
gi 281371490 1470 ENELKRKQDDAdqdmmMAGMASQAAQEAELNARK 1503
Cdd:pfam00430 101 AAEIEQEKDRA-----LAELRQQVVALAVQIAEK 129
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
826-870 |
7.02e-05 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 41.53 E-value: 7.02e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*
gi 281371490 826 CQCNDniDPNAVGNCNRLTGECLkCIYNTAGFYCDRCKEGFFGNP 870
Cdd:smart00180 1 CDCDP--GGSASGTCDPDTGQCE-CKPNVTGRRCDRCAPGYYGDG 42
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1228-1569 |
7.05e-05 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 47.83 E-value: 7.05e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1228 EELNRKYEQAKNI----------SQDLEKQA--------------ARVHEEAKRA-GDKAVEIYASVAQLTPVDSEALES 1282
Cdd:PTZ00121 1094 EEAFGKAEEAKKTetgkaeearkAEEAKKKAedarkaeearkaedARKAEEARKAeDAKRVEIARKAEDARKAEEARKAE 1173
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1283 EANKIK--------KEAADLDRLID----QKLKDYEDLR--EDMRGKEHEvKNLLEKGKAEQQTADQLLARADAAKALAE 1348
Cdd:PTZ00121 1174 DAKKAEaarkaeevRKAEELRKAEDarkaEAARKAEEERkaEEARKAEDA-KKAEAVKKAEEAKKDAEEAKKAEEERNNE 1252
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1349 EAAKKGRNTLQEANDILNNLKDFDRRVNDNKTAAEEALR----RIPAINRTIAEANEKTREAQLAlGNAAADATEAKNKA 1424
Cdd:PTZ00121 1253 EIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKadeaKKAEEKKKADEAKKKAEEAKKA-DEAKKKAEEAKKKA 1331
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1425 HEAERIASGVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAenelKRKQDDADQDMMMAGMASQAAQEAELNARKA 1504
Cdd:PTZ00121 1332 DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEA----KKKADAAKKKAEEKKKADEAKKKAEEDKKKA 1407
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 281371490 1505 -----KNSVSSLLSQLTDLLEQLGQLDtvDLNKLNEiegSLNKAKDEMKVSDLDRKVSNLEN---EARKQEAA 1569
Cdd:PTZ00121 1408 delkkAAAAKKKADEAKKKAEEKKKAD--EAKKKAE---EAKKADEAKKKAEEAKKAEEAKKkaeEAKKADEA 1475
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1212-1593 |
7.37e-05 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 46.82 E-value: 7.37e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1212 LLLRTLAGENQTAL-EIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDkavEIYASVAQLtpvdsEALESEANKIKKE 1290
Cdd:COG4372 24 ILIAALSEQLRKALfELDKLQEELEQLREELEQAREELEQLEEELEQARS---ELEQLEEEL-----EELNEQLQAAQAE 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1291 AADLDrlidQKLKDYEDLREDMRGKehevknlLEKGKAEQQTADQLlaradaakalaeeaakkgRNTLQEANDILNN-LK 1369
Cdd:COG4372 96 LAQAQ----EELESLQEEAEELQEE-------LEELQKERQDLEQQ------------------RKQLEAQIAELQSeIA 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1370 DFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLalgnaaADATEAKNKAHEAERIASGVQKNATSTKADAERTF 1449
Cdd:COG4372 147 EREEELKELEEQLESLQEELAALEQELQALSEAEAEQAL------DELLKEANRNAEKEEELAEAEKLIESLPRELAEEL 220
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1450 SEVTDLDNEVNGmlRQLEEAENELKRKQDDADQDMMMAGMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLGQLDTVD 1529
Cdd:COG4372 221 LEAKDSLEAKLG--LALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLL 298
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 281371490 1530 LNKLNEIEGSLNKAKDEMKVSDLDRKVSNLENEARKQEAAILDYDRDIAEIIKDIHNLEDIKKT 1593
Cdd:COG4372 299 ALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKG 362
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1355-1569 |
7.75e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 47.07 E-value: 7.75e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1355 RNTLQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALgnaaadateAKNKAHEAERIASgV 1434
Cdd:COG4942 44 AALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL---------EAQKEELAELLRA-L 113
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1435 QKNATSTK----------ADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQDmmmagMASQAAQEAELNARKA 1504
Cdd:COG4942 114 YRLGRQPPlalllspedfLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAE-----RAELEALLAELEEERA 188
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1505 KnsVSSLLSQLTDLLEQLGQldtvdlnKLNEIEGSLNKAKDEMKvsDLDRKVSNLENEARKQEAA 1569
Cdd:COG4942 189 A--LEALKAERQKLLARLEK-------ELAELAAELAELQQEAE--ELEALIARLEAEAAAAAER 242
|
|
| Laminin_I |
pfam06008 |
Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from ... |
1356-1537 |
7.83e-05 |
|
Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
Pssm-ID: 310534 [Multi-domain] Cd Length: 258 Bit Score: 46.25 E-value: 7.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1356 NTLQE-ANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTreAQLALGNAAADATEAKNKAHEAERIA--- 1431
Cdd:pfam06008 57 EELQKkATQTLAKAQQVNAESERTLGHAKELAEAIKNLIDNIKEINEKV--ATLGENDFALPSSDLSRMLAEAQRMLgei 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1432 -----SGVQKNATSTKADAERTFSEVTDL----DNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASQAAQeaELNAR 1502
Cdd:pfam06008 135 rsrdfGTQLQNAEAELKAAQDLLSRIQTWfqspQEENKALANALRDSLAEYEAKLSDLRELLREAAAKTRDAN--RLNLA 212
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 281371490 1503 KAKN---------SVSSLLSQLTDLLEQLGQ-LDTVDLnKLNEIE 1537
Cdd:pfam06008 213 NQANlrefqrkkeEVSEQKNQLEETLKTARDsLDAANL-LLQEID 256
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1071-1565 |
7.84e-05 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 47.51 E-value: 7.84e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1071 DRLK-EAEREvtdlLREAQ-EVKDVDQNLMDRLQRVNNSLHSQ-ISR-------LHNIRNTIE----ETGILAERARSRV 1136
Cdd:NF041483 86 DQLRaDAERE----LRDARaQTQRILQEHAEHQARLQAELHTEaVQRrqqldqeLAERRQTVEshvnENVAWAEQLRART 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1137 ES-TEQLIEIASRELEKAkMAAANVsitqpestgepnnmtllaeEARRLAERHKQeaddivRVAKTANETSAEAYNLLLR 1215
Cdd:NF041483 162 ESqARRLLDESRAEAEQA-LAAARA-------------------EAERLAEEARQ------RLGSEAESARAEAEAILRR 215
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1216 TLAGENQTaleieeLNRKYEQAknisqdlekQAARVHEEAKRAgdkaveiyasvaqltpvdSEALESEAnkIKKEAADLD 1295
Cdd:NF041483 216 ARKDAERL------LNAASTQA---------QEATDHAEQLRS------------------STAAESDQ--ARRQAAELS 260
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1296 RLIDQKLKDYEdlrEDMRGKEHEVKNLLEKGK--AEQQTAdqllaradaakalaeeaakkgrnTLQEANDilnnlkdfdr 1373
Cdd:NF041483 261 RAAEQRMQEAE---EALREARAEAEKVVAEAKeaAAKQLA-----------------------SAESANE---------- 304
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1374 rvNDNKTAAEEalrripaINRTIAEAN---EKTR-EAQLALGNAAADATEAKNKAHEAERIASGvqKNATSTKADAERTf 1449
Cdd:NF041483 305 --QRTRTAKEE-------IARLVGEATkeaEALKaEAEQALADARAEAEKLVAEAAEKARTVAA--EDTAAQLAKAART- 372
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1450 sevtdldnevngmlrqleeAENELKRKQDDADQdmmmagMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLgqldtvd 1529
Cdd:NF041483 373 -------------------AEEVLTKASEDAKA------TTRAAAEEAERIRREAEAEADRLRGEAADQAEQL------- 420
|
490 500 510
....*....|....*....|....*....|....*.
gi 281371490 1530 lnklneiEGSlnkAKDEMKvsDLDRKVSNLENEARK 1565
Cdd:NF041483 421 -------KGA---AKDDTK--EYRAKTVELQEEARR 444
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1257-1592 |
1.01e-04 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 46.43 E-value: 1.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1257 RAGDKAVEIYASVAQLTPVDSEALESEANKIKKEAADLDRLidqkLKDYEDLREDMRGKEHEVKNL---LEKGKAEQQTA 1333
Cdd:COG4372 3 RLGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKL----QEELEQLREELEQAREELEQLeeeLEQARSELEQL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1334 DQLlaradaakalaeeaakkgrntLQEANDILNNLKDFDRRVNDNKTAAEEALRRIPA-INRTIAEANEKTREAQLALGN 1412
Cdd:COG4372 79 EEE---------------------LEELNEQLQAAQAELAQAQEELESLQEEAEELQEeLEELQKERQDLEQQRKQLEAQ 137
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1413 AAADATEAKNKAHEAERIASGVQKNATSTKA-DAERTFSEVTDLDNEVNGMLRQLEeaENELKRKQDDADQDMMMAGMAS 1491
Cdd:COG4372 138 IAELQSEIAEREEELKELEEQLESLQEELAAlEQELQALSEAEAEQALDELLKEAN--RNAEKEEELAEAEKLIESLPRE 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1492 QAAQEAELNARKAKNSVSSLLSQLTDLLEQLGQLDTVDLNKLNEIEGSLNKAKDE-----MKVSDLDRKVSNLENEARKQ 1566
Cdd:COG4372 216 LAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEkdteeEELEIAALELEALEEAALEL 295
|
330 340
....*....|....*....|....*.
gi 281371490 1567 EAAILDYDRDIAEIIKDIHNLEDIKK 1592
Cdd:COG4372 296 KLLALLLNLAALSLIGALEDALLAAL 321
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1382-1594 |
1.20e-04 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 46.87 E-value: 1.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1382 AEEALRRIPAINRTIA---EANEKTREAQLALGNAAADATEAKNKAHEAE----RIASG---VQK--NATSTKA------ 1443
Cdd:PRK04863 340 VQTALRQQEKIERYQAdleELEERLEEQNEVVEEADEQQEENEARAEAAEeevdELKSQladYQQalDVQQTRAiqyqqa 419
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1444 ----DAERTFSEVTDLDNE-VNGMLRQLEEAENELKRKQDDADQDMMMAGMA-SQAAQEAELnARK---------AKNSV 1508
Cdd:PRK04863 420 vqalERAKQLCGLPDLTADnAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAhSQFEQAYQL-VRKiagevsrseAWDVA 498
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1509 SSLLSQLTD---LLEQLGQLDtvdlNKLNEIEGSLNK-------AKDEMKVSDLD-RKVSNLENEARKQEAAILDYDRDI 1577
Cdd:PRK04863 499 RELLRRLREqrhLAEQLQQLR----MRLSELEQRLRQqqraerlLAEFCKRLGKNlDDEDELEQLQEELEARLESLSESV 574
|
250
....*....|....*..
gi 281371490 1578 AEIIKDIHNLEDIKKTL 1594
Cdd:PRK04863 575 SEARERRMALRQQLEQL 591
|
|
| Laminin_I |
pfam06008 |
Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from ... |
1106-1370 |
1.24e-04 |
|
Laminin Domain I; coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
Pssm-ID: 310534 [Multi-domain] Cd Length: 258 Bit Score: 45.48 E-value: 1.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1106 NSLHSQISRLHNIRNTIEETGILAERARSRVESTEQLIEIASRELEKAKMAAANVsitQPESTGEPNNMTLLAEEARRLA 1185
Cdd:pfam06008 5 NSLTGALPAPYKINYNLENLTKQLQEYLSPENAHKIQIEILEKELSSLAQETEEL---QKKATQTLAKAQQVNAESERTL 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1186 ERHKQEADDIVRVAKTANETSAEAYNLllrtlaGENQTALEIEELNRKYEQAKNI-----SQDLEKQaarvHEEAKRAGD 1260
Cdd:pfam06008 82 GHAKELAEAIKNLIDNIKEINEKVATL------GENDFALPSSDLSRMLAEAQRMlgeirSRDFGTQ----LQNAEAELK 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1261 KAVEIYASVAQLTPVDSEALESEANKIKKEAADLDrlidQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARA 1340
Cdd:pfam06008 152 AAQDLLSRIQTWFQSPQEENKALANALRDSLAEYE----AKLSDLRELLREAAAKTRDANRLNLANQANLREFQRKKEEV 227
|
250 260 270
....*....|....*....|....*....|
gi 281371490 1341 DAAKALAEEAAKKGRNTLQEANDILNNLKD 1370
Cdd:pfam06008 228 SEQKNQLEETLKTARDSLDAANLLLQEIDD 257
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
340-394 |
1.31e-04 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 41.18 E-value: 1.31e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 281371490 340 CDCNGR---SQECYFdpelyrstgHGGHCTnCRDNTDGAKCERCRENFFRLGNTEACS 394
Cdd:pfam00053 1 CDCNPHgslSDTCDP---------ETGQCL-CKPGVTGRHCDRCKPGYYGLPSDPPQG 48
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
1226-1533 |
1.38e-04 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 45.67 E-value: 1.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1226 EIEELNRKYEQAKNISQDLEKQ------------------AARVHE---EAKRAGDKAVEIYASVAQLTPVDSEALEsEA 1284
Cdd:COG1340 9 SLEELEEKIEELREEIEELKEKrdelneelkelaekrdelNAQVKElreEAQELREKRDELNEKVKELKEERDELNE-KL 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1285 NKIKKEAADLDRLIDQKLK---DYEDLREDMRGkehevknlLEKgkaEQQTADQllaradaakalaeeaakkgrnTLQEA 1361
Cdd:COG1340 88 NELREELDELRKELAELNKaggSIDKLRKEIER--------LEW---RQQTEVL---------------------SPEEE 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1362 NDILNNLKDFDRRVNDNKTAAEEalrripaiNRTIAEANEKTREAQLALGNAAADATEAKNKAHEA-ERIASGVQKnATS 1440
Cdd:COG1340 136 KELVEKIKELEKELEKAKKALEK--------NEKLKELRAELKELRKEAEEIHKKIKELAEEAQELhEEMIELYKE-ADE 206
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1441 TKADAERTFSEVTDLDNEVNgmlrQLEEAENELKRKQDDADQDMMMagmasqaAQEAELNARKAKNSvSSLLSQLTDLLE 1520
Cdd:COG1340 207 LRKEADELHKEIVEAQEKAD----ELHEEIIELQKELRELRKELKK-------LRKKQRALKREKEK-EELEEKAEEIFE 274
|
330
....*....|....*.
gi 281371490 1521 QL---GQLDTVDLNKL 1533
Cdd:COG1340 275 KLkkgEKLTTEELKLL 290
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1413-1594 |
1.44e-04 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 45.98 E-value: 1.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1413 AAADATEAKNKAHeAERIASGVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQdmmmaGMASQ 1492
Cdd:COG3883 1 ALALALAAPTPAF-ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDK-----LQAEI 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1493 AAQEAELNARKAKnsvssLLSQLTDLLEQLGQLDTVDLnklneiegsLNKAKDemkVSDLDRKVSNLENEARKQEAAILD 1572
Cdd:COG3883 75 AEAEAEIEERREE-----LGERARALYRSGGSVSYLDV---------LLGSES---FSDFLDRLSALSKIADADADLLEE 137
|
170 180
....*....|....*....|..
gi 281371490 1573 YDRDIAEIIKDIHNLEDIKKTL 1594
Cdd:COG3883 138 LKADKAELEAKKAELEAKLAEL 159
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1267-1502 |
1.53e-04 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 45.99 E-value: 1.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1267 ASVAQLTPVDSEALESEANKIK---KEAADLDRLIDQKL-KDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQllarada 1342
Cdd:TIGR02794 35 AEIIQAVLVDPGAVAQQANRIQqqkKPAAKKEQERQKKLeQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQA------- 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1343 akalaeeaakkgrntlQEANdilnnlkdfdRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAADAT---- 1418
Cdd:TIGR02794 108 ----------------EQAA----------KQAEEKQKQAEEAKAKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKakaa 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1419 -EAKNKAHEAERIASGVQKNATSTKADAERtfsEVTDLDNEvngmlRQLEEAENELKRKQdDADQDMMMAGMASQAAQEA 1497
Cdd:TIGR02794 162 aEAKKKAEEAKKKAEAEAKAKAEAEAKAKA---EEAKAKAE-----AAKAKAAAEAAAKA-EAEAAAAAAAEAERKADEA 232
|
....*
gi 281371490 1498 ELNAR 1502
Cdd:TIGR02794 233 ELGDI 237
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
770-818 |
1.53e-04 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 40.80 E-value: 1.53e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 281371490 770 PCPCPGGSSCAIVPKTKEVVCtHCPTGTAGKRCELCDDGYFGDPLGSNG 818
Cdd:cd00055 1 PCDCNGHGSLSGQCDPGTGQC-ECKPNTTGRRCDRCAPGYYGLPSQGGG 48
|
|
| MA |
smart00283 |
Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo ... |
1357-1522 |
2.04e-04 |
|
Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo reversible methylation in response to attractants or repellants during bacterial chemotaxis.
Pssm-ID: 214599 [Multi-domain] Cd Length: 262 Bit Score: 44.97 E-value: 2.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1357 TLQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQ----------------------LALgNAA 1414
Cdd:smart00283 37 VAANADEIAATAQSAAEAAEEGREAVEDAITAMDQIREVVEEAVSAVEELEessdeigeivsviddiadqtnlLAL-NAA 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1415 ----------------AD-----ATEAKNKAHEAERIASGVQKNATSTKADAERTFSEV-----------------TDLD 1456
Cdd:smart00283 116 ieaarageagrgfavvADevrklAERSAESAKEIESLIKEIQEETNEAVAAMEESSSEVeegvelveetgdaleeiVDSV 195
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 281371490 1457 NEVNGMLRQLEEAENELKRKQDDADQDMM-MAGMASQAAQEAELNARKAKnSVSSLLSQLTDLLEQL 1522
Cdd:smart00283 196 EEIADLVQEIAAATDEQAAGSEEVNAAIDeIAQVTQETAAMSEEISAAAE-ELSGLAEELDELVERF 261
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1033-1246 |
2.21e-04 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 46.20 E-value: 2.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1033 YRLVKDKVAEHRVKLQELESLIANLGTGDETVTDQA--FEDRLKEAEREVTDL---LREAQEVKDVDQNLMDRLQRvnnS 1107
Cdd:TIGR02168 812 LTLLNEEAANLRERLESLERRIAATERRLEDLEEQIeeLSEDIESLAAEIEELeelIEELESELEALLNERASLEE---A 888
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1108 LHSQISRLHNIRNTIEETGILAERARSRVESTEQLIEIASRELEKAKM--------AAANVSITQPESTGEPNNMTLLAE 1179
Cdd:TIGR02168 889 LALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVridnlqerLSEEYSLTLEEAEALENKIEDDEE 968
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 281371490 1180 EARRLAERHKQEADDIVRVaktaNETSAEaynlllrtlagenqtalEIEELNRKYEQAKNISQDLEK 1246
Cdd:TIGR02168 969 EARRRLKRLENKIKELGPV----NLAAIE-----------------EYEELKERYDFLTAQKEDLTE 1014
|
|
| AtpF |
COG0711 |
FoF1-type ATP synthase, membrane subunit b or b' [Energy production and conversion]; FoF1-type ... |
1388-1515 |
2.22e-04 |
|
FoF1-type ATP synthase, membrane subunit b or b' [Energy production and conversion]; FoF1-type ATP synthase, membrane subunit b or b' is part of the Pathway/BioSystem: FoF1-type ATP synthase
Pssm-ID: 440475 [Multi-domain] Cd Length: 152 Bit Score: 43.24 E-value: 2.22e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1388 RIPAINRTIAEANEKTREAQLALGNAAADATEAKnkaHEAERIASGVQKNATSTKADAertfseVTDLDNEVNgmlRQLE 1467
Cdd:COG0711 32 RQEKIADGLAEAERAKEEAEAALAEYEEKLAEAR---AEAAEIIAEARKEAEAIAEEA------KAEAEAEAE---RIIA 99
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 281371490 1468 EAENELKRKQDDADQDM--MMAGMASQAAQ---EAELNARKAKNSVSSLLSQL 1515
Cdd:COG0711 100 QAEAEIEQERAKALAELraEVADLAVAIAEkilGKELDAAAQAALVDRFIAEL 152
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
1071-1304 |
2.42e-04 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 45.78 E-value: 2.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1071 DRLKEAEREVTDL---LREAQEVKDVDQNLMDRLQRVNNSlhsQISRLHNIrntIEETGILAERARSRVES-TEQLIEIA 1146
Cdd:PHA02562 174 DKIRELNQQIQTLdmkIDHIQQQIKTYNKNIEEQRKKNGE---NIARKQNK---YDELVEEAKTIKAEIEElTDELLNLV 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1147 ------SRELEKAKMAAANVS---------------------ITQPESTgEPNNMTLLAEEARRLAERHKQEADDIVRVA 1199
Cdd:PHA02562 248 mdiedpSAALNKLNTAAAKIKskieqfqkvikmyekggvcptCTQQISE-GPDRITKIKDKLKELQHSLEKLDTAIDELE 326
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1200 KTANEtsaeaYNLLLRTlagenqtaleIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVAQLTPvDSEA 1279
Cdd:PHA02562 327 EIMDE-----FNEQSKK----------LLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQD-ELDK 390
|
250 260
....*....|....*....|....*
gi 281371490 1280 LESEANKIKKEAADLDrLIDQKLKD 1304
Cdd:PHA02562 391 IVKTKSELVKEKYHRG-IVTDLLKD 414
|
|
| RecN |
COG0497 |
DNA repair ATPase RecN [Replication, recombination and repair]; |
1358-1583 |
2.65e-04 |
|
DNA repair ATPase RecN [Replication, recombination and repair];
Pssm-ID: 440263 [Multi-domain] Cd Length: 555 Bit Score: 45.45 E-value: 2.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1358 LQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAE-------ANEktrEAQLA-----LGNAAadatEAKNKAH 1425
Cdd:COG0497 157 LEEYREAYRAWRALKKELEELRADEAERARELDLLRFQLEEleaaalqPGE---EEELEeerrrLSNAE----KLREALQ 229
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1426 EAERIASGVQKNATSTKADAERTFSEVTDLD---NEVNGMLR----QLEEAENELKRKQDDADQDmmmagmaSQAAQEAE 1498
Cdd:COG0497 230 EALEALSGGEGGALDLLGQALRALERLAEYDpslAELAERLEsaliELEEAASELRRYLDSLEFD-------PERLEEVE 302
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1499 --LN-----ARKAKNSVSSLLSQLTDLLEQLGQLDTVDlNKLNEIEGSLNKAKDEMkvSDLDRKVSnlenEARKQEAAIL 1571
Cdd:COG0497 303 erLAllrrlARKYGVTVEELLAYAEELRAELAELENSD-ERLEELEAELAEAEAEL--LEAAEKLS----AARKKAAKKL 375
|
250
....*....|..
gi 281371490 1572 dyDRDIAEIIKD 1583
Cdd:COG0497 376 --EKAVTAELAD 385
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1037-1259 |
2.75e-04 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 45.73 E-value: 2.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1037 KDKVAEHRVKLQELESLIANLgTGDETVTDQAFEdRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSL-HSQISRL 1115
Cdd:TIGR00618 672 KELLASRQLALQKMQSEKEQL-TYWKEMLAQCQT-LLRELETHIEEYDREFNEIENASSSLGSDLAAREDALnQSLKELM 749
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1116 HNIRNTIEETGILAERARSRVESTEQLIEiasrELEKAKMAAANVSITQPESTGEpnnmtlLAEearrLAERHKQEADDI 1195
Cdd:TIGR00618 750 HQARTVLKARTEAHFNNNEEVTAALQTGA----ELSHLAAEIQFFNRLREEDTHL------LKT----LEAEIGQEIPSD 815
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 281371490 1196 VRVAKTANETSAEAYNLLLRTLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRAG 1259
Cdd:TIGR00618 816 EDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSDKLN 879
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
1065-1578 |
2.82e-04 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 45.58 E-value: 2.82e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1065 TDQAFEDRLKeAEREVTDLLREAQEVKDVD-------QNLMDRLQRVNNSLHS-QISRLHNIRNTIEETGILAERAR--- 1133
Cdd:pfam10174 75 TIQALQDELR-AQRDLNQLLQQDFTTSPVDgedkfstPELTEENFRRLQSEHErQAKELFLLRKTLEEMELRIETQKqtl 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1134 -SRVESTEQLIEI------ASRELE-----KAKMAAANVSITQPES---TGEPNNMTLLAEEARR---LAERHKQEADDI 1195
Cdd:pfam10174 154 gARDESIKKLLEMlqskglPKKSGEedwerTRRIAEAEMQLGHLEVlldQKEKENIHLREELHRRnqlQPDPAKTKALQT 233
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1196 VRVAKTANETSAEAYNlllRTLAGENQT-----ALEIEE-----------------LNRKYEQAKnisQDLEKQAA---- 1249
Cdd:pfam10174 234 VIEMKDTKISSLERNI---RDLEDEVQMlktngLLHTEDreeeikqmevykshskfMKNKIDQLK---QELSKKESella 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1250 ---------------RVHEEAKRAGDKAVEIYASVAQlTPVDS--EALESEANKIKKEAADLDRLIDQK---------LK 1303
Cdd:pfam10174 308 lqtkletltnqnsdcKQHIEVLKESLTAKEQRAAILQ-TEVDAlrLRLEEKESFLNKKTKQLQDLTEEKstlageirdLK 386
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1304 DY---------------EDLREDMRGKEHEVKNLLEKGKAEQQ---TADQLLAradaakalaeeaakkgrnTLQEA---- 1361
Cdd:pfam10174 387 DMldvkerkinvlqkkiENLQEQLRDKDKQLAGLKERVKSLQTdssNTDTALT------------------TLEEAlsek 448
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1362 NDILNNLK---DFDRR-----VNDNKTAAEEALRRIPAINRTIAEANE-----KTREAQLALGNAAADAT------EAKN 1422
Cdd:pfam10174 449 ERIIERLKeqrEREDRerleeLESLKKENKDLKEKVSALQPELTEKESslidlKEHASSLASSGLKKDSKlksleiAVEQ 528
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1423 KAHEAERIASGVQKnaTSTKADAERTFSEVTD----LDNEVNG--------------MLRQLEEAENE--LKRKQDDADQ 1482
Cdd:pfam10174 529 KKEECSKLENQLKK--AHNAEEAVRTNPEINDrirlLEQEVARykeesgkaqaeverLLGILREVENEknDKDKKIAELE 606
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1483 DMMMAGMASQAAQEAELNA----RKAKNsvsslLSQLTD-LLEQLGQLDTVDLNKLNEIEGSLNKAK---DEMK------ 1548
Cdd:pfam10174 607 SLTLRQMKEQNKKVANIKHgqqeMKKKG-----AQLLEEaRRREDNLADNSQQLQLEELMGALEKTRqelDATKarlsst 681
|
650 660 670 680
....*....|....*....|....*....|....*....|....*
gi 281371490 1549 ---VSDLDRKVSNLENEARKQ-E-----------AAILDYDRDIA 1578
Cdd:pfam10174 682 qqsLAEKDGHLTNLRAERRKQlEeilemkqeallAAISEKDANIA 726
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
1037-1591 |
2.88e-04 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 45.81 E-value: 2.88e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1037 KDKVAEHRVKLQELESLIANlgtgdeTVTDqafeDRLKEAEREVTD---LLREAQEVKDVDQNLMDRLQRVN------NS 1107
Cdd:TIGR01612 859 KEKIDSEHEQFAELTNKIKA------EISD----DKLNDYEKKFNDsksLINEINKSIEEEYQNINTLKKVDeyikicEN 928
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1108 LHSQISRLHNIRN-----------TIEETGILAERARSRVEST----EQLIEIASRELEKAKMAAANVSITQ-----PES 1167
Cdd:TIGR01612 929 TKESIEKFHNKQNilkeilnknidTIKESNLIEKSYKDKFDNTlidkINELDKAFKDASLNDYEAKNNELIKyfndlKAN 1008
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1168 TGEPNNMTLLaeeaRRLAERHKQeADDIVRVAKTANET--------SAEAYNLLLRTlagENQTALEIEELNrkyeqaKN 1239
Cdd:TIGR01612 1009 LGKNKENMLY----HQFDEKEKA-TNDIEQKIEDANKNipnieiaiHTSIYNIIDEI---EKEIGKNIELLN------KE 1074
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1240 ISQDLEKQAARVHEEAKRAGDKAVEIYASVAQLTPVDsealesEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEV 1319
Cdd:TIGR01612 1075 ILEEAEINITNFNEIKEKLKHYNFDDFGKEENIKYAD------EINKIKDDIKNLDQKIDHHIKALEEIKKKSENYIDEI 1148
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1320 K---NLLEKgKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLKDFDRrvndNKTAAEEalrrIPAINRT- 1395
Cdd:TIGR01612 1149 KaqiNDLED-VADKAISNDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEIAEIEK----DKTSLEE----VKGINLSy 1219
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1396 -----------IAEANEKTREAQLALGNAAADATEAKNKAHEAER---IASGVQK--NATSTKADAERTFSEVTDLDNEV 1459
Cdd:TIGR01612 1220 gknlgklflekIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENemgIEMDIKAemETFNISHDDDKDHHIISKKHDEN 1299
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1460 NGMLRqleeaENELKRKQDDADQDMMmaGMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLgqldtvdlnKLNEIEGS 1539
Cdd:TIGR01612 1300 ISDIR-----EKSLKIIEDFSEESDI--NDIKKELQKNLLDAQKHNSDINLYLNEIANIYNIL---------KLNKIKKI 1363
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|..
gi 281371490 1540 LNKAKDEMKvsDLDRKVSNLENEARKQEAAIldydrdiaEIIKDIHNLEDIK 1591
Cdd:TIGR01612 1364 IDEVKEYTK--EIEENNKNIKDELDKSEKLI--------KKIKDDINLEECK 1405
|
|
| Laminin_EGF |
pfam00053 |
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six. |
284-325 |
3.27e-04 |
|
Laminin EGF domain; This family is like pfam00008 but has 8 conserved cysteines instead of six.
Pssm-ID: 395007 Cd Length: 49 Bit Score: 40.03 E-value: 3.27e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 281371490 284 CKCNGHASECVKNEFGKLVCNCKHNTYGVDCEKCLPFFNDRP 325
Cdd:pfam00053 1 CDCNPHGSLSDTCDPETGQCLCKPGVTGRHCDRCKPGYYGLP 42
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1227-1461 |
3.33e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 44.80 E-value: 3.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1227 IEELNRKYEQAKNiSQDLEKQaaRVHEEAKRAGDKAVEIYASVAQLTPVDSEALEseANKIKKEAADLDRLIDQKLKdye 1306
Cdd:PRK09510 61 VEQYNRQQQQQKS-AKRAEEQ--RKKKEQQQAEELQQKQAAEQERLKQLEKERLA--AQEQKKQAEEAAKQAALKQK--- 132
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1307 dLREDMRGKEHEVKNLleKGKAEQQTADQLLARADAAKALAEEAAKKGRNTlQEANdilnnlkdfdrrvndnKTAAEEAL 1386
Cdd:PRK09510 133 -QAEEAAAKAAAAAKA--KAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAA-AEAK----------------KKAEAEAA 192
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1387 RRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAhEAERIASGVQKNATSTKADAERTfSEVTDLDNEVNG 1461
Cdd:PRK09510 193 AKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKA-AAEAKAAAEKAAAAKAAEKAAAA-KAAAEVDDLFGG 265
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1208-1432 |
3.57e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 45.29 E-value: 3.57e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1208 EAYNLLLRtlAGENQTALE-IEELNRKYEQAKNISQDLE--KQAARVHEEAKRAGDKAVEIyasvaqltpvdsEALESEA 1284
Cdd:COG4913 239 RAHEALED--AREQIELLEpIRELAERYAAARERLAELEylRAALRLWFAQRRLELLEAEL------------EELRAEL 304
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1285 NKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKgkaEQQTADqllaradaakalaeeaakkgrntlQEANDI 1364
Cdd:COG4913 305 ARLEAELERLEARLDALREELDELEAQIRGNGGDRLEQLER---EIERLE------------------------RELEER 357
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1365 LNNLKDFDRRVND-----NKTAAE-EALRRipAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAER-IAS 1432
Cdd:COG4913 358 ERRRARLEALLAAlglplPASAEEfAALRA--EAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAeIAS 430
|
|
| Taxilin |
pfam09728 |
Myosin-like coiled-coil protein; Taxilin contains an extraordinarily long coiled-coil domain ... |
1281-1594 |
4.19e-04 |
|
Myosin-like coiled-coil protein; Taxilin contains an extraordinarily long coiled-coil domain in its C-terminal half and is ubiquitously expressed. It is a novel binding partner of several syntaxin family members and is possibly involved in Ca2+-dependent exocytosis in neuroendocrine cells. Gamma-taxilin, described as leucine zipper protein Factor Inhibiting ATF4-mediated Transcription (FIAT), localizes to the nucleus in osteoblasts and dimerizes with ATF4 to form inactive dimers, thus inhibiting ATF4-mediated transcription.
Pssm-ID: 462861 [Multi-domain] Cd Length: 302 Bit Score: 44.17 E-value: 4.19e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1281 ESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARadaakalaeeaakkgRNTLQE 1360
Cdd:pfam09728 3 ARELMQLLNKLDSPEEKLAALCKKYAELLEEMKRLQKDLKKLKKKQDQLQKEKDQLQSE---------------LSKAIL 67
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1361 ANDILNNL-KDFDRRvndNKTAAEEALRRIpainrtiAEANEKTREAQLALGNAAADATEaknkaheaeRIASGVQKNAT 1439
Cdd:pfam09728 68 AKSKLEKLcRELQKQ---NKKLKEESKKLA-------KEEEEKRKELSEKFQSTLKDIQD---------KMEEKSEKNNK 128
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1440 STKADAE-----RTFSEVTDL-DNEVNGMLRQ--LEEAENELKRKQDDADQDmmmagmasQAAQEAELN-ARKAKNSVSS 1510
Cdd:pfam09728 129 LREENEElreklKSLIEQYELrELHFEKLLKTkeLEVQLAEAKLQQATEEEE--------KKAQEKEVAkARELKAQVQT 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1511 LLSQLTDLLEQL-------GQL-DTvdLNKLNEIEGSLNKAKDEM--KVSDLDRkvsnlENEA--RKQEA---AILD--- 1572
Cdd:pfam09728 201 LSETEKELREQLnlyvekfEEFqDT--LNKSNEVFTTFKKEMEKMskKIKKLEK-----ENLTwkRKWEKsnkALLEmae 273
|
330 340
....*....|....*....|....*.
gi 281371490 1573 ----YDRDIAEIIKDIHNLEDIKKTL 1594
Cdd:pfam09728 274 erqkLKEELEKLQKKLEKLENLCRAL 299
|
|
| EGF_Lam |
cd00055 |
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous ... |
339-387 |
4.29e-04 |
|
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Pssm-ID: 238012 Cd Length: 50 Bit Score: 39.64 E-value: 4.29e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 281371490 339 PCDCNGR---SQECYFdpelyrstgHGGHCTnCRDNTDGAKCERCRENFFRL 387
Cdd:cd00055 1 PCDCNGHgslSGQCDP---------GTGQCE-CKPNTTGRRCDRCAPGYYGL 42
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
1128-1280 |
4.95e-04 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 43.95 E-value: 4.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1128 LAERARSRVESTEQLIEI--ASRELEKA-----KMAAANvsitqpestgePNNMTLLAEEARRLAERHK-QEADDIVRVA 1199
Cdd:COG2956 102 LLELDPDDAEALRLLAEIyeQEGDWEKAievleRLLKLG-----------PENAHAYCELAELYLEQGDyDEAIEALEKA 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1200 KTANETSAEAYNLLLRTLAGENQTALEIEELNRKYEQAKNISQDLEKqAARVHEEAKRAgDKAVEIYASVAQLTPVDSEA 1279
Cdd:COG2956 171 LKLDPDCARALLLLAELYLEQGDYEEAIAALERALEQDPDYLPALPR-LAELYEKLGDP-EEALELLRKALELDPSDDLL 248
|
.
gi 281371490 1280 L 1280
Cdd:COG2956 249 L 249
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
1044-1333 |
5.15e-04 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 44.73 E-value: 5.15e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1044 RVKLQELESLIANLgtgDETVTDQAFEDR--LKEAEREVTDLLREAQEVKDVDQNLMDRLQRV-------NNSLHSQISR 1114
Cdd:pfam05557 1 RAELIESKARLSQL---QNEKKQMELEHKraRIELEKKASALKRQLDRESDRNQELQKRIRLLekreaeaEEALREQAEL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1115 LHNIRNTIEETGILAERARSRVESTEQLI-----EIAS--RELEKAKMAaanvsitqpestgepnnMTLLAEEARRLAER 1187
Cdd:pfam05557 78 NRLKKKYLEALNKKLNEKESQLADAREVIsclknELSElrRQIQRAELE-----------------LQSTNSELEELQER 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1188 HKQEAddivrvAKTAN-ETSAEAYNLLLRTLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRagdkaveiy 1266
Cdd:pfam05557 141 LDLLK------AKASEaEQLRQNLEKQQSSLAEAEQRIKELEFEIQSQEQDSEIVKNSKSELARIPELEKE--------- 205
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 281371490 1267 asVAQLTPVDSEALESEANK--IKKEAADLDRlidqKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTA 1333
Cdd:pfam05557 206 --LERLREHNKHLNENIENKllLKEEVEDLKR----KLEREEKYREEAATLELEKEKLEQELQSWVKLA 268
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1192-1526 |
5.29e-04 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 44.95 E-value: 5.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1192 ADDIVRVAKTANETSAEAYNL------LLRTLAGENQTAleiEELNRKYEQAKNISQDLEK--QAARVH----EEAKRAG 1259
Cdd:PRK04863 271 AADYMRHANERRVHLEEALELrrelytSRRQLAAEQYRL---VEMARELAELNEAESDLEQdyQAASDHlnlvQTALRQQ 347
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1260 DKAVEIYASVAQLTPVDSEALES--------EANKIKKEAADL--DRLIDQkLKDYE---DLREDMRGKEHEVKNLLEKG 1326
Cdd:PRK04863 348 EKIERYQADLEELEERLEEQNEVveeadeqqEENEARAEAAEEevDELKSQ-LADYQqalDVQQTRAIQYQQAVQALERA 426
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1327 KAEQQTADQLlaradaakalaeeaakkgrntlqeandiLNNLKDF-DRRVNDNKTAAEEALrripainrtiaEAnektrE 1405
Cdd:PRK04863 427 KQLCGLPDLT----------------------------ADNAEDWlEEFQAKEQEATEELL-----------SL-----E 462
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1406 AQLALGNAAADATEaknKAHEA-ERIASGVQKNATSTKA-DAERTFSEVTDLDNEVNGMLRQLEEAENELkRKQDDADQD 1483
Cdd:PRK04863 463 QKLSVAQAAHSQFE---QAYQLvRKIAGEVSRSEAWDVArELLRRLREQRHLAEQLQQLRMRLSELEQRL-RQQQRAERL 538
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 281371490 1484 MMMAGMAS-------------QAAQEAELNArkAKNSVSSLLSQLTDLLEQLGQLD 1526
Cdd:PRK04863 539 LAEFCKRLgknlddedeleqlQEELEARLES--LSESVSEARERRMALRQQLEQLQ 592
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1065-1256 |
6.01e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 44.62 E-value: 6.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1065 TDQAFEDRLKEAEREVTDLLREAQEVKdVDQNLMDrLQRVNNSLHSQISRLhniRNTIEETGILAERARSRVESTEQLIE 1144
Cdd:COG3206 176 ALEFLEEQLPELRKELEEAEAALEEFR-QKNGLVD-LSEEAKLLLQQLSEL---ESQLAEARAELAEAEARLAALRAQLG 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1145 IASRELE-----------KAKMAAANVSITQPESTGEPNN--MTLLAEEARRLAERHKQEADDI-------VRVAKTANE 1204
Cdd:COG3206 251 SGPDALPellqspviqqlRAQLAELEAELAELSARYTPNHpdVIALRAQIAALRAQLQQEAQRIlasleaeLEALQAREA 330
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 281371490 1205 TSAEAYNLLLRTLAGENQTALEIEELNRKYEQAKNISQDLEK--QAARVHEEAK 1256
Cdd:COG3206 331 SLQAQLAQLEARLAELPELEAELRRLEREVEVARELYESLLQrlEEARLAEALT 384
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1226-1450 |
8.23e-04 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 43.67 E-value: 8.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1226 EIEELNRKYEQAKNISQDLEKQAARVHEEAKRA-----------GDKAVEIYASVAQLTPVD--------SEALESeank 1286
Cdd:COG3883 45 ELEELNEEYNELQAELEALQAEIDKLQAEIAEAeaeieerreelGERARALYRSGGSVSYLDvllgsesfSDFLDR---- 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1287 ikkeAADLDRLIDQ---KLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLlaradaakalaeeaakkgRNTLQEAND 1363
Cdd:COG3883 121 ----LSALSKIADAdadLLEELKADKAELEAKKAELEAKLAELEALKAELEAA------------------KAELEAQQA 178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1364 ILNNLKDfdrRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEAKNKAHEAERIASGVQKNATSTKA 1443
Cdd:COG3883 179 EQEALLA---QLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGAA 255
|
....*..
gi 281371490 1444 DAERTFS 1450
Cdd:COG3883 256 GAAAGSA 262
|
|
| MA |
smart00283 |
Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo ... |
1136-1419 |
9.62e-04 |
|
Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer); Thought to undergo reversible methylation in response to attractants or repellants during bacterial chemotaxis.
Pssm-ID: 214599 [Multi-domain] Cd Length: 262 Bit Score: 43.04 E-value: 9.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1136 VESTEQLIEIASRELEKAKMAAanvsitqpestgepNNMTLLAEEARRLAERhkqeADDIVRVAKTANETSAEAYNLLLR 1215
Cdd:smart00283 3 SEAVEEIAAGAEEQAEELEELA--------------ERMEELSASIEEVAAN----ADEIAATAQSAAEAAEEGREAVED 64
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1216 TLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVHE--------------EAKRAGDkAVEIYASVAQltpvdsE--A 1279
Cdd:smart00283 65 AITAMDQIREVVEEAVSAVEELEESSDEIGEIVSVIDDiadqtnllalnaaiEAARAGE-AGRGFAVVAD------EvrK 137
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1280 LeseANKIKKEAADLDRLIDQKLKDYEDLREDMRgkehEVKNLLEKGKAeqqtadqllaradaakalaeeaakKGRNTLQ 1359
Cdd:smart00283 138 L---AERSAESAKEIESLIKEIQEETNEAVAAME----ESSSEVEEGVE------------------------LVEETGD 186
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1360 EANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQlALGNAAADATE 1419
Cdd:smart00283 187 ALEEIVDSVEEIADLVQEIAAATDEQAAGSEEVNAAIDEIAQVTQETA-AMSEEISAAAE 245
|
|
| PspA |
COG1842 |
Phage shock protein A [Transcription, Signal transduction mechanisms]; |
1279-1564 |
1.01e-03 |
|
Phage shock protein A [Transcription, Signal transduction mechanisms];
Pssm-ID: 441447 [Multi-domain] Cd Length: 217 Bit Score: 42.50 E-value: 1.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1279 ALESEANKIKKEAADLDRLIDQKLKDYEDlredmrgkehevknllEKGKAEQQTADQllaradaakalaeeaakkgrntl 1358
Cdd:COG1842 9 IIRANINALLDKAEDPEKMLDQAIRDMEE----------------DLVEARQALAQV----------------------- 49
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1359 qeandilnnlkdfdrrvndnkTAAEEALRRipAINRTIAEANEKTREAQLALgnAAAD---ATEAknkaheAERIASgVQ 1435
Cdd:COG1842 50 ---------------------IANQKRLER--QLEELEAEAEKWEEKARLAL--EKGRedlAREA------LERKAE-LE 97
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1436 KNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENElkrkqddadQDMMMAGMASQAAQEAelnARKAKNSVSSllSQL 1515
Cdd:COG1842 98 AQAEALEAQLAQLEEQVEKLKEALRQLESKLEELKAK---------KDTLKARAKAAKAQEK---VNEALSGIDS--DDA 163
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 281371490 1516 TDLLEQLgqldtvdLNKLNEIEGSLNkAKDEMKV-SDLDRKVSNLENEAR 1564
Cdd:COG1842 164 TSALERM-------EEKIEEMEARAE-AAAELAAgDSLDDELAELEADSE 205
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1039-1598 |
1.38e-03 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 43.50 E-value: 1.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1039 KVAEHRVKLQELESLIANLgtgdetvtdqafeDRLKEAEREVTDLLREAQEVkDVDQNLMDRLQRVNNSLHSQISRLHNI 1118
Cdd:TIGR00606 452 KQEELKFVIKELQQLEGSS-------------DRILELDQELRKAERELSKA-EKNSLTETLKKEVKSLQNEKADLDRKL 517
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1119 RNTIEEtgiLAERARSRvESTEQLIEIASRELEKAKMAAANVSITQPESTGEPN---NMTLLAEEARRLAERHKQEADDI 1195
Cdd:TIGR00606 518 RKLDQE---MEQLNHHT-TTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGyfpNKKQLEDWLHSKSKEINQTRDRL 593
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1196 VRVAK--TANETSAEAYNLLLRTLagenqtaleiEELNRKYEQakNI-----SQDLEKQAARVHEEAKRAGDK------A 1262
Cdd:TIGR00606 594 AKLNKelASLEQNKNHINNELESK----------EEQLSSYED--KLfdvcgSQDEESDLERLKEEIEKSSKQramlagA 661
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1263 VEIYAS-VAQLTPVDSEA---------LESEANKIKKEAADLDRLIDQKLKDYEDLredMRGKEHEVKNLLEKGKAEQQT 1332
Cdd:TIGR00606 662 TAVYSQfITQLTDENQSCcpvcqrvfqTEAELQEFISDLQSKLRLAPDKLKSTESE---LKKKEKRRDEMLGLAPGRQSI 738
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1333 ADQLLaradaakalaeeaakkgrntlQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINrtiaeANEKTREAQLAlgn 1412
Cdd:TIGR00606 739 IDLKE---------------------KEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIM-----PEEESAKVCLT--- 789
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1413 aaaDATEAKNKAHEAERIASGV-QKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEaENELKRKqddadqdmmmagmAS 1491
Cdd:TIGR00606 790 ---DVTIMERFQMELKDVERKIaQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVS-KIELNRK-------------LI 852
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1492 QAAQEAELNARKAKNSVSSLLSQLTDLLEQLGQLDTVDLNKLNEIEgSLNKAKDEMKVSDL-------DRKVSNLENEAR 1564
Cdd:TIGR00606 853 QDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQ-SLIREIKDAKEQDSpletfleKDQQEKEELISS 931
|
570 580 590
....*....|....*....|....*....|....*..
gi 281371490 1565 KQEAAILDYDR--DIAEIIKDIH-NLEDIKKTLPTGC 1598
Cdd:TIGR00606 932 KETSNKKAQDKvnDIKEKVKNIHgYMKDIENKIQDGK 968
|
|
| PTZ00440 |
PTZ00440 |
reticulocyte binding protein 2-like protein; Provisional |
1033-1590 |
1.47e-03 |
|
reticulocyte binding protein 2-like protein; Provisional
Pssm-ID: 240419 [Multi-domain] Cd Length: 2722 Bit Score: 43.67 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1033 YRLVKDKVAEHRVKL-QELESLIANLGTGDETVTDQ--------AFEDRLKEAEREVTD---LLREAQE-VKDVDQNLMD 1099
Cdd:PTZ00440 740 YKEEEEKLEVYKHQIiNRKNEFILHLYENDKDLPDGkntyeeflQYKDTILNKENKISNdinILKENKKnNQDLLNSYNI 819
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1100 RLQRVNNSLHSQISRLHNIRNTIEETGI---LAE------RARSRVESTEQLIEIASRELEKAKmaaaNVSITQPESTGE 1170
Cdd:PTZ00440 820 LIQKLEAHTEKNDEELKQLLQKFPTEDEnlnLKElekefnENNQIVDNIIKDIENMNKNINIIK----TLNIAINRSNSN 895
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1171 PNNMTLLAEEARRLAERHKQ-----EADDIVRVAKTA------NETSAEAYNLLLRTLAgeNQTALEIEELNRKYEQAK- 1238
Cdd:PTZ00440 896 KQLVEHLLNNKIDLKNKLEQhmkiiNTDNIIQKNEKLnllnnlNKEKEKIEKQLSDTKI--NNLKMQIEKTLEYYDKSKe 973
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1239 NISQDLEKQAARVHEEAKRAGDKAVEIyasvAQLTpVDSEALESEANK-IKKEAADLDRLIDqklKDYEDLREDMRGKEH 1317
Cdd:PTZ00440 974 NINGNDGTHLEKLDKEKDEWEHFKSEI----DKLN-VNYNILNKKIDDlIKKQHDDIIELID---KLIKEKGKEIEEKVD 1045
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1318 EVKNLLEKGKAEQQTADQLLARAdaakalaeeaakkgrntLQEANDILNNLKDFDRRVNdnktaaeealrripAINRTIA 1397
Cdd:PTZ00440 1046 QYISLLEKMKTKLSSFHFNIDIK-----------------KYKNPKIKEEIKLLEEKVE--------------ALLKKID 1094
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1398 EANEKTREAQLALGNAAADATEAKNKAHEAERIAsgvQKNATSTKADAERTFSEVTDLDNEVNgMLRQLEEAENELKRK- 1476
Cdd:PTZ00440 1095 ENKNKLIEIKNKSHEHVVNADKEKNKQTEHYNKK---KKSLEKIYKQMEKTLKELENMNLEDI-TLNEVNEIEIEYERIl 1170
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1477 -QDDADQDMMMAGMASQAAQEAELNARK---AKNSVSSLLSQLTDLLEQLGQLDTVDLNKlNEIEGSLNKAKDEMKVSDL 1552
Cdd:PTZ00440 1171 iDHIVEQINNEAKKSKTIMEEIESYKKDidqVKKNMSKERNDHLTTFEYNAYYDKATASY-ENIEELTTEAKGLKGEANR 1249
|
570 580 590 600
....*....|....*....|....*....|....*....|.
gi 281371490 1553 DRKVSNLE---NEARKQEAAILDYDRDIAEIIKDIHNLEDI 1590
Cdd:PTZ00440 1250 STNVDELKeikLQVFSYLQQVIKENNKMENALHEIKNMYEF 1290
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1367-1594 |
1.47e-03 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 43.51 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1367 NLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAA---DATEAKNKAHEAERIASGVQKNATSTKA 1443
Cdd:PRK03918 180 RLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKevkELEELKEEIEELEKELESLEGSKRKLEE 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1444 DaertFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASQAAQE-AELNARKAK-----NSVSSLLSQLTD 1517
Cdd:PRK03918 260 K----IRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDElREIEKRLSRleeeiNGIEERIKELEE 335
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281371490 1518 LLEQLGQLdtvdLNKLNEIEGSLNKAK-DEMKVSDLDRKVSNLENeARKQEAaildyDRDIAEIIKDIHNLEDIKKTL 1594
Cdd:PRK03918 336 KEERLEEL----KKKLKELEKRLEELEeRHELYEEAKAKKEELER-LKKRLT-----GLTPEKLEKELEELEKAKEEI 403
|
|
| PLN03188 |
PLN03188 |
kinesin-12 family protein; Provisional |
1033-1307 |
1.79e-03 |
|
kinesin-12 family protein; Provisional
Pssm-ID: 215621 [Multi-domain] Cd Length: 1320 Bit Score: 43.00 E-value: 1.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1033 YRLVKDKVAEHRVKLQ---ELESlianlgTGDETVTDQAFEDRlkeAEREVtdLLREAQEVKD-----VDQNLMDRLQRv 1104
Cdd:PLN03188 949 HKLLKEKYENHPEVLRtkiELKR------VQDELEHYRNFYDM---GEREV--LLEEIQDLRSqlqyyIDSSLPSARKR- 1016
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1105 nNSL--------HSQISRLHNIRNTIEETG-------------------ILAERARSRVESTEQLIEIASRELEKAKMAA 1157
Cdd:PLN03188 1017 -NSLlkltyscePSQAPPLNTIPESTDESPekkleqerlrwteaeskwiSLAEELRTELDASRALAEKQKHELDTEKRCA 1095
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1158 anvsitqpESTGEPNNMTL------------LAEEARRLAERHK--QEA-DDIVRVAKTANETSAEAYnlLLRTLAGEnQ 1222
Cdd:PLN03188 1096 --------EELKEAMQMAMegharmleqyadLEEKHIQLLARHRriQEGiDDVKKAAARAGVRGAESK--FINALAAE-I 1164
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1223 TALEIE-ELNRKYEQAKNISqdLEKQAARVHEEAKRAGDKAVEIYASVAQLTPVDSEALESEankikKEAADLDRLIDQK 1301
Cdd:PLN03188 1165 SALKVErEKERRYLRDENKS--LQAQLRDTAEAVQAAGELLVRLKEAEEALTVAQKRAMDAE-----QEAAEAYKQIDKL 1237
|
....*.
gi 281371490 1302 LKDYED 1307
Cdd:PLN03188 1238 KRKHEN 1243
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
1184-1415 |
2.03e-03 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 42.02 E-value: 2.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1184 LAERHKQEADDIVRVAKTANETsAEAY----NLLLRTlaGENQTALEI-EELNRKYEQAKNISQDLekqaARVHEEAKRA 1258
Cdd:COG2956 20 LNGQPDKAIDLLEEALELDPET-VEAHlalgNLYRRR--GEYDRAIRIhQKLLERDPDRAEALLEL----AQDYLKAGLL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1259 gDKAVEIYASVAQLTPVDSEALESEANkIKKEAADLDRLIDQklkdYEDLREDMRGKEHevkNLLEKGKAEQQTADqlla 1338
Cdd:COG2956 93 -DRAEELLEKLLELDPDDAEALRLLAE-IYEQEGDWEKAIEV----LERLLKLGPENAH---AYCELAELYLEQGD---- 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1339 radaakalaeeaAKKGRNTLQEANDI-------LNNLKDFDRRVNDNKtAAEEALRRIPAINRTIAEANEKTREAQLALG 1411
Cdd:COG2956 160 ------------YDEAIEALEKALKLdpdcaraLLLLAELYLEQGDYE-EAIAALERALEQDPDYLPALPRLAELYEKLG 226
|
....
gi 281371490 1412 NAAA 1415
Cdd:COG2956 227 DPEE 230
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1359-1505 |
2.04e-03 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 41.45 E-value: 2.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1359 QEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNaAADATEAKNKAHEaerIASgvQKNA 1438
Cdd:COG1579 31 AELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGN-VRNNKEYEALQKE---IES--LKRR 104
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 281371490 1439 TSTKADAERTFSE-VTDLDNEVNGMLRQLEEAENELKRKQDDADQDmmmagMASQAAQEAELNARKAK 1505
Cdd:COG1579 105 ISDLEDEILELMErIEELEEELAELEAELAELEAELEEKKAELDEE-----LAELEAELEELEAEREE 167
|
|
| Borrelia_P83 |
pfam05262 |
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins. |
1291-1607 |
2.05e-03 |
|
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
Pssm-ID: 114011 [Multi-domain] Cd Length: 489 Bit Score: 42.68 E-value: 2.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1291 AADLDRLIDQK-----LKDYE---DLREDMRG-KEHEVKNLLEKGKAEQQTADQLLaradaakalaeeaakkgrntlqea 1361
Cdd:pfam05262 171 DVDTDSISDKKvvealREDNEkgvNFRRDMTDlKERESQEDAKRAQQLKEELDKKQ------------------------ 226
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1362 ndilnnlKDFDRrvndnktAAEEALRRIPAINRTIAEANEKTREAQLALGNAAA-----DATEAKNKAHEAERIASGVQK 1436
Cdd:pfam05262 227 -------IDADK-------AQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTsspkeDKQVAENQKREIEKAQIEIKK 292
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1437 NA-TSTKADAERTFsevtDLDNEVNGMLRQLEEAENELKRKQDDADQDM--MMAGMASQAAQEAElNARKAKNSVSSLls 1513
Cdd:pfam05262 293 NDeEALKAKDHKAF----DLKQESKASEKEAEDKELEAQKKREPVAEDLqkTKPQVEAQPTSLNE-DAIDSSNPVYGL-- 365
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1514 QLTDLLEQLGQLDTVDLNK--------LNEIEGSL--NKAKDEMKVSD---------LDRKVSNLE-----NEARKQEAA 1569
Cdd:pfam05262 366 KVVDPITNLSELVLIDLKTevrlresaQQTIRRRGlyEREKDLVAIAItsgnaklqlVDIDLKNLEvikesNFEIAKNSS 445
|
330 340 350
....*....|....*....|....*....|....*...
gi 281371490 1570 ILDYDRDIAEIIKDIHNLEDIKKTLPTGCFNTPSIEKP 1607
Cdd:pfam05262 446 LYVDSRMILVAVKDDSNKWRLAKFSPKLDEFILSENKI 483
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1047-1526 |
2.33e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 42.57 E-value: 2.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1047 LQELESLiANLGTGDETVTD--------QAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRLhni 1118
Cdd:pfam07888 3 LDELVTL-EEESHGEEGGTDmllvvpraELLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRREL--- 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1119 rntieetgilaeraRSRVESTEQLIEIASRELEKAKMAAANVSITQPESTGEPNnmTLLAEEARRLAeRHKQEADDIvrv 1198
Cdd:pfam07888 79 --------------ESRVAELKEELRQSREKHEELEEKYKELSASSEELSEEKD--ALLAQRAAHEA-RIRELEEDI--- 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1199 aktanetsaeaynlllRTLAgenQTALEIEelnrkyeqaknisQDLEkqaaRVHEEAKRAGDKAVEIYASVAQLTpVDSE 1278
Cdd:pfam07888 139 ----------------KTLT---QRVLERE-------------TELE----RMKERAKKAGAQRKEEEAERKQLQ-AKLQ 181
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1279 ALESEANKIKKEAADLDRLIDQKLKDYEDLREDmrgkeheVKNLLEK-GKAEQQTAdqllaradaakalaeeaakkgrnt 1357
Cdd:pfam07888 182 QTEEELRSLSKEFQELRNSLAQRDTQVLQLQDT-------ITTLTQKlTTAHRKEA------------------------ 230
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1358 lqEANDILNNLKDFDRRVNDNKTAAE---EALRRIPAI-NRTIAEANEKTRE-AQLALGNAAADATEAKNKAHEA-ERia 1431
Cdd:pfam07888 231 --ENEALLEELRSLQERLNASERKVEglgEELSSMAAQrDRTQAELHQARLQaAQLTLQLADASLALREGRARWAqER-- 306
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1432 SGVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRkqdDADQDMMMAGMASQAAQEAELNARKAKNSVSSL 1511
Cdd:pfam07888 307 ETLQQSAEADKDRIEKLSAELQRLEERLQEERMEREKLEVELGR---EKDCNRVQLSESRRELQELKASLRVAQKEKEQL 383
|
490
....*....|....*
gi 281371490 1512 LSQLTDLLEQLGQLD 1526
Cdd:pfam07888 384 QAEKQELLEYIRQLE 398
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1264-1581 |
2.58e-03 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 42.20 E-value: 2.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1264 EIYASVAQLtpvdsEALESEANKIKKEAADLDRLIDQKLKDYE---DLREDMRGKEHEVKNLLEKGKAEQQTADQLLARA 1340
Cdd:COG5278 87 EIDELLAEL-----RSLTADNPEQQARLDELEALIDQWLAELEqviALRRAGGLEAALALVRSGEGKALMDEIRARLLLL 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1341 DAAKALAEEAAKKGRNTLQEANDILNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQLALGNAAADATEA 1420
Cdd:COG5278 162 ALALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALA 241
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1421 KNKAHEAERIASGVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASQAAQEAELN 1500
Cdd:COG5278 242 LALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAA 321
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1501 ARKAKNSVSSLLSQLTDLLEQLGQLDTVDL---NKLNEIEGSLNKAKDEMKVSDLDRKVSNLENEARKQEAAILDYDRDI 1577
Cdd:COG5278 322 AAALAALLALALATALAAAAAALALLAALLaeaAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAA 401
|
....
gi 281371490 1578 AEII 1581
Cdd:COG5278 402 AAAE 405
|
|
| MscS_porin |
pfam12795 |
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ... |
1271-1484 |
2.58e-03 |
|
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.
Pssm-ID: 432790 [Multi-domain] Cd Length: 238 Bit Score: 41.52 E-value: 2.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1271 QLTPVDSEALES---------EANKIKKEAADLDRLIDQKLKDYEDLRED----MRGKEHEVKNLLEK---GKAEQQTAD 1334
Cdd:pfam12795 10 LDEAAKKKLLQDlqqalslldKIDASKQRAAAYQKALDDAPAELRELRQElaalQAKAEAAPKEILASlslEELEQRLLQ 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1335 QllaradaakalaeeaakkgRNTLQEANdilNNLKDFDRRVNDNKTAAE-------EALRRIPAINRTI---AEANEKTR 1404
Cdd:pfam12795 90 T-------------------SAQLQELQ---NQLAQLNSQLIELQTRPEraqqqlsEARQRLQQIRNRLngpAPPGEPLS 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1405 EAQLALGNAAADATEAKNKAHEAERIASGVQKNATSTKAD-----AERTFSEVTDLDNEVNGmlRQLEEAENELKRKQDD 1479
Cdd:pfam12795 148 EAQRWALQAELAALKAQIDMLEQELLSNNNRQDLLKARRDlltlrIQRLEQQLQALQELLNE--KRLQEAEQAVAQTEQL 225
|
....*
gi 281371490 1480 ADQDM 1484
Cdd:pfam12795 226 AEEAA 230
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
1466-1594 |
2.71e-03 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 40.89 E-value: 2.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1466 LEEAENELKRKQDDADQDMMMAGMASQAAQEAELNARKAKnsvsslLSQLTDLLEQLGQLDTVDLNKLNEIEGSLNKA-- 1543
Cdd:cd00176 16 LSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEER------VEALNELGEQLIEEGHPDAEEIQERLEELNQRwe 89
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 281371490 1544 -------------KDEMKVSDLDRKVSNLENEARKQEAAIL--DYDRDIAEIIKDIHNLEDIKKTL 1594
Cdd:cd00176 90 elrelaeerrqrlEEALDLQQFFRDADDLEQWLEEKEAALAseDLGKDLESVEELLKKHKELEEEL 155
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1179-1409 |
2.76e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.06 E-value: 2.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1179 EEARRLAERHKQEADDIVRVAKTANETSAEAYNLLLRTLAGENQTALEIEELNRKYEQAKNISQDLEKQAARVHEEAKRA 1258
Cdd:COG4942 37 AELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELLRALYRL 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1259 G-----------DKAVEIYASVAQLTPVdSEALESEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEvKNLLEKGK 1327
Cdd:COG4942 117 GrqpplalllspEDFLDAVRRLQYLKYL-APARREQAEELRADLAELAALRAELEAERAELEALLAELEEE-RAALEALK 194
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1328 AEQQTadqllaradaakalaeeaakkgrnTLQEANdilNNLKDFDRRVNDNKTAAEEALRRIPAINRTIAEANEKTREAQ 1407
Cdd:COG4942 195 AERQK------------------------LLARLE---KELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAG 247
|
..
gi 281371490 1408 LA 1409
Cdd:COG4942 248 FA 249
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1250-1485 |
3.41e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.21 E-value: 3.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1250 RVHEEAKRAGDKaveiyasVAQLTPVdsEALESEANKIKKEAADLDRLIDqKLKDYEDLREDmrgkeHEVKNLLEKGKAE 1329
Cdd:COG4913 239 RAHEALEDAREQ-------IELLEPI--RELAERYAAARERLAELEYLRA-ALRLWFAQRRL-----ELLEAELEELRAE 303
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1330 QQTADQLLARAdaakalaeeaakkgRNTLQEANDILNNLKdfdRRVNDNKTAAEEALRRipainrTIAEANEKTREAQLA 1409
Cdd:COG4913 304 LARLEAELERL--------------EARLDALREELDELE---AQIRGNGGDRLEQLER------EIERLERELEERERR 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1410 LGNAAADAT----EAKNKAHEAERIASGVQKNATSTKADAERTFSEVTDLDNEVNGMLRQLEEAE---NELKRKQDDADQ 1482
Cdd:COG4913 361 RARLEALLAalglPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEaeiASLERRKSNIPA 440
|
...
gi 281371490 1483 DMM 1485
Cdd:COG4913 441 RLL 443
|
|
| CENP-F_leu_zip |
pfam10473 |
Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, ... |
1462-1568 |
3.72e-03 |
|
Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance.
Pssm-ID: 463102 [Multi-domain] Cd Length: 140 Bit Score: 39.59 E-value: 3.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1462 MLRQLEEAENE---LKRKQDDADQDMMMagmaSQAAQE-AELNARKAKNSVSSLLSQLTDLLEQLGQLdTVDLNKLNEIE 1537
Cdd:pfam10473 8 VLEKLKESERKadsLKDKVENLERELEM----SEENQElAILEAENSKAEVETLKAEIEEMAQNLRDL-ELDLVTLRSEK 82
|
90 100 110
....*....|....*....|....*....|...
gi 281371490 1538 GSLNKAKDEM--KVSDLDRKVSNLENEARKQEA 1568
Cdd:pfam10473 83 ENLTKELQKKqeRVSELESLNSSLENLLEEKEQ 115
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
1079-1566 |
4.35e-03 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 41.99 E-value: 4.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1079 EVTDLLREAQevKDVDQNLMDRLQRVNNSLHSQISRLH----NIRNTIEETGILAER---ARSRVESTEQLIEIASRELE 1151
Cdd:COG5022 932 RLKKLLNNID--LEEGPSIEYVKLPELNKLHEVESKLKetseEYEDLLKKSTILVREgnkANSELKNFKKELAELSKQYG 1009
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1152 KAKMAAANVSITQPESTGEPNNMTLLAEEarRLAERHKQEADDIVRVA-KTANETSAEAYNLLLRTlagENQTALEieel 1230
Cdd:COG5022 1010 ALQESTKQLKELPVEVAELQSASKIISSE--STELSILKPLQKLKGLLlLENNQLQARYKALKLRR---ENSLLDD---- 1080
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1231 nrkyeqaknisqDLEKQAARVHEEAKragdkavEIYASVAQLTPVDSEALESEANKIKKEAADLDRLID------QKLKD 1304
Cdd:COG5022 1081 ------------KQLYQLESTENLLK-------TINVKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQEiskflsQLVNT 1141
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1305 YEDLREDMRGKEHEVKNLLEKGKAEQQTAdqllaradaakalaeeaaKKGRNTLQEANDILNNLKDF-----DRRVNDNK 1379
Cdd:COG5022 1142 LEPVFQKLSVLQLELDGLFWEANLEALPS------------------PPPFAALSEKRLYQSALYDEksklsSSEVNDLK 1203
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1380 TAAEEALRRIP-------AINRTIAEANEKTREAQLALGNAAAD---ATEAKNKAHEAERIASGVQKNATSTKADAERTF 1449
Cdd:COG5022 1204 NELIALFSKIFsgwprgdKLKKLISEGWVPTEYSTSLKGFNNLNkkfDTPASMSNEKLLSLLNSIDNLLSSYKLEEEVLP 1283
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1450 SEVTDLDNEVNGMLRQ-LEEAENELKRKQddadqdmmmagmASQAAQEAELNARKAKNS-VSSLLSQLTDLLE--QLGQL 1525
Cdd:COG5022 1284 ATINSLLQYINVGLFNaLRTKASSLRWKS------------ATEVNYNSEELDDWCREFeISDVDEELEELIQavKVLQL 1351
|
490 500 510 520
....*....|....*....|....*....|....*....|..
gi 281371490 1526 DTVDLNKLNEIEGSLNKAKD-EMKVSDLDRKVSNLENEARKQ 1566
Cdd:COG5022 1352 LKDDLNKLDELLDACYSLNPaEIQNLKSRYDPADKENNLPKE 1393
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1235-1495 |
5.05e-03 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 40.98 E-value: 5.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1235 EQAKNISQDLEKQAARVHEEAKRAGDKAVEiyasvAQltpvdsEALESEANKIKKEAADLDRliDQKLKDYEDLREDMRG 1314
Cdd:TIGR02794 50 QQANRIQQQKKPAAKKEQERQKKLEQQAEE-----AE------KQRAAEQARQKELEQRAAA--EKAAKQAEQAAKQAEE 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1315 KEHEVKNLLEKGKAEQQTADQLLARadaakalaeeaakkgrntlQEANDILNNLKDFDRRVN---DNKTAAEEALRRIPA 1391
Cdd:TIGR02794 117 KQKQAEEAKAKQAAEAKAKAEAEAE-------------------RKAKEEAAKQAEEEAKAKaaaEAKKKAEEAKKKAEA 177
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1392 INRTIAEAnektrEAQLALGNAAADATEAKNKAhEAERIASGVQKNATSTKADAERTFSEVTDLDnevngmLRQLEEAEN 1471
Cdd:TIGR02794 178 EAKAKAEA-----EAKAKAEEAKAKAEAAKAKA-AAEAAAKAEAEAAAAAAAEAERKADEAELGD------IFGLASGSN 245
|
250 260
....*....|....*....|....
gi 281371490 1472 elKRKQDDADQDMMMAGMASQAAQ 1495
Cdd:TIGR02794 246 --AEKQGGARGAAAGSEVDKYAAI 267
|
|
| FlgN |
pfam05130 |
FlgN protein; This family includes the FlgN protein and export chaperone involved in flagellar ... |
1048-1121 |
5.19e-03 |
|
FlgN protein; This family includes the FlgN protein and export chaperone involved in flagellar synthesis.
Pssm-ID: 428323 [Multi-domain] Cd Length: 140 Bit Score: 38.89 E-value: 5.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1048 QELESLIANLGTGDETVTDQAFE------DRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRLHNIRNT 1121
Cdd:pfam05130 55 KERRELLAELGLSPEEATLSELLakeeedPELRELWQELLELLERLKELNELNGELIEQSLEFNNRSLNILQGAANGSNT 134
|
|
| Lebercilin |
pfam15619 |
Ciliary protein causing Leber congenital amaurosis disease; Lebercilin is a family of ... |
1179-1296 |
5.20e-03 |
|
Ciliary protein causing Leber congenital amaurosis disease; Lebercilin is a family of eukaryotic ciliary proteins. Mutations in the gene, LCA5, are implicated in the disease Leber congenital amaurosis. In photoreceptors, lebercilin is uniquely localized at the cilium that bridges the inner and outer segments. Lebercilin functions as an integral element of selective protein transport through photoreceptor cilia. Lebercilin specifically interacts with the intraflagellar transport (IFT), and disruption of IFT can lead to Leber congenital amaurosis.
Pssm-ID: 464776 [Multi-domain] Cd Length: 193 Bit Score: 39.89 E-value: 5.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1179 EEARRLAERHKQEADDIVRVAKTANETSAEAYNLL-----LRTLAgENQTALEIEELNRK-------YEQAKNISQDLEK 1246
Cdd:pfam15619 67 EEVRVLRERLRRLQEKERDLERKLKEKEAELLRLRdqlkrLEKLS-EDKNLAEREELQKKleqleakLEDKDEKIQDLER 145
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1247 Q-----AARVHEEAkRAGDKAVEIYASVAQltpvdseaLESEANKIKKEAADLDR 1296
Cdd:pfam15619 146 KlelenKSFRRQLA-AEKKKHKEAQEEVKI--------LQEEIERLQQKLKEKER 191
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1177-1533 |
5.29e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 41.44 E-value: 5.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1177 LAEEARRLAERHKQEADDI--VRVAKTANETSAEAYNLLLRTLAGEN---QTALEIEELNRKYEQAKNISQDLEKQAARV 1251
Cdd:COG4913 615 LEAELAELEEELAEAEERLeaLEAELDALQERREALQRLAEYSWDEIdvaSAEREIAELEAELERLDASSDDLAALEEQL 694
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1252 hEEAKRAGDKAVEiyasvaqltpvDSEALESEANKIKKEAADLDRLIDQKLKDYEDLreDMRGKEHEVKNLLEKGKAEQQ 1331
Cdd:COG4913 695 -EELEAELEELEE-----------ELDELKGEIGRLEKELEQAEEELDELQDRLEAA--EDLARLELRALLEERFAAALG 760
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1332 TADQLLAradaakalaeeaakkgRNTLQEANDILNNLKDfdrrvndnktAAEEALRRIpainrtIAEANEKTREAQLALG 1411
Cdd:COG4913 761 DAVEREL----------------RENLEERIDALRARLN----------RAEEELERA------MRAFNREWPAETADLD 808
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1412 NAAADATE--------AKNKAHEAE-RIASGVQKNATSTKAD----AERTFSEVTDLDNEVNGMLRQLEEAEN-----EL 1473
Cdd:COG4913 809 ADLESLPEylalldrlEEDGLPEYEeRFKELLNENSIEFVADllskLRRAIREIKERIDPLNDSLKRIPFGPGrylrlEA 888
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 281371490 1474 KRKQDDAD---QDMMMAgmASQAAQEAELNARKAK-NSVSSLLSQLTDlleQLGQLDTVDLNKL 1533
Cdd:COG4913 889 RPRPDPEVrefRQELRA--VTSGASLFDEELSEARfAALKRLIERLRS---EEEESDRRWRARV 947
|
|
| PRK07735 |
PRK07735 |
NADH-quinone oxidoreductase subunit C; |
1303-1507 |
5.75e-03 |
|
NADH-quinone oxidoreductase subunit C;
Pssm-ID: 236081 [Multi-domain] Cd Length: 430 Bit Score: 41.12 E-value: 5.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1303 KDYEDLREDM--RGKEHEVKNLLEKGKAEQQTADQLLARADAAKALAEEAakkgrnTLQEANdilnnlkdfdrrvndnKT 1380
Cdd:PRK07735 5 KDLEDLKKEAarRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDM------TIEEAK----------------RR 62
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1381 AAEEALRRIPAINRTIAEANEKTREAQLALGNAAAdATEAKNKAHEAERI-------ASGVQKNATSTKADAERTfsevt 1453
Cdd:PRK07735 63 AAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKA-AAAAKAKAAALAKQkregteeVTEEEKAAAKAKAAAAAK----- 136
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1454 dldNEVNGMLRQLEEAENELKRKQDDADQDMMMAGMASQA-AQEAELNARKAKNS 1507
Cdd:PRK07735 137 ---AKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAkAKAAALAKQKAAEA 188
|
|
| Tropomyosin |
pfam00261 |
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ... |
1067-1295 |
6.71e-03 |
|
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.
Pssm-ID: 459736 [Multi-domain] Cd Length: 235 Bit Score: 40.01 E-value: 6.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1067 QAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNNSLHSQISRLHNIRNTIEEtgilAERARSRVESTEQLIEIA 1146
Cdd:pfam00261 18 KEAMKKLEEAEKRAEKAEAEVAALNRRIQLLEEELERTEERLAEALEKLEEAEKAADE----SERGRKVLENRALKDEEK 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1147 SRELEkakmAAANVSITQPESTGEPnnmtlLAEEARRLAERhKQEADDIVRVAKTANETSAEAYNLL------LRTLAGE 1220
Cdd:pfam00261 94 MEILE----AQLKEAKEIAEEADRK-----YEEVARKLVVV-EGDLERAEERAELAESKIVELEEELkvvgnnLKSLEAS 163
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1221 NQTALEIEELNRkyEQAKNISQDLeKQAARVHEEAKRagdKAVEIYASVAQLtpvdSEALESEANKIKKEAADLD 1295
Cdd:pfam00261 164 EEKASEREDKYE--EQIRFLTEKL-KEAETRAEFAER---SVQKLEKEVDRL----EDELEAEKEKYKAISEELD 228
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1036-1344 |
6.99e-03 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 41.18 E-value: 6.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1036 VKDKVAEHRVKLQELESLIAN----LGTGD-----ETVTDQAFEDRLKEAEREVTDLLREAQEVKDVDQNLMDRLQRVNN 1106
Cdd:PRK02224 424 LREREAELEATLRTARERVEEaealLEAGKcpecgQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAED 503
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1107 --SLHSQISRLHNIRNTIEEtgiLAERARSRVESTEQLIEiasrELEKAKmaaanvsiTQPESTGEPNNMTllAEEARRL 1184
Cdd:PRK02224 504 lvEAEDRIERLEERREDLEE---LIAERRETIEEKRERAE----ELRERA--------AELEAEAEEKREA--AAEAEEE 566
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1185 AERHKQEADDIVRvAKTANETSAEAYNLLLRTLAGENQTALEIEELNRKYEQAknisQDLEKQAaRVHEEAKRagDKAVE 1264
Cdd:PRK02224 567 AEEAREEVAELNS-KLAELKERIESLERIRTLLAAIADAEDEIERLREKREAL----AELNDER-RERLAEKR--ERKRE 638
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1265 IYASVaqltpvDSEALEsEANKIKKEAADLDRLIDQKLKDYEDLREDMRGKEHEVKNLLEKGKAEQQTADQLLARADAAK 1344
Cdd:PRK02224 639 LEAEF------DEARIE-EAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEELRERREALENRVEALE 711
|
|
| Tar |
COG0840 |
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms]; |
1153-1535 |
8.07e-03 |
|
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
Pssm-ID: 440602 [Multi-domain] Cd Length: 533 Bit Score: 40.77 E-value: 8.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1153 AKMAAANVSITQPESTGEPNNMTLLAEEARRLAERHKQEADDIVRVAKTANETSAEAYNLLLRTLAGENQTALEIEELnR 1232
Cdd:COG0840 6 LLLALLLALLLLALSLLALLAAALLILLALLLAALTALALLLLLSLLALLLLLLLLALALLLVLLALLLLLALVVLLA-L 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1233 KYEQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVAQLTPVDSEALESEANKIKKEAADLDRLIDQKLKDYEDLREDM 1312
Cdd:COG0840 85 LLALLLLLLALLALALAALALLAALAALLALLELLLAALLAALAIALLALAALLALAALALALLALALLAAAAAAAAALA 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1313 RGKEHEVKNLLEKGKAEQQTADQLLARADAAKALAEEAAKKGRNTLQEANDILNNLKDFDRRVNDNKTAAEE--ALRRip 1390
Cdd:COG0840 165 ALLEAAALALAAAALALALLAAALLALVALAIILALLLSRSITRPLRELLEVLERIAEGDLTVRIDVDSKDEigQLAD-- 242
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1391 AINRTIAEANEKTREAQLALGNAAADATEAknkAHEAERIASGVQKNATSTkadaERTFSEVTDLDNEVNGMlrqleeAE 1470
Cdd:COG0840 243 AFNRMIENLRELVGQVRESAEQVASASEEL---AASAEELAAGAEEQAASL----EETAAAMEELSATVQEV------AE 309
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 281371490 1471 NelkrkqddadqdmmmAGMASQAAQEAELNARKAKNSVSSLLSQLTDLLEQLGQLDTVdLNKLNE 1535
Cdd:COG0840 310 N---------------AQQAAELAEEASELAEEGGEVVEEAVEGIEEIRESVEETAET-IEELGE 358
|
|
| EGF_Lam |
smart00180 |
Laminin-type epidermal growth factor-like domai; |
284-326 |
8.91e-03 |
|
Laminin-type epidermal growth factor-like domai;
Pssm-ID: 214543 Cd Length: 46 Bit Score: 35.75 E-value: 8.91e-03
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 281371490 284 CKCNGHASECVKNEFGKLVCNCKHNTYGVDCEKCLPFFNDRPW 326
Cdd:smart00180 1 CDCDPGGSASGTCDPDTGQCECKPNVTGRRCDRCAPGYYGDGP 43
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1226-1335 |
9.16e-03 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 39.52 E-value: 9.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 281371490 1226 EIEELNRKYEQAKNISQDLEKQAARVHEEAKRAGDKAVEIYASVAQL-TPVDSEALESEANKIKKEAADLDRLIDQKLKD 1304
Cdd:COG1579 39 ELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVrNNKEYEALQKEIESLKRRISDLEDEILELMER 118
|
90 100 110
....*....|....*....|....*....|.
gi 281371490 1305 YEDLREDMRGKEHEVKNLLEKGKAEQQTADQ 1335
Cdd:COG1579 119 IEELEEELAELEAELAELEAELEEKKAELDE 149
|
|
|