|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_2 |
pfam04826 |
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain ... |
2041-2261 |
7.14e-87 |
|
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain interact with numerous other proteins, through these interactions they are involved in a wide variety of processes including carcinogenesis, control of cellular ageing and survival, regulation of circadian rhythm and lysosomal sorting of G protein-coupled receptors. :
Pssm-ID: 461447 [Multi-domain] Cd Length: 221 Bit Score: 283.15 E-value: 7.14e-87
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 2041 LDPHDLEKLICMIEMTEDPSVHEIATNALYNSADYPYPQEIDRNIGGISVIQSLLSNPYPNVRQKALNALNNISVAAENH 2120
Cdd:pfam04826 1 LEPQELDKLLALLKSSEDPFIHEIALITLGNSAAYPFNQDIIRDLGGIPIIANLLSDSNPEVKEKALNALNNLSMNVENQ 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 2121 RTVKTYLSQVCEDTVTYPLNSNVQVAGLRLIKHLTITSEYQHMVTNYISEFLRLLALGSGETKDHVLGMLVNFSKNPSMT 2200
Cdd:pfam04826 81 KKIKVYVNQVCKDIVSSPLNSEVQLAGLKLLTNLSVTNDYHHLVVSYIRDFFSLLSQGNEKTKFQVLKVLLNLSENPAMT 160
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720434366 2201 RDLLIANAPTALINIFSKKETKENILNALLLFENINRHFKKRGKTYPQDRFSKTSLYFLFQ 2261
Cdd:pfam04826 161 RELLSAQVPSSLLSLFNSKEAKENLLRALTIFENINFHLKKEAKLFVKEHFTKGSLFSIFR 221
|
|
| COG4625 super family |
cl34793 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
944-1452 |
4.30e-05 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown]; The actual alignment was detected with superfamily member COG4625:
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 49.01 E-value: 4.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 944 NEVALGSWVSAGDQATGAIWAGAQTTDGTWVADKASAGSWTGAENQISAGSWVVSGNQAIAGPWAVSQVTDGSWPAVQAS 1023
Cdd:COG4625 11 GGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1024 GVSWVVDQATGTWTVAENQTGAVSWAGAGNIVSIGYWTGAVDQTNAVSWTGTTDQVGVEVKPRFEDQASEKGSWVVAGVQ 1103
Cdd:COG4625 91 GGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGG 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1104 TSGETRLGSEDQSSGRSWTETVDQANAASRLGTVDQAGGTSWAGTGDQVGGVSTSGSADQSSSGSWAGTRNLAGERSWTG 1183
Cdd:COG4625 171 GGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGG 250
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1184 TGDQSDGAAKPGFENQTSDEGSWAGTIGQPSGGSKSVSEAQSAGRSWADSADQLSGGFLVGPLDQANGESQPVSGELAAS 1263
Cdd:COG4625 251 GGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 330
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1264 GVDQTSGGGcwtGSGDQSGGESRLGPRDQSNGESWPGTGDQTGGWYCTYTGTQTIGGGPWVGVSGQDVGGPKPVHMNQAT 1343
Cdd:COG4625 331 GGGAGGGGG---SGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGG 407
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1344 GGAWLGTGTQVSAVSWTGDQVGGCSKPGFEDQAIGGGFWAGAGDQTTGGSrpavsedqssGGVSWGGAGGHVIGGSRTDQ 1423
Cdd:COG4625 408 TGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAG----------GSGGGAGAGGGSGSGAGTLT 477
|
490 500
....*....|....*....|....*....
gi 1720434366 1424 SSGTSWPGMGSQVSGGswiGSVDQTSGCT 1452
Cdd:COG4625 478 LTGNNTYTGTTTVNGG---GNYTQSAGST 503
|
|
| MSCRAMM_ClfA super family |
cl41352 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
333-745 |
4.61e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif. The actual alignment was detected with superfamily member NF033609:
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 48.75 E-value: 4.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 333 DTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDALPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVA 412
Cdd:NF033609 553 EIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSAS 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 413 KDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKA 492
Cdd:NF033609 633 DSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 493 GAKADQKASGQpqpvancqnealpgtknkvkgnpnptpkteaGTATTSSAQTNVTSSQGETTPGAKNKAKGNRNSVPKAG 572
Cdd:NF033609 713 DSDSDSDSDSD-------------------------------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 761
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 573 AGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQSVPASQGTALTGTKTKVKGNSSAVSKPDAGAG 652
Cdd:NF033609 762 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 841
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 653 TMGSAHAKTAANSQGETLPGSKNKVKGNSNAVPkaeagagttePNQPQAEALLGARNKVKGNSNSVP----KAESGASTI 728
Cdd:NF033609 842 SDSDSDSDSDSDSDSESDSNSDSESGSNNNVVP----------PNSPKNGTNASNKNEAKDSKEPLPdtgsEDEANTSLI 911
|
410
....*....|....*..
gi 1720434366 729 LALASSQAEALLGARNK 745
Cdd:NF033609 912 WGLLASLGSLLLFRRKK 928
|
|
| WEMBL super family |
cl25644 |
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ... |
46-190 |
2.72e-03 |
|
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like". The actual alignment was detected with superfamily member pfam05701:
Pssm-ID: 461718 [Multi-domain] Cd Length: 562 Bit Score: 42.71 E-value: 2.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 46 LEEEIE-TQSETsslvetmvmtEAVTLTESTSQAKEVTMKEAVTQTDAEAEavgkkeavtqtKAKAWAMAGRAEVKK--E 122
Cdd:pfam05701 354 LEAELNrTKSEI----------ALVQAKEKEAREKMVELPKQLQQAAQEAE-----------EAKSLAQAAREELRKakE 412
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720434366 123 AMTQTKAEARTlaeketeinrvtvTQSEVLAVTKEVVKIGTMNETGVVAeamIRPLEETVSVTRTQSE 190
Cdd:pfam05701 413 EAEQAKAAAST-------------VESRLEAVLKEIEAAKASEKLALAA---IKALQESESSAESTNQ 464
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_2 |
pfam04826 |
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain ... |
2041-2261 |
7.14e-87 |
|
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain interact with numerous other proteins, through these interactions they are involved in a wide variety of processes including carcinogenesis, control of cellular ageing and survival, regulation of circadian rhythm and lysosomal sorting of G protein-coupled receptors.
Pssm-ID: 461447 [Multi-domain] Cd Length: 221 Bit Score: 283.15 E-value: 7.14e-87
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 2041 LDPHDLEKLICMIEMTEDPSVHEIATNALYNSADYPYPQEIDRNIGGISVIQSLLSNPYPNVRQKALNALNNISVAAENH 2120
Cdd:pfam04826 1 LEPQELDKLLALLKSSEDPFIHEIALITLGNSAAYPFNQDIIRDLGGIPIIANLLSDSNPEVKEKALNALNNLSMNVENQ 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 2121 RTVKTYLSQVCEDTVTYPLNSNVQVAGLRLIKHLTITSEYQHMVTNYISEFLRLLALGSGETKDHVLGMLVNFSKNPSMT 2200
Cdd:pfam04826 81 KKIKVYVNQVCKDIVSSPLNSEVQLAGLKLLTNLSVTNDYHHLVVSYIRDFFSLLSQGNEKTKFQVLKVLLNLSENPAMT 160
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720434366 2201 RDLLIANAPTALINIFSKKETKENILNALLLFENINRHFKKRGKTYPQDRFSKTSLYFLFQ 2261
Cdd:pfam04826 161 RELLSAQVPSSLLSLFNSKEAKENLLRALTIFENINFHLKKEAKLFVKEHFTKGSLFSIFR 221
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
944-1452 |
4.30e-05 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 49.01 E-value: 4.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 944 NEVALGSWVSAGDQATGAIWAGAQTTDGTWVADKASAGSWTGAENQISAGSWVVSGNQAIAGPWAVSQVTDGSWPAVQAS 1023
Cdd:COG4625 11 GGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1024 GVSWVVDQATGTWTVAENQTGAVSWAGAGNIVSIGYWTGAVDQTNAVSWTGTTDQVGVEVKPRFEDQASEKGSWVVAGVQ 1103
Cdd:COG4625 91 GGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGG 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1104 TSGETRLGSEDQSSGRSWTETVDQANAASRLGTVDQAGGTSWAGTGDQVGGVSTSGSADQSSSGSWAGTRNLAGERSWTG 1183
Cdd:COG4625 171 GGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGG 250
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1184 TGDQSDGAAKPGFENQTSDEGSWAGTIGQPSGGSKSVSEAQSAGRSWADSADQLSGGFLVGPLDQANGESQPVSGELAAS 1263
Cdd:COG4625 251 GGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 330
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1264 GVDQTSGGGcwtGSGDQSGGESRLGPRDQSNGESWPGTGDQTGGWYCTYTGTQTIGGGPWVGVSGQDVGGPKPVHMNQAT 1343
Cdd:COG4625 331 GGGAGGGGG---SGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGG 407
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1344 GGAWLGTGTQVSAVSWTGDQVGGCSKPGFEDQAIGGGFWAGAGDQTTGGSrpavsedqssGGVSWGGAGGHVIGGSRTDQ 1423
Cdd:COG4625 408 TGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAG----------GSGGGAGAGGGSGSGAGTLT 477
|
490 500
....*....|....*....|....*....
gi 1720434366 1424 SSGTSWPGMGSQVSGGswiGSVDQTSGCT 1452
Cdd:COG4625 478 LTGNNTYTGTTTVNGG---GNYTQSAGST 503
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
333-745 |
4.61e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 48.75 E-value: 4.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 333 DTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDALPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVA 412
Cdd:NF033609 553 EIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSAS 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 413 KDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKA 492
Cdd:NF033609 633 DSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 493 GAKADQKASGQpqpvancqnealpgtknkvkgnpnptpkteaGTATTSSAQTNVTSSQGETTPGAKNKAKGNRNSVPKAG 572
Cdd:NF033609 713 DSDSDSDSDSD-------------------------------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 761
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 573 AGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQSVPASQGTALTGTKTKVKGNSSAVSKPDAGAG 652
Cdd:NF033609 762 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 841
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 653 TMGSAHAKTAANSQGETLPGSKNKVKGNSNAVPkaeagagttePNQPQAEALLGARNKVKGNSNSVP----KAESGASTI 728
Cdd:NF033609 842 SDSDSDSDSDSDSDSESDSNSDSESGSNNNVVP----------PNSPKNGTNASNKNEAKDSKEPLPdtgsEDEANTSLI 911
|
410
....*....|....*..
gi 1720434366 729 LALASSQAEALLGARNK 745
Cdd:NF033609 912 WGLLASLGSLLLFRRKK 928
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
302-651 |
4.77e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 48.75 E-value: 4.77e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 302 VAKIETEATSSATMDGGKDANAKAMTDVNVTDTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDALPD 381
Cdd:NF033609 554 IEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASD 633
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 382 AGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTC 461
Cdd:NF033609 634 SDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 462 AQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTKNKVKGNPNPTPKTEAGTATTSS 541
Cdd:NF033609 714 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 542 AQTNvTSSQGETTPGAKNKAKGNRNSVPKAGAGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQS 621
Cdd:NF033609 794 SDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNV 872
|
330 340 350
....*....|....*....|....*....|
gi 1720434366 622 VPASQGTALTGTKTKVKGNSSAVSKPDAGA 651
Cdd:NF033609 873 VPPNSPKNGTNASNKNEAKDSKEPLPDTGS 902
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
204-540 |
6.58e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 48.37 E-value: 6.58e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 204 NDMSVVVTGVDTKSCAQSQAVAIIQNDDMAGAEADK-EDLKNMSKAGSGIDMKVPGQPHIAANNLAEAVPGAKNDAWDNA 282
Cdd:NF033609 571 SDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSaSDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDS 650
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 283 KDICEAEADIRTcliQSETVAKIETEATSSATMDGGKDANAKAMTDVNV---TDTQPQAVTSDQTEAMPDAKVKGKGNAS 359
Cdd:NF033609 651 DSDSDSDSDSDS---DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDSD 727
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 360 AMAKAGAKANTKTSSQTDALPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTK 439
Cdd:NF033609 728 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 807
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 440 GKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTK 519
Cdd:NF033609 808 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNK 887
|
330 340
....*....|....*....|..
gi 1720434366 520 NKVKGNPNPTPKTEA-GTATTS 540
Cdd:NF033609 888 NEAKDSKEPLPDTGSeDEANTS 909
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
391-734 |
1.01e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.60 E-value: 1.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 391 NVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGE-------ALPNTKGKARGKAKAKCKAAAGTDTKTCAQ 463
Cdd:pfam05109 457 NLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTeskapdmTSPTSAVTTPTPNATSPTPAVTTPTPNATS 536
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 464 PQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQP---VANCQNEALPGTKNKVKGNPNPTPKTEAGTATTS 540
Cdd:pfam05109 537 PTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPtsaVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTP 616
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 541 SAQTNVTSSQGETTPGAKNKAKGNRNSVPKAGAGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQ 620
Cdd:pfam05109 617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVST 696
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 621 SVPASQGTalTGTKTKVKGNSSAVSKPDAGAGTMGS----AHAKTAANSQGETLPG-SKNKVKGNSNAVPKAEAGAGTTE 695
Cdd:pfam05109 697 SSPAPRPG--TTSQASGPGNSSTSTKPGEVNVTKGTppknATSPQAPSGQKTAVPTvTSTGGKANSTTGGKHTTGHGART 774
|
330 340 350
....*....|....*....|....*....|....*....
gi 1720434366 696 PNQPQAEAllgarnkvkGNSNSVPKAESGASTILALASS 734
Cdd:pfam05109 775 STEPTTDY---------GGDSTTPRTRYNATTYLPPSTS 804
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1021-1307 |
1.71e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.92 E-value: 1.71e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1021 QASGVSWVVDQATGtwtvaENQTGAVSWAgAGNIVSIGYWTgavdqTNAVSwTGTTDQVGVEVKPRFEDQASEKGSWVVA 1100
Cdd:NF033849 279 HGSTRGWSHTQSTS-----ESESTGQSSS-VGTSESQSHGT-----TEGTS-TTDSSSHSQSSSYNVSSGTGVSSSHSDG 346
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1101 GVQTSGETRLGSEDQSSGRSWTETVdqanaaSRLGTVDQAGGTSW-AGTGDQVGGVSTSGSADQSSSGSwagtrNLAGER 1179
Cdd:NF033849 347 TSQSTSISHSESSSESTGTSVGHST------SSSVSSSESSSRSSsSGVSGGFSGGIAGGGVTSEGLGA-----SQGGSE 415
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1180 SW-TGTGDQSDGAAKPGFENQTSDEGsWAGTIGQPSGGSKSVSEAQsaGRSWADSADQLSGgflvgpldQANGESQPVSg 1258
Cdd:NF033849 416 GWgSGDSVQSVSQSYGSSSSTGTSSG-HSDSSSHSTSSGQADSVSQ--GTSWSEGTGTSQG--------QSVGTSESWS- 483
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1720434366 1259 elaASGVDQTSGGGCWTGSGDQSGGESRLGPRDQSNGESWPGTGDQTGG 1307
Cdd:NF033849 484 ---TSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
968-1237 |
2.84e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.15 E-value: 2.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 968 TTDGTWVADKASAGSWTGAENQISAG---SWVVSGNQAIAGPWAVSQVTDGSWPAVQASGVSWVVDQATGTWTVAENQTG 1044
Cdd:NF033849 281 STRGWSHTQSTSESESTGQSSSVGTSesqSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSS 360
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1045 AVSWAGAGNIVSIGYWTGAVDQTNAvSWtGTTDQVGVEVKP------RFEDQASEKGSWVVA-GVQTSGETRLGSEDQSS 1117
Cdd:NF033849 361 ESTGTSVGHSTSSSVSSSESSSRSS-SS-GVSGGFSGGIAGggvtseGLGASQGGSEGWGSGdSVQSVSQSYGSSSSTGT 438
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1118 GRSWTETVDQANAASRLGTVDQAGGTSwAGTGDQVGgVSTSGSADQSSSGSWAGTRNlageRSWTGTGDQSDGAAKPGFE 1197
Cdd:NF033849 439 SSGHSDSSSHSTSSGQADSVSQGTSWS-EGTGTSQG-QSVGTSESWSTSQSETDSVG----DSTGTSESVSQGDGRSTGR 512
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 1720434366 1198 NQTSDEG-SWAGTIGQPSGGSKSVSEAQSAGRSWA---DSADQL 1237
Cdd:NF033849 513 SESQGTSlGTSGGRTSGAGGSMGLGPSISLGKSYQwedDPAILL 556
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
299-573 |
3.11e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 46.06 E-value: 3.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 299 SETVAKIETEATSSATMDGGKDANAKAMTDvnvTDTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDA 378
Cdd:NF033609 646 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 379 LPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDT 458
Cdd:NF033609 723 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 802
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 459 KTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTKNKVKGNPNPTPKTEAGTAT 538
Cdd:NF033609 803 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGT 882
|
250 260 270
....*....|....*....|....*....|....*
gi 1720434366 539 TSSaqtnvtssqgettpgAKNKAKGNRNSVPKAGA 573
Cdd:NF033609 883 NAS---------------NKNEAKDSKEPLPDTGS 902
|
|
| PRK08691 |
PRK08691 |
DNA polymerase III subunits gamma and tau; Validated |
526-699 |
6.53e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236333 [Multi-domain] Cd Length: 709 Bit Score: 45.08 E-value: 6.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 526 PNPTPKTEAGTATTSSAQTNVTSSQGETTPGAKNKAKgnrNSVPKAGAGPDTT----GSAQ-SQTVANSHSEALPGAKNK 600
Cdd:PRK08691 394 PQPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQEN---NDVPPWEDAPDEAqtaaGTAQtSAKSIQTASEAETPPENQ 470
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 601 VKSNSNVVPKAEAGVGACPQSVPASqgtaltgtktkvkgnSSAVSKPdagAGTMGSAHAKTAANSQGETLPGSKNKVKGN 680
Cdd:PRK08691 471 VSKNKAADNETDAPLSEVPSENPIQ---------------ATPNDEA---VETETFAHEAPAEPFYGYGFPDNDCPPEDG 532
|
170 180
....*....|....*....|....*..
gi 1720434366 681 --------SNAVPKAEAGAGTTEPNQP 699
Cdd:PRK08691 533 aeipppdwEHAAPADTAGGGADEEAEA 559
|
|
| WEMBL |
pfam05701 |
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ... |
46-190 |
2.72e-03 |
|
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".
Pssm-ID: 461718 [Multi-domain] Cd Length: 562 Bit Score: 42.71 E-value: 2.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 46 LEEEIE-TQSETsslvetmvmtEAVTLTESTSQAKEVTMKEAVTQTDAEAEavgkkeavtqtKAKAWAMAGRAEVKK--E 122
Cdd:pfam05701 354 LEAELNrTKSEI----------ALVQAKEKEAREKMVELPKQLQQAAQEAE-----------EAKSLAQAAREELRKakE 412
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720434366 123 AMTQTKAEARTlaeketeinrvtvTQSEVLAVTKEVVKIGTMNETGVVAeamIRPLEETVSVTRTQSE 190
Cdd:pfam05701 413 EAEQAKAAAST-------------VESRLEAVLKEIEAAKASEKLALAA---IKALQESESSAESTNQ 464
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Arm_2 |
pfam04826 |
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain ... |
2041-2261 |
7.14e-87 |
|
Armadillo-like; This domain contains armadillo-like repeats. Proteins containing this domain interact with numerous other proteins, through these interactions they are involved in a wide variety of processes including carcinogenesis, control of cellular ageing and survival, regulation of circadian rhythm and lysosomal sorting of G protein-coupled receptors.
Pssm-ID: 461447 [Multi-domain] Cd Length: 221 Bit Score: 283.15 E-value: 7.14e-87
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 2041 LDPHDLEKLICMIEMTEDPSVHEIATNALYNSADYPYPQEIDRNIGGISVIQSLLSNPYPNVRQKALNALNNISVAAENH 2120
Cdd:pfam04826 1 LEPQELDKLLALLKSSEDPFIHEIALITLGNSAAYPFNQDIIRDLGGIPIIANLLSDSNPEVKEKALNALNNLSMNVENQ 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 2121 RTVKTYLSQVCEDTVTYPLNSNVQVAGLRLIKHLTITSEYQHMVTNYISEFLRLLALGSGETKDHVLGMLVNFSKNPSMT 2200
Cdd:pfam04826 81 KKIKVYVNQVCKDIVSSPLNSEVQLAGLKLLTNLSVTNDYHHLVVSYIRDFFSLLSQGNEKTKFQVLKVLLNLSENPAMT 160
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720434366 2201 RDLLIANAPTALINIFSKKETKENILNALLLFENINRHFKKRGKTYPQDRFSKTSLYFLFQ 2261
Cdd:pfam04826 161 RELLSAQVPSSLLSLFNSKEAKENLLRALTIFENINFHLKKEAKLFVKEHFTKGSLFSIFR 221
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
944-1452 |
4.30e-05 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 49.01 E-value: 4.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 944 NEVALGSWVSAGDQATGAIWAGAQTTDGTWVADKASAGSWTGAENQISAGSWVVSGNQAIAGPWAVSQVTDGSWPAVQAS 1023
Cdd:COG4625 11 GGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGT 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1024 GVSWVVDQATGTWTVAENQTGAVSWAGAGNIVSIGYWTGAVDQTNAVSWTGTTDQVGVEVKPRFEDQASEKGSWVVAGVQ 1103
Cdd:COG4625 91 GGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGG 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1104 TSGETRLGSEDQSSGRSWTETVDQANAASRLGTVDQAGGTSWAGTGDQVGGVSTSGSADQSSSGSWAGTRNLAGERSWTG 1183
Cdd:COG4625 171 GGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGG 250
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1184 TGDQSDGAAKPGFENQTSDEGSWAGTIGQPSGGSKSVSEAQSAGRSWADSADQLSGGFLVGPLDQANGESQPVSGELAAS 1263
Cdd:COG4625 251 GGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 330
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1264 GVDQTSGGGcwtGSGDQSGGESRLGPRDQSNGESWPGTGDQTGGWYCTYTGTQTIGGGPWVGVSGQDVGGPKPVHMNQAT 1343
Cdd:COG4625 331 GGGAGGGGG---SGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGG 407
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1344 GGAWLGTGTQVSAVSWTGDQVGGCSKPGFEDQAIGGGFWAGAGDQTTGGSrpavsedqssGGVSWGGAGGHVIGGSRTDQ 1423
Cdd:COG4625 408 TGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAG----------GSGGGAGAGGGSGSGAGTLT 477
|
490 500
....*....|....*....|....*....
gi 1720434366 1424 SSGTSWPGMGSQVSGGswiGSVDQTSGCT 1452
Cdd:COG4625 478 LTGNNTYTGTTTVNGG---GNYTQSAGST 503
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
333-745 |
4.61e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 48.75 E-value: 4.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 333 DTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDALPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVA 412
Cdd:NF033609 553 EIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSAS 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 413 KDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKA 492
Cdd:NF033609 633 DSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 712
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 493 GAKADQKASGQpqpvancqnealpgtknkvkgnpnptpkteaGTATTSSAQTNVTSSQGETTPGAKNKAKGNRNSVPKAG 572
Cdd:NF033609 713 DSDSDSDSDSD-------------------------------SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 761
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 573 AGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQSVPASQGTALTGTKTKVKGNSSAVSKPDAGAG 652
Cdd:NF033609 762 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 841
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 653 TMGSAHAKTAANSQGETLPGSKNKVKGNSNAVPkaeagagttePNQPQAEALLGARNKVKGNSNSVP----KAESGASTI 728
Cdd:NF033609 842 SDSDSDSDSDSDSDSESDSNSDSESGSNNNVVP----------PNSPKNGTNASNKNEAKDSKEPLPdtgsEDEANTSLI 911
|
410
....*....|....*..
gi 1720434366 729 LALASSQAEALLGARNK 745
Cdd:NF033609 912 WGLLASLGSLLLFRRKK 928
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
302-651 |
4.77e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 48.75 E-value: 4.77e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 302 VAKIETEATSSATMDGGKDANAKAMTDVNVTDTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDALPD 381
Cdd:NF033609 554 IEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASD 633
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 382 AGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDTKTC 461
Cdd:NF033609 634 SDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 462 AQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTKNKVKGNPNPTPKTEAGTATTSS 541
Cdd:NF033609 714 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 542 AQTNvTSSQGETTPGAKNKAKGNRNSVPKAGAGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQS 621
Cdd:NF033609 794 SDSD-SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNV 872
|
330 340 350
....*....|....*....|....*....|
gi 1720434366 622 VPASQGTALTGTKTKVKGNSSAVSKPDAGA 651
Cdd:NF033609 873 VPPNSPKNGTNASNKNEAKDSKEPLPDTGS 902
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
204-540 |
6.58e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 48.37 E-value: 6.58e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 204 NDMSVVVTGVDTKSCAQSQAVAIIQNDDMAGAEADK-EDLKNMSKAGSGIDMKVPGQPHIAANNLAEAVPGAKNDAWDNA 282
Cdd:NF033609 571 SDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSaSDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDS 650
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 283 KDICEAEADIRTcliQSETVAKIETEATSSATMDGGKDANAKAMTDVNV---TDTQPQAVTSDQTEAMPDAKVKGKGNAS 359
Cdd:NF033609 651 DSDSDSDSDSDS---DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDSD 727
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 360 AMAKAGAKANTKTSSQTDALPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTK 439
Cdd:NF033609 728 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 807
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 440 GKARGKAKAKCKAAAGTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTK 519
Cdd:NF033609 808 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNK 887
|
330 340
....*....|....*....|..
gi 1720434366 520 NKVKGNPNPTPKTEA-GTATTS 540
Cdd:NF033609 888 NEAKDSKEPLPDTGSeDEANTS 909
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
391-734 |
1.01e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.60 E-value: 1.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 391 NVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGE-------ALPNTKGKARGKAKAKCKAAAGTDTKTCAQ 463
Cdd:pfam05109 457 NLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTeskapdmTSPTSAVTTPTPNATSPTPAVTTPTPNATS 536
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 464 PQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQP---VANCQNEALPGTKNKVKGNPNPTPKTEAGTATTS 540
Cdd:pfam05109 537 PTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPtsaVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTP 616
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 541 SAQTNVTSSQGETTPGAKNKAKGNRNSVPKAGAGPDTTGSAQSQTVANSHSEALPGAKNKVKSNSNVVPKAEAGVGACPQ 620
Cdd:pfam05109 617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVST 696
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 621 SVPASQGTalTGTKTKVKGNSSAVSKPDAGAGTMGS----AHAKTAANSQGETLPG-SKNKVKGNSNAVPKAEAGAGTTE 695
Cdd:pfam05109 697 SSPAPRPG--TTSQASGPGNSSTSTKPGEVNVTKGTppknATSPQAPSGQKTAVPTvTSTGGKANSTTGGKHTTGHGART 774
|
330 340 350
....*....|....*....|....*....|....*....
gi 1720434366 696 PNQPQAEAllgarnkvkGNSNSVPKAESGASTILALASS 734
Cdd:pfam05109 775 STEPTTDY---------GGDSTTPRTRYNATTYLPPSTS 804
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1021-1307 |
1.71e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.92 E-value: 1.71e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1021 QASGVSWVVDQATGtwtvaENQTGAVSWAgAGNIVSIGYWTgavdqTNAVSwTGTTDQVGVEVKPRFEDQASEKGSWVVA 1100
Cdd:NF033849 279 HGSTRGWSHTQSTS-----ESESTGQSSS-VGTSESQSHGT-----TEGTS-TTDSSSHSQSSSYNVSSGTGVSSSHSDG 346
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1101 GVQTSGETRLGSEDQSSGRSWTETVdqanaaSRLGTVDQAGGTSW-AGTGDQVGGVSTSGSADQSSSGSwagtrNLAGER 1179
Cdd:NF033849 347 TSQSTSISHSESSSESTGTSVGHST------SSSVSSSESSSRSSsSGVSGGFSGGIAGGGVTSEGLGA-----SQGGSE 415
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1180 SW-TGTGDQSDGAAKPGFENQTSDEGsWAGTIGQPSGGSKSVSEAQsaGRSWADSADQLSGgflvgpldQANGESQPVSg 1258
Cdd:NF033849 416 GWgSGDSVQSVSQSYGSSSSTGTSSG-HSDSSSHSTSSGQADSVSQ--GTSWSEGTGTSQG--------QSVGTSESWS- 483
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1720434366 1259 elaASGVDQTSGGGCWTGSGDQSGGESRLGPRDQSNGESWPGTGDQTGG 1307
Cdd:NF033849 484 ---TSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
968-1237 |
2.84e-04 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 46.15 E-value: 2.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 968 TTDGTWVADKASAGSWTGAENQISAG---SWVVSGNQAIAGPWAVSQVTDGSWPAVQASGVSWVVDQATGTWTVAENQTG 1044
Cdd:NF033849 281 STRGWSHTQSTSESESTGQSSSVGTSesqSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSS 360
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1045 AVSWAGAGNIVSIGYWTGAVDQTNAvSWtGTTDQVGVEVKP------RFEDQASEKGSWVVA-GVQTSGETRLGSEDQSS 1117
Cdd:NF033849 361 ESTGTSVGHSTSSSVSSSESSSRSS-SS-GVSGGFSGGIAGggvtseGLGASQGGSEGWGSGdSVQSVSQSYGSSSSTGT 438
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1118 GRSWTETVDQANAASRLGTVDQAGGTSwAGTGDQVGgVSTSGSADQSSSGSWAGTRNlageRSWTGTGDQSDGAAKPGFE 1197
Cdd:NF033849 439 SSGHSDSSSHSTSSGQADSVSQGTSWS-EGTGTSQG-QSVGTSESWSTSQSETDSVG----DSTGTSESVSQGDGRSTGR 512
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 1720434366 1198 NQTSDEG-SWAGTIGQPSGGSKSVSEAQSAGRSWA---DSADQL 1237
Cdd:NF033849 513 SESQGTSlGTSGGRTSGAGGSMGLGPSISLGKSYQwedDPAILL 556
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
299-573 |
3.11e-04 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 46.06 E-value: 3.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 299 SETVAKIETEATSSATMDGGKDANAKAMTDvnvTDTQPQAVTSDQTEAMPDAKVKGKGNASAMAKAGAKANTKTSSQTDA 378
Cdd:NF033609 646 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 722
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 379 LPDAGDKNRSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEALPNTKGKARGKAKAKCKAAAGTDT 458
Cdd:NF033609 723 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 802
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 459 KTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGQPQPVANCQNEALPGTKNKVKGNPNPTPKTEAGTAT 538
Cdd:NF033609 803 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGT 882
|
250 260 270
....*....|....*....|....*....|....*
gi 1720434366 539 TSSaqtnvtssqgettpgAKNKAKGNRNSVPKAGA 573
Cdd:NF033609 883 NAS---------------NKNEAKDSKEPLPDTGS 902
|
|
| PRK08691 |
PRK08691 |
DNA polymerase III subunits gamma and tau; Validated |
526-699 |
6.53e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236333 [Multi-domain] Cd Length: 709 Bit Score: 45.08 E-value: 6.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 526 PNPTPKTEAGTATTSSAQTNVTSSQGETTPGAKNKAKgnrNSVPKAGAGPDTT----GSAQ-SQTVANSHSEALPGAKNK 600
Cdd:PRK08691 394 PQPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQEN---NDVPPWEDAPDEAqtaaGTAQtSAKSIQTASEAETPPENQ 470
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 601 VKSNSNVVPKAEAGVGACPQSVPASqgtaltgtktkvkgnSSAVSKPdagAGTMGSAHAKTAANSQGETLPGSKNKVKGN 680
Cdd:PRK08691 471 VSKNKAADNETDAPLSEVPSENPIQ---------------ATPNDEA---VETETFAHEAPAEPFYGYGFPDNDCPPEDG 532
|
170 180
....*....|....*....|....*..
gi 1720434366 681 --------SNAVPKAEAGAGTTEPNQP 699
Cdd:PRK08691 533 aeipppdwEHAAPADTAGGGADEEAEA 559
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
892-1076 |
2.28e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.20 E-value: 2.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 892 DWENSVSWTEDESGATIGPWTGAANdkAGLVSSWAVACDESSIKSWTGARPENEVALGSWVSAGDQATGAIWAGAQTTDG 971
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGA--AATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTST 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 972 TWVADKASAGSWTGAenqisagswvvsgnqaiagPWAVSQVTDGSWPAVQASGVSWVVDQATGTWTVAENQTGAVSWAGA 1051
Cdd:COG3469 79 TATATAAAAAATSTS-------------------ATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAS 139
|
170 180
....*....|....*....|....*
gi 1720434366 1052 GNIVSIGYWTGAVDQTNAVSWTGTT 1076
Cdd:COG3469 140 ATSSAGSTTTTTTVSGTETATGGTT 164
|
|
| WEMBL |
pfam05701 |
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ... |
46-190 |
2.72e-03 |
|
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".
Pssm-ID: 461718 [Multi-domain] Cd Length: 562 Bit Score: 42.71 E-value: 2.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 46 LEEEIE-TQSETsslvetmvmtEAVTLTESTSQAKEVTMKEAVTQTDAEAEavgkkeavtqtKAKAWAMAGRAEVKK--E 122
Cdd:pfam05701 354 LEAELNrTKSEI----------ALVQAKEKEAREKMVELPKQLQQAAQEAE-----------EAKSLAQAAREELRKakE 412
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720434366 123 AMTQTKAEARTlaeketeinrvtvTQSEVLAVTKEVVKIGTMNETGVVAeamIRPLEETVSVTRTQSE 190
Cdd:pfam05701 413 EAEQAKAAAST-------------VESRLEAVLKEIEAAKASEKLALAA---IKALQESESSAESTNQ 464
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
276-623 |
4.69e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.21 E-value: 4.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 276 NDAWDNAKDICEAEADIRTCLIQSETVAKIETEATSSATMDG-GKDANAKAMTDVNVTDTQPqavTSDQTEAMPDAKVKG 354
Cdd:pfam05109 497 NGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTlGKTSPTSAVTTPTPNATSP---TPAVTTPTPNATIPT 573
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 355 KGNASAMAKAGAKANTKTSsqtdalPDAGDKNrSDNNVTAKAETGIDMVSCTQTEPVAKDNANTTSKEGAQATGQSQGEA 434
Cdd:pfam05109 574 LGKTSPTSAVTTPTPNATS------PTVGETS-PQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSL 646
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 435 LPNTKGKARGKAKAKCKAAAgTDTKTCAQPQAGTKTQAEALSDSKVDSKSDSNGVSKAGAKADQKASGqpqpvaNCQNEA 514
Cdd:pfam05109 647 RPSSISETLSPSTSDNSTSH-MPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPG------NSSTST 719
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 515 LPGTKNKVKGNP--NPT-PKTEAGTATTSSAQTNVTSSQGETTPGAKNKAKGNRNSV-PKAGAGPDTTgsaQSQTVANSH 590
Cdd:pfam05109 720 KPGEVNVTKGTPpkNATsPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTePTTDYGGDST---TPRTRYNAT 796
|
330 340 350
....*....|....*....|....*....|....*..
gi 1720434366 591 SEALPGAKNKVKS----NSNVVPKAEAGVGACPQSVP 623
Cdd:pfam05109 797 TYLPPSTSSKLRPrwtfTSPPVTTAQATVPVPPTSQP 833
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
997-1490 |
7.52e-03 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 41.69 E-value: 7.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 997 VSGNQAIAGPWAVSQVTDGSWPAVQASGVSWVVDQATGTWTVAENQTGAVSWAGAGNIVSIGYWTGAVDQTNAVSWTGTT 1076
Cdd:COG4625 3 GGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGG 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1077 DQVGVEVKPRFEDQASEKGSWVVAGVQTSGETRLGSEDQSSGRSWTETVDQANAASRLGTVDQAGGTSWAGTGDQVGGVS 1156
Cdd:COG4625 83 GGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGG 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1157 TSGSADQSSSGSWAGTRNLAGERSWTGTGDQSDGAAKPGFENQTSDEGSwAGTIGQPSGGSKSVSEAQSAGRSWADSADQ 1236
Cdd:COG4625 163 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGG-GGAGGGGGGGGGGGGGGGGGGGGGGGGGGG 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1237 LSGGFLVGPLDQANGESQPVSGELAASGVDQTSGGGCWTGSGDQSGGESRLGPRDQSNGESWPGTGDQTGGWYCTYTGTQ 1316
Cdd:COG4625 242 GGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1317 TIGGGPWVGVSGQDVGGPKPVHMNQATGGAWLGTGTQVSAVSWTGDQVGGCSKPGFEDQAIGGGFWAGAGDQTTGGSRPA 1396
Cdd:COG4625 322 GGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGG 401
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1397 VSEDQSSGGVSWGGAGGHVIGGSRTDQSSGTSWPGMGSQVSGGSWIGSVDQTSGcTKSGFEDQTCAGGSWVGTGEQTSGD 1476
Cdd:COG4625 402 GGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGG-AGGSGGGAGAGGGSGSGAGTLTLTG 480
|
490
....*....|....
gi 1720434366 1477 SWPGSRASNEASGG 1490
Cdd:COG4625 481 NNTYTGTTTVNGGG 494
|
|
| AidA |
COG3468 |
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ... |
952-1362 |
7.66e-03 |
|
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442691 [Multi-domain] Cd Length: 846 Bit Score: 41.47 E-value: 7.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 952 VSAGDQATGAIWAGAQTTDGTWVADKASAGSWTGAENQISAGSWVVSGNQAIAGPWAVSQVTDGSWPAVQASGVSWVVDQ 1031
Cdd:COG3468 1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1032 ATGTWTVAENQTGAVSWAGAGNIVSIGYWTGAVDQTNAVSWTGTTDQVGVEVKPRFEDQASEKGSWVVAGVQTSGETRLG 1111
Cdd:COG3468 81 SGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGGGGGGTGVGGTGAAAAGGG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1112 SEDQSSGRSWTETVDQANAASRLGTVDQAGGTSW--AGTGDQVGGVSTSGSADQSSSGSWAGTRNLAGERSWTGTGDQSD 1189
Cdd:COG3468 161 TGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGagGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1190 GA--AKPGFENQTSDEGSWAGTIGQPSGGSKSVSEAQSAGRSWADSADQLSGGFLVGPLDQANGESQPVSGELAASGVDQ 1267
Cdd:COG3468 241 GGgsAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGGS 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720434366 1268 TSGGGCWTGSGDQSGGESR-----------------LGPRDQSNGESWPGTGDQTGGWYCTYTGTQTIGGGPWVGVSGQD 1330
Cdd:COG3468 321 NAGGGSGGGGGGGGGGGGGgttlngagsagggtgaaLAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGGT 400
|
410 420 430
....*....|....*....|....*....|..
gi 1720434366 1331 VGGPKPVHMNQATGGAWLGTGTQVSAVSWTGD 1362
Cdd:COG3468 401 GNNGGGGVGGGGGGGLTLTGGTLTVNGNYTGN 432
|
|
|