NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|578809903|ref|XP_006714531|]
View 

nipped-B-like protein isoform X2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SCC2 cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1197-2407 0e+00

Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.


:

Pssm-ID: 467937 [Multi-domain]  Cd Length: 1197  Bit Score: 1438.63  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1197 VKVLNILEKNIQDGSKLStLLNHNNDTEEEERLWRDLIMERVTKSADACLTtinIMTSPNMPKAVYIEDVIERVIQYTKF 1276
Cdd:cd23958     3 VRLLTILERNIRDGESLD-LDLDESQEDDEERLWLLERIDRALEAADASLT---ILTSPGLPKQLYSEDLIERVVDFLKF 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1277 HLQNTLYPQYDPVYRLDPHGGGLlssKAKRAKCSTHKQRVIVMLYNKVCDIVSSLSELLEIQLLTDTTILQVSSMGITPF 1356
Cdd:cd23958    79 QLENTIYPAYDPVYRSDSSAKAG---KKKRAKASSKKKKSVSTLLNKLCELLSLLAELLSLQSLTDSVILQLVYLAISPF 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1357 FVE----NVSELQLCAIKLVTAVFSRYEKHRQLILEEIFTSLARLPTSKRSLRNFRLNSSDmdgepmYIQMVTALVLQLI 1432
Cdd:cd23958   156 FVEnavsNVDELQLSALKLLTSIFSRYPDQRQFIIEEILSSLAKLPSSKRNLRQFRLNDGK------SIQMVTALLLQLV 229
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1433 QCVVHLPSSEKDSN-----AEEDSNKKIDQDVVITNSYETAMRTAQNFLSIFLKKCGSK--QGEEDYRPLFENFVQDLLS 1505
Cdd:cd23958   230 QSSVKLPNLEKESSrdkslEEDSDELLEDEESALAKSYESAVRIASYFLSFLLQKCTKKkkEKDTDYRPLFENFVQDLLT 309
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1506 TVNKPEWPAAELLLSLLGRLLVHQFSNKSTEMALRVASLDYLGTVAARLRKDAVTskmdqgsierilkqvsggedeiQQL 1585
Cdd:cd23958   310 VLNLPEWPAAELLLSLLGRLLVSIFSNKKTDANARVMALDLLGLIAARLRKDALA----------------------EEL 367
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1586 QKALLDYLDENTETDPSLVFSRKFYIAQWFRDTTLETEKAMKsqkdeessegthhAKEIETTGQIMHRAENRKKFLrsii 1665
Cdd:cd23958   368 QKALLDYLAENSSSDPSLESARGFYLAQWLRDLSNELEKAEK-------------AAEEEDTILKLELSELRKKFL---- 430
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1666 kttPSQFSTLKMNSDTVDYDDACLIVRYLASMRPFAQSFDIYLTQILRVLGENAIAVRTKAMKCLSEVVAVDPSILARLD 1745
Cdd:cd23958   431 ---DSKILSKEEEASPLSREDAKLLYRALASQRPLSQSFDPILKQLLSSLDEPAVTLRTKALKALSLVVEADPSILGDPD 507
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1746 MQRGVHGRLMDNSTSVREAAVELLGRFVLCRPQLAEQYYDMLIERILDTGISVRKRVIKILRDICIEQPTFPKITEMCVK 1825
Cdd:cd23958   508 VQRAVEGRLLDSSASVREAAVELVGKYISSRPDLAEQYYEMIAERILDTGVSVRKRVIKILRDIYLRTPDFEIKVDICVR 587
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1826 MIRRVND-EEGIKKLVNETFQKLWFTPTPHN-----DKEAMTRKILNITDVVAACRdTGYDWFEQLLQNLLKSEEDSSYK 1899
Cdd:cd23958   588 LLRRINDeEESIKDLARKTFQELWFTPFPESsspaqDKESLAERVLLIVDVVAACR-KGLDLLEQLLKRLLKSKEDKEDK 666
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1900 PVKKACTQLVDNLVEHILKYEESLADSdnkgvNSGRLVACITTLFLFSKIRP-QLMVKHAMTMQPYLTTKCSTQNDFMVI 1978
Cdd:cd23958   667 SVRKACKQLVDCLVELILELEEDDDES-----SESDLVACLSTLHLFAKADPkLLLVEHAETLQPYLKSKCSTREDQQVL 741
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1979 CNVAKILELVVPLMEHPSETFLATIEEDLMKLIIKYGMTVVQHCVSCLGAVVNKVTQNFKFVWACFNRYYGAISKLKSQH 2058
Cdd:cd23958   742 RYVLRILRSVLPLLSHPSESFLEELEEDLLKLLLKHSVTVLQEAIACLCAVVNKLTKNYERLRKALQSCLKLLRKYKRQA 821
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2059 QEDPNNTsllTNKPALLRSLFTVGALCRHFDFDLEDFKGN-----SKVNIKDKVLELLMYFTKHS-DEEVQTKAIIGLGF 2132
Cdd:cd23958   822 NLDPSSL---KEDPKLLRLLYILGLLARYCDFDSERDDFEkaplkTKESVKELVFDLLLFFTKPPiDEDVRKKALQALGF 898
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2133 AFIQHPSLMFEQEVKNLYNNILSDknSSVNLKIQVLKNLQTYLQEEDTRMQQADRDWKKVAKQ-----EDLKEMGDVSSG 2207
Cdd:cd23958   899 LCIAHPKLFLSPEVLKLLDEILAS--GSLKLKLQVLRNLQEFLQAEEKRMEAADAEWKKNSKAadvkvLDGKEMGDADSG 976
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2208 MSSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFI 2287
Cdd:cd23958   977 VASSIMQRYLKDILELCLSSDSQVRLAALKVLELILRQGLVHPIQCVPTLIALETDPNPAIRKLALRLLKELHEKYESLV 1056
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2288 HMKAVAGMKMSYQVQQAINTclKDPVRGFRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDD------TAKTDVTM 2361
Cdd:cd23958  1057 ESKYLEGVRLAFQYQKRLAG--DTRGRGFRTDSPPTALLGRLYSLLRGNRKSRRKFLKSLLKLFDFdlkkssDSPSDLDF 1134
                        1210      1220      1230      1240
                  ....*....|....*....|....*....|....*....|....*..
gi 578809903 2362 LLYIADNLACFPYQTQEEPLFIMHHIDITLSVSGSNLLQSF-KESMV 2407
Cdd:cd23958  1135 LLFLAENLAFLPYQTQDEPLFVIHTIDRILSVTGSSLLQAIaKASQA 1181
PspC_subgroup_2 super family cl41463
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-832 1.60e-15

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


The actual alignment was detected with superfamily member NF033839:

Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 82.51  E-value: 1.60e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  597 PETPKKKSDPELS--KSEMKQS-ESRLAESKPN-ENRLVETKSSENKLETKVETQTEELKQNESRTTECKQNEstiVEPK 672
Cdd:NF033839  281 QDTPKEPGNKKPSapKPGMQPSpQPEKKEVKPEpETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPE---VKPQ 357
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  673 qnenrLSDTKPNDNKQNNGRSETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNE 752
Cdd:NF033839  358 -----PEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVK 432
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  753 GRPETPKHRHDNRRDSGKPSTEKKPEVSKHKQDTKSDSPR------LKSERAEALKQRPDGR--SVSESLRRDhdnKQKS 824
Cdd:NF033839  433 PQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKpevkpqPEKPKPDNSKPQADDKkpSTPNNLSKD---KQPS 509

                  ....*...
gi 578809903  825 DDRGESER 832
Cdd:NF033839  510 NQASTNEK 517
PTZ00121 super family cl31754
MAEBL; Provisional
476-1098 1.85e-12

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 73.64  E-value: 1.85e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  476 EIERIERESAIERERFSKEVQDKDKPLKKRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPALMVSIDLHQAGRVDSQA 555
Cdd:PTZ00121 1282 ELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  556 SITQ-DSDSIKKPEEIKQCNDAPVSVLQEDIVGSLKSTPENhpetPKKKSDpELSKSEMKQSESRLAESKPNENRlvetK 634
Cdd:PTZ00121 1362 AEEKaEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEE----DKKKAD-ELKKAAAAKKKADEAKKKAEEKK----K 1432
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  635 SSENKLETKVETQTEELKQnesRTTECKQNEStiVEPKQNENRLSDtKPNDNKQNNGRSETTKSRPETPKQKGES--RPE 712
Cdd:PTZ00121 1433 ADEAKKKAEEAKKADEAKK---KAEEAKKAEE--AKKKAEEAKKAD-EAKKKAEEAKKADEAKKKAEEAKKKADEakKAA 1506
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  713 TPKQKSDghpETPKQKgdgrpETPKQKGESRPETPKQKNEGRPETPKHRHDNRRDS---GKPSTEKKPEVSKHKQDTKSD 789
Cdd:PTZ00121 1507 EAKKKAD---EAKKAE-----EAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAeelKKAEEKKKAEEAKKAEEDKNM 1578
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  790 SPRlkseRAEALKQRPDGRSV-------------SESLRRDHDNKQKSDD--RGESERHRGDQSR------VRRPETLRS 848
Cdd:PTZ00121 1579 ALR----KAEEAKKAEEARIEevmklyeeekkmkAEEAKKAEEAKIKAEElkKAEEEKKKVEQLKkkeaeeKKKAEELKK 1654
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  849 SSRN-------------EHGIKSDSSKTDKLERKHRHESGDSRE---------RPSSGEQKSRPDSPRvKQGDSNKSRSD 906
Cdd:PTZ00121 1655 AEEEnkikaaeeakkaeEDKKKAEEAKKAEEDEKKAAEALKKEAeeakkaeelKKKEAEEKKKAEELK-KAEEENKIKAE 1733
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  907 KLGFKSPTSK----DDKRTEGNKSKVDTNKAHPDNKAEfpsyllggRSGALKNFVIPKIKRDKDgnvtqETKKMEMKGEP 982
Cdd:PTZ00121 1734 EAKKEAEEDKkkaeEAKKDEEEKKKIAHLKKEEEKKAE--------EIRKEKEAVIEEELDEED-----EKRRMEVDKKI 1800
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  983 KDKVEKIGLVEDLNKGAKPVVVLQKLSLDDVQKLIKDREDKSRSSLKPIK-------NKPSKSNKGSIDQSVLKELPPEL 1055
Cdd:PTZ00121 1801 KDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEkhkfnknNENGEDGNKEADFNKEKDLKEDD 1880
                         650       660       670       680
                  ....*....|....*....|....*....|....*....|....*..
gi 578809903 1056 LAEIESTMplcERVKMNK----RKRSTVNEKPKYAEISSDEdNDSDE 1098
Cdd:PTZ00121 1881 EEEIEEAD---EIEKIDKddieREIPNNNMAGKNNDIIDDK-LDKDE 1923
 
Name Accession Description Interval E-value
SCC2 cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1197-2407 0e+00

Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.


Pssm-ID: 467937 [Multi-domain]  Cd Length: 1197  Bit Score: 1438.63  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1197 VKVLNILEKNIQDGSKLStLLNHNNDTEEEERLWRDLIMERVTKSADACLTtinIMTSPNMPKAVYIEDVIERVIQYTKF 1276
Cdd:cd23958     3 VRLLTILERNIRDGESLD-LDLDESQEDDEERLWLLERIDRALEAADASLT---ILTSPGLPKQLYSEDLIERVVDFLKF 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1277 HLQNTLYPQYDPVYRLDPHGGGLlssKAKRAKCSTHKQRVIVMLYNKVCDIVSSLSELLEIQLLTDTTILQVSSMGITPF 1356
Cdd:cd23958    79 QLENTIYPAYDPVYRSDSSAKAG---KKKRAKASSKKKKSVSTLLNKLCELLSLLAELLSLQSLTDSVILQLVYLAISPF 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1357 FVE----NVSELQLCAIKLVTAVFSRYEKHRQLILEEIFTSLARLPTSKRSLRNFRLNSSDmdgepmYIQMVTALVLQLI 1432
Cdd:cd23958   156 FVEnavsNVDELQLSALKLLTSIFSRYPDQRQFIIEEILSSLAKLPSSKRNLRQFRLNDGK------SIQMVTALLLQLV 229
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1433 QCVVHLPSSEKDSN-----AEEDSNKKIDQDVVITNSYETAMRTAQNFLSIFLKKCGSK--QGEEDYRPLFENFVQDLLS 1505
Cdd:cd23958   230 QSSVKLPNLEKESSrdkslEEDSDELLEDEESALAKSYESAVRIASYFLSFLLQKCTKKkkEKDTDYRPLFENFVQDLLT 309
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1506 TVNKPEWPAAELLLSLLGRLLVHQFSNKSTEMALRVASLDYLGTVAARLRKDAVTskmdqgsierilkqvsggedeiQQL 1585
Cdd:cd23958   310 VLNLPEWPAAELLLSLLGRLLVSIFSNKKTDANARVMALDLLGLIAARLRKDALA----------------------EEL 367
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1586 QKALLDYLDENTETDPSLVFSRKFYIAQWFRDTTLETEKAMKsqkdeessegthhAKEIETTGQIMHRAENRKKFLrsii 1665
Cdd:cd23958   368 QKALLDYLAENSSSDPSLESARGFYLAQWLRDLSNELEKAEK-------------AAEEEDTILKLELSELRKKFL---- 430
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1666 kttPSQFSTLKMNSDTVDYDDACLIVRYLASMRPFAQSFDIYLTQILRVLGENAIAVRTKAMKCLSEVVAVDPSILARLD 1745
Cdd:cd23958   431 ---DSKILSKEEEASPLSREDAKLLYRALASQRPLSQSFDPILKQLLSSLDEPAVTLRTKALKALSLVVEADPSILGDPD 507
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1746 MQRGVHGRLMDNSTSVREAAVELLGRFVLCRPQLAEQYYDMLIERILDTGISVRKRVIKILRDICIEQPTFPKITEMCVK 1825
Cdd:cd23958   508 VQRAVEGRLLDSSASVREAAVELVGKYISSRPDLAEQYYEMIAERILDTGVSVRKRVIKILRDIYLRTPDFEIKVDICVR 587
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1826 MIRRVND-EEGIKKLVNETFQKLWFTPTPHN-----DKEAMTRKILNITDVVAACRdTGYDWFEQLLQNLLKSEEDSSYK 1899
Cdd:cd23958   588 LLRRINDeEESIKDLARKTFQELWFTPFPESsspaqDKESLAERVLLIVDVVAACR-KGLDLLEQLLKRLLKSKEDKEDK 666
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1900 PVKKACTQLVDNLVEHILKYEESLADSdnkgvNSGRLVACITTLFLFSKIRP-QLMVKHAMTMQPYLTTKCSTQNDFMVI 1978
Cdd:cd23958   667 SVRKACKQLVDCLVELILELEEDDDES-----SESDLVACLSTLHLFAKADPkLLLVEHAETLQPYLKSKCSTREDQQVL 741
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1979 CNVAKILELVVPLMEHPSETFLATIEEDLMKLIIKYGMTVVQHCVSCLGAVVNKVTQNFKFVWACFNRYYGAISKLKSQH 2058
Cdd:cd23958   742 RYVLRILRSVLPLLSHPSESFLEELEEDLLKLLLKHSVTVLQEAIACLCAVVNKLTKNYERLRKALQSCLKLLRKYKRQA 821
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2059 QEDPNNTsllTNKPALLRSLFTVGALCRHFDFDLEDFKGN-----SKVNIKDKVLELLMYFTKHS-DEEVQTKAIIGLGF 2132
Cdd:cd23958   822 NLDPSSL---KEDPKLLRLLYILGLLARYCDFDSERDDFEkaplkTKESVKELVFDLLLFFTKPPiDEDVRKKALQALGF 898
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2133 AFIQHPSLMFEQEVKNLYNNILSDknSSVNLKIQVLKNLQTYLQEEDTRMQQADRDWKKVAKQ-----EDLKEMGDVSSG 2207
Cdd:cd23958   899 LCIAHPKLFLSPEVLKLLDEILAS--GSLKLKLQVLRNLQEFLQAEEKRMEAADAEWKKNSKAadvkvLDGKEMGDADSG 976
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2208 MSSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFI 2287
Cdd:cd23958   977 VASSIMQRYLKDILELCLSSDSQVRLAALKVLELILRQGLVHPIQCVPTLIALETDPNPAIRKLALRLLKELHEKYESLV 1056
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2288 HMKAVAGMKMSYQVQQAINTclKDPVRGFRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDD------TAKTDVTM 2361
Cdd:cd23958  1057 ESKYLEGVRLAFQYQKRLAG--DTRGRGFRTDSPPTALLGRLYSLLRGNRKSRRKFLKSLLKLFDFdlkkssDSPSDLDF 1134
                        1210      1220      1230      1240
                  ....*....|....*....|....*....|....*....|....*..
gi 578809903 2362 LLYIADNLACFPYQTQEEPLFIMHHIDITLSVSGSNLLQSF-KESMV 2407
Cdd:cd23958  1135 LLFLAENLAFLPYQTQDEPLFVIHTIDRILSVTGSSLLQAIaKASQA 1181
Nipped-B_C pfam12830
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or ...
2209-2390 1.18e-69

Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or sister chromatid cohesion proteins.


Pssm-ID: 463722  Cd Length: 180  Bit Score: 232.04  E-value: 1.18e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  2209 SSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFIH 2288
Cdd:pfam12830    1 CSALVQRYLKHILEICLSSDDQVRLLALEVLALILRQGLVHPKECIPTLIALETSPNPYIRKLAFELHKELHEKHESLLE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  2289 MKAVAGMKMSYQVQQAINTClkdpvrgfRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDD------TAKTDVTML 2362
Cdd:pfam12830   81 SRYMEGIRLAFEYQRRVLSG--------ATLEPPTSFLSLLYSLLRSNKKSRKKFLKSLVKLFFDldlsseSSPSDLDFL 152
                          170       180
                   ....*....|....*....|....*...
gi 578809903  2363 LYIADNLACFPYQTQEEPLFIMHHIDIT 2390
Cdd:pfam12830  153 RFLAENLAFLPYQTQDEVLFLIHHIDRI 180
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-832 1.60e-15

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 82.51  E-value: 1.60e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  597 PETPKKKSDPELS--KSEMKQS-ESRLAESKPN-ENRLVETKSSENKLETKVETQTEELKQNESRTTECKQNEstiVEPK 672
Cdd:NF033839  281 QDTPKEPGNKKPSapKPGMQPSpQPEKKEVKPEpETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPE---VKPQ 357
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  673 qnenrLSDTKPNDNKQNNGRSETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNE 752
Cdd:NF033839  358 -----PEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVK 432
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  753 GRPETPKHRHDNRRDSGKPSTEKKPEVSKHKQDTKSDSPR------LKSERAEALKQRPDGR--SVSESLRRDhdnKQKS 824
Cdd:NF033839  433 PQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKpevkpqPEKPKPDNSKPQADDKkpSTPNNLSKD---KQPS 509

                  ....*...
gi 578809903  825 DDRGESER 832
Cdd:NF033839  510 NQASTNEK 517
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
590-790 1.95e-13

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 75.96  E-value: 1.95e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  590 KSTPENHPETPKKKSDPELSKsEMKQSESRLAESKPNENRLVETKSSENKLETkvETQTEELK-QNESRTTECKQNESTi 668
Cdd:NF033839  329 KPEVKPQPEKPKPEVKPQLET-PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQP--ETPKPEVKpQPEKPKPEVKPQPEK- 404
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  669 vepkqnenrlsdTKPNDNKQNNGRSETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPK 748
Cdd:NF033839  405 ------------PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPK 472
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 578809903  749 QKNEGRPETPKhrHDNRR---DSGKPSTEKkpEVSKHKQDTKSDS 790
Cdd:NF033839  473 PEVKPQPEKPK--PDNSKpqaDDKKPSTPN--NLSKDKQPSNQAS 513
PTZ00121 PTZ00121
MAEBL; Provisional
476-1098 1.85e-12

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 73.64  E-value: 1.85e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  476 EIERIERESAIERERFSKEVQDKDKPLKKRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPALMVSIDLHQAGRVDSQA 555
Cdd:PTZ00121 1282 ELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  556 SITQ-DSDSIKKPEEIKQCNDAPVSVLQEDIVGSLKSTPENhpetPKKKSDpELSKSEMKQSESRLAESKPNENRlvetK 634
Cdd:PTZ00121 1362 AEEKaEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEE----DKKKAD-ELKKAAAAKKKADEAKKKAEEKK----K 1432
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  635 SSENKLETKVETQTEELKQnesRTTECKQNEStiVEPKQNENRLSDtKPNDNKQNNGRSETTKSRPETPKQKGES--RPE 712
Cdd:PTZ00121 1433 ADEAKKKAEEAKKADEAKK---KAEEAKKAEE--AKKKAEEAKKAD-EAKKKAEEAKKADEAKKKAEEAKKKADEakKAA 1506
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  713 TPKQKSDghpETPKQKgdgrpETPKQKGESRPETPKQKNEGRPETPKHRHDNRRDS---GKPSTEKKPEVSKHKQDTKSD 789
Cdd:PTZ00121 1507 EAKKKAD---EAKKAE-----EAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAeelKKAEEKKKAEEAKKAEEDKNM 1578
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  790 SPRlkseRAEALKQRPDGRSV-------------SESLRRDHDNKQKSDD--RGESERHRGDQSR------VRRPETLRS 848
Cdd:PTZ00121 1579 ALR----KAEEAKKAEEARIEevmklyeeekkmkAEEAKKAEEAKIKAEElkKAEEEKKKVEQLKkkeaeeKKKAEELKK 1654
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  849 SSRN-------------EHGIKSDSSKTDKLERKHRHESGDSRE---------RPSSGEQKSRPDSPRvKQGDSNKSRSD 906
Cdd:PTZ00121 1655 AEEEnkikaaeeakkaeEDKKKAEEAKKAEEDEKKAAEALKKEAeeakkaeelKKKEAEEKKKAEELK-KAEEENKIKAE 1733
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  907 KLGFKSPTSK----DDKRTEGNKSKVDTNKAHPDNKAEfpsyllggRSGALKNFVIPKIKRDKDgnvtqETKKMEMKGEP 982
Cdd:PTZ00121 1734 EAKKEAEEDKkkaeEAKKDEEEKKKIAHLKKEEEKKAE--------EIRKEKEAVIEEELDEED-----EKRRMEVDKKI 1800
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  983 KDKVEKIGLVEDLNKGAKPVVVLQKLSLDDVQKLIKDREDKSRSSLKPIK-------NKPSKSNKGSIDQSVLKELPPEL 1055
Cdd:PTZ00121 1801 KDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEkhkfnknNENGEDGNKEADFNKEKDLKEDD 1880
                         650       660       670       680
                  ....*....|....*....|....*....|....*....|....*..
gi 578809903 1056 LAEIESTMplcERVKMNK----RKRSTVNEKPKYAEISSDEdNDSDE 1098
Cdd:PTZ00121 1881 EEEIEEAD---EIEKIDKddieREIPNNNMAGKNNDIIDDK-LDKDE 1923
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
591-943 1.20e-10

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 67.10  E-value: 1.20e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  591 STPEN----HPETPKKKSDPELSKSEMKQSESRLAESKPNENRLVETKSSENKLE-TKVETQTEELKQNESRTTECKQ-- 663
Cdd:NF033839  162 PQPENpehqKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAVATYMSKILDDiQKHHLQKEKHRQIVALIKELDElk 241
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  664 --------NESTIVEPKQNENRL-SDTKPNDNKQNNGRSETT-----KSRPETPKQKGESRPETPKQKSDGHPETPKQKG 729
Cdd:NF033839  242 kqalseidNVNTKVEIENTVHKIfADMDAVVTKFKKGLTQDTpkepgNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEV 321
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  730 DGRPETPKQKGESRPETPKQKNEGRPETPKHRHDNRRDSGKPstEKKPEVSKHKQDTKSDSPRLKSE-RAEALKQRPDGR 808
Cdd:NF033839  322 KPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKP--EVKPQPEKPKPEVKPQPETPKPEvKPQPEKPKPEVK 399
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  809 SVSEslrrdhdnKQKSDDRGESERHRGD---QSRVRRPETLRSSSRNEHGIKSDsSKTDKLERKHRHESGDSRERPSSge 885
Cdd:NF033839  400 PQPE--------KPKPEVKPQPEKPKPEvkpQPEKPKPEVKPQPEKPKPEVKPQ-PEKPKPEVKPQPETPKPEVKPQP-- 468
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578809903  886 QKSRPD---SPRVKQGDSNKSRSDKlgfKSPTSKDDKRTEGNKSK--VDTNKAHPDNKAEFPS 943
Cdd:NF033839  469 EKPKPEvkpQPEKPKPDNSKPQADD---KKPSTPNNLSKDKQPSNqaSTNEKATNKPKKSLPS 528
ftsN TIGR02223
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ...
591-778 1.18e-07

cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]


Pssm-ID: 274041 [Multi-domain]  Cd Length: 298  Bit Score: 56.24  E-value: 1.18e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   591 STPENHPETPKKKSDPELSKSEMK---QSESRLAESKPNENRLVETKSSENKLETKVETQTEELKQNESRTTEckqnest 667
Cdd:TIGR02223   51 SKQANEPETLQPKNQTENGETAADlppKPEERWSYIEELEAREVLINDPEEPSNGGGVEESAQLTAEQRQLLE------- 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   668 ivePKQNENRLSDTKPNDNKQNNGRSETTKSRPETPKQKGESRPETPKQKsdghPETPKQKGDGRPETPKQKGESRPETP 747
Cdd:TIGR02223  124 ---QMQADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTP----VETEKIASKVKEAKQKQKALPKQTAE 196
                          170       180       190
                   ....*....|....*....|....*....|.
gi 578809903   748 KQKNEGRPETPkhRHDNRRDSGKPSTEKKPE 778
Cdd:TIGR02223  197 TQSNSKPIETA--PKADKADKTKPKPKEKAE 225
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
476-894 4.58e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 55.85  E-value: 4.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  476 EIERIERESAIERERFSKEVQDK-DKPLKKRKQDSYPQEAGG---ATGGNRPASQETGSTGNGSRPALmvsidlhqagrv 551
Cdd:PTZ00449  484 EIKKLIKKSKKKLAPIEEEDSDKhDEPPEGPEASGLPPKAPGdkeGEEGEHEDSKESDEPKEGGKPGE------------ 551
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  552 dsqasiTQDSDSIKKPEEIKQCNDAPVSVLQEDIVGSLKSTPENHPETPKKKSDPELSKSEMKQSESRLAESK--PNENR 629
Cdd:PTZ00449  552 ------TKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLdiPKSPK 625
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  630 LVETKSSENKLETKVETQTEELKQNESRTTECKQNESTIV--EPKQNEN----------RLSDTKPNDNKQNNGRSETTK 697
Cdd:PTZ00449  626 RPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPpfDPKFKEKfyddyldaaaKSKETKTTVVLDESFESILKE 705
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  698 SRPETPKQKGES-RPETPKQKSDghPETPKQKgDGRPETPKQKGESRPETPKQKNEGRPETPKHRH------------DN 764
Cdd:PTZ00449  706 TLPETPGTPFTTpRPLPPKLPRD--EEFPFEP-IGDPDAEQPDDIEFFTPPEEERTFFHETPADTPlpdilaeefkeeDI 782
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  765 RRDSGKP-STEKKPEV-SKHKQDTKSDSPRL--KSERAEALK-QRPDGRSVSESLRRDHDNKQKSDDRGESerhRGDQSR 839
Cdd:PTZ00449  783 HAETGEPdEAMKRPDSpSEHEDKPPGDHPSLpkKRHRLDGLAlSTTDLESDAGRIAKDASGKIVKLKRSKS---FDDLTT 859
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578809903  840 VRRPETLRSSSR----NEHGIKSDSSKTDKLERKHRHESGDSR--ERPSSGEQKSRPDSPR 894
Cdd:PTZ00449  860 VEEAEEMGAEARkivvDDDGTEADDEDTHPPEEKHKSEVRRRRppKKPSKPKKPSKPKKPK 920
Caldesmon pfam02029
Caldesmon;
480-890 2.08e-04

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 46.78  E-value: 2.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   480 IERESAIERERFSKEVQDKdkplkkRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPALMVSIDLHQAGrvdsqasitq 559
Cdd:pfam02029    1 IEDEEEAARERRRRAREER------RRQKEEEEPSGQVTESVEPNEHNSYEEDSELKPSGQGGLDEEEAF---------- 64
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   560 dSDSIKKPEEIKQCNDAPVSVLQEDIVGSLKSTPENHPETPKKKSDPELSKSEMK-QSESRLAESKPNENRLVETKSSEN 638
Cdd:pfam02029   65 -LDRTAKREERRQKRLQEALERQKEFDPTIADEKESVAERKENNEEEENSSWEKEeKRDSRLGRYKEEETEIREKEYQEN 143
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   639 KLET--------------KVETQTEELKQNESRTTECKQNESTIVEPKQNENRLSDTKPNdnkqnngrsettksrpeTPK 704
Cdd:pfam02029  144 KWSTevrqaeeegeeeedKSEEAEEVPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRG-----------------HPE 206
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   705 QKGESRPEtpKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNEGRpetpkhrhdnRRDSGKPSTEKKPevSKHKQ 784
Cdd:pfam02029  207 VKSQNGEE--EVTKLKVTTKRRQGGLSQSQEREEEAEVFLEAEQKLEELR----------RRRQEKESEEFEK--LRQKQ 272
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   785 dtksdsPRLKSERAEALKQRPDGRSV-SESLRRDHDNKQKSDDRGESERHRGDQSRVRR-----------PETlrSSSRN 852
Cdd:pfam02029  273 ------QEAELELEELKKKREERRKLlEEEEQRRKQEEAERKLREEEEKRRMKEEIERRraeaaekrqklPED--SSSEG 344
                          410       420       430
                   ....*....|....*....|....*....|....*...
gi 578809903   853 EHGIKSDSSKTDKLERKHRHESGDSRERPSSGEQKSRP 890
Cdd:pfam02029  345 KKPFKCFSPKGSSLKITERAEFLNKSLQKSSSVKKTHP 382
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
475-759 1.42e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 44.23  E-value: 1.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  475 AEIERIERESAIERERFSKEV--QDKDKPlKKRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPAlmvsidlhqagrvD 552
Cdd:NF033838  233 AEEEAKRRADAKLKEAVEKNVatSEQDKP-KRRAKRGVLGEPATPDKKENDAKSSDSSVGEETLPS-------------P 298
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  553 SQASITQDSDSIKKPEEI-KQCNDapvsvlQEDivgslkSTPENHPETPKKKSDPELSKSEMKQSESRLaeskpnenRLV 631
Cdd:NF033838  299 SLKPEKKVAEAEKKVEEAkKKAKD------QKE------EDRRNYPTNTYKTLELEIAESDVKVKEAEL--------ELV 358
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  632 ETKSSENKLETKVETQTEELkqnesrttECKQNESTIVEPKQNENRlsdTKPNDNKQNNGRSETTKSRP-ETPKQKGESR 710
Cdd:NF033838  359 KEEAKEPRNEEKIKQAKAKV--------ESKKAEATRLEKIKTDRK---KAEEEAKRKAAEEDKVKEKPaEQPQPAPAPQ 427
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  711 PETPKQKSDGHPETPK-QKGDGR----------PETPKQKGESRPetPKQKNEGRPETPK 759
Cdd:NF033838  428 PEKPAPKPEKPAEQPKaEKPADQqaeedyarrsEEEYNRLTQQQP--PKTEKPAQPSTPK 485
 
Name Accession Description Interval E-value
SCC2 cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1197-2407 0e+00

Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.


Pssm-ID: 467937 [Multi-domain]  Cd Length: 1197  Bit Score: 1438.63  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1197 VKVLNILEKNIQDGSKLStLLNHNNDTEEEERLWRDLIMERVTKSADACLTtinIMTSPNMPKAVYIEDVIERVIQYTKF 1276
Cdd:cd23958     3 VRLLTILERNIRDGESLD-LDLDESQEDDEERLWLLERIDRALEAADASLT---ILTSPGLPKQLYSEDLIERVVDFLKF 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1277 HLQNTLYPQYDPVYRLDPHGGGLlssKAKRAKCSTHKQRVIVMLYNKVCDIVSSLSELLEIQLLTDTTILQVSSMGITPF 1356
Cdd:cd23958    79 QLENTIYPAYDPVYRSDSSAKAG---KKKRAKASSKKKKSVSTLLNKLCELLSLLAELLSLQSLTDSVILQLVYLAISPF 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1357 FVE----NVSELQLCAIKLVTAVFSRYEKHRQLILEEIFTSLARLPTSKRSLRNFRLNSSDmdgepmYIQMVTALVLQLI 1432
Cdd:cd23958   156 FVEnavsNVDELQLSALKLLTSIFSRYPDQRQFIIEEILSSLAKLPSSKRNLRQFRLNDGK------SIQMVTALLLQLV 229
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1433 QCVVHLPSSEKDSN-----AEEDSNKKIDQDVVITNSYETAMRTAQNFLSIFLKKCGSK--QGEEDYRPLFENFVQDLLS 1505
Cdd:cd23958   230 QSSVKLPNLEKESSrdkslEEDSDELLEDEESALAKSYESAVRIASYFLSFLLQKCTKKkkEKDTDYRPLFENFVQDLLT 309
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1506 TVNKPEWPAAELLLSLLGRLLVHQFSNKSTEMALRVASLDYLGTVAARLRKDAVTskmdqgsierilkqvsggedeiQQL 1585
Cdd:cd23958   310 VLNLPEWPAAELLLSLLGRLLVSIFSNKKTDANARVMALDLLGLIAARLRKDALA----------------------EEL 367
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1586 QKALLDYLDENTETDPSLVFSRKFYIAQWFRDTTLETEKAMKsqkdeessegthhAKEIETTGQIMHRAENRKKFLrsii 1665
Cdd:cd23958   368 QKALLDYLAENSSSDPSLESARGFYLAQWLRDLSNELEKAEK-------------AAEEEDTILKLELSELRKKFL---- 430
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1666 kttPSQFSTLKMNSDTVDYDDACLIVRYLASMRPFAQSFDIYLTQILRVLGENAIAVRTKAMKCLSEVVAVDPSILARLD 1745
Cdd:cd23958   431 ---DSKILSKEEEASPLSREDAKLLYRALASQRPLSQSFDPILKQLLSSLDEPAVTLRTKALKALSLVVEADPSILGDPD 507
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1746 MQRGVHGRLMDNSTSVREAAVELLGRFVLCRPQLAEQYYDMLIERILDTGISVRKRVIKILRDICIEQPTFPKITEMCVK 1825
Cdd:cd23958   508 VQRAVEGRLLDSSASVREAAVELVGKYISSRPDLAEQYYEMIAERILDTGVSVRKRVIKILRDIYLRTPDFEIKVDICVR 587
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1826 MIRRVND-EEGIKKLVNETFQKLWFTPTPHN-----DKEAMTRKILNITDVVAACRdTGYDWFEQLLQNLLKSEEDSSYK 1899
Cdd:cd23958   588 LLRRINDeEESIKDLARKTFQELWFTPFPESsspaqDKESLAERVLLIVDVVAACR-KGLDLLEQLLKRLLKSKEDKEDK 666
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1900 PVKKACTQLVDNLVEHILKYEESLADSdnkgvNSGRLVACITTLFLFSKIRP-QLMVKHAMTMQPYLTTKCSTQNDFMVI 1978
Cdd:cd23958   667 SVRKACKQLVDCLVELILELEEDDDES-----SESDLVACLSTLHLFAKADPkLLLVEHAETLQPYLKSKCSTREDQQVL 741
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1979 CNVAKILELVVPLMEHPSETFLATIEEDLMKLIIKYGMTVVQHCVSCLGAVVNKVTQNFKFVWACFNRYYGAISKLKSQH 2058
Cdd:cd23958   742 RYVLRILRSVLPLLSHPSESFLEELEEDLLKLLLKHSVTVLQEAIACLCAVVNKLTKNYERLRKALQSCLKLLRKYKRQA 821
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2059 QEDPNNTsllTNKPALLRSLFTVGALCRHFDFDLEDFKGN-----SKVNIKDKVLELLMYFTKHS-DEEVQTKAIIGLGF 2132
Cdd:cd23958   822 NLDPSSL---KEDPKLLRLLYILGLLARYCDFDSERDDFEkaplkTKESVKELVFDLLLFFTKPPiDEDVRKKALQALGF 898
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2133 AFIQHPSLMFEQEVKNLYNNILSDknSSVNLKIQVLKNLQTYLQEEDTRMQQADRDWKKVAKQ-----EDLKEMGDVSSG 2207
Cdd:cd23958   899 LCIAHPKLFLSPEVLKLLDEILAS--GSLKLKLQVLRNLQEFLQAEEKRMEAADAEWKKNSKAadvkvLDGKEMGDADSG 976
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2208 MSSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFI 2287
Cdd:cd23958   977 VASSIMQRYLKDILELCLSSDSQVRLAALKVLELILRQGLVHPIQCVPTLIALETDPNPAIRKLALRLLKELHEKYESLV 1056
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 2288 HMKAVAGMKMSYQVQQAINTclKDPVRGFRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDD------TAKTDVTM 2361
Cdd:cd23958  1057 ESKYLEGVRLAFQYQKRLAG--DTRGRGFRTDSPPTALLGRLYSLLRGNRKSRRKFLKSLLKLFDFdlkkssDSPSDLDF 1134
                        1210      1220      1230      1240
                  ....*....|....*....|....*....|....*....|....*..
gi 578809903 2362 LLYIADNLACFPYQTQEEPLFIMHHIDITLSVSGSNLLQSF-KESMV 2407
Cdd:cd23958  1135 LLFLAENLAFLPYQTQDEPLFVIHTIDRILSVTGSSLLQAIaKASQA 1181
Nipped-B_C pfam12830
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or ...
2209-2390 1.18e-69

Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or sister chromatid cohesion proteins.


Pssm-ID: 463722  Cd Length: 180  Bit Score: 232.04  E-value: 1.18e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  2209 SSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFIH 2288
Cdd:pfam12830    1 CSALVQRYLKHILEICLSSDDQVRLLALEVLALILRQGLVHPKECIPTLIALETSPNPYIRKLAFELHKELHEKHESLLE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  2289 MKAVAGMKMSYQVQQAINTClkdpvrgfRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDD------TAKTDVTML 2362
Cdd:pfam12830   81 SRYMEGIRLAFEYQRRVLSG--------ATLEPPTSFLSLLYSLLRSNKKSRKKFLKSLVKLFFDldlsseSSPSDLDFL 152
                          170       180
                   ....*....|....*....|....*...
gi 578809903  2363 LYIADNLACFPYQTQEEPLFIMHHIDIT 2390
Cdd:pfam12830  153 RFLAENLAFLPYQTQDEVLFLIHHIDRI 180
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-832 1.60e-15

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 82.51  E-value: 1.60e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  597 PETPKKKSDPELS--KSEMKQS-ESRLAESKPN-ENRLVETKSSENKLETKVETQTEELKQNESRTTECKQNEstiVEPK 672
Cdd:NF033839  281 QDTPKEPGNKKPSapKPGMQPSpQPEKKEVKPEpETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPE---VKPQ 357
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  673 qnenrLSDTKPNDNKQNNGRSETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNE 752
Cdd:NF033839  358 -----PEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVK 432
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  753 GRPETPKHRHDNRRDSGKPSTEKKPEVSKHKQDTKSDSPR------LKSERAEALKQRPDGR--SVSESLRRDhdnKQKS 824
Cdd:NF033839  433 PQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKpevkpqPEKPKPDNSKPQADDKkpSTPNNLSKD---KQPS 509

                  ....*...
gi 578809903  825 DDRGESER 832
Cdd:NF033839  510 NQASTNEK 517
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
590-790 1.95e-13

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 75.96  E-value: 1.95e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  590 KSTPENHPETPKKKSDPELSKsEMKQSESRLAESKPNENRLVETKSSENKLETkvETQTEELK-QNESRTTECKQNESTi 668
Cdd:NF033839  329 KPEVKPQPEKPKPEVKPQLET-PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQP--ETPKPEVKpQPEKPKPEVKPQPEK- 404
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  669 vepkqnenrlsdTKPNDNKQNNGRSETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPK 748
Cdd:NF033839  405 ------------PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPK 472
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 578809903  749 QKNEGRPETPKhrHDNRR---DSGKPSTEKkpEVSKHKQDTKSDS 790
Cdd:NF033839  473 PEVKPQPEKPK--PDNSKpqaDDKKPSTPN--NLSKDKQPSNQAS 513
PTZ00121 PTZ00121
MAEBL; Provisional
476-1098 1.85e-12

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 73.64  E-value: 1.85e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  476 EIERIERESAIERERFSKEVQDKDKPLKKRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPALMVSIDLHQAGRVDSQA 555
Cdd:PTZ00121 1282 ELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA 1361
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  556 SITQ-DSDSIKKPEEIKQCNDAPVSVLQEDIVGSLKSTPENhpetPKKKSDpELSKSEMKQSESRLAESKPNENRlvetK 634
Cdd:PTZ00121 1362 AEEKaEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEE----DKKKAD-ELKKAAAAKKKADEAKKKAEEKK----K 1432
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  635 SSENKLETKVETQTEELKQnesRTTECKQNEStiVEPKQNENRLSDtKPNDNKQNNGRSETTKSRPETPKQKGES--RPE 712
Cdd:PTZ00121 1433 ADEAKKKAEEAKKADEAKK---KAEEAKKAEE--AKKKAEEAKKAD-EAKKKAEEAKKADEAKKKAEEAKKKADEakKAA 1506
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  713 TPKQKSDghpETPKQKgdgrpETPKQKGESRPETPKQKNEGRPETPKHRHDNRRDS---GKPSTEKKPEVSKHKQDTKSD 789
Cdd:PTZ00121 1507 EAKKKAD---EAKKAE-----EAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAeelKKAEEKKKAEEAKKAEEDKNM 1578
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  790 SPRlkseRAEALKQRPDGRSV-------------SESLRRDHDNKQKSDD--RGESERHRGDQSR------VRRPETLRS 848
Cdd:PTZ00121 1579 ALR----KAEEAKKAEEARIEevmklyeeekkmkAEEAKKAEEAKIKAEElkKAEEEKKKVEQLKkkeaeeKKKAEELKK 1654
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  849 SSRN-------------EHGIKSDSSKTDKLERKHRHESGDSRE---------RPSSGEQKSRPDSPRvKQGDSNKSRSD 906
Cdd:PTZ00121 1655 AEEEnkikaaeeakkaeEDKKKAEEAKKAEEDEKKAAEALKKEAeeakkaeelKKKEAEEKKKAEELK-KAEEENKIKAE 1733
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  907 KLGFKSPTSK----DDKRTEGNKSKVDTNKAHPDNKAEfpsyllggRSGALKNFVIPKIKRDKDgnvtqETKKMEMKGEP 982
Cdd:PTZ00121 1734 EAKKEAEEDKkkaeEAKKDEEEKKKIAHLKKEEEKKAE--------EIRKEKEAVIEEELDEED-----EKRRMEVDKKI 1800
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  983 KDKVEKIGLVEDLNKGAKPVVVLQKLSLDDVQKLIKDREDKSRSSLKPIK-------NKPSKSNKGSIDQSVLKELPPEL 1055
Cdd:PTZ00121 1801 KDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEkhkfnknNENGEDGNKEADFNKEKDLKEDD 1880
                         650       660       670       680
                  ....*....|....*....|....*....|....*....|....*..
gi 578809903 1056 LAEIESTMplcERVKMNK----RKRSTVNEKPKYAEISSDEdNDSDE 1098
Cdd:PTZ00121 1881 EEEIEEAD---EIEKIDKddieREIPNNNMAGKNNDIIDDK-LDKDE 1923
PTZ00121 PTZ00121
MAEBL; Provisional
470-1126 7.52e-12

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 71.71  E-value: 7.52e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  470 ELDALAEIERIERESAIERERFSKEVQDKDKPLKKRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPALMVSID--LHQ 547
Cdd:PTZ00121 1095 EAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEArkAED 1174
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  548 AGRVDS--QASITQDSDSIKKPEEIKQCNDAPVSVLQEDIVGSLKSTPENHPETPKKKSDPELSKSEMKQSE---SRLAE 622
Cdd:PTZ00121 1175 AKKAEAarKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEeerNNEEI 1254
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  623 SKPNENRLVETKSSENKLETKVETQTEELKQNES--RTTECKQNESTI----VEPKQNENRLSDTKPNDNKQNNGRSETT 696
Cdd:PTZ00121 1255 RKFEEARMAHFARRQAAIKAEEARKADELKKAEEkkKADEAKKAEEKKkadeAKKKAEEAKKADEAKKKAEEAKKKADAA 1334
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  697 KSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKgeSRPETPKQKNEGRPETPKHRHDNRRDSGKPSTEKK 776
Cdd:PTZ00121 1335 KKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAK--KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  777 PEVSKhkqdTKSDSPRLKSE---RAEALKQRPDGRSVSESLRRDHDNKQKSDD--RGESERHRGDQSRVRRPETLRSssr 851
Cdd:PTZ00121 1413 AAAAK----KKADEAKKKAEekkKADEAKKKAEEAKKADEAKKKAEEAKKAEEakKKAEEAKKADEAKKKAEEAKKA--- 1485
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  852 NEHGIKSDSSKTDKLERKHRHESGDSRERPSSGEQKSRPDSPRvKQGDSNKSRSDKLGFKSPTSKDDKRTEGNKSKVDTN 931
Cdd:PTZ00121 1486 DEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAK-KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKK 1564
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  932 KAHPDNKAEFPSYLLGGRSGALKNfvIPKIKRDKDGNVTQETKKMEMKGEPKDKVEKIGlVEDLNKGA---KPVVVLQKL 1008
Cdd:PTZ00121 1565 KAEEAKKAEEDKNMALRKAEEAKK--AEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIK-AEELKKAEeekKKVEQLKKK 1641
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1009 SLDDVQKLIKDREDKSRSSLKPIKNKPSKSNKGSIDQSVLKELPPELLAEIESTMPLCERVKMNKRKRSTVNEKPKYAEI 1088
Cdd:PTZ00121 1642 EAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEEL 1721
                         650       660       670
                  ....*....|....*....|....*....|....*...
gi 578809903 1089 SSDEDNDSDEAFEFARKMKKkekqKKRKAYEPKLTPEE 1126
Cdd:PTZ00121 1722 KKAEEENKIKAEEAKKEAEE----DKKKAEEAKKDEEE 1755
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
591-943 1.20e-10

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 67.10  E-value: 1.20e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  591 STPEN----HPETPKKKSDPELSKSEMKQSESRLAESKPNENRLVETKSSENKLE-TKVETQTEELKQNESRTTECKQ-- 663
Cdd:NF033839  162 PQPENpehqKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAVATYMSKILDDiQKHHLQKEKHRQIVALIKELDElk 241
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  664 --------NESTIVEPKQNENRL-SDTKPNDNKQNNGRSETT-----KSRPETPKQKGESRPETPKQKSDGHPETPKQKG 729
Cdd:NF033839  242 kqalseidNVNTKVEIENTVHKIfADMDAVVTKFKKGLTQDTpkepgNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEV 321
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  730 DGRPETPKQKGESRPETPKQKNEGRPETPKHRHDNRRDSGKPstEKKPEVSKHKQDTKSDSPRLKSE-RAEALKQRPDGR 808
Cdd:NF033839  322 KPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKP--EVKPQPEKPKPEVKPQPETPKPEvKPQPEKPKPEVK 399
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  809 SVSEslrrdhdnKQKSDDRGESERHRGD---QSRVRRPETLRSSSRNEHGIKSDsSKTDKLERKHRHESGDSRERPSSge 885
Cdd:NF033839  400 PQPE--------KPKPEVKPQPEKPKPEvkpQPEKPKPEVKPQPEKPKPEVKPQ-PEKPKPEVKPQPETPKPEVKPQP-- 468
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 578809903  886 QKSRPD---SPRVKQGDSNKSRSDKlgfKSPTSKDDKRTEGNKSK--VDTNKAHPDNKAEFPS 943
Cdd:NF033839  469 EKPKPEvkpQPEKPKPDNSKPQADD---KKPSTPNNLSKDKQPSNqaSTNEKATNKPKKSLPS 528
PTZ00121 PTZ00121
MAEBL; Provisional
607-1227 5.74e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 65.55  E-value: 5.74e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  607 ELSKSE--MKQSESRLAEskpNENRLVETKSSENKLETKVETQTEELKQN--ESRTTECKQNESTIVEPKQNENRLSDTK 682
Cdd:PTZ00121 1192 ELRKAEdaRKAEAARKAE---EERKAEEARKAEDAKKAEAVKKAEEAKKDaeEAKKAEEERNNEEIRKFEEARMAHFARR 1268
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  683 PNDNKQNNGRSETTKSRPETPKQKGESRPETPKQKSDghpETPKQKGDGR-PETPKQKGES---RPETPKQKNE---GRP 755
Cdd:PTZ00121 1269 QAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKAD---EAKKKAEEAKkADEAKKKAEEakkKADAAKKKAEeakKAA 1345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  756 ETPKHRHDNRRDSGKPSTEKKPEVSKHKQDTKSDSPRLKSERAEALKQRPDGRSVSESLRRDHDNKQKSDDRGESERHRG 835
Cdd:PTZ00121 1346 EAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKK 1425
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  836 DQSRVRRPETLRSSSrnEHGIKSDSSKTdKLERKHRHESGDSRERPSSGEQKSRPDSPRVKQGDSNKSRSDKLGFKSPTS 915
Cdd:PTZ00121 1426 KAEEKKKADEAKKKA--EEAKKADEAKK-KAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEA 1502
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  916 KddKRTEGNKSKVDTNKAHPDNKAEFPSYLLGGRSGAlknfvipKIKRDKDGNVTQETKKME--MKGEPKDKVE-KIGLV 992
Cdd:PTZ00121 1503 K--KAAEAKKKADEAKKAEEAKKADEAKKAEEAKKAD-------EAKKAEEKKKADELKKAEelKKAEEKKKAEeAKKAE 1573
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  993 EDLNKGAKPVVVLQKLSLDDVQKLIKDREDKSRSSLKPIKN-----------KPSKSNKGSIDQSVLKELPPELLAEIES 1061
Cdd:PTZ00121 1574 EDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKaeeakikaeelKKAEEEKKKVEQLKKKEAEEKKKAEELK 1653
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1062 TMPLCERVKMNKRKRSTVNEKPKYAEISSDEDND--SDEAFEFARKMKKKEKQKKRKAYEPKLTPEEMMDSSTFKRFTAs 1139
Cdd:PTZ00121 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEkkAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKA- 1732
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1140 iENILDNLEDMDFTA------FGDDDEIPQELLLGKHQLNELGSESAKIKAMGIMDK-----LSTDKTVKVLNILEKNIQ 1208
Cdd:PTZ00121 1733 -EEAKKEAEEDKKKAeeakkdEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEdekrrMEVDKKIKDIFDNFANII 1811
                         650       660
                  ....*....|....*....|
gi 578809903 1209 DGSKLSTL-LNHNNDTEEEE 1227
Cdd:PTZ00121 1812 EGGKEGNLvINDSKEMEDSA 1831
Cohesin_HEAT pfam12765
HEAT repeat associated with sister chromatid cohesion; This HEAT repeat is found most ...
1728-1769 2.93e-09

HEAT repeat associated with sister chromatid cohesion; This HEAT repeat is found most frequently in sister chromatid cohesion proteins such as Nipped-B. HEAT repeats are found tandemly repeated in many proteins, and they appear to serve as flexible scaffolding on which other components can assemble.


Pssm-ID: 403845 [Multi-domain]  Cd Length: 42  Bit Score: 54.39  E-value: 2.93e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 578809903  1728 KCLSEVVAVDPSILARLDMQRGVHGRLMDNSTSVREAAVELL 1769
Cdd:pfam12765    1 KALSSLVEKDPSILDSPDVKEAISRRLTDSSPSVRDAALELL 42
ftsN TIGR02223
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ...
591-778 1.18e-07

cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]


Pssm-ID: 274041 [Multi-domain]  Cd Length: 298  Bit Score: 56.24  E-value: 1.18e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   591 STPENHPETPKKKSDPELSKSEMK---QSESRLAESKPNENRLVETKSSENKLETKVETQTEELKQNESRTTEckqnest 667
Cdd:TIGR02223   51 SKQANEPETLQPKNQTENGETAADlppKPEERWSYIEELEAREVLINDPEEPSNGGGVEESAQLTAEQRQLLE------- 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   668 ivePKQNENRLSDTKPNDNKQNNGRSETTKSRPETPKQKGESRPETPKQKsdghPETPKQKGDGRPETPKQKGESRPETP 747
Cdd:TIGR02223  124 ---QMQADMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTP----VETEKIASKVKEAKQKQKALPKQTAE 196
                          170       180       190
                   ....*....|....*....|....*....|.
gi 578809903   748 KQKNEGRPETPkhRHDNRRDSGKPSTEKKPE 778
Cdd:TIGR02223  197 TQSNSKPIETA--PKADKADKTKPKPKEKAE 225
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
476-894 4.58e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 55.85  E-value: 4.58e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  476 EIERIERESAIERERFSKEVQDK-DKPLKKRKQDSYPQEAGG---ATGGNRPASQETGSTGNGSRPALmvsidlhqagrv 551
Cdd:PTZ00449  484 EIKKLIKKSKKKLAPIEEEDSDKhDEPPEGPEASGLPPKAPGdkeGEEGEHEDSKESDEPKEGGKPGE------------ 551
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  552 dsqasiTQDSDSIKKPEEIKQCNDAPVSVLQEDIVGSLKSTPENHPETPKKKSDPELSKSEMKQSESRLAESK--PNENR 629
Cdd:PTZ00449  552 ------TKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLdiPKSPK 625
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  630 LVETKSSENKLETKVETQTEELKQNESRTTECKQNESTIV--EPKQNEN----------RLSDTKPNDNKQNNGRSETTK 697
Cdd:PTZ00449  626 RPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPpfDPKFKEKfyddyldaaaKSKETKTTVVLDESFESILKE 705
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  698 SRPETPKQKGES-RPETPKQKSDghPETPKQKgDGRPETPKQKGESRPETPKQKNEGRPETPKHRH------------DN 764
Cdd:PTZ00449  706 TLPETPGTPFTTpRPLPPKLPRD--EEFPFEP-IGDPDAEQPDDIEFFTPPEEERTFFHETPADTPlpdilaeefkeeDI 782
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  765 RRDSGKP-STEKKPEV-SKHKQDTKSDSPRL--KSERAEALK-QRPDGRSVSESLRRDHDNKQKSDDRGESerhRGDQSR 839
Cdd:PTZ00449  783 HAETGEPdEAMKRPDSpSEHEDKPPGDHPSLpkKRHRLDGLAlSTTDLESDAGRIAKDASGKIVKLKRSKS---FDDLTT 859
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 578809903  840 VRRPETLRSSSR----NEHGIKSDSSKTDKLERKHRHESGDSR--ERPSSGEQKSRPDSPR 894
Cdd:PTZ00449  860 VEEAEEMGAEARkivvDDDGTEADDEDTHPPEEKHKSEVRRRRppKKPSKPKKPSKPKKPK 920
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
704-1098 8.71e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 54.70  E-value: 8.71e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  704 KQKGESRPETPKQKSDGHpetpkqkgDGRPETPKQKGESrPETPkqkneGRPETPKHRHDNRRDSGKPSTEKKPEVSKHK 783
Cdd:PTZ00449  490 KKSKKKLAPIEEEDSDKH--------DEPPEGPEASGLP-PKAP-----GDKEGEEGEHEDSKESDEPKEGGKPGETKEG 555
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  784 QDTKSDSP--RLKSERAEALKQRPDGRSVSESLRRDHDNKQksddrgeSERHRGDQSRVRRPETLRSSSrnehgikSDSS 861
Cdd:PTZ00449  556 EVGKKPGPakEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKK-------PKRPRSAQRPTRPKSPKLPEL-------LDIP 621
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  862 KTDKlerkhRHESGDSRERPSSGEQKS---RPDSPRVKQgDSNKSRSDKLGFkSPTSK----DDKRTEGNKSKVDTNKAH 934
Cdd:PTZ00449  622 KSPK-----RPESPKSPKRPPPPQRPSspeRPEGPKIIK-SPKPPKSPKPPF-DPKFKekfyDDYLDAAAKSKETKTTVV 694
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  935 PDNKAEF----PSYLLGGRSGALKNFVIPKIKRDKDGNVT-------QETKKMEMKGEPKDKVEKIGlvEDLNKGAKPVV 1003
Cdd:PTZ00449  695 LDESFESilkeTLPETPGTPFTTPRPLPPKLPRDEEFPFEpigdpdaEQPDDIEFFTPPEEERTFFH--ETPADTPLPDI 772
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1004 VLQKLSLDDVQKLIKDREDKSRSSLKPIKNKPSKSnkGSIDQSVLKELPPELLA----EIESTM------PLCERVKMNK 1073
Cdd:PTZ00449  773 LAEEFKEEDIHAETGEPDEAMKRPDSPSEHEDKPP--GDHPSLPKKRHRLDGLAlsttDLESDAgriakdASGKIVKLKR 850
                         410       420       430
                  ....*....|....*....|....*....|....
gi 578809903 1074 RKR----STVNEKPKY-AEIS----SDEDNDSDE 1098
Cdd:PTZ00449  851 SKSfddlTTVEEAEEMgAEARkivvDDDGTEADD 884
PTZ00121 PTZ00121
MAEBL; Provisional
649-1125 2.47e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 53.61  E-value: 2.47e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  649 EELKQNESRTTECKQNEsTIVEPKQNENrlsDTKPNDNKQNNGRSETTKSRPETP-KQKGESRPETPKQKSDGHPETPKQ 727
Cdd:PTZ00121 1030 EELTEYGNNDDVLKEKD-IIDEDIDGNH---EGKAEAKAHVGQDEGLKPSYKDFDfDAKEDNRADEATEEAFGKAEEAKK 1105
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  728 KGDGRPETPKQKGES--RPETPKQKNEGRP-------ETPKHRHDNRRDSGKPSTE--KKPEVSKHKQDTKSDSPRLKSE 796
Cdd:PTZ00121 1106 TETGKAEEARKAEEAkkKAEDARKAEEARKaedarkaEEARKAEDAKRVEIARKAEdaRKAEEARKAEDAKKAEAARKAE 1185
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  797 ---RAEALKQRPDGRSVSESLRRDHDNKQKSDDRGESERH----------RGDQSRVRRPETLRSSSRNEHGIKSDSSKT 863
Cdd:PTZ00121 1186 evrKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKaeavkkaeeaKKDAEEAKKAEEERNNEEIRKFEEARMAHF 1265
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  864 DKLERKHRHESGDSRERPSSGEQKSRPDSPR----VKQGDSNKSRSDKlgfKSPTSKDDKRTEGNKSKVDTNKAHPDNKA 939
Cdd:PTZ00121 1266 ARRQAAIKAEEARKADELKKAEEKKKADEAKkaeeKKKADEAKKKAEE---AKKADEAKKKAEEAKKKADAAKKKAEEAK 1342
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  940 EFPSYLLGGRSGALKNFVIPKIKRDKDGNVTQETKKMemKGEPKDKVEKIGLVEDLNKGAKPvvvlQKLSLDDVQKliKD 1019
Cdd:PTZ00121 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKK--ADAAKKKAEEKKKADEAKKKAEE----DKKKADELKK--AA 1414
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1020 REDKSRSSLKPIKNKPSKSNKgsidqsvLKELPPELLAEIESTMPLCERVKM-NKRKRSTVNEKPKYAEISSDEDNDSDE 1098
Cdd:PTZ00121 1415 AAKKKADEAKKKAEEKKKADE-------AKKKAEEAKKADEAKKKAEEAKKAeEAKKKAEEAKKADEAKKKAEEAKKADE 1487
                         490       500
                  ....*....|....*....|....*..
gi 578809903 1099 AFEFARKMKKKEKQKKRKAYEPKLTPE 1125
Cdd:PTZ00121 1488 AKKKAEEAKKKADEAKKAAEAKKKADE 1514
PRK12678 PRK12678
transcription termination factor Rho; Provisional
693-907 8.70e-06

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 51.44  E-value: 8.70e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  693 SETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNEGRPETPKHRHDNRRDSGKPS 772
Cdd:PRK12678   67 AATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAG 146
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  773 TEKKPEVSKHKQDTKSDSPRLKSERaealkQRPDGRsvseslRRDHDNKQKSDDRGESERHRGDQSRVRRPETLRSSSRN 852
Cdd:PRK12678  147 EGGEQPATEARADAAERTEEEERDE-----RRRRGD------REDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRR 215
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 578809903  853 EHGiksdssktDKLERKHRHESGDSRERPSSGEQKSRPDSPRVKQGDSNKSRSDK 907
Cdd:PRK12678  216 EER--------GRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRR 262
PRK12678 PRK12678
transcription termination factor Rho; Provisional
692-889 2.14e-05

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 49.90  E-value: 2.14e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  692 RSETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNEGRPETPKHRHDNRRDSGKP 771
Cdd:PRK12678   77 ARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEA 156
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  772 STEKKPEVSKHKQDTKSD--SPRLKSERAEALKQRPDGRSVSESLRRDHDNKQKSDDRGESERHRGDQSRVRRPEtlRSS 849
Cdd:PRK12678  157 RADAAERTEEEERDERRRrgDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRR--RRD 234
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 578809903  850 SRNEHGiksDSSKTDKLERKHRHESGDSRERPSSGEQKSR 889
Cdd:PRK12678  235 RRDARG---DDNREDRGDRDGDDGEGRGGRRGRRFRDRDR 271
dnaA PRK14086
chromosomal replication initiator protein DnaA;
700-889 1.83e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 47.13  E-value: 1.83e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  700 PETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPK---QKGESRPETPKQKNEGR----PETPKHRHDNRRDSGKPS 772
Cdd:PRK14086   97 PPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPrqdQLPTARPAYPAYQQRPEpgawPRAADDYGWQQQRLGFPP 176
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  773 TEKKPEVSkhkqdtkSDSPRLKSERAEALKQRPDgrsvseslrRDHDNKQKSDDRGESERHRGDqsRVRRPETLRSSSRN 852
Cdd:PRK14086  177 RAPYASPA-------SYAPEQERDREPYDAGRPE---------YDQRRRDYDHPRPDWDRPRRD--RTDRPEPPPGAGHV 238
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 578809903  853 EHGIKSDSSKTDKLERKHR-----HESGDSRERPSSGEQKSR 889
Cdd:PRK14086  239 HRGGPGPPERDDAPVVPIRpsapgPLAAQPAPAPGPGEPTAR 280
Caldesmon pfam02029
Caldesmon;
480-890 2.08e-04

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 46.78  E-value: 2.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   480 IERESAIERERFSKEVQDKdkplkkRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPALMVSIDLHQAGrvdsqasitq 559
Cdd:pfam02029    1 IEDEEEAARERRRRAREER------RRQKEEEEPSGQVTESVEPNEHNSYEEDSELKPSGQGGLDEEEAF---------- 64
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   560 dSDSIKKPEEIKQCNDAPVSVLQEDIVGSLKSTPENHPETPKKKSDPELSKSEMK-QSESRLAESKPNENRLVETKSSEN 638
Cdd:pfam02029   65 -LDRTAKREERRQKRLQEALERQKEFDPTIADEKESVAERKENNEEEENSSWEKEeKRDSRLGRYKEEETEIREKEYQEN 143
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   639 KLET--------------KVETQTEELKQNESRTTECKQNESTIVEPKQNENRLSDTKPNdnkqnngrsettksrpeTPK 704
Cdd:pfam02029  144 KWSTevrqaeeegeeeedKSEEAEEVPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRG-----------------HPE 206
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   705 QKGESRPEtpKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNEGRpetpkhrhdnRRDSGKPSTEKKPevSKHKQ 784
Cdd:pfam02029  207 VKSQNGEE--EVTKLKVTTKRRQGGLSQSQEREEEAEVFLEAEQKLEELR----------RRRQEKESEEFEK--LRQKQ 272
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   785 dtksdsPRLKSERAEALKQRPDGRSV-SESLRRDHDNKQKSDDRGESERHRGDQSRVRR-----------PETlrSSSRN 852
Cdd:pfam02029  273 ------QEAELELEELKKKREERRKLlEEEEQRRKQEEAERKLREEEEKRRMKEEIERRraeaaekrqklPED--SSSEG 344
                          410       420       430
                   ....*....|....*....|....*....|....*...
gi 578809903   853 EHGIKSDSSKTDKLERKHRHESGDSRERPSSGEQKSRP 890
Cdd:pfam02029  345 KKPFKCFSPKGSSLKITERAEFLNKSLQKSSSVKKTHP 382
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
878-1102 3.96e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 46.19  E-value: 3.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  878 RERPSSGEQKSRPDSPRVKQGDSNKSRSDKLGFKSPTSKDDKRTEGNKSKVDTNKAHPDNK---AEFPSYLLGGRSGALK 954
Cdd:PTZ00108 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKrklDDKPDNKKSNSSGSDQ 1225
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  955 NFVIPKIKRDKDGNVTQETKKmEMKGEPKDKVEKIGLVEDLNKGAKPVVVLQKLSLDDVQKLIKDREDKSRSSLKPIKNK 1034
Cdd:PTZ00108 1226 EDDEEQKTKPKKSSVKRLKSK-KNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSS 1304
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 578809903 1035 PSKSNKGSIDQSVLKELPPELLAEIESTmplcERVKMNKR-------KRSTVNEKPKYAEISSDEDNDSDEAFEF 1102
Cdd:PTZ00108 1305 PTKKKVKKRLEGSLAALKKKKKSEKKTA----RKKKSKTRvkqasasQSSRLLRRPRKKKSDSSSEDDDDSEVDD 1375
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
590-838 3.96e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 46.19  E-value: 3.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  590 KSTPENHPETPKKKSDPELSKSEMKQSESrLAESKPNENRLVETKSSENKLETKVETQTEELKQNESRTTECKQNESTIV 669
Cdd:PTZ00108 1156 QRLKSKTKGKASKLRKPKLKKKEKKKKKS-SADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTK 1234
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  670 EPKQNENRLSDTKPNDnKQNNGRSETTKSRPETPKQKGESRPEtPKQKSDGHPETPKQkgdgrPETPKQKGESRPETPKQ 749
Cdd:PTZ00108 1235 PKKSSVKRLKSKKNNS-SKSSEDNDEFSSDDLSKEGKPKNAPK-RVSAVQYSPPPPSK-----RPDGESNGGSKPSSPTK 1307
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  750 KNEGRPETPKhrhdnrrdSGKPSTEKKPEVSKHKQDtKSDSPRLKSERAEALKQRPDGRSVSESLRRDHDNKQKSDDRGE 829
Cdd:PTZ00108 1308 KKVKKRLEGS--------LAALKKKKKSEKKTARKK-KSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSED 1378

                  ....*....
gi 578809903  830 SERHRGDQS 838
Cdd:PTZ00108 1379 EDDEDDEDD 1387
PRK12678 PRK12678
transcription termination factor Rho; Provisional
692-899 4.07e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 46.05  E-value: 4.07e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  692 RSETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNEGRPETPKHRHDNRRDSGKP 771
Cdd:PRK12678   88 RQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEE 167
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  772 STEKKPEVSKHKQDTKSDSPRLKSERAEALKQRPDGRSVSESLRRDHDNKQKSDDRGESERHRGDQSRVRRPEtlRSSSR 851
Cdd:PRK12678  168 ERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRRRRDRRDARG--DDNRE 245
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 578809903  852 NEHGIKSDSSKTDKLERKHRHESGDSRERPSSGEQKSRPdsPRVKQGD 899
Cdd:PRK12678  246 DRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNERE--PELREDD 291
PDS5 cd19953
Sister chromatid cohesion protein PDS5; Pds5 plays a crucial role in sister chromatid cohesion. ...
1720-1816 5.21e-04

Sister chromatid cohesion protein PDS5; Pds5 plays a crucial role in sister chromatid cohesion. Together with WapI and Scc3, it is involved in the release of the cohesin complex from chromosomes during S phase. The core of the cohesin complex consists of a coiled-coiled heterodimer of Smc1 and Smc30, together with Scc1 (also called kleisin). Pds5 interacts with Scc1 via a conserved patch on the surface of its heat repeats. Pds5 also promotes the acetylation of Smc3 that protects cohesin from releasing activity in G2 phase.


Pssm-ID: 410996 [Multi-domain]  Cd Length: 630  Bit Score: 45.59  E-value: 5.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903 1720 IAVRTKAMKCLSEVVAVDPS-ILARldmqrgVH--------GRLMDNSTSVREAAVELLGRFVLCRPQLAEQYYDMLIER 1790
Cdd:cd19953   259 VDVRLLATKLLGKMFAEKGSaGFAQ------TYpslwkeflGRFNDKSPEVRLAWVESAKHILLNHPDLAEDILEALKKR 332
                          90       100
                  ....*....|....*....|....*.
gi 578809903 1791 ILDTGISVRKRVIKILRDICIEQPTF 1816
Cdd:cd19953   333 LLDPDEKVRLAAVKAICDLAYEDLLH 358
PDS5 pfam20168
Sister chromatid cohesion protein PDS5 protein; This entry represents the Sister chromatid ...
1719-1862 5.95e-04

Sister chromatid cohesion protein PDS5 protein; This entry represents the Sister chromatid cohesion protein PDS5. The large PDS5 molecule is exclusively alpha helical, composed of a large number of HEAT-like repeats and helical extensions/additions that deviate from the HEAT repeat pattern.


Pssm-ID: 466319 [Multi-domain]  Cd Length: 1051  Bit Score: 45.66  E-value: 5.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  1719 AIAVRTKAMKCLSEVVAVDPSIlaRLDMQRGVHGRLMDNSTSVREAAVELLGRF-------VLCRPQLAEqyydmLIERI 1791
Cdd:pfam20168  297 SVAVRIAWVEAAKQILLNHPDL--RSEILEALKDRLLDPDEKVRLAAVKAIGDLdyetllhVVSEKLLKT-----LAERL 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  1792 LDTGISVRKRVIKILRDI-------CIEQPtfPKITEMCV----KMIR--RVNDEEgIKKLVNETFQKLWFtPTPHNDKE 1858
Cdd:pfam20168  370 RDKKPSVRKEALKTLAKLynvaygeIEEGD--EEAIEKFGwipnKILHlyYINDPE-IRALVERVLFEYLL-PALLDDEE 445

                   ....
gi 578809903  1859 AMTR 1862
Cdd:pfam20168  446 RVKR 449
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
475-759 1.42e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 44.23  E-value: 1.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  475 AEIERIERESAIERERFSKEV--QDKDKPlKKRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPAlmvsidlhqagrvD 552
Cdd:NF033838  233 AEEEAKRRADAKLKEAVEKNVatSEQDKP-KRRAKRGVLGEPATPDKKENDAKSSDSSVGEETLPS-------------P 298
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  553 SQASITQDSDSIKKPEEI-KQCNDapvsvlQEDivgslkSTPENHPETPKKKSDPELSKSEMKQSESRLaeskpnenRLV 631
Cdd:NF033838  299 SLKPEKKVAEAEKKVEEAkKKAKD------QKE------EDRRNYPTNTYKTLELEIAESDVKVKEAEL--------ELV 358
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  632 ETKSSENKLETKVETQTEELkqnesrttECKQNESTIVEPKQNENRlsdTKPNDNKQNNGRSETTKSRP-ETPKQKGESR 710
Cdd:NF033838  359 KEEAKEPRNEEKIKQAKAKV--------ESKKAEATRLEKIKTDRK---KAEEEAKRKAAEEDKVKEKPaEQPQPAPAPQ 427
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  711 PETPKQKSDGHPETPK-QKGDGR----------PETPKQKGESRPetPKQKNEGRPETPK 759
Cdd:NF033838  428 PEKPAPKPEKPAEQPKaEKPADQqaeedyarrsEEEYNRLTQQQP--PKTEKPAQPSTPK 485
PRK08581 PRK08581
amidase domain-containing protein;
585-869 2.56e-03

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 43.24  E-value: 2.56e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  585 IVGSLkSTPENHPETPKKKSDPELSKSEMKQSESRlAESKPNENRLVETKSSENKLETkvETQTEELKQNESRTTEckqn 664
Cdd:PRK08581   16 VLPTL-TSPTAYADDPQKDSTAKTTSHDSKKSNDD-ETSKDTSSKDTDKADNNNTSNQ--DNNDKKFSTIDSSTSD---- 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  665 ESTIVEPKQNENRLSDTKP--NDNKQNNGRSETTKSRPETPKQKGESRPETPKQKSDGHPETPKQKGDGRPETPKQKGES 742
Cdd:PRK08581   88 SNNIIDFIYKNLPQTNINQllTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSK 167
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  743 RPETPKQKNEGRPETPKHRHDNRRDSGKPSTEKKPEVSKHKQDTKSDSPrlKSERAEALKQRPDGRSVSESlrRDHDNKQ 822
Cdd:PRK08581  168 QDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKS--SSKDNQSMSDSALDSILDQY--SEDAKKT 243
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 578809903  823 KSDDRGESERHRGDQSRVRRPEtLRSSSRNEHGIKSDSSKTDKLERK 869
Cdd:PRK08581  244 QKDYASQSKKDKTETSNTKNPQ-LPTQDELKHKSKPAQSFENDVNQS 289
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
610-919 2.68e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 43.50  E-value: 2.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  610 KSEMKQSESRLAESKpneNRLVET--KSSENKLETKVEtQTEELKQNESRTTECKQNESTIVEPKQNENRLSDTKPNDNK 687
Cdd:PTZ00108 1108 NAELEKKEKELEKLK---NTTPKDmwLEDLDKFEEALE-EQEEVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKK 1183
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  688 qnngrSETTKSRPETPKQkgesrpETPKQKSDGHPETPKQKGDGRPETPKQKGESRPETPKQKNEGRPETPKHRHDNRRD 767
Cdd:PTZ00108 1184 -----SSADKSKKASVVG------NSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSK 1252
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  768 SGKPSTEKKPEVSKHKQDTKSDSPRLKSerAEALKQRPDGRSvseslrrdhDNKQKSDDRGESERHRGDQSRVRrpetlr 847
Cdd:PTZ00108 1253 SSEDNDEFSSDDLSKEGKPKNAPKRVSA--VQYSPPPPSKRP---------DGESNGGSKPSSPTKKKVKKRLE------ 1315
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 578809903  848 sssrnehgIKSDSSKTDKLERKHRHESGDSRERP--SSGEQKSRPDSPRVKQGDSNKSRSDKLGFKSPTSKDDK 919
Cdd:PTZ00108 1316 --------GSLAALKKKKKSEKKTARKKKSKTRVkqASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDD 1381
PTZ00112 PTZ00112
origin recognition complex 1 protein; Provisional
588-785 3.03e-03

origin recognition complex 1 protein; Provisional


Pssm-ID: 240274 [Multi-domain]  Cd Length: 1164  Bit Score: 43.05  E-value: 3.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  588 SLKSTPENHPETPKKKSDPELSKSEMKQSESRLAESKPNENRLVETKSSENKLETKVETQTEELKQNESRTTECKQNEST 667
Cdd:PTZ00112  219 NDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSSVLLKRKSQCLRKDSY 298
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  668 IVEPKQNENRLSDTKpNDNKQNNGRSETTKSRPETPKQKGESRPetpkqkSDGHPETPKQKGDGRPETPKQKgESRPETP 747
Cdd:PTZ00112  299 VYSNHQKKAKTGDPK-NIIHRNNGSSNSNNDDTSSSNHLGSNRI------SNRNPSSPYKKQTTTKHTNNTK-NNKYNKT 370
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 578809903  748 KQKNEGRPETPKHRHDNRRDSGKPSTEKKPEVSKHKQD 785
Cdd:PTZ00112  371 KTTQKFNHPLRHHATINKRSSMLPMSEQKGRGASEKSE 408
DUF612 pfam04747
Protein of unknown function, DUF612; This family includes several uncharacterized proteins ...
468-906 5.37e-03

Protein of unknown function, DUF612; This family includes several uncharacterized proteins from Caenorhabditis elegans.


Pssm-ID: 282585 [Multi-domain]  Cd Length: 511  Bit Score: 41.97  E-value: 5.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   468 EVELDALAEIERIERESAIERERFSKEVQDKDKPLKKRKQDSYPQEAGGATGGNRPASQETGSTGNGSRPALMVSidlhq 547
Cdd:pfam04747  106 EAEAKKRAAQEEEHKQWKAEQERIQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKAEKTKKASTPAPVEEEIVVK----- 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   548 agrvdsqaSITQDSDSIKKPEeikqcndapvsvlqedivgslKSTPENHPETPKKKSDP---ELSKSEMKQSESRLAESK 624
Cdd:pfam04747  181 --------KVANDRSAAPAPE---------------------PKTPTNTPAEPAEQVQEitgKKNKKNKKKSESEATAAP 231
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   625 PNENRLVETK---SSENKLETKVETQTEELKQNESRTTECKQNESTIVEPKQNENRLSDTKPNDNKQNNGRSETTKSRPE 701
Cdd:pfam04747  232 ASVEQVVEQPkvvTEEPHQQAAPQEKKNKKNKRKSESENVPAASETPVEPVVETTPPASENQKKNKKDKKKSESEKVVEE 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   702 TPKQKGesrPETPKQKSDGHPE-----TPKQKGDGRP-ETPKQKGESRPETPKQKNEGRPETPkhrhdnrrdsgkPSTEK 775
Cdd:pfam04747  312 PVQAEA---PKSKKPTADDNMDfldfvTAKEEPKDEPaETPAAPVEEVVENVVENVVEKSTTP------------PATEN 376
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903   776 KPEVSKHKQdtKSDSPRLKSERAEALKQRPDGRSVSESLRRDHDNKQKS-DDRGESERHRGDQSRVRRPETLRSSSRNEH 854
Cdd:pfam04747  377 KKKNKKDKK--KSESEKVTEQPVESAPAPPQVEQVVETTPPASENKKKNkKDKKKSESEKAVEEPVQAAPSSKKPTADDN 454
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 578809903   855 GIKSD--SSKTDKLERKHRHESGDSRERPSSGEQKSRPDSPRVKQGDSNKSRSD 906
Cdd:pfam04747  455 MDFLDfvTAKPDKSESVEEHIAAPMIVEPAHADEETAAAAEGKKKNKKDKKKKE 508
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
337-679 6.43e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 42.02  E-value: 6.43e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  337 EQSEKAAMYDIISSPSKDSTKltlRLSRVRSSDMDQQEDMISGVENSNVSEN--DIPFNVQYPGQTSKTPITPQDINRPL 414
Cdd:PRK14949  473 EASSSLDADNSAVPEQIDSTA---EQSVVNPSVTDTQVDDTSASNNSAADNTvdDNYSAEDTLESNGLDEGDYAQDSAPL 549
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  415 NAAQCLSQQEQTAFLPANQVPVLQQNTSVAAKQPQTSVVQNQQQISQQGPIYDEVE------LDA-LAEierieRESAIE 487
Cdd:PRK14949  550 DAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEAQPSSQSLSPISAVTTAAASladddiLDAvLAA-----RDSLLS 624
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  488 rERFSKEVQDKD--KPLKKRKQDSYPQEAGGAtggnrPASQETGSTGNGSRPALMVSIDLHQAGRVDSQASITQDSDSIK 565
Cdd:PRK14949  625 -DLDALSPKEGDgkKSSADRKPKTPPSRAPPA-----SLSKPASSPDASQTSASFDLDPDFELATHQSVPEAALASGSAP 698
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  566 KPEEIKQCNDAPvsvlqedivgslkstPENHPETPKKKSDPELSKSEMKQSESRLAESKPNENRLvetkssENKLETKVE 645
Cdd:PRK14949  699 APPPVPDPYDRP---------------PWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAV------EQQATHQPQ 757
                         330       340       350
                  ....*....|....*....|....*....|....
gi 578809903  646 TQTEelkqnESRTTECKQNESTIVEPKQNENRLS 679
Cdd:PRK14949  758 VQAE-----AQSPASTTALTQTSSEVQDTELNLV 786
PRK12678 PRK12678
transcription termination factor Rho; Provisional
611-842 9.32e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 41.43  E-value: 9.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  611 SEMKQSE------SRLAESKPNENRLVETKSSENKLETKVETQTEELKQNESRTTECKQNESTIVEPKQNENRLSDTKPN 684
Cdd:PRK12678   44 SGMRKGEliaaikEARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPE 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  685 DNKQNNGRSETTKSRPETPKQKGESRPETPK------QKSDGHPETPKQKGDGRPETPKQKGES-----RPETPKQKNEG 753
Cdd:PRK12678  124 AAQARERRERGEAARRGAARKAGEGGEQPATearadaAERTEEEERDERRRRGDREDRQAEAERgergrREERGRDGDDR 203
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 578809903  754 RPETPKHRHDNRRDSGKpstekkpevskhkqdtksdspRLKSERAEALKQRPDGRSVSESLRRDhDNKQKSDDRGESERH 833
Cdd:PRK12678  204 DRRDRREQGDRREERGR---------------------RDGGDRRGRRRRRDRRDARGDDNRED-RGDRDGDDGEGRGGR 261

                  ....*....
gi 578809903  834 RGDQSRVRR 842
Cdd:PRK12678  262 RGRRFRDRD 270
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH