NCBI Conserved Domain Search

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

600-914

1.25e-12

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 73.82 E-value: 1.25e-12

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  600 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 673
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  674 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 749
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  750 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 829
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  830 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 909
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496


                   ....*
gi 1907081939  910 RDLIK 914
Cdd:COG1196    497 LEAEA 501

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

588-869

1.08e-10

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 67.39 E-value: 1.08e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  588 SERLSTHELtSLLEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEVATSPsgawqrLHRVNQDLQSEL 667
Cdd:TIGR02168  219 KAELRELEL-ALLVLRLEELREELEELQEELKEAEEELE-ELTAELQELEEKLEELRLEVSE------LEEEIEELQKEL 290

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  668 EAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEqgkvREQLEEWQHS 747
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL----EAELEELEAE 366

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  748 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 823
Cdd:TIGR02168  367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELE 446

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081939  824 NYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 869
Cdd:TIGR02168  447 EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492

PRK03918

DNA double-strand break repair ATPase Rad50;

652-1017

6.59e-10

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 64.70 E-value: 6.59e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  652 AWQRLHRVNQDLQSELEaqcRRQELITQQiqtlkhsyGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELK 731
Cdd:PRK03918   163 AYKNLGEVIKEIKRRIE---RLEKFIKRT--------ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  732 mEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----------------QALQRDRQKEVQ 794
Cdd:PRK03918   232 -ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkelkelkekaeeyiklSEFYEEYLDELR 310

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  795 RLQECIAELSQQL-GTSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYED------QLQGHVQQVEAL 867
Cdd:PRK03918   311 EIEKRLSRLEEEInGIEERIKELEEKEER------LEELKKKLKELEKRLEELEERHELYEEakakkeELERLKKRLTGL 384

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  868 QKEKLSETCKGSEQVHKLEEELEAREAS-IRQLAQHVQSLHDE---------------RDLIKHQFQELMER----VATS 927
Cdd:PRK03918   385 TPEKLEKELEELEKAKEEIEEEISKITArIGELKKEIKELKKAieelkkakgkcpvcgRELTEEHRKELLEEytaeLKRI 464

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  928 DGDVAELQEKLRGKEVDYQNLEHSHHRVS--VQLQSVRTLLREKEEELKHI------------KETHERV--LEKKDQDL 991
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESelIKLKELAEQLKELEEKLKKYnleelekkaeeyEKLKEKLikLKGEIKSL 544

                          410       420
                   ....*....|....*....|....*.
gi 1907081939  992 NEALVKMIALGSSLEETEIKLQEKEE 1017
Cdd:PRK03918   545 KKELEKLEELKKKLAELEKKLDELEE 570

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1857-2122

5.49e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 61.88 E-value: 5.49e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1857 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 1935
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1936 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2015
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2016 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2095
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458

                          250       260
                   ....*....|....*....|....*..
gi 1907081939 2096 RVKESEIQYLKQEISSLKDELQTALRD 2122
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1868-2060

1.15e-07

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 57.37 E-value: 1.15e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1868 LRERIQELEAQMGVMREELGH-----KELEGDVAALQEKyqrdFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 1942
Cdd:TIGR02168  307 LRERLANLERQLEELEAQLEElesklDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEEQLE 382

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1943 REEKDRLLAEETAATIsaieamkNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2022
Cdd:TIGR02168  383 TLRSKVAQLELQIASL-------NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455

                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1907081939 2023 NAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA 2060
Cdd:TIGR02168  456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493

Myosin_tail_1

pfam01576

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...

594-1043

2.42e-07

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.

Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 56.34 E-value: 2.42e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  594 HELTSLLEKELEQSQKEASDLLEQNRLLQDqLRVALGREQSAREGyvLQTEVATSPS----------------GAWQRLH 657
Cdd:pfam01576   78 HELESRLEEEEERSQQLQNEKKKMQQHIQD-LEEQLDEEEAARQK--LQLEKVTTEAkikkleedillledqnSKLSKER 154

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  658 RVNQDLQSELEAQCRRQELITQQIQTLKHSygeakdairhHEAEIQTLQTRLGNAAAelaiKEQALAKLKGELKMEQGKV 737
Cdd:pfam01576  155 KLLEERISEFTSNLAEEEEKAKSLSKLKNK----------HEAMISDLEERLKKEEK----GRQELEKAKRKLEGESTDL 220

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  738 REQLEEWQHSKAMLSGQLRASEQKLRSTEARLlektqelrdlETQQALQRDRQKEVQRLQECIAELSQQLgTSEQAQRLM 817
Cdd:pfam01576  221 QEQIAELQAQIAELRAQLAKKEEELQAALARL----------EEETAQKNNALKKIRELEAQISELQEDL-ESERAARNK 289

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  818 EKKLKRNYTLLLESCEQEKQALL------QNLK-----EVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLE 886
Cdd:pfam01576  290 AEKQRRDLGEELEALKTELEDTLdttaaqQELRskreqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAK 369

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  887 EELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLL 966
Cdd:pfam01576  370 RNKANLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLL 449

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  967 REKEEelKHIKETHE-RVLEKKDQDLNEALV----KMIALGSSLEETEI-------KLQEKEECLRRF----------VS 1024
Cdd:pfam01576  450 NEAEG--KNIKLSKDvSSLESQLQDTQELLQeetrQKLNLSTRLRQLEDernslqeQLEEEEEAKRNVerqlstlqaqLS 527

                          490
                   ....*....|....*....
gi 1907081939 1025 DSPKDAKEPLSTTEPTEEG 1043
Cdd:pfam01576  528 DMKKKLEEDAGTLEALEEG 546

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

1701-2170

5.82e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 48.58 E-value: 5.82e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1701 SEYQKVITLIEKENTELKAKVSQMDHQQRCLQ-EAENK-------HSESMFALQGRYEEEIRCMVEQLSHTENTLQAERS 1772
Cdd:pfam15921  220 SAISKILRELDTEISYLKGRIFPVEDQLEALKsESQNKielllqqHQDRIEQLISEHEVEITGLTEKASSARSQANSIQS 299

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1773 RvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSlrgalslyqpshpDSSLAPGPSEp 1852
Cdd:pfam15921  300 Q-LEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLA-------------NSELTEARTE- 364

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1853 ravpaaKDEAESMSG-LRERIQELEAQMGVMREELG---------------------HKELEGDVAALQ-EKYQRDFESL 1909
Cdd:pfam15921  365 ------RDQFSQESGnLDDQLQKLLADLHKREKELSlekeqnkrlwdrdtgnsitidHLRRELDDRNMEvQRLEALLKAM 438

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1910 KATC----ERGFAAMEETHQ--KKIEDLQRQHQRELEKLREEKDRLLA-----EETAATISAIeamkNAHREEMERELEK 1978
Cdd:pfam15921  439 KSECqgqmERQMAAIQGKNEslEKVSSLTAQLESTKEMLRKVVEELTAkkmtlESSERTVSDL----TASLQEKERAIEA 514

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1979 SQrSQISSINS--DIEALRRQYL----EELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQ 2052
Cdd:pfam15921  515 TN-AEITKLRSrvDLKLQELQHLknegDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA 593

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2053 ELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVLLRVKESE-----IQYLKQEISSLKDELQTALRDKKYAS 2127
Cdd:pfam15921  594 QLEKEINDRRLELQEFKILKDKKDAKIRELEARVSDLELEKVKLVNAGSerlraVKDIKQERDQLLNEVKTSRNELNSLS 673

                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*...
gi 1907081939 2128 DKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE-----KSPEGT 2170
Cdd:pfam15921  674 EDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQtrntlKSMEGS 721

PTZ00121

MAEBL; Provisional

1658-2062

1.44e-04

MAEBL; Provisional

Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 47.44 E-value: 1.44e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1658 EYEKELRFYKKACQEAKGASGQKRAQAVGALKEE---YEELlhKQKSEYQKVITLIEKENTELKAKVSQMdhqqRCLQEA 1734
Cdd:PTZ00121  1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEakkADEA--KKKAEEAKKADEAKKKAEEAKKKADEA----KKAAEA 1508

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1735 ENKHSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRElqavh 1814
Cdd:PTZ00121  1509 KKKADEAKKAEEAKKADEAK-KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK----- 1582

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1815 QEELRALQEHYIwslrgalslyqpshpdsslapgpsepravpaakdeaesmsglrERIQELEAQMGVMREELGHKELEGD 1894
Cdd:PTZ00121  1583 AEEAKKAEEARI-------------------------------------------EEVMKLYEEEKKMKAEEAKKAEEAK 1619

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1895 VAALQEKYQrdfESLKATCERgFAAMEETHQKKIEDLQRQHqrELEKLREEKDRLLAEETAATISAIEAMKNAHREEMER 1974
Cdd:PTZ00121  1620 IKAEELKKA---EEEKKKVEQ-LKKKEAEEKKKAEELKKAE--EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1975 ELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQEL 2054
Cdd:PTZ00121  1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773


                   ....*...
gi 1907081939 2055 NNRLAAEI 2062
Cdd:PTZ00121  1774 RKEKEAVI 1781

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1658-1949

9.73e-04

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 44.66 E-value: 9.73e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1658 EYEKELRFYKKACQEAKGASGQKRaQAVGALKEEYEELLHKQKSEYQKvITLIEKENTELKAKVSQMDHQQRCLQEAENK 1737
Cdd:TIGR02168  702 ELRKELEELEEELEQLRKELEELS-RQISALRKDLARLEAEVEQLEER-IAQLSKELTELEAEIEELEERLEEAEEELAE 779

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1738 HSESMFALQGRYEEeircMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVREL--QAVHQ 1815
Cdd:TIGR02168  780 AEAEIEELEAQIEQ----LKEELKALREALDELRAE-LTLLNEEAANLRERLESLERRIAATERRLEDLEEQIeeLSEDI 854

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1816 EELRALQEHYiWSLRGALSlyqpSHPDSSLAPGPSEPRAVPAAKDEAESMSG----LRERIQELEAQMGVMREELGH--- 1888
Cdd:TIGR02168  855 ESLAAEIEEL-EELIEELE----SELEALLNERASLEEALALLRSELEELSEelreLESKRSELRRELEELREKLAQlel 929

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081939 1889 --KELEGDVAALQEK----YQRDFEslkatcergfaaMEETHQKKIEDLQRQHQRELEKLREEKDRL 1949
Cdd:TIGR02168  930 rlEGLEVRIDNLQERlseeYSLTLE------------EAEALENKIEDDEEEARRRLKRLENKIKEL 984

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1562-2039

1.38e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 43.99 E-value: 1.38e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1562 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLPPTEPLGGCQRLLRMSQHlSYESCLEGLGQYSSLLVQd 1641
Cdd:COG4717     87 EEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPE-RLEELEERLEELRELEEE- 164

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1642 aiiqaqvcyaacriRLEYEKELRFYKKACQEAKGASGQKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV 1721
Cdd:COG4717    165 --------------LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEEL 229

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1722 SQMDHQQRCLQEAENKHSESMFALqgryeeeIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLED 1801
Cdd:COG4717    230 EQLENELEAAALEERLKEARLLLL-------IAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGK 302

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1802 RFQlkvrelQAVHQEELRALQEHYIWSLRGALSLyqpshpdsslaPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMgv 1881
Cdd:COG4717    303 EAE------ELQALPALEELEEEELEELLAALGL-----------PPDLSPEELLELLDRIEELQELLREAEELEEEL-- 363

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1882 mreelghkelegDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQR--QHQRELEKLREEKDRLLAEETAATIS 1959
Cdd:COG4717    364 ------------QLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEEleELEEQLEELLGELEELLEALDEEELE 431

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1960 AIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY-----LEELQSVQRELEVLSEQYSQKCLenahLAQALEAER 2034
Cdd:COG4717    432 EELEELEEELEELEEELEE-LREELAELEAELEQLEEDGelaelLQELEELKAELRELAEEWAALKL----ALELLEEAR 506


                   ....*
gi 1907081939 2035 QALRQ 2039
Cdd:COG4717    507 EEYRE 511

PRK03918

DNA double-strand break repair ATPase Rad50;

1578-2156

4.36e-03

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 42.36 E-value: 4.36e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1578 IQDELAQQLREKASILEEISAALPVLPPT-EPLGGCQRLLRmsqhlSYESCLEGLgqySSLLVQDAIIQAQVCYAACRIR 1656
Cdd:PRK03918   191 IEELIKEKEKELEEVLREINEISSELPELrEELEKLEKEVK-----ELEELKEEI---EELEKELESLEGSKRKLEEKIR 262

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1657 lEYEKELRFYKKACQEAKgaSGQKRAQAVGALKEEYEElLHKQKSEYQKVITLIEKENTELKAKVSQMdhqQRCLQEAEN 1736
Cdd:PRK03918   263 -ELEERIEELKKEIEELE--EKVKELKELKEKAEEYIK-LSEFYEEYLDELREIEKRLSRLEEEINGI---EERIKELEE 335

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1737 KHSEsMFALQGRyEEEIRCMVEQLSHTENTLQAERsRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQE 1816
Cdd:PRK03918   336 KEER-LEELKKK-LKELEKRLEELEERHELYEEAK-AKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITA 412

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1817 ELRALqEHYIWSLRGALSLYQPSHPDSSL--APGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREELghKELEGD 1894
Cdd:PRK03918   413 RIGEL-KKEIKELKKAIEELKKAKGKCPVcgRELTEEHRKELLEEYTAE-LKRIEKELKEIEEKERKLRKEL--RELEKV 488

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1895 VaalqeKYQRDFESLKATCERGFAAMEETHQKKIEDLQRQhQRELEKLREEKDRLLAE--ETAATISAIEAMKNaHREEM 1972
Cdd:PRK03918   489 L-----KKESELIKLKELAEQLKELEEKLKKYNLEELEKK-AEEYEKLKEKLIKLKGEikSLKKELEKLEELKK-KLAEL 561

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1973 ERELEKSQRsQISSINSDIEALRRQYLEELQSVQRELEVLSEQYsqkcLENAHLAQALEAERQALRQCQRENQELNAHNQ 2052
Cdd:PRK03918   562 EKKLDELEE-ELAELLKELEELGFESVEELEERLKELEPFYNEY----LELKDAEKELEREEKELKKLEEELDKAFEELA 636

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2053 ELNNRLAAEITRLRTLLTgdgggestglpLTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTA------LRDKKYA 2126
Cdd:PRK03918   637 ETEKRLEELRKELEELEK-----------KYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIkktlekLKEELEE 705

                          570       580       590
                   ....*....|....*....|....*....|
gi 1907081939 2127 SDKYKDIYTELSIAKAkadcDISRLKEQLK 2156
Cdd:PRK03918   706 REKAKKELEKLEKALE----RVEELREKVK 731

TOPEUc

smart00435

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina ...

1853-1949

8.88e-03

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina virus topoisomerase, Variola virus topoisomerase, Shope fibroma virus topoisomeras

Pssm-ID: 214661 [Multi-domain] Cd Length: 391 Bit Score: 40.80 E-value: 8.88e-03

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  1853 RAVPaaKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDV-AALQEKYQRDFESLKATCERGFAAM--EETHQKKIE 1929
Cdd:smart00435  269 RTVS--KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLkRKLKSKFERDNEKLDAEVKEKKKEKkkEEKKKKQIE 346

                            90       100
                    ....*....|....*....|
gi 1907081939  1930 DLQRQHQReLEKLREEKDRL 1949
Cdd:smart00435  347 RLEERIEK-LEVQATDKEEN 365

Name

Accession

Description

Interval

E-value

PH_M-RIP

cd13275

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...

311-412

5.16e-47

Pssm-ID: 270094 Cd Length: 104 Bit Score: 164.04 E-value: 5.16e-47

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 388
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80

                           90       100
                   ....*....|....*....|....
gi 1907081939  389 SGIRRNWIQTIMKHVLPASAPDVT 412
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104

smart00233

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...

311-403

4.25e-16

Pssm-ID: 214574 [Multi-domain] Cd Length: 102 Bit Score: 75.66 E-value: 4.25e-16

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939   311 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTC---YDVTEYPVQRNYGFQIHTKEGE-FTL 384
Cdd:smart00233    3 KEGWLYKKSGGGkkSWKKRYFVLFNSTLLYYKSKKDKKSYKPKGSIDLSGCtvrEAPDPDSSKKPHCFEIKTSDRKtLLL 82

                            90
                    ....*....|....*....
gi 1907081939   385 SAMTSGIRRNWIQTIMKHV 403
Cdd:smart00233   83 QAESEEEREKWVEALRKAI 101

pfam00169

PH domain; PH stands for pleckstrin homology.

311-399

2.83e-15

PH domain; PH stands for pleckstrin homology.

Pssm-ID: 459697 [Multi-domain] Cd Length: 105 Bit Score: 73.75 E-value: 2.83e-15

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDV---TEYPVQRNYGFQIHTKEG----E 381
Cdd:pfam00169    3 KEGWLLKKGGGkkKSWKKRYFVLFDGSLLYYKDDKSGKSKEPKGSISLSGCEVVevvASDSPKRKFCFELRTGERtgkrT 82

                           90
                   ....*....|....*...
gi 1907081939  382 FTLSAMTSGIRRNWIQTI 399
Cdd:pfam00169   83 YLLQAESEEERKDWIKAI 100

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

600-914

1.25e-12

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 73.82 E-value: 1.25e-12

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  600 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 673
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  674 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 749
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  750 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 829
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  830 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 909
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496


                   ....*
gi 1907081939  910 RDLIK 914
Cdd:COG1196    497 LEAEA 501

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

654-1013

2.20e-12

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 72.66 E-value: 2.20e-12

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  654 QRLHRVNqDLQSELEAQcrRQELITQQIQTLKhsYGEAKDAIRHHEAEIQTLQTRlgNAAAELAIKEQALAKLKGELKME 733
Cdd:COG1196    186 ENLERLE-DILGELERQ--LEPLERQAEKAER--YRELKEELKELEAELLLLKLR--ELEAELEELEAELEELEAELEEL 258

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  734 QGKVREQLEEWQHSKAmlsgQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV----QRLQECIAELSQQLGT 809
Cdd:COG1196    259 EAELAELEAELEELRL----ELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELeerlEELEEELAELEEELEE 334

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  810 SEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSEtckgseqvhklEEEL 889
Cdd:COG1196    335 LEEELEELEEELEEA--------EEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEA-----------LRAA 395

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  890 EAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREK 969
Cdd:COG1196    396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL 475

                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1907081939  970 EEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQ 1013
Cdd:COG1196    476 EAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGL 519

cd00821

Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are ...

311-399

3.34e-12

Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 275388 [Multi-domain] Cd Length: 92 Bit Score: 64.49 E-value: 3.34e-12

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQ--YEDGQWKKHWFVLADQSLRYYRDSvAEEAADLDGEINLSTCYDVTEY-PVQRNYGFQIHTKEGE-FTLSA 386
Cdd:cd00821      1 KEGYLLKRggGGLKSWKKRWFVLFEGVLLYYKSK-KDSSYKPKGSIPLSGILEVEEVsPKERPHCFELVTPDGRtYYLQA 79

                           90
                   ....*....|...
gi 1907081939  387 MTSGIRRNWIQTI 399
Cdd:cd00821     80 DSEEERQEWLKAL 92

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

588-869

1.08e-10

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 67.39 E-value: 1.08e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  588 SERLSTHELtSLLEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEVATSPsgawqrLHRVNQDLQSEL 667
Cdd:TIGR02168  219 KAELRELEL-ALLVLRLEELREELEELQEELKEAEEELE-ELTAELQELEEKLEELRLEVSE------LEEEIEELQKEL 290

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  668 EAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEqgkvREQLEEWQHS 747
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL----EAELEELEAE 366

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  748 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 823
Cdd:TIGR02168  367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELE 446

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081939  824 NYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 869
Cdd:TIGR02168  447 EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492

PRK03918

DNA double-strand break repair ATPase Rad50;

652-1017

6.59e-10

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 64.70 E-value: 6.59e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  652 AWQRLHRVNQDLQSELEaqcRRQELITQQiqtlkhsyGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELK 731
Cdd:PRK03918   163 AYKNLGEVIKEIKRRIE---RLEKFIKRT--------ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  732 mEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----------------QALQRDRQKEVQ 794
Cdd:PRK03918   232 -ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkelkelkekaeeyiklSEFYEEYLDELR 310

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  795 RLQECIAELSQQL-GTSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYED------QLQGHVQQVEAL 867
Cdd:PRK03918   311 EIEKRLSRLEEEInGIEERIKELEEKEER------LEELKKKLKELEKRLEELEERHELYEEakakkeELERLKKRLTGL 384

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  868 QKEKLSETCKGSEQVHKLEEELEAREAS-IRQLAQHVQSLHDE---------------RDLIKHQFQELMER----VATS 927
Cdd:PRK03918   385 TPEKLEKELEELEKAKEEIEEEISKITArIGELKKEIKELKKAieelkkakgkcpvcgRELTEEHRKELLEEytaeLKRI 464

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  928 DGDVAELQEKLRGKEVDYQNLEHSHHRVS--VQLQSVRTLLREKEEELKHI------------KETHERV--LEKKDQDL 991
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESelIKLKELAEQLKELEEKLKKYnleelekkaeeyEKLKEKLikLKGEIKSL 544

                          410       420
                   ....*....|....*....|....*.
gi 1907081939  992 NEALVKMIALGSSLEETEIKLQEKEE 1017
Cdd:PRK03918   545 KKELEKLEELKKKLAELEKKLDELEE 570

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

654-1049

9.17e-10

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 64.31 E-value: 9.17e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  654 QRLHRVNQDLQ------SELEAQCRRqeLITQQIQTLKhsYGEAKDAIRHHEAEIQTLQtrlgnaaaelaiKEQALAKLK 727
Cdd:TIGR02168  179 RKLERTRENLDrledilNELERQLKS--LERQAEKAER--YKELKAELRELELALLVLR------------LEELREELE 242

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  728 gELKMEQGKVREQLEEwqhskamLSGQLRASEQKLRSTEARLLEKTQELRDLetqqalqrdrQKEVQRLQECIAELSQQL 807
Cdd:TIGR02168  243 -ELQEELKEAEEELEE-------LTAELQELEEKLEELRLEVSELEEEIEEL----------QKELYALANEISRLEQQK 304

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  808 GTSEQAQRLMEKKLKRnYTLLLESCEQEKQALLQNLKEVEDKasayEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEE 887
Cdd:TIGR02168  305 QILRERLANLERQLEE-LEAQLEELESKLDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEE 379

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  888 EleareasIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEklrgkevdyQNLEHSHHRVSVQLQSVRTLLR 967
Cdd:TIGR02168  380 Q-------LETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ---------EIEELLKKLEEAELKELQAELE 443

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  968 EKEEELKHIKETHERV---LEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLRRFvSDSPKDAKEPLsttEPTEEGS 1044
Cdd:TIGR02168  444 ELEEELEELQEELERLeeaLEELREELEEAEQALDAAERELAQLQARLDSLERLQENL-EGFSEGVKALL---KNQSGLS 519


                   ....*
gi 1907081939 1045 GILPL 1049
Cdd:TIGR02168  520 GILGV 524

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

602-811

1.51e-09

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 63.78 E-value: 1.51e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  602 KELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPsgAWQRLHRVN------QDLQSELEAQCRRQE 675
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAALR--LWFAQRRLElleaelEELRAELARLEAELE 312

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  676 LITQQIQTLKHSYGEAKDAIRHH--------EAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHS 747
Cdd:COG4913    313 RLEARLDALREELDELEAQIRGNggdrleqlEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAAL 392

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081939  748 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV-QRLQECIAELSQQLGTSE 811
Cdd:COG4913    393 LEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIpARLLALRDALAEALGLDE 457

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

699-1007

2.55e-09

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 62.77 E-value: 2.55e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  699 EAEIQTLQTRLGNAAAELAIKEQALAKLKGE---LKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQE 775
Cdd:TIGR02168  676 RREIEELEEKIEELEEKIAELEKALAELRKEleeLEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKE 755

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  776 LRDLETQ-QALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYE 854
Cdd:TIGR02168  756 LTELEAEiEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREA-----LDELRAELTLLNEEAANLRERLESLE 830

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  855 DQLQGHVQQVEALQKEKLSEtckgSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL 934
Cdd:TIGR02168  831 RRIAATERRLEDLEEQIEEL----SEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELREL 906

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081939  935 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEEL----KHIKETHERVLEKKDQDLNEALVKMIALGSSLEE 1007
Cdd:TIGR02168  907 ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLseeySLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

719-1017

2.79e-09

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 62.77 E-value: 2.79e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  719 KEQALAKLKGELKMEQGKVREQLEEwqhsKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQqaLQRDRQkEVQRLQE 798
Cdd:TIGR02168  675 RRREIEELEEKIEELEEKIAELEKA----LAELRKELEELEEELEQLRKELEELSRQISALRKD--LARLEA-EVEQLEE 747

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  799 CIAELSQQLgtSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEkLSETckg 878
Cdd:TIGR02168  748 RIAQLSKEL--TELEAEIEELEER------LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAE-LTLL--- 815

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  879 SEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQ 958
Cdd:TIGR02168  816 NEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081939  959 LQSVRTLLREKE----------EELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1017
Cdd:TIGR02168  896 LEELSEELRELEskrselrrelEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIE 964

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

736-1020

4.02e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 62.26 E-value: 4.02e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  736 KVREQLEEWQHSKAMLsgQLRASEQKLRSTEARLLEKTQELRDLETQQALqrdRQKEVQRLQECIAELSQQLGTSEQAQR 815
Cdd:COG1196    217 ELKEELKELEAELLLL--KLRELEAELEELEAELEELEAELEELEAELAE---LEAELEELRLELEELELELEEAQAEEY 291

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  816 LMEKKLKRnytlllesCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETckgsEQVHKLEEELEAREAS 895
Cdd:COG1196    292 ELLAELAR--------LEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELE----EELEEAEEELEEAEAE 359

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  896 IRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKH 975
Cdd:COG1196    360 LAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEE 439

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907081939  976 IKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLR 1020
Cdd:COG1196    440 EEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLE 484

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1857-2122

5.49e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 61.88 E-value: 5.49e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1857 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 1935
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1936 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2015
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2016 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2095
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458

                          250       260
                   ....*....|....*....|....*..
gi 1907081939 2096 RVKESEIQYLKQEISSLKDELQTALRD 2122
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

654-1015

1.25e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 60.47 E-value: 1.25e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  654 QRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKme 733
Cdd:TIGR02169  684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELK-- 761

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  734 qgKVREQLEEWQHSKAMLSGQLRASEQKLRstEARLLEKTQELRDLEtqqalqrdrqKEVQRLQECIAELSQQLGTSEQA 813
Cdd:TIGR02169  762 --ELEARIEELEEDLHKLEEALNDLEARLS--HSRIPEIQAELSKLE----------EEVSRIEARLREIEQKLNRLTLE 827

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  814 QRLMEKKLkrnytlllesceQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEARE 893
Cdd:TIGR02169  828 KEYLEKEI------------QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  894 ASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELqEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEE-E 972
Cdd:TIGR02169  896 AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEI-EDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPvN 974

                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1907081939  973 LKHIKEtHERVLEKKDqDLNEALVKMIALGSSLEETEIKLQEK 1015
Cdd:TIGR02169  975 MLAIQE-YEEVLKRLD-ELKEKRAKLEEERKAILERIEEYEKK 1015

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

738-1042

1.87e-08

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 60.08 E-value: 1.87e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  738 REQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQalqRDRQKEVQRLQEciaelsqqlgtSEQAQRLM 817
Cdd:TIGR02169  673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKI---GEIEKEIEQLEQ-----------EEEKLKER 738

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  818 EKKLKRNytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEAL-QKEKLSETCKGSEQVHKLEEELEAREASI 896
Cdd:TIGR02169  739 LEELEED----LSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLeARLSHSRIPEIQAELSKLEEEVSRIEARL 814

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  897 RQLAQHVQSLHDERDLIKHQFQELMERVatsdgDVAELQEKLRGKEVDyqNLEHSHHRVSVQLQSVRTLLREKEEELKHI 976
Cdd:TIGR02169  815 REIEQKLNRLTLEKEYLEKEIQELQEQR-----IDLKEQIKSIEKEIE--NLNGKKEELEEELEELEAALRDLESRLGDL 887

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081939  977 K------ETHERVLEKKDQDLNEALVKmiaLGSSLEETEIKLQEKEECLRRFvsDSPKDAKEPLSTTEPTEE 1042
Cdd:TIGR02169  888 KkerdelEAQLRELERKIEELEAQIEK---KRKRLSELKAKLEALEEELSEI--EDPKGEDEEIPEEELSLE 954

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

654-870

2.74e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 58.24 E-value: 2.74e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  654 QRLHRVNQDL---QSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGEl 730
Cdd:COG4942     27 AELEQLQQEIaelEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEE- 105

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  731 kmeqgkvreqleewqhskamLSGQLRASEQKLRSTEARLL----EKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQ 806
Cdd:COG4942    106 --------------------LAELLRALYRLGRQPPLALLlspeDFLDAVRRLQYLKYLAPARREQAEELRADLAELAAL 165

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081939  807 LGTSEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKE 870
Cdd:COG4942    166 RAELEAERAELEALLAEL--------EEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221

PH2_MyoX

cd13296

Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular ...

311-411

3.74e-08

Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular motor that has crucial functions in the transport and/or tethering of integrins in the actin-based extensions known as filopodia, microtubule binding, and in netrin-mediated axon guidance. It functions as a dimer. MyoX walks on bundles of actin, rather than single filaments, unlike the other unconventional myosins. MyoX is present in organisms ranging from humans to choanoflagellates, but not in Drosophila and Caenorhabditis elegans.MyoX consists of a N-terminal motor/head region, a neck made of 3 IQ motifs, and a tail consisting of a coiled-coil domain, a PEST region, 3 PH domains, a myosin tail homology 4 (MyTH4), and a FERM domain at its very C-terminus. The first PH domain in the MyoX tail is a split-PH domain, interupted by the second PH domain such that PH 1a and PH 1b flanks PH 2. The third PH domain (PH 3) follows the PH 1b domain. This cd contains the second PH repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270108 Cd Length: 103 Bit Score: 53.24 E-value: 3.74e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQYEDG------QWKKHWFVLADQSLRYYRDsvAEEAADLDGEINLSTCYDVTEYPVQRNyGFQIHTKEGEFTL 384
Cdd:cd13296      1 KSGWLTKKGGGSstlsrrNWKSRWFVLRDTVLKYYEN--DQEGEKLLGTIDIRSAKEIVDNDPKEN-RLSITTEERTYHL 77

                           90       100
                   ....*....|....*....|....*..
gi 1907081939  385 SAMTSGIRRNWIQtIMKHVLPASAPDV 411
Cdd:cd13296     78 VAESPEDASQWVN-VLTRVISATDLEL 103

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

700-871

6.78e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 57.86 E-value: 6.78e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  700 AEIQTLQTRLGNAAAELAIKEQALAKLKgELKMEQGKVREQLEEWQHSKAMLSGQLRASE--QKLRSTEARLLEKTQELR 777
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEELE-ELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPERLE 149

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  778 DLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQL 857
Cdd:COG4717    150 ELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEEL 229

                          170
                   ....*....|....
gi 1907081939  858 QGHVQQVEALQKEK 871
Cdd:COG4717    230 EQLENELEAAALEE 243

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

756-1017

8.14e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 58.02 E-value: 8.14e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  756 RASEQKLRSTEARLL-------EKTQELRDLETQ-------QALQ---RDRQKEVQRLQecIAELSQQLGTSEQAQRLME 818
Cdd:COG1196    175 EEAERKLEATEENLErledilgELERQLEPLERQaekaeryRELKeelKELEAELLLLK--LRELEAELEELEAELEELE 252

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  819 KKLKRnYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETckgsEQVHKLEEELEAREASIRQ 898
Cdd:COG1196    253 AELEE-LEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLE----ERRRELEERLEELEEELAE 327

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  899 LAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEvdyQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKE 978
Cdd:COG1196    328 LEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAE---AELAEAEEELEELAEELLEALRAAAELAAQLEE 404

                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1907081939  979 ThERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1017
Cdd:COG1196    405 L-EEAEEALLERLERLEEELEELEEALAELEEEEEEEEE 442

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

679-869

1.14e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 57.23 E-value: 1.14e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  679 QQIQTLKHSYGEAKDAIRHHEA--EIQTLQTRLGNAAAELAIKEQALAKLK--------GELKMEQGKVREQLEEWQHSK 748
Cdd:COG4913    232 EHFDDLERAHEALEDAREQIELlePIRELAERYAAARERLAELEYLRAALRlwfaqrrlELLEAELEELRAELARLEAEL 311

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  749 AMLSGQLRASEQKLRSTEARLLE-KTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTL 827
Cdd:COG4913    312 ERLEARLDALREELDELEAQIRGnGGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAA 391

                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1907081939  828 LLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 869
Cdd:COG4913    392 LLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLER 433

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1868-2060

1.15e-07

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 57.37 E-value: 1.15e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1868 LRERIQELEAQMGVMREELGH-----KELEGDVAALQEKyqrdFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 1942
Cdd:TIGR02168  307 LRERLANLERQLEELEAQLEElesklDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEEQLE 382

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1943 REEKDRLLAEETAATIsaieamkNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2022
Cdd:TIGR02168  383 TLRSKVAQLELQIASL-------NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455

                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1907081939 2023 NAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA 2060
Cdd:TIGR02168  456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

1856-2122

1.81e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 56.85 E-value: 1.81e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1856 PAAKDEAESMSGLRERIQELEAQMGVMREELGHkeLEgDVAALQEKYQRDFESLKA--TCERGFAAmeETHQKKIEDLQR 1933
Cdd:COG4913    221 PDTFEAADALVEHFDDLERAHEALEDAREQIEL--LE-PIRELAERYAAARERLAEleYLRAALRL--WFAQRRLELLEA 295

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1934 qhqrELEKLREEKDRLLAEETAATISAIEAmkNAHREEMERELEKSQRSQISSINSDIEALRRqyleELQSVQRELEVLS 2013
Cdd:COG4913    296 ----ELEELRAELARLEAELERLEARLDAL--REELDELEAQIRGNGGDRLEQLEREIERLER----ELEERERRRARLE 365

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2014 EQysqkcLENAHLAQALEAE--RQALRQCQRENQELNAHNQELNNRLAAEITRLRtlltgdgggestglpltqgkdayEL 2091
Cdd:COG4913    366 AL-----LAALGLPLPASAEefAALRAEAAALLEALEEELEALEEALAEAEAALR-----------------------DL 417

                          250       260       270
                   ....*....|....*....|....*....|.
gi 1907081939 2092 EVLLRVKESEIQYLKQEISSLKDELQTALRD 2122
Cdd:COG4913    418 RRELRELEAEIASLERRKSNIPARLLALRDA 448

Myosin_tail_1

pfam01576

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...

594-1043

2.42e-07

Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 56.34 E-value: 2.42e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  594 HELTSLLEKELEQSQKEASDLLEQNRLLQDqLRVALGREQSAREGyvLQTEVATSPS----------------GAWQRLH 657
Cdd:pfam01576   78 HELESRLEEEEERSQQLQNEKKKMQQHIQD-LEEQLDEEEAARQK--LQLEKVTTEAkikkleedillledqnSKLSKER 154

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  658 RVNQDLQSELEAQCRRQELITQQIQTLKHSygeakdairhHEAEIQTLQTRLGNAAAelaiKEQALAKLKGELKMEQGKV 737
Cdd:pfam01576  155 KLLEERISEFTSNLAEEEEKAKSLSKLKNK----------HEAMISDLEERLKKEEK----GRQELEKAKRKLEGESTDL 220

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  738 REQLEEWQHSKAMLSGQLRASEQKLRSTEARLlektqelrdlETQQALQRDRQKEVQRLQECIAELSQQLgTSEQAQRLM 817
Cdd:pfam01576  221 QEQIAELQAQIAELRAQLAKKEEELQAALARL----------EEETAQKNNALKKIRELEAQISELQEDL-ESERAARNK 289

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  818 EKKLKRNYTLLLESCEQEKQALL------QNLK-----EVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLE 886
Cdd:pfam01576  290 AEKQRRDLGEELEALKTELEDTLdttaaqQELRskreqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAK 369

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  887 EELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLL 966
Cdd:pfam01576  370 RNKANLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLL 449

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  967 REKEEelKHIKETHE-RVLEKKDQDLNEALV----KMIALGSSLEETEI-------KLQEKEECLRRF----------VS 1024
Cdd:pfam01576  450 NEAEG--KNIKLSKDvSSLESQLQDTQELLQeetrQKLNLSTRLRQLEDernslqeQLEEEEEAKRNVerqlstlqaqLS 527

                          490
                   ....*....|....*....
gi 1907081939 1025 DSPKDAKEPLSTTEPTEEG 1043
Cdd:pfam01576  528 DMKKKLEEDAGTLEALEEG 546

PH_AtPH1

cd13276

Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all ...

311-407

2.63e-07

Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all plant tissue and is proposed to be the plant homolog of human pleckstrin. Pleckstrin consists of two PH domains separated by a linker region, while AtPH has a single PH domain with a short N-terminal extension. AtPH1 binds PtdIns3P specifically and is thought to be an adaptor molecule since it has no obvious catalytic functions. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270095 Cd Length: 106 Bit Score: 50.78 E-value: 2.63e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQYED-GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVT--EYPVQRNYGFQIHTKEGEFTLSAM 387
Cdd:cd13276      1 KAGWLEKQGEFiKTWRRRWFVLKQGKLFWFKEPDVTPYSKPRGVIDLSKCLTVKsaEDATNKENAFELSTPEETFYFIAD 80

                           90       100
                   ....*....|....*....|
gi 1907081939  388 TSGIRRNWIQTIMKHVLPAS 407
Cdd:cd13276     81 NEKEKEEWIGAIGRAIVKHS 100

PH-GRAM1_AGT26

cd13215

Autophagy-related protein 26/Sterol 3-beta-glucosyltransferase Pleckstrin homology (PH) domain, ...

310-403

2.64e-07

Autophagy-related protein 26/Sterol 3-beta-glucosyltransferase Pleckstrin homology (PH) domain, repeat 1; ATG26 (also called UGT51/UDP-glycosyltransferase 51), a member of the glycosyltransferase 28 family, resulting in the biosynthesis of sterol glucoside. ATG26 in decane metabolism and autophagy. There are 32 known autophagy-related (ATG) proteins, 17 are components of the core autophagic machinery essential for all autophagy-related pathways and 15 are the additional components required only for certain pathways or species. The core autophagic machinery includes 1) the ATG9 cycling system (ATG1, ATG2, ATG9, ATG13, ATG18, and ATG27), 2) the phosphatidylinositol 3-kinase complex (ATG6/VPS30, ATG14, VPS15, and ATG34), and 3) the ubiquitin-like protein system (ATG3, ATG4, ATG5, ATG7, ATG8, ATG10, ATG12, and ATG16). Less is known about how the core machinery is adapted or modulated with additional components to accommodate the nonselective sequestration of bulk cytosol (autophagosome formation) or selective sequestration of specific cargos (Cvt vesicle, pexophagosome, or bacteria-containing autophagosome formation). The pexophagosome-specific additions include the ATG30-ATG11-ATG17 receptor-adaptors complex, the coiled-coil protein ATG25, and the sterol glucosyltransferase ATG26. ATG26 is necessary for the degradation of medium peroxisomes. It contains 2 GRAM domains and a single PH domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 275402 Cd Length: 116 Bit Score: 51.08 E-value: 2.64e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  310 FKKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSvaeeaADL---DGEINLSTCY--DVTEYPVQRNYGFQIHTKEGEFT 383
Cdd:cd13215     22 IKSGYLSKRsKRTLRYTRYWFVLKGDTLSWYNSS-----TDLyfpAGTIDLRYATsiELSKSNGEATTSFKIVTNSRTYK 96

                           90       100
                   ....*....|....*....|
gi 1907081939  384 LSAMTSGIRRNWIQTIMKHV 403
Cdd:cd13215     97 FKADSETSADEWVKALKKQI 116

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1874-2168

2.65e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 56.23 E-value: 2.65e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1874 ELEAQMGVMREELGhkELEGDVAALQEKyqrdFESLKATCERGFAAMEETHqKKIEDLQRQHQRELEKLREEKDRLlaEE 1953
Cdd:TIGR02169  671 SEPAELQRLRERLE--GLKRELSSLQSE----LRRIENRLDELSQELSDAS-RKIGEIEKEIEQLEQEEEKLKERL--EE 741

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1954 TAATISAIEAMKNAHREEMErELEK---SQRSQISSINSDIEALRRQYLEE-LQSVQRELEVLSEQYSQKCLENAHLAQA 2029
Cdd:TIGR02169  742 LEEDLSSLEQEIENVKSELK-ELEArieELEEDLHKLEEALNDLEARLSHSrIPEIQAELSKLEEEVSRIEARLREIEQK 820

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2030 LEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEStglpltqgkDAYELEVLLRVKESEIQYLKQEI 2109
Cdd:TIGR02169  821 LNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEE---------ELEELEAALRDLESRLGDLKKER 891

                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081939 2110 SSLKDELQTALRDKKYASDKYKDIYTELSIAKAKAdcdiSRLKEQLKAATEALGEKSPE 2168
Cdd:TIGR02169  892 DELEAQLRELERKIEELEAQIEKKRKRLSELKAKL----EALEEELSEIEDPKGEDEEI 946

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1683-2044

3.47e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 55.84 E-value: 3.47e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1683 QAVGALKEEYEELLhkqksEYQKVITliEKENTELKAKVSQMDHQQRCLQEAENKHSESmfalqgryEEEIRCMVEQLSH 1762
Cdd:TIGR02169  198 QQLERLRREREKAE-----RYQALLK--EKREYEGYELLKEKEALERQKEAIERQLASL--------EEELEKLTEEISE 262

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1763 TENTLqAERSRVLSQLDASVKDRQAMEQHHVQ--------QMKMLEDRFQLKVRELQ------AVHQEELRALQEHyIWS 1828
Cdd:TIGR02169  263 LEKRL-EEIEQLLEELNKKIKDLGEEEQLRVKekigeleaEIASLERSIAEKERELEdaeerlAKLEAEIDKLLAE-IEE 340

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1829 LRGALSLYQpshpdsslapgpSEPRAVPAA-KDEAESMSGLRERIQELEAQMGVMREELghkelegdvaalqEKYQRDFE 1907
Cdd:TIGR02169  341 LEREIEEER------------KRRDKLTEEyAELKEELEDLRAELEEVDKEFAETRDEL-------------KDYREKLE 395

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1908 SLKatcergfaameethqKKIEDLQRQHQRELEKLREEKDRLlaEETAATISAIEAMKNAHREEME--RELEKSQRSQIS 1985
Cdd:TIGR02169  396 KLK---------------REINELKRELDRLQEELQRLSEEL--ADLNAAIAGIEAKINELEEEKEdkALEIKKQEWKLE 458

                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081939 1986 SINSDIEALRRQYL---EELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQREN 2044
Cdd:TIGR02169  459 QLAADLSKYEQELYdlkEEYDRVEKELSKLQRELAE-----------AEAQARASEERVRGG 509

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

555-800

4.04e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 55.46 E-value: 4.04e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  555 VEIEQRWHQVET--TPLREEKQVPIAPLHLSLEdrSERLSTHELTSLLEKELEQSQKE-ASDLLEQNRLLQD--QLRVAL 629
Cdd:TIGR02169  268 EEIEQLLEELNKkiKDLGEEEQLRVKEKIGELE--AEIASLERSIAEKERELEDAEERlAKLEAEIDKLLAEieELEREI 345

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  630 GREQSAREGyvLQTEVATSPsgawQRLHRVNQDLQS-ELEAQCRRQEL--ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQ 706
Cdd:TIGR02169  346 EEERKRRDK--LTEEYAELK----EELEDLRAELEEvDKEFAETRDELkdYREKLEKLKREINELKRELDRLQEELQRLS 419

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  707 TRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSG---QLRASEQKLRSTEARLLEKTQELRDLETQQ 783
Cdd:TIGR02169  420 EELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKyeqELYDLKEEYDRVEKELSKLQRELAEAEAQA 499

                          250
                   ....*....|....*..
gi 1907081939  784 ALQRDRQKEVQRLQECI 800
Cdd:TIGR02169  500 RASEERVRGGRAVEEVL 516

PH_CNK_mammalian-like

cd01260

Connector enhancer of KSR (Kinase suppressor of ras) (CNK) pleckstrin homology (PH) domain; ...

313-357

4.94e-07

Connector enhancer of KSR (Kinase suppressor of ras) (CNK) pleckstrin homology (PH) domain; CNK family members function as protein scaffolds, regulating the activity and the subcellular localization of RAS activated RAF. There is a single CNK protein present in Drosophila and Caenorhabditis elegans in contrast to mammals which have 3 CNK proteins (CNK1, CNK2, and CNK3). All of the CNK members contain a sterile a motif (SAM), a conserved region in CNK (CRIC) domain, and a PSD-95/DLG-1/ZO-1 (PDZ) domain, and, with the exception of CNK3, a PH domain. A CNK2 splice variant CNK2A also has a PDZ domain-binding motif at its C terminus and Drosophila CNK (D-CNK) also has a domain known as the Raf-interacting region (RIR) that mediates binding of the Drosophila Raf kinase. This cd contains CNKs from mammals, chickens, amphibians, fish, and crustacea. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269962 Cd Length: 114 Bit Score: 50.48 E-value: 4.94e-07

                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907081939  313 GWLTKQYEDG-----QWKKHWFVLADQSLRYYRDSVAEEAadlDGEINLS 357
Cdd:cd01260     17 GWLWKKKEAKsffgqKWKKYWFVLKGSSLYWYSNQQDEKA---EGFINLP 63

PH1_ARAP

cd13253

ArfGAP with RhoGAP domain, ankyrin repeat and PH domain Pleckstrin homology (PH) domain, ...

311-404

5.28e-07

ArfGAP with RhoGAP domain, ankyrin repeat and PH domain Pleckstrin homology (PH) domain, repeat 1; ARAP proteins (also called centaurin delta) are phosphatidylinositol 3,4,5-trisphosphate-dependent GTPase-activating proteins that modulate actin cytoskeleton remodeling by regulating ARF and RHO family members. They bind phosphatidylinositol 3,4,5-trisphosphate (PtdIns(3,4,5)P3) and phosphatidylinositol 3,4-bisphosphate (PtdIns(3,4,5)P2) binding. There are 3 mammalian ARAP proteins: ARAP1, ARAP2, and ARAP3. All ARAP proteins contain a N-terminal SAM (sterile alpha motif) domain, 5 PH domains, an ArfGAP domain, 2 ankyrin domain, A RhoGap domain, and a Ras-associating domain. This hierarchy contains the first PH domain in ARAP. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270073 Cd Length: 94 Bit Score: 49.69 E-value: 5.28e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQYEDGQ---WKKHWFVLADQSLRYYRdsvAEEAADLDGEINLSTcydVTEYPVQRNYGFQIHTKEGEFTLSAM 387
Cdd:cd13253      2 KSGYLDKQGGQGNnkgFQKRWVVFDGLSLRYFD---SEKDAYSKRIIPLSA---ISTVRAVGDNKFELVTTNRTFVFRAE 75

                           90
                   ....*....|....*..
gi 1907081939  388 TSGIRRNWIQTIMKHVL 404
Cdd:cd13253     76 SDDERNLWCSTLQAAIS 92

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1750-2047

5.43e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 55.07 E-value: 5.43e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1750 EEEIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLE--DRFQLKVRELQAVHQEELRALQehyiw 1827
Cdd:TIGR02169  676 LQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEqlEQEEEKLKERLEELEEDLSSLE----- 750

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1828 slrgalslyqpshpdsslapgpsepRAVPAAKDEaesMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKyQRDFE 1907
Cdd:TIGR02169  751 -------------------------QEIENVKSE---LKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEI-QAELS 801

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1908 SLKATCERGFAAMEETHQKkiedLQRQHQRE--LEKLREEK--DRLLAEETAATISAIEAMKNAHREEMERELEKSQRS- 1982
Cdd:TIGR02169  802 KLEEEVSRIEARLREIEQK----LNRLTLEKeyLEKEIQELqeQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAl 877

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081939 1983 -QISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQEL 2047
Cdd:TIGR02169  878 rDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGED 943

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1866-2164

7.03e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 54.94 E-value: 7.03e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1866 SGLRERIQELEAQMGVMREELG-----HKELEGDVAALQE---------KYQRDFESLKAtcergfaameETHQKKIEDL 1931
Cdd:COG1196    168 SKYKERKEEAERKLEATEENLErlediLGELERQLEPLERqaekaeryrELKEELKELEA----------ELLLLKLREL 237

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1932 QRQHQRELEKLREEKDRL--LAEETAATISAIEAMKNAHREEMERELEKSQR-----SQISSINSDIEAL---RRQYLEE 2001
Cdd:COG1196    238 EAELEELEAELEELEAELeeLEAELAELEAELEELRLELEELELELEEAQAEeyellAELARLEQDIARLeerRRELEER 317

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2002 LQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDgggestglp 2081
Cdd:COG1196    318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEEL--------- 388

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2082 LTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2161
Cdd:COG1196    389 LEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAEL 468


                   ...
gi 1907081939 2162 LGE 2164
Cdd:COG1196    469 LEE 471

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

700-1040

8.07e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 54.69 E-value: 8.07e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  700 AEIQTLQTRLGNAAAELAIKEQALAKLKGElkmeqgkvREQLEEWQHskamLSGQLRASEQKLRSTEARLLEKTQE--LR 777
Cdd:TIGR02169  177 EELEEVEENIERLDLIIDEKRQQLERLRRE--------REKAERYQA----LLKEKREYEGYELLKEKEALERQKEaiER 244

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  778 DLETQQALQRDRQKEVQRLQECIAELSQQLgtSEQAQRLMEKKLKRNYTLllesceQEKQALLQ-NLKEVEDKASAYEDQ 856
Cdd:TIGR02169  245 QLASLEEELEKLTEEISELEKRLEEIEQLL--EELNKKIKDLGEEEQLRV------KEKIGELEaEIASLERSIAEKERE 316

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  857 LQghvqQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQE 936
Cdd:TIGR02169  317 LE----DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYRE 392

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  937 KLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETH---ERVLEKKDQDLNEALVKMIALGSSLEETEIKLQ 1013
Cdd:TIGR02169  393 KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKInelEEEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472

                          330       340
                   ....*....|....*....|....*..
gi 1907081939 1014 EKEECLRRfVSDSPKDAKEPLSTTEPT 1040
Cdd:TIGR02169  473 DLKEEYDR-VEKELSKLQRELAEAEAQ 498

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

572-977

8.63e-07

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 54.74 E-value: 8.63e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  572 EKQVPIAPLHLSlEDRSER----LSTHELTSLLEK---ELEQSQKEASDLLEQNRLLQDQ-LRVALGREQSAREGYVLQT 643
Cdd:pfam15921  348 EKQLVLANSELT-EARTERdqfsQESGNLDDQLQKllaDLHKREKELSLEKEQNKRLWDRdTGNSITIDHLRRELDDRNM 426

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  644 EVatspsgawQRLHRVNQDLQSELEAQCRRQ--------------ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRL 709
Cdd:pfam15921  427 EV--------QRLEALLKAMKSECQGQMERQmaaiqgkneslekvSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTV 498

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  710 GNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHskamlsgqLRASEQKLRSTEArllektqELRDLETQQAlQRDR 789
Cdd:pfam15921  499 SDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQH--------LKNEGDHLRNVQT-------ECEALKLQMA-EKDK 562

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  790 QKEVQRLQ-ECIAELSQQLGTSEQAQRLMEKKLKRNYtlllesceQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEaLQ 868
Cdd:pfam15921  563 VIEILRQQiENMTQLVGQHGRTAGAMQVEKAQLEKEI--------NDRRLELQEFKILKDKKDAKIRELEARVSDLE-LE 633

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  869 KEKLSETckGSEQVhkleeeleareasirqlaQHVQSLHDERDlikhqfqELMERVATSDGDVAELQEKLRGKEVDYQN- 947
Cdd:pfam15921  634 KVKLVNA--GSERL------------------RAVKDIKQERD-------QLLNEVKTSRNELNSLSEDYEVLKRNFRNk 686

                          410       420       430
                   ....*....|....*....|....*....|...
gi 1907081939  948 ---LEHSHHRVSVQLQSVRTLLREKEEELKHIK 977
Cdd:pfam15921  687 seeMETTTNKLKMQLKSAQSELEQTRNTLKSME 719

PH_PEPP1_2_3

cd13248

Phosphoinositol 3-phosphate binding proteins 1, 2, and 3 pleckstrin homology (PH) domain; ...

311-399

1.09e-06

Phosphoinositol 3-phosphate binding proteins 1, 2, and 3 pleckstrin homology (PH) domain; PEPP1 (also called PLEKHA4/PH domain-containing family A member 4 and RHOXF1/Rhox homeobox family member 1), and related homologs PEPP2 (also called PLEKHA5/PH domain-containing family A member 5) and PEPP3 (also called PLEKHA6/PH domain-containing family A member 6), have PH domains that interact specifically with PtdIns(3,4)P3. Other proteins that bind PtdIns(3,4)P3 specifically are: TAPP1 (tandem PH-domain-containing protein-1) and TAPP2], PtdIns3P AtPH1, and Ptd- Ins(3,5)P2 (centaurin-beta2). All of these proteins contain at least 5 of the 6 conserved amino acids that make up the putative phosphatidylinositol 3,4,5- trisphosphate-binding motif (PPBM) located at their N-terminus. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270068 Cd Length: 104 Bit Score: 49.19 E-value: 1.09e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTcYDVT----EYPVQRNYGFQIhTKEGEFT- 383
Cdd:cd13248      9 MSGWLHKQGGSGlkNWRKRWFVLKDNCLYYYKD---PEEEKALGSILLPS-YTISpappSDEISRKFAFKA-EHANMRTy 83

                           90
                   ....*....|....*..
gi 1907081939  384 -LSAMTSGIRRNWIQTI 399
Cdd:cd13248     84 yFAADTAEEMEQWMNAM 100

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1791-2117

1.12e-06

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 54.29 E-value: 1.12e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1791 HHVQQMKMLEDRFQLKVRELQAVhQEELRALQEhyiwSLRGALSLYQPSHPDSSLAPGPSEpRAVPAAKDEAESMSGLRE 1870
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAEL-RKELEELEE----ELEQLRKELEELSRQISALRKDLA-RLEAEVEQLEERIAQLSK 754

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1871 RIQELEAQMGVMREELGH-----KELEGDVAALQEKYQRDFESLKATCERgfaameethqkkIEDLQRQHQRELEKLREE 1945
Cdd:TIGR02168  755 ELTELEAEIEELEERLEEaeeelAEAEAEIEELEAQIEQLKEELKALREA------------LDELRAELTLLNEEAANL 822

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1946 KDRLLAEETAAtisaieAMKNAHREEMERELEKsQRSQISSINSDIEALRRQyLEELQSvqrELEVLSEQYSQKCLENAH 2025
Cdd:TIGR02168  823 RERLESLERRI------AATERRLEDLEEQIEE-LSEDIESLAAEIEELEEL-IEELES---ELEALLNERASLEEALAL 891

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2026 LAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLltgdgggESTGLPLTQ-----GKDAYE-LEVLLRVKE 2099
Cdd:TIGR02168  892 LRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGL-------EVRIDNLQErlseeYSLTLEeAEALENKIE 964

                          330
                   ....*....|....*...
gi 1907081939 2100 SEIQYLKQEISSLKDELQ 2117
Cdd:TIGR02168  965 DDEEEARRRLKRLENKIK 982

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1711-2096

1.34e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 53.79 E-value: 1.34e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1711 EKENTELKAKVSQMDHQQRCLQEAENKHSESmfalqgryEEEIRCMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQ 1790
Cdd:COG1196    221 ELKELEAELLLLKLRELEAELEELEAELEEL--------EAELEELEAELAELEAELEELRLE-LEELELELEEAQAEEY 291

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1791 HHVQQMKMLEDRFQLKVRELQAVHQEELRALQEhyiwslrgalslyqpshpdsslapgpsepravpaakdEAEsmsgLRE 1870
Cdd:COG1196    292 ELLAELARLEQDIARLEERRRELEERLEELEEE-------------------------------------LAE----LEE 330

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1871 RIQELEAQMGVMREELghKELEGDVAALQEKYQRdfeslkatcergfaamEETHQKKIEDLQRQHQRELEKLREEKDRLL 1950
Cdd:COG1196    331 ELEELEEELEELEEEL--EEAEEELEEAEAELAE----------------AEEALLEAEAELAEAEEELEELAEELLEAL 392

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1951 AEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQAL 2030
Cdd:COG1196    393 RAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEA 472

                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907081939 2031 EAERQALRQCQRENQELNA-----HNQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVLLR 2096
Cdd:COG1196    473 ALLEAALAELLEELAEAAArllllLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAA 543

PH2_ADAP

cd01251

ArfGAP with dual PH domains Pleckstrin homology (PH) domain, repeat 2; ADAP (also called ...

309-399

2.04e-06

ArfGAP with dual PH domains Pleckstrin homology (PH) domain, repeat 2; ADAP (also called centaurin alpha) is a phophatidlyinositide binding protein consisting of an N-terminal ArfGAP domain and two PH domains. In response to growth factor activation, PI3K phosphorylates phosphatidylinositol 4,5-bisphosphate to phosphatidylinositol 3,4,5-trisphosphate. Centaurin alpha 1 is recruited to the plasma membrane following growth factor stimulation by specific binding of its PH domain to phosphatidylinositol 3,4,5-trisphosphate. Centaurin alpha 2 is constitutively bound to the plasma membrane since it binds phosphatidylinositol 4,5-bisphosphate and phosphatidylinositol 3,4,5-trisphosphate with equal affinity. This cd contains the second PH domain repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241282 Cd Length: 105 Bit Score: 48.35 E-value: 2.04e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  309 NFKK-GWLTK----QYEdgQWKKHWFVLADQSLRYYRDSvaeeaadLD----GEINLSTC---YDVTE-----YPVQRNY 371
Cdd:cd01251      1 DFLKeGYLEKtgpkQTD--GFRKRWFTLDDRRLMYFKDP-------LDafpkGEIFIGSKeegYSVREglppgIKGHWGF 71

                           90       100
                   ....*....|....*....|....*...
gi 1907081939  372 GFQIHTKEGEFTLSAMTSGIRRNWIQTI 399
Cdd:cd01251     72 GFTLVTPDRTFLLSAETEEERREWITAI 99

GumC

COG3206

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

600-810

2.58e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 52.71 E-value: 2.58e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  600 LEKELEQSQKEASDLLEQNRL--LQDQLRVALGREQSAREgyvLQTEVATSPSGAWQRLHRVNQDLQSELEAQcrRQELI 677
Cdd:COG3206    187 LRKELEEAEAALEEFRQKNGLvdLSEEAKLLLQQLSELES---QLAEARAELAEAEARLAALRAQLGSGPDAL--PELLQ 261

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  678 TQQIQTLKHSYGEAkdairhhEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLE-EWQHSKAMLSgQLR 756
Cdd:COG3206    262 SPVIQQLRAQLAEL-------EAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILASLEaELEALQAREA-SLQ 333

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907081939  757 ASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKE-VQRLQEciAELSQQLGTS 810
Cdd:COG3206    334 AQLAQLEARLAELPELEAELRRLEREVEVARELYESlLQRLEE--ARLAEALTVG 386

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

556-1021

2.74e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 53.02 E-value: 2.74e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  556 EIEQRWHQVETTPLREEKQvpIAPLHLSLEDRSERL-STHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQS 634
Cdd:COG1196    285 EAQAEEYELLAELARLEQD--IARLEERRRELEERLeELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  635 AREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAA 714
Cdd:COG1196    363 AEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEE 442

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  715 ELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSgQLRASEQKLRSTEARLLE--KTQELRDLETQQALQRDRQKE 792
Cdd:COG1196    443 ALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALA-ELLEELAEAAARLLLLLEaeADYEGFLEGVKAALLLAGLRG 521

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  793 VQR---------------LQECIAELSQQLGT------SEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKAS 851
Cdd:COG1196    522 LAGavavligveaayeaaLEAALAAALQNIVVeddevaAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAV 601

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  852 AYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELME---RVATSD 928
Cdd:COG1196    602 DLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAallEAEAEL 681

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  929 GDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEET 1008
Cdd:COG1196    682 EELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPD 761

                          490
                   ....*....|...
gi 1907081939 1009 EIKLQEKEECLRR 1021
Cdd:COG1196    762 LEELERELERLER 774

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

600-972

3.15e-06

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 52.46 E-value: 3.15e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  600 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSARegyvLQTEVATSPSG---------AWQRLHRVNQDLQSEL-EA 669
Cdd:COG4717    100 LEEELEELEAELEELREELEKLEKLLQLLPLYQELEA----LEAELAELPERleeleerleELRELEEELEELEAELaEL 175

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  670 QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLkgELKMEQGKVREQLEEWQHSKA 749
Cdd:COG4717    176 QEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQL--ENELEAAALEERLKEARLLLL 253

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  750 MLSGQL------------------------------RASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQEC 799
Cdd:COG4717    254 IAAALLallglggsllsliltiagvlflvlgllallFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPD 333

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  800 I--AELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQ-----NLKEVEDKASAYED--QLQGHVQQVEALQKE 870
Cdd:COG4717    334 LspEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAeagveDEEELRAALEQAEEyqELKEELEELEEQLEE 413

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  871 KLSETCKGSEQVHKLE--EELEAREASIRQLAQHVQSLHDERDLIKHQFQELMErvatsDGDVAELQEKLRGKEVDYQNL 948
Cdd:COG4717    414 LLGELEELLEALDEEEleEELEELEEELEELEEELEELREELAELEAELEQLEE-----DGELAELLQELEELKAELREL 488

                          410       420
                   ....*....|....*....|....
gi 1907081939  949 EHSHHRVSVQLQSVRTLLREKEEE 972
Cdd:COG4717    489 AEEWAALKLALELLEEAREEYREE 512

SMC_N

pfam02463

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...

715-1016

3.17e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.

Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 52.67 E-value: 3.17e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  715 ELAIKEQALAKL------KGELKMEQGKVREQL--EEWQHSKAMLSGQLRASEQKLRSTEARLLEKT---QELRDLETQQ 783
Cdd:pfam02463  167 LKRKKKEALKKLieetenLAELIIDLEELKLQElkLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKlneERIDLLQELL 246

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  784 ALQRDRQKEVQRLQECIAELSQQ----LGTSEQAQRLMEKKLKRNYTLL------LESCEQEKQALLQNLKEVEDKASAY 853
Cdd:pfam02463  247 RDEQEEIESSKQEIEKEEEKLAQvlkeNKEEEKEKKLQEEELKLLAKEEeelkseLLKLERRKVDDEEKLKESEKEKKKA 326

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  854 EDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQ---FQELMERVATSDGD 930
Cdd:pfam02463  327 EKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAaklKEEELELKSEEEKE 406

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  931 VAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEI 1010
Cdd:pfam02463  407 AQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQL 486


                   ....*.
gi 1907081939 1011 KLQEKE 1016
Cdd:pfam02463  487 ELLLSR 492

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

712-942

3.54e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 51.69 E-value: 3.54e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  712 AAAELAIKEQALAKLKGELKmeqgKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQ 790
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIA----ELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAElAELEKEIA 93

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  791 KEVQRLQECIAELSQQLgtseqaqRLMEKKLKRNYTLLLESCEQEKQAL--LQNLKEVEDKASAYEDQLQGHVQQVEALQ 868
Cdd:COG4942     94 ELRAELEAQKEELAELL-------RALYRLGRQPPLALLLSPEDFLDAVrrLQYLKYLAPARREQAEELRADLAELAALR 166

                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081939  869 KEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKE 942
Cdd:COG4942    167 AELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAA 240

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1936-2164

5.31e-06

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 51.98 E-value: 5.31e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1936 QRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR---ELEVL 2012
Cdd:TIGR02168  676 RREIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQleeRIAQL 752

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2013 SEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTL-----LTGDGGGESTGLPLTQGKD 2087
Cdd:TIGR02168  753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELraeltLLNEEAANLRERLESLERR 832

                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081939 2088 AYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE 2164
Cdd:TIGR02168  833 IAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRE 905

Mplasa_alph_rch

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

679-1009

6.19e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 51.56 E-value: 6.19e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  679 QQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAI---KEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQL 755
Cdd:TIGR04523  117 EQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKlnnKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKL 196

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  756 RASEQKLRSTEA-----RLLEKtqELRDLETQQA-LQRDRQKEVQRLQECIAELSQqlgTSEQAQRLMEKKLKRNYTLll 829
Cdd:TIGR04523  197 LKLELLLSNLKKkiqknKSLES--QISELKKQNNqLKDNIEKKQQEINEKTTEISN---TQTQLNQLKDEQNKIKKQL-- 269

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  830 esceQEKQallQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKG--------SEQVHKLEEELEAREASIRQLAQ 901
Cdd:TIGR04523  270 ----SEKQ---KELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKElkselknqEKKLEEIQNQISQNNKIISQLNE 342

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  902 HVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKhIKETHE 981
Cdd:TIGR04523  343 QISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIK-KLQQEK 421

                          330       340
                   ....*....|....*....|....*...
gi 1907081939  982 RVLEKKDQDLNEALVKMIALGSSLEETE 1009
Cdd:TIGR04523  422 ELLEKEIERLKETIIKNNSEIKDLTNQD 449

PH_DAPP1

cd10573

Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; ...

308-399

6.39e-06

Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; DAPP1 (also known as PHISH/3' phosphoinositide-interacting SH2 domain-containing protein or Bam32) plays a role in B-cell activation and has potential roles in T-cell and mast cell function. DAPP1 promotes B cell receptor (BCR) induced activation of Rho GTPases Rac1 and Cdc42, which feed into mitogen-activated protein kinases (MAPK) activation pathways and affect cytoskeletal rearrangement. DAPP1can also regulate BCR-induced activation of extracellular signal-regulated kinase (ERK), and c-jun NH2-terminal kinase (JNK). DAPP1 contains an N-terminal SH2 domain and a C-terminal pleckstrin homology (PH) domain with a single tyrosine phosphorylation site located centrally. DAPP1 binds strongly to both PtdIns(3,4,5)P3 and PtdIns(3,4)P2. The PH domain is essential for plasma membrane recruitment of PI3K upon cell activation. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269977 [Multi-domain] Cd Length: 96 Bit Score: 46.55 E-value: 6.39e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  308 LNFKKGWLTKQyedGQ----WKKHWFVLADQSLRYYRDSVAEEAADldgEINLSTCYDVTEYPVQ-RNYGFQIHTKEGEF 382
Cdd:cd10573      2 LGSKEGYLTKL---GGivknWKTRWFVLRRNELKYFKTRGDTKPIR---VLDLRECSSVQRDYSQgKVNCFCLVFPERTF 75

                           90
                   ....*....|....*..
gi 1907081939  383 TLSAMTSGIRRNWIQTI 399
Cdd:cd10573     76 YMYANTEEEADEWVKLL 92

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

585-914

7.12e-06

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 51.61 E-value: 7.12e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  585 EDRSERLSTheLTSLLEKELEQSQKEASDLLEQNrllqdqlrvALGREQSAREGYVLqtevatspSGAWQRLHRVNQDLQ 664
Cdd:TIGR02169  183 EENIERLDL--IIDEKRQQLERLRREREKAERYQ---------ALLKEKREYEGYEL--------LKEKEALERQKEAIE 243

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  665 SELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQ--------TLQTRLGNAAAELAIKEQALAKLKGELKMEQGK 736
Cdd:TIGR02169  244 RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKdlgeeeqlRVKEKIGELEAEIASLERSIAEKERELEDAEER 323

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  737 VR---EQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----QALQRDRQKEVQRlQECIAELSQQLG 808
Cdd:TIGR02169  324 LAkleAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEleevdKEFAETRDELKDY-REKLEKLKREIN 402

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  809 TSEQAQ-RLMEKKLKRNYTLL-----LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKL---SETCKGS 879
Cdd:TIGR02169  403 ELKRELdRLQEELQRLSEELAdlnaaIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYdlkEEYDRVE 482

                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1907081939  880 EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIK 914
Cdd:TIGR02169  483 KELSKLQRELAEAEAQARASEERVRGGRAVEEVLK 517

PH_TBC1D2A

cd01265

TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1 ...

324-402

7.29e-06

TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1/Prostate antigen recognized and identified by SEREX 1 and ARMUS) contains a PH domain and a TBC-type GTPase catalytic domain. TBC1D2A integrates signaling between Arf6, Rac1, and Rab7 during junction disassembly. Activated Rac1 recruits TBC1D2A to locally inactivate Rab7 via its C-terminal TBC/RabGAP domain and facilitate E-cadherin degradation in lysosomes. The TBC1D2A PH domain mediates localization at cell-cell contacts and coprecipitates with cadherin complexes. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269966 Cd Length: 102 Bit Score: 46.55 E-value: 7.29e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  324 WKKHWFVLADQS--LRYYRDSvaeeaADLD--GEINLSTCydVTEYPVQRNYG-FQIHTKEGEFTLSAMTSGIRRNWIQT 398
Cdd:cd01265     19 WKRRWFVLDESKcqLYYYRSP-----QDATplGSIDLSGA--AFSYDPEAEPGqFEIHTPGRVHILKASTRQAMLYWLQA 91


                   ....
gi 1907081939  399 IMKH 402
Cdd:cd01265     92 LQSK 95

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

738-974

7.93e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 51.45 E-value: 7.93e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  738 REQLEEWQHSKAMLSgQLRASEQKLRSTEARLlEKTQELRD----------LETQQALQRDRQKEVQRLQECIAELSQQL 807
Cdd:COG4913    241 HEALEDAREQIELLE-PIRELAERYAAARERL-AELEYLRAalrlwfaqrrLELLEAELEELRAELARLEAELERLEARL 318

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  808 GTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQghvqqvealqkeKLSETCKGSEQVHKLEE 887
Cdd:COG4913    319 DALREELDELEAQIRGNGGDRLEQLEREIERLERELEERERRRARLEALLA------------ALGLPLPASAEEFAALR 386

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  888 eleareasiRQLAQHVQSLHDERDlikhqfqELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLR 967
Cdd:COG4913    387 ---------AEAAALLEALEEELE-------ALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRDALA 450

                          250
                   ....*....|.
gi 1907081939  968 E----KEEELK 974
Cdd:COG4913    451 EalglDEAELP 461

Mplasa_alph_rch

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

600-1017

9.23e-06

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 51.17 E-value: 9.23e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  600 LEKELEQSQKEASDLLEQNRLLQDQLRVaLGREQSAREGYVLQTEvatspsgaWQRLhRVNQDLqSELEAQCRRQELITQ 679
Cdd:TIGR04523  150 KEKELEKLNNKYNDLKKQKEELENELNL-LEKEKLNIQKNIDKIK--------NKLL-KLELLL-SNLKKKIQKNKSLES 218

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  680 QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAikeqalaklkgELKMEQGKVREQLEEWQHskamlsgQLRASE 759
Cdd:TIGR04523  219 QISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLN-----------QLKDEQNKIKKQLSEKQK-------ELEQNN 280

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  760 QKLRSTEARLLEKTQELRDL--ETQQALQRDRQKEVQRLQECIAELSQQLGTSEQA-QRLMEK--KLKRNytllLESCEQ 834
Cdd:TIGR04523  281 KKIKELEKQLNQLKSEISDLnnQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIiSQLNEQisQLKKE----LTNSES 356

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  835 EKQALLQNLKEVEDKASAYEDQLQGHVQQVEAL--QKEKLSETCKGSEQVHKleeeleareasirQLAQHVQSLHDERDL 912
Cdd:TIGR04523  357 ENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLesQINDLESKIQNQEKLNQ-------------QKDEQIKKLQQEKEL 423

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  913 IKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNL--------------EHSHHRVSVQLQSVRTLLREKEEELKHIKE 978
Cdd:TIGR04523  424 LEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLdntresletqlkvlSRSINKIKQNLEQKQKELKSKEKELKKLNE 503

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1907081939  979 tHERVLEKKDQDLN----EALVKMIALGSSLEETEIKLQEKEE 1017
Cdd:TIGR04523  504 -EKKELEEKVKDLTkkisSLKEKIEKLESEKKEKESKISDLED 545

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1895-2204

1.22e-05

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 50.84 E-value: 1.22e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1895 VAALQEKYQRDFESLKATCERgfaamEETHQKKIEDLQRQhqreLEKLREEKDRLL--------AEETAATI--SAIEAM 1964
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEEN-----IERLDLIIDEKRQQ----LERLRREREKAEryqallkeKREYEGYEllKEKEAL 235

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1965 K------NAHREEMERELEKSQRsQISSINSDIEALRRqyleELQSVQRELEVLSEQYSQKCLENAHLAQA-LEAERQAL 2037
Cdd:TIGR02169  236 ErqkeaiERQLASLEEELEKLTE-EISELEKRLEEIEQ----LLEELNKKIKDLGEEEQLRVKEKIGELEAeIASLERSI 310

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2038 RQCQRENQELNAHNQELN---NRLAAEITRLRTLLtgdgggESTGLPLTQGKDAY-ELEVLLRVKESEIQYLKQEISSLK 2113
Cdd:TIGR02169  311 AEKERELEDAEERLAKLEaeiDKLLAEIEELEREI------EEERKRRDKLTEEYaELKEELEDLRAELEEVDKEFAETR 384

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2114 DELQTALRDKKYASDKYKDIYTELSI---AKAKADCDISRLKEQLKAATEALGEkSPEGTTVSGYDIMKSKSNPDFLKKD 2190
Cdd:TIGR02169  385 DELKDYREKLEKLKREINELKRELDRlqeELQRLSEELADLNAAIAGIEAKINE-LEEEKEDKALEIKKQEWKLEQLAAD 463

                          330
                   ....*....|....
gi 1907081939 2191 RSCVTRQLRNIRSK 2204
Cdd:TIGR02169  464 LSKYEQELYDLKEE 477

CALCOCO1

pfam07888

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...

599-846

1.28e-05

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.

Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 50.28 E-value: 1.28e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  599 LLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQ----CRRQ 674
Cdd:pfam07888   31 LLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKykelSASS 110

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  675 ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAikeqalaklkgELKMEQGKVREQLEEWQHSKAMLSGQ 754
Cdd:pfam07888  111 EELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELE-----------RMKERAKKAGAQRKEEEAERKQLQAK 179

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  755 LRASEQKLRSTEARLlektQELRDLETQQALQrdrqkeVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTL--LLESC 832
Cdd:pfam07888  180 LQQTEEELRSLSKEF----QELRNSLAQRDTQ------VLQLQDTITTLTQKLTTAHRKEAENEALLEELRSLqeRLNAS 249

                          250
                   ....*....|....
gi 1907081939  833 EQEKQALLQNLKEV 846
Cdd:pfam07888  250 ERKVEGLGEELSSM 263

PH1_PLEKHH1_PLEKHH2

cd13282

Pleckstrin homology (PH) domain containing, family H (with MyTH4 domain) members 1 and 2 ...

311-399

1.55e-05

Pleckstrin homology (PH) domain containing, family H (with MyTH4 domain) members 1 and 2 (PLEKHH1) PH domain, repeat 1; PLEKHH1 and PLEKHH2 (also called PLEKHH1L) are thought to function in phospholipid binding and signal transduction. There are 3 Human PLEKHH genes: PLEKHH1, PLEKHH2, and PLEKHH3. There are many isoforms, the longest of which contain a FERM domain, a MyTH4 domain, two PH domains, a peroximal domain, a vacuolar domain, and a coiled coil stretch. The FERM domain has a cloverleaf tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM domain is part of the PH domain family. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241436 Cd Length: 96 Bit Score: 45.37 E-value: 1.55e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQyeDGQ---WKKHWFVLADQSLRYYRD--SVAEEAAdldGEINLSTCYDVTeyPVQRNYGFQIHTKEGEFTLS 385
Cdd:cd13282      1 KAGYLTKL--GGKvktWKRRWFVLKNGELFYYKSpnDVIRKPQ---GQIALDGSCEIA--RAEGAQTFEIVTEKRTYYLT 73

                           90
                   ....*....|....
gi 1907081939  386 AMTSGIRRNWIQTI 399
Cdd:cd13282     74 ADSENDLDEWIRVI 87

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1679-2112

2.06e-05

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 49.94 E-value: 2.06e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1679 QKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRCLQEAEN--KHSESMFALQGRYEEEIRCM 1756
Cdd:COG1196    347 EEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEEleEAEEALLERLERLEEELEEL 426

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1757 VEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKmLEDRFQLKVRELQAVHQEELRALQEHyiWSLRGALSLY 1836
Cdd:COG1196    427 EEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAE-LLEEAALLEAALAELLEELAEAAARL--LLLLEAEADY 503

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1837 QPSHPDSSLAPGPSEPRAVPAAKDEAesMSGLRERIQELEAQMGVMREELGHKELEgdVAALQEKYQRDFESLKAT---- 1912
Cdd:COG1196    504 EGFLEGVKAALLLAGLRGLAGAVAVL--IGVEAAYEAALEAALAAALQNIVVEDDE--VAAAAIEYLKAAKAGRATflpl 579

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1913 --CERGFAAMEETHQKKIEDLQRQHQRELEKLREEKDRL---LAEETAATISAIEAMKNAHREEMERELEKSQRSQISSI 1987
Cdd:COG1196    580 dkIRARAALAAALARGAIGAAVDLVASDLREADARYYVLgdtLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAG 659

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1988 NSDIEALRRQYLEELQSVQRELEVLSEQ--YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRL 2065
Cdd:COG1196    660 GSLTGGSRRELLAALLEAEAELEELAERlaEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLE 739

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1907081939 2066 RTLltgdgggESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSL 2112
Cdd:COG1196    740 ELL-------EEEELLEEEALEELPEPPDLEELERELERLEREIEAL 779

PH_Osh1p_Osh2p_yeast

cd13292

Yeast oxysterol binding protein homologs 1 and 2 Pleckstrin homology (PH) domain; Yeast Osh1p ...

312-403

2.86e-05

Yeast oxysterol binding protein homologs 1 and 2 Pleckstrin homology (PH) domain; Yeast Osh1p is proposed to function in postsynthetic sterol regulation, piecemeal microautophagy of the nucleus, and cell polarity establishment. Yeast Osh2p is proposed to function in sterol metabolism and cell polarity establishment. Both Osh1p and Osh2p contain 3 N-terminal ankyrin repeats, a PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. OSBP andOsh1p PH domains specifically localize to the Golgi apparatus in a PtdIns4P-dependent manner. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. In general OSBPs and ORPs have been found to be involved in the transport and metabolism of cholesterol and related lipids in eukaryotes. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. They are members of the oxysterol binding protein (OSBP) family which includes OSBP, OSBP-related proteins (ORP), Goodpasture antigen binding protein (GPBP), and Four phosphate adaptor protein 1 (FAPP1). They have a wide range of purported functions including sterol transport, cell cycle control, pollen development and vessicle transport from Golgi recognize both PI lipids and ARF proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241446 Cd Length: 103 Bit Score: 44.99 E-value: 2.86e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  312 KGWLTK--QYEDGqWKKHWFVLADQSLRYYRDSVAEEAAdLDGEINLSTCYDVteYPVQRNYGFQIHTKEG---EFTLSA 386
Cdd:cd13292      5 KGYLKKwtNYAKG-YKTRWFVLEDGVLSYYRHQDDEGSA-CRGSINMKNARLV--SDPSEKLRFEVSSKTSgspKWYLKA 80

                           90
                   ....*....|....*..
gi 1907081939  387 MTSGIRRNWIQTIMKHV 403
Cdd:cd13292     81 NHPVEAARWIQALQKAI 97

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

767-1017

2.88e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.53 E-value: 2.88e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  767 ARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKklkrnytlllescEQEKQALLQNLKEV 846
Cdd:COG4913    194 LRLLHKTQSFKPIGDLDDFVREYMLEEPDTFEAADALVEHFDDLERAHEALED-------------AREQIELLEPIREL 260

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  847 EDKASAYEDQLQGHVQQVEALQKEKlsetckGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVAT 926
Cdd:COG4913    261 AERYAAARERLAELEYLRAALRLWF------AQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRG 334

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  927 SDGD-VAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSL 1005
Cdd:COG4913    335 NGGDrLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAAL 414

                          250
                   ....*....|..
gi 1907081939 1006 EETEIKLQEKEE 1017
Cdd:COG4913    415 RDLRRELRELEA 426

mukB

PRK04863

chromosome partition protein MukB;

584-924

2.92e-05

chromosome partition protein MukB;

Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 49.57 E-value: 2.92e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  584 LEDRSERLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQtevatspsgawQRLHRVNQDL 663
Cdd:PRK04863   289 LELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQTALRQQ-----------EKIERYQADL 357

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  664 QsELEAQCRRQELITQQIQTLKHSYGEAKDAIrhhEAEIQTLQTRLGNAAAELAIKE----------QALAKLKGELK-- 731
Cdd:PRK04863   358 E-ELEERLEEQNEVVEEADEQQEENEARAEAA---EEEVDELKSQLADYQQALDVQQtraiqyqqavQALERAKQLCGlp 433

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  732 -MEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEA--RLLEKTQEL-----------------RDLETQQALQRDRQK 791
Cdd:PRK04863   434 dLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAahSQFEQAYQLvrkiagevsrseawdvaRELLRRLREQRHLAE 513

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  792 EVQRLQECIAELSQQLGTSEQAQRLM---EKKLKRNYTL--LLESCEQEKQALLQNLKE----VEDKASAYEDQLQGHVQ 862
Cdd:PRK04863   514 QLQQLRMRLSELEQRLRQQQRAERLLaefCKRLGKNLDDedELEQLQEELEARLESLSEsvseARERRMALRQQLEQLQA 593

                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907081939  863 QVEALQK---------EKLSETCkgsEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERV 924
Cdd:PRK04863   594 RIQRLAArapawlaaqDALARLR---EQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDEEI 661

PH_SWAP-70

cd13273

Switch-associated protein-70 Pleckstrin homology (PH) domain; SWAP-70 (also called ...

311-399

3.19e-05

Switch-associated protein-70 Pleckstrin homology (PH) domain; SWAP-70 (also called Differentially expressed in FDCP 6/DEF-6 or IRF4-binding protein) functions in cellular signal transduction pathways (in conjunction with Rac), regulates cell motility through actin rearrangement, and contributes to the transformation and invasion activity of mouse embryo fibroblasts. Metazoan SWAP-70 is found in B lymphocytes, mast cells, and in a variety of organs. Metazoan SWAP-70 contains an N-terminal EF-hand motif, a centrally located PH domain, and a C-terminal coiled-coil domain. The PH domain of Metazoan SWAP-70 contains a phosphoinositide-binding site and a nuclear localization signal (NLS), which localize SWAP-70 to the plasma membrane and nucleus, respectively. The NLS is a sequence of four Lys residues located at the N-terminus of the C-terminal a-helix; this is a unique characteristic of the Metazoan SWAP-70 PH domain. The SWAP-70 PH domain binds PtdIns(3,4,5)P3 and PtdIns(4,5)P2 embedded in lipid bilayer vesicles. There are additional plant SWAP70 proteins, but these are not included in this hierarchy. Rice SWAP70 (OsSWAP70) exhibits GEF activity toward the its Rho GTPase, OsRac1, and regulates chitin-induced production of reactive oxygen species and defense gene expression in rice. Arabidopsis SWAP70 (AtSWAP70) plays a role in both PAMP- and effector-triggered immunity. Plant SWAP70 contains both DH and PH domains, but their arrangement is the reverse of that in typical DH-PH-type Rho GEFs, wherein the DH domain is flanked by a C-terminal PH domain. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270092 Cd Length: 110 Bit Score: 44.98 E-value: 3.19e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYrdsVAEEAADLDGEINL--STCYDVTEYPVQRNYGFQIHTKEGEFTLSAM 387
Cdd:cd13273     10 KKGYLWKKgHLLPTWTERWFVLKPNSLSYY---KSEDLKEKKGEIALdsNCCVESLPDREGKKCRFLVKTPDKTYELSAS 86

                           90
                   ....*....|..
gi 1907081939  388 TSGIRRNWIQTI 399
Cdd:cd13273     87 DHKTRQEWIAAI 98

PH_Gab-like

cd13324

Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are ...

313-399

3.88e-05

Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. There are 3 families: Gab1, Gab2, and Gab3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270133 Cd Length: 112 Bit Score: 44.71 E-value: 3.88e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  313 GWLTK-----QYEDGQWKKHWFVL------ADQS-LRYYRDsvaEEAADLDGEINLSTCYDVT-----EYPVQRN-YGFQ 374
Cdd:cd13324      5 GWLTKsppekKIWRAAWRRRWFVLrsgrlsGGQDvLEYYTD---DHCKKLKGIIDLDQCEQVDagltfEKKKFKNqFIFD 81

                           90       100
                   ....*....|....*....|....*
gi 1907081939  375 IHTKEGEFTLSAMTSGIRRNWIQTI 399
Cdd:cd13324     82 IRTPKRTYYLVAETEEEMNKWVRCI 106

Mplasa_alph_rch

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

660-1016

4.47e-05

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 48.86 E-value: 4.47e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  660 NQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGElkmEQGKvRE 739
Cdd:TIGR04523  309 NKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKE---NQSY-KQ 384

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  740 QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQ----ALQRDRQKEVQRLQECIAELSQQLGTSEQAQR 815
Cdd:TIGR04523  385 EIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIerlkETIIKNNSEIKDLTNQDSVKELIIKNLDNTRE 464

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  816 LMEKKLKrnytLLLESCEQEKQALLQNLKEVEDKASAYeDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREAS 895
Cdd:TIGR04523  465 SLETQLK----VLSRSINKIKQNLEQKQKELKSKEKEL-KKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESK 539

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  896 IRQLAQHVQSLHDE--RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEEL 973
Cdd:TIGR04523  540 ISDLEDELNKDDFElkKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKEL 619

                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1907081939  974 KHIKETHERVLEKKDqDLNEALVKMIALGSSLEETEIKLQEKE 1016
Cdd:TIGR04523  620 EKAKKENEKLSSIIK-NIKSKKNKLKQEVKQIKETIKEIRNKW 661

PRK02224

DNA double-strand break repair Rad50 ATPase;

585-1017

4.57e-05

DNA double-strand break repair Rad50 ATPase;

Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 48.88 E-value: 4.57e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  585 EDRSERLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYvlqTEVATSPSGAWQRLHRVNQDLQ 664
Cdd:PRK02224   293 EERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESL---REDADDLEERAEELREEAAELE 369

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  665 SELEAqCRRQelitqqiqtlkhsYGEAKDAIRHHEAEIQTLQTRLGNAAAELaikeQALAKLKGELKMEQGKVREQLEEw 744
Cdd:PRK02224   370 SELEE-AREA-------------VEDRREEIEELEEEIEELRERFGDAPVDL----GNAEDFLEELREERDELREREAE- 430

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  745 qhskamLSGQLRASEQKLRSTEaRLLEK------TQELRDLETQQALQRDRQKevqrlqecIAELSQQLGTSEQAQRLME 818
Cdd:PRK02224   431 ------LEATLRTARERVEEAE-ALLEAgkcpecGQPVEGSPHVETIEEDRER--------VEELEAELEDLEEEVEEVE 495

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  819 KKLKRNYTllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQvhklEEELEAREASIRQ 898
Cdd:PRK02224   496 ERLERAED--LVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEK----REAAAEAEEEAEE 569

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  899 LAQHVQSLHDERDLIKHQFQELmERVATSDGDVAELQ---EKLRGKEVDYQNLE-HSHHRVSVQLQSVRTLLREKE---- 970
Cdd:PRK02224   570 AREEVAELNSKLAELKERIESL-ERIRTLLAAIADAEdeiERLREKREALAELNdERRERLAEKRERKRELEAEFDeari 648

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1907081939  971 EELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1017
Cdd:PRK02224   649 EEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEE 695

Mplasa_alph_rch

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

661-1017

4.99e-05

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 48.48 E-value: 4.99e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  661 QDLQSELEAQCRRQELITQQIQTLKHSYGEAKDA-------IRHHEAEIQTLQTRLGNAAAELAI----KEQALAK-LKG 728
Cdd:TIGR04523  235 EKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQlsekqkeLEQNNKKIKELEKQLNQLKSEISDlnnqKEQDWNKeLKS 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  729 ELKMEQGKVRE---QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQ---KEVQRLQECIA 801
Cdd:TIGR04523  315 ELKNQEKKLEEiqnQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEiEKLKKENQsykQEIKNLESQIN 394

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  802 ELSQQLGTSEQAQRLME---KKLKRNYTLLLESCEQEKQALLQN---LKEVEDKASAYEDQLQGHVQQVEAlQKEKLSEt 875
Cdd:TIGR04523  395 DLESKIQNQEKLNQQKDeqiKKLQQEKELLEKEIERLKETIIKNnseIKDLTNQDSVKELIIKNLDNTRES-LETQLKV- 472

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  876 ckgseqvhkleeeleaREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRV 955
Cdd:TIGR04523  473 ----------------LSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEK 536

                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907081939  956 SVQLQSVRTLLREKEEELKhiKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1017
Cdd:TIGR04523  537 ESKISDLEDELNKDDFELK--KENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEK 596

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

600-813

5.39e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 47.84 E-value: 5.39e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  600 LEKELEQSQKEASDLLEQNRLLQDQLrvalgrEQSAREGYVLQTEVATspsgAWQRLHRVNQDLQSELEAQCRRQELITQ 679
Cdd:COG4942     39 LEKELAALKKEEKALLKQLAALERRI------AALARRIRALEQELAA----LEAELAELEKEIAELRAELEAQKEELAE 108

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  680 QIQTL----KHSY-------GEAKDAIRHHEAeIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSK 748
Cdd:COG4942    109 LLRALyrlgRQPPlalllspEDFLDAVRRLQY-LKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEER 187

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907081939  749 AMLSGQLRASEQKLRSTEARLLEKTQELRDLetqqalqrdrQKEVQRLQECIAELSQQLGTSEQA 813
Cdd:COG4942    188 AALEALKAERQKLLARLEKELAELAAELAEL----------QQEAEELEALIARLEAEAAAAAER 242

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

1701-2170

5.82e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 48.58 E-value: 5.82e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1701 SEYQKVITLIEKENTELKAKVSQMDHQQRCLQ-EAENK-------HSESMFALQGRYEEEIRCMVEQLSHTENTLQAERS 1772
Cdd:pfam15921  220 SAISKILRELDTEISYLKGRIFPVEDQLEALKsESQNKielllqqHQDRIEQLISEHEVEITGLTEKASSARSQANSIQS 299

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1773 RvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSlrgalslyqpshpDSSLAPGPSEp 1852
Cdd:pfam15921  300 Q-LEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLA-------------NSELTEARTE- 364

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1853 ravpaaKDEAESMSG-LRERIQELEAQMGVMREELG---------------------HKELEGDVAALQ-EKYQRDFESL 1909
Cdd:pfam15921  365 ------RDQFSQESGnLDDQLQKLLADLHKREKELSlekeqnkrlwdrdtgnsitidHLRRELDDRNMEvQRLEALLKAM 438

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1910 KATC----ERGFAAMEETHQ--KKIEDLQRQHQRELEKLREEKDRLLA-----EETAATISAIeamkNAHREEMERELEK 1978
Cdd:pfam15921  439 KSECqgqmERQMAAIQGKNEslEKVSSLTAQLESTKEMLRKVVEELTAkkmtlESSERTVSDL----TASLQEKERAIEA 514

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1979 SQrSQISSINS--DIEALRRQYL----EELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQ 2052
Cdd:pfam15921  515 TN-AEITKLRSrvDLKLQELQHLknegDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA 593

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2053 ELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVLLRVKESE-----IQYLKQEISSLKDELQTALRDKKYAS 2127
Cdd:pfam15921  594 QLEKEINDRRLELQEFKILKDKKDAKIRELEARVSDLELEKVKLVNAGSerlraVKDIKQERDQLLNEVKTSRNELNSLS 673

                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*...
gi 1907081939 2128 DKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE-----KSPEGT 2170
Cdd:pfam15921  674 EDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQtrntlKSMEGS 721

COG1340

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];

675-986

5.91e-05

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];

Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 47.21 E-value: 5.91e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  675 ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQtrlgNAAAELAIKEQALAKLKGELKMEQGKVREQLEEwqhskamLSGQ 754
Cdd:COG1340      4 DELSSSLEELEEKIEELREEIEELKEKRDELN----EELKELAEKRDELNAQVKELREEAQELREKRDE-------LNEK 72

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  755 LRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTS----EQAQRLMEK--KLKRNYTLL 828
Cdd:COG1340     73 VKELKEERDELNEKLNELREELDELRKELAELNKAGGSIDKLRKEIERLEWRQQTEvlspEEEKELVEKikELEKELEKA 152

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  829 LESCEQEKQallqnLKEVEDKASAYEDQLQGHVQQVEALQKEklsetckgSEQVHKleeeleareaSIRQLAQHVQSLHD 908
Cdd:COG1340    153 KKALEKNEK-----LKELRAELKELRKEAEEIHKKIKELAEE--------AQELHE----------EMIELYKEADELRK 209

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081939  909 ERDLIKHQFQELMERvatsdgdVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRtllreKEEELKHIKETHERVLEK 986
Cdd:COG1340    210 EADELHKEIVEAQEK-------ADELHEEIIELQKELRELRKELKKLRKKQRALK-----REKEKEELEEKAEEIFEK 275

PH_evt

cd13265

Evectin Pleckstrin homology (PH) domain; There are 2 members of the evectin family (also ...

310-362

7.32e-05

Evectin Pleckstrin homology (PH) domain; There are 2 members of the evectin family (also called pleckstrin homology domain containing, family B): evt-1 (also called PLEKHB1) and evt-2 (also called PLEKHB2). evt-1 is specific to the nervous system, where it is expressed in photoreceptors and myelinating glia. evt-2 is widely expressed in both neural and nonneural tissues. Evectins possess a single N-terminal PH domain and a C-terminal hydrophobic region. evt-1 is thought to function as a mediator of post-Golgi trafficking in cells that produce large membrane-rich organelles. It is a candidate gene for the inherited human retinopathy autosomal dominant familial exudative vitreoretinopathy and a susceptibility gene for multiple sclerosis. evt-2 is essential for retrograde endosomal membrane transport from the plasma membrane (PM) to the Golgi. Two membrane trafficking pathways pass through recycling endosomes: a recycling pathway and a retrograde pathway that links the PM to the Golgi/ER. Its PH domain that is unique in that it specifically recognizes phosphatidylserine (PS), but not polyphosphoinositides. PS is an anionic phospholipid class in eukaryotic biomembranes, is highly enriched in the PM, and plays key roles in various physiological processes such as the coagulation cascade, recruitment and activation of signaling molecules, and clearance of apoptotic cells. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270085 Cd Length: 108 Bit Score: 43.83 E-value: 7.32e-05

                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081939  310 FKKGWLTKQYE-DGQWKKHWFVL-ADQSLRYYRDsvaEEAADLDGEINL-STCYDV 362
Cdd:cd13265      4 VKSGWLLRQSTiLKRWKKNWFVLyGDGNLVYYED---ETRREVEGRINMpRECRNI 56

sbcc

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

565-1031

1.01e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 47.66 E-value: 1.01e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  565 ETTPLREEkQVPIAPLHLSLEDRSERLSTHELTSLLEKELEQ-----SQKEASDLLEQNRLLQDQLRVALGREQSAREGY 639
Cdd:TIGR00618  401 ELDILQRE-QATIDTRTSAFRDLQGQLAHAKKQQELQQRYAElcaaaITCTAQCEKLEKIHLQESAQSLKEREQQLQTKE 479

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  640 VLQTEVATSPSGAWQRLHRVnQDLQSELEAQCRRQELITQQIQTL------------KHSY-GEAKDAIRHHEAEIQTLQ 706
Cdd:TIGR00618  480 QIHLQETRKKAVVLARLLEL-QEEPCPLCGSCIHPNPARQDIDNPgpltrrmqrgeqTYAQlETSEEDVYHQLTSERKQR 558

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  707 TRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSgqlraseqklRSTEARLLEKTQELRDLETQQALQ 786
Cdd:TIGR00618  559 ASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLS----------EAEDMLACEQHALLRKLQPEQDLQ 628

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  787 RDRQKEvqrlQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQN-LKEVEDKASAYEDQLQGHVQQVE 865
Cdd:TIGR00618  629 DVRLHL----QQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLaLQKMQSEKEQLTYWKEMLAQCQT 704

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  866 ALQKEKLSETcKGSEQVHKLEEELEAREASIR-QLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVD 944
Cdd:TIGR00618  705 LLRELETHIE-EYDREFNEIENASSSLGSDLAaREDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGAELSHL 783

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  945 YQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLRRFVS 1024
Cdd:TIGR00618  784 AAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQ 863


                   ....*..
gi 1907081939 1025 DSPKDAK 1031
Cdd:TIGR00618  864 LTQEQAK 870

HCR

pfam07111

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...

559-1014

1.03e-04

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.

Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 47.44 E-value: 1.03e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  559 QRWHQVETTPLREEKQVPIAPLHLSLEDRSERLSTHELTSLLE-KELEQSQKEASDLLEQNRLLQDQLRVALGREQSARE 637
Cdd:pfam07111  146 QRLHQEQLSSLTQAHEEALSSLTSKAEGLEKSLNSLETKRAGEaKQLAEAQKEAELLRKQLSKTQEELEAQVTLVESLRK 225

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  638 gYVLQTEVATSPSGAWqrlhrvnqdlqsELEaqcrRQELItQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELA 717
Cdd:pfam07111  226 -YVGEQVPPEVHSQTW------------ELE----RQELL-DTMQHLQEDRADLQATVELLQVRVQSLTHMLALQEEELT 287

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  718 IKEQALAKLKGELKMeqgKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQA-----LQR---DR 789
Cdd:pfam07111  288 RKIQPSDSLEPEFPK---KCRSLLNRWREKVFALMVQLKAQDLEHRDSVKQLRGQVAELQEQVTSQSqeqaiLQRalqDK 364

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  790 QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLL------LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQ 863
Cdd:pfam07111  365 AAEVEVERMSAKGLQMELSRAQEARRRQQQQTASAEEQLkfvvnaMSSTQIWLETTMTRVEQAVARIPSLSNRLSYAVRK 444

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  864 V---EALQKEKLS------ETCKGSEqvhKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERvATSDGDVAEL 934
Cdd:pfam07111  445 VhtiKGLMARKVAlaqlrqESCPPPP---PAPPVDADLSLELEQLREERNRLDAELQLSAHLIQQEVGR-AREQGEAERQ 520

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  935 Q--EKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKM-IALGSSLEETEIK 1011
Cdd:pfam07111  521 QlsEVAQQLEQELQRAQESLASVGQQLEVARQGQQESTEEAASLRQELTQQQEIYGQALQEKVAEVeTRLREQLSDTKRR 600


                   ...
gi 1907081939 1012 LQE 1014
Cdd:pfam07111  601 LNE 603

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

1845-2075

1.16e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 46.68 E-value: 1.16e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1845 LAPGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQRDFESLKATcERGFAAMEeth 1924
Cdd:COG4942     10 LLALAAAAQADAAAEAEAE-LEQLQQEIAELEKELAALKKE--EKALLKQLAALERRIAALARRIRAL-EQELAALE--- 82

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1925 qKKIEDLQRQH---QRELEKLREEKDRLLA--------EETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEA 1993
Cdd:COG4942     83 -AELAELEKEIaelRAELEAQKEELAELLRalyrlgrqPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAE 161

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1994 LRR------QYLEELQSVQRELE----VLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEIT 2063
Cdd:COG4942    162 LAAlraeleAERAELEALLAELEeeraALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241

                          250
                   ....*....|..
gi 1907081939 2064 RLRTLLTGDGGG 2075
Cdd:COG4942    242 RTPAAGFAALKG 253

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

777-1025

1.36e-04

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 47.36 E-value: 1.36e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  777 RDLETQQALqRDRQKEVQRLQECIAELSQQLGT----SEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASA 852
Cdd:TIGR02168  173 RRKETERKL-ERTRENLDRLEDILNELERQLKSlerqAEKAERYKELKAELR--------ELELALLVLRLEELREELEE 243

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  853 YEDQLQGHVQQVEALQKEKlsetckgseqvhkleeelEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVA 932
Cdd:TIGR02168  244 LQEELKEAEEELEELTAEL------------------QELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQ 305

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  933 ELQEKLRgkevdyqNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHErVLEKKDQDLNEALVKMIALgssLEETEIKL 1012
Cdd:TIGR02168  306 ILRERLA-------NLERQLEELEAQLEELESKLDELAEELAELEEKLE-ELKEELESLEAELEELEAE---LEELESRL 374

                          250
                   ....*....|...
gi 1907081939 1013 QEKEECLRRFVSD 1025
Cdd:TIGR02168  375 EELEEQLETLRSK 387

PTZ00121

MAEBL; Provisional

1658-2062

1.44e-04

MAEBL; Provisional

Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 47.44 E-value: 1.44e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1658 EYEKELRFYKKACQEAKGASGQKRAQAVGALKEE---YEELlhKQKSEYQKVITLIEKENTELKAKVSQMdhqqRCLQEA 1734
Cdd:PTZ00121  1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEakkADEA--KKKAEEAKKADEAKKKAEEAKKKADEA----KKAAEA 1508

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1735 ENKHSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRElqavh 1814
Cdd:PTZ00121  1509 KKKADEAKKAEEAKKADEAK-KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK----- 1582

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1815 QEELRALQEHYIwslrgalslyqpshpdsslapgpsepravpaakdeaesmsglrERIQELEAQMGVMREELGHKELEGD 1894
Cdd:PTZ00121  1583 AEEAKKAEEARI-------------------------------------------EEVMKLYEEEKKMKAEEAKKAEEAK 1619

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1895 VAALQEKYQrdfESLKATCERgFAAMEETHQKKIEDLQRQHqrELEKLREEKDRLLAEETAATISAIEAMKNAHREEMER 1974
Cdd:PTZ00121  1620 IKAEELKKA---EEEKKKVEQ-LKKKEAEEKKKAEELKKAE--EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1975 ELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQEL 2054
Cdd:PTZ00121  1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773


                   ....*...
gi 1907081939 2055 NNRLAAEI 2062
Cdd:PTZ00121  1774 RKEKEAVI 1781

DUF5401

pfam17380

Family of unknown function (DUF5401); This is a family of unknown function found in ...

1870-2046

1.54e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.

Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 47.04 E-value: 1.54e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1870 ERIQELEA-QMGVMRE-ELGHKELEG--DVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKLREE 1945
Cdd:pfam17380  375 SRMRELERlQMERQQKnERVRQELEAarKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLE 454

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1946 KdrLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISS---INSDIEALRRQYLEELQS---VQRELE-----VLSE 2014
Cdd:pfam17380  455 E--QERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRrkiLEKELEERKQAMIEEERKrklLEKEMEerqkaIYEE 532

                          170       180       190
                   ....*....|....*....|....*....|..
gi 1907081939 2015 QYSQKCLENAHLAQALEAERQALRQCQRENQE 2046
Cdd:pfam17380  533 ERRREAEEERRKQQEMEERRRIQEQMRKATEE 564

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1863-2128

1.63e-04

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 46.98 E-value: 1.63e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1863 ESMSGLRERIQELEAQMGVMREELGH------KELEGDVAALQEKYqRDFESLKATCERGFAAMEEtHQKKIEDLQRQHQ 1936
Cdd:TIGR02169  251 EELEKLTEEISELEKRLEEIEQLLEElnkkikDLGEEEQLRVKEKI-GELEAEIASLERSIAEKER-ELEDAEERLAKLE 328

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1937 RELEKLREEKDRL---LAEETAATISAIEAMKNAHREEMEReleksqRSQISSINSDIEALRR---QYLEELQSVQRELE 2010
Cdd:TIGR02169  329 AEIDKLLAEIEELereIEEERKRRDKLTEEYAELKEELEDL------RAELEEVDKEFAETRDelkDYREKLEKLKREIN 402

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2011 VLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTlltgdgggestglpLTQGKDAYE 2090
Cdd:TIGR02169  403 ELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ--------------LAADLSKYE 468

                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1907081939 2091 LEvlLRVKESEIQYLKQEISSLKDELQTALRDKKYASD 2128
Cdd:TIGR02169  469 QE--LYDLKEEYDRVEKELSKLQRELAEAEAQARASEE 504

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

760-963

1.64e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 47.22 E-value: 1.64e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  760 QKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQ-QLGTSEQAQRLMEKKLKRNyTLLLESCEQEKQA 838
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAAlRLWFAQRRLELLEAELEEL-RAELARLEAELER 313

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  839 LLQNLKEVEDKASAYEDQLQGH-VQQVEALQKEklsetckgseqVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQF 917
Cdd:COG4913    314 LEARLDALREELDELEAQIRGNgGDRLEQLERE-----------IERLERELEERERRRARLEALLAALGLPLPASAEEF 382

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907081939  918 QEL----MERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVR 963
Cdd:COG4913    383 AALraeaAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLE 432

GumC

COG3206

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

1868-2066

1.95e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 46.55 E-value: 1.95e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1868 LRERIQELEAQMGVMREELghKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQhqreLEKLREEKD 1947
Cdd:COG3206    173 ARKALEFLEEQLPELRKEL--EEAEAALEEFRQKNG----------LVDLSEEAKLLLQQLSELESQ----LAEARAELA 236

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1948 RLLAEETAATiSAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEE---LQSVQRELEVLSEQYSQkclENA 2024
Cdd:COG3206    237 EAEARLAALR-AQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPNhpdVIALRAQIAALRAQLQQ---EAQ 312

                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907081939 2025 HLAQALEAERQALRQCQRE-NQELNAHNQELN--NRLAAEITRLR 2066
Cdd:COG3206    313 RILASLEAELEALQAREASlQAQLAQLEARLAelPELEAELRRLE 357

PH_Boi

cd13316

Boi family Pleckstrin homology domain; Yeast Boi proteins Boi1 and Boi2 are functionally ...

312-401

2.04e-04

Boi family Pleckstrin homology domain; Yeast Boi proteins Boi1 and Boi2 are functionally redundant and important for cell growth with Boi mutants displaying defects in bud formation and in the maintenance of cell polarity.They appear to be linked to Rho-type GTPase, Cdc42 and Rho3. Boi1 and Boi2 display two-hybrid interactions with the GTP-bound ("active") form of Cdc42, while Rho3 can suppress of the lethality caused by deletion of Boi1 and Boi2. These findings suggest that Boi1 and Boi2 are targets of Cdc42 that promote cell growth in a manner that is regulated by Rho3. Boi proteins contain a N-terminal SH3 domain, followed by a SAM (sterile alpha motif) domain, a proline-rich region, which mediates binding to the second SH3 domain of Bem1, and C-terminal PH domain. The PH domain is essential for its function in cell growth and is important for localization to the bud, while the SH3 domain is needed for localization to the neck. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270126 Cd Length: 97 Bit Score: 42.36 E-value: 2.04e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  312 KGWLTKQYED-GQWKKHWFVLADQSLRYYRdsvAEEAADLDGEINLsTCYDVT----EYPVQRNYGFQI--HTKEGEFTL 384
Cdd:cd13316      3 SGWMKKRGERyGTWKTRYFVLKGTRLYYLK---SENDDKEKGLIDL-TGHRVVpddsNSPFRGSYGFKLvpPAVPKVHYF 78

                           90
                   ....*....|....*..
gi 1907081939  385 SAMTSGIRRNWIQTIMK 401
Cdd:cd13316     79 AVDEKEELREWMKALMK 95

CwlO1

COG3883

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...

661-875

2.04e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];

Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 45.98 E-value: 2.04e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  661 QDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLkgelkmeqgkvREQ 740
Cdd:COG3883     19 QAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEER-----------REE 87

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  741 LEEWQHSKAMLSGQLRASEQKLRSTE-ARLLEKTQELRDL-ETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLME 818
Cdd:COG3883     88 LGERARALYRSGGSVSYLDVLLGSESfSDFLDRLSALSKIaDADADLLEELKADKAELEAKKAELEAKLAELEALKAELE 167

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081939  819 KKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSET 875
Cdd:COG3883    168 AAKAE-----LEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAA 219

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

1857-2014

2.13e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 46.83 E-value: 2.13e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1857 AAKDEAES-MSGLRERIQELEAQM---GVMREElghkELEGDVAALQEKY---QRDFESLKATCER-GFAAmeETHQKKI 1928
Cdd:COG4913    309 AELERLEArLDALREELDELEAQIrgnGGDRLE----QLEREIERLERELeerERRRARLEALLAAlGLPL--PASAEEF 382

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1929 EDLQRQHQRELEKLREEKDRLLAEETAAtISAIEAMKNAHREeMERELEkSQRSQISSINSDIEALRRQYLEELQSVQRE 2008
Cdd:COG4913    383 AALRAEAAALLEALEEELEALEEALAEA-EAALRDLRRELRE-LEAEIA-SLERRKSNIPARLLALRDALAEALGLDEAE 459


                   ....*.
gi 1907081939 2009 LEVLSE 2014
Cdd:COG4913    460 LPFVGE 465

rad50

TIGR00606

rad50; All proteins in this family for which functions are known are involvedin recombination, ...

654-1019

2.71e-04

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).

Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 46.58 E-value: 2.71e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  654 QRLHRVNQDLQSELEAQCRRQELITQQIQTLK----------HSYGEAKDAIRHHEAEIQTLQtrlgnaAAELAIKEQAL 723
Cdd:TIGR00606  754 QKVNRDIQRLKNDIEEQETLLGTIMPEEESAKvcltdvtimeRFQMELKDVERKIAQQAAKLQ------GSDLDRTVQQV 827

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  724 AKLKGELKMEQGKVREQLEEWQHskamLSGQLRASEQKLRSTEARL-LEKTQELRDLETQQALQRDRQKEVQRLQECIAE 802
Cdd:TIGR00606  828 NQEKQEKQHELDTVVSKIELNRK----LIQDQQEQIQHLKSKTNELkSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIRE 903

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  803 LSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQAL--LQNLKEVEDKASAYEDQLQGHVQQ-VEALQKEKLSETCKGS 879
Cdd:TIGR00606  904 IKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQdkVNDIKEKVKNIHGYMKDIENKIQDgKDDYLKQKETELNTVN 983

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  880 EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQF---------QELMERVATSDGDVAELQekLRGKEVDYQNLEH 950
Cdd:TIGR00606  984 AQLEECEKHQEKINEDMRLMRQDIDTQKIQERWLQDNLtlrkrenelKEVEEELKQHLKEMGQMQ--VLQMKQEHQKLEE 1061

                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907081939  951 SHHRVSVQLQSVRTLLREKEEELKHIKethERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECL 1019
Cdd:TIGR00606 1062 NIDLIKRNHVLALGRQKGYEKEIKHFK---KELREPQFRDAEEKYREMMIVMRTTELVNKDLDIYYKTL 1127

PRK03918

DNA double-strand break repair ATPase Rad50;

1687-2116

2.91e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 46.21 E-value: 2.91e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1687 ALKEEYEELLhKQKSEYQKVITLIEKENTELKAKVSQmdhqqrcLQEAENKHSESMFALQGRYE--EEIRCMVEQLSHTE 1764
Cdd:PRK03918   176 RRIERLEKFI-KRTENIEELIKEKEKELEEVLREINE-------ISSELPELREELEKLEKEVKelEELKEEIEELEKEL 247

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1765 NTLQAErsrvLSQLDASVKDRQAMEQHHVQQMKMLEDrfqlKVRELqavhqEELRALQEHYIwSLRGALSLY--QPSHPD 1842
Cdd:PRK03918   248 ESLEGS----KRKLEEKIRELEERIEELKKEIEELEE----KVKEL-----KELKEKAEEYI-KLSEFYEEYldELREIE 313

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1843 SSLAPGPSEPRAVPAAKDEAESMSglrERIQELEAQMGVMREELGhkELEGDVAALQEKYQRDFESLKATCERGFAAMEE 1922
Cdd:PRK03918   314 KRLSRLEEEINGIEERIKELEEKE---ERLEELKKKLKELEKRLE--ELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1923 ThQKKIEDLQRQH---QRELEKLREEKDRLLAEEtAATISAIEAMKNAHR----------EEMERELEKSQRSQISSINS 1989
Cdd:PRK03918   389 L-EKELEELEKAKeeiEEEISKITARIGELKKEI-KELKKAIEELKKAKGkcpvcgreltEEHRKELLEEYTAELKRIEK 466

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1990 DIEALRRQyLEELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLL 2069
Cdd:PRK03918   467 ELKEIEEK-ERKLRKELRELEKVLKKESE-----------LIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKL 534

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1907081939 2070 TGDGGgESTGLpLTQGKDAYELEVLLRVKESEIQYLKQEISSLKDEL 2116
Cdd:PRK03918   535 IKLKG-EIKSL-KKELEKLEELKKKLAELEKKLDELEEELAELLKEL 579

DUF3584

pfam12128

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...

1751-2163

3.03e-04

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.

Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 46.37 E-value: 3.03e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1751 EEIRCMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHyIWSLR 1830
Cdd:pfam12128  237 MKIRPEFTKLQQEFNTLESAELR-LSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLDDQWKEKRDELNGE-LSAAD 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1831 GALSLYQpSHPDSSlapgpsEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQRDFESLK 1910
Cdd:pfam12128  315 AAVAKDR-SELEAL------EDQHGAFLDADIETAAADQEQLPSWQSELENLEER--LKALTGKHQDVTAKYNRRRSKIK 385

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1911 ATCERGFAAMEEthqkkiedlqrqhqrELEKLREEKDRLLAEETAatisAIEAMKNAHREEME------RELEKSQRSQI 1984
Cdd:pfam12128  386 EQNNRDIAGIKD---------------KLAKIREARDRQLAVAED----DLQALESELREQLEagklefNEEEYRLKSRL 446

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1985 SSIN---------SDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELN 2055
Cdd:pfam12128  447 GELKlrlnqatatPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELE 526

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2056 NRLAAEITRLRTLLTGDGGG--ESTG----------------LPLTQGKDAYEL-EVLLRVKESEIQ---YLKQEISSLK 2113
Cdd:pfam12128  527 LQLFPQAGTLLHFLRKEAPDweQSIGkvispellhrtdldpeVWDGSVGGELNLyGVKLDLKRIDVPewaASEEELRERL 606

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907081939 2114 DELQTALRDkkyASDKYKDIYTELSIAKA---KADCDISRLKEQLKAATEALG 2163
Cdd:pfam12128  607 DKAEEALQS---AREKQAAAEEQLVQANGeleKASREETFARTALKNARLDLR 656

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

655-829

3.28e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.91 E-value: 3.28e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  655 RLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKE--QALAKLKGELKM 732
Cdd:COG4717     64 RKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAE 143

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  733 EQGKVRE------QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQE-----LRDLETQQALQRDRQKEVQRLQECIA 801
Cdd:COG4717    144 LPERLEEleerleELRELEEELEELEAELAELQEELEELLEQLSLATEEelqdlAEELEELQQRLAELEEELEEAQEELE 223

                          170       180       190
                   ....*....|....*....|....*....|
gi 1907081939  802 ELSQQLGTSEQAQRL--MEKKLKRNYTLLL 829
Cdd:COG4717    224 ELEEELEQLENELEAaaLEERLKEARLLLL 253

PRK03918

DNA double-strand break repair ATPase Rad50;

584-1014

3.51e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 45.83 E-value: 3.51e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  584 LEDRSERLSTheltslLEKELEQSQKEASDLLEQNRLLQD--QLRVALGREQSAREGYVLQtevatspsgawqrlhrvnq 661
Cdd:PRK03918   333 LEEKEERLEE------LKKKLKELEKRLEELEERHELYEEakAKKEELERLKKRLTGLTPE------------------- 387

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  662 DLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNA---AAELAikEQALAKLKGELKMEQGKVR 738
Cdd:PRK03918   388 KLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCpvcGRELT--EEHRKELLEEYTAELKRIE 465

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  739 EQLEEWQHSKAMLSGQLRASEQKLR--STEARLLEKTQELRDLETQqaLQRDRQKEVQRLQECIAELSQQLGTSEQAQRL 816
Cdd:PRK03918   466 KELKEIEEKERKLRKELRELEKVLKkeSELIKLKELAEQLKELEEK--LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKS 543

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  817 MEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEalqkEKLSETCKGSEQVHKLEEELEAREASI 896
Cdd:PRK03918   544 LKKELEK-----LEELKKKLAELEKKLDELEEELAELLKELEELGFESV----EELEERLKELEPFYNEYLELKDAEKEL 614

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  897 RQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL-----QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEE 971
Cdd:PRK03918   615 EREEKELKKLEEELDKAFEELAETEKRLEELRKELEELekkysEEEYEELREEYLELSRELAGLRAELEELEKRREEIKK 694

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*
gi 1907081939  972 ELKHIKETHERVLEKKDQ--DLNEALVKMIALGSSLEETEIKLQE 1014
Cdd:PRK03918   695 TLEKLKEELEEREKAKKEleKLEKALERVEELREKVKKYKALLKE 739

PH_GRP1-like

cd01252

General Receptor for Phosphoinositides-1-like Pleckstrin homology (PH) domain; GRP1/cytohesin3 ...

311-399

4.16e-04

General Receptor for Phosphoinositides-1-like Pleckstrin homology (PH) domain; GRP1/cytohesin3 and the related proteins ARNO (ARF nucleotide-binding site opener)/cytohesin-2 and cytohesin-1 are ARF exchange factors that contain a pleckstrin homology (PH) domain thought to target these proteins to cell membranes through binding polyphosphoinositides. The PH domains of all three proteins exhibit relatively high affinity for PtdIns(3,4,5)P3. Within the Grp1 family, diglycine (2G) and triglycine (3G) splice variants, differing only in the number of glycine residues in the PH domain, strongly influence the affinity and specificity for phosphoinositides. The 2G variants selectively bind PtdIns(3,4,5)P3 with high affinity,the 3G variants bind PtdIns(3,4,5)P3 with about 30-fold lower affinity and require the polybasic region for plasma membrane targeting. These ARF-GEFs share a common, tripartite structure consisting of an N-terminal coiled-coil domain, a central domain with homology to the yeast protein Sec7, a PH domain, and a C-terminal polybasic region. The Sec7 domain is autoinhibited by conserved elements proximal to the PH domain. GRP1 binds to the DNA binding domain of certain nuclear receptors (TRalpha, TRbeta, AR, ER, but not RXR), and can repress thyroid hormone receptor (TR)-mediated transactivation by decreasing TR-complex formation on thyroid hormone response elements. ARNO promotes sequential activation of Arf6, Cdc42 and Rac1 and insulin secretion. Cytohesin acts as a PI 3-kinase effector mediating biological responses including cell spreading and adhesion, chemotaxis, protein trafficking, and cytoskeletal rearrangements, only some of which appear to depend on their ability to activate ARFs. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269954 Cd Length: 119 Bit Score: 41.92 E-value: 4.16e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQyeDGQ---WKKHWFVLADQSLRYYRDSvaeEAADLDGEI---NLStcydVTEYP-VQRNYGFQIH------- 376
Cdd:cd01252      5 REGWLLKL--GGRvksWKRRWFILTDNCLYYFEYT---TDKEPRGIIpleNLS----VREVEdKKKPFCFELYspsngqv 75

                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1907081939  377 -----------TKEGEFT---LSAMTSGIRRNWIQTI 399
Cdd:cd01252     76 ikacktdsdgkVVEGNHTvyrISAASEEERDEWIKSI 112

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

556-858

4.29e-04

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 45.83 E-value: 4.29e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  556 EIEQRWHQVETTPLREEKQVPIAPLHLSLEDRSERLSTHELTSLLEK--ELEQSQKEASDLLEQNRLLQDQLRVALGREQ 633
Cdd:TIGR02169  699 RIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDlsSLEQEIENVKSELKELEARIEELEEDLHKLE 778

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  634 SAREGyvLQTEVATSPsgaWQRLhrvnQDLQSELEAQCRRQELITQQIQ------TLKHSYgeAKDAIRHHEAEIQTLQT 707
Cdd:TIGR02169  779 EALND--LEARLSHSR---IPEI----QAELSKLEEEVSRIEARLREIEqklnrlTLEKEY--LEKEIQELQEQRIDLKE 847

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  708 RLGNAAAELAIKEQALAKLKGELKMEQGKVRE---QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQA 784
Cdd:TIGR02169  848 QIKSIEKEIENLNGKKEELEEELEELEAALRDlesRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081939  785 LQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQ 858
Cdd:TIGR02169  928 ALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEE-----IRALEPVNMLAIQEYEEVLKRLDELKEKRA 996

PRK03918

DNA double-strand break repair ATPase Rad50;

1864-2221

4.52e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 45.44 E-value: 4.52e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1864 SMSGLRERIQELEAQMGVMREELghKELEGDVAALQE------------KYQRDFESLKATCERGFAAMEETH---QKKI 1928
Cdd:PRK03918   253 SKRKLEEKIRELEERIEELKKEI--EELEEKVKELKElkekaeeyiklsEFYEEYLDELREIEKRLSRLEEEIngiEERI 330

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1929 EDLQRQHQR--ELEKLREEKDRLLA--EETAATISAIEAMKnahrEEMERELEKSQRSQISSINSDIEALRRQYLEelqs 2004
Cdd:PRK03918   331 KELEEKEERleELKKKLKELEKRLEelEERHELYEEAKAKK----EELERLKKRLTGLTPEKLEKELEELEKAKEE---- 402

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2005 VQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNA-HNQELNNRLAAEITRLRTLLTGDGGGESTGLplt 2083
Cdd:PRK03918   403 IEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEeHRKELLEEYTAELKRIEKELKEIEEKERKLR--- 479

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2084 qgKDAYELEVLLRvKESEIQYLKQ---EISSLKDELqtalrdKKYASDKYKDIYTELSIAKAKAD---CDISRLKEQLKA 2157
Cdd:PRK03918   480 --KELRELEKVLK-KESELIKLKElaeQLKELEEKL------KKYNLEELEKKAEEYEKLKEKLIklkGEIKSLKKELEK 550

                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081939 2158 ATEALGEKSpegttvsgydimKSKSNPDFLKKDRSCVTRQLRNIRSKSLKEgltVQERLKLFES 2221
Cdd:PRK03918   551 LEELKKKLA------------ELEKKLDELEEELAELLKELEELGFESVEE---LEERLKELEP 599

PH_OSBP_ORP4

cd13284

Human Oxysterol binding protein and OSBP-related protein 4 Pleckstrin homology (PH) domain; ...

311-397

4.74e-04

Human Oxysterol binding protein and OSBP-related protein 4 Pleckstrin homology (PH) domain; Human OSBP is proposed to function is sterol-dependent regulation of ERK dephosphorylation and sphingomyelin synthesis as well as modulation of insulin signaling and hepatic lipogenesis. It contains a N-terminal PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. OSBPs and Osh1p PH domains specifically localize to the Golgi apparatus in a PtdIns4P-dependent manner. ORP4 is proposed to function in Vimentin-dependent sterol transport and/or signaling. Human ORP4 has 2 forms, a long (ORP4L) and a short (ORP4S). ORP4L contains a N-terminal PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. ORP4S is truncated and contains only an OSBP-related domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. They are members of the oxysterol binding protein (OSBP) family which includes OSBP, OSBP-related proteins (ORP), Goodpasture antigen binding protein (GPBP), and Four phosphate adaptor protein 1 (FAPP1). They have a wide range of purported functions including sterol transport, cell cycle control, pollen development and vessicle transport from Golgi recognize both PI lipids and ARF proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270101 Cd Length: 99 Bit Score: 41.21 E-value: 4.74e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTK--QYEDGqWKKHWFVLADQSLRYYRdSVAEEAADLDGEINLSTCYDVTEYPVQrnygFQIHT-KEGEFTLSAM 387
Cdd:cd13284      1 MKGWLLKwtNYIKG-YQRRWFVLSNGLLSYYR-NQAEMAHTCRGTINLAGAEIHTEDSCN----FVISNgGTQTFHLKAS 74

                           90
                   ....*....|
gi 1907081939  388 TSGIRRNWIQ 397
Cdd:cd13284     75 SEVERQRWVT 84

PRK03918

DNA double-strand break repair ATPase Rad50;

1937-2204

5.05e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 45.44 E-value: 5.05e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1937 RELEKLREEKDRLLAEEtaatiSAIEAMKnahrEEMERELEKSQRsQISSINSDIEALRRQyLEELQSVQRELEVLSEQY 2016
Cdd:PRK03918   172 KEIKRRIERLEKFIKRT-----ENIEELI----KEKEKELEEVLR-EINEISSELPELREE-LEKLEKEVKELEELKEEI 240

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2017 SQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRlAAEITRLRTLltgdgggestglpltqgKDAY-ELEVLL 2095
Cdd:PRK03918   241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK-VKELKELKEK-----------------AEEYiKLSEFY 302

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2096 RVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIyTELSIAKAKADCDISRLKEQLKAATEA---LGEKSPEGTTV 2172
Cdd:PRK03918   303 EEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERL-EELKKKLKELEKRLEELEERHELYEEAkakKEELERLKKRL 381

                          250       260       270
                   ....*....|....*....|....*....|..
gi 1907081939 2173 SGYDIMKSKSNPDFLKKDRSCVTRQLRNIRSK 2204
Cdd:PRK03918   382 TGLTPEKLEKELEELEKAKEEIEEEISKITAR 413

SCP-1

pfam05483

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...

1669-2153

5.51e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.

Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 45.10 E-value: 5.51e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1669 ACQEAKGASGQKRAQAVGALKEEYEELLHKQKsEYQKVITLIEKENTELKAKVSqmdhqqrclqEAENKHSESMFALqgr 1748
Cdd:pfam05483  198 AFEELRVQAENARLEMHFKLKEDHEKIQHLEE-EYKKEINDKEKQVSLLLIQIT----------EKENKMKDLTFLL--- 263

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1749 yeEEIRCMVEQLSHtENTLQAERSRVLSQ----LDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEH 1824
Cdd:pfam05483  264 --EESRDKANQLEE-KTKLQDENLKELIEkkdhLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEEL 340

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1825 YIWSLRGALSLYQPSHPDSSLAPG-PSEPRAVPAAKD-------EAESMSGLRERIQELEAQMGVMREELgHKELEGDVA 1896
Cdd:pfam05483  341 NKAKAAHSFVVTEFEATTCSLEELlRTEQQRLEKNEDqlkiitmELQKKSSELEEMTKFKNNKEVELEEL-KKILAEDEK 419

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1897 ALQEKYQRD--FESLKATcERGFAAMEETHQKKIEDLQRQ----------HQRELEKLREE--KDRLLAEETAATISAIE 1962
Cdd:pfam05483  420 LLDEKKQFEkiAEELKGK-EQELIFLLQAREKEIHDLEIQltaiktseehYLKEVEDLKTEleKEKLKNIELTAHCDKLL 498

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1963 AMKNAHREE---MERELEKSQRSQISSINSDIEALRR-QYLEELQSVQR-ELEVLSEQYSQ-----KCLENAHLAQALEA 2032
Cdd:pfam05483  499 LENKELTQEasdMTLELKKHQEDIINCKKQEERMLKQiENLEEKEMNLRdELESVREEFIQkgdevKCKLDKSEENARSI 578

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2033 ERQALRQCQRENQELNAHNQ-----ELNNRLAAEITRLRTLLTGDGGGESTGLpltqgkDAYELEVllRVKESEIQYLKQ 2107
Cdd:pfam05483  579 EYEVLKKEKQMKILENKCNNlkkqiENKNKNIEELHQENKALKKKGSAENKQL------NAYEIKV--NKLELELASAKQ 650

                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081939 2108 EISSLKDELQTALRDKKYASDKykdIYTELSIAKAKADCDISRLKE 2153
Cdd:pfam05483  651 KFEEIIDNYQKEIEDKKISEEK---LLEEVEKAKAIADEAVKLQKE 693

PH_Btk

cd01238

Bruton's tyrosine kinase pleckstrin homology (PH) domain; Btk is a member of the Tec family of ...

311-399

5.69e-04

Bruton's tyrosine kinase pleckstrin homology (PH) domain; Btk is a member of the Tec family of cytoplasmic protein tyrosine kinases that includes BMX, IL2-inducible T-cell kinase (Itk) and Tec. Btk plays a role in the maturation of B cells. Tec proteins general have an N-terminal PH domain, followed by a Tek homology (TH) domain, a SH3 domain, a SH2 domain and a kinase domain. The Btk PH domain binds phosphatidylinositol 3,4,5-trisphosphate and responds to signalling via phosphatidylinositol 3-kinase. The PH domain is also involved in membrane anchoring which is confirmed by the discovery of a mutation of a critical arginine residue in the BTK PH domain. This results in severe human immunodeficiency known as X-linked agammaglobulinemia (XLA) in humans and a related disorder is mice.PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269944 [Multi-domain] Cd Length: 140 Bit Score: 42.22 E-value: 5.69e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQyedGQ---------WKKHWFVLADQSLRYYrDSVAEEAADLDGEINLSTCYDV----TEYPVQRNYGFQIHT 377
Cdd:cd01238      1 LEGLLVKR---SQgkkrfgpvnYKERWFVLTKSSLSYY-EGDGEKRGKEKGSIDLSKVRCVeevkDEAFFERKYPFQVVY 76

                           90       100
                   ....*....|....*....|..
gi 1907081939  378 KEGEFTLSAMTSGIRRNWIQTI 399
Cdd:cd01238     77 DDYTLYVFAPSEEDRDEWIAAL 98

PH_Gab2_2

cd13384

Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily ...

310-399

6.31e-04

Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily includes several Gab proteins, Drosophila DOS and C. elegans SOC-1. They are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. Members here include insect, nematodes, and crustacean Gab2s. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241535 Cd Length: 115 Bit Score: 41.66 E-value: 6.31e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  310 FKKGWLTK-----QYEDGQWKKHWFVLADQS------LRYYRDsvaEEAADLDGEINLSTCYDV-----TEYPVQRNYG- 372
Cdd:cd13384      4 VYEGWLTKsppekRIWRAKWRRRYFVLRQSEipgqyfLEYYTD---RTCRKLKGSIDLDQCEQVdagltFETKNKLKDQh 80

                           90       100
                   ....*....|....*....|....*...
gi 1907081939  373 -FQIHTKEGEFTLSAMTSGIRRNWIQTI 399
Cdd:cd13384     81 iFDIRTPKRTYYLVADTEDEMNKWVNCI 108

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

1723-2141

6.66e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 45.11 E-value: 6.66e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1723 QMDHQQRCLQEAENKHSESMFAlqgRYEEEIRCMVEQLSHTeNTLQAERSRVLSQldaSVKDRQAmeqhHVQQMKMLEDR 1802
Cdd:pfam15921   60 ELDSPRKIIAYPGKEHIERVLE---EYSHQVKDLQRRLNES-NELHEKQKFYLRQ---SVIDLQT----KLQEMQMERDA 128

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1803 FqLKVRELQAVHQEELRALQEHYIWSLRGALSLYQPSHPDSSLAPGP------------SEPRAVPAAKDEAESmsglrE 1870
Cdd:pfam15921  129 M-ADIRRRESQSQEDLRNQLQNTVHELEAAKCLKEDMLEDSNTQIEQlrkmmlshegvlQEIRSILVDFEEASG-----K 202

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1871 RIQELEAQMGVMREELGH------KELEGDVAALQEK---YQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEK 1941
Cdd:pfam15921  203 KIYEHDSMSTMHFRSLGSaiskilRELDTEISYLKGRifpVEDQLEALKSESQNKIELLLQQHQDRIEQLISEHEVEITG 282

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1942 LREEKdrllaeetaatiSAIEAMKNAHREEME--RELEKSQRSQISSINSDIEALRRQYLEELQSVQRELE-VLSEQYSQ 2018
Cdd:pfam15921  283 LTEKA------------SSARSQANSIQSQLEiiQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEdKIEELEKQ 350

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2019 KCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNR---LAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVll 2095
Cdd:pfam15921  351 LVLANSELTEARTERDQFSQESGNLDDQLQKLLADLHKRekeLSLEKEQNKRLWDRDTGNSITIDHLRRELDDRNMEV-- 428

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1907081939 2096 RVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAK 2141
Cdd:pfam15921  429 QRLEALLKAMKSECQGQMERQMAAIQGKNESLEKVSSLTAQLESTK 474

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

716-972

6.84e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 44.37 E-value: 6.84e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  716 LAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQrdrQKEVQR 795
Cdd:COG4942     11 LALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAAL---EAELAE 87

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  796 LQECIAELSQQLGTSEQ--AQRL--MEKKLKRNYTLLLESCEqekqallqNLKEVEDKASAYEDQLQGHVQQVEALqkek 871
Cdd:COG4942     88 LEKEIAELRAELEAQKEelAELLraLYRLGRQPPLALLLSPE--------DFLDAVRRLQYLKYLAPARREQAEEL---- 155

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  872 lsetckgseqvhklEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHS 951
Cdd:COG4942    156 --------------RADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221

                          250       260
                   ....*....|....*....|.
gi 1907081939  952 HHRVSVQLQSVRTLLREKEEE 972
Cdd:COG4942    222 AEELEALIARLEAEAAAAAER 242

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1859-2039

7.22e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 44.76 E-value: 7.22e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1859 KDEAESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCERgfaameethqkkIEDLQRQHQrE 1938
Cdd:COG4717     91 AELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPER------------LEELEERLE-E 157

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1939 LEKLREEKDRLLAEetaatisaiEAMKNAHREEMERELEKSQRSQISSINSDIEALR---RQYLEELQSVQRELEVLSEQ 2015
Cdd:COG4717    158 LRELEEELEELEAE---------LAELQEELEELLEQLSLATEEELQDLAEELEELQqrlAELEEELEEAQEELEELEEE 228

                          170       180
                   ....*....|....*....|....
gi 1907081939 2016 YSQkcLENAHLAQALEAERQALRQ 2039
Cdd:COG4717    229 LEQ--LENELEAAALEERLKEARL 250

sbcc

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

600-995

7.42e-04

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 44.96 E-value: 7.42e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  600 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSA-------REGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCR 672
Cdd:TIGR00618  227 ELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLkqlrariEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIE 306

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  673 RQ-ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGN-----AAAELAIKEQALAKLKGELKMEQGKVREQLEEWQH 746
Cdd:TIGR00618  307 QQaQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLlqtlhSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQ 386

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  747 SKAMLSGQLRASEQ---KLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 823
Cdd:TIGR00618  387 QKTTLTQKLQSLCKeldILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQ 466

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  824 NYTLLLEScEQEKQALLQNLKEVEDKASAYEDQLQG------------HVQQVEALQKEKLSETCKGSEQVHKLEEELEA 891
Cdd:TIGR00618  467 SLKEREQQ-LQTKEQIHLQETRKKAVVLARLLELQEepcplcgscihpNPARQDIDNPGPLTRRMQRGEQTYAQLETSEE 545

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  892 REASIRQ-LAQHVQSLHDERDLIKHQFQELmervATSDGDVAELQEKLRGKEVDYQNL--EHSHHRVSVQLQSVRTLLRE 968
Cdd:TIGR00618  546 DVYHQLTsERKQRASLKEQMQEIQQSFSIL----TQCDNRSKEDIPNLQNITVRLQDLteKLSEAEDMLACEQHALLRKL 621

                          410       420
                   ....*....|....*....|....*..
gi 1907081939  969 KEEELKHIKETHERVLEKKDQDLNEAL 995
Cdd:TIGR00618  622 QPEQDLQDVRLHLQQCSQELALKLTAL 648

HEC1

COG5185

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...

1857-2153

8.49e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 44.56 E-value: 8.49e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1857 AAKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCE-----RGFAAMEETHQKKIEDL 1931
Cdd:COG5185    269 KLGENAESSKRLNENANNLIKQFENTKEKIAEYTKSIDIKKATESLEEQLAAAEAEQEleeskRETETGIQNLTAEIEQG 348

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1932 QRQHQRELEKLREEKDRLLAEETAATisaieamknahREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEV 2011
Cdd:COG5185    349 QESLTENLEAIKEEIENIVGEVELSK-----------SSEELDSFKDTIESTKESLDEIPQNQRGYAQEILATLEDTLKA 417

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2012 LSEQysqkclenahlaqaLEAERQALRQCQRENQElnahNQELNNRLAAEITRLRTLLTGDGggeSTGLPLTQGKDAYEL 2091
Cdd:COG5185    418 ADRQ--------------IEELQRQIEQATSSNEE----VSKLLNELISELNKVMREADEES---QSRLEEAYDEINRSV 476

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081939 2092 EVLLRVKESEIQYLKQEISSLKDELQT--ALRDKKYASDKYKDIYTELSIAKAKADCDISRLKE 2153
Cdd:COG5185    477 RSKKEDLNEELTQIESRVSTLKATLEKlrAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILA 540

PRK09039

peptidoglycan -binding protein;

1919-2071

8.86e-04

peptidoglycan -binding protein;

Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 43.80 E-value: 8.86e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1919 AMEETHQKKIEDLQRQHQRELEKLREEKDRL--LAEETAATISAIEAMKNAHREEM--ERELEKSQRSQISSINSDIEAL 1994
Cdd:PRK09039    70 SLERQGNQDLQDSVANLRASLSAAEAERSRLqaLLAELAGAGAAAEGRAGELAQELdsEKQVSARALAQVELLNQQIAAL 149

                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081939 1995 RRQyleeLQSVQRELEVlSEQYSQkclenahlaqalEAERQALRQCQRENQELNAHNQELnNRLAAE-ITRLRTLLTG 2071
Cdd:PRK09039   150 RRQ----LAALEAALDA-SEKRDR------------ESQAKIADLGRRLNVALAQRVQEL-NRYRSEfFGRLREILGD 209

PHA02562

endonuclease subunit; Provisional

656-881

9.17e-04

endonuclease subunit; Provisional

Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 44.23 E-value: 9.17e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  656 LHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKlkgeLKMEQG 735
Cdd:PHA02562   190 IDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNK----LNTAAA 265

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  736 KVREQLEEWQHSKAMLS--GQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQKEVQRLQEcIAELSQQLgtseq 812
Cdd:PHA02562   266 KIKSKIEQFQKVIKMYEkgGVCPTCTQQISEGPDRITKIKDKLKELQHSlEKLDTAIDELEEIMDE-FNEQSKKL----- 339

                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907081939  813 aqrlmeKKLKRNYtlllESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKE--KLSETCKGSEQ 881
Cdd:PHA02562   340 ------LELKNKI----STNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDEldKIVKTKSELVK 400

SMC_prok_B

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1658-1949

9.73e-04

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 44.66 E-value: 9.73e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1658 EYEKELRFYKKACQEAKGASGQKRaQAVGALKEEYEELLHKQKSEYQKvITLIEKENTELKAKVSQMDHQQRCLQEAENK 1737
Cdd:TIGR02168  702 ELRKELEELEEELEQLRKELEELS-RQISALRKDLARLEAEVEQLEER-IAQLSKELTELEAEIEELEERLEEAEEELAE 779

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1738 HSESMFALQGRYEEeircMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVREL--QAVHQ 1815
Cdd:TIGR02168  780 AEAEIEELEAQIEQ----LKEELKALREALDELRAE-LTLLNEEAANLRERLESLERRIAATERRLEDLEEQIeeLSEDI 854

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1816 EELRALQEHYiWSLRGALSlyqpSHPDSSLAPGPSEPRAVPAAKDEAESMSG----LRERIQELEAQMGVMREELGH--- 1888
Cdd:TIGR02168  855 ESLAAEIEEL-EELIEELE----SELEALLNERASLEEALALLRSELEELSEelreLESKRSELRRELEELREKLAQlel 929

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907081939 1889 --KELEGDVAALQEK----YQRDFEslkatcergfaaMEETHQKKIEDLQRQHQRELEKLREEKDRL 1949
Cdd:TIGR02168  930 rlEGLEVRIDNLQERlseeYSLTLE------------EAEALENKIEDDEEEARRRLKRLENKIKEL 984

PRK02224

DNA double-strand break repair Rad50 ATPase;

1857-2119

1.01e-03

DNA double-strand break repair Rad50 ATPase;

Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 44.26 E-value: 1.01e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1857 AAKDEAEsmsgLRERIQELEAQMGVMREELGHKELEGDVAALQ-----------EKYQRDFESLKATCERGFAAMEETHQ 1925
Cdd:PRK02224   197 EEKEEKD----LHERLNGLESELAELDEEIERYEEQREQARETrdeadevleehEERREELETLEAEIEDLRETIAETER 272

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1926 KK--IEDLQRQHQRELEKLREEKDRLLAEE--TAATISAIEAMKN---AHREEMERELEKsQRSQISSINSDIEALRrqy 1998
Cdd:PRK02224   273 EReeLAEEVRDLRERLEELEEERDDLLAEAglDDADAEAVEARREeleDRDEELRDRLEE-CRVAAQAHNEEAESLR--- 348

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1999 leelqsvqRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAaeitrlrtlltgdgggest 2078
Cdd:PRK02224   349 --------EDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFG------------------- 401

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1907081939 2079 GLPLTQGKDAYELEVLLrvkeSEIQYLKQEISSLKDELQTA 2119
Cdd:PRK02224   402 DAPVDLGNAEDFLEELR----EERDELREREAELEATLRTA 438

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

1862-2062

1.15e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 44.14 E-value: 1.15e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1862 AESMSGLRERIQELEAQMGVMR--------------EELGHKELEGDVAALQEKYQR------DFESLK---ATCERGFA 1918
Cdd:COG4913    623 EEELAEAEERLEALEAELDALQerrealqrlaeyswDEIDVASAEREIAELEAELERldassdDLAALEeqlEELEAELE 702

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1919 AMEETHQKKIEDlQRQHQRELEKLREEKDRLLAEETAATISAI------------EAMKNAHREEMERELEKSQ---RSQ 1983
Cdd:COG4913    703 ELEEELDELKGE-IGRLEKELEQAEEELDELQDRLEAAEDLARlelralleerfaAALGDAVERELRENLEERIdalRAR 781

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1984 ISSINSDIEALRRQYLEE----LQSVQRELEVLSEqYSQKC--LENAHLAQALEAERQALRQCQRENQElnahnqELNNR 2057
Cdd:COG4913    782 LNRAEEELERAMRAFNREwpaeTADLDADLESLPE-YLALLdrLEEDGLPEYEERFKELLNENSIEFVA------DLLSK 854


                   ....*
gi 1907081939 2058 LAAEI 2062
Cdd:COG4913    855 LRRAI 859

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1562-2039

1.38e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 43.99 E-value: 1.38e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1562 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLPPTEPLGGCQRLLRMSQHlSYESCLEGLGQYSSLLVQd 1641
Cdd:COG4717     87 EEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPE-RLEELEERLEELRELEEE- 164

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1642 aiiqaqvcyaacriRLEYEKELRFYKKACQEAKGASGQKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV 1721
Cdd:COG4717    165 --------------LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEEL 229

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1722 SQMDHQQRCLQEAENKHSESMFALqgryeeeIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLED 1801
Cdd:COG4717    230 EQLENELEAAALEERLKEARLLLL-------IAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGK 302

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1802 RFQlkvrelQAVHQEELRALQEHYIWSLRGALSLyqpshpdsslaPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMgv 1881
Cdd:COG4717    303 EAE------ELQALPALEELEEEELEELLAALGL-----------PPDLSPEELLELLDRIEELQELLREAEELEEEL-- 363

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1882 mreelghkelegDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQR--QHQRELEKLREEKDRLLAEETAATIS 1959
Cdd:COG4717    364 ------------QLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEEleELEEQLEELLGELEELLEALDEEELE 431

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1960 AIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY-----LEELQSVQRELEVLSEQYSQKCLenahLAQALEAER 2034
Cdd:COG4717    432 EELEELEEELEELEEELEE-LREELAELEAELEQLEEDGelaelLQELEELKAELRELAEEWAALKL----ALELLEEAR 506


                   ....*
gi 1907081939 2035 QALRQ 2039
Cdd:COG4717    507 EEYRE 511

EnvC

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

598-767

1.42e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 43.21 E-value: 1.42e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  598 SLLEKELEQSQKEAS----DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRR 673
Cdd:COG4942     79 AALEAELAELEKEIAelraELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRAD 158

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  674 QELITQQIQTLKhsygEAKDAIRHHEAEIQTLQTRLgnaAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSG 753
Cdd:COG4942    159 LAELAALRAELE----AERAELEALLAELEEERAAL---EALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231

                          170
                   ....*....|....
gi 1907081939  754 QLRASEQKLRSTEA 767
Cdd:COG4942    232 LEAEAAAAAERTPA 245

COG5022

Myosin heavy chain [General function prediction only];

660-1126

1.63e-03

Myosin heavy chain [General function prediction only];

Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 43.91 E-value: 1.63e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  660 NQDLQSELEAQCRRQELITQQI----QTLKHSYGEAkdAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKmEQG 735
Cdd:COG5022    819 IIKLQKTIKREKKLRETEEVEFslkaEVLIQKFGRS--LKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVK-SIS 895

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  736 KVREQLEEWQHSKAMLSGQLRASEQ---KLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLgtseq 812
Cdd:COG5022    896 SLKLVNLELESEIIELKKSLSSDLIenlEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETS----- 970

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  813 aqrlmekklkrnytlllesceQEKQALLqnlkeveDKASAYEDQLQGHVQQVEALQKEkLSETCKGSEQVHKLEEELEAR 892
Cdd:COG5022    971 ---------------------EEYEDLL-------KKSTILVREGNKANSELKNFKKE-LAELSKQYGALQESTKQLKEL 1021

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  893 EASIRQLAQHVQSLHDERDlIKHQFQELMERVATSDGDVAELQEKLrgKEVDYQN-LEHSHHRVSVQLQSVRTLlrEKEE 971
Cdd:COG5022   1022 PVEVAELQSASKIISSEST-ELSILKPLQKLKGLLLLENNQLQARY--KALKLRReNSLLDDKQLYQLESTENL--LKTI 1096

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  972 ELKHIKETHERVLEKKdqdlnEALVKMIALGSSLEEteikLQEKEECLRRFVSDSPkDAKEPLSTTEPTEEGSGILPLGS 1051
Cdd:COG5022   1097 NVKDLEVTNRNLVKPA-----NVLQFIVAQMIKLNL----LQEISKFLSQLVNTLE-PVFQKLSVLQLELDGLFWEANLE 1166

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1052 VTRVFPGFpHSQPEDEDPSAGLGEEGSSGS-------------LSREENTILPKSADMPE--REG-HLQSTSKSDPGapI 1115
Cdd:COG5022   1167 ALPSPPPF-AALSEKRLYQSALYDEKSKLSssevndlkneliaLFSKIFSGWPRGDKLKKliSEGwVPTEYSTSLKG--F 1243

                          490
                   ....*....|.
gi 1907081939 1116 KRPRIRFSTIQ 1126
Cdd:COG5022   1244 NNLNKKFDTPA 1254

SCP-1

pfam05483

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...

549-974

1.64e-03

Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 43.56 E-value: 1.64e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  549 KTQNVHVEIEQRWHQVETT--PLREEKQVPIAPLHLSLEDRSERLstHELTSLLEKELEQSQKEASDLLEQNRLLQDQLR 626
Cdd:pfam05483  180 ETRQVYMDLNNNIEKMILAfeELRVQAENARLEMHFKLKEDHEKI--QHLEEEYKKEINDKEKQVSLLLIQITEKENKMK 257

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  627 VALGREQSAREGyVLQTEVATSpsgawqrlhrvnqdLQSE-LEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTL 705
Cdd:pfam05483  258 DLTFLLEESRDK-ANQLEEKTK--------------LQDEnLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIA 322

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  706 QTRLGNAAAELAIKEQALAKLKGELKMeqgkvreQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELR----DLET 781
Cdd:pfam05483  323 TKTICQLTEEKEAQMEELNKAKAAHSF-------VVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQkkssELEE 395

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  782 QQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKklkrnytllLESCEQEKQALLQNL-KEVED---KASAYEDQL 857
Cdd:pfam05483  396 MTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEE---------LKGKEQELIFLLQAReKEIHDleiQLTAIKTSE 466

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  858 QGHVQQVEALQKEKLSETCKGSEqvhkleeeleareasirqLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEK 937
Cdd:pfam05483  467 EHYLKEVEDLKTELEKEKLKNIE------------------LTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQ 528

                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 1907081939  938 LRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELK 974
Cdd:pfam05483  529 EERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVK 565

PH_RhoGap25-like

cd13263

Rho GTPase activating protein 25 and related proteins Pleckstrin homology (PH) domain; ...

311-401

1.80e-03

Rho GTPase activating protein 25 and related proteins Pleckstrin homology (PH) domain; RhoGAP25 (also called ArhGap25) like other RhoGaps are involved in cell polarity, cell morphology and cytoskeletal organization. They act as GTPase activators for the Rac-type GTPases by converting them to an inactive GDP-bound state and control actin remodeling by inactivating Rac downstream of Rho leading to suppress leading edge protrusion and promotes cell retraction to achieve cellular polarity and are able to suppress RAC1 and CDC42 activity in vitro. Overexpression of these proteins induces cell rounding with partial or complete disruption of actin stress fibers and formation of membrane ruffles, lamellipodia, and filopodia. This hierarchy contains RhoGAP22, RhoGAP24, and RhoGAP25. Members here contain an N-terminal PH domain followed by a RhoGAP domain and either a BAR or TATA Binding Protein (TBP) Associated Factor 4 (TAF4) domain. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270083 Cd Length: 114 Bit Score: 40.06 E-value: 1.80e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQYED-GQWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTCyDVTEYPVQRN----YGFQIHTKEGE---- 381
Cdd:cd13263      5 KSGWLKKQGSIvKNWQQRWFVLRGDQLYYYKD---EDDTKPQGTIPLPGN-KVKEVPFNPEepgkFLFEIIPGGGGdrmt 80

                           90       100
                   ....*....|....*....|....*
gi 1907081939  382 -----FTLSAMTSGIRRNWIQTIMK 401
Cdd:cd13263     81 snhdsYLLMANSQAEMEEWVKVIRR 105

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

600-1019

1.80e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 43.57 E-value: 1.80e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  600 LEKELEQSQKEASDLLEQNrllQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQelITQ 679
Cdd:pfam15921  247 LEALKSESQNKIELLLQQH---QDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMRQ--LSD 321

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  680 QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQAlaklKGELKMEQGKVREQLEEwqhskamLSGQLRASE 759
Cdd:pfam15921  322 LESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTE----RDQFSQESGNLDDQLQK-------LLADLHKRE 390

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  760 QKLRstearlLEKTQELR--DLETQQA-----LQR---DRQKEVQRLQ--------ECIAELSQQLGTSEQAQRLMEKkl 821
Cdd:pfam15921  391 KELS------LEKEQNKRlwDRDTGNSitidhLRReldDRNMEVQRLEallkamksECQGQMERQMAAIQGKNESLEK-- 462

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  822 krnYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSE--QVHKLEEELEAREASIRQL 899
Cdd:pfam15921  463 ---VSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEitKLRSRVDLKLQELQHLKNE 539

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  900 AQHVQSLHDERDLIKHQFQELMERVatsdgdvaelqEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLrEKE--------E 971
Cdd:pfam15921  540 GDHLRNVQTECEALKLQMAEKDKVI-----------EILRQQIENMTQLVGQHGRTAGAMQVEKAQL-EKEindrrlelQ 607

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907081939  972 ELKHIKETHE---RVLEKKDQDLNEALVKMIALGSS-LEETEIKLQEKEECL 1019
Cdd:pfam15921  608 EFKILKDKKDakiRELEARVSDLELEKVKLVNAGSErLRAVKDIKQERDQLL 659

PRK02224

DNA double-strand break repair Rad50 ATPase;

1862-2047

1.81e-03

DNA double-strand break repair Rad50 ATPase;

Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 43.49 E-value: 1.81e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1862 AESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQekyQRDFESLKATCERgfaAMEETHQKkiedlQRQHQRELEK 1941
Cdd:PRK02224   278 AEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEAR---REELEDRDEELRD---RLEECRVA-----AQAHNEEAES 346

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1942 LREEKDRLlaEETAATISAIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY---LEELQSVQRELEVLSEQysq 2018
Cdd:PRK02224   347 LREDADDL--EERAEELREEAAELESELEEAREAVED-RREEIEELEEEIEELRERFgdaPVDLGNAEDFLEELREE--- 420

                          170       180       190
                   ....*....|....*....|....*....|
gi 1907081939 2019 kcLENAHLAQA-LEAERQALRQCQRENQEL 2047
Cdd:PRK02224   421 --RDELREREAeLEATLRTARERVEEAEAL 448

sbcc

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

712-987

1.88e-03

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 43.42 E-value: 1.88e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  712 AAAELAIKEQALAK---LKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRD 788
Cdd:TIGR00618  182 ALMEFAKKKSLHGKaelLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQL 261

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  789 RQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQ 868
Cdd:TIGR00618  262 LKQLRARIEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIE 341

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  869 KEKLSETCKGSEQVH------------KLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSD-------- 928
Cdd:TIGR00618  342 EQRRLLQTLHSQEIHirdahevatsirEISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDtrtsafrd 421

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907081939  929 --GDVAEL-------QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKK 987
Cdd:TIGR00618  422 lqGQLAHAkkqqelqQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKK 489

Cast

pfam10174

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...

656-1021

1.94e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.

Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 43.27 E-value: 1.94e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  656 LHRVNQDLQSELEAQCRRQELITQQIQT--------------LKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAE------ 715
Cdd:pfam10174  245 LERNIRDLEDEVQMLKTNGLLHTEDREEeikqmevykshskfMKNKIDQLKQELSKKESELLALQTKLETLTNQnsdckq 324

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  716 --------LAIKEQALAKLKGE-----LKMEQ-----GKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELR 777
Cdd:pfam10174  325 hievlkesLTAKEQRAAILQTEvdalrLRLEEkesflNKKTKQLQDLTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIE 404

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  778 DLETQqalQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDkasaYEDQL 857
Cdd:pfam10174  405 NLQEQ---LRDKDKQLAGLKERVKSLQTDSSNTDTALTTLEEALSEKERIIERLKEQREREDRERLEELES----LKKEN 477

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  858 QGHVQQVEALQKEKLSETCKGS---EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDvAEL 934
Cdd:pfam10174  478 KDLKEKVSALQPELTEKESSLIdlkEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKLENQLKKAHNAEEAVRTN-PEI 556

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  935 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEElKHIKETHERVLEK------KDQDLNEALVKMialgSSLEET 1008
Cdd:pfam10174  557 NDRIRLLEQEVARYKEESGKAQAEVERLLGILREVENE-KNDKDKKIAELESltlrqmKEQNKKVANIKH----GQQEMK 631

                          410
                   ....*....|...
gi 1907081939 1009 EIKLQEKEECLRR 1021
Cdd:pfam10174  632 KKGAQLLEEARRR 644

CCDC158

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

565-980

1.94e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 43.57 E-value: 1.94e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  565 ETTPLREEKQVPIAPLH-----LSLE-DRSERLSTHELTSL-----LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQ 633
Cdd:pfam15921  371 ESGNLDDQLQKLLADLHkrekeLSLEkEQNKRLWDRDTGNSitidhLRRELDDRNMEVQRLEALLKAMKSECQGQMERQM 450

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  634 SAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGnaa 713
Cdd:pfam15921  451 AAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVD--- 527

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  714 aelaIKEQALAKLKGE---------------LKM-EQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLlEKTQELR 777
Cdd:pfam15921  528 ----LKLQELQHLKNEgdhlrnvqtecealkLQMaEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQL-EKEINDR 602

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  778 DLETQQ--ALQRDRQKEVQRLQECIAEL-----------SQQLGT-----SEQAQRLMEKKLKRNYtllLESCEQEKQAL 839
Cdd:pfam15921  603 RLELQEfkILKDKKDAKIRELEARVSDLelekvklvnagSERLRAvkdikQERDQLLNEVKTSRNE---LNSLSEDYEVL 679

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  840 LQNLKEVEDKASAYEDQLQGHVQ--QVEALQKEKLSETCKGSE------------QVHKLEEELEAREASIRQLAQHVQS 905
Cdd:pfam15921  680 KRNFRNKSEEMETTTNKLKMQLKsaQSELEQTRNTLKSMEGSDghamkvamgmqkQITAKRGQIDALQSKIQFLEEAMTN 759

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  906 LHDERDLIKHQFQEL---MERVATSDGDVAELQEKLRGKEVDYQ----NLEHSHHRVSVQLQSVRTLLREKEEELKHIKE 978
Cdd:pfam15921  760 ANKEKHFLKEEKNKLsqeLSTVATEKNKMAGELEVLRSQERRLKekvaNMEVALDKASLQFAECQDIIQRQEQESVRLKL 839


                   ..
gi 1907081939  979 TH 980
Cdd:pfam15921  840 QH 841

YhaN

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1681-2143

2.02e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 43.22 E-value: 2.02e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1681 RAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRcLQEAENKHSESMFALQGRYEE---EIRCMV 1757
Cdd:COG4717     44 RAMLLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAE-LQEELEELEEELEELEAELEElreELEKLE 122

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1758 EQLSHTENTLQAER-SRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQlKVRELQAVHQEELRALQEHYIWSLRGALSLY 1836
Cdd:COG4717    123 KLLQLLPLYQELEAlEAELAELPERLEELEERLEELRELEEELEELEA-ELAELQEELEELLEQLSLATEEELQDLAEEL 201

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1837 QpshpdsslapgpsepravpAAKDEAESmsgLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCER- 1915
Cdd:COG4717    202 E-------------------ELQQRLAE---LEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLLLIAAALl 259

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1916 GFAAMEETHQKKIEDLQRQHQ-----RELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSD 1990
Cdd:COG4717    260 ALLGLGGSLLSLILTIAGVLFlvlglLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEEL 339

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1991 IEALRRqyLEELQSVQRELEVLSEQYSQKCLE---NAHLAQALEAERQALRQCQRENQELnahnQELNNRLAAEITRLRT 2067
Cdd:COG4717    340 LELLDR--IEELQELLREAEELEEELQLEELEqeiAALLAEAGVEDEEELRAALEQAEEY----QELKEELEELEEQLEE 413

                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081939 2068 LLTGDGGGESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYAsdkykDIYTELSIAKAK 2143
Cdd:COG4717    414 LLGELEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELA-----ELLQELEELKAE 484

Smc

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1654-2047

2.20e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 43.39 E-value: 2.20e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1654 RIRLEYEKELRFYKKACQEAKGASGQKRAQAvgALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRcLQE 1733
Cdd:COG1196    380 ELEELAEELLEALRAAAELAAQLEELEEAEE--ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAE-LEE 456

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1734 AENKHSESMFALQGRYEEEIrcmvEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAV 1813
Cdd:COG1196    457 EEEALLELLAELLEEAALLE----AALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGV 532

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1814 HQEELRALQEhyiwsLRGALSLYQPSHPDSSLAPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREElGHKELEG 1893
Cdd:COG1196    533 EAAYEAALEA-----ALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGA-AVDLVAS 606

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1894 DVAALQEKYQRDFESL------KATCERGFAAMEETHQKKIE---DLQRQHQRELEKLREEKDRLLAEETAATISAIEAM 1964
Cdd:COG1196    607 DLREADARYYVLGDTLlgrtlvAARLEAALRRAVTLAGRLREvtlEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAE 686

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1965 KNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAE----------R 2034
Cdd:COG1196    687 RLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEElpeppdleelE 766

                          410
                   ....*....|...
gi 1907081939 2035 QALRQCQRENQEL 2047
Cdd:COG1196    767 RELERLEREIEAL 779

CwlO1

COG3883

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...

1978-2185

2.49e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];

Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 42.51 E-value: 2.49e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1978 KSQRSQISSINSDIEALRrqylEELQSVQRELEVLSEQYSQKcleNAHLAQALEAERQALRQCQRENQELNAHNQELNNR 2057
Cdd:COG3883     19 QAKQKELSELQAELEAAQ----AELDALQAELEELNEEYNEL---QAELEALQAEIDKLQAEIAEAEAEIEERREELGER 91

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2058 LAA------EITRLRTLLTGDGGGE--------STGLPLTQG--KDAYELEVLLRVKESEIQYLKQEISSLKDELQTALR 2121
Cdd:COG3883     92 ARAlyrsggSVSYLDVLLGSESFSDfldrlsalSKIADADADllEELKADKAELEAKKAELEAKLAELEALKAELEAAKA 171

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907081939 2122 DKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPEGTTVSGYDIMKSKSNPD 2185
Cdd:COG3883    172 ELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAA 235

GumC

COG3206

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

1862-2031

2.75e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 42.70 E-value: 2.75e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1862 AESMSGLRERIQELEAQMGVMREELghKELEGDVAALQEKYQRDFESLKATCErgfAAMEETHQKKIEDLQRQHQRELEK 1941
Cdd:COG3206    211 SEEAKLLLQQLSELESQLAEARAEL--AEAEARLAALRAQLGSGPDALPELLQ---SPVIQQLRAQLAELEAELAELSAR 285

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1942 LREEKDRL--LAEETAATISAIEAMKNAHREEMERELEkSQRSQISSINSDIEALRRQYLE------ELQSVQRELEVLS 2013
Cdd:COG3206    286 YTPNHPDViaLRAQIAALRAQLQQEAQRILASLEAELE-ALQAREASLQAQLAQLEARLAElpeleaELRRLEREVEVAR 364

                          170       180
                   ....*....|....*....|
gi 1907081939 2014 EQYSQ--KCLENAHLAQALE 2031
Cdd:COG3206    365 ELYESllQRLEEARLAEALT 384

SMC_prok_A

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1932-2204

3.08e-03

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 42.75 E-value: 3.08e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1932 QRQHQRELEKLREEKDRLLAEEtAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALrrqyLEELQSVQRELEV 2011
Cdd:TIGR02169  669 SRSEPAELQRLRERLEGLKREL-SSLQSELRRIEN-RLDELSQELSDASR-KIGEIEKEIEQL----EQEEEKLKERLEE 741

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2012 LSEQYSQkclenahLAQALEAERQALRQCQRENQELnahnQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYEL 2091
Cdd:TIGR02169  742 LEEDLSS-------LEQEIENVKSELKELEARIEEL----EEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRI 810

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 2092 EVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKadcdISRLKEQLKAATEALgekspegtt 2171
Cdd:TIGR02169  811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGK----KEELEEELEELEAAL--------- 877

                          250       260       270
                   ....*....|....*....|....*....|...
gi 1907081939 2172 vsgYDIMKSKSNpdfLKKDRSCVTRQLRNIRSK 2204
Cdd:TIGR02169  878 ---RDLESRLGD---LKKERDELEAQLRELERK 904

PRK10246

exonuclease subunit SbcC; Provisional

600-863

3.09e-03

exonuclease subunit SbcC; Provisional

Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 42.87 E-value: 3.09e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  600 LEKELEQSQKEASDLLEQNRLLQDQLRvalgREQSAREGYVLQTEVATSpsgAWQRL-------HRVNQDLQSELEAQCR 672
Cdd:PRK10246   535 LEKEVKKLGEEGAALRGQLDALTKQLQ----RDESEAQSLRQEEQALTQ---QWQAVcaslnitLQPQDDIQPWLDAQEE 607

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  673 RQELITQ--QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIK------EQA-LAKLKGELKMEQGKVREQ--L 741
Cdd:PRK10246   608 HERQLRLlsQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTlpqedeEASwLATRQQEAQSWQQRQNELtaL 687

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  742 EEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRD----LETQ-QALQRDRQKEVQRLQECIAELSQQLGTSEQAQR- 815
Cdd:PRK10246   688 QNRIQQLTPLLETLPQSDDLPHSEETVALDNWRQVHEqclsLHSQlQTLQQQDVLEAQRLQKAQAQFDTALQASVFDDQq 767

                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907081939  816 ------LMEKKLKRnytlllesCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQ 863
Cdd:PRK10246   768 aflaalLDEETLTQ--------LEQLKQNLENQRQQAQTLVTQTAQALAQHQQH 813

sbcc

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

654-808

3.25e-03

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 42.65 E-value: 3.25e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  654 QRLHRVNQDLQSELEA-QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQtRLGNAAAELAIKEQALAKLKGELKM 732
Cdd:TIGR00618  725 NASSSLGSDLAAREDAlNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGA-ELSHLAAEIQFFNRLREEDTHLLKT 803

                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081939  733 EQGKVREQLEewqHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLG 808
Cdd:TIGR00618  804 LEAEIGQEIP---SDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSD 876

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

754-924

3.48e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.59 E-value: 3.48e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  754 QLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKK---LKRNYTL--- 827
Cdd:COG4913    611 KLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELerlDASSDDLaal 690

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  828 --LLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQS 905
Cdd:COG4913    691 eeQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELREN 770

                          170
                   ....*....|....*....
gi 1907081939  906 LHDERDLIKHQFQELMERV 924
Cdd:COG4913    771 LEERIDALRARLNRAEEEL 789

mukB

PRK04863

chromosome partition protein MukB;

1671-2068

3.54e-03

chromosome partition protein MukB;

Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 42.64 E-value: 3.54e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1671 QEAKGASGQ-KRAQA-VGALKEEYEE------LLHKQKSEYQKVITLIEKENTELKAKVSqmDHQQRcLQEAENKhsesm 1742
Cdd:PRK04863   341 QTALRQQEKiERYQAdLEELEERLEEqnevveEADEQQEENEARAEAAEEEVDELKSQLA--DYQQA-LDVQQTR----- 412

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1743 fALQGRyeeeircmveqlshteNTLQA-ERSRVLSQLDA----SVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEE 1817
Cdd:PRK04863   413 -AIQYQ----------------QAVQAlERAKQLCGLPDltadNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAHSQF 475

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1818 LRALQehyiwSLRgalslyqpshpdsSLAPGPSEPRAVPAAKD---EAESMSGLRERIQELEAQmgvmreelgHKELEGD 1894
Cdd:PRK04863   476 EQAYQ-----LVR-------------KIAGEVSRSEAWDVAREllrRLREQRHLAEQLQQLRMR---------LSELEQR 528

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1895 VAAlqekyQRDFESLKATCERGFAAMEEThQKKIEDLQRQHQRELEKLREEKDRLlaeetaatisaieamkNAHREEMER 1974
Cdd:PRK04863   529 LRQ-----QQRAERLLAEFCKRLGKNLDD-EDELEQLQEELEARLESLSESVSEA----------------RERRMALRQ 586

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1975 ELEKsqrsqissINSDIEALRRQYLEELQSvQRELEVLSEQY------SQKCLEnaHLAQALEAERQALR---QCQRENQ 2045
Cdd:PRK04863   587 QLEQ--------LQARIQRLAARAPAWLAA-QDALARLREQSgeefedSQDVTE--YMQQLLERERELTVerdELAARKQ 655

                          410       420
                   ....*....|....*....|...
gi 1907081939 2046 ELNAHNQELNNRLAAEITRLRTL 2068
Cdd:PRK04863   656 ALDEEIERLSQPGGSEDPRLNAL 678

Cast

pfam10174

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...

1925-2068

3.66e-03

Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 42.50 E-value: 3.66e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1925 QKKIEDLQRQHQRELEKLREEKDRL--LAEETAATISA--------------IEAMKNAH-REEMER--ELEkSQRSQIS 1985
Cdd:pfam10174  400 QKKIENLQEQLRDKDKQLAGLKERVksLQTDSSNTDTAlttleealsekeriIERLKEQReREDRERleELE-SLKKENK 478

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939 1986 SINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQR-ENQELNAHNQELNNRLAAEIT- 2063
Cdd:pfam10174  479 DLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKlENQLKKAHNAEEAVRTNPEINd 558


                   ....*
gi 1907081939 2064 RLRTL 2068
Cdd:pfam10174  559 RIRLL 563

PRK11281

mechanosensitive channel MscK;

661-869

3.81e-03

mechanosensitive channel MscK;

Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 42.59 E-value: 3.81e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  661 QDLQSELEAQCRRQELITQQ---IQTLKHSYgEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKG--------- 728
Cdd:PRK11281    39 ADVQAQLDALNKQKLLEAEDklvQQDLEQTL-ALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDdndeetret 117

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  729 -------ELKMEQGKVREQLEEWQHSKAMLSGQL--------RASEQkLRSTEARLLEKTQELRDLETQQALQRDRQKev 793
Cdd:PRK11281   118 lstlslrQLESRLAQTLDQLQNAQNDLAEYNSQLvslqtqpeRAQAA-LYANSQRLQQIRNLLKGGKVGGKALRPSQR-- 194

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  794 QRLQECIAELSQQ-------LGTSEQAQRLMEKklKRNYTLLLESCEQEKQALLQNLkeVEDKasaYEDQLQGHVQQVEA 866
Cdd:PRK11281   195 VLLQAEQALLNAQndlqrksLEGNTQLQDLLQK--QRDYLTARIQRLEHQLQLLQEA--INSK---RLTLSEKTVQEAQS 267


                   ...
gi 1907081939  867 LQK 869
Cdd:PRK11281   268 QDE 270

PH_ACAP

cd13250

ArfGAP with coiled-coil, ankyrin repeat and PH domains Pleckstrin homology (PH) domain; ACAP ...

311-399

3.81e-03

ArfGAP with coiled-coil, ankyrin repeat and PH domains Pleckstrin homology (PH) domain; ACAP (also called centaurin beta) functions both as a Rab35 effector and as an Arf6-GTPase-activating protein (GAP) by which it controls actin remodeling and membrane trafficking. ACAP contain an NH2-terminal bin/amphiphysin/Rvs (BAR) domain, a phospholipid-binding domain, a PH domain, a GAP domain, and four ankyrin repeats. The AZAPs constitute a family of Arf GAPs that are characterized by an NH2-terminal pleckstrin homology (PH) domain and a central Arf GAP domain followed by two or more ankyrin repeats. On the basis of sequence and domain organization, the AZAP family is further subdivided into four subfamilies: 1) the ACAPs contain an NH2-terminal bin/amphiphysin/Rvs (BAR) domain (a phospholipid-binding domain that is thought to sense membrane curvature), a single PH domain followed by the GAP domain, and four ankyrin repeats; 2) the ASAPs also contain an NH2-terminal BAR domain, the tandem PH domain/GAP domain, three ankyrin repeats, two proline-rich regions, and a COOH-terminal Src homology 3 domain; 3) the AGAPs contain an NH2-terminal GTPase-like domain (GLD), a split PH domain, and the GAP domain followed by four ankyrin repeats; and 4) the ARAPs contain both an Arf GAP domain and a Rho GAP domain, as well as an NH2-terminal sterile-a motif (SAM), a proline-rich region, a GTPase-binding domain, and five PH domains. PMID 18003747 and 19055940 Centaurin can bind to phosphatidlyinositol (3,4,5)P3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270070 Cd Length: 98 Bit Score: 38.74 E-value: 3.81e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  311 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLdgEINLSTCYDVTEYPVQRNYGFQIHTKEGEFTLSAMT 388
Cdd:cd13250      1 KEGYLFKRSSNafKTWKRRWFSLQNGQLYYQKRDKKDEPTVM--VEDLRLCTVKPTEDSDRRFCFEVISPTKSYMLQAES 78

                           90
                   ....*....|.
gi 1907081939  389 SGIRRNWIQTI 399
Cdd:cd13250     79 EEDRQAWIQAI 89

PRK09039

peptidoglycan -binding protein;

701-843

3.96e-03

peptidoglycan -binding protein;

Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 41.88 E-value: 3.96e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907081939  701 EIQTLQTRLGNAAAELAIKEQALA---KLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELr 777
Cdd:PRK09039    47 EISGKDSALDRLNSQIAELADLLSlerQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQEL- 125

                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907081939  778 DLETQQALQRDRQkeVQRLQECIAELSQQLGTSEQAQRLMEKKlkrnytlllescEQEKQALLQNL 843
Cdd:PRK09039   126 DSEKQVSARALAQ--VELLNQQIAALRRQLAALEAALDASEKR------------DRESQAKIADL 177

SMC_prok_B