NCBI Conserved Domain Search

Conserved domains on [gi|1039737300|ref|XP_017170057|]

View

myosin phosphatase Rho-interacting protein isoform X1 [Mus musculus]

Protein Classification

PH_RIP and PH_M-RIP domain-containing protein( domain architecture ID 12913567)

protein containing domains PH_RIP, PH_M-RIP, Smc, and SMC_prok_B

Graphical summary

Zoom to residue level

show extra options »

Show site features Horizontal zoom: ×

List of domain hits

Name

Accession

Description

Interval

E-value

PH_RIP

cd01236

Rho-Interacting Protein Pleckstrin homology (PH) domain; RIP1-RhoGDI2 was obtained in a screen ...

16-151

1.34e-79

Rho-Interacting Protein Pleckstrin homology (PH) domain; RIP1-RhoGDI2 was obtained in a screen for proteins that bind to wild-type RhoA. RIP2, RIP3, and RIP4 were isolated from cDNA libraries with constitutively active V14RhoA (containing the C190R mutation). RIP2 represents a novel GDP/GTP exchange factor (RhoGEF), while RIP3 (p116Rip) and RIP4 are thought to be structural proteins. RhoGEF contains a Dbl(DH)/PH region, a a zinc finger motif, a leucine-rich domain, and a coiled-coil region. The last 2 domains are thought to be involved in mediating protein-protein interactions. RIP3 is a negative regulator of RhoA signaling that inhibits, either directly or indirectly, RhoA-stimulated actomyosin contractility. In plants RIP3 is localized at microtubules and interacts with the kinesin-13 family member AtKinesin-13A, suggesting a role for RIP3 in microtubule reorganization and a possible function in Rho proteins of plants (ROP)-regulated polar growth. It has a PH domain, two proline-rich regions which are putative binding sites for SH3 domains, and a COOH-terminal coiled-coil region which overlaps with the RhoA-binding region. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269942 Cd Length: 136 Bit Score: 258.52 E-value: 1.34e-79

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   16 IFNKSKCQNCFKPRESHLLNDEDLTQAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQ 95
Cdd:cd01236      1 NKSKCKCCFCFRPRHSHLALEEARMQRKVIYCGWLYVAPPGTDFSNPSHRSKRWQRRWFVLYDDGELTYALDEMPDTLPQ 80

                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300   96 GTINMNQCTDVVDGEARTGQKFSLCILTPDKEHFIRAETKEIISGWLEMLMVYPRT 151
Cdd:cd01236     81 GSIDMSQCTEVTDAEARTGHPHSLAITTPERIHFVKADSKEEIRWWLELLAVYPRT 136

PH_M-RIP

cd13275

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...

507-608

5.60e-47

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed to play a role in myosin phosphatase regulation by RhoA. M-RIP contains 2 PH domains followed by a Rho binding domain (Rho-BD), and a C-terminal myosin binding subunit (MBS) binding domain (MBS-BD). The amino terminus of M-RIP with its adjacent PH domains and polyproline motifs mediates binding to both actin and Galpha. M-RIP brings RhoA and MBS into close proximity where M-RIP can target RhoA to the myosin phosphatase complex to regulate the myosin phosphorylation state. M-RIP does this via its C-terminal coiled-coil domain which interacts with the MBS leucine zipper domain of myosin phosphatase, while its Rho-BD, directly binds RhoA in a nucleotide-independent manner. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270094 Cd Length: 104 Bit Score: 164.04 E-value: 5.60e-47

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 584
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80

                           90       100
                   ....*....|....*....|....
gi 1039737300  585 SGIRRNWIQTIMKHVLPASAPDVT 608
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104

Smc super family

cl34174

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

796-1110

6.01e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

The actual alignment was detected with superfamily member COG1196:

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 74.97 E-value: 6.01e-13

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 869
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  870 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 945
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  946 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 1025
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1026 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1105
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496


                   ....*
gi 1039737300 1106 RDLIK 1110
Cdd:COG1196    497 LEAEA 501

Smc super family

cl34174

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

2053-2318

3.08e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

The actual alignment was detected with superfamily member COG1196:

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 62.65 E-value: 3.08e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2131
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2132 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2211
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2212 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2291
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458

                          250       260
                   ....*....|....*....|....*..
gi 1039737300 2292 RVKESEIQYLKQEISSLKDELQTALRD 2318
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485

SMC_prok_B super family

cl37069

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1758-2360

1.48e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]

The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 53.91 E-value: 1.48e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1758 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1831
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1832 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1906
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1907 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1986
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSE-GVKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1987 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 2059
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2060 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2126
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2127 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2203
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2204 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLR------TLLTGDGGGESTGLP 2277
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDelraelTLLNEEAANLRERLE 827

                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2278 LTQgKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2357
Cdd:TIGR02168  828 SLE-RRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902


                   ...
gi 1039737300 2358 LGE 2360
Cdd:TIGR02168  903 LRE 905

Name

Accession

Description

Interval

E-value

PH_RIP

cd01236

Rho-Interacting Protein Pleckstrin homology (PH) domain; RIP1-RhoGDI2 was obtained in a screen ...

16-151

1.34e-79

Pssm-ID: 269942 Cd Length: 136 Bit Score: 258.52 E-value: 1.34e-79

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   16 IFNKSKCQNCFKPRESHLLNDEDLTQAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQ 95
Cdd:cd01236      1 NKSKCKCCFCFRPRHSHLALEEARMQRKVIYCGWLYVAPPGTDFSNPSHRSKRWQRRWFVLYDDGELTYALDEMPDTLPQ 80

                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300   96 GTINMNQCTDVVDGEARTGQKFSLCILTPDKEHFIRAETKEIISGWLEMLMVYPRT 151
Cdd:cd01236     81 GSIDMSQCTEVTDAEARTGHPHSLAITTPERIHFVKADSKEEIRWWLELLAVYPRT 136

PH_M-RIP

cd13275

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...

507-608

5.60e-47

Pssm-ID: 270094 Cd Length: 104 Bit Score: 164.04 E-value: 5.60e-47

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 584
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80

                           90       100
                   ....*....|....*....|....
gi 1039737300  585 SGIRRNWIQTIMKHVLPASAPDVT 608
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104

smart00233

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...

507-599

4.02e-16

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

Pssm-ID: 214574 [Multi-domain] Cd Length: 102 Bit Score: 76.05 E-value: 4.02e-16

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   507 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTC---YDVTEYPVQRNYGFQIHTKEGE-FTL 580
Cdd:smart00233    3 KEGWLYKKSGGGkkSWKKRYFVLFNSTLLYYKSKKDKKSYKPKGSIDLSGCtvrEAPDPDSSKKPHCFEIKTSDRKtLLL 82

                            90
                    ....*....|....*....
gi 1039737300   581 SAMTSGIRRNWIQTIMKHV 599
Cdd:smart00233   83 QAESEEEREKWVEALRKAI 101

pfam00169

PH domain; PH stands for pleckstrin homology.

507-595

2.89e-15

PH domain; PH stands for pleckstrin homology.

Pssm-ID: 459697 [Multi-domain] Cd Length: 105 Bit Score: 73.75 E-value: 2.89e-15

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDV---TEYPVQRNYGFQIHTKEG----E 577
Cdd:pfam00169    3 KEGWLLKKGGGkkKSWKKRYFVLFDGSLLYYKDDKSGKSKEPKGSISLSGCEVVevvASDSPKRKFCFELRTGERtgkrT 82

                           90
                   ....*....|....*...
gi 1039737300  578 FTLSAMTSGIRRNWIQTI 595
Cdd:pfam00169   83 YLLQAESEEERKDWIKAI 100

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

796-1110

6.01e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 74.97 E-value: 6.01e-13

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 869
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  870 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 945
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  946 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 1025
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1026 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1105
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496


                   ....*
gi 1039737300 1106 RDLIK 1110
Cdd:COG1196    497 LEAEA 501

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

784-1065

4.89e-11

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 68.54 E-value: 4.89e-11

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  784 SERLSTHELtSLLEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEVATSPsgawqrLHRVNQDLQSEL 863
Cdd:TIGR02168  219 KAELRELEL-ALLVLRLEELREELEELQEELKEAEEELE-ELTAELQELEEKLEELRLEVSE------LEEEIEELQKEL 290

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  864 EAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEqgkvREQLEEWQHS 943
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL----EAELEELEAE 366

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  944 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 1019
Cdd:TIGR02168  367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELE 446

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1039737300 1020 NYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 1065
Cdd:TIGR02168  447 EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492

PRK03918

DNA double-strand break repair ATPase Rad50;

848-1213

1.32e-10

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 67.01 E-value: 1.32e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  848 AWQRLHRVNQDLQSELEaqcRRQELITQQiqtlkhsyGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELK 927
Cdd:PRK03918   163 AYKNLGEVIKEIKRRIE---RLEKFIKRT--------ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  928 mEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----------------QALQRDRQKEVQ 990
Cdd:PRK03918   232 -ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkelkelkekaeeyiklSEFYEEYLDELR 310

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  991 RLQECIAELSQQL-GTSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYED------QLQGHVQQVEAL 1063
Cdd:PRK03918   311 EIEKRLSRLEEEInGIEERIKELEEKEER------LEELKKKLKELEKRLEELEERHELYEEakakkeELERLKKRLTGL 384

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1064 QKEKLSETCKGSEQVHKLEEELEAREAS-IRQLAQHVQSLHDE---------------RDLIKHQFQELMER----VATS 1123
Cdd:PRK03918   385 TPEKLEKELEELEKAKEEIEEEISKITArIGELKKEIKELKKAieelkkakgkcpvcgRELTEEHRKELLEEytaeLKRI 464

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1124 DGDVAELQEKLRGKEVDYQNLEHSHHRVS--VQLQSVRTLLREKEEELKHI------------KETHERV--LEKKDQDL 1187
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESelIKLKELAEQLKELEEKLKKYnleelekkaeeyEKLKEKLikLKGEIKSL 544

                          410       420
                   ....*....|....*....|....*.
gi 1039737300 1188 NEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:PRK03918   545 KKELEKLEELKKKLAELEKKLDELEE 570

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

2053-2318

3.08e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 62.65 E-value: 3.08e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2131
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2132 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2211
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2212 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2291
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458

                          250       260
                   ....*....|....*....|....*..
gi 1039737300 2292 RVKESEIQYLKQEISSLKDELQTALRD 2318
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485

smart00233

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...

44-145

1.68e-08

Pssm-ID: 214574 [Multi-domain] Cd Length: 102 Bit Score: 54.09 E-value: 1.68e-08

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300    44 PIYGGWLLLAPDGtdfdnpvhRSRKWQRRFFILYEHGLLRYALDEMPTTL-PQGTINMNQCT-DVVDGEARTGQKFSLCI 121
Cdd:smart00233    1 VIKEGWLYKKSGG--------GKKSWKKRYFVLFNSTLLYYKSKKDKKSYkPKGSIDLSGCTvREAPDPDSSKKPHCFEI 72

                            90       100
                    ....*....|....*....|....*
gi 1039737300   122 LTPDKE-HFIRAETKEIISGWLEML 145
Cdd:smart00233   73 KTSDRKtLLLQAESEEEREKWVEAL 97

Myosin_tail_1

pfam01576

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...

744-1239

2.17e-08

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.

Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 59.80 E-value: 2.17e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  744 LKTQNVHVEIEQRWHQV--ETTPLREEKQVPiAPLHLSLEDRSERLST---------HELTSLLEKELEQSQKEASDLLE 812
Cdd:pfam01576   22 QKAESELKELEKKHQQLceEKNALQEQLQAE-TELCAEAEEMRARLAArkqeleeilHELESRLEEEEERSQQLQNEKKK 100

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  813 QNRLLQDqLRVALGREQSAREGyvLQTEVATSPS----------------GAWQRLHRVNQDLQSELEAQCRRQELITQQ 876
Cdd:pfam01576  101 MQQHIQD-LEEQLDEEEAARQK--LQLEKVTTEAkikkleedillledqnSKLSKERKLLEERISEFTSNLAEEEEKAKS 177

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  877 IQTLKHSygeakdairhHEAEIQTLQTRLGNAAAelaiKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQ 956
Cdd:pfam01576  178 LSKLKNK----------HEAMISDLEERLKKEEK----GRQELEKAKRKLEGESTDLQEQIAELQAQIAELRAQLAKKEE 243

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  957 KLRSTEARLlektqelrdlETQQALQRDRQKEVQRLQECIAELSQQLgTSEQAQRLMEKKLKRNYTLLLESCEQEKQALL 1036
Cdd:pfam01576  244 ELQAALARL----------EEETAQKNNALKKIRELEAQISELQEDL-ESERAARNKAEKQRRDLGEELEALKTELEDTL 312

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1037 ------QNLK-----EVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1105
Cdd:pfam01576  313 dttaaqQELRskreqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAE 392

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1106 RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEelKHIKETHE-RVLEKKD 1184
Cdd:pfam01576  393 LRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEG--KNIKLSKDvSSLESQL 470

                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300 1185 QDLNEALV----KMIALGSSLEETEI-------KLQEKEECLRRF----------VSDSPKDAKEPLSTTEPTEEG 1239
Cdd:pfam01576  471 QDTQELLQeetrQKLNLSTRLRQLEDernslqeQLEEEEEAKRNVerqlstlqaqLSDMKKKLEEDAGTLEALEEG 546

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

2064-2256

6.31e-08

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 58.53 E-value: 6.31e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2064 LRERIQELEAQMGVMREELGH-----KELEGDVAALQEKyqrdFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2138
Cdd:TIGR02168  307 LRERLANLERQLEELEAQLEElesklDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEEQLE 382

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2139 REEKDRLLAEETAATIsaieamkNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2218
Cdd:TIGR02168  383 TLRSKVAQLELQIASL-------NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455

                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1039737300 2219 NAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA 2256
Cdd:TIGR02168  456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493

pfam00169

PH domain; PH stands for pleckstrin homology.

44-145

4.37e-07

PH domain; PH stands for pleckstrin homology.

Pssm-ID: 459697 [Multi-domain] Cd Length: 105 Bit Score: 50.25 E-value: 4.37e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   44 PIYGGWLLLAPDGtdfdnpvhRSRKWQRRFFILYEHGLLRYALDEMPTTL-PQGTINMNQCTDV-VDGEARTGQKFSLCI 121
Cdd:pfam00169    1 VVKEGWLLKKGGG--------KKKSWKKRYFVLFDGSLLYYKDDKSGKSKePKGSISLSGCEVVeVVASDSPKRKFCFEL 72

                           90       100
                   ....*....|....*....|....*...
gi 1039737300  122 LTPD----KEHFIRAETKEIISGWLEML 145
Cdd:pfam00169   73 RTGErtgkRTYLLQAESEEERKDWIKAI 100

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1758-2360

1.48e-06

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 53.91 E-value: 1.48e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1758 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1831
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1832 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1906
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1907 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1986
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSE-GVKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1987 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 2059
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2060 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2126
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2127 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2203
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2204 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLR------TLLTGDGGGESTGLP 2277
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDelraelTLLNEEAANLRERLE 827

                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2278 LTQgKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2357
Cdd:TIGR02168  828 SLE-RRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902


                   ...
gi 1039737300 2358 LGE 2360
Cdd:TIGR02168  903 LRE 905

CCDC158

pfam15921

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

1897-2366

8.51e-06

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 51.27 E-value: 8.51e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1897 SEYQKVITLIEKENTELKAKVSQMDHQQRCLQ-EAENK-------HSESMFALQGRYEEEIRCMVEQLSHTENTLQAERS 1968
Cdd:pfam15921  220 SAISKILRELDTEISYLKGRIFPVEDQLEALKsESQNKielllqqHQDRIEQLISEHEVEITGLTEKASSARSQANSIQS 299

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1969 RvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSlrgalslyqpshpDSSLAPGPSEp 2048
Cdd:pfam15921  300 Q-LEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLA-------------NSELTEARTE- 364

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2049 ravpaaKDEAESMSG-LRERIQELEAQMGVMREELG---------------------HKELEGDVAALQ-EKYQRDFESL 2105
Cdd:pfam15921  365 ------RDQFSQESGnLDDQLQKLLADLHKREKELSlekeqnkrlwdrdtgnsitidHLRRELDDRNMEvQRLEALLKAM 438

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2106 KATC----ERGFAAMEETHQ--KKIEDLQRQHQRELEKLREEKDRLLA-----EETAATISAIeamkNAHREEMERELEK 2174
Cdd:pfam15921  439 KSECqgqmERQMAAIQGKNEslEKVSSLTAQLESTKEMLRKVVEELTAkkmtlESSERTVSDL----TASLQEKERAIEA 514

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2175 SQrSQISSINS--DIEALRRQYL----EELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQ 2248
Cdd:pfam15921  515 TN-AEITKLRSrvDLKLQELQHLknegDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA 593

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2249 ELNNRLAAEITRLRTLLTgdgggestglpLTQGKDAYELEVLLRVKESEIQY----------------LKQEISSLKDEL 2312
Cdd:pfam15921  594 QLEKEINDRRLELQEFKI-----------LKDKKDAKIRELEARVSDLELEKvklvnagserlravkdIKQERDQLLNEV 662

                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300 2313 QTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE-----KSPEGT 2366
Cdd:pfam15921  663 KTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQtrntlKSMEGS 721

PRK03918

DNA double-strand break repair ATPase Rad50;

1883-2364

4.69e-05

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 48.91 E-value: 4.69e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1883 ALKEEYEELLhKQKSEYQKVITLIEKENTELKAKVSQmdhqqrcLQEAENKHSESMFALQGRYE--EEIRCMVEQLSHTE 1960
Cdd:PRK03918   176 RRIERLEKFI-KRTENIEELIKEKEKELEEVLREINE-------ISSELPELREELEKLEKEVKelEELKEEIEELEKEL 247

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1961 NTLQAErsrvLSQLDASVKDRQAMEQHHVQQMKMLEDrfqlKVRELqavhqEELRALQEHYIwSLRGALSLY--QPSHPD 2038
Cdd:PRK03918   248 ESLEGS----KRKLEEKIRELEERIEELKKEIEELEE----KVKEL-----KELKEKAEEYI-KLSEFYEEYldELREIE 313

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2039 SSLAPGPSEPRAVPAAKDEAESMSglrERIQELEAQMGVMREELGhkELEGDVAALQEKYQRDFESLKATCERGFAAMEE 2118
Cdd:PRK03918   314 KRLSRLEEEINGIEERIKELEEKE---ERLEELKKKLKELEKRLE--ELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2119 ThQKKIEDLQRQH---QRELEKLREEKDRLLAEEtAATISAIEAMKNAHR----------EEMERELEKSQRSQISSINS 2185
Cdd:PRK03918   389 L-EKELEELEKAKeeiEEEISKITARIGELKKEI-KELKKAIEELKKAKGkcpvcgreltEEHRKELLEEYTAELKRIEK 466

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2186 DIEALRRQyLEELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLL 2265
Cdd:PRK03918   467 ELKEIEEK-ERKLRKELRELEKVLKKESE-----------LIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKL 534

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2266 TGDGGgESTGLpLTQGKDAYELEVLLRVKESEIQYLKQEISSLK-----------DELQTALRDKKYASDKY---KDIYT 2331
Cdd:PRK03918   535 IKLKG-EIKSL-KKELEKLEELKKKLAELEKKLDELEEELAELLkeleelgfesvEELEERLKELEPFYNEYlelKDAEK 612

                          490       500       510
                   ....*....|....*....|....*....|...
gi 1039737300 2332 ELSIAKAKadcdISRLKEQLKAATEALGEKSPE 2364
Cdd:PRK03918   613 ELEREEKE----LKKLEEELDKAFEELAETEKR 641

YhaN

COG4717

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1758-2235

3.11e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.91 E-value: 3.11e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1758 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLPPTEPLGGCQRLLRMSQHlSYESCLEGLGQYSSLLVQd 1837
Cdd:COG4717     87 EEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPE-RLEELEERLEELRELEEE- 164

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1838 aiiqaqvcyaacriRLEYEKELRFYKKACQEAKGASGQKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV 1917
Cdd:COG4717    165 --------------LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEEL 229

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1918 SQMDHQQRCLQEAENKHSESMFALqgryeeeIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLED 1997
Cdd:COG4717    230 EQLENELEAAALEERLKEARLLLL-------IAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGK 302

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1998 RFQlkvrelQAVHQEELRALQEHYIWSLRGALSLyqpshpdsslaPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMgv 2077
Cdd:COG4717    303 EAE------ELQALPALEELEEEELEELLAALGL-----------PPDLSPEELLELLDRIEELQELLREAEELEEEL-- 363

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2078 mreelghkelegDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQR--QHQRELEKLREEKDRLLAEETAATIS 2155
Cdd:COG4717    364 ------------QLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEEleELEEQLEELLGELEELLEALDEEELE 431

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2156 AIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY-----LEELQSVQRELEVLSEQYSQKCLenahLAQALEAER 2230
Cdd:COG4717    432 EELEELEEELEELEEELEE-LREELAELEAELEQLEEDGelaelLQELEELKAELRELAEEWAALKL----ALELLEEAR 506


                   ....*
gi 1039737300 2231 QALRQ 2235
Cdd:COG4717    507 EEYRE 511

PRK03918

DNA double-strand break repair ATPase Rad50;

1717-2352

4.60e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 45.83 E-value: 4.60e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1717 ERVIQQILetlrhptgREDQVQTSWDQnpLGEILRPgTDGSQEPLQALHQSPEvlaAIQDELAQQLREKASILEEISAAL 1796
Cdd:PRK03918   148 EKVVRQIL--------GLDDYENAYKN--LGEVIKE-IKRRIERLEKFIKRTE---NIEELIKEKEKELEEVLREINEIS 213

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1797 PVLPPT-EPLGGCQRLLRmsqhlSYESCLEGLgqySSLLVQDAIIQAQVCYAACRIRlEYEKELRFYKKACQEAKgaSGQ 1875
Cdd:PRK03918   214 SELPELrEELEKLEKEVK-----ELEELKEEI---EELEKELESLEGSKRKLEEKIR-ELEERIEELKKEIEELE--EKV 282

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1876 KRAQAVGALKEEYEElLHKQKSEYQKVITLIEKENTELKAKVSQMdhqQRCLQEAENKHSEsMFALQGRyEEEIRCMVEQ 1955
Cdd:PRK03918   283 KELKELKEKAEEYIK-LSEFYEEYLDELREIEKRLSRLEEEINGI---EERIKELEEKEER-LEELKKK-LKELEKRLEE 356

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1956 LSHTENTLQAERsRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALqEHYIWSLRGALSLYQPS 2035
Cdd:PRK03918   357 LEERHELYEEAK-AKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL-KKEIKELKKAIEELKKA 434

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2036 HPDSSL--APGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREELghKELEGDVaalqeKYQRDFESLKATCERGF 2113
Cdd:PRK03918   435 KGKCPVcgRELTEEHRKELLEEYTAE-LKRIEKELKEIEEKERKLRKEL--RELEKVL-----KKESELIKLKELAEQLK 506

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2114 AAMEETHQKKIEDLQRQhQRELEKLREEKDRLLAE--ETAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALR 2191
Cdd:PRK03918   507 ELEEKLKKYNLEELEKK-AEEYEKLKEKLIKLKGEikSLKKELEKLEELKK-KLAELEKKLDELEE-ELAELLKELEELG 583

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2192 RQYLEELQSVQRELEVLSEQYsqkcLENAHLAQALEAERQALRQCQRENQELNAHNQELNNR---LAAEITRLRTLLTGD 2268
Cdd:PRK03918   584 FESVEELEERLKELEPFYNEY----LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRleeLRKELEELEKKYSEE 659

                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2269 GGGESTGLPLtqgkdayELEVLLRVKESEIQYLKqeisSLKDELQTALRDKKYASDKYKDIYTEL-SIAKAKAdcDISRL 2347
Cdd:PRK03918   660 EYEELREEYL-------ELSRELAGLRAELEELE----KRREEIKKTLEKLKEELEEREKAKKELeKLEKALE--RVEEL 726


                   ....*
gi 1039737300 2348 KEQLK 2352
Cdd:PRK03918   727 REKVK 731

TOPEUc

smart00435

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina ...

2049-2145

8.27e-03

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina virus topoisomerase, Variola virus topoisomerase, Shope fibroma virus topoisomeras

Pssm-ID: 214661 [Multi-domain] Cd Length: 391 Bit Score: 41.18 E-value: 8.27e-03

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  2049 RAVPaaKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDV-AALQEKYQRDFESLKATCERGFAAM--EETHQKKIE 2125
Cdd:smart00435  269 RTVS--KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLkRKLKSKFERDNEKLDAEVKEKKKEKkkEEKKKKQIE 346

                            90       100
                    ....*....|....*....|
gi 1039737300  2126 DLQRQHQReLEKLREEKDRL 2145
Cdd:smart00435  347 RLEERIEK-LEVQATDKEEN 365

Name

Accession

Description

Interval

E-value

PH_RIP

cd01236

Rho-Interacting Protein Pleckstrin homology (PH) domain; RIP1-RhoGDI2 was obtained in a screen ...

16-151

1.34e-79

Pssm-ID: 269942 Cd Length: 136 Bit Score: 258.52 E-value: 1.34e-79

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   16 IFNKSKCQNCFKPRESHLLNDEDLTQAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQ 95
Cdd:cd01236      1 NKSKCKCCFCFRPRHSHLALEEARMQRKVIYCGWLYVAPPGTDFSNPSHRSKRWQRRWFVLYDDGELTYALDEMPDTLPQ 80

                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300   96 GTINMNQCTDVVDGEARTGQKFSLCILTPDKEHFIRAETKEIISGWLEMLMVYPRT 151
Cdd:cd01236     81 GSIDMSQCTEVTDAEARTGHPHSLAITTPERIHFVKADSKEEIRWWLELLAVYPRT 136

PH_M-RIP

cd13275

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...

507-608

5.60e-47

Pssm-ID: 270094 Cd Length: 104 Bit Score: 164.04 E-value: 5.60e-47

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 584
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80

                           90       100
                   ....*....|....*....|....
gi 1039737300  585 SGIRRNWIQTIMKHVLPASAPDVT 608
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104

smart00233

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...

507-599

4.02e-16

Pssm-ID: 214574 [Multi-domain] Cd Length: 102 Bit Score: 76.05 E-value: 4.02e-16

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   507 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTC---YDVTEYPVQRNYGFQIHTKEGE-FTL 580
Cdd:smart00233    3 KEGWLYKKSGGGkkSWKKRYFVLFNSTLLYYKSKKDKKSYKPKGSIDLSGCtvrEAPDPDSSKKPHCFEIKTSDRKtLLL 82

                            90
                    ....*....|....*....
gi 1039737300   581 SAMTSGIRRNWIQTIMKHV 599
Cdd:smart00233   83 QAESEEEREKWVEALRKAI 101

pfam00169

PH domain; PH stands for pleckstrin homology.

507-595

2.89e-15

PH domain; PH stands for pleckstrin homology.

Pssm-ID: 459697 [Multi-domain] Cd Length: 105 Bit Score: 73.75 E-value: 2.89e-15

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDV---TEYPVQRNYGFQIHTKEG----E 577
Cdd:pfam00169    3 KEGWLLKKGGGkkKSWKKRYFVLFDGSLLYYKDDKSGKSKEPKGSISLSGCEVVevvASDSPKRKFCFELRTGERtgkrT 82

                           90
                   ....*....|....*...
gi 1039737300  578 FTLSAMTSGIRRNWIQTI 595
Cdd:pfam00169   83 YLLQAESEEERKDWIKAI 100

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

796-1110

6.01e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 74.97 E-value: 6.01e-13

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 869
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  870 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 945
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  946 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 1025
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1026 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1105
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496


                   ....*
gi 1039737300 1106 RDLIK 1110
Cdd:COG1196    497 LEAEA 501

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

850-1209

9.17e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 74.20 E-value: 9.17e-13

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  850 QRLHRVNqDLQSELEAQcrRQELITQQIQTLKhsYGEAKDAIRHHEAEIQTLQTRlgNAAAELAIKEQALAKLKGELKME 929
Cdd:COG1196    186 ENLERLE-DILGELERQ--LEPLERQAEKAER--YRELKEELKELEAELLLLKLR--ELEAELEELEAELEELEAELEEL 258

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  930 QGKVREQLEEWQHSKAmlsgQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV----QRLQECIAELSQQLGT 1005
Cdd:COG1196    259 EAELAELEAELEELRL----ELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELeerlEELEEELAELEEELEE 334

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1006 SEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSEtckgseqvhklEEEL 1085
Cdd:COG1196    335 LEEELEELEEELEEA--------EEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEA-----------LRAA 395

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1086 EAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREK 1165
Cdd:COG1196    396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL 475

                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1039737300 1166 EEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQ 1209
Cdd:COG1196    476 EAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGL 519

cd00821

Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are ...

507-595

2.76e-12

Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 275388 [Multi-domain] Cd Length: 92 Bit Score: 64.87 E-value: 2.76e-12

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQ--YEDGQWKKHWFVLADQSLRYYRDSvAEEAADLDGEINLSTCYDVTEY-PVQRNYGFQIHTKEGE-FTLSA 582
Cdd:cd00821      1 KEGYLLKRggGGLKSWKKRWFVLFEGVLLYYKSK-KDSSYKPKGSIPLSGILEVEEVsPKERPHCFELVTPDGRtYYLQA 79

                           90
                   ....*....|...
gi 1039737300  583 MTSGIRRNWIQTI 595
Cdd:cd00821     80 DSEEERQEWLKAL 92

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

784-1065

4.89e-11

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 68.54 E-value: 4.89e-11

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  784 SERLSTHELtSLLEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEVATSPsgawqrLHRVNQDLQSEL 863
Cdd:TIGR02168  219 KAELRELEL-ALLVLRLEELREELEELQEELKEAEEELE-ELTAELQELEEKLEELRLEVSE------LEEEIEELQKEL 290

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  864 EAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEqgkvREQLEEWQHS 943
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL----EAELEELEAE 366

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  944 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 1019
Cdd:TIGR02168  367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELE 446

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1039737300 1020 NYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 1065
Cdd:TIGR02168  447 EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492

PRK03918

DNA double-strand break repair ATPase Rad50;

848-1213

1.32e-10

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 67.01 E-value: 1.32e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  848 AWQRLHRVNQDLQSELEaqcRRQELITQQiqtlkhsyGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELK 927
Cdd:PRK03918   163 AYKNLGEVIKEIKRRIE---RLEKFIKRT--------ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  928 mEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----------------QALQRDRQKEVQ 990
Cdd:PRK03918   232 -ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkelkelkekaeeyiklSEFYEEYLDELR 310

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  991 RLQECIAELSQQL-GTSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYED------QLQGHVQQVEAL 1063
Cdd:PRK03918   311 EIEKRLSRLEEEInGIEERIKELEEKEER------LEELKKKLKELEKRLEELEERHELYEEakakkeELERLKKRLTGL 384

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1064 QKEKLSETCKGSEQVHKLEEELEAREAS-IRQLAQHVQSLHDE---------------RDLIKHQFQELMER----VATS 1123
Cdd:PRK03918   385 TPEKLEKELEELEKAKEEIEEEISKITArIGELKKEIKELKKAieelkkakgkcpvcgRELTEEHRKELLEEytaeLKRI 464

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1124 DGDVAELQEKLRGKEVDYQNLEHSHHRVS--VQLQSVRTLLREKEEELKHI------------KETHERV--LEKKDQDL 1187
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESelIKLKELAEQLKELEEKLKKYnleelekkaeeyEKLKEKLikLKGEIKSL 544

                          410       420
                   ....*....|....*....|....*.
gi 1039737300 1188 NEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:PRK03918   545 KKELEKLEELKKKLAELEKKLDELEE 570

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

850-1245

2.47e-10

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 66.23 E-value: 2.47e-10

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  850 QRLHRVNQDLQ------SELEAQCRRqeLITQQIQTLKhsYGEAKDAIRHHEAEIQTLQtrlgnaaaelaiKEQALAKLK 923
Cdd:TIGR02168  179 RKLERTRENLDrledilNELERQLKS--LERQAEKAER--YKELKAELRELELALLVLR------------LEELREELE 242

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  924 gELKMEQGKVREQLEEwqhskamLSGQLRASEQKLRSTEARLLEKTQELRDLetqqalqrdrQKEVQRLQECIAELSQQL 1003
Cdd:TIGR02168  243 -ELQEELKEAEEELEE-------LTAELQELEEKLEELRLEVSELEEEIEEL----------QKELYALANEISRLEQQK 304

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1004 GTSEQAQRLMEKKLKRnYTLLLESCEQEKQALLQNLKEVEDKasayEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEE 1083
Cdd:TIGR02168  305 QILRERLANLERQLEE-LEAQLEELESKLDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEE 379

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1084 EleareasIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEklrgkevdyQNLEHSHHRVSVQLQSVRTLLR 1163
Cdd:TIGR02168  380 Q-------LETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ---------EIEELLKKLEEAELKELQAELE 443

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1164 EKEEELKHIKETHERV---LEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLRRFvSDSPKDAKEPLsttEPTEEGS 1240
Cdd:TIGR02168  444 ELEEELEELQEELERLeeaLEELREELEEAEQALDAAERELAQLQARLDSLERLQENL-EGFSEGVKALL---KNQSGLS 519


                   ....*
gi 1039737300 1241 GILPL 1245
Cdd:TIGR02168  520 GILGV 524

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

915-1213

1.05e-09

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 64.31 E-value: 1.05e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  915 KEQALAKLKGELKMEQGKVREQLEEwqhsKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQqaLQRDRQkEVQRLQE 994
Cdd:TIGR02168  675 RRREIEELEEKIEELEEKIAELEKA----LAELRKELEELEEELEQLRKELEELSRQISALRKD--LARLEA-EVEQLEE 747

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  995 CIAELSQQLgtSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEkLSETckg 1074
Cdd:TIGR02168  748 RIAQLSKEL--TELEAEIEELEER------LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAE-LTLL--- 815

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1075 SEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQ 1154
Cdd:TIGR02168  816 NEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300 1155 LQSVRTLLREKE----------EELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:TIGR02168  896 LEELSEELRELEskrselrrelEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIE 964

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

798-1007

1.18e-09

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 64.17 E-value: 1.18e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  798 KELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPsgAWQRLHRVN------QDLQSELEAQCRRQE 871
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAALR--LWFAQRRLElleaelEELRAELARLEAELE 312

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  872 LITQQIQTLKHSYGEAKDAIRHH--------EAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHS 943
Cdd:COG4913    313 RLEARLDALREELDELEAQIRGNggdrleqlEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAAL 392

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039737300  944 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV-QRLQECIAELSQQLGTSE 1007
Cdd:COG4913    393 LEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIpARLLALRDALAEALGLDE 457

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

895-1203

1.24e-09

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 63.92 E-value: 1.24e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  895 EAEIQTLQTRLGNAAAELAIKEQALAKLKGE---LKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQE 971
Cdd:TIGR02168  676 RREIEELEEKIEELEEKIAELEKALAELRKEleeLEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKE 755

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  972 LRDLETQ-QALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYE 1050
Cdd:TIGR02168  756 LTELEAEiEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREA-----LDELRAELTLLNEEAANLRERLESLE 830

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1051 DQLQGHVQQVEALQKEKLSEtckgSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL 1130
Cdd:TIGR02168  831 RRIAATERRLEDLEEQIEEL----SEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELREL 906

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300 1131 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEEL----KHIKETHERVLEKKDQDLNEALVKMIALGSSLEE 1203
Cdd:TIGR02168  907 ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLseeySLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

932-1216

2.62e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 63.03 E-value: 2.62e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  932 KVREQLEEWQHSKAMLsgQLRASEQKLRSTEARLLEKTQELRDLETQQALqrdRQKEVQRLQECIAELSQQLGTSEQAQR 1011
Cdd:COG1196    217 ELKEELKELEAELLLL--KLRELEAELEELEAELEELEAELEELEAELAE---LEAELEELRLELEELELELEEAQAEEY 291

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1012 LMEKKLKRnytlllesCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETckgsEQVHKLEEELEAREAS 1091
Cdd:COG1196    292 ELLAELAR--------LEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELE----EELEEAEEELEEAEAE 359

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1092 IRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKH 1171
Cdd:COG1196    360 LAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEE 439

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1039737300 1172 IKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLR 1216
Cdd:COG1196    440 EEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLE 484

cd00821

Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are ...

48-145

2.74e-09

Pssm-ID: 275388 [Multi-domain] Cd Length: 92 Bit Score: 56.01 E-value: 2.74e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   48 GWLLLAPDGTdfdnpvhrSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTINMNQCTDVVDGEaRTGQKFSLCILTPDKE 127
Cdd:cd00821      3 GYLLKRGGGG--------LKSWKKRWFVLFEGVLLYYKSKKDSSYKPKGSIPLSGILEVEEVS-PKERPHCFELVTPDGR 73

                           90
                   ....*....|....*....
gi 1039737300  128 HF-IRAETKEIISGWLEML 145
Cdd:cd00821     74 TYyLQADSEEERQEWLKAL 92

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

2053-2318

3.08e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 62.65 E-value: 3.08e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2131
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2132 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2211
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2212 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2291
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458

                          250       260
                   ....*....|....*....|....*..
gi 1039737300 2292 RVKESEIQYLKQEISSLKDELQTALRD 2318
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

850-1211

5.63e-09

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 62.01 E-value: 5.63e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  850 QRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKme 929
Cdd:TIGR02169  684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELK-- 761

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  930 qgKVREQLEEWQHSKAMLSGQLRASEQKLRstEARLLEKTQELRDLEtqqalqrdrqKEVQRLQECIAELSQQLGTSEQA 1009
Cdd:TIGR02169  762 --ELEARIEELEEDLHKLEEALNDLEARLS--HSRIPEIQAELSKLE----------EEVSRIEARLREIEQKLNRLTLE 827

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1010 QRLMEKKLkrnytlllesceQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEARE 1089
Cdd:TIGR02169  828 KEYLEKEI------------QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1090 ASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELqEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEE-E 1168
Cdd:TIGR02169  896 AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEI-EDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPvN 974

                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1039737300 1169 LKHIKEtHERVLEKKDqDLNEALVKMIALGSSLEETEIKLQEK 1211
Cdd:TIGR02169  975 MLAIQE-YEEVLKRLD-ELKEKRAKLEEERKAILERIEEYEKK 1015

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

934-1238

7.90e-09

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 61.24 E-value: 7.90e-09

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  934 REQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQalqRDRQKEVQRLQEciaelsqqlgtSEQAQRLM 1013
Cdd:TIGR02169  673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKI---GEIEKEIEQLEQ-----------EEEKLKER 738

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1014 EKKLKRNytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEAL-QKEKLSETCKGSEQVHKLEEELEAREASI 1092
Cdd:TIGR02169  739 LEELEED----LSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLeARLSHSRIPEIQAELSKLEEEVSRIEARL 814

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1093 RQLAQHVQSLHDERDLIKHQFQELMERVatsdgDVAELQEKLRGKEVDyqNLEHSHHRVSVQLQSVRTLLREKEEELKHI 1172
Cdd:TIGR02169  815 REIEQKLNRLTLEKEYLEKEIQELQEQR-----IDLKEQIKSIEKEIE--NLNGKKEELEEELEELEAALRDLESRLGDL 887

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 1173 K------ETHERVLEKKDQDLNEALVKmiaLGSSLEETEIKLQEKEECLRRFvsDSPKDAKEPLSTTEPTEE 1238
Cdd:TIGR02169  888 KkerdelEAQLRELERKIEELEAQIEK---KRKRLSELKAKLEALEEELSEI--EDPKGEDEEIPEEELSLE 954

EnvC

COG4942

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

850-1066

1.23e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 59.78 E-value: 1.23e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  850 QRLHRVNQDL---QSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGEl 926
Cdd:COG4942     27 AELEQLQQEIaelEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEE- 105

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  927 kmeqgkvreqleewqhskamLSGQLRASEQKLRSTEARLL----EKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQ 1002
Cdd:COG4942    106 --------------------LAELLRALYRLGRQPPLALLlspeDFLDAVRRLQYLKYLAPARREQAEELRADLAELAAL 165

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039737300 1003 LGTSEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKE 1066
Cdd:COG4942    166 RAELEAERAELEALLAEL--------EEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221

smart00233

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...

44-145

1.68e-08

Pssm-ID: 214574 [Multi-domain] Cd Length: 102 Bit Score: 54.09 E-value: 1.68e-08

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300    44 PIYGGWLLLAPDGtdfdnpvhRSRKWQRRFFILYEHGLLRYALDEMPTTL-PQGTINMNQCT-DVVDGEARTGQKFSLCI 121
Cdd:smart00233    1 VIKEGWLYKKSGG--------GKKSWKKRYFVLFNSTLLYYKSKKDKKSYkPKGSIDLSGCTvREAPDPDSSKKPHCFEI 72

                            90       100
                    ....*....|....*....|....*
gi 1039737300   122 LTPDKE-HFIRAETKEIISGWLEML 145
Cdd:smart00233   73 KTSDRKtLLLQAESEEEREKWVEAL 97

Myosin_tail_1

pfam01576

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...

744-1239

2.17e-08

Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 59.80 E-value: 2.17e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  744 LKTQNVHVEIEQRWHQV--ETTPLREEKQVPiAPLHLSLEDRSERLST---------HELTSLLEKELEQSQKEASDLLE 812
Cdd:pfam01576   22 QKAESELKELEKKHQQLceEKNALQEQLQAE-TELCAEAEEMRARLAArkqeleeilHELESRLEEEEERSQQLQNEKKK 100

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  813 QNRLLQDqLRVALGREQSAREGyvLQTEVATSPS----------------GAWQRLHRVNQDLQSELEAQCRRQELITQQ 876
Cdd:pfam01576  101 MQQHIQD-LEEQLDEEEAARQK--LQLEKVTTEAkikkleedillledqnSKLSKERKLLEERISEFTSNLAEEEEKAKS 177

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  877 IQTLKHSygeakdairhHEAEIQTLQTRLGNAAAelaiKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQ 956
Cdd:pfam01576  178 LSKLKNK----------HEAMISDLEERLKKEEK----GRQELEKAKRKLEGESTDLQEQIAELQAQIAELRAQLAKKEE 243

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  957 KLRSTEARLlektqelrdlETQQALQRDRQKEVQRLQECIAELSQQLgTSEQAQRLMEKKLKRNYTLLLESCEQEKQALL 1036
Cdd:pfam01576  244 ELQAALARL----------EEETAQKNNALKKIRELEAQISELQEDL-ESERAARNKAEKQRRDLGEELEALKTELEDTL 312

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1037 ------QNLK-----EVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1105
Cdd:pfam01576  313 dttaaqQELRskreqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAE 392

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1106 RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEelKHIKETHE-RVLEKKD 1184
Cdd:pfam01576  393 LRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEG--KNIKLSKDvSSLESQL 470

                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300 1185 QDLNEALV----KMIALGSSLEETEI-------KLQEKEECLRRF----------VSDSPKDAKEPLSTTEPTEEG 1239
Cdd:pfam01576  471 QDTQELLQeetrQKLNLSTRLRQLEDernslqeQLEEEEEAKRNVerqlstlqaqLSDMKKKLEEDAGTLEALEEG 546

YhaN

COG4717

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

896-1067

2.87e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 59.01 E-value: 2.87e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  896 AEIQTLQTRLGNAAAELAIKEQALAKLKgELKMEQGKVREQLEEWQHSKAMLSGQLRASE--QKLRSTEARLLEKTQELR 973
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEELE-ELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPERLE 149

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  974 DLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQL 1053
Cdd:COG4717    150 ELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEEL 229

                          170
                   ....*....|....
gi 1039737300 1054 QGHVQQVEALQKEK 1067
Cdd:COG4717    230 EQLENELEAAALEE 243

PH2_MyoX

cd13296

Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular ...

507-607

4.29e-08

Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular motor that has crucial functions in the transport and/or tethering of integrins in the actin-based extensions known as filopodia, microtubule binding, and in netrin-mediated axon guidance. It functions as a dimer. MyoX walks on bundles of actin, rather than single filaments, unlike the other unconventional myosins. MyoX is present in organisms ranging from humans to choanoflagellates, but not in Drosophila and Caenorhabditis elegans.MyoX consists of a N-terminal motor/head region, a neck made of 3 IQ motifs, and a tail consisting of a coiled-coil domain, a PEST region, 3 PH domains, a myosin tail homology 4 (MyTH4), and a FERM domain at its very C-terminus. The first PH domain in the MyoX tail is a split-PH domain, interupted by the second PH domain such that PH 1a and PH 1b flanks PH 2. The third PH domain (PH 3) follows the PH 1b domain. This cd contains the second PH repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270108 Cd Length: 103 Bit Score: 53.24 E-value: 4.29e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYEDG------QWKKHWFVLADQSLRYYRDsvAEEAADLDGEINLSTCYDVTEYPVQRNyGFQIHTKEGEFTL 580
Cdd:cd13296      1 KSGWLTKKGGGSstlsrrNWKSRWFVLRDTVLKYYEN--DQEGEKLLGTIDIRSAKEIVDNDPKEN-RLSITTEERTYHL 77

                           90       100
                   ....*....|....*....|....*..
gi 1039737300  581 SAMTSGIRRNWIQtIMKHVLPASAPDV 607
Cdd:cd13296     78 VAESPEDASQWVN-VLTRVISATDLEL 103

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

952-1213

4.45e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 58.79 E-value: 4.45e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  952 RASEQKLRSTEARLL-------EKTQELRDLETQ-------QALQ---RDRQKEVQRLQecIAELSQQLGTSEQAQRLME 1014
Cdd:COG1196    175 EEAERKLEATEENLErledilgELERQLEPLERQaekaeryRELKeelKELEAELLLLK--LRELEAELEELEAELEELE 252

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1015 KKLKRnYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETckgsEQVHKLEEELEAREASIRQ 1094
Cdd:COG1196    253 AELEE-LEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLE----ERRRELEERLEELEEELAE 327

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1095 LAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEvdyQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKE 1174
Cdd:COG1196    328 LEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAE---AELAEAEEELEELAEELLEALRAAAELAAQLEE 404

                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1039737300 1175 ThERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:COG1196    405 L-EEAEEALLERLERLEEELEELEEALAELEEEEEEEEE 442

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

2064-2256

6.31e-08

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 58.53 E-value: 6.31e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2064 LRERIQELEAQMGVMREELGH-----KELEGDVAALQEKyqrdFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2138
Cdd:TIGR02168  307 LRERLANLERQLEELEAQLEElesklDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEEQLE 382

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2139 REEKDRLLAEETAATIsaieamkNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2218
Cdd:TIGR02168  383 TLRSKVAQLELQIASL-------NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455

                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1039737300 2219 NAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA 2256
Cdd:TIGR02168  456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1879-2240

8.52e-08

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 58.16 E-value: 8.52e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1879 QAVGALKEEYEELLhkqksEYQKVITliEKENTELKAKVSQMDHQQRCLQEAENKHSESmfalqgryEEEIRCMVEQLSH 1958
Cdd:TIGR02169  198 QQLERLRREREKAE-----RYQALLK--EKREYEGYELLKEKEALERQKEAIERQLASL--------EEELEKLTEEISE 262

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1959 TENTLqAERSRVLSQLDASVKDRQAMEQHHVQ--------QMKMLEDRFQLKVRELQ------AVHQEELRALQEHyIWS 2024
Cdd:TIGR02169  263 LEKRL-EEIEQLLEELNKKIKDLGEEEQLRVKekigeleaEIASLERSIAEKERELEdaeerlAKLEAEIDKLLAE-IEE 340

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2025 LRGALSLYQpshpdsslapgpSEPRAVPAA-KDEAESMSGLRERIQELEAQMGVMREElgHKELegdvaalqekyqrdfe 2103
Cdd:TIGR02169  341 LEREIEEER------------KRRDKLTEEyAELKEELEDLRAELEEVDKEFAETRDE--LKDY---------------- 390

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2104 slkatcergfaameethQKKIEDLQRQH---QRELEKLREEKDRLLAE--ETAATISAIEAMKNAHREEME--RELEKSQ 2176
Cdd:TIGR02169  391 -----------------REKLEKLKREInelKRELDRLQEELQRLSEElaDLNAAIAGIEAKINELEEEKEdkALEIKKQ 453

                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300 2177 RSQISSINSDIEALRRQYL---EELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQREN 2240
Cdd:TIGR02169  454 EWKLEQLAADLSKYEQELYdlkEEYDRVEKELSKLQRELAE-----------AEAQARASEERVRGG 509

PH2_MyoX

cd13296

Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular ...

48-145

8.66e-08

Pssm-ID: 270108 Cd Length: 103 Bit Score: 52.08 E-value: 8.66e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   48 GWLLLAPDGTdfdNPVHRsRKWQRRFFILYEHGLLRYALDEmPTTLPQGTINMNQCTDVVDgeaRTGQKFSLCILTPDKE 127
Cdd:cd13296      3 GWLTKKGGGS---STLSR-RNWKSRWFVLRDTVLKYYENDQ-EGEKLLGTIDIRSAKEIVD---NDPKENRLSITTEERT 74

                           90
                   ....*....|....*...
gi 1039737300  128 HFIRAETKEIISGWLEML 145
Cdd:cd13296     75 YHLVAESPEDASQWVNVL 92

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

875-1065

9.38e-08

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 58.00 E-value: 9.38e-08

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  875 QQIQTLKHSYGEAKDAIRHHEA--EIQTLQTRLGNAAAELAIKEQALAKLK--------GELKMEQGKVREQLEEWQHSK 944
Cdd:COG4913    232 EHFDDLERAHEALEDAREQIELlePIRELAERYAAARERLAELEYLRAALRlwfaqrrlELLEAELEELRAELARLEAEL 311

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  945 AMLSGQLRASEQKLRSTEARLLE-KTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTL 1023
Cdd:COG4913    312 ERLEARLDALREELDELEAQIRGnGGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAA 391

                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1039737300 1024 LLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 1065
Cdd:COG4913    392 LLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLER 433

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

2070-2364

1.08e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 57.77 E-value: 1.08e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2070 ELEAQMGVMREELGhkELEGDVAALQEKyqrdFESLKATCERGFAAMEETHqKKIEDLQRQHQRELEKLREEKDRLlaEE 2149
Cdd:TIGR02169  671 SEPAELQRLRERLE--GLKRELSSLQSE----LRRIENRLDELSQELSDAS-RKIGEIEKEIEQLEQEEEKLKERL--EE 741

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2150 TAATISAIEAMKNAHREEMErELEK---SQRSQISSINSDIEALRRQYLEE-LQSVQRELEVLSEQYSQKCLENAHLAQA 2225
Cdd:TIGR02169  742 LEEDLSSLEQEIENVKSELK-ELEArieELEEDLHKLEEALNDLEARLSHSrIPEIQAELSKLEEEVSRIEARLREIEQK 820

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2226 LEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEStglpltqgkDAYELEVLLRVKESEIQYLKQEI 2305
Cdd:TIGR02169  821 LNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEE---------ELEELEAALRDLESRLGDLKKER 891

                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300 2306 SSLKDELQTALRDKKYASDKYKDIYTELSIAKAKAdcdiSRLKEQLKAATEALGEKSPE 2364
Cdd:TIGR02169  892 DELEAQLRELERKIEELEAQIEKKRKRLSELKAKL----EALEEELSEIEDPKGEDEEI 946

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

2052-2318

1.16e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 57.62 E-value: 1.16e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2052 PAAKDEAESMSGLRERIQELEAQMGVMREELGHkeLEgDVAALQEKYQRDFESLKA--TCERGFAAmeETHQKKIEDLQR 2129
Cdd:COG4913    221 PDTFEAADALVEHFDDLERAHEALEDAREQIEL--LE-PIRELAERYAAARERLAEleYLRAALRL--WFAQRRLELLEA 295

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2130 qhqrELEKLREEKDRLLAEETAATISAIEAmkNAHREEMERELEKSQRSQISSINSDIEALRRqyleELQSVQRELEVLS 2209
Cdd:COG4913    296 ----ELEELRAELARLEAELERLEARLDAL--REELDELEAQIRGNGGDRLEQLEREIERLER----ELEERERRRARLE 365

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2210 EQysqkcLENAHLAQALEAE--RQALRQCQRENQELNAHNQELNNRLAAEITRLRtlltgdgggestglpltqgkdayEL 2287
Cdd:COG4913    366 AL-----LAALGLPLPASAEefAALRAEAAALLEALEEELEALEEALAEAEAALR-----------------------DL 417

                          250       260       270
                   ....*....|....*....|....*....|.
gi 1039737300 2288 EVLLRVKESEIQYLKQEISSLKDELQTALRD 2318
Cdd:COG4913    418 RRELRELEAEIASLERRKSNIPARLLALRDA 448

CCDC158

pfam15921

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

768-1173

1.25e-07

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 57.44 E-value: 1.25e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  768 EKQVPIAPLHLSlEDRSER----LSTHELTSLLEK---ELEQSQKEASDLLEQNRLLQDQ-LRVALGREQSAREGYVLQT 839
Cdd:pfam15921  348 EKQLVLANSELT-EARTERdqfsQESGNLDDQLQKllaDLHKREKELSLEKEQNKRLWDRdTGNSITIDHLRRELDDRNM 426

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  840 EVatspsgawQRLHRVNQDLQSELEAQCRRQ--------------ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRL 905
Cdd:pfam15921  427 EV--------QRLEALLKAMKSECQGQMERQmaaiqgkneslekvSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTV 498

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  906 GNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHskamlsgqLRASEQKLRSTEArllektqELRDLETQQAlQRDR 985
Cdd:pfam15921  499 SDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQH--------LKNEGDHLRNVQT-------ECEALKLQMA-EKDK 562

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  986 QKEVQRLQ-ECIAELSQQLGTSEQAQRLMEKKLKRNYtlllesceQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEaLQ 1064
Cdd:pfam15921  563 VIEILRQQiENMTQLVGQHGRTAGAMQVEKAQLEKEI--------NDRRLELQEFKILKDKKDAKIRELEARVSDLE-LE 633

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1065 KEKLSETckGSEQVhkleeeleareasirqlaQHVQSLHDERDlikhqfqELMERVATSDGDVAELQEKLRGKEVDYQN- 1143
Cdd:pfam15921  634 KVKLVNA--GSERL------------------RAVKDIKQERD-------QLLNEVKTSRNELNSLSEDYEVLKRNFRNk 686

                          410       420       430
                   ....*....|....*....|....*....|...
gi 1039737300 1144 ---LEHSHHRVSVQLQSVRTLLREKEEELKHIK 1173
Cdd:pfam15921  687 seeMETTTNKLKMQLKSAQSELEQTRNTLKSME 719

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

751-996

1.63e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 57.00 E-value: 1.63e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  751 VEIEQRWHQVET--TPLREEKQVPIAPLHLSLEdrSERLSTHELTSLLEKELEQSQKE-ASDLLEQNRLLQD--QLRVAL 825
Cdd:TIGR02169  268 EEIEQLLEELNKkiKDLGEEEQLRVKEKIGELE--AEIASLERSIAEKERELEDAEERlAKLEAEIDKLLAEieELEREI 345

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  826 GREQSAREGyvLQTEVATSPsgawQRLHRVNQDLQS-ELEAQCRRQEL--ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQ 902
Cdd:TIGR02169  346 EEERKRRDK--LTEEYAELK----EELEDLRAELEEvDKEFAETRDELkdYREKLEKLKREINELKRELDRLQEELQRLS 419

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  903 TRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSG---QLRASEQKLRSTEARLLEKTQELRDLETQQ 979
Cdd:TIGR02169  420 EELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKyeqELYDLKEEYDRVEKELSKLQRELAEAEAQA 499

                          250
                   ....*....|....*..
gi 1039737300  980 ALQRDRQKEVQRLQECI 996
Cdd:TIGR02169  500 RASEERVRGGRAVEEVL 516

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

1946-2243

2.25e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 56.61 E-value: 2.25e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1946 EEEIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLE--DRFQLKVRELQAVHQEELRALQehyiw 2023
Cdd:TIGR02169  676 LQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEqlEQEEEKLKERLEELEEDLSSLE----- 750

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2024 slrgalslyqpshpdsslapgpsepRAVPAAKDEaesMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKyQRDFE 2103
Cdd:TIGR02169  751 -------------------------QEIENVKSE---LKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEI-QAELS 801

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2104 SLKATCERGFAAMEETHQKkiedLQRQHQRE--LEKLREEK--DRLLAEETAATISAIEAMKNAHREEMERELEKSQRS- 2178
Cdd:TIGR02169  802 KLEEEVSRIEARLREIEQK----LNRLTLEKeyLEKEIQELqeQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAl 877

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300 2179 -QISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQEL 2243
Cdd:TIGR02169  878 rDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGED 943

PH-GRAM1_AGT26

cd13215

Autophagy-related protein 26/Sterol 3-beta-glucosyltransferase Pleckstrin homology (PH) domain, ...

506-599

2.86e-07

Autophagy-related protein 26/Sterol 3-beta-glucosyltransferase Pleckstrin homology (PH) domain, repeat 1; ATG26 (also called UGT51/UDP-glycosyltransferase 51), a member of the glycosyltransferase 28 family, resulting in the biosynthesis of sterol glucoside. ATG26 in decane metabolism and autophagy. There are 32 known autophagy-related (ATG) proteins, 17 are components of the core autophagic machinery essential for all autophagy-related pathways and 15 are the additional components required only for certain pathways or species. The core autophagic machinery includes 1) the ATG9 cycling system (ATG1, ATG2, ATG9, ATG13, ATG18, and ATG27), 2) the phosphatidylinositol 3-kinase complex (ATG6/VPS30, ATG14, VPS15, and ATG34), and 3) the ubiquitin-like protein system (ATG3, ATG4, ATG5, ATG7, ATG8, ATG10, ATG12, and ATG16). Less is known about how the core machinery is adapted or modulated with additional components to accommodate the nonselective sequestration of bulk cytosol (autophagosome formation) or selective sequestration of specific cargos (Cvt vesicle, pexophagosome, or bacteria-containing autophagosome formation). The pexophagosome-specific additions include the ATG30-ATG11-ATG17 receptor-adaptors complex, the coiled-coil protein ATG25, and the sterol glucosyltransferase ATG26. ATG26 is necessary for the degradation of medium peroxisomes. It contains 2 GRAM domains and a single PH domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 275402 Cd Length: 116 Bit Score: 51.08 E-value: 2.86e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  506 FKKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSvaeeaADL---DGEINLSTCY--DVTEYPVQRNYGFQIHTKEGEFT 579
Cdd:cd13215     22 IKSGYLSKRsKRTLRYTRYWFVLKGDTLSWYNSS-----TDLyfpAGTIDLRYATsiELSKSNGEATTSFKIVTNSRTYK 96

                           90       100
                   ....*....|....*....|
gi 1039737300  580 LSAMTSGIRRNWIQTIMKHV 599
Cdd:cd13215     97 FKADSETSADEWVKALKKQI 116

PH_AtPH1

cd13276

Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all ...

507-603

3.24e-07

Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all plant tissue and is proposed to be the plant homolog of human pleckstrin. Pleckstrin consists of two PH domains separated by a linker region, while AtPH has a single PH domain with a short N-terminal extension. AtPH1 binds PtdIns3P specifically and is thought to be an adaptor molecule since it has no obvious catalytic functions. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270095 Cd Length: 106 Bit Score: 50.78 E-value: 3.24e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYED-GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVT--EYPVQRNYGFQIHTKEGEFTLSAM 583
Cdd:cd13276      1 KAGWLEKQGEFiKTWRRRWFVLKQGKLFWFKEPDVTPYSKPRGVIDLSKCLTVKsaEDATNKENAFELSTPEETFYFIAD 80

                           90       100
                   ....*....|....*....|
gi 1039737300  584 TSGIRRNWIQTIMKHVLPAS 603
Cdd:cd13276     81 NEKEKEEWIGAIGRAIVKHS 100

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

2062-2360

3.32e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 56.10 E-value: 3.32e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2062 SGLRERIQELEAQMGVMREELG-----HKELEGDVAALQE---------KYQRDFESLKAtcergfaameETHQKKIEDL 2127
Cdd:COG1196    168 SKYKERKEEAERKLEATEENLErlediLGELERQLEPLERqaekaeryrELKEELKELEA----------ELLLLKLREL 237

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2128 QRQHQRELEKLREEKDRL--LAEETAATISAIEAMKNAHREEMERELEKSQR-----SQISSINSDIEAL---RRQYLEE 2197
Cdd:COG1196    238 EAELEELEAELEELEAELeeLEAELAELEAELEELRLELEELELELEEAQAEeyellAELARLEQDIARLeerRRELEER 317

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2198 LQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDgggestglp 2277
Cdd:COG1196    318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEEL--------- 388

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2278 LTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2357
Cdd:COG1196    389 LEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAEL 468


                   ...
gi 1039737300 2358 LGE 2360
Cdd:COG1196    469 LEE 471

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1987-2313

3.58e-07

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 55.83 E-value: 3.58e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1987 HHVQQMKMLEDRFQLKVRELQAVhQEELRALQEhyiwSLRGALSLYQPSHPDSSLAPGPSEpRAVPAAKDEAESMSGLRE 2066
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAEL-RKELEELEE----ELEQLRKELEELSRQISALRKDLA-RLEAEVEQLEERIAQLSK 754

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2067 RIQELEAQMGVMREELGH-----KELEGDVAALQEKYQRDFESLKATCERgfaameethqkkIEDLQRQHQRELEKLREE 2141
Cdd:TIGR02168  755 ELTELEAEIEELEERLEEaeeelAEAEAEIEELEAQIEQLKEELKALREA------------LDELRAELTLLNEEAANL 822

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2142 KDRLLAEETAAtisaieAMKNAHREEMERELEKsQRSQISSINSDIEALRRQyLEELQSvqrELEVLSEQYSQKCLENAH 2221
Cdd:TIGR02168  823 RERLESLERRI------AATERRLEDLEEQIEE-LSEDIESLAAEIEELEEL-IEELES---ELEALLNERASLEEALAL 891

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2222 LAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLltgdgggESTGLPLTQ-----GKDAYE-LEVLLRVKE 2295
Cdd:TIGR02168  892 LRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGL-------EVRIDNLQErlseeYSLTLEeAEALENKIE 964

                          330
                   ....*....|....*...
gi 1039737300 2296 SEIQYLKQEISSLKDELQ 2313
Cdd:TIGR02168  965 DDEEEARRRLKRLENKIK 982

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

896-1236

3.79e-07

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 55.84 E-value: 3.79e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  896 AEIQTLQTRLGNAAAELAIKEQALAKLKGElkmeqgkvREQLEEWQHskamLSGQLRASEQKLRSTEARLLEKTQE--LR 973
Cdd:TIGR02169  177 EELEEVEENIERLDLIIDEKRQQLERLRRE--------REKAERYQA----LLKEKREYEGYELLKEKEALERQKEaiER 244

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  974 DLETQQALQRDRQKEVQRLQECIAELSQQLgtSEQAQRLMEKKLKRNYTLllesceQEKQALLQ-NLKEVEDKASAYEDQ 1052
Cdd:TIGR02169  245 QLASLEEELEKLTEEISELEKRLEEIEQLL--EELNKKIKDLGEEEQLRV------KEKIGELEaEIASLERSIAEKERE 316

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1053 LQghvqQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQE 1132
Cdd:TIGR02169  317 LE----DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYRE 392

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1133 KLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETH---ERVLEKKDQDLNEALVKMIALGSSLEETEIKLQ 1209
Cdd:TIGR02169  393 KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKInelEEEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472

                          330       340
                   ....*....|....*....|....*..
gi 1039737300 1210 EKEECLRRfVSDSPKDAKEPLSTTEPT 1236
Cdd:TIGR02169  473 DLKEEYDR-VEKELSKLQRELAEAEAQ 498

pfam00169

PH domain; PH stands for pleckstrin homology.

44-145

4.37e-07

PH domain; PH stands for pleckstrin homology.

Pssm-ID: 459697 [Multi-domain] Cd Length: 105 Bit Score: 50.25 E-value: 4.37e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   44 PIYGGWLLLAPDGtdfdnpvhRSRKWQRRFFILYEHGLLRYALDEMPTTL-PQGTINMNQCTDV-VDGEARTGQKFSLCI 121
Cdd:pfam00169    1 VVKEGWLLKKGGG--------KKKSWKKRYFVLFDGSLLYYKDDKSGKSKePKGSISLSGCEVVeVVASDSPKRKFCFEL 72

                           90       100
                   ....*....|....*....|....*...
gi 1039737300  122 LTPD----KEHFIRAETKEIISGWLEML 145
Cdd:pfam00169   73 RTGErtgkRTYLLQAESEEERKDWIKAI 100

PH_CNK_mammalian-like

cd01260

Connector enhancer of KSR (Kinase suppressor of ras) (CNK) pleckstrin homology (PH) domain; ...

509-553

5.73e-07

Connector enhancer of KSR (Kinase suppressor of ras) (CNK) pleckstrin homology (PH) domain; CNK family members function as protein scaffolds, regulating the activity and the subcellular localization of RAS activated RAF. There is a single CNK protein present in Drosophila and Caenorhabditis elegans in contrast to mammals which have 3 CNK proteins (CNK1, CNK2, and CNK3). All of the CNK members contain a sterile a motif (SAM), a conserved region in CNK (CRIC) domain, and a PSD-95/DLG-1/ZO-1 (PDZ) domain, and, with the exception of CNK3, a PH domain. A CNK2 splice variant CNK2A also has a PDZ domain-binding motif at its C terminus and Drosophila CNK (D-CNK) also has a domain known as the Raf-interacting region (RIR) that mediates binding of the Drosophila Raf kinase. This cd contains CNKs from mammals, chickens, amphibians, fish, and crustacea. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269962 Cd Length: 114 Bit Score: 50.10 E-value: 5.73e-07

                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039737300  509 GWLTKQYEDG-----QWKKHWFVLADQSLRYYRDSVAEEAadlDGEINLS 553
Cdd:cd01260     17 GWLWKKKEAKsffgqKWKKYWFVLKGSSLYWYSNQQDEKA---EGFINLP 63

PH1_ARAP

cd13253

ArfGAP with RhoGAP domain, ankyrin repeat and PH domain Pleckstrin homology (PH) domain, ...

507-600

5.78e-07

ArfGAP with RhoGAP domain, ankyrin repeat and PH domain Pleckstrin homology (PH) domain, repeat 1; ARAP proteins (also called centaurin delta) are phosphatidylinositol 3,4,5-trisphosphate-dependent GTPase-activating proteins that modulate actin cytoskeleton remodeling by regulating ARF and RHO family members. They bind phosphatidylinositol 3,4,5-trisphosphate (PtdIns(3,4,5)P3) and phosphatidylinositol 3,4-bisphosphate (PtdIns(3,4,5)P2) binding. There are 3 mammalian ARAP proteins: ARAP1, ARAP2, and ARAP3. All ARAP proteins contain a N-terminal SAM (sterile alpha motif) domain, 5 PH domains, an ArfGAP domain, 2 ankyrin domain, A RhoGap domain, and a Ras-associating domain. This hierarchy contains the first PH domain in ARAP. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270073 Cd Length: 94 Bit Score: 49.69 E-value: 5.78e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYEDGQ---WKKHWFVLADQSLRYYRdsvAEEAADLDGEINLSTcydVTEYPVQRNYGFQIHTKEGEFTLSAM 583
Cdd:cd13253      2 KSGYLDKQGGQGNnkgFQKRWVVFDGLSLRYFD---SEKDAYSKRIIPLSA---ISTVRAVGDNKFELVTTNRTFVFRAE 75

                           90
                   ....*....|....*..
gi 1039737300  584 TSGIRRNWIQTIMKHVL 600
Cdd:cd13253     76 SDDERNLWCSTLQAAIS 92

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1907-2292

5.95e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 55.33 E-value: 5.95e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1907 EKENTELKAKVSQMDHQQRCLQEAENKHSESmfalqgryEEEIRCMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQ 1986
Cdd:COG1196    221 ELKELEAELLLLKLRELEAELEELEAELEEL--------EAELEELEAELAELEAELEELRLE-LEELELELEEAQAEEY 291

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1987 HHVQQMKMLEDRFQLKVRELQAVHQEELRALQEhyiwslrgalslyqpshpdsslapgpsepravpaakdEAEsmsgLRE 2066
Cdd:COG1196    292 ELLAELARLEQDIARLEERRRELEERLEELEEE-------------------------------------LAE----LEE 330

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2067 RIQELEAQMGVMREELghKELEGDVAALQEKYQRdfeslkatcergfaamEETHQKKIEDLQRQHQRELEKLREEKDRLL 2146
Cdd:COG1196    331 ELEELEEELEELEEEL--EEAEEELEEAEAELAE----------------AEEALLEAEAELAEAEEELEELAEELLEAL 392

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2147 AEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQAL 2226
Cdd:COG1196    393 RAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEA 472

                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039737300 2227 EAERQALRQCQRENQELNA-----HNQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVLLR 2292
Cdd:COG1196    473 ALLEAALAELLEELAEAAArllllLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAA 543

YhaN

COG4717

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

796-1168

7.37e-07

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 54.77 E-value: 7.37e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSARegyvLQTEVATSPSG---------AWQRLHRVNQDLQSEL-EA 865
Cdd:COG4717    100 LEEELEELEAELEELREELEKLEKLLQLLPLYQELEA----LEAELAELPERleeleerleELRELEEELEELEAELaEL 175

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  866 QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLkgELKMEQGKVREQLEEWQHSKA 945
Cdd:COG4717    176 QEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQL--ENELEAAALEERLKEARLLLL 253

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  946 MLSGQL------------------------------RASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQEC 995
Cdd:COG4717    254 IAAALLallglggsllsliltiagvlflvlgllallFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPD 333

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  996 I--AELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQ-----NLKEVEDKASAYED--QLQGHVQQVEALQKE 1066
Cdd:COG4717    334 LspEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAeagveDEEELRAALEQAEEyqELKEELEELEEQLEE 413

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1067 KLSETCKGSEQVHKLE--EELEAREASIRQLAQHVQSLHDERDLIKHQFQELMErvatsDGDVAELQEKLRGKEVDYQNL 1144
Cdd:COG4717    414 LLGELEELLEALDEEEleEELEELEEELEELEEELEELREELAELEAELEQLEE-----DGELAELLQELEELKAELREL 488

                          410       420
                   ....*....|....*....|....
gi 1039737300 1145 EHSHHRVSVQLQSVRTLLREKEEE 1168
Cdd:COG4717    489 AEEWAALKLALELLEEAREEYREE 512

Mplasa_alph_rch

TIGR04523

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

796-1213

7.64e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 54.64 E-value: 7.64e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVaLGREQSAREGYVLQTEvatspsgaWQRLhRVNQDLqSELEAQCRRQELITQ 875
Cdd:TIGR04523  150 KEKELEKLNNKYNDLKKQKEELENELNL-LEKEKLNIQKNIDKIK--------NKLL-KLELLL-SNLKKKIQKNKSLES 218

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  876 QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAikeqalaklkgELKMEQGKVREQLEEWQHskamlsgQLRASE 955
Cdd:TIGR04523  219 QISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLN-----------QLKDEQNKIKKQLSEKQK-------ELEQNN 280

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  956 QKLRSTEARLLEKTQELRDL--ETQQALQRDRQKEVQRLQECIAELSQQLGTSEQA-QRLMEK--KLKRNytllLESCEQ 1030
Cdd:TIGR04523  281 KKIKELEKQLNQLKSEISDLnnQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIiSQLNEQisQLKKE----LTNSES 356

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1031 EKQALLQNLKEVEDKASAYEDQLQGHVQQVEAL--QKEKLSETCKGSEQVHKleeeleareasirQLAQHVQSLHDERDL 1108
Cdd:TIGR04523  357 ENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLesQINDLESKIQNQEKLNQ-------------QKDEQIKKLQQEKEL 423

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1109 IKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNL--------------EHSHHRVSVQLQSVRTLLREKEEELKHIKE 1174
Cdd:TIGR04523  424 LEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLdntresletqlkvlSRSINKIKQNLEQKQKELKSKEKELKKLNE 503

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1039737300 1175 tHERVLEKKDQDLN----EALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:TIGR04523  504 -EKKELEEKVKDLTkkisSLKEKIEKLESEKKEKESKISDLED 545

SMC_N

pfam02463

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...

911-1212

8.25e-07

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.

Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 54.59 E-value: 8.25e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  911 ELAIKEQALAKL------KGELKMEQGKVREQL--EEWQHSKAMLSGQLRASEQKLRSTEARLLEKT---QELRDLETQQ 979
Cdd:pfam02463  167 LKRKKKEALKKLieetenLAELIIDLEELKLQElkLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKlneERIDLLQELL 246

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  980 ALQRDRQKEVQRLQECIAELSQQ----LGTSEQAQRLMEKKLKRNYTLL------LESCEQEKQALLQNLKEVEDKASAY 1049
Cdd:pfam02463  247 RDEQEEIESSKQEIEKEEEKLAQvlkeNKEEEKEKKLQEEELKLLAKEEeelkseLLKLERRKVDDEEKLKESEKEKKKA 326

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1050 EDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQ---FQELMERVATSDGD 1126
Cdd:pfam02463  327 EKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAaklKEEELELKSEEEKE 406

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1127 VAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEI 1206
Cdd:pfam02463  407 AQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQL 486


                   ....*.
gi 1039737300 1207 KLQEKE 1212
Cdd:pfam02463  487 ELLLSR 492

GumC

COG3206

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

796-1006

8.35e-07

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 54.64 E-value: 8.35e-07

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRL--LQDQLRVALGREQSAREGYV-LQTEVAtspsGAWQRLHRVNQDLQSELEAQcrRQEL 872
Cdd:COG3206    187 LRKELEEAEAALEEFRQKNGLvdLSEEAKLLLQQLSELESQLAeARAELA----EAEARLAALRAQLGSGPDAL--PELL 260

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  873 ITQQIQTLKHSYGEAkdairhhEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLE-EWQHSKAMLSgQL 951
Cdd:COG3206    261 QSPVIQQLRAQLAEL-------EAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILASLEaELEALQAREA-SL 332

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300  952 RASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKE-VQRLQEciAELSQQLGTS 1006
Cdd:COG3206    333 QAQLAQLEARLAELPELEAELRRLEREVEVARELYESlLQRLEE--ARLAEALTVG 386

Mplasa_alph_rch

TIGR04523

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

875-1205

1.00e-06

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 54.26 E-value: 1.00e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  875 QQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAI---KEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQL 951
Cdd:TIGR04523  117 EQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKlnnKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKL 196

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  952 RASEQKLRSTEA-----RLLEKtqELRDLETQQA-LQRDRQKEVQRLQECIAELSQqlgTSEQAQRLMEKKLKRNYTLll 1025
Cdd:TIGR04523  197 LKLELLLSNLKKkiqknKSLES--QISELKKQNNqLKDNIEKKQQEINEKTTEISN---TQTQLNQLKDEQNKIKKQL-- 269

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1026 esceQEKQallQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKG--------SEQVHKLEEELEAREASIRQLAQ 1097
Cdd:TIGR04523  270 ----SEKQ---KELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKElkselknqEKKLEEIQNQISQNNKIISQLNE 342

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1098 HVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKhIKETHE 1177
Cdd:TIGR04523  343 QISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIK-KLQQEK 421

                          330       340
                   ....*....|....*....|....*...
gi 1039737300 1178 RVLEKKDQDLNEALVKMIALGSSLEETE 1205
Cdd:TIGR04523  422 ELLEKEIERLKETIIKNNSEIKDLTNQD 449

PH_PEPP1_2_3

cd13248

Phosphoinositol 3-phosphate binding proteins 1, 2, and 3 pleckstrin homology (PH) domain; ...

507-595

1.18e-06

Phosphoinositol 3-phosphate binding proteins 1, 2, and 3 pleckstrin homology (PH) domain; PEPP1 (also called PLEKHA4/PH domain-containing family A member 4 and RHOXF1/Rhox homeobox family member 1), and related homologs PEPP2 (also called PLEKHA5/PH domain-containing family A member 5) and PEPP3 (also called PLEKHA6/PH domain-containing family A member 6), have PH domains that interact specifically with PtdIns(3,4)P3. Other proteins that bind PtdIns(3,4)P3 specifically are: TAPP1 (tandem PH-domain-containing protein-1) and TAPP2], PtdIns3P AtPH1, and Ptd- Ins(3,5)P2 (centaurin-beta2). All of these proteins contain at least 5 of the 6 conserved amino acids that make up the putative phosphatidylinositol 3,4,5- trisphosphate-binding motif (PPBM) located at their N-terminus. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270068 Cd Length: 104 Bit Score: 49.19 E-value: 1.18e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTcYDVT----EYPVQRNYGFQIhTKEGEFT- 579
Cdd:cd13248      9 MSGWLHKQGGSGlkNWRKRWFVLKDNCLYYYKD---PEEEKALGSILLPS-YTISpappSDEISRKFAFKA-EHANMRTy 83

                           90
                   ....*....|....*..
gi 1039737300  580 -LSAMTSGIRRNWIQTI 595
Cdd:cd13248     84 yFAADTAEEMEQWMNAM 100

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

752-1217

1.38e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 53.79 E-value: 1.38e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  752 EIEQRWHQVETTPLREEKQvpIAPLHLSLEDRSERL-STHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQS 830
Cdd:COG1196    285 EAQAEEYELLAELARLEQD--IARLEERRRELEERLeELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  831 AREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAA 910
Cdd:COG1196    363 AEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEE 442

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  911 ELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSgQLRASEQKLRSTEARLLE--KTQELRDLETQQALQRDRQKE 988
Cdd:COG1196    443 ALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALA-ELLEELAEAAARLLLLLEaeADYEGFLEGVKAALLLAGLRG 521

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  989 VQR---------------LQECIAELSQQLGT------SEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKAS 1047
Cdd:COG1196    522 LAGavavligveaayeaaLEAALAAALQNIVVeddevaAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAV 601

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1048 AYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELME---RVATSD 1124
Cdd:COG1196    602 DLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAallEAEAEL 681

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1125 GDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEET 1204
Cdd:COG1196    682 EELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPD 761

                          490
                   ....*....|...
gi 1039737300 1205 EIKLQEKEECLRR 1217
Cdd:COG1196    762 LEELERELERLER 774

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1758-2360

1.48e-06

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 53.91 E-value: 1.48e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1758 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1831
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1832 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1906
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1907 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1986
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSE-GVKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1987 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 2059
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2060 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2126
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2127 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2203
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2204 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLR------TLLTGDGGGESTGLP 2277
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDelraelTLLNEEAANLRERLE 827

                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2278 LTQgKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2357
Cdd:TIGR02168  828 SLE-RRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902


                   ...
gi 1039737300 2358 LGE 2360
Cdd:TIGR02168  903 LRE 905

EnvC

COG4942

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

908-1138

2.03e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 52.46 E-value: 2.03e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  908 AAAELAIKEQALAKLKGELKmeqgKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQ 986
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIA----ELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAElAELEKEIA 93

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  987 KEVQRLQECIAELSQQLgtseqaqRLMEKKLKRNYTLLLESCEQEKQAL--LQNLKEVEDKASAYEDQLQGHVQQVEALQ 1064
Cdd:COG4942     94 ELRAELEAQKEELAELL-------RALYRLGRQPPLALLLSPEDFLDAVrrLQYLKYLAPARREQAEELRADLAELAALR 166

                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039737300 1065 KEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKE 1138
Cdd:COG4942    167 AELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAA 240

PH2_ADAP

cd01251

ArfGAP with dual PH domains Pleckstrin homology (PH) domain, repeat 2; ADAP (also called ...

505-595

2.21e-06

ArfGAP with dual PH domains Pleckstrin homology (PH) domain, repeat 2; ADAP (also called centaurin alpha) is a phophatidlyinositide binding protein consisting of an N-terminal ArfGAP domain and two PH domains. In response to growth factor activation, PI3K phosphorylates phosphatidylinositol 4,5-bisphosphate to phosphatidylinositol 3,4,5-trisphosphate. Centaurin alpha 1 is recruited to the plasma membrane following growth factor stimulation by specific binding of its PH domain to phosphatidylinositol 3,4,5-trisphosphate. Centaurin alpha 2 is constitutively bound to the plasma membrane since it binds phosphatidylinositol 4,5-bisphosphate and phosphatidylinositol 3,4,5-trisphosphate with equal affinity. This cd contains the second PH domain repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241282 Cd Length: 105 Bit Score: 48.35 E-value: 2.21e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  505 NFKK-GWLTK----QYEdgQWKKHWFVLADQSLRYYRDSvaeeaadLD----GEINLSTC---YDVTE-----YPVQRNY 567
Cdd:cd01251      1 DFLKeGYLEKtgpkQTD--GFRKRWFTLDDRRLMYFKDP-------LDafpkGEIFIGSKeegYSVREglppgIKGHWGF 71

                           90       100
                   ....*....|....*....|....*...
gi 1039737300  568 GFQIHTKEGEFTLSAMTSGIRRNWIQTI 595
Cdd:cd01251     72 GFTLVTPDRTFLLSAETEEERREWITAI 99

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

781-1110

2.94e-06

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 52.76 E-value: 2.94e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  781 EDRSERLSTheLTSLLEKELEQSQKEASDLLEQNrllqdqlrvALGREQSAREGYVLqtevatspSGAWQRLHRVNQDLQ 860
Cdd:TIGR02169  183 EENIERLDL--IIDEKRQQLERLRREREKAERYQ---------ALLKEKREYEGYEL--------LKEKEALERQKEAIE 243

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  861 SELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQ--------TLQTRLGNAAAELAIKEQALAKLKGELKMEQGK 932
Cdd:TIGR02169  244 RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKdlgeeeqlRVKEKIGELEAEIASLERSIAEKERELEDAEER 323

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  933 VR---EQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----QALQRDRQKEVQRlQECIAELSQQLG 1004
Cdd:TIGR02169  324 LAkleAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEleevdKEFAETRDELKDY-REKLEKLKREIN 402

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1005 TSEQAQ-RLMEKKLKRNYTLL-----LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKL---SETCKGS 1075
Cdd:TIGR02169  403 ELKRELdRLQEELQRLSEELAdlnaaIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYdlkEEYDRVE 482

                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1039737300 1076 EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIK 1110
Cdd:TIGR02169  483 KELSKLQRELAEAEAQARASEERVRGGRAVEEVLK 517

PH_Gab-like

cd13324

Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are ...

45-141

3.25e-06

Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. There are 3 families: Gab1, Gab2, and Gab3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270133 Cd Length: 112 Bit Score: 48.18 E-value: 3.25e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   45 IYGGWLLLAPdgtdfdnPVHR--SRKWQRRFFILYEHGL------LRYALDEMPTTlPQGTINMNQCTDVVDGEARTGQK 116
Cdd:cd13324      2 VYEGWLTKSP-------PEKKiwRAAWRRRWFVLRSGRLsggqdvLEYYTDDHCKK-LKGIIDLDQCEQVDAGLTFEKKK 73

                           90       100
                   ....*....|....*....|....*....
gi 1039737300  117 FS----LCILTPDKEHFIRAETKEIISGW 141
Cdd:cd13324     74 FKnqfiFDIRTPKRTYYLVAETEEEMNKW 102

PH_PLEKHD1

cd13281

Pleckstrin homology (PH) domain containing, family D (with coiled-coil domains) member 1 PH ...

64-145

3.62e-06

Pleckstrin homology (PH) domain containing, family D (with coiled-coil domains) member 1 PH domain; Human PLEKHD1 (also called UPF0639, pleckstrin homology domain containing, family D (with M protein repeats) member 1) is a single transcript and contains a single PH domain. PLEKHD1 is conserved in human, chimpanzee, , dog, cow, mouse, chicken, zebrafish, and Caenorhabditis elegans. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270099 Cd Length: 139 Bit Score: 48.47 E-value: 3.62e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   64 HRSRKWQRRFFILYEHGLLRYALDEMPT--------TLPQGTINMNQCTDVVDGEarTGQKFSLCILTPD--KEHFIRAE 133
Cdd:cd13281     25 HQSAKWSKRFFIIKEGFLLYYSESEKKDfektrhfnIHPKGVIPLGGCSIEAVED--PGKPYAISISHSDfkGNIILAAD 102

                           90
                   ....*....|..
gi 1039737300  134 TKEIISGWLEML 145
Cdd:cd13281    103 SEFEQEKWLDML 114

CALCOCO1

pfam07888

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...

795-1042

4.02e-06

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.

Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 51.82 E-value: 4.02e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  795 LLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQ----CRRQ 870
Cdd:pfam07888   31 LLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKykelSASS 110

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  871 ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAikeqalaklkgELKMEQGKVREQLEEWQHSKAMLSGQ 950
Cdd:pfam07888  111 EELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELE-----------RMKERAKKAGAQRKEEEAERKQLQAK 179

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  951 LRASEQKLRSTEARLlektQELRDLETQQALQrdrqkeVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTL--LLESC 1028
Cdd:pfam07888  180 LQQTEEELRSLSKEF----QELRNSLAQRDTQ------VLQLQDTITTLTQKLTTAHRKEAENEALLEELRSLqeRLNAS 249

                          250
                   ....*....|....
gi 1039737300 1029 EQEKQALLQNLKEV 1042
Cdd:pfam07888  250 ERKVEGLGEELSSM 263

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

2091-2400

4.26e-06

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 52.38 E-value: 4.26e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2091 VAALQEKYQRDFESLKATCERgfaamEETHQKKIEDLQRQhqreLEKLREEKDRLL--------AEETAATI--SAIEAM 2160
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEEN-----IERLDLIIDEKRQQ----LERLRREREKAEryqallkeKREYEGYEllKEKEAL 235

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2161 K------NAHREEMERELEKSQRsQISSINSDIEALRRqyleELQSVQRELEVLSEQYSQKCLENAHLAQA-LEAERQAL 2233
Cdd:TIGR02169  236 ErqkeaiERQLASLEEELEKLTE-EISELEKRLEEIEQ----LLEELNKKIKDLGEEEQLRVKEKIGELEAeIASLERSI 310

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2234 RQCQRENQELNAHNQELN---NRLAAEITRLRTLLtgdgggESTGLPLTQGKDAY-ELEVLLRVKESEIQYLKQEISSLK 2309
Cdd:TIGR02169  311 AEKERELEDAEERLAKLEaeiDKLLAEIEELEREI------EEERKRRDKLTEEYaELKEELEDLRAELEEVDKEFAETR 384

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2310 DELQTALRDKKYASDKYKDIYTELSI---AKAKADCDISRLKEQLKAATEALGEkSPEGTTVSGYDIMKSKSNPDFLKKD 2386
Cdd:TIGR02169  385 DELKDYREKLEKLKREINELKRELDRlqeELQRLSEELADLNAAIAGIEAKINE-LEEEKEDKALEIKKQEWKLEQLAAD 463

                          330
                   ....*....|....
gi 1039737300 2387 RSCVTRQLRNIRSK 2400
Cdd:TIGR02169  464 LSKYEQELYDLKEE 477

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

934-1170

6.17e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 51.84 E-value: 6.17e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  934 REQLEEWQHSKAMLSgQLRASEQKLRSTEARLlEKTQELRD----------LETQQALQRDRQKEVQRLQECIAELSQQL 1003
Cdd:COG4913    241 HEALEDAREQIELLE-PIRELAERYAAARERL-AELEYLRAalrlwfaqrrLELLEAELEELRAELARLEAELERLEARL 318

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1004 GTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQghvqqvealqkeKLSETCKGSEQVHKLEE 1083
Cdd:COG4913    319 DALREELDELEAQIRGNGGDRLEQLEREIERLERELEERERRRARLEALLA------------ALGLPLPASAEEFAALR 386

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1084 eleareasiRQLAQHVQSLHDERDlikhqfqELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLR 1163
Cdd:COG4913    387 ---------AEAAALLEALEEELE-------ALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRDALA 450

                          250
                   ....*....|.
gi 1039737300 1164 E----KEEELK 1170
Cdd:COG4913    451 EalglDEAELP 461

Mplasa_alph_rch

TIGR04523

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

857-1213

6.70e-06

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 51.56 E-value: 6.70e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  857 QDLQSELEAQCRRQELITQQIQTLKHSYGEAKDA-------IRHHEAEIQTLQTRLGNAAAELAI----KEQALAK-LKG 924
Cdd:TIGR04523  235 EKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQlsekqkeLEQNNKKIKELEKQLNQLKSEISDlnnqKEQDWNKeLKS 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  925 ELKMEQGKVRE---QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQ---KEVQRLQECIA 997
Cdd:TIGR04523  315 ELKNQEKKLEEiqnQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEiEKLKKENQsykQEIKNLESQIN 394

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  998 ELSQQLGTSEQAQRLME---KKLKRNYTLLLESCEQEKQALLQN---LKEVEDKASAYEDQLQGHVQQVEAlQKEKLSEt 1071
Cdd:TIGR04523  395 DLESKIQNQEKLNQQKDeqiKKLQQEKELLEKEIERLKETIIKNnseIKDLTNQDSVKELIIKNLDNTRES-LETQLKV- 472

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1072 ckgseqvhkleeeleaREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRV 1151
Cdd:TIGR04523  473 ----------------LSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEK 536

                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 1152 SVQLQSVRTLLREKEEELKhiKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:TIGR04523  537 ESKISDLEDELNKDDFELK--KENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEK 596

PH_DAPP1

cd10573

Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; ...

504-595

6.93e-06

Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; DAPP1 (also known as PHISH/3' phosphoinositide-interacting SH2 domain-containing protein or Bam32) plays a role in B-cell activation and has potential roles in T-cell and mast cell function. DAPP1 promotes B cell receptor (BCR) induced activation of Rho GTPases Rac1 and Cdc42, which feed into mitogen-activated protein kinases (MAPK) activation pathways and affect cytoskeletal rearrangement. DAPP1can also regulate BCR-induced activation of extracellular signal-regulated kinase (ERK), and c-jun NH2-terminal kinase (JNK). DAPP1 contains an N-terminal SH2 domain and a C-terminal pleckstrin homology (PH) domain with a single tyrosine phosphorylation site located centrally. DAPP1 binds strongly to both PtdIns(3,4,5)P3 and PtdIns(3,4)P2. The PH domain is essential for plasma membrane recruitment of PI3K upon cell activation. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269977 [Multi-domain] Cd Length: 96 Bit Score: 46.55 E-value: 6.93e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  504 LNFKKGWLTKQyedGQ----WKKHWFVLADQSLRYYRDSVAEEAADldgEINLSTCYDVTEYPVQ-RNYGFQIHTKEGEF 578
Cdd:cd10573      2 LGSKEGYLTKL---GGivknWKTRWFVLRRNELKYFKTRGDTKPIR---VLDLRECSSVQRDYSQgKVNCFCLVFPERTF 75

                           90
                   ....*....|....*..
gi 1039737300  579 TLSAMTSGIRRNWIQTI 595
Cdd:cd10573     76 YMYANTEEEADEWVKLL 92

PH_Gab2_2

cd13384

Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily ...

45-141

7.35e-06

Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily includes several Gab proteins, Drosophila DOS and C. elegans SOC-1. They are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. Members here include insect, nematodes, and crustacean Gab2s. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241535 Cd Length: 115 Bit Score: 47.05 E-value: 7.35e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   45 IYGGWLLLAPdgtdfdnPVHR--SRKWQRRFFILY-----EHGLLRYALDEMPTTLpQGTINMNQCTDV-----VDGEAR 112
Cdd:cd13384      4 VYEGWLTKSP-------PEKRiwRAKWRRRYFVLRqseipGQYFLEYYTDRTCRKL-KGSIDLDQCEQVdagltFETKNK 75

                           90       100
                   ....*....|....*....|....*....
gi 1039737300  113 TGQKFSLCILTPDKEHFIRAETKEIISGW 141
Cdd:cd13384     76 LKDQHIFDIRTPKRTYYLVADTEDEMNKW 104

CCDC158

pfam15921

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

1897-2366

8.51e-06

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 51.27 E-value: 8.51e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1897 SEYQKVITLIEKENTELKAKVSQMDHQQRCLQ-EAENK-------HSESMFALQGRYEEEIRCMVEQLSHTENTLQAERS 1968
Cdd:pfam15921  220 SAISKILRELDTEISYLKGRIFPVEDQLEALKsESQNKielllqqHQDRIEQLISEHEVEITGLTEKASSARSQANSIQS 299

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1969 RvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSlrgalslyqpshpDSSLAPGPSEp 2048
Cdd:pfam15921  300 Q-LEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLA-------------NSELTEARTE- 364

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2049 ravpaaKDEAESMSG-LRERIQELEAQMGVMREELG---------------------HKELEGDVAALQ-EKYQRDFESL 2105
Cdd:pfam15921  365 ------RDQFSQESGnLDDQLQKLLADLHKREKELSlekeqnkrlwdrdtgnsitidHLRRELDDRNMEvQRLEALLKAM 438

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2106 KATC----ERGFAAMEETHQ--KKIEDLQRQHQRELEKLREEKDRLLA-----EETAATISAIeamkNAHREEMERELEK 2174
Cdd:pfam15921  439 KSECqgqmERQMAAIQGKNEslEKVSSLTAQLESTKEMLRKVVEELTAkkmtlESSERTVSDL----TASLQEKERAIEA 514

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2175 SQrSQISSINS--DIEALRRQYL----EELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQ 2248
Cdd:pfam15921  515 TN-AEITKLRSrvDLKLQELQHLknegDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA 593

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2249 ELNNRLAAEITRLRTLLTgdgggestglpLTQGKDAYELEVLLRVKESEIQY----------------LKQEISSLKDEL 2312
Cdd:pfam15921  594 QLEKEINDRRLELQEFKI-----------LKDKKDAKIRELEARVSDLELEKvklvnagserlravkdIKQERDQLLNEV 662

                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300 2313 QTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE-----KSPEGT 2366
Cdd:pfam15921  663 KTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQtrntlKSMEGS 721

PH_TBC1D2A

cd01265

TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1 ...

520-598

8.54e-06

TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1/Prostate antigen recognized and identified by SEREX 1 and ARMUS) contains a PH domain and a TBC-type GTPase catalytic domain. TBC1D2A integrates signaling between Arf6, Rac1, and Rab7 during junction disassembly. Activated Rac1 recruits TBC1D2A to locally inactivate Rab7 via its C-terminal TBC/RabGAP domain and facilitate E-cadherin degradation in lysosomes. The TBC1D2A PH domain mediates localization at cell-cell contacts and coprecipitates with cadherin complexes. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269966 Cd Length: 102 Bit Score: 46.55 E-value: 8.54e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  520 WKKHWFVLADQS--LRYYRDSvaeeaADLD--GEINLSTCydVTEYPVQRNYG-FQIHTKEGEFTLSAMTSGIRRNWIQT 594
Cdd:cd01265     19 WKRRWFVLDESKcqLYYYRSP-----QDATplGSIDLSGA--AFSYDPEAEPGqFEIHTPGRVHILKASTRQAMLYWLQA 91


                   ....
gi 1039737300  595 IMKH 598
Cdd:cd01265     92 LQSK 95

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1875-2308

8.88e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 51.48 E-value: 8.88e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1875 QKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRCLQEAEN--KHSESMFALQGRYEEEIRCM 1952
Cdd:COG1196    347 EEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEEleEAEEALLERLERLEEELEEL 426

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1953 VEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKmLEDRFQLKVRELQAVHQEELRALQEHyiWSLRGALSLY 2032
Cdd:COG1196    427 EEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAE-LLEEAALLEAALAELLEELAEAAARL--LLLLEAEADY 503

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2033 QPSHPDSSLAPGPSEPRAVPAAKDEAesMSGLRERIQELEAQMGVMREELGHKELEgdVAALQEKYQRDFESLKAT---- 2108
Cdd:COG1196    504 EGFLEGVKAALLLAGLRGLAGAVAVL--IGVEAAYEAALEAALAAALQNIVVEDDE--VAAAAIEYLKAAKAGRATflpl 579

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2109 --CERGFAAMEETHQKKIEDLQRQHQRELEKLREEKDRL---LAEETAATISAIEAMKNAHREEMERELEKSQRSQISSI 2183
Cdd:COG1196    580 dkIRARAALAAALARGAIGAAVDLVASDLREADARYYVLgdtLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAG 659

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2184 NSDIEALRRQYLEELQSVQRELEVLSEQ--YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRL 2261
Cdd:COG1196    660 GSLTGGSRRELLAALLEAEAELEELAERlaEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLE 739

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1039737300 2262 RTLltgdgggESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSL 2308
Cdd:COG1196    740 ELL-------EEEELLEEEALEELPEPPDLEELERELERLEREIEAL 779

COG1340

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];

871-1182

9.95e-06

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];

Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 49.91 E-value: 9.95e-06

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  871 ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQtrlgNAAAELAIKEQALAKLKGELKMEQGKVREQLEEwqhskamLSGQ 950
Cdd:COG1340      4 DELSSSLEELEEKIEELREEIEELKEKRDELN----EELKELAEKRDELNAQVKELREEAQELREKRDE-------LNEK 72

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  951 LRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTS----EQAQRLMEK--KLKRNYTLL 1024
Cdd:COG1340     73 VKELKEERDELNEKLNELREELDELRKELAELNKAGGSIDKLRKEIERLEWRQQTEvlspEEEKELVEKikELEKELEKA 152

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1025 LESCEQEKQallqnLKEVEDKASAYEDQLQGHVQQVEALQKEklsetckgSEQVHKleeeleareaSIRQLAQHVQSLHD 1104
Cdd:COG1340    153 KKALEKNEK-----LKELRAELKELRKEAEEIHKKIKELAEE--------AQELHE----------EMIELYKEADELRK 209

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 1105 ERDLIKHQFQELMERvatsdgdVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRtllreKEEELKHIKETHERVLEK 1182
Cdd:COG1340    210 EADELHKEIVEAQEK-------ADELHEEIIELQKELRELRKELKKLRKKQRALK-----REKEKEELEEKAEEIFEK 275

Mplasa_alph_rch

TIGR04523

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...

856-1212

1.11e-05

Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 50.79 E-value: 1.11e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  856 NQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGElkmEQGKvRE 935
Cdd:TIGR04523  309 NKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKE---NQSY-KQ 384

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  936 QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQ----ALQRDRQKEVQRLQECIAELSQQLGTSEQAQR 1011
Cdd:TIGR04523  385 EIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIerlkETIIKNNSEIKDLTNQDSVKELIIKNLDNTRE 464

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1012 LMEKKLKrnytLLLESCEQEKQALLQNLKEVEDKASAYeDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREAS 1091
Cdd:TIGR04523  465 SLETQLK----VLSRSINKIKQNLEQKQKELKSKEKEL-KKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESK 539

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1092 IRQLAQHVQSLHDE--RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEEL 1169
Cdd:TIGR04523  540 ISDLEDELNKDDFElkKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKEL 619

                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1039737300 1170 KHIKETHERVLEKKDqDLNEALVKMIALGSSLEETEIKLQEKE 1212
Cdd:TIGR04523  620 EKAKKENEKLSSIIK-NIKSKKNKLKQEVKQIKETIKEIRNKW 661

PH1_PLEKHH1_PLEKHH2

cd13282

Pleckstrin homology (PH) domain containing, family H (with MyTH4 domain) members 1 and 2 ...

507-595

1.56e-05

Pleckstrin homology (PH) domain containing, family H (with MyTH4 domain) members 1 and 2 (PLEKHH1) PH domain, repeat 1; PLEKHH1 and PLEKHH2 (also called PLEKHH1L) are thought to function in phospholipid binding and signal transduction. There are 3 Human PLEKHH genes: PLEKHH1, PLEKHH2, and PLEKHH3. There are many isoforms, the longest of which contain a FERM domain, a MyTH4 domain, two PH domains, a peroximal domain, a vacuolar domain, and a coiled coil stretch. The FERM domain has a cloverleaf tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM domain is part of the PH domain family. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241436 Cd Length: 96 Bit Score: 45.75 E-value: 1.56e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQyeDGQ---WKKHWFVLADQSLRYYRD--SVAEEAAdldGEINLSTCYDVTeyPVQRNYGFQIHTKEGEFTLS 581
Cdd:cd13282      1 KAGYLTKL--GGKvktWKRRWFVLKNGELFYYKSpnDVIRKPQ---GQIALDGSCEIA--RAEGAQTFEIVTEKRTYYLT 73

                           90
                   ....*....|....
gi 1039737300  582 AMTSGIRRNWIQTI 595
Cdd:cd13282     74 ADSENDLDEWIRVI 87

mukB

PRK04863

chromosome partition protein MukB;

780-1120

1.61e-05

chromosome partition protein MukB;

Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 50.73 E-value: 1.61e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  780 LEDRSERLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQtevatspsgawQRLHRVNQDL 859
Cdd:PRK04863   289 LELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQTALRQQ-----------EKIERYQADL 357

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  860 QsELEAQCRRQELITQQIQTLKHSYGEAKDAIrhhEAEIQTLQTRLGNAAAELAIKE----------QALAKLKGELK-- 927
Cdd:PRK04863   358 E-ELEERLEEQNEVVEEADEQQEENEARAEAA---EEEVDELKSQLADYQQALDVQQtraiqyqqavQALERAKQLCGlp 433

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  928 -MEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEA--RLLEKTQEL-----------------RDLETQQALQRDRQK 987
Cdd:PRK04863   434 dLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAahSQFEQAYQLvrkiagevsrseawdvaRELLRRLREQRHLAE 513

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  988 EVQRLQECIAELSQQLGTSEQAQRLM---EKKLKRNYTL--LLESCEQEKQALLQNLKE----VEDKASAYEDQLQGHVQ 1058
Cdd:PRK04863   514 QLQQLRMRLSELEQRLRQQQRAERLLaefCKRLGKNLDDedELEQLQEELEARLESLSEsvseARERRMALRQQLEQLQA 593

                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039737300 1059 QVEALQK---------EKLSETCkgsEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERV 1120
Cdd:PRK04863   594 RIQRLAArapawlaaqDALARLR---EQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDEEI 661

sbcc

TIGR00618

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

761-1227

1.89e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 50.35 E-value: 1.89e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  761 ETTPLREEkQVPIAPLHLSLEDRSERLSTHELTSLLEKELEQ-----SQKEASDLLEQNRLLQDQLRVALGREQSAREGY 835
Cdd:TIGR00618  401 ELDILQRE-QATIDTRTSAFRDLQGQLAHAKKQQELQQRYAElcaaaITCTAQCEKLEKIHLQESAQSLKEREQQLQTKE 479

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  836 VLQTEVATSPSGAWQRLHRVnQDLQSELEAQCRRQELITQQIQTL------------KHSY-GEAKDAIRHHEAEIQTLQ 902
Cdd:TIGR00618  480 QIHLQETRKKAVVLARLLEL-QEEPCPLCGSCIHPNPARQDIDNPgpltrrmqrgeqTYAQlETSEEDVYHQLTSERKQR 558

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  903 TRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSgqlraseqklRSTEARLLEKTQELRDLETQQALQ 982
Cdd:TIGR00618  559 ASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLS----------EAEDMLACEQHALLRKLQPEQDLQ 628

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  983 RDRQKEvqrlQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQN-LKEVEDKASAYEDQLQGHVQQVE 1061
Cdd:TIGR00618  629 DVRLHL----QQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLaLQKMQSEKEQLTYWKEMLAQCQT 704

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1062 ALQKEKLSETcKGSEQVHKLEEELEAREASIR-QLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVD 1140
Cdd:TIGR00618  705 LLRELETHIE-EYDREFNEIENASSSLGSDLAaREDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGAELSHL 783

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1141 YQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLRRFVS 1220
Cdd:TIGR00618  784 AAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQ 863


                   ....*..
gi 1039737300 1221 DSPKDAK 1227
Cdd:TIGR00618  864 LTQEQAK 870

HCR

pfam07111

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...

755-1210

1.97e-05

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.

Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 50.14 E-value: 1.97e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  755 QRWHQVETTPLREEKQVPIAPLHLSLEDRSERLSTHELTSLLE-KELEQSQKEASDLLEQNRLLQDQLRVALGREQSARE 833
Cdd:pfam07111  146 QRLHQEQLSSLTQAHEEALSSLTSKAEGLEKSLNSLETKRAGEaKQLAEAQKEAELLRKQLSKTQEELEAQVTLVESLRK 225

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  834 gYVLQTEVATSPSGAWqrlhrvnqdlqsELEaqcrRQELItQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELA 913
Cdd:pfam07111  226 -YVGEQVPPEVHSQTW------------ELE----RQELL-DTMQHLQEDRADLQATVELLQVRVQSLTHMLALQEEELT 287

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  914 IKEQALAKLKGELKMeqgKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQA-----LQR---DR 985
Cdd:pfam07111  288 RKIQPSDSLEPEFPK---KCRSLLNRWREKVFALMVQLKAQDLEHRDSVKQLRGQVAELQEQVTSQSqeqaiLQRalqDK 364

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  986 QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLL------LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQ 1059
Cdd:pfam07111  365 AAEVEVERMSAKGLQMELSRAQEARRRQQQQTASAEEQLkfvvnaMSSTQIWLETTMTRVEQAVARIPSLSNRLSYAVRK 444

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1060 V---EALQKEKLS------ETCKGSEqvhKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERvATSDGDVAEL 1130
Cdd:pfam07111  445 VhtiKGLMARKVAlaqlrqESCPPPP---PAPPVDADLSLELEQLREERNRLDAELQLSAHLIQQEVGR-AREQGEAERQ 520

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1131 Q--EKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKM-IALGSSLEETEIK 1207
Cdd:pfam07111  521 QlsEVAQQLEQELQRAQESLASVGQQLEVARQGQQESTEEAASLRQELTQQQEIYGQALQEKVAEVeTRLREQLSDTKRR 600


                   ...
gi 1039737300 1208 LQE 1210
Cdd:pfam07111  601 LNE 603

PRK02224

DNA double-strand break repair Rad50 ATPase;

781-1213

2.10e-05

DNA double-strand break repair Rad50 ATPase;

Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 50.04 E-value: 2.10e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  781 EDRSERLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYvlqTEVATSPSGAWQRLHRVNQDLQ 860
Cdd:PRK02224   293 EERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESL---REDADDLEERAEELREEAAELE 369

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  861 SELEAqCRRQelitqqiqtlkhsYGEAKDAIRHHEAEIQTLQTRLGNAAAELaikeQALAKLKGELKMEQGKVREQLEEw 940
Cdd:PRK02224   370 SELEE-AREA-------------VEDRREEIEELEEEIEELRERFGDAPVDL----GNAEDFLEELREERDELREREAE- 430

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  941 qhskamLSGQLRASEQKLRSTEaRLLEK------TQELRDLETQQALQRDRQKevqrlqecIAELSQQLGTSEQAQRLME 1014
Cdd:PRK02224   431 ------LEATLRTARERVEEAE-ALLEAgkcpecGQPVEGSPHVETIEEDRER--------VEELEAELEDLEEEVEEVE 495

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1015 KKLKRNYTllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQvhklEEELEAREASIRQ 1094
Cdd:PRK02224   496 ERLERAED--LVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEK----REAAAEAEEEAEE 569

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1095 LAQHVQSLHDERDLIKHQFQELmERVATSDGDVAELQ---EKLRGKEVDYQNLE-HSHHRVSVQLQSVRTLLREKE---- 1166
Cdd:PRK02224   570 AREEVAELNSKLAELKERIESL-ERIRTLLAAIADAEdeiERLREKREALAELNdERRERLAEKRERKRELEAEFDeari 648

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1039737300 1167 EELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:PRK02224   649 EEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEE 695

EnvC

COG4942

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

796-1009

2.22e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 49.38 E-value: 2.22e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLrvalgrEQSAREGYVLQTEVATspsgAWQRLHRVNQDLQSELEAQCRRQELITQ 875
Cdd:COG4942     39 LEKELAALKKEEKALLKQLAALERRI------AALARRIRALEQELAA----LEAELAELEKEIAELRAELEAQKEELAE 108

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  876 QIQTL----KHSY-------GEAKDAIRHHEAeIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSK 944
Cdd:COG4942    109 LLRALyrlgRQPPlalllspEDFLDAVRRLQY-LKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEER 187

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039737300  945 AMLSGQLRASEQKLRSTEARLLEKTQELRDLetqqalqrdrQKEVQRLQECIAELSQQLGTSEQA 1009
Cdd:COG4942    188 AALEALKAERQKLLARLEKELAELAAELAEL----------QQEAEELEALIARLEAEAAAAAER 242

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

963-1213

2.52e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.91 E-value: 2.52e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  963 ARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKklkrnytlllescEQEKQALLQNLKEV 1042
Cdd:COG4913    194 LRLLHKTQSFKPIGDLDDFVREYMLEEPDTFEAADALVEHFDDLERAHEALED-------------AREQIELLEPIREL 260

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1043 EDKASAYEDQLQGHVQQVEALQKEKlsetckGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVAT 1122
Cdd:COG4913    261 AERYAAARERLAELEYLRAALRLWF------AQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRG 334

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1123 SDGD-VAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSL 1201
Cdd:COG4913    335 NGGDrLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAAL 414

                          250
                   ....*....|..
gi 1039737300 1202 EETEIKLQEKEE 1213
Cdd:COG4913    415 RDLRRELRELEA 426

PH_SWAP-70

cd13273

Switch-associated protein-70 Pleckstrin homology (PH) domain; SWAP-70 (also called ...

507-595

3.10e-05

Switch-associated protein-70 Pleckstrin homology (PH) domain; SWAP-70 (also called Differentially expressed in FDCP 6/DEF-6 or IRF4-binding protein) functions in cellular signal transduction pathways (in conjunction with Rac), regulates cell motility through actin rearrangement, and contributes to the transformation and invasion activity of mouse embryo fibroblasts. Metazoan SWAP-70 is found in B lymphocytes, mast cells, and in a variety of organs. Metazoan SWAP-70 contains an N-terminal EF-hand motif, a centrally located PH domain, and a C-terminal coiled-coil domain. The PH domain of Metazoan SWAP-70 contains a phosphoinositide-binding site and a nuclear localization signal (NLS), which localize SWAP-70 to the plasma membrane and nucleus, respectively. The NLS is a sequence of four Lys residues located at the N-terminus of the C-terminal a-helix; this is a unique characteristic of the Metazoan SWAP-70 PH domain. The SWAP-70 PH domain binds PtdIns(3,4,5)P3 and PtdIns(4,5)P2 embedded in lipid bilayer vesicles. There are additional plant SWAP70 proteins, but these are not included in this hierarchy. Rice SWAP70 (OsSWAP70) exhibits GEF activity toward the its Rho GTPase, OsRac1, and regulates chitin-induced production of reactive oxygen species and defense gene expression in rice. Arabidopsis SWAP70 (AtSWAP70) plays a role in both PAMP- and effector-triggered immunity. Plant SWAP70 contains both DH and PH domains, but their arrangement is the reverse of that in typical DH-PH-type Rho GEFs, wherein the DH domain is flanked by a C-terminal PH domain. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270092 Cd Length: 110 Bit Score: 44.98 E-value: 3.10e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYrdsVAEEAADLDGEINL--STCYDVTEYPVQRNYGFQIHTKEGEFTLSAM 583
Cdd:cd13273     10 KKGYLWKKgHLLPTWTERWFVLKPNSLSYY---KSEDLKEKKGEIALdsNCCVESLPDREGKKCRFLVKTPDKTYELSAS 86

                           90
                   ....*....|..
gi 1039737300  584 TSGIRRNWIQTI 595
Cdd:cd13273     87 DHKTRQEWIAAI 98

PH_Osh1p_Osh2p_yeast

cd13292

Yeast oxysterol binding protein homologs 1 and 2 Pleckstrin homology (PH) domain; Yeast Osh1p ...

508-599

3.13e-05

Yeast oxysterol binding protein homologs 1 and 2 Pleckstrin homology (PH) domain; Yeast Osh1p is proposed to function in postsynthetic sterol regulation, piecemeal microautophagy of the nucleus, and cell polarity establishment. Yeast Osh2p is proposed to function in sterol metabolism and cell polarity establishment. Both Osh1p and Osh2p contain 3 N-terminal ankyrin repeats, a PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. OSBP andOsh1p PH domains specifically localize to the Golgi apparatus in a PtdIns4P-dependent manner. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. In general OSBPs and ORPs have been found to be involved in the transport and metabolism of cholesterol and related lipids in eukaryotes. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. They are members of the oxysterol binding protein (OSBP) family which includes OSBP, OSBP-related proteins (ORP), Goodpasture antigen binding protein (GPBP), and Four phosphate adaptor protein 1 (FAPP1). They have a wide range of purported functions including sterol transport, cell cycle control, pollen development and vessicle transport from Golgi recognize both PI lipids and ARF proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 241446 Cd Length: 103 Bit Score: 44.99 E-value: 3.13e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  508 KGWLTK--QYEDGqWKKHWFVLADQSLRYYRDSVAEEAAdLDGEINLSTCYDVteYPVQRNYGFQIHTKEG---EFTLSA 582
Cdd:cd13292      5 KGYLKKwtNYAKG-YKTRWFVLEDGVLSYYRHQDDEGSA-CRGSINMKNARLV--SDPSEKLRFEVSSKTSgspKWYLKA 80

                           90
                   ....*....|....*..
gi 1039737300  583 MTSGIRRNWIQTIMKHV 599
Cdd:cd13292     81 NHPVEAARWIQALQKAI 97

EnvC

COG4942

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

2041-2271

4.53e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.22 E-value: 4.53e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2041 LAPGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQRDFESLKATcERGFAAMEeth 2120
Cdd:COG4942     10 LLALAAAAQADAAAEAEAE-LEQLQQEIAELEKELAALKKE--EKALLKQLAALERRIAALARRIRAL-EQELAALE--- 82

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2121 qKKIEDLQRQH---QRELEKLREEKDRLLA--------EETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEA 2189
Cdd:COG4942     83 -AELAELEKEIaelRAELEAQKEELAELLRalyrlgrqPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAE 161

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2190 LRR------QYLEELQSVQRELE----VLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEIT 2259
Cdd:COG4942    162 LAAlraeleAERAELEALLAELEeeraALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241

                          250
                   ....*....|..
gi 1039737300 2260 RLRTLLTGDGGG 2271
Cdd:COG4942    242 RTPAAGFAALKG 253

SCP-1

pfam05483

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...

1865-2349

4.65e-05

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.

Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 48.95 E-value: 4.65e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1865 ACQEAKGASGQKRAQAVGALKEEYEELLHKQKsEYQKVITLIEKENTELKAKVSqmdhqqrclqEAENKHSESMFALqgr 1944
Cdd:pfam05483  198 AFEELRVQAENARLEMHFKLKEDHEKIQHLEE-EYKKEINDKEKQVSLLLIQIT----------EKENKMKDLTFLL--- 263

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1945 yeEEIRCMVEQLSHtENTLQAERSRVLSQ----LDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEH 2020
Cdd:pfam05483  264 --EESRDKANQLEE-KTKLQDENLKELIEkkdhLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEEL 340

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2021 YIWSLRGALSLYQPSHPDSSLAPG-PSEPRAVPAAKD-------EAESMSGLRERIQELEAQMGVMREELgHKELEGDVA 2092
Cdd:pfam05483  341 NKAKAAHSFVVTEFEATTCSLEELlRTEQQRLEKNEDqlkiitmELQKKSSELEEMTKFKNNKEVELEEL-KKILAEDEK 419

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2093 ALQEKYQRD--FESLKATcERGFAAMEETHQKKIEDLQRQ----------HQRELEKLREE--KDRLLAEETAATISAIE 2158
Cdd:pfam05483  420 LLDEKKQFEkiAEELKGK-EQELIFLLQAREKEIHDLEIQltaiktseehYLKEVEDLKTEleKEKLKNIELTAHCDKLL 498

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2159 AMKNAHREE---MERELEKSQRSQISSINSDIEALRR-QYLEELQSVQR-ELEVLSEQYSQ-----KCLENAHLAQALEA 2228
Cdd:pfam05483  499 LENKELTQEasdMTLELKKHQEDIINCKKQEERMLKQiENLEEKEMNLRdELESVREEFIQkgdevKCKLDKSEENARSI 578

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2229 ERQALRQCQRENQELNAHNQ-----ELNNRLAAEITRLRTLLTGDGGGESTGLpltqgkDAYELEVllRVKESEIQYLKQ 2303
Cdd:pfam05483  579 EYEVLKKEKQMKILENKCNNlkkqiENKNKNIEELHQENKALKKKGSAENKQL------NAYEIKV--NKLELELASAKQ 650

                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*.
gi 1039737300 2304 EISSLKDELQTALRDKKYASDKykdIYTELSIAKAKADCDISRLKE 2349
Cdd:pfam05483  651 KFEEIIDNYQKEIEDKKISEEK---LLEEVEKAKAIADEAVKLQKE 693

PRK03918

DNA double-strand break repair ATPase Rad50;

1883-2364

4.69e-05

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 48.91 E-value: 4.69e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1883 ALKEEYEELLhKQKSEYQKVITLIEKENTELKAKVSQmdhqqrcLQEAENKHSESMFALQGRYE--EEIRCMVEQLSHTE 1960
Cdd:PRK03918   176 RRIERLEKFI-KRTENIEELIKEKEKELEEVLREINE-------ISSELPELREELEKLEKEVKelEELKEEIEELEKEL 247

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1961 NTLQAErsrvLSQLDASVKDRQAMEQHHVQQMKMLEDrfqlKVRELqavhqEELRALQEHYIwSLRGALSLY--QPSHPD 2038
Cdd:PRK03918   248 ESLEGS----KRKLEEKIRELEERIEELKKEIEELEE----KVKEL-----KELKEKAEEYI-KLSEFYEEYldELREIE 313

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2039 SSLAPGPSEPRAVPAAKDEAESMSglrERIQELEAQMGVMREELGhkELEGDVAALQEKYQRDFESLKATCERGFAAMEE 2118
Cdd:PRK03918   314 KRLSRLEEEINGIEERIKELEEKE---ERLEELKKKLKELEKRLE--ELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2119 ThQKKIEDLQRQH---QRELEKLREEKDRLLAEEtAATISAIEAMKNAHR----------EEMERELEKSQRSQISSINS 2185
Cdd:PRK03918   389 L-EKELEELEKAKeeiEEEISKITARIGELKKEI-KELKKAIEELKKAKGkcpvcgreltEEHRKELLEEYTAELKRIEK 466

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2186 DIEALRRQyLEELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLL 2265
Cdd:PRK03918   467 ELKEIEEK-ERKLRKELRELEKVLKKESE-----------LIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKL 534

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2266 TGDGGgESTGLpLTQGKDAYELEVLLRVKESEIQYLKQEISSLK-----------DELQTALRDKKYASDKY---KDIYT 2331
Cdd:PRK03918   535 IKLKG-EIKSL-KKELEKLEELKKKLAELEKKLDELEEELAELLkeleelgfesvEELEERLKELEPFYNEYlelKDAEK 612

                          490       500       510
                   ....*....|....*....|....*....|...
gi 1039737300 2332 ELSIAKAKadcdISRLKEQLKAATEALGEKSPE 2364
Cdd:PRK03918   613 ELEREEKE----LKKLEEELDKAFEELAETEKR 641

PH_Gab-like

cd13324

Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are ...

509-595

4.92e-05

Pssm-ID: 270133 Cd Length: 112 Bit Score: 44.71 E-value: 4.92e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  509 GWLTK-----QYEDGQWKKHWFVL------ADQS-LRYYRDsvaEEAADLDGEINLSTCYDVT-----EYPVQRN-YGFQ 570
Cdd:cd13324      5 GWLTKsppekKIWRAAWRRRWFVLrsgrlsGGQDvLEYYTD---DHCKKLKGIIDLDQCEQVDagltfEKKKFKNqFIFD 81

                           90       100
                   ....*....|....*....|....*
gi 1039737300  571 IHTKEGEFTLSAMTSGIRRNWIQTI 595
Cdd:cd13324     82 IRTPKRTYYLVAETEEEMNKWVRCI 106

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

973-1221

5.78e-05

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 48.51 E-value: 5.78e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  973 RDLETQQALqRDRQKEVQRLQECIAELSQQLGT----SEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASA 1048
Cdd:TIGR02168  173 RRKETERKL-ERTRENLDRLEDILNELERQLKSlerqAEKAERYKELKAELR--------ELELALLVLRLEELREELEE 243

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1049 YEDQLQGHVQQVEALQKEKlsetckgseqvhkleeelEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVA 1128
Cdd:TIGR02168  244 LQEELKEAEEELEELTAEL------------------QELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQ 305

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1129 ELQEKLRgkevdyqNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHErVLEKKDQDLNEALVKMIALgssLEETEIKL 1208
Cdd:TIGR02168  306 ILRERLA-------NLERQLEELEAQLEELESKLDELAEELAELEEKLE-ELKEELESLEAELEELEAE---LEELESRL 374

                          250
                   ....*....|...
gi 1039737300 1209 QEKEECLRRFVSD 1221
Cdd:TIGR02168  375 EELEEQLETLRSK 387

GumC

COG3206

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

2064-2262

6.97e-05

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 48.09 E-value: 6.97e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2064 LRERIQELEAQMGVMREELghKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQhqreLEKLREEKD 2143
Cdd:COG3206    173 ARKALEFLEEQLPELRKEL--EEAEAALEEFRQKNG----------LVDLSEEAKLLLQQLSELESQ----LAEARAELA 236

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2144 RLLAEETAATiSAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEE---LQSVQRELEVLSEQYSQkclENA 2220
Cdd:COG3206    237 EAEARLAALR-AQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPNhpdVIALRAQIAALRAQLQQ---EAQ 312

                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1039737300 2221 HLAQALEAERQALRQCQRE-NQELNAHNQELN--NRLAAEITRLR 2262
Cdd:COG3206    313 RILASLEAELEALQAREASlQAQLAQLEARLAelPELEAELRRLE 357

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

2059-2324

7.19e-05

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 48.53 E-value: 7.19e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2059 ESMSGLRERIQELEAQMGVMREELGH------KELEGDVAALQEKYqRDFESLKATCERGFAAMEEtHQKKIEDLQRQHQ 2132
Cdd:TIGR02169  251 EELEKLTEEISELEKRLEEIEQLLEElnkkikDLGEEEQLRVKEKI-GELEAEIASLERSIAEKER-ELEDAEERLAKLE 328

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2133 RELEKLREEKDRL---LAEETAATISAIEAMKNAHREEMEReleksqRSQISSINSDIEALRR---QYLEELQSVQRELE 2206
Cdd:TIGR02169  329 AEIDKLLAEIEELereIEEERKRRDKLTEEYAELKEELEDL------RAELEEVDKEFAETRDelkDYREKLEKLKREIN 402

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2207 VLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTlltgdgggestglpLTQGKDAYE 2286
Cdd:TIGR02169  403 ELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ--------------LAADLSKYE 468

                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1039737300 2287 LEvlLRVKESEIQYLKQEISSLKDELQTALRDKKYASD 2324
Cdd:TIGR02169  469 QE--LYDLKEEYDRVEKELSKLQRELAEAEAQARASEE 504

CwlO1

COG3883

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...

857-1071

7.33e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];

Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 47.52 E-value: 7.33e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  857 QDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLkgelkmeqgkvREQ 936
Cdd:COG3883     19 QAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEER-----------REE 87

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  937 LEEWQHSKAMLSGQLRASEQKLRSTE-ARLLEKTQELRDL-ETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLME 1014
Cdd:COG3883     88 LGERARALYRSGGSVSYLDVLLGSESfSDFLDRLSALSKIaDADADLLEELKADKAELEAKKAELEAKLAELEALKAELE 167

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300 1015 KKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSET 1071
Cdd:COG3883    168 AAKAE-----LEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAA 219

rad50

TIGR00606

rad50; All proteins in this family for which functions are known are involvedin recombination, ...

766-1215

7.47e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).

Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 48.50 E-value: 7.47e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  766 REEKQVPIAPLHLSLEDRSERlSTHELTSLLEKELEQSQKEASDLLEQNRLLQdqlrVALGREQSARegyVLQTEVAtsp 845
Cdd:TIGR00606  724 RRDEMLGLAPGRQSIIDLKEK-EIPELRNKLQKVNRDIQRLKNDIEEQETLLG----TIMPEEESAK---VCLTDVT--- 792

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  846 sgawqrlhrVNQDLQSELEAQCRRqelITQQIQTLKHSygeakDAIRHHEAEIQTLQTrlgnaaaelaiKEQALAKLKGE 925
Cdd:TIGR00606  793 ---------IMERFQMELKDVERK---IAQQAAKLQGS-----DLDRTVQQVNQEKQE-----------KQHELDTVVSK 844

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  926 LKMEQGKVREQLEEWQHskamlsgqLRASEQKLRStearllEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGT 1005
Cdd:TIGR00606  845 IELNRKLIQDQQEQIQH--------LKSKTNELKS------EKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQ 910

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1006 SEQAQRLMEKKLKRNYTLLLESCEQEKQAL--LQNLKEVEDKASAYEDQLQGHVQQ-VEALQKEKLSETCKGSEQVHKLE 1082
Cdd:TIGR00606  911 DSPLETFLEKDQQEKEELISSKETSNKKAQdkVNDIKEKVKNIHGYMKDIENKIQDgKDDYLKQKETELNTVNAQLEECE 990

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1083 EELEAREASIRQLAQHVQSLHDERDLIKHQF---------QELMERVATSDGDVAELQekLRGKEVDYQNLEHSHHRVSV 1153
Cdd:TIGR00606  991 KHQEKINEDMRLMRQDIDTQKIQERWLQDNLtlrkrenelKEVEEELKQHLKEMGQMQ--VLQMKQEHQKLEENIDLIKR 1068

                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 1154 QLQSVRTLLREKEEELKHIKethERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECL 1215
Cdd:TIGR00606 1069 NHVLALGRQKGYEKEIKHFK---KELREPQFRDAEEKYREMMIVMRTTELVNKDLDIYYKTL 1127

PH_DOCK-D

cd13267

Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also ...

48-145

8.14e-05

Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also called Zizimin subfamily) consists of Dock9/Zizimin1, Dock10/Zizimin3, and Dock11/Zizimin2. DOCK-D has a N-terminal DUF3398 domain, a PH-like domain, a Dock Homology Region 1, DHR1 (also called CZH1), a C2 domain, and a C-terminal DHR2 domain (also called CZH2). Zizimin1 is enriched in the brain, lung, and kidney; zizimin2 is found in B and T lymphocytes, and zizimin3 is enriched in brain, lung, spleen and thymus. Zizimin1 functions in autoinhibition and membrane targeting. Zizimin2 is an immune-related and age-regulated guanine nucleotide exchange factor, which facilitates filopodial formation through activation of Cdc42, which results in activation of cell migration. No function has been determined for Zizimin3 to date. The N-terminal half of zizimin1 binds to the GEF domain through three distinct areas, including CZH1, to inhibit the interaction with Cdc42. In addition its PH domain binds phosphoinositides and mediates zizimin1 membrane targeting. DOCK is a family of proteins involved in intracellular signalling networks. They act as guanine nucleotide exchange factors for small G proteins of the Rho family, such as Rac and Cdc42. There are 4 subfamilies of DOCK family proteins based on their sequence homology: A-D. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270087 Cd Length: 126 Bit Score: 44.24 E-value: 8.14e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   48 GWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHG----LLRYALDEMPTTlPQGTINMNQCTDVVDGEARTGQKFSLCILT 123
Cdd:cd13267     10 GYLYKGPENSSDSFISLAMKSFKRRFFHLKQLVdgsyILEFYKDEKKKE-AKGTIFLDSCTGVVQNSKRRKFCFELRMQD 88

                           90       100
                   ....*....|....*....|..
gi 1039737300  124 PdKEHFIRAETKEIISGWLEML 145
Cdd:cd13267     89 K-KSYVLAAESEAEMDEWISKL 109

PH_evt

cd13265

Evectin Pleckstrin homology (PH) domain; There are 2 members of the evectin family (also ...

506-558

8.75e-05

Evectin Pleckstrin homology (PH) domain; There are 2 members of the evectin family (also called pleckstrin homology domain containing, family B): evt-1 (also called PLEKHB1) and evt-2 (also called PLEKHB2). evt-1 is specific to the nervous system, where it is expressed in photoreceptors and myelinating glia. evt-2 is widely expressed in both neural and nonneural tissues. Evectins possess a single N-terminal PH domain and a C-terminal hydrophobic region. evt-1 is thought to function as a mediator of post-Golgi trafficking in cells that produce large membrane-rich organelles. It is a candidate gene for the inherited human retinopathy autosomal dominant familial exudative vitreoretinopathy and a susceptibility gene for multiple sclerosis. evt-2 is essential for retrograde endosomal membrane transport from the plasma membrane (PM) to the Golgi. Two membrane trafficking pathways pass through recycling endosomes: a recycling pathway and a retrograde pathway that links the PM to the Golgi/ER. Its PH domain that is unique in that it specifically recognizes phosphatidylserine (PS), but not polyphosphoinositides. PS is an anionic phospholipid class in eukaryotic biomembranes, is highly enriched in the PM, and plays key roles in various physiological processes such as the coagulation cascade, recruitment and activation of signaling molecules, and clearance of apoptotic cells. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270085 Cd Length: 108 Bit Score: 43.83 E-value: 8.75e-05

                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300  506 FKKGWLTKQYE-DGQWKKHWFVL-ADQSLRYYRDsvaEEAADLDGEINL-STCYDV 558
Cdd:cd13265      4 VKSGWLLRQSTiLKRWKKNWFVLyGDGNLVYYED---ETRREVEGRINMpRECRNI 56

PRK03918

DNA double-strand break repair ATPase Rad50;

780-1217

9.31e-05

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 47.75 E-value: 9.31e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  780 LEDRSERLSTheltslLEKELEQSQKEASDLLEQNRLLQD--QLRVALGREQSAREGYVLQtevatspsgawqrlhrvnq 857
Cdd:PRK03918   333 LEEKEERLEE------LKKKLKELEKRLEELEERHELYEEakAKKEELERLKKRLTGLTPE------------------- 387

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  858 DLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNA---AAELAikEQALAKLKGELKMEQGKVR 934
Cdd:PRK03918   388 KLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCpvcGRELT--EEHRKELLEEYTAELKRIE 465

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  935 EQLEEWQHSKAMLSGQLRASEQKLR--STEARLLEKTQELRDLETQqaLQRDRQKEVQRLQECIAELSQQLGTSEQAQRL 1012
Cdd:PRK03918   466 KELKEIEEKERKLRKELRELEKVLKkeSELIKLKELAEQLKELEEK--LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKS 543

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1013 MEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEalqkEKLSETCKGSEQVHKLEEELEAREASI 1092
Cdd:PRK03918   544 LKKELEK-----LEELKKKLAELEKKLDELEEELAELLKELEELGFESV----EELEERLKELEPFYNEYLELKDAEKEL 614

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1093 RQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL-----QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEE 1167
Cdd:PRK03918   615 EREEKELKKLEEELDKAFEELAETEKRLEELRKELEELekkysEEEYEELREEYLELSRELAGLRAELEELEKRREEIKK 694

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 1168 ELKHIKETHERVLEKKD--QDLNEALVKMIALGSSLEetEIKLQEKEECLRR 1217
Cdd:PRK03918   695 TLEKLKEELEEREKAKKelEKLEKALERVEELREKVK--KYKALLKERALSK 744

PTZ00121

MAEBL; Provisional

1854-2258

9.73e-05

MAEBL; Provisional

Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 48.21 E-value: 9.73e-05

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1854 EYEKELRFYKKACQEAKGASGQKRAQAVGALKEE---YEELlhKQKSEYQKVITLIEKENTELKAKVSQMdhqqRCLQEA 1930
Cdd:PTZ00121  1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEakkADEA--KKKAEEAKKADEAKKKAEEAKKKADEA----KKAAEA 1508

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1931 ENKHSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRElqavh 2010
Cdd:PTZ00121  1509 KKKADEAKKAEEAKKADEAK-KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK----- 1582

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2011 QEELRALQEHYIwslrgalslyqpshpdsslapgpsepravpaakdeaesmsglrERIQELEAQMGVMREELGHKELEGD 2090
Cdd:PTZ00121  1583 AEEAKKAEEARI-------------------------------------------EEVMKLYEEEKKMKAEEAKKAEEAK 1619

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2091 VAALQEKYQrdfESLKATCERgFAAMEETHQKKIEDLQRQHqrELEKLREEKDRLLAEETAATISAIEAMKNAHREEMER 2170
Cdd:PTZ00121  1620 IKAEELKKA---EEEKKKVEQ-LKKKEAEEKKKAEELKKAE--EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2171 ELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQEL 2250
Cdd:PTZ00121  1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773


                   ....*...
gi 1039737300 2251 NNRLAAEI 2258
Cdd:PTZ00121  1774 RKEKEAVI 1781

DUF3584

pfam12128

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...

1947-2359

1.06e-04

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.

Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 47.91 E-value: 1.06e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1947 EEIRCMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHyIWSLR 2026
Cdd:pfam12128  237 MKIRPEFTKLQQEFNTLESAELR-LSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLDDQWKEKRDELNGE-LSAAD 314

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2027 GALSLYQpSHPDSSlapgpsEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQRDFESLK 2106
Cdd:pfam12128  315 AAVAKDR-SELEAL------EDQHGAFLDADIETAAADQEQLPSWQSELENLEER--LKALTGKHQDVTAKYNRRRSKIK 385

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2107 ATCERGFAAMEEthqkkiedlqrqhqrELEKLREEKDRLLAEETAatisAIEAMKNAHREEME------RELEKSQRSQI 2180
Cdd:pfam12128  386 EQNNRDIAGIKD---------------KLAKIREARDRQLAVAED----DLQALESELREQLEagklefNEEEYRLKSRL 446

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2181 SSIN---------SDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELN 2251
Cdd:pfam12128  447 GELKlrlnqatatPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELE 526

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2252 NRLAAEITRLRTLLTGDGGG--ESTG----------------LPLTQGKDAYEL-EVLLRVKESEIQ---YLKQEISSLK 2309
Cdd:pfam12128  527 LQLFPQAGTLLHFLRKEAPDweQSIGkvispellhrtdldpeVWDGSVGGELNLyGVKLDLKRIDVPewaASEEELRERL 606

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039737300 2310 DELQTALRDkkyASDKYKDIYTELSIAKA---KADCDISRLKEQLKAATEALG 2359
Cdd:pfam12128  607 DKAEEALQS---AREKQAAAEEQLVQANGeleKASREETFARTALKNARLDLR 656

DUF5401

pfam17380

Family of unknown function (DUF5401); This is a family of unknown function found in ...

2066-2242

1.09e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.

Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 47.43 E-value: 1.09e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2066 ERIQELEA-QMGVMRE-ELGHKELEG--DVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKLREE 2141
Cdd:pfam17380  375 SRMRELERlQMERQQKnERVRQELEAarKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLE 454

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2142 KdrLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISS---INSDIEALRRQYLEELQS---VQRELE-----VLSE 2210
Cdd:pfam17380  455 E--QERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRrkiLEKELEERKQAMIEEERKrklLEKEMEerqkaIYEE 532

                          170       180       190
                   ....*....|....*....|....*....|..
gi 1039737300 2211 QYSQKCLENAHLAQALEAERQALRQCQRENQE 2242
Cdd:pfam17380  533 ERRREAEEERRKQQEMEERRRIQEQMRKATEE 564

YhaN

COG4717

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

851-1217

1.15e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.45 E-value: 1.15e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  851 RLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKE--QALAKLKGELKM 928
Cdd:COG4717     64 RKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAE 143

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  929 EQGKV---REQLEEWQHskamLSGQLRASEQKLRSTEARlLEKTQELRDLETQQALQrDRQKEVQRLQECIAELSQQLGT 1005
Cdd:COG4717    144 LPERLeelEERLEELRE----LEEELEELEAELAELQEE-LEELLEQLSLATEEELQ-DLAEELEELQQRLAELEEELEE 217

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1006 SEQAQRLMEKKLKRNYTLLLESCEQEKQ--------------ALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSET 1071
Cdd:COG4717    218 AQEELEELEEELEQLENELEAAALEERLkearlllliaaallALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREK 297

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1072 CKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRV 1151
Cdd:COG4717    298 ASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLA 377

                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 1152 SVQLQSvRTLLREKEEELKHIKETHERVLEKKDQdLNEALVKMIALGSSLEETEIK--LQEKEECLRR 1217
Cdd:COG4717    378 EAGVED-EEELRAALEQAEEYQELKEELEELEEQ-LEELLGELEELLEALDEEELEeeLEELEEELEE 443

PH_TBC1D2A

cd01265

TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1 ...

67-145

1.21e-04

Pssm-ID: 269966 Cd Length: 102 Bit Score: 43.08 E-value: 1.21e-04

                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300   67 RKWQRRFFILYEHGLLRYALDEMPTTLPQGTINMNQCTDVVDGEARTGQkFSlcILTPDKEHFIRAETKEIISGWLEML 145
Cdd:cd01265     17 KGWKRRWFVLDESKCQLYYYRSPQDATPLGSIDLSGAAFSYDPEAEPGQ-FE--IHTPGRVHILKASTRQAMLYWLQAL 92

CCDC158

pfam15921

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

1919-2337

1.37e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 47.42 E-value: 1.37e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1919 QMDHQQRCLQEAENKHSESMFAlqgRYEEEIRCMVEQLSHTeNTLQAERSRVLSQldaSVKDRQAmeqhHVQQMKMLEDR 1998
Cdd:pfam15921   60 ELDSPRKIIAYPGKEHIERVLE---EYSHQVKDLQRRLNES-NELHEKQKFYLRQ---SVIDLQT----KLQEMQMERDA 128

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1999 FqLKVRELQAVHQEELRALQEHYIWSLRGALSLYQPSHPDSSLAPGP------------SEPRAVPAAKDEAESmsglrE 2066
Cdd:pfam15921  129 M-ADIRRRESQSQEDLRNQLQNTVHELEAAKCLKEDMLEDSNTQIEQlrkmmlshegvlQEIRSILVDFEEASG-----K 202

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2067 RIQELEAQMGVMREELGH------KELEGDVAALQEK---YQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEK 2137
Cdd:pfam15921  203 KIYEHDSMSTMHFRSLGSaiskilRELDTEISYLKGRifpVEDQLEALKSESQNKIELLLQQHQDRIEQLISEHEVEITG 282

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2138 LREEKdrllaeetaatiSAIEAMKNAHREEME--RELEKSQRSQISSINSDIEALRRQYLEELQSVQRELE-VLSEQYSQ 2214
Cdd:pfam15921  283 LTEKA------------SSARSQANSIQSQLEiiQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEdKIEELEKQ 350

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2215 KCLENAHLAQAlEAERQALRQ----CQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVl 2290
Cdd:pfam15921  351 LVLANSELTEA-RTERDQFSQesgnLDDQLQKLLADLHKREKELSLEKEQNKRLWDRDTGNSITIDHLRRELDDRNMEV- 428

                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1039737300 2291 lRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAK 2337
Cdd:pfam15921  429 -QRLEALLKAMKSECQGQMERQMAAIQGKNESLEKVSSLTAQLESTK 474

sbcc

TIGR00618

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

796-1217

1.41e-04

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 47.27 E-value: 1.41e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSA-------REGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCR 868
Cdd:TIGR00618  227 ELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLkqlrariEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIE 306

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  869 RQ-ELITQQIQTLKHSYGEAKDAIRHHEAEIQTL--QTRLGN---AAAELAIKEQALAKLKGELKMEQGKVREQLEEWQH 942
Cdd:TIGR00618  307 QQaQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIeeQRRLLQtlhSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQ 386

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  943 SKAMLSGQLRASEQ---KLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 1019
Cdd:TIGR00618  387 QKTTLTQKLQSLCKeldILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQ 466

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1020 NYTLLLEScEQEKQALLQNLKEVEDKASAYEDQLQG------------HVQQVEALQKEKLSETCKGSEQVHKLEEELEA 1087
Cdd:TIGR00618  467 SLKEREQQ-LQTKEQIHLQETRKKAVVLARLLELQEepcplcgscihpNPARQDIDNPGPLTRRMQRGEQTYAQLETSEE 545

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1088 REASIRQ-LAQHVQSLHDERDLIKHQFQELmervATSDGDVAELQEKLRGKEVDYQNL--EHSHHRVSVQLQSVRTLLRE 1164
Cdd:TIGR00618  546 DVYHQLTsERKQRASLKEQMQEIQQSFSIL----TQCDNRSKEDIPNLQNITVRLQDLteKLSEAEDMLACEQHALLRKL 621

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039737300 1165 KEEELKHIKETHERVLEKKDQDLNEALVKmialgsslEETEIKLQEKEECLRR 1217
Cdd:TIGR00618  622 QPEQDLQDVRLHLQQCSQELALKLTALHA--------LQLTLTQERVREHALS 666

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

956-1159

1.52e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 47.22 E-value: 1.52e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  956 QKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQ-QLGTSEQAQRLMEKKLKRNyTLLLESCEQEKQA 1034
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAAlRLWFAQRRLELLEAELEEL-RAELARLEAELER 313

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1035 LLQNLKEVEDKASAYEDQLQGH-VQQVEALQKEklsetckgseqVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQF 1113
Cdd:COG4913    314 LEARLDALREELDELEAQIRGNgGDRLEQLERE-----------IERLERELEERERRRARLEALLAALGLPLPASAEEF 382

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1114 QEL----MERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVR 1159
Cdd:COG4913    383 AALraeaAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLE 432

PH_AtPH1

cd13276

Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all ...

67-142

1.57e-04

Pssm-ID: 270095 Cd Length: 106 Bit Score: 43.07 E-value: 1.57e-04

                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300   67 RKWQRRFFILYEHGLLRY-ALDEMPTTLPQGTINMNQCTDVVDGEARTGQKFSLCILTPDKEHFIRAETKEIISGWL 142
Cdd:cd13276     13 KTWRRRWFVLKQGKLFWFkEPDVTPYSKPRGVIDLSKCLTVKSAEDATNKENAFELSTPEETFYFIADNEKEKEEWI 89

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

2053-2210

1.60e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 47.22 E-value: 1.60e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAES-MSGLRERIQELEAQM---GVMREElghkELEGDVAALQEKY---QRDFESLKATCER-GFAAmeETHQKKI 2124
Cdd:COG4913    309 AELERLEArLDALREELDELEAQIrgnGGDRLE----QLEREIERLERELeerERRRARLEALLAAlGLPL--PASAEEF 382

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2125 EDLQRQHQRELEKLREEKDRLLAEETAAtISAIEAMKNAHREeMERELEkSQRSQISSINSDIEALRRQYLEELQSVQRE 2204
Cdd:COG4913    383 AALRAEAAALLEALEEELEALEEALAEA-EAALRDLRRELRE-LEAEIA-SLERRKSNIPARLLALRDALAEALGLDEAE 459


                   ....*.
gi 1039737300 2205 LEVLSE 2210
Cdd:COG4913    460 LPFVGE 465

PRK03918

DNA double-strand break repair ATPase Rad50;

2133-2400

1.80e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 46.98 E-value: 1.80e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2133 RELEKLREEKDRLLAEEtaatiSAIEAMKnahrEEMERELEKSQRsQISSINSDIEALRRQyLEELQSVQRELEVLSEQY 2212
Cdd:PRK03918   172 KEIKRRIERLEKFIKRT-----ENIEELI----KEKEKELEEVLR-EINEISSELPELREE-LEKLEKEVKELEELKEEI 240

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2213 SQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRlAAEITRLRTLltgdgggestglpltqgKDAY-ELEVLL 2291
Cdd:PRK03918   241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK-VKELKELKEK-----------------AEEYiKLSEFY 302

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2292 RVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIyTELSIAKAKADCDISRLKEQLKAATEA---LGEKSPEGTTV 2368
Cdd:PRK03918   303 EEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERL-EELKKKLKELEKRLEELEERHELYEEAkakKEELERLKKRL 381

                          250       260       270
                   ....*....|....*....|....*....|..
gi 1039737300 2369 SGYDIMKSKSNPDFLKKDRSCVTRQLRNIRSK 2400
Cdd:PRK03918   382 TGLTPEKLEKELEELEKAKEEIEEEISKITAR 413

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

752-1054

2.10e-04

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 46.98 E-value: 2.10e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  752 EIEQRWHQVETTPLREEKQVPIAPLHLSLEDRSERLSTHELTSLLEK--ELEQSQKEASDLLEQNRLLQDQLRVALGREQ 829
Cdd:TIGR02169  699 RIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDlsSLEQEIENVKSELKELEARIEELEEDLHKLE 778

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  830 SAREGyvLQTEVATSPsgaWQRLhrvnQDLQSELEAQCRRQELITQQIQ------TLKHSYgeAKDAIRHHEAEIQTLQT 903
Cdd:TIGR02169  779 EALND--LEARLSHSR---IPEI----QAELSKLEEEVSRIEARLREIEqklnrlTLEKEY--LEKEIQELQEQRIDLKE 847

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  904 RLGNAAAELAIKEQALAKLKGELKMEQGKVRE---QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQA 980
Cdd:TIGR02169  848 QIKSIEKEIENLNGKKEELEEELEELEAALRDlesRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927

                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039737300  981 LQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQ 1054
Cdd:TIGR02169  928 ALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEE-----IRALEPVNMLAIQEYEEVLKRLDELKEKRA 996

PH_Boi

cd13316

Boi family Pleckstrin homology domain; Yeast Boi proteins Boi1 and Boi2 are functionally ...

508-597

2.21e-04

Boi family Pleckstrin homology domain; Yeast Boi proteins Boi1 and Boi2 are functionally redundant and important for cell growth with Boi mutants displaying defects in bud formation and in the maintenance of cell polarity.They appear to be linked to Rho-type GTPase, Cdc42 and Rho3. Boi1 and Boi2 display two-hybrid interactions with the GTP-bound ("active") form of Cdc42, while Rho3 can suppress of the lethality caused by deletion of Boi1 and Boi2. These findings suggest that Boi1 and Boi2 are targets of Cdc42 that promote cell growth in a manner that is regulated by Rho3. Boi proteins contain a N-terminal SH3 domain, followed by a SAM (sterile alpha motif) domain, a proline-rich region, which mediates binding to the second SH3 domain of Bem1, and C-terminal PH domain. The PH domain is essential for its function in cell growth and is important for localization to the bud, while the SH3 domain is needed for localization to the neck. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270126 Cd Length: 97 Bit Score: 42.36 E-value: 2.21e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  508 KGWLTKQYED-GQWKKHWFVLADQSLRYYRdsvAEEAADLDGEINLsTCYDVT----EYPVQRNYGFQI--HTKEGEFTL 580
Cdd:cd13316      3 SGWMKKRGERyGTWKTRYFVLKGTRLYYLK---SENDDKEKGLIDL-TGHRVVpddsNSPFRGSYGFKLvpPAVPKVHYF 78

                           90
                   ....*....|....*..
gi 1039737300  581 SAMTSGIRRNWIQTIMK 597
Cdd:cd13316     79 AVDEKEELREWMKALMK 95

PH_3BP2

cd13308

SH3 domain-binding protein 2 Pleckstrin homology (PH) domain; SH3BP2 (the gene that encodes ...

66-145

2.53e-04

SH3 domain-binding protein 2 Pleckstrin homology (PH) domain; SH3BP2 (the gene that encodes the adaptor protein 3BP2), HD, ITU, IT10C3, and ADD1 are located near the Huntington's Disease Gene on Human Chromosome 4pl6.3. SH3BP2 lies in a region that is often missing in individuals with Wolf-Hirschhorn syndrome (WHS). Gain of function mutations in SH3BP2 causes enhanced B-cell antigen receptor (BCR)-mediated activation of nuclear factor of activated T cells (NFAT), resulting in a rare, genetic disorder called cherubism. This results in an increase in the signaling complex formation with Syk, phospholipase C-gamma2 (PLC-gamma2), and Vav1. It was recently discovered that Tankyrase regulates 3BP2 stability through ADP-ribosylation and ubiquitylation by the E3-ubiquitin ligase. Cherubism mutations uncouple 3BP2 from Tankyrase-mediated protein destruction, which results in its stabilization and subsequent hyperactivation of the Src, Syk, and Vav signaling pathways. SH3BP2 is also a potential negative regulator of the abl oncogene. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270118 Cd Length: 113 Bit Score: 42.78 E-value: 2.53e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   66 SRKWQRRFFILYEHGLLrYALDEMPTTlPQGTINMNQCTDVVDGEARTGQKFSLCILTPDKEH---FIRAETKEIISGWL 142
Cdd:cd13308     25 LQNWQLRYVIIHQGCVY-YYKNDQSAK-PKGVFSLNGYNRRAAEERTSKLKFVFKIIHLSPDHrtwYFAAKSEDEMSEWM 102


                   ...
gi 1039737300  143 EML 145
Cdd:cd13308    103 EYI 105

YhaN

COG4717

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

2055-2235

2.84e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 46.30 E-value: 2.84e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2055 KDEAESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCERgfaameethqkkIEDLQRQHQrE 2134
Cdd:COG4717     91 AELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPER------------LEELEERLE-E 157

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2135 LEKLREEKDRLLAEetaatisaiEAMKNAHREEMERELEKSQRSQISSINSDIEALR---RQYLEELQSVQRELEVLSEQ 2211
Cdd:COG4717    158 LRELEEELEELEAE---------LAELQEELEELLEQLSLATEEELQDLAEELEELQqrlAELEEELEEAQEELEELEEE 228

                          170       180
                   ....*....|....*....|....
gi 1039737300 2212 YSQkcLENAHLAQALEAERQALRQ 2235
Cdd:COG4717    229 LEQ--LENELEAAALEERLKEARL 250

SCP-1

pfam05483

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...

745-1170

2.87e-04

Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 46.25 E-value: 2.87e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  745 KTQNVHVEIEQRWHQVETT--PLREEKQVPIAPLHLSLEDRSERLstHELTSLLEKELEQSQKEASDLLEQNRLLQDQLR 822
Cdd:pfam05483  180 ETRQVYMDLNNNIEKMILAfeELRVQAENARLEMHFKLKEDHEKI--QHLEEEYKKEINDKEKQVSLLLIQITEKENKMK 257

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  823 VALGREQSAREGyVLQTEVATSpsgawqrlhrvnqdLQSE-LEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTL 901
Cdd:pfam05483  258 DLTFLLEESRDK-ANQLEEKTK--------------LQDEnLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIA 322

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  902 QTRLGNAAAELAIKEQALAKLKGELKMeqgkvreQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELR----DLET 977
Cdd:pfam05483  323 TKTICQLTEEKEAQMEELNKAKAAHSF-------VVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQkkssELEE 395

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  978 QQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKklkrnytllLESCEQEKQALLQNL-KEVED---KASAYEDQL 1053
Cdd:pfam05483  396 MTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEE---------LKGKEQELIFLLQAReKEIHDleiQLTAIKTSE 466

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1054 QGHVQQVEALQKEKLSETCKGSEqvhkleeeleareasirqLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEK 1133
Cdd:pfam05483  467 EHYLKEVEDLKTELEKEKLKNIE------------------LTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQ 528

                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 1039737300 1134 LRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELK 1170
Cdd:pfam05483  529 EERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVK 565

HEC1

COG5185

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...

2053-2349

2.88e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 46.10 E-value: 2.88e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCE-----RGFAAMEETHQKKIEDL 2127
Cdd:COG5185    269 KLGENAESSKRLNENANNLIKQFENTKEKIAEYTKSIDIKKATESLEEQLAAAEAEQEleeskRETETGIQNLTAEIEQG 348

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2128 QRQHQRELEKLREEKDRLLAEETAATisaieamknahREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEV 2207
Cdd:COG5185    349 QESLTENLEAIKEEIENIVGEVELSK-----------SSEELDSFKDTIESTKESLDEIPQNQRGYAQEILATLEDTLKA 417

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2208 LSEQysqkclenahlaqaLEAERQALRQCQRENQElnahNQELNNRLAAEITRLRTLLTGDGggeSTGLPLTQGKDAYEL 2287
Cdd:COG5185    418 ADRQ--------------IEELQRQIEQATSSNEE----VSKLLNELISELNKVMREADEES---QSRLEEAYDEINRSV 476

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039737300 2288 EVLLRVKESEIQYLKQEISSLKDELQT--ALRDKKYASDKYKDIYTELSIAKAKADCDISRLKE 2349
Cdd:COG5185    477 RSKKEDLNEELTQIESRVSTLKATLEKlrAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILA 540

YhaN

COG4717

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1758-2235

3.11e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.91 E-value: 3.11e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1758 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLPPTEPLGGCQRLLRMSQHlSYESCLEGLGQYSSLLVQd 1837
Cdd:COG4717     87 EEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPE-RLEELEERLEELRELEEE- 164

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1838 aiiqaqvcyaacriRLEYEKELRFYKKACQEAKGASGQKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV 1917
Cdd:COG4717    165 --------------LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEEL 229

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1918 SQMDHQQRCLQEAENKHSESMFALqgryeeeIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLED 1997
Cdd:COG4717    230 EQLENELEAAALEERLKEARLLLL-------IAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGK 302

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1998 RFQlkvrelQAVHQEELRALQEHYIWSLRGALSLyqpshpdsslaPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMgv 2077
Cdd:COG4717    303 EAE------ELQALPALEELEEEELEELLAALGL-----------PPDLSPEELLELLDRIEELQELLREAEELEEEL-- 363

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2078 mreelghkelegDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQR--QHQRELEKLREEKDRLLAEETAATIS 2155
Cdd:COG4717    364 ------------QLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEEleELEEQLEELLGELEELLEALDEEELE 431

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2156 AIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY-----LEELQSVQRELEVLSEQYSQKCLenahLAQALEAER 2230
Cdd:COG4717    432 EELEELEEELEELEEELEE-LREELAELEAELEQLEEDGelaelLQELEELKAELRELAEEWAALKL----ALELLEEAR 506


                   ....*
gi 1039737300 2231 QALRQ 2235
Cdd:COG4717    507 EEYRE 511

CCDC158

pfam15921

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

796-1215

3.42e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 46.26 E-value: 3.42e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNrllQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQelITQ 875
Cdd:pfam15921  247 LEALKSESQNKIELLLQQH---QDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMRQ--LSD 321

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  876 QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQAlaklKGELKMEQGKVREQLEEwqhskamLSGQLRASE 955
Cdd:pfam15921  322 LESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTE----RDQFSQESGNLDDQLQK-------LLADLHKRE 390

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  956 QKLRstearlLEKTQELR--DLETQQA-----LQR---DRQKEVQRLQ--------ECIAELSQQLGTSEQAQRLMEKkl 1017
Cdd:pfam15921  391 KELS------LEKEQNKRlwDRDTGNSitidhLRReldDRNMEVQRLEallkamksECQGQMERQMAAIQGKNESLEK-- 462

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1018 krnYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSE--QVHKLEEELEAREASIRQL 1095
Cdd:pfam15921  463 ---VSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEitKLRSRVDLKLQELQHLKNE 539

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1096 AQHVQSLHDERDLIKHQFQELMERVatsdgdvaelqEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLrEKE--------E 1167
Cdd:pfam15921  540 GDHLRNVQTECEALKLQMAEKDKVI-----------EILRQQIENMTQLVGQHGRTAGAMQVEKAQL-EKEindrrlelQ 607

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 1168 ELKHIKETHE---RVLEKKDQDLNEALVKMIALGSS-LEETEIKLQEKEECL 1215
Cdd:pfam15921  608 EFKILKDKKDakiRELEARVSDLELEKVKLVNAGSErLRAVKDIKQERDQLL 659

CCDC158

pfam15921

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...

761-1176

3.48e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.

Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 46.26 E-value: 3.48e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  761 ETTPLREEKQVPIAPLH-----LSLE-DRSERLSTHELTSL-----LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQ 829
Cdd:pfam15921  371 ESGNLDDQLQKLLADLHkrekeLSLEkEQNKRLWDRDTGNSitidhLRRELDDRNMEVQRLEALLKAMKSECQGQMERQM 450

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  830 SAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGnaa 909
Cdd:pfam15921  451 AAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVD--- 527

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  910 aelaIKEQALAKLKGE---------------LKM-EQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLlEKTQELR 973
Cdd:pfam15921  528 ----LKLQELQHLKNEgdhlrnvqtecealkLQMaEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQL-EKEINDR 602

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  974 DLETQQ--ALQRDRQKEVQRLQECIAEL-----------SQQLGT-----SEQAQRLMEKKLKRNYtllLESCEQEKQAL 1035
Cdd:pfam15921  603 RLELQEfkILKDKKDAKIRELEARVSDLelekvklvnagSERLRAvkdikQERDQLLNEVKTSRNE---LNSLSEDYEVL 679

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1036 LQNLKEVEDKASAYEDQLQGHVQ--QVEALQKEKLSETCKGSE------------QVHKLEEELEAREASIRQLAQHVQS 1101
Cdd:pfam15921  680 KRNFRNKSEEMETTTNKLKMQLKsaQSELEQTRNTLKSMEGSDghamkvamgmqkQITAKRGQIDALQSKIQFLEEAMTN 759

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1102 LHDERDLIKHQFQEL---MERVATSDGDVAELQEKLRGKEVDYQ----NLEHSHHRVSVQLQSVRTLLREKEEELKHIKE 1174
Cdd:pfam15921  760 ANKEKHFLKEEKNKLsqeLSTVATEKNKMAGELEVLRSQERRLKekvaNMEVALDKASLQFAECQDIIQRQEQESVRLKL 839


                   ..
gi 1039737300 1175 TH 1176
Cdd:pfam15921  840 QH 841

EnvC

COG4942

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

912-1168

3.63e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 45.53 E-value: 3.63e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  912 LAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQrdrQKEVQR 991
Cdd:COG4942     11 LALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAAL---EAELAE 87

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  992 LQECIAELSQQLGTSEQ--AQRL--MEKKLKRNYTLLLESCEqekqallqNLKEVEDKASAYEDQLQGHVQQVEALqkek 1067
Cdd:COG4942     88 LEKEIAELRAELEAQKEelAELLraLYRLGRQPPLALLLSPE--------DFLDAVRRLQYLKYLAPARREQAEEL---- 155

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1068 lsetckgseqvhklEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHS 1147
Cdd:COG4942    156 --------------RADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221

                          250       260
                   ....*....|....*....|.
gi 1039737300 1148 HHRVSVQLQSVRTLLREKEEE 1168
Cdd:COG4942    222 AEELEALIARLEAEAAAAAER 242

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

1854-2145

3.63e-04

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 46.20 E-value: 3.63e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1854 EYEKELRFYKKACQEAKGASGQKRaQAVGALKEEYEELLHKQKSEYQKvITLIEKENTELKAKVSQMDHQQRCLQEAENK 1933
Cdd:TIGR02168  702 ELRKELEELEEELEQLRKELEELS-RQISALRKDLARLEAEVEQLEER-IAQLSKELTELEAEIEELEERLEEAEEELAE 779

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1934 HSESMFALQGRYEEeircMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVREL--QAVHQ 2011
Cdd:TIGR02168  780 AEAEIEELEAQIEQ----LKEELKALREALDELRAE-LTLLNEEAANLRERLESLERRIAATERRLEDLEEQIeeLSEDI 854

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2012 EELRALQEHYiWSLRGALSlyqpSHPDSSLAPGPSEPRAVPAAKDEAESMSG----LRERIQELEAQMGVMREELGH--- 2084
Cdd:TIGR02168  855 ESLAAEIEEL-EELIEELE----SELEALLNERASLEEALALLRSELEELSEelreLESKRSELRRELEELREKLAQlel 929

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300 2085 --KELEGDVAALQEK----YQRDFEslkatcergfaaMEETHQKKIEDLQRQHQRELEKLREEKDRL 2145
Cdd:TIGR02168  930 rlEGLEVRIDNLQERlseeYSLTLE------------EAEALENKIEDDEEEARRRLKRLENKIKEL 984

Cast

pfam10174

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...

852-1217

4.33e-04

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.

Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 45.58 E-value: 4.33e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  852 LHRVNQDLQSELEAQCRRQELITQQIQT--------------LKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAE------ 911
Cdd:pfam10174  245 LERNIRDLEDEVQMLKTNGLLHTEDREEeikqmevykshskfMKNKIDQLKQELSKKESELLALQTKLETLTNQnsdckq 324

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  912 --------LAIKEQALAKLKGE-----LKMEQ-----GKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELR 973
Cdd:pfam10174  325 hievlkesLTAKEQRAAILQTEvdalrLRLEEkesflNKKTKQLQDLTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIE 404

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  974 DLETQqalQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDkasaYEDQL 1053
Cdd:pfam10174  405 NLQEQ---LRDKDKQLAGLKERVKSLQTDSSNTDTALTTLEEALSEKERIIERLKEQREREDRERLEELES----LKKEN 477

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1054 QGHVQQVEALQKEKLSETCKGS---EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDvAEL 1130
Cdd:pfam10174  478 KDLKEKVSALQPELTEKESSLIdlkEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKLENQLKKAHNAEEAVRTN-PEI 556

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1131 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEElKHIKETHERVLEK------KDQDLNEALVKMialgSSLEET 1204
Cdd:pfam10174  557 NDRIRLLEQEVARYKEESGKAQAEVERLLGILREVENE-KNDKDKKIAELESltlrqmKEQNKKVANIKH----GQQEMK 631

                          410
                   ....*....|...
gi 1039737300 1205 EIKLQEKEECLRR 1217
Cdd:pfam10174  632 KKGAQLLEEARRR 644

PRK03918

DNA double-strand break repair ATPase Rad50;

1717-2352

4.60e-04

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 45.83 E-value: 4.60e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1717 ERVIQQILetlrhptgREDQVQTSWDQnpLGEILRPgTDGSQEPLQALHQSPEvlaAIQDELAQQLREKASILEEISAAL 1796
Cdd:PRK03918   148 EKVVRQIL--------GLDDYENAYKN--LGEVIKE-IKRRIERLEKFIKRTE---NIEELIKEKEKELEEVLREINEIS 213

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1797 PVLPPT-EPLGGCQRLLRmsqhlSYESCLEGLgqySSLLVQDAIIQAQVCYAACRIRlEYEKELRFYKKACQEAKgaSGQ 1875
Cdd:PRK03918   214 SELPELrEELEKLEKEVK-----ELEELKEEI---EELEKELESLEGSKRKLEEKIR-ELEERIEELKKEIEELE--EKV 282

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1876 KRAQAVGALKEEYEElLHKQKSEYQKVITLIEKENTELKAKVSQMdhqQRCLQEAENKHSEsMFALQGRyEEEIRCMVEQ 1955
Cdd:PRK03918   283 KELKELKEKAEEYIK-LSEFYEEYLDELREIEKRLSRLEEEINGI---EERIKELEEKEER-LEELKKK-LKELEKRLEE 356

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1956 LSHTENTLQAERsRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALqEHYIWSLRGALSLYQPS 2035
Cdd:PRK03918   357 LEERHELYEEAK-AKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL-KKEIKELKKAIEELKKA 434

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2036 HPDSSL--APGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREELghKELEGDVaalqeKYQRDFESLKATCERGF 2113
Cdd:PRK03918   435 KGKCPVcgRELTEEHRKELLEEYTAE-LKRIEKELKEIEEKERKLRKEL--RELEKVL-----KKESELIKLKELAEQLK 506

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2114 AAMEETHQKKIEDLQRQhQRELEKLREEKDRLLAE--ETAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALR 2191
Cdd:PRK03918   507 ELEEKLKKYNLEELEKK-AEEYEKLKEKLIKLKGEikSLKKELEKLEELKK-KLAELEKKLDELEE-ELAELLKELEELG 583

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2192 RQYLEELQSVQRELEVLSEQYsqkcLENAHLAQALEAERQALRQCQRENQELNAHNQELNNR---LAAEITRLRTLLTGD 2268
Cdd:PRK03918   584 FESVEELEERLKELEPFYNEY----LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRleeLRKELEELEKKYSEE 659

                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2269 GGGESTGLPLtqgkdayELEVLLRVKESEIQYLKqeisSLKDELQTALRDKKYASDKYKDIYTEL-SIAKAKAdcDISRL 2347
Cdd:PRK03918   660 EYEELREEYL-------ELSRELAGLRAELEELE----KRREEIKKTLEKLKEELEEREKAKKELeKLEKALE--RVEEL 726


                   ....*
gi 1039737300 2348 KEQLK 2352
Cdd:PRK03918   727 REKVK 731

YhaN

COG4717

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

1877-2339

4.68e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];

Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.53 E-value: 4.68e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1877 RAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRcLQEAENKHSESMFALQGRYEE---EIRCMV 1953
Cdd:COG4717     44 RAMLLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAE-LQEELEELEEELEELEAELEElreELEKLE 122

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1954 EQLSHTENTLQAER-SRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQlKVRELQAVHQEELRALQEHYIWSLRGALSLY 2032
Cdd:COG4717    123 KLLQLLPLYQELEAlEAELAELPERLEELEERLEELRELEEELEELEA-ELAELQEELEELLEQLSLATEEELQDLAEEL 201

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2033 QpshpdsslapgpsepravpAAKDEAESmsgLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCER- 2111
Cdd:COG4717    202 E-------------------ELQQRLAE---LEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLLLIAAALl 259

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2112 GFAAMEETHQKKIEDLQRQHQ-----RELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSD 2186
Cdd:COG4717    260 ALLGLGGSLLSLILTIAGVLFlvlglLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEEL 339

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2187 IEALRRqyLEELQSVQRELEVLSEQYSQKCLE---NAHLAQALEAERQALRQCQRENQELnahnQELNNRLAAEITRLRT 2263
Cdd:COG4717    340 LELLDR--IEELQELLREAEELEEELQLEELEqeiAALLAEAGVEDEEELRAALEQAEEY----QELKEELEELEEQLEE 413

                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300 2264 LLTGDGGGESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYAsdkykDIYTELSIAKAK 2339
Cdd:COG4717    414 LLGELEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELA-----ELLQELEELKAE 484

PH_OSBP_ORP4

cd13284

Human Oxysterol binding protein and OSBP-related protein 4 Pleckstrin homology (PH) domain; ...

507-593

5.04e-04

Human Oxysterol binding protein and OSBP-related protein 4 Pleckstrin homology (PH) domain; Human OSBP is proposed to function is sterol-dependent regulation of ERK dephosphorylation and sphingomyelin synthesis as well as modulation of insulin signaling and hepatic lipogenesis. It contains a N-terminal PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. OSBPs and Osh1p PH domains specifically localize to the Golgi apparatus in a PtdIns4P-dependent manner. ORP4 is proposed to function in Vimentin-dependent sterol transport and/or signaling. Human ORP4 has 2 forms, a long (ORP4L) and a short (ORP4S). ORP4L contains a N-terminal PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. ORP4S is truncated and contains only an OSBP-related domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. They are members of the oxysterol binding protein (OSBP) family which includes OSBP, OSBP-related proteins (ORP), Goodpasture antigen binding protein (GPBP), and Four phosphate adaptor protein 1 (FAPP1). They have a wide range of purported functions including sterol transport, cell cycle control, pollen development and vessicle transport from Golgi recognize both PI lipids and ARF proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270101 Cd Length: 99 Bit Score: 41.59 E-value: 5.04e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTK--QYEDGqWKKHWFVLADQSLRYYRdSVAEEAADLDGEINLSTCYDVTEYPVQrnygFQIHT-KEGEFTLSAM 583
Cdd:cd13284      1 MKGWLLKwtNYIKG-YQRRWFVLSNGLLSYYR-NQAEMAHTCRGTINLAGAEIHTEDSCN----FVISNgGTQTFHLKAS 74

                           90
                   ....*....|
gi 1039737300  584 TSGIRRNWIQ 593
Cdd:cd13284     75 SEVERQRWVT 84

PH_GRP1-like

cd01252

General Receptor for Phosphoinositides-1-like Pleckstrin homology (PH) domain; GRP1/cytohesin3 ...

507-542

5.27e-04

General Receptor for Phosphoinositides-1-like Pleckstrin homology (PH) domain; GRP1/cytohesin3 and the related proteins ARNO (ARF nucleotide-binding site opener)/cytohesin-2 and cytohesin-1 are ARF exchange factors that contain a pleckstrin homology (PH) domain thought to target these proteins to cell membranes through binding polyphosphoinositides. The PH domains of all three proteins exhibit relatively high affinity for PtdIns(3,4,5)P3. Within the Grp1 family, diglycine (2G) and triglycine (3G) splice variants, differing only in the number of glycine residues in the PH domain, strongly influence the affinity and specificity for phosphoinositides. The 2G variants selectively bind PtdIns(3,4,5)P3 with high affinity,the 3G variants bind PtdIns(3,4,5)P3 with about 30-fold lower affinity and require the polybasic region for plasma membrane targeting. These ARF-GEFs share a common, tripartite structure consisting of an N-terminal coiled-coil domain, a central domain with homology to the yeast protein Sec7, a PH domain, and a C-terminal polybasic region. The Sec7 domain is autoinhibited by conserved elements proximal to the PH domain. GRP1 binds to the DNA binding domain of certain nuclear receptors (TRalpha, TRbeta, AR, ER, but not RXR), and can repress thyroid hormone receptor (TR)-mediated transactivation by decreasing TR-complex formation on thyroid hormone response elements. ARNO promotes sequential activation of Arf6, Cdc42 and Rac1 and insulin secretion. Cytohesin acts as a PI 3-kinase effector mediating biological responses including cell spreading and adhesion, chemotaxis, protein trafficking, and cytoskeletal rearrangements, only some of which appear to depend on their ability to activate ARFs. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269954 Cd Length: 119 Bit Score: 41.92 E-value: 5.27e-04

                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1039737300  507 KKGWLTKQyeDGQ---WKKHWFVLADQSLRYYRDSVAEE 542
Cdd:cd01252      5 REGWLLKL--GGRvksWKRRWFILTDNCLYYFEYTTDKE 41

PRK02224

DNA double-strand break repair Rad50 ATPase;

2053-2315

5.45e-04

DNA double-strand break repair Rad50 ATPase;

Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 45.42 E-value: 5.45e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAEsmsgLRERIQELEAQMGVMREELGHKELEGDVAALQ-----------EKYQRDFESLKATCERGFAAMEETHQ 2121
Cdd:PRK02224   197 EEKEEKD----LHERLNGLESELAELDEEIERYEEQREQARETrdeadevleehEERREELETLEAEIEDLRETIAETER 272

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2122 KK--IEDLQRQHQRELEKLREEKDRLLAEE--TAATISAIEAMKN---AHREEMERELEKsQRSQISSINSDIEALRrqy 2194
Cdd:PRK02224   273 EReeLAEEVRDLRERLEELEEERDDLLAEAglDDADAEAVEARREeleDRDEELRDRLEE-CRVAAQAHNEEAESLR--- 348

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2195 leelqsvqRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAaeitrlrtlltgdgggest 2274
Cdd:PRK02224   349 --------EDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFG------------------- 401

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1039737300 2275 GLPLTQGKDAYELEVLLrvkeSEIQYLKQEISSLKDELQTA 2315
Cdd:PRK02224   402 DAPVDLGNAEDFLEELR----EERDELREREAELEATLRTA 438

sbcc

TIGR00618

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

908-1183

5.91e-04

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 45.34 E-value: 5.91e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  908 AAAELAIKEQALAK---LKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRD 984
Cdd:TIGR00618  182 ALMEFAKKKSLHGKaelLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQL 261

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  985 RQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQ 1064
Cdd:TIGR00618  262 LKQLRARIEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIE 341

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1065 KEKLSETCKGSEQVH------------KLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSD-------- 1124
Cdd:TIGR00618  342 EQRRLLQTLHSQEIHirdahevatsirEISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDtrtsafrd 421

                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 1125 --GDVAEL-------QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKK 1183
Cdd:TIGR00618  422 lqGQLAHAkkqqelqQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKK 489

SMC_prok_A

TIGR02169

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...

2128-2409

5.91e-04

Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 45.44 E-value: 5.91e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2128 QRQHQRELEKLREEKDRLLAEEtAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALrrqyLEELQSVQRELEV 2207
Cdd:TIGR02169  669 SRSEPAELQRLRERLEGLKREL-SSLQSELRRIEN-RLDELSQELSDASR-KIGEIEKEIEQL----EQEEEKLKERLEE 741

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2208 LSEQYSQkclenahLAQALEAERQALRQCQRENQELnahnQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYEL 2287
Cdd:TIGR02169  742 LEEDLSS-------LEQEIENVKSELKELEARIEEL----EEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRI 810

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2288 EVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKadcdISRLKEQLKAATEALgekspegtt 2367
Cdd:TIGR02169  811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGK----KEELEEELEELEAAL--------- 877

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|..
gi 1039737300 2368 vsgYDIMKSKSNpdfLKKDRSCVTRQLRNIRSKsvIEQVSWD 2409
Cdd:TIGR02169  878 ---RDLESRLGD---LKKERDELEAQLRELERK--IEELEAQ 911

PHA02562

endonuclease subunit; Provisional

852-1077

6.25e-04

endonuclease subunit; Provisional

Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 45.01 E-value: 6.25e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  852 LHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKlkgeLKMEQG 931
Cdd:PHA02562   190 IDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNK----LNTAAA 265

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  932 KVREQLEEWQHSKAMLS--GQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQKEVQRLQEcIAELSQQLgtseq 1008
Cdd:PHA02562   266 KIKSKIEQFQKVIKMYEkgGVCPTCTQQISEGPDRITKIKDKLKELQHSlEKLDTAIDELEEIMDE-FNEQSKKL----- 339

                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039737300 1009 aqrlmeKKLKRNYtlllESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKE--KLSETCKGSEQ 1077
Cdd:PHA02562   340 ------LELKNKI----STNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDEldKIVKTKSELVK 400

PH_Btk

cd01238

Bruton's tyrosine kinase pleckstrin homology (PH) domain; Btk is a member of the Tec family of ...

507-595

6.67e-04

Bruton's tyrosine kinase pleckstrin homology (PH) domain; Btk is a member of the Tec family of cytoplasmic protein tyrosine kinases that includes BMX, IL2-inducible T-cell kinase (Itk) and Tec. Btk plays a role in the maturation of B cells. Tec proteins general have an N-terminal PH domain, followed by a Tek homology (TH) domain, a SH3 domain, a SH2 domain and a kinase domain. The Btk PH domain binds phosphatidylinositol 3,4,5-trisphosphate and responds to signalling via phosphatidylinositol 3-kinase. The PH domain is also involved in membrane anchoring which is confirmed by the discovery of a mutation of a critical arginine residue in the BTK PH domain. This results in severe human immunodeficiency known as X-linked agammaglobulinemia (XLA) in humans and a related disorder is mice.PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269944 [Multi-domain] Cd Length: 140 Bit Score: 42.22 E-value: 6.67e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQyedGQ---------WKKHWFVLADQSLRYYrDSVAEEAADLDGEINLSTCYDV----TEYPVQRNYGFQIHT 573
Cdd:cd01238      1 LEGLLVKR---SQgkkrfgpvnYKERWFVLTKSSLSYY-EGDGEKRGKEKGSIDLSKVRCVeevkDEAFFERKYPFQVVY 76

                           90       100
                   ....*....|....*....|..
gi 1039737300  574 KEGEFTLSAMTSGIRRNWIQTI 595
Cdd:cd01238     77 DDYTLYVFAPSEEDRDEWIAAL 98

Myosin_tail_1

pfam01576

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...

1868-2255

6.80e-04

Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 45.17 E-value: 6.80e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1868 EAKGASGQKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKEN------TELKAkVSQMDHQQRCLQEAENKHSESMFAL 1941
Cdd:pfam01576  562 EEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKkfdqmlAEEKA-ISARYAEERDRAEAEAREKETRALS 640

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1942 QGRYEEEIRCMVEQLSHTENTLQAERSRVLSQLDASVKD-------RQAMEQhHVQQMKM----LEDrfqlkvrELQAVH 2010
Cdd:pfam01576  641 LARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNvhelersKRALEQ-QVEEMKTqleeLED-------ELQATE 712

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2011 QEELR------ALQEHYIWSLRgalslyqpshpdsslapgpsepravpaAKDEA--ESMSGLRERIQELEAQMGVMREEl 2082
Cdd:pfam01576  713 DAKLRlevnmqALKAQFERDLQ---------------------------ARDEQgeEKRRQLVKQVRELEAELEDERKQ- 764

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2083 ghkelEGDVAALQEKYQRDFESLKATCERGFAAMEET--HQKKIEDLQRQHQRELEKLREEKDRLLA--EETAATISAIE 2158
Cdd:pfam01576  765 -----RAQAVAAKKKLELDLKELEAQIDAANKGREEAvkQLKKLQAQMKDLQRELEEARASRDEILAqsKESEKKLKNLE 839

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2159 A-----------------MKNAHREEMERELEK--SQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLEN 2219
Cdd:pfam01576  840 AellqlqedlaaserarrQAQQERDELADEIASgaSGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQV 919

                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1039737300 2220 AHLAQALEAERQALRQCQRENQELNAHNQELNNRLA 2255
Cdd:pfam01576  920 EQLTTELAAERSTSQKSESARQQLERQNKELKAKLQ 955

PRK09039

peptidoglycan -binding protein;

2115-2267

7.05e-04

peptidoglycan -binding protein;

Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 44.19 E-value: 7.05e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2115 AMEETHQKKIEDLQRQHQRELEKLREEKDRL--LAEETAATISAIEAMKNAHREEM--ERELEKSQRSQISSINSDIEAL 2190
Cdd:PRK09039    70 SLERQGNQDLQDSVANLRASLSAAEAERSRLqaLLAELAGAGAAAEGRAGELAQELdsEKQVSARALAQVELLNQQIAAL 149

                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 2191 RRQyleeLQSVQRELEVlSEQYSQkclenahlaqalEAERQALRQCQRENQELNAHNQELnNRLAAE-ITRLRTLLTG 2267
Cdd:PRK09039   150 RRQ----LAALEAALDA-SEKRDR------------ESQAKIADLGRRLNVALAQRVQEL-NRYRSEfFGRLREILGD 209

PH_Gab2_2

cd13384

Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily ...

506-595

7.61e-04

Pssm-ID: 241535 Cd Length: 115 Bit Score: 41.27 E-value: 7.61e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  506 FKKGWLTK-----QYEDGQWKKHWFVLADQS------LRYYRDsvaEEAADLDGEINLSTCYDV-----TEYPVQRNYG- 568
Cdd:cd13384      4 VYEGWLTKsppekRIWRAKWRRRYFVLRQSEipgqyfLEYYTD---RTCRKLKGSIDLDQCEQVdagltFETKNKLKDQh 80

                           90       100
                   ....*....|....*....|....*...
gi 1039737300  569 -FQIHTKEGEFTLSAMTSGIRRNWIQTI 595
Cdd:cd13384     81 iFDIRTPKRTYYLVADTEDEMNKWVNCI 108

EnvC

COG4942

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...

794-963

8.19e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 44.37 E-value: 8.19e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  794 SLLEKELEQSQKEAS----DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRR 869
Cdd:COG4942     79 AALEAELAELEKEIAelraELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRAD 158

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  870 QELITQQIQTLKhsygEAKDAIRHHEAEIQTLQTRLgnaAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSG 949
Cdd:COG4942    159 LAELAALRAELE----AERAELEALLAELEEERAAL---EALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231

                          170
                   ....*....|....
gi 1039737300  950 QLRASEQKLRSTEA 963
Cdd:COG4942    232 LEAEAAAAAERTPA 245

CwlO1

COG3883

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...

2174-2381

8.21e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];

Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 44.44 E-value: 8.21e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2174 KSQRSQISSINSDIEALRrqylEELQSVQRELEVLSEQYSQKcleNAHLAQALEAERQALRQCQRENQELNAHNQELNNR 2253
Cdd:COG3883     19 QAKQKELSELQAELEAAQ----AELDALQAELEELNEEYNEL---QAELEALQAEIDKLQAEIAEAEAEIEERREELGER 91

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2254 LAA------EITRLRTLLTGDGGGE--------STGLPLTQG--KDAYELEVLLRVKESEIQYLKQEISSLKDELQTALR 2317
Cdd:COG3883     92 ARAlyrsggSVSYLDVLLGSESFSDfldrlsalSKIADADADllEELKADKAELEAKKAELEAKLAELEALKAELEAAKA 171

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039737300 2318 DKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPEGTTVSGYDIMKSKSNPD 2381
Cdd:COG3883    172 ELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAA 235

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

2058-2258

8.22e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 44.91 E-value: 8.22e-04

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2058 AESMSGLRERIQELEAQMGVMR--------------EELGHKELEGDVAALQEKYQR------DFESLK---ATCERGFA 2114
Cdd:COG4913    623 EEELAEAEERLEALEAELDALQerrealqrlaeyswDEIDVASAEREIAELEAELERldassdDLAALEeqlEELEAELE 702

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2115 AMEEtHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAI------------EAMKNAHREEMERELEKSQ---RSQ 2179
Cdd:COG4913    703 ELEE-ELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARlelralleerfaAALGDAVERELRENLEERIdalRAR 781

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2180 ISSINSDIEALRRQYLEE----LQSVQRELEVLSEqYSQKC--LENAHLAQALEAERQALRQCQRENQElnahnqELNNR 2253
Cdd:COG4913    782 LNRAEEELERAMRAFNREwpaeTADLDADLESLPE-YLALLdrLEEDGLPEYEERFKELLNENSIEFVA------DLLSK 854


                   ....*
gi 1039737300 2254 LAAEI 2258
Cdd:COG4913    855 LRRAI 859

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1850-2243

1.01e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 44.54 E-value: 1.01e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1850 RIRLEYEKELRFYKKACQEAKGASGQKRAQAvgALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRcLQE 1929
Cdd:COG1196    380 ELEELAEELLEALRAAAELAAQLEELEEAEE--ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAE-LEE 456

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1930 AENKHSESMFALQGRYEEEIrcmvEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAV 2009
Cdd:COG1196    457 EEEALLELLAELLEEAALLE----AALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGV 532

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2010 HQEELRALQEhyiwsLRGALSLYQPSHPDSSLAPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREElGHKELEG 2089
Cdd:COG1196    533 EAAYEAALEA-----ALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGA-AVDLVAS 606

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2090 DVAALQEKYQRDFESL------KATCERGFAAMEETHQKKIE---DLQRQHQRELEKLREEKDRLLAEETAATISAIEAM 2160
Cdd:COG1196    607 DLREADARYYVLGDTLlgrtlvAARLEAALRRAVTLAGRLREvtlEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAE 686

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2161 KNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAE----------R 2230
Cdd:COG1196    687 RLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEElpeppdleelE 766

                          410
                   ....*....|...
gi 1039737300 2231 QALRQCQRENQEL 2243
Cdd:COG1196    767 RELERLEREIEAL 779

COG5022

Myosin heavy chain [General function prediction only];

856-1322

1.01e-03

Myosin heavy chain [General function prediction only];

Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 44.68 E-value: 1.01e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  856 NQDLQSELEAQCRRQELITQQI----QTLKHSYGEAkdAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKmEQG 931
Cdd:COG5022    819 IIKLQKTIKREKKLRETEEVEFslkaEVLIQKFGRS--LKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVK-SIS 895

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  932 KVREQLEEWQHSKAMLSGQLRASEQ---KLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLgtseq 1008
Cdd:COG5022    896 SLKLVNLELESEIIELKKSLSSDLIenlEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETS----- 970

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1009 aqrlmekklkrnytlllesceQEKQALLqnlkeveDKASAYEDQLQGHVQQVEALQKEkLSETCKGSEQVHKLEEELEAR 1088
Cdd:COG5022    971 ---------------------EEYEDLL-------KKSTILVREGNKANSELKNFKKE-LAELSKQYGALQESTKQLKEL 1021

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1089 EASIRQLAQHVQSLHDERDlIKHQFQELMERVATSDGDVAELQEKLrgKEVDYQN-LEHSHHRVSVQLQSVRTLlrEKEE 1167
Cdd:COG5022   1022 PVEVAELQSASKIISSEST-ELSILKPLQKLKGLLLLENNQLQARY--KALKLRReNSLLDDKQLYQLESTENL--LKTI 1096

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1168 ELKHIKETHERVLEKKdqdlnEALVKMIALGSSLEEteikLQEKEECLRRFVSDSPkDAKEPLSTTEPTEEGSGILPLGS 1247
Cdd:COG5022   1097 NVKDLEVTNRNLVKPA-----NVLQFIVAQMIKLNL----LQEISKFLSQLVNTLE-PVFQKLSVLQLELDGLFWEANLE 1166

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1248 VTRVFPGFpHSQPEDEDPSAGLGEEGSSGS-------------LSREENTILPKSADMPE--REG-HLQSTSKSDPGapI 1311
Cdd:COG5022   1167 ALPSPPPF-AALSEKRLYQSALYDEKSKLSssevndlkneliaLFSKIFSGWPRGDKLKKliSEGwVPTEYSTSLKG--F 1243

                          490
                   ....*....|.
gi 1039737300 1312 KRPRIRFSTIQ 1322
Cdd:COG5022   1244 NNLNKKFDTPA 1254

PRK02224

DNA double-strand break repair Rad50 ATPase;

2058-2243

1.05e-03

DNA double-strand break repair Rad50 ATPase;

Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 44.65 E-value: 1.05e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2058 AESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQekyQRDFESLKATCERgfaAMEETHQKkiedlQRQHQRELEK 2137
Cdd:PRK02224   278 AEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEAR---REELEDRDEELRD---RLEECRVA-----AQAHNEEAES 346

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2138 LREEKDRLlaEETAATISAIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY---LEELQSVQRELEVLSEQysq 2214
Cdd:PRK02224   347 LREDADDL--EERAEELREEAAELESELEEAREAVED-RREEIEELEEEIEELRERFgdaPVDLGNAEDFLEELREE--- 420

                          170       180       190
                   ....*....|....*....|....*....|
gi 1039737300 2215 kcLENAHLAQA-LEAERQALRQCQRENQEL 2243
Cdd:PRK02224   421 --RDELREREAeLEATLRTARERVEEAEAL 448

PRK10246

exonuclease subunit SbcC; Provisional

796-1059

1.22e-03

exonuclease subunit SbcC; Provisional

Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 44.41 E-value: 1.22e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRvalgREQSAREGYVLQTEVATSpsgAWQRL-------HRVNQDLQSELEAQCR 868
Cdd:PRK10246   535 LEKEVKKLGEEGAALRGQLDALTKQLQ----RDESEAQSLRQEEQALTQ---QWQAVcaslnitLQPQDDIQPWLDAQEE 607

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  869 RQELITQ--QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIK------EQA-LAKLKGELKMEQGKVREQ--L 937
Cdd:PRK10246   608 HERQLRLlsQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTlpqedeEASwLATRQQEAQSWQQRQNELtaL 687

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  938 EEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRD----LETQ-QALQRDRQKEVQRLQECIAELSQQLGTSEQAQR- 1011
Cdd:PRK10246   688 QNRIQQLTPLLETLPQSDDLPHSEETVALDNWRQVHEqclsLHSQlQTLQQQDVLEAQRLQKAQAQFDTALQASVFDDQq 767

                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1039737300 1012 ------LMEKKLKRnytlllesCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQ 1059
Cdd:PRK10246   768 aflaalLDEETLTQ--------LEQLKQNLENQRQQAQTLVTQTAQALAQHQQH 813

GumC

COG3206

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

2058-2227

1.24e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];

Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 44.24 E-value: 1.24e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2058 AESMSGLRERIQELEAQMGVMREELghKELEGDVAALQEKYQRDFESLKATCErgfAAMEETHQKKIEDLQRQHQRELEK 2137
Cdd:COG3206    211 SEEAKLLLQQLSELESQLAEARAEL--AEAEARLAALRAQLGSGPDALPELLQ---SPVIQQLRAQLAELEAELAELSAR 285

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2138 LREEKDRL--LAEETAATISAIEAMKNAHREEMERELEkSQRSQISSINSDIEALRRQYLE------ELQSVQRELEVLS 2209
Cdd:COG3206    286 YTPNHPDViaLRAQIAALRAQLQQEAQRILASLEAELE-ALQAREASLQAQLAQLEARLAElpeleaELRRLEREVEVAR 364

                          170       180
                   ....*....|....*....|
gi 1039737300 2210 EQYSQ--KCLENAHLAQALE 2227
Cdd:COG3206    365 ELYESllQRLEEARLAEALT 384

mukB

PRK04863

chromosome partition protein MukB;

1867-2264

1.27e-03

chromosome partition protein MukB;

Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 44.18 E-value: 1.27e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1867 QEAKGASGQ-KRAQA-VGALKEEYEE------LLHKQKSEYQKVITLIEKENTELKAKVSqmDHQQRcLQEAENKhsesm 1938
Cdd:PRK04863   341 QTALRQQEKiERYQAdLEELEERLEEqnevveEADEQQEENEARAEAAEEEVDELKSQLA--DYQQA-LDVQQTR----- 412

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1939 fALQGRyeeeircmveqlshteNTLQA-ERSRVLSQLDA----SVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEE 2013
Cdd:PRK04863   413 -AIQYQ----------------QAVQAlERAKQLCGLPDltadNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAHSQF 475

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2014 LRALQehyiwSLRgalslyqpshpdsSLAPGPSEPRAVPAAKD---EAESMSGLRERIQELEAQmgvmreelgHKELEGD 2090
Cdd:PRK04863   476 EQAYQ-----LVR-------------KIAGEVSRSEAWDVAREllrRLREQRHLAEQLQQLRMR---------LSELEQR 528

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2091 VAAlqekyQRDFESLKATCERGFAAMEEThQKKIEDLQRQHQRELEKLREEKDRLlaeetaatisaieamkNAHREEMER 2170
Cdd:PRK04863   529 LRQ-----QQRAERLLAEFCKRLGKNLDD-EDELEQLQEELEARLESLSESVSEA----------------RERRMALRQ 586

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2171 ELEKsqrsqissINSDIEALRRQYLEELQSvQRELEVLSEQY------SQKCLEnaHLAQALEAERQALR---QCQRENQ 2241
Cdd:PRK04863   587 QLEQ--------LQARIQRLAARAPAWLAA-QDALARLREQSgeefedSQDVTE--YMQQLLERERELTVerdELAARKQ 655

                          410       420
                   ....*....|....*....|...
gi 1039737300 2242 ELNAHNQELNNRLAAEITRLRTL 2264
Cdd:PRK04863   656 ALDEEIERLSQPGGSEDPRLNAL 678

Cast

pfam10174

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...

2121-2264

1.37e-03

Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 44.04 E-value: 1.37e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2121 QKKIEDLQRQHQRELEKLREEKDRL--LAEETAATISA--------------IEAMKNAH-REEMER--ELEkSQRSQIS 2181
Cdd:pfam10174  400 QKKIENLQEQLRDKDKQLAGLKERVksLQTDSSNTDTAlttleealsekeriIERLKEQReREDRERleELE-SLKKENK 478

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2182 SINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQR-ENQELNAHNQELNNRLAAEIT- 2259
Cdd:pfam10174  479 DLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKlENQLKKAHNAEEAVRTNPEINd 558


                   ....*
gi 1039737300 2260 RLRTL 2264
Cdd:pfam10174  559 RIRLL 563

sbcc

TIGR00618

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

850-1004

1.37e-03

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 44.19 E-value: 1.37e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  850 QRLHRVNQDLQSELEA-QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQtRLGNAAAELAIKEQALAKLKGELKM 928
Cdd:TIGR00618  725 NASSSLGSDLAAREDAlNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGA-ELSHLAAEIQFFNRLREEDTHLLKT 803

                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300  929 EQGKVREQLEewqHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLG 1004
Cdd:TIGR00618  804 LEAEIGQEIP---SDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSD 876

Myosin_tail_1

pfam01576

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...

1875-2315

1.69e-03

Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 44.01 E-value: 1.69e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1875 QKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV-----SQMDHQQR------CLQEAENKHSESMFALQG 1943
Cdd:pfam01576  352 QKHTQALEELTEQLEQA-KRNKANLEKAKQALESENAELQAELrtlqqAKQDSEHKrkklegQLQELQARLSESERQRAE 430

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1944 RYEEEIRCMVEQLSHTENTLQAER-----SRVLSQLDASVKDRQAMEQHHVQQMKMLEDRfqlkVRELQavhqEELRALQ 2018
Cdd:pfam01576  431 LAEKLSKLQSELESVSSLLNEAEGkniklSKDVSSLESQLQDTQELLQEETRQKLNLSTR----LRQLE----DERNSLQ 502

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2019 EHYiwslrgalslyqpshpdsslapgpsepravpaaKDEAESMSGLRERIQELEAQMGVMREELghKELEGDVAALQE-- 2096
Cdd:pfam01576  503 EQL---------------------------------EEEEEAKRNVERQLSTLQAQLSDMKKKL--EEDAGTLEALEEgk 547

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2097 -KYQRDFESLKATCERGFAAMEETH------QKKIEDL------QRQHQRELEKLREEKDRLLAEETAATISAIEAMKNA 2163
Cdd:pfam01576  548 kRLQRELEALTQQLEEKAAAYDKLEktknrlQQELDDLlvdldhQRQLVSNLEKKQKKFDQMLAEEKAISARYAEERDRA 627

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2164 HREEMERELEksqrsqissinsdiealrrqyleelqsvqreleVLSeqysqkclenahLAQALEAERQALRQCQRENQEL 2243
Cdd:pfam01576  628 EAEAREKETR---------------------------------ALS------------LARALEEALEAKEELERTNKQL 662

                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 2244 NAHNQELNNRLAAeitrlrtlltgdgggestglpltQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTA 2315
Cdd:pfam01576  663 RAEMEDLVSSKDD-----------------------VGKNVHELERSKRALEQQVEEMKTQLEELEDELQAT 711

SMC_prok_B

TIGR02168

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...

2056-2320

1.74e-03

Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 43.89 E-value: 1.74e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2056 DEAESMSGLRERIQELEAQMGVMREELG-----HKELEGDVAALQ------EKYQRDFESLKATcERGFAAME-ETHQKK 2123
Cdd:TIGR02168  162 EEAAGISKYKERRKETERKLERTRENLDrlediLNELERQLKSLErqaekaERYKELKAELREL-ELALLVLRlEELREE 240

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2124 IEDLQRQhQRELEKLREEKDRLLAEETAAtisaIEAMKNAHREeMERELEKSQRS--QISSINSDIEALRRQYLEELQSV 2201
Cdd:TIGR02168  241 LEELQEE-LKEAEEELEELTAELQELEEK----LEELRLEVSE-LEEEIEELQKElyALANEISRLEQQKQILRERLANL 314

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2202 QRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRtlltgdgggestglpltqg 2281
Cdd:TIGR02168  315 ERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLE------------------- 375

                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1039737300 2282 kdayELEVLLRVKESEIQYLKQEISSLKDELQTALRDKK 2320
Cdd:TIGR02168  376 ----ELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410

PH_RhoGap25-like

cd13263

Rho GTPase activating protein 25 and related proteins Pleckstrin homology (PH) domain; ...

507-597

1.95e-03

Rho GTPase activating protein 25 and related proteins Pleckstrin homology (PH) domain; RhoGAP25 (also called ArhGap25) like other RhoGaps are involved in cell polarity, cell morphology and cytoskeletal organization. They act as GTPase activators for the Rac-type GTPases by converting them to an inactive GDP-bound state and control actin remodeling by inactivating Rac downstream of Rho leading to suppress leading edge protrusion and promotes cell retraction to achieve cellular polarity and are able to suppress RAC1 and CDC42 activity in vitro. Overexpression of these proteins induces cell rounding with partial or complete disruption of actin stress fibers and formation of membrane ruffles, lamellipodia, and filopodia. This hierarchy contains RhoGAP22, RhoGAP24, and RhoGAP25. Members here contain an N-terminal PH domain followed by a RhoGAP domain and either a BAR or TATA Binding Protein (TBP) Associated Factor 4 (TAF4) domain. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270083 Cd Length: 114 Bit Score: 40.06 E-value: 1.95e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYED-GQWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTCyDVTEYPVQRN----YGFQIHTKEGE---- 577
Cdd:cd13263      5 KSGWLKKQGSIvKNWQQRWFVLRGDQLYYYKD---EDDTKPQGTIPLPGN-KVKEVPFNPEepgkFLFEIIPGGGGdrmt 80

                           90       100
                   ....*....|....*....|....*
gi 1039737300  578 -----FTLSAMTSGIRRNWIQTIMK 597
Cdd:cd13263     81 snhdsYLLMANSQAEMEEWVKVIRR 105

MukB

COG3096

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...

2063-2351

1.97e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 43.79 E-value: 1.97e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2063 GLRERIQELEAQMGVMREElgHKELEGDVAALQE----------KYQRDFESLKATceRGFAAMEETHQKKIEDLQRQHQ 2132
Cdd:COG3096    372 EAAEQLAEAEARLEAAEEE--VDSLKSQLADYQQaldvqqtraiQYQQAVQALEKA--RALCGLPDLTPENAEDYLAAFR 447

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2133 RELEKLREEkdrLLAEETAATISaiEAMKNAHREEME------------------RELEKSQRSQiSSINSDIEALRRQY 2194
Cdd:COG3096    448 AKEQQATEE---VLELEQKLSVA--DAARRQFEKAYElvckiageversqawqtaRELLRRYRSQ-QALAQRLQQLRAQL 521

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2195 --LEELQSVQRELEVLSEQYSQ---KCLENA----HLAQALEAERQAL-----------RQCQRENQELNAHNQELNNRL 2254
Cdd:COG3096    522 aeLEQRLRQQQNAERLLEEFCQrigQQLDAAeeleELLAELEAQLEELeeqaaeaveqrSELRQQLEQLRARIKELAARA 601

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2255 AAEIT---RLRTLltgdggGESTGLPLT--QGKDAYeLEVLLRvKESEIQYLKQEISSLKDELQTALRdkkyasdkykdi 2329
Cdd:COG3096    602 PAWLAaqdALERL------REQSGEALAdsQEVTAA-MQQLLE-REREATVERDELAARKQALESQIE------------ 661

                          330       340
                   ....*....|....*....|..
gi 1039737300 2330 ytELSIAKAKADCDISRLKEQL 2351
Cdd:COG3096    662 --RLSQPGGAEDPRLLALAERL 681

Apolipoprotein

pfam01442

Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ...

2059-2235

2.05e-03

Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.

Pssm-ID: 460211 [Multi-domain] Cd Length: 175 Bit Score: 41.48 E-value: 2.05e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2059 ESMSGLRERIQELEAQMGVMREELgHKELEGDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2138
Cdd:pfam01442    4 DSLDELSTYAEELQEQLGPVAQEL-VDRLEKETEALRERLQKDLEEVRAKLEPYLEELQAKLGQNVEELRQRLEPYTEEL 82

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2139 ReekdrllaEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2218
Cdd:pfam01442   83 R--------KRLNADAEELQEKLAPYGEELRERLEQNVDALRARLAPYAEELRQKLAERLEELKESLAPYAEEVQAQLSQ 154

                          170
                   ....*....|....*...
gi 1039737300 2219 NA-HLAQALEAERQALRQ 2235
Cdd:pfam01442  155 RLqELREKLEPQAEDLRE 172

sbcc

TIGR00618

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

1775-2360

2.28e-03

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 43.42 E-value: 2.28e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1775 QDELAQQLREKASILEEISAALPVLPPTEPLGgcqrLLRMSQHLSYESCLEGLGQYSSLLVQDAIIQAQVCYAACRIRLE 1854
Cdd:TIGR00618  151 QGEFAQFLKAKSKEKKELLMNLFPLDQYTQLA----LMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEK 226

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1855 YEKELRFYKKACQEAKGASGQKRAQAVGALKEEYE-ELLHKQKSEYQKVITLIEKENTELK------------AKVSQMD 1921
Cdd:TIGR00618  227 ELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLlKQLRARIEELRAQEAVLEETQERINrarkaaplaahiKAVTQIE 306

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1922 HQ-QRCLQEAENKHSESMFALQGRYE-EEIRCMVEQLSHTENTLQAERSRVLSQLDA-----SVKDRQAMEQHHV----Q 1990
Cdd:TIGR00618  307 QQaQRIHTELQSKMRSRAKLLMKRAAhVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVatsirEISCQQHTLTQHIhtlqQ 386

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1991 QMKMLEDRFQLKVRELQAVHQE---------ELRALQEHYIwSLRGALSLYQPSHPDSSLAPGPSEPRAVPAAKDEAESM 2061
Cdd:TIGR00618  387 QKTTLTQKLQSLCKELDILQREqatidtrtsAFRDLQGQLA-HAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESA 465

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2062 SGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFES--LKATCERGFAAMEETHQKK---IEDLQRQHQRELE 2136
Cdd:TIGR00618  466 QSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGscIHPNPARQDIDNPGPLTRRmqrGEQTYAQLETSEE 545

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2137 KLREEKDRLLaeETAATISAieamknahREEMERELEKSQRSQISSINSDIEALRRqyleELQSVQRELEVLSEQYSQKC 2216
Cdd:TIGR00618  546 DVYHQLTSER--KQRASLKE--------QMQEIQQSFSILTQCDNRSKEDIPNLQN----ITVRLQDLTEKLSEAEDMLA 611

                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2217 LEN-AHL-----AQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTgdgggestglplTQGKDAYELEVL 2290
Cdd:TIGR00618  612 CEQhALLrklqpEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSI------------RVLPKELLASRQ 679

                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039737300 2291 LrvKESEIQYLKQEISSLKDEL---QTALRDKKYASDKYKDIYTELSIAKAKAdcdISRLKEQLKAATEALGE 2360
Cdd:TIGR00618  680 L--ALQKMQSEKEQLTYWKEMLaqcQTLLRELETHIEEYDREFNEIENASSSL---GSDLAAREDALNQSLKE 747

PRK11281

mechanosensitive channel MscK;

857-1065

2.46e-03

mechanosensitive channel MscK;

Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 43.36 E-value: 2.46e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  857 QDLQSELEAQCRRQELITQQ---IQTLKHSYgEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKG--------- 924
Cdd:PRK11281    39 ADVQAQLDALNKQKLLEAEDklvQQDLEQTL-ALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDdndeetret 117

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  925 -------ELKMEQGKVREQLEEWQHSKAMLSGQL--------RASEQkLRSTEARLLEKTQELRDLETQQALQRDRQKev 989
Cdd:PRK11281   118 lstlslrQLESRLAQTLDQLQNAQNDLAEYNSQLvslqtqpeRAQAA-LYANSQRLQQIRNLLKGGKVGGKALRPSQR-- 194

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  990 QRLQECIAELSQQ-------LGTSEQAQRLMEKklKRNYTLLLESCEQEKQALLQNLkeVEDKasaYEDQLQGHVQQVEA 1062
Cdd:PRK11281   195 VLLQAEQALLNAQndlqrksLEGNTQLQDLLQK--QRDYLTARIQRLEHQLQLLQEA--INSK---RLTLSEKTVQEAQS 267


                   ...
gi 1039737300 1063 LQK 1065
Cdd:PRK11281   268 QDE 270

DR0291

COG1579

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...

2067-2211

2.89e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];

Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 41.83 E-value: 2.89e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2067 RIQELEAQMGVMREELghKELEGDVAALQ---EKYQRDFESLKATCERgFAAMEETHQKKIEDLQRQH-----QRELEKL 2138
Cdd:COG1579     18 ELDRLEHRLKELPAEL--AELEDELAALEarlEAAKTELEDLEKEIKR-LELEIEEVEARIKKYEEQLgnvrnNKEYEAL 94

                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039737300 2139 REEKDRLLAEETAATISAIEAMKNahREEMERELEKSQrSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2211
Cdd:COG1579     95 QKEIESLKRRISDLEDEILELMER--IEELEEELAELE-AELAELEAELEEKKAELDEELAELEAELEELEAE 164

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

950-1120

2.94e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.98 E-value: 2.94e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  950 QLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKK--LKRNYTLL--- 1024
Cdd:COG4913    611 KLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELerLDASSDDLaal 690

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1025 ---LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQS 1101
Cdd:COG4913    691 eeqLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELREN 770

                          170
                   ....*....|....*....
gi 1039737300 1102 LHDERDLIKHQFQELMERV 1120
Cdd:COG4913    771 LEERIDALRARLNRAEEEL 789

PRK09039

peptidoglycan -binding protein;

897-1039

3.23e-03

peptidoglycan -binding protein;

Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 42.26 E-value: 3.23e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  897 EIQTLQTRLGNAAAELAIKEQALA---KLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELr 973
Cdd:PRK09039    47 EISGKDSALDRLNSQIAELADLLSlerQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQEL- 125

                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300  974 DLETQQALQRDRQkeVQRLQECIAELSQQLGTSEQAQRLMEKKlkrnytlllescEQEKQALLQNL 1039
Cdd:PRK09039   126 DSEKQVSARALAQ--VELLNQQIAALRRQLAALEAALDASEKR------------DRESQAKIADL 177

COG4372

Uncharacterized protein, contains DUF3084 domain [Function unknown];

895-1213

3.34e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];

Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.20 E-value: 3.34e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  895 EAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRD 974
Cdd:COG4372     12 RLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA 91

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  975 LETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLEScEQEKQALLQNLKEVEDKASAYE 1050
Cdd:COG4372     92 AQAELAQAQEEleslQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAER-EEELKELEEQLESLQEELAALE 170

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1051 DQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL 1130
Cdd:COG4372    171 QELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEE 250

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1131 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQE 1210
Cdd:COG4372    251 LLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLEL 330


                   ...
gi 1039737300 1211 KEE 1213
Cdd:COG4372    331 ALA 333

PH_DAPP1

cd10573

Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; ...

69-145

3.85e-03

Pssm-ID: 269977 [Multi-domain] Cd Length: 96 Bit Score: 38.84 E-value: 3.85e-03

                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300   69 WQRRFFILYEHgLLRYALDEMPTTlPQGTINMNQCTDVvDGEARTGQKFSLCILTPDKEHFIRAETKEIISGWLEML 145
Cdd:cd10573     19 WKTRWFVLRRN-ELKYFKTRGDTK-PIRVLDLRECSSV-QRDYSQGKVNCFCLVFPERTFYMYANTEEEADEWVKLL 92

PTZ00121

MAEBL; Provisional

910-1187

3.89e-03

MAEBL; Provisional

Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 42.82 E-value: 3.89e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  910 AELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMlsgQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV 989
Cdd:PTZ00121  1542 AEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM---ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  990 QRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLS 1069
Cdd:PTZ00121  1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEA 1698

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1070 ETCKGSEQVHKLEEELEAREASIRQlaqhvqslHDERDLIKhqfQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHH 1149
Cdd:PTZ00121  1699 EEAKKAEELKKKEAEEKKKAEELKK--------AEEENKIK---AEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEE 1767

                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1039737300 1150 RVSVQLQSVRTLLreKEEELKHIKETHERVLEKKDQDL 1187
Cdd:PTZ00121  1768 KKAEEIRKEKEAV--IEEELDEEDEKRRMEVDKKIKDI 1803

DUF4618

pfam15397

Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins ...

2054-2199

3.91e-03

Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins in this family are typically between 238 and 363 amino acids in length. There are two conserved sequence motifs: EYP and KCTPD.

Pssm-ID: 464704 [Multi-domain] Cd Length: 258 Bit Score: 41.48 E-value: 3.91e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2054 AKDEAESMSGLRERIQELEAQMGVMREELG----HKELEGDVAALQ-EKYQRDFESLKatcergfaameETHQKKIEDLQ 2128
Cdd:pfam15397   76 EEKEESKLNKLEQQLEQLNAKIQKTQEELNflstYKDKEYPVKAVQiANLVRQLQQLK-----------DSQQDELDELE 144

                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 2129 RQHQRELEKL----REEKDRLL---AEETAATISAIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQyLEELQ 2199
Cdd:pfam15397  145 EMRRMVLESLsrkiQKKKEKILsslAEKTLSPYQESLLQKTRDNQVMLKEIEQ-FREFIDELEEEIPKLKAE-VQQLQ 220

MukB

COG3096

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...

796-1070

4.06e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 42.63 E-value: 4.06e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVA--TSPSGAWQRlhrvnqdlQSELEAQCRRQELI 873
Cdd:COG3096    439 AEDYLAAFRAKEQQATEEVLELEQKLSVADAARRQFEKAYELVCKIAgeVERSQAWQT--------ARELLRRYRSQQAL 510

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  874 TQQIQTLKHSYGEA-KDAIRHHEAEiqtlqtrlgnaaaelaikeqalaKLKGELKMEQGKVREQLEEWQHSKAMLSGQLR 952
Cdd:COG3096    511 AQRLQQLRAQLAELeQRLRQQQNAE-----------------------RLLEEFCQRIGQQLDAAEELEELLAELEAQLE 567

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  953 ASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTS--------EQAQRLMEKklKRNYTLL 1024
Cdd:COG3096    568 ELEEQAAEAVEQRSELRQQLEQLRARIKELAARAPAWLAAQDALERLREQSGEAladsqevtAAMQQLLER--EREATVE 645

                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1039737300 1025 LESCEQEKQALLQNLKEVEDKASAYEDQLqghVQQVEALQKEKLSE 1070
Cdd:COG3096    646 RDELAARKQALESQIERLSQPGGAEDPRL---LALAERLGGVLLSE 688

PH_ACAP

cd13250

ArfGAP with coiled-coil, ankyrin repeat and PH domains Pleckstrin homology (PH) domain; ACAP ...

507-595

4.21e-03

ArfGAP with coiled-coil, ankyrin repeat and PH domains Pleckstrin homology (PH) domain; ACAP (also called centaurin beta) functions both as a Rab35 effector and as an Arf6-GTPase-activating protein (GAP) by which it controls actin remodeling and membrane trafficking. ACAP contain an NH2-terminal bin/amphiphysin/Rvs (BAR) domain, a phospholipid-binding domain, a PH domain, a GAP domain, and four ankyrin repeats. The AZAPs constitute a family of Arf GAPs that are characterized by an NH2-terminal pleckstrin homology (PH) domain and a central Arf GAP domain followed by two or more ankyrin repeats. On the basis of sequence and domain organization, the AZAP family is further subdivided into four subfamilies: 1) the ACAPs contain an NH2-terminal bin/amphiphysin/Rvs (BAR) domain (a phospholipid-binding domain that is thought to sense membrane curvature), a single PH domain followed by the GAP domain, and four ankyrin repeats; 2) the ASAPs also contain an NH2-terminal BAR domain, the tandem PH domain/GAP domain, three ankyrin repeats, two proline-rich regions, and a COOH-terminal Src homology 3 domain; 3) the AGAPs contain an NH2-terminal GTPase-like domain (GLD), a split PH domain, and the GAP domain followed by four ankyrin repeats; and 4) the ARAPs contain both an Arf GAP domain and a Rho GAP domain, as well as an NH2-terminal sterile-a motif (SAM), a proline-rich region, a GTPase-binding domain, and five PH domains. PMID 18003747 and 19055940 Centaurin can bind to phosphatidlyinositol (3,4,5)P3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270070 Cd Length: 98 Bit Score: 38.74 E-value: 4.21e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLdgEINLSTCYDVTEYPVQRNYGFQIHTKEGEFTLSAMT 584
Cdd:cd13250      1 KEGYLFKRSSNafKTWKRRWFSLQNGQLYYQKRDKKDEPTVM--VEDLRLCTVKPTEDSDRRFCFEVISPTKSYMLQAES 78

                           90
                   ....*....|.
gi 1039737300  585 SGIRRNWIQTI 595
Cdd:cd13250     79 EEDRQAWIQAI 89

PH_TAAP2-like

cd13255

Tandem PH-domain-containing protein 2 Pleckstrin homology (PH) domain; The binding of TAPP2 ...

507-595

4.35e-03

Tandem PH-domain-containing protein 2 Pleckstrin homology (PH) domain; The binding of TAPP2 (also called PLEKHA2) adaptors to PtdIns(3,4)P(2), but not PI(3,4, 5)P3, function as negative regulators of insulin and PI3K signalling pathways (i.e. TAPP/utrophin/syntrophin complex). TAPP2 contains two sequential PH domains in which the C-terminal PH domain specifically binds PtdIns(3,4)P2 with high affinity. The N-terminal PH domain does not interact with any phosphoinositide tested. They also contain a C-terminal PDZ-binding motif that interacts with several PDZ-binding proteins, including PTPN13 (known previously as PTPL1 or FAP-1) as well as the scaffolding proteins MUPP1 (multiple PDZ-domain-containing protein 1), syntrophin and utrophin. The members here are most sequence similar to TAPP2 proteins, but may not be actual TAPP2 proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270075 Cd Length: 110 Bit Score: 38.93 E-value: 4.35e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYEDGQ-WKKHWFVLADQSLRYYRDSVAEEAADLdgeINLSTCYDVTEYPVQRN-YGFQIHTKEGEFTLSAMT 584
Cdd:cd13255      8 KAGYLEKKGERRKtWKKRWFVLRPTKLAYYKNDKEYRLLRL---IDLTDIHTCTEVQLKKHdNTFGIVTPARTFYVQADS 84

                           90
                   ....*....|.
gi 1039737300  585 SGIRRNWIQTI 595
Cdd:cd13255     85 KAEMESWISAI 95

PH_DOCK-D

cd13267

Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also ...

506-597

4.64e-03

Pssm-ID: 270087 Cd Length: 126 Bit Score: 39.23 E-value: 4.64e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  506 FKKGWLTKQYEDGQ----------WKKHWFVL---ADQS--LRYYRDsvaEEAADLDGEINLSTCYDVTEYPVQRNYGFQ 570
Cdd:cd13267      7 TKEGYLYKGPENSSdsfislamksFKRRFFHLkqlVDGSyiLEFYKD---EKKKEAKGTIFLDSCTGVVQNSKRRKFCFE 83

                           90       100
                   ....*....|....*....|....*...
gi 1039737300  571 IHTKEGE-FTLSAMTSGIRRNWIQTIMK 597
Cdd:cd13267     84 LRMQDKKsYVLAAESEAEMDEWISKLNK 111

Mitofilin

pfam09731

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...

887-1068

4.86e-03

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.

Pssm-ID: 430783 [Multi-domain] Cd Length: 618 Bit Score: 42.05 E-value: 4.86e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  887 AKDAIRHHEAEIQTLQTRlGNAAAELAIKEQALAKLKGELKmeqgkVREQLEEwqhskamlsgQLRASEQKLRSTEARLL 966
Cdd:pfam09731  292 AHREIDQLSKKLAELKKR-EEKHIERALEKQKEELDKLAEE-----LSARLEE----------VRAADEAQLRLEFERER 355

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  967 EKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLgtseqaQRLMEKKLKrnytlllESCEQEKQALLQNLKEVEDKA 1046
Cdd:pfam09731  356 EEIRESYEEKLRTELERQAEAHEEHLKDVLVEQEIEL------QREFLQDIK-------EKVEEERAGRLLKLNELLANL 422

                          170       180
                   ....*....|....*....|...
gi 1039737300 1047 SAYEDQLQGHVQQV-EALQKEKL 1068
Cdd:pfam09731  423 KGLEKATSSHSEVEdENRKAQQL 445

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

796-961

5.17e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.21 E-value: 5.17e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGY---VLQTEVAtspsgAWQ-RLHRVNQD------LQSELEA 865
Cdd:COG4913    622 LEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIdvaSAEREIA-----ELEaELERLDASsddlaaLEEQLEE 696

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  866 QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKA 945
Cdd:COG4913    697 LEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERID 776

                          170
                   ....*....|....*.
gi 1039737300  946 MLSGQLRASEQKLRST 961
Cdd:COG4913    777 ALRARLNRAEEELERA 792

PH_KIFIA_KIFIB

cd01233

KIFIA and KIFIB protein pleckstrin homology (PH) domain; The kinesin-3 family motors KIFIA ...

507-595

5.23e-03

KIFIA and KIFIB protein pleckstrin homology (PH) domain; The kinesin-3 family motors KIFIA (Caenorhabditis elegans homolog unc-104) and KIFIB transport synaptic vesicle precursors that contain synaptic vesicle proteins, such as synaptophysin, synaptotagmin and the small GTPase RAB3A, but they do not transport organelles that contain plasma membrane proteins. They have a N-terminal motor domain, followed by a coiled-coil domain, and a C-terminal PH domain. KIF1A adopts a monomeric form in vitro, but acts as a processive dimer in vivo. KIF1B has alternatively spliced isoforms distinguished by the presence or absence of insertion sequences in the conserved amino-terminal region of the protein; this results in their different motor activities. KIF1A and KIF1B bind to RAB3 proteins through the adaptor protein mitogen-activated protein kinase (MAPK) -activating death domain (MADD; also calledDENN), which was first identified as a RAB3 guanine nucleotide exchange factor (GEF). PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 269939 Cd Length: 103 Bit Score: 38.73 E-value: 5.23e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQyEDG--QWKKHWFVLADQSLRYYRDSvaeeaADLD--GEINLSTC---YDV-TEYPVQRNYGFQIHTKEGEF 578
Cdd:cd01233      8 KRGYLLFL-EDAtdGWVRRWVVLRRPYLHIYSSE-----KDGDerGVINLSTArveYSPdQEALLGRPNVFAVYTPTNSY 81

                           90
                   ....*....|....*..
gi 1039737300  579 TLSAMTSGIRRNWIQTI 595
Cdd:cd01233     82 LLQARSEKEMQDWLYAI 98

PH1_PH_fungal

cd13298

Fungal proteins Pleckstrin homology (PH) domain, repeat 1; The functions of these fungal ...

506-595

5.41e-03

Fungal proteins Pleckstrin homology (PH) domain, repeat 1; The functions of these fungal proteins are unknown, but they all contain 2 PH domains. This cd represents the first PH repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.

Pssm-ID: 270110 Cd Length: 106 Bit Score: 38.76 E-value: 5.41e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  506 FKKGWLTKQYED-GQWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTCYDVTEYPV-QRNYGFQIHTKEGEFTLSAM 583
Cdd:cd13298      7 LKSGYLLKRSRKtKNWKKRWVVLRPCQLSYYKD---EKEYKLRRVINLSELLAVAPLKDkKRKNVFGIYTPSKNLHFRAT 83

                           90
                   ....*....|..
gi 1039737300  584 TSGIRRNWIQTI 595
Cdd:cd13298     84 SEKDANEWVEAL 95

CwlO1

COG3883

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...

796-971

5.69e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];

Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 41.74 E-value: 5.69e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQ---DQLRVALGR------EQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEaq 866
Cdd:COG3883     56 LQAELEALQAEIDKLQAEIAEAEaeiEERREELGEraralyRSGGSVSYLDVLLGSESFSDFLDRLSALSKIADADAD-- 133

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  867 crrqelITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAM 946
Cdd:COG3883    134 ------LLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAA 207

                          170       180
                   ....*....|....*....|....*
gi 1039737300  947 LSGQLRASEQKLRSTEARLLEKTQE 971
Cdd:COG3883    208 AEAAAAAAAAAAAAAAAAAAAAAAA 232

Yuri_gagarin

pfam15934

Yuri gagarin; The yuri gagarin protein found in Drosophila, it plays roles in spermatogenesis.

754-982

6.57e-03

Yuri gagarin; The yuri gagarin protein found in Drosophila, it plays roles in spermatogenesis.

Pssm-ID: 318204 [Multi-domain] Cd Length: 234 Bit Score: 40.71 E-value: 6.57e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  754 EQRWHQVETTPLREEKQVPIAPLHLSLEDRSERLstheltslleKELEQSQKEASDLLEQNRLlqdqlrvalgreqsARE 833
Cdd:pfam15934   45 EQEQQLKEFTVQNQRLACQIDNLHETLKDRDHQI----------KQLQSMITGYSDISENNRL--------------KEE 100

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  834 GYVLQTEVATspsgawqrLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNaaaela 913
Cdd:pfam15934  101 IHDLKQKNCV--------QARVVRKMGLELKGQEEQRVELCDKYESLLGSFEEQCQELKRANRRVQSLQTRLSQ------ 166

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300  914 ikeqaLAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQ 982
Cdd:pfam15934  167 -----VEKLQEELRTERKILREEVIALKEKDAKSNGRERALQDQLKCCQTEIEKSRTLIRNMQSHLQLE 230

Golgin_A5

pfam09787

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...

2153-2266

6.91e-03

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.

Pssm-ID: 462900 [Multi-domain] Cd Length: 305 Bit Score: 40.90 E-value: 6.91e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2153 TISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQA 2232
Cdd:pfam09787   43 TALTLELEELRQERDLLREEIQKLRGQIQQLRTELQELEAQQQEEAESSREQLQELEEQLATERSARREAEAELERLQEE 122

                           90       100       110
                   ....*....|....*....|....*....|....
gi 1039737300 2233 LRQCQRENQELNAHNQELNNRLAAEITRLRTLLT 2266
Cdd:pfam09787  123 LRYLEEELRRSKATLQSRIKDREAEIEKLRNQLT 156

CALCOCO1

pfam07888

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...

991-1203

7.60e-03

Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 41.42 E-value: 7.60e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  991 RLQECIAELSQQLGTSEQAQRLMEKKlKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQlqghVQQVEALQKEKLSE 1070
Cdd:pfam07888   35 RLEECLQERAELLQAQEAANRQREKE-KERYKRDREQWERQRRELESRVAELKEELRQSREK----HEELEEKYKELSAS 109

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1071 TCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHR 1150
Cdd:pfam07888  110 SEELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRS 189

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039737300 1151 VSVQLQSVRTLLREKEEELKHIKEThervLEKKDQDLNEALVKMIALGSSLEE 1203
Cdd:pfam07888  190 LSKEFQELRNSLAQRDTQVLQLQDT----ITTLTQKLTTAHRKEAENEALLEE 238

DR0291

COG1579

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...

796-974

8.22e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];

Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 40.29 E-value: 8.22e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVA------LGREQSAREGYVLQTEvatspsgawQRLHRVNQDLQS-----ELE 864
Cdd:COG1579     22 LEHRLKELPAELAELEDELAALEARLEAAkteledLEKEIKRLELEIEEVE---------ARIKKYEEQLGNvrnnkEYE 92

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  865 AqcrrqelITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEwqhsk 944
Cdd:COG1579     93 A-------LQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEE----- 160

                          170       180       190
                   ....*....|....*....|....*....|.
gi 1039737300  945 amlsgqLRASEQKLRST-EARLLEKTQELRD 974
Cdd:COG1579    161 ------LEAEREELAAKiPPELLALYERIRK 185

TOPEUc

smart00435

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina ...

2049-2145

8.27e-03

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina virus topoisomerase, Variola virus topoisomerase, Shope fibroma virus topoisomeras

Pssm-ID: 214661 [Multi-domain] Cd Length: 391 Bit Score: 41.18 E-value: 8.27e-03

                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  2049 RAVPaaKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDV-AALQEKYQRDFESLKATCERGFAAM--EETHQKKIE 2125
Cdd:smart00435  269 RTVS--KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLkRKLKSKFERDNEKLDAEVKEKKKEKkkEEKKKKQIE 346

                            90       100
                    ....*....|....*....|
gi 1039737300  2126 DLQRQHQReLEKLREEKDRL 2145
Cdd:smart00435  347 RLEERIEK-LEVQATDKEEN 365

sbcc

TIGR00618

exonuclease SbcC; All proteins in this family for which functions are known are part of an ...

1861-2362

8.43e-03

Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 41.49 E-value: 8.43e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1861 FYKKACQEAKGASGQKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRCLQEAENKHSESMFA 1940
Cdd:TIGR00618  186 FAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQL 265

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1941 LqgRYEEEIRCMVEQLSHTENTLQAERSrvlsqldasvKDRQAMEQHHV----QQMKMLEDRFQLKVREL-QAVHQEELR 2015
Cdd:TIGR00618  266 R--ARIEELRAQEAVLEETQERINRARK----------AAPLAAHIKAVtqieQQAQRIHTELQSKMRSRaKLLMKRAAH 333

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2016 ALQEHYIWSLRGALSLYQPSHPDSslapgpsepravpaaKDEAESMSGLREriqELEAQmgvmreelghKELEGDVAALQ 2095
Cdd:TIGR00618  334 VKQQSSIEEQRRLLQTLHSQEIHI---------------RDAHEVATSIRE---ISCQQ----------HTLTQHIHTLQ 385

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2096 EKYQRDFESLKATCERGFAAMEETHQKKIEDLQR----------QHQRELEKLREEKDRLLAEETAAtisaIEAMKNAHR 2165
Cdd:TIGR00618  386 QQKTTLTQKLQSLCKELDILQREQATIDTRTSAFrdlqgqlahaKKQQELQQRYAELCAAAITCTAQ----CEKLEKIHL 461

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2166 EEM-----ERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEvlseqysQKCLE-NAHLAQALEAERQALRQCQRE 2239
Cdd:TIGR00618  462 QESaqslkEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLC-------GSCIHpNPARQDIDNPGPLTRRMQRGE 534

                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2240 NqELNAHNQELNN---RLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEV-LLRVKESEIQYLKQEISSLKD----E 2311
Cdd:TIGR00618  535 Q-TYAQLETSEEDvyhQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIpNLQNITVRLQDLTEKLSEAEDmlacE 613

                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039737300 2312 LQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKS 2362
Cdd:TIGR00618  614 QHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHA 664

PRK03918

DNA double-strand break repair ATPase Rad50;

2054-2266

9.25e-03

DNA double-strand break repair ATPase Rad50;

Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 41.20 E-value: 9.25e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2054 AKDEAESMSGLRERIQELEAQMGVMREELGH-KELEGDVAALQEKYQRDFESL----KATCERGFAAMEEThQKKIEDLQ 2128
Cdd:PRK03918   520 LEKKAEEYEKLKEKLIKLKGEIKSLKKELEKlEELKKKLAELEKKLDELEEELaellKELEELGFESVEEL-EERLKELE 598

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2129 RQHQRELE------KLREEKDRL-LAEETAATISAIEAMKNAHREEMERELEKSQRsqiSSINSDIEALRRQYLE---EL 2198
Cdd:PRK03918   599 PFYNEYLElkdaekELEREEKELkKLEEELDKAFEELAETEKRLEELRKELEELEK---KYSEEEYEELREEYLElsrEL 675

                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 2199 QSVQRELEVLSEQYSqkclENAHLAQALEAERQALRQCQRENQELNAHNQELnNRLAAEITRLRTLLT 2266
Cdd:PRK03918   676 AGLRAELEELEKRRE----EIKKTLEKLKEELEEREKAKKELEKLEKALERV-EELREKVKKYKALLK 738

Smc

COG1196

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...

1867-2318

9.26e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];

Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 41.46 E-value: 9.26e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1867 QEAKGASGQKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRCLQEAENKHSESMFALQGRYE 1946
Cdd:COG1196    258 LEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEE 337

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1947 EEIRCMvEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSLR 2026
Cdd:COG1196    338 ELEELE-EELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERL 416

                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2027 GALSLYQPSHPDSSLApgpSEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREELghKELEGDVAALQEKYQRDFESLK 2106
Cdd:COG1196    417 ERLEEELEELEEALAE---LEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELL--EEAALLEAALAELLEELAEAAA 491

                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2107 AtcERGFAAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSd 2186
Cdd:COG1196    492 R--LLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKA- 568

                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2187 iEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQREN---QELNAHNQELNNRLAAEITRLRT 2263
Cdd:COG1196    569 -AKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTllgRTLVAARLEAALRRAVTLAGRLR 647

                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1039737300 2264 LLTGDGGGESTGLPLTQGKDAyELEVLLRVKESEIQYLKQEISSLKDELQTALRD 2318
Cdd:COG1196    648 EVTLEGEGGSAGGSLTGGSRR-ELLAALLEAEAELEELAERLAEEELELEEALLA 701

COG4913

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

2120-2318

9.33e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];

Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 41.44 E-value: 9.33e-03

                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2120 HQKKIEDLQRQH-------QRELEKLREEKDRLlAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRR 2192
Cdd:COG4913    590 HEKDDRRRIRSRyvlgfdnRAKLAALEAELAEL-EEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAER 668

                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2193 qyleELQSVQRELEVLSEqySQKCLEnaHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA---EITRLRTLLtgDG 2269
Cdd:COG4913    669 ----EIAELEAELERLDA--SSDDLA--ALEEQLEELEAELEELEEELDELKGEIGRLEKELEQaeeELDELQDRL--EA 738

                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2270 GGESTGLPLTQGKDAYELEVLLRVKESEIQY-LKQEISSLKDELQTALRD 2318
Cdd:COG4913    739 AEDLARLELRALLEERFAAALGDAVERELREnLEERIDALRARLNRAEEE 788

Blast search parameters

Data Source:	Precalculated data, version = cdd.v.3.21
Preset Options:	Database: CDSEARCH/cdd Low complexity filter: no Composition Based Adjustment: yes E-value threshold: 0.01