NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1039737300|ref|XP_017170057|]
View 

myosin phosphatase Rho-interacting protein isoform X1 [Mus musculus]

Protein Classification

PH_RIP and PH_M-RIP domain-containing protein( domain architecture ID 12913567)

protein containing domains PH_RIP, PH_M-RIP, Smc, and SMC_prok_B

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PH_RIP cd01236
Rho-Interacting Protein Pleckstrin homology (PH) domain; RIP1-RhoGDI2 was obtained in a screen ...
16-151 1.34e-79

Rho-Interacting Protein Pleckstrin homology (PH) domain; RIP1-RhoGDI2 was obtained in a screen for proteins that bind to wild-type RhoA. RIP2, RIP3, and RIP4 were isolated from cDNA libraries with constitutively active V14RhoA (containing the C190R mutation). RIP2 represents a novel GDP/GTP exchange factor (RhoGEF), while RIP3 (p116Rip) and RIP4 are thought to be structural proteins. RhoGEF contains a Dbl(DH)/PH region, a a zinc finger motif, a leucine-rich domain, and a coiled-coil region. The last 2 domains are thought to be involved in mediating protein-protein interactions. RIP3 is a negative regulator of RhoA signaling that inhibits, either directly or indirectly, RhoA-stimulated actomyosin contractility. In plants RIP3 is localized at microtubules and interacts with the kinesin-13 family member AtKinesin-13A, suggesting a role for RIP3 in microtubule reorganization and a possible function in Rho proteins of plants (ROP)-regulated polar growth. It has a PH domain, two proline-rich regions which are putative binding sites for SH3 domains, and a COOH-terminal coiled-coil region which overlaps with the RhoA-binding region. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


:

Pssm-ID: 269942  Cd Length: 136  Bit Score: 258.52  E-value: 1.34e-79
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   16 IFNKSKCQNCFKPRESHLLNDEDLTQAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQ 95
Cdd:cd01236      1 NKSKCKCCFCFRPRHSHLALEEARMQRKVIYCGWLYVAPPGTDFSNPSHRSKRWQRRWFVLYDDGELTYALDEMPDTLPQ 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300   96 GTINMNQCTDVVDGEARTGQKFSLCILTPDKEHFIRAETKEIISGWLEMLMVYPRT 151
Cdd:cd01236     81 GSIDMSQCTEVTDAEARTGHPHSLAITTPERIHFVKADSKEEIRWWLELLAVYPRT 136
PH_M-RIP cd13275
Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...
507-608 5.60e-47

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed to play a role in myosin phosphatase regulation by RhoA. M-RIP contains 2 PH domains followed by a Rho binding domain (Rho-BD), and a C-terminal myosin binding subunit (MBS) binding domain (MBS-BD). The amino terminus of M-RIP with its adjacent PH domains and polyproline motifs mediates binding to both actin and Galpha. M-RIP brings RhoA and MBS into close proximity where M-RIP can target RhoA to the myosin phosphatase complex to regulate the myosin phosphorylation state. M-RIP does this via its C-terminal coiled-coil domain which interacts with the MBS leucine zipper domain of myosin phosphatase, while its Rho-BD, directly binds RhoA in a nucleotide-independent manner. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


:

Pssm-ID: 270094  Cd Length: 104  Bit Score: 164.04  E-value: 5.60e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 584
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80
                           90       100
                   ....*....|....*....|....
gi 1039737300  585 SGIRRNWIQTIMKHVLPASAPDVT 608
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104
Smc super family cl34174
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
796-1110 6.01e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


The actual alignment was detected with superfamily member COG1196:

Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 74.97  E-value: 6.01e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 869
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  870 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 945
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  946 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 1025
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1026 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1105
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496

                   ....*
gi 1039737300 1106 RDLIK 1110
Cdd:COG1196    497 LEAEA 501
Smc super family cl34174
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
2053-2318 3.08e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


The actual alignment was detected with superfamily member COG1196:

Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 62.65  E-value: 3.08e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2131
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2132 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2211
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2212 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2291
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458
                          250       260
                   ....*....|....*....|....*..
gi 1039737300 2292 RVKESEIQYLKQEISSLKDELQTALRD 2318
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1758-2360 1.48e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.91  E-value: 1.48e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1758 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1831
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1832 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1906
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1907 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1986
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSE-GVKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1987 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 2059
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2060 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2126
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2127 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2203
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2204 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLR------TLLTGDGGGESTGLP 2277
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDelraelTLLNEEAANLRERLE 827
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2278 LTQgKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2357
Cdd:TIGR02168  828 SLE-RRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902

                   ...
gi 1039737300 2358 LGE 2360
Cdd:TIGR02168  903 LRE 905
 
Name Accession Description Interval E-value
PH_RIP cd01236
Rho-Interacting Protein Pleckstrin homology (PH) domain; RIP1-RhoGDI2 was obtained in a screen ...
16-151 1.34e-79

Rho-Interacting Protein Pleckstrin homology (PH) domain; RIP1-RhoGDI2 was obtained in a screen for proteins that bind to wild-type RhoA. RIP2, RIP3, and RIP4 were isolated from cDNA libraries with constitutively active V14RhoA (containing the C190R mutation). RIP2 represents a novel GDP/GTP exchange factor (RhoGEF), while RIP3 (p116Rip) and RIP4 are thought to be structural proteins. RhoGEF contains a Dbl(DH)/PH region, a a zinc finger motif, a leucine-rich domain, and a coiled-coil region. The last 2 domains are thought to be involved in mediating protein-protein interactions. RIP3 is a negative regulator of RhoA signaling that inhibits, either directly or indirectly, RhoA-stimulated actomyosin contractility. In plants RIP3 is localized at microtubules and interacts with the kinesin-13 family member AtKinesin-13A, suggesting a role for RIP3 in microtubule reorganization and a possible function in Rho proteins of plants (ROP)-regulated polar growth. It has a PH domain, two proline-rich regions which are putative binding sites for SH3 domains, and a COOH-terminal coiled-coil region which overlaps with the RhoA-binding region. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269942  Cd Length: 136  Bit Score: 258.52  E-value: 1.34e-79
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   16 IFNKSKCQNCFKPRESHLLNDEDLTQAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQ 95
Cdd:cd01236      1 NKSKCKCCFCFRPRHSHLALEEARMQRKVIYCGWLYVAPPGTDFSNPSHRSKRWQRRWFVLYDDGELTYALDEMPDTLPQ 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300   96 GTINMNQCTDVVDGEARTGQKFSLCILTPDKEHFIRAETKEIISGWLEMLMVYPRT 151
Cdd:cd01236     81 GSIDMSQCTEVTDAEARTGHPHSLAITTPERIHFVKADSKEEIRWWLELLAVYPRT 136
PH_M-RIP cd13275
Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...
507-608 5.60e-47

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed to play a role in myosin phosphatase regulation by RhoA. M-RIP contains 2 PH domains followed by a Rho binding domain (Rho-BD), and a C-terminal myosin binding subunit (MBS) binding domain (MBS-BD). The amino terminus of M-RIP with its adjacent PH domains and polyproline motifs mediates binding to both actin and Galpha. M-RIP brings RhoA and MBS into close proximity where M-RIP can target RhoA to the myosin phosphatase complex to regulate the myosin phosphorylation state. M-RIP does this via its C-terminal coiled-coil domain which interacts with the MBS leucine zipper domain of myosin phosphatase, while its Rho-BD, directly binds RhoA in a nucleotide-independent manner. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270094  Cd Length: 104  Bit Score: 164.04  E-value: 5.60e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 584
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80
                           90       100
                   ....*....|....*....|....
gi 1039737300  585 SGIRRNWIQTIMKHVLPASAPDVT 608
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104
PH smart00233
Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...
507-599 4.02e-16

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.


Pssm-ID: 214574 [Multi-domain]  Cd Length: 102  Bit Score: 76.05  E-value: 4.02e-16
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   507 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTC---YDVTEYPVQRNYGFQIHTKEGE-FTL 580
Cdd:smart00233    3 KEGWLYKKSGGGkkSWKKRYFVLFNSTLLYYKSKKDKKSYKPKGSIDLSGCtvrEAPDPDSSKKPHCFEIKTSDRKtLLL 82
                            90
                    ....*....|....*....
gi 1039737300   581 SAMTSGIRRNWIQTIMKHV 599
Cdd:smart00233   83 QAESEEEREKWVEALRKAI 101
PH pfam00169
PH domain; PH stands for pleckstrin homology.
507-595 2.89e-15

PH domain; PH stands for pleckstrin homology.


Pssm-ID: 459697 [Multi-domain]  Cd Length: 105  Bit Score: 73.75  E-value: 2.89e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDV---TEYPVQRNYGFQIHTKEG----E 577
Cdd:pfam00169    3 KEGWLLKKGGGkkKSWKKRYFVLFDGSLLYYKDDKSGKSKEPKGSISLSGCEVVevvASDSPKRKFCFELRTGERtgkrT 82
                           90
                   ....*....|....*...
gi 1039737300  578 FTLSAMTSGIRRNWIQTI 595
Cdd:pfam00169   83 YLLQAESEEERKDWIKAI 100
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
796-1110 6.01e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 74.97  E-value: 6.01e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 869
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  870 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 945
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  946 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 1025
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1026 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1105
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496

                   ....*
gi 1039737300 1106 RDLIK 1110
Cdd:COG1196    497 LEAEA 501
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
784-1065 4.89e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 68.54  E-value: 4.89e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  784 SERLSTHELtSLLEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEVATSPsgawqrLHRVNQDLQSEL 863
Cdd:TIGR02168  219 KAELRELEL-ALLVLRLEELREELEELQEELKEAEEELE-ELTAELQELEEKLEELRLEVSE------LEEEIEELQKEL 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  864 EAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEqgkvREQLEEWQHS 943
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL----EAELEELEAE 366
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  944 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 1019
Cdd:TIGR02168  367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELE 446
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1039737300 1020 NYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 1065
Cdd:TIGR02168  447 EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
848-1213 1.32e-10

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 67.01  E-value: 1.32e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  848 AWQRLHRVNQDLQSELEaqcRRQELITQQiqtlkhsyGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELK 927
Cdd:PRK03918   163 AYKNLGEVIKEIKRRIE---RLEKFIKRT--------ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  928 mEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----------------QALQRDRQKEVQ 990
Cdd:PRK03918   232 -ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkelkelkekaeeyiklSEFYEEYLDELR 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  991 RLQECIAELSQQL-GTSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYED------QLQGHVQQVEAL 1063
Cdd:PRK03918   311 EIEKRLSRLEEEInGIEERIKELEEKEER------LEELKKKLKELEKRLEELEERHELYEEakakkeELERLKKRLTGL 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1064 QKEKLSETCKGSEQVHKLEEELEAREAS-IRQLAQHVQSLHDE---------------RDLIKHQFQELMER----VATS 1123
Cdd:PRK03918   385 TPEKLEKELEELEKAKEEIEEEISKITArIGELKKEIKELKKAieelkkakgkcpvcgRELTEEHRKELLEEytaeLKRI 464
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1124 DGDVAELQEKLRGKEVDYQNLEHSHHRVS--VQLQSVRTLLREKEEELKHI------------KETHERV--LEKKDQDL 1187
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESelIKLKELAEQLKELEEKLKKYnleelekkaeeyEKLKEKLikLKGEIKSL 544
                          410       420
                   ....*....|....*....|....*.
gi 1039737300 1188 NEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:PRK03918   545 KKELEKLEELKKKLAELEKKLDELEE 570
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
2053-2318 3.08e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 62.65  E-value: 3.08e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2131
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2132 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2211
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2212 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2291
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458
                          250       260
                   ....*....|....*....|....*..
gi 1039737300 2292 RVKESEIQYLKQEISSLKDELQTALRD 2318
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485
PH smart00233
Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...
44-145 1.68e-08

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.


Pssm-ID: 214574 [Multi-domain]  Cd Length: 102  Bit Score: 54.09  E-value: 1.68e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300    44 PIYGGWLLLAPDGtdfdnpvhRSRKWQRRFFILYEHGLLRYALDEMPTTL-PQGTINMNQCT-DVVDGEARTGQKFSLCI 121
Cdd:smart00233    1 VIKEGWLYKKSGG--------GKKSWKKRYFVLFNSTLLYYKSKKDKKSYkPKGSIDLSGCTvREAPDPDSSKKPHCFEI 72
                            90       100
                    ....*....|....*....|....*
gi 1039737300   122 LTPDKE-HFIRAETKEIISGWLEML 145
Cdd:smart00233   73 KTSDRKtLLLQAESEEEREKWVEAL 97
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
744-1239 2.17e-08

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 59.80  E-value: 2.17e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  744 LKTQNVHVEIEQRWHQV--ETTPLREEKQVPiAPLHLSLEDRSERLST---------HELTSLLEKELEQSQKEASDLLE 812
Cdd:pfam01576   22 QKAESELKELEKKHQQLceEKNALQEQLQAE-TELCAEAEEMRARLAArkqeleeilHELESRLEEEEERSQQLQNEKKK 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  813 QNRLLQDqLRVALGREQSAREGyvLQTEVATSPS----------------GAWQRLHRVNQDLQSELEAQCRRQELITQQ 876
Cdd:pfam01576  101 MQQHIQD-LEEQLDEEEAARQK--LQLEKVTTEAkikkleedillledqnSKLSKERKLLEERISEFTSNLAEEEEKAKS 177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  877 IQTLKHSygeakdairhHEAEIQTLQTRLGNAAAelaiKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQ 956
Cdd:pfam01576  178 LSKLKNK----------HEAMISDLEERLKKEEK----GRQELEKAKRKLEGESTDLQEQIAELQAQIAELRAQLAKKEE 243
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  957 KLRSTEARLlektqelrdlETQQALQRDRQKEVQRLQECIAELSQQLgTSEQAQRLMEKKLKRNYTLLLESCEQEKQALL 1036
Cdd:pfam01576  244 ELQAALARL----------EEETAQKNNALKKIRELEAQISELQEDL-ESERAARNKAEKQRRDLGEELEALKTELEDTL 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1037 ------QNLK-----EVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1105
Cdd:pfam01576  313 dttaaqQELRskreqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAE 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1106 RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEelKHIKETHE-RVLEKKD 1184
Cdd:pfam01576  393 LRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEG--KNIKLSKDvSSLESQL 470
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300 1185 QDLNEALV----KMIALGSSLEETEI-------KLQEKEECLRRF----------VSDSPKDAKEPLSTTEPTEEG 1239
Cdd:pfam01576  471 QDTQELLQeetrQKLNLSTRLRQLEDernslqeQLEEEEEAKRNVerqlstlqaqLSDMKKKLEEDAGTLEALEEG 546
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
2064-2256 6.31e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 58.53  E-value: 6.31e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2064 LRERIQELEAQMGVMREELGH-----KELEGDVAALQEKyqrdFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2138
Cdd:TIGR02168  307 LRERLANLERQLEELEAQLEElesklDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEEQLE 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2139 REEKDRLLAEETAATIsaieamkNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2218
Cdd:TIGR02168  383 TLRSKVAQLELQIASL-------NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1039737300 2219 NAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA 2256
Cdd:TIGR02168  456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493
PH pfam00169
PH domain; PH stands for pleckstrin homology.
44-145 4.37e-07

PH domain; PH stands for pleckstrin homology.


Pssm-ID: 459697 [Multi-domain]  Cd Length: 105  Bit Score: 50.25  E-value: 4.37e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   44 PIYGGWLLLAPDGtdfdnpvhRSRKWQRRFFILYEHGLLRYALDEMPTTL-PQGTINMNQCTDV-VDGEARTGQKFSLCI 121
Cdd:pfam00169    1 VVKEGWLLKKGGG--------KKKSWKKRYFVLFDGSLLYYKDDKSGKSKePKGSISLSGCEVVeVVASDSPKRKFCFEL 72
                           90       100
                   ....*....|....*....|....*...
gi 1039737300  122 LTPD----KEHFIRAETKEIISGWLEML 145
Cdd:pfam00169   73 RTGErtgkRTYLLQAESEEERKDWIKAI 100
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1758-2360 1.48e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.91  E-value: 1.48e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1758 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1831
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1832 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1906
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1907 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1986
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSE-GVKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1987 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 2059
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2060 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2126
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2127 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2203
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2204 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLR------TLLTGDGGGESTGLP 2277
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDelraelTLLNEEAANLRERLE 827
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2278 LTQgKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2357
Cdd:TIGR02168  828 SLE-RRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902

                   ...
gi 1039737300 2358 LGE 2360
Cdd:TIGR02168  903 LRE 905
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1897-2366 8.51e-06

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 51.27  E-value: 8.51e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1897 SEYQKVITLIEKENTELKAKVSQMDHQQRCLQ-EAENK-------HSESMFALQGRYEEEIRCMVEQLSHTENTLQAERS 1968
Cdd:pfam15921  220 SAISKILRELDTEISYLKGRIFPVEDQLEALKsESQNKielllqqHQDRIEQLISEHEVEITGLTEKASSARSQANSIQS 299
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1969 RvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSlrgalslyqpshpDSSLAPGPSEp 2048
Cdd:pfam15921  300 Q-LEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLA-------------NSELTEARTE- 364
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2049 ravpaaKDEAESMSG-LRERIQELEAQMGVMREELG---------------------HKELEGDVAALQ-EKYQRDFESL 2105
Cdd:pfam15921  365 ------RDQFSQESGnLDDQLQKLLADLHKREKELSlekeqnkrlwdrdtgnsitidHLRRELDDRNMEvQRLEALLKAM 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2106 KATC----ERGFAAMEETHQ--KKIEDLQRQHQRELEKLREEKDRLLA-----EETAATISAIeamkNAHREEMERELEK 2174
Cdd:pfam15921  439 KSECqgqmERQMAAIQGKNEslEKVSSLTAQLESTKEMLRKVVEELTAkkmtlESSERTVSDL----TASLQEKERAIEA 514
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2175 SQrSQISSINS--DIEALRRQYL----EELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQ 2248
Cdd:pfam15921  515 TN-AEITKLRSrvDLKLQELQHLknegDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA 593
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2249 ELNNRLAAEITRLRTLLTgdgggestglpLTQGKDAYELEVLLRVKESEIQY----------------LKQEISSLKDEL 2312
Cdd:pfam15921  594 QLEKEINDRRLELQEFKI-----------LKDKKDAKIRELEARVSDLELEKvklvnagserlravkdIKQERDQLLNEV 662
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300 2313 QTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE-----KSPEGT 2366
Cdd:pfam15921  663 KTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQtrntlKSMEGS 721
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1883-2364 4.69e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 48.91  E-value: 4.69e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1883 ALKEEYEELLhKQKSEYQKVITLIEKENTELKAKVSQmdhqqrcLQEAENKHSESMFALQGRYE--EEIRCMVEQLSHTE 1960
Cdd:PRK03918   176 RRIERLEKFI-KRTENIEELIKEKEKELEEVLREINE-------ISSELPELREELEKLEKEVKelEELKEEIEELEKEL 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1961 NTLQAErsrvLSQLDASVKDRQAMEQHHVQQMKMLEDrfqlKVRELqavhqEELRALQEHYIwSLRGALSLY--QPSHPD 2038
Cdd:PRK03918   248 ESLEGS----KRKLEEKIRELEERIEELKKEIEELEE----KVKEL-----KELKEKAEEYI-KLSEFYEEYldELREIE 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2039 SSLAPGPSEPRAVPAAKDEAESMSglrERIQELEAQMGVMREELGhkELEGDVAALQEKYQRDFESLKATCERGFAAMEE 2118
Cdd:PRK03918   314 KRLSRLEEEINGIEERIKELEEKE---ERLEELKKKLKELEKRLE--ELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2119 ThQKKIEDLQRQH---QRELEKLREEKDRLLAEEtAATISAIEAMKNAHR----------EEMERELEKSQRSQISSINS 2185
Cdd:PRK03918   389 L-EKELEELEKAKeeiEEEISKITARIGELKKEI-KELKKAIEELKKAKGkcpvcgreltEEHRKELLEEYTAELKRIEK 466
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2186 DIEALRRQyLEELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLL 2265
Cdd:PRK03918   467 ELKEIEEK-ERKLRKELRELEKVLKKESE-----------LIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKL 534
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2266 TGDGGgESTGLpLTQGKDAYELEVLLRVKESEIQYLKQEISSLK-----------DELQTALRDKKYASDKY---KDIYT 2331
Cdd:PRK03918   535 IKLKG-EIKSL-KKELEKLEELKKKLAELEKKLDELEEELAELLkeleelgfesvEELEERLKELEPFYNEYlelKDAEK 612
                          490       500       510
                   ....*....|....*....|....*....|...
gi 1039737300 2332 ELSIAKAKadcdISRLKEQLKAATEALGEKSPE 2364
Cdd:PRK03918   613 ELEREEKE----LKKLEEELDKAFEELAETEKR 641
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1758-2235 3.11e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 45.91  E-value: 3.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1758 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLPPTEPLGGCQRLLRMSQHlSYESCLEGLGQYSSLLVQd 1837
Cdd:COG4717     87 EEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPE-RLEELEERLEELRELEEE- 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1838 aiiqaqvcyaacriRLEYEKELRFYKKACQEAKGASGQKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV 1917
Cdd:COG4717    165 --------------LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEEL 229
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1918 SQMDHQQRCLQEAENKHSESMFALqgryeeeIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLED 1997
Cdd:COG4717    230 EQLENELEAAALEERLKEARLLLL-------IAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGK 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1998 RFQlkvrelQAVHQEELRALQEHYIWSLRGALSLyqpshpdsslaPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMgv 2077
Cdd:COG4717    303 EAE------ELQALPALEELEEEELEELLAALGL-----------PPDLSPEELLELLDRIEELQELLREAEELEEEL-- 363
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2078 mreelghkelegDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQR--QHQRELEKLREEKDRLLAEETAATIS 2155
Cdd:COG4717    364 ------------QLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEEleELEEQLEELLGELEELLEALDEEELE 431
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2156 AIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY-----LEELQSVQRELEVLSEQYSQKCLenahLAQALEAER 2230
Cdd:COG4717    432 EELEELEEELEELEEELEE-LREELAELEAELEQLEEDGelaelLQELEELKAELRELAEEWAALKL----ALELLEEAR 506

                   ....*
gi 1039737300 2231 QALRQ 2235
Cdd:COG4717    507 EEYRE 511
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1717-2352 4.60e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 45.83  E-value: 4.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1717 ERVIQQILetlrhptgREDQVQTSWDQnpLGEILRPgTDGSQEPLQALHQSPEvlaAIQDELAQQLREKASILEEISAAL 1796
Cdd:PRK03918   148 EKVVRQIL--------GLDDYENAYKN--LGEVIKE-IKRRIERLEKFIKRTE---NIEELIKEKEKELEEVLREINEIS 213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1797 PVLPPT-EPLGGCQRLLRmsqhlSYESCLEGLgqySSLLVQDAIIQAQVCYAACRIRlEYEKELRFYKKACQEAKgaSGQ 1875
Cdd:PRK03918   214 SELPELrEELEKLEKEVK-----ELEELKEEI---EELEKELESLEGSKRKLEEKIR-ELEERIEELKKEIEELE--EKV 282
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1876 KRAQAVGALKEEYEElLHKQKSEYQKVITLIEKENTELKAKVSQMdhqQRCLQEAENKHSEsMFALQGRyEEEIRCMVEQ 1955
Cdd:PRK03918   283 KELKELKEKAEEYIK-LSEFYEEYLDELREIEKRLSRLEEEINGI---EERIKELEEKEER-LEELKKK-LKELEKRLEE 356
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1956 LSHTENTLQAERsRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALqEHYIWSLRGALSLYQPS 2035
Cdd:PRK03918   357 LEERHELYEEAK-AKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL-KKEIKELKKAIEELKKA 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2036 HPDSSL--APGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREELghKELEGDVaalqeKYQRDFESLKATCERGF 2113
Cdd:PRK03918   435 KGKCPVcgRELTEEHRKELLEEYTAE-LKRIEKELKEIEEKERKLRKEL--RELEKVL-----KKESELIKLKELAEQLK 506
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2114 AAMEETHQKKIEDLQRQhQRELEKLREEKDRLLAE--ETAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALR 2191
Cdd:PRK03918   507 ELEEKLKKYNLEELEKK-AEEYEKLKEKLIKLKGEikSLKKELEKLEELKK-KLAELEKKLDELEE-ELAELLKELEELG 583
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2192 RQYLEELQSVQRELEVLSEQYsqkcLENAHLAQALEAERQALRQCQRENQELNAHNQELNNR---LAAEITRLRTLLTGD 2268
Cdd:PRK03918   584 FESVEELEERLKELEPFYNEY----LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRleeLRKELEELEKKYSEE 659
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2269 GGGESTGLPLtqgkdayELEVLLRVKESEIQYLKqeisSLKDELQTALRDKKYASDKYKDIYTEL-SIAKAKAdcDISRL 2347
Cdd:PRK03918   660 EYEELREEYL-------ELSRELAGLRAELEELE----KRREEIKKTLEKLKEELEEREKAKKELeKLEKALE--RVEEL 726

                   ....*
gi 1039737300 2348 KEQLK 2352
Cdd:PRK03918   727 REKVK 731
TOPEUc smart00435
DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina ...
2049-2145 8.27e-03

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina virus topoisomerase, Variola virus topoisomerase, Shope fibroma virus topoisomeras


Pssm-ID: 214661 [Multi-domain]  Cd Length: 391  Bit Score: 41.18  E-value: 8.27e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  2049 RAVPaaKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDV-AALQEKYQRDFESLKATCERGFAAM--EETHQKKIE 2125
Cdd:smart00435  269 RTVS--KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLkRKLKSKFERDNEKLDAEVKEKKKEKkkEEKKKKQIE 346
                            90       100
                    ....*....|....*....|
gi 1039737300  2126 DLQRQHQReLEKLREEKDRL 2145
Cdd:smart00435  347 RLEERIEK-LEVQATDKEEN 365
 
Name Accession Description Interval E-value
PH_RIP cd01236
Rho-Interacting Protein Pleckstrin homology (PH) domain; RIP1-RhoGDI2 was obtained in a screen ...
16-151 1.34e-79

Rho-Interacting Protein Pleckstrin homology (PH) domain; RIP1-RhoGDI2 was obtained in a screen for proteins that bind to wild-type RhoA. RIP2, RIP3, and RIP4 were isolated from cDNA libraries with constitutively active V14RhoA (containing the C190R mutation). RIP2 represents a novel GDP/GTP exchange factor (RhoGEF), while RIP3 (p116Rip) and RIP4 are thought to be structural proteins. RhoGEF contains a Dbl(DH)/PH region, a a zinc finger motif, a leucine-rich domain, and a coiled-coil region. The last 2 domains are thought to be involved in mediating protein-protein interactions. RIP3 is a negative regulator of RhoA signaling that inhibits, either directly or indirectly, RhoA-stimulated actomyosin contractility. In plants RIP3 is localized at microtubules and interacts with the kinesin-13 family member AtKinesin-13A, suggesting a role for RIP3 in microtubule reorganization and a possible function in Rho proteins of plants (ROP)-regulated polar growth. It has a PH domain, two proline-rich regions which are putative binding sites for SH3 domains, and a COOH-terminal coiled-coil region which overlaps with the RhoA-binding region. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269942  Cd Length: 136  Bit Score: 258.52  E-value: 1.34e-79
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   16 IFNKSKCQNCFKPRESHLLNDEDLTQAKPIYGGWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHGLLRYALDEMPTTLPQ 95
Cdd:cd01236      1 NKSKCKCCFCFRPRHSHLALEEARMQRKVIYCGWLYVAPPGTDFSNPSHRSKRWQRRWFVLYDDGELTYALDEMPDTLPQ 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300   96 GTINMNQCTDVVDGEARTGQKFSLCILTPDKEHFIRAETKEIISGWLEMLMVYPRT 151
Cdd:cd01236     81 GSIDMSQCTEVTDAEARTGHPHSLAITTPERIHFVKADSKEEIRWWLELLAVYPRT 136
PH_M-RIP cd13275
Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed ...
507-608 5.60e-47

Myosin phosphatase-RhoA Interacting Protein Pleckstrin homology (PH) domain; M-RIP is proposed to play a role in myosin phosphatase regulation by RhoA. M-RIP contains 2 PH domains followed by a Rho binding domain (Rho-BD), and a C-terminal myosin binding subunit (MBS) binding domain (MBS-BD). The amino terminus of M-RIP with its adjacent PH domains and polyproline motifs mediates binding to both actin and Galpha. M-RIP brings RhoA and MBS into close proximity where M-RIP can target RhoA to the myosin phosphatase complex to regulate the myosin phosphorylation state. M-RIP does this via its C-terminal coiled-coil domain which interacts with the MBS leucine zipper domain of myosin phosphatase, while its Rho-BD, directly binds RhoA in a nucleotide-independent manner. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270094  Cd Length: 104  Bit Score: 164.04  E-value: 5.60e-47
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVTEYPVQRNYGFQIHTKEGE-FTLSAMT 584
Cdd:cd13275      1 KKGWLMKQgSRQGEWSKHWFVLRGAALKYYRDPSAEEAGELDGVIDLSSCTEVTELPVSRNYGFQVKTWDGKvYVLSAMT 80
                           90       100
                   ....*....|....*....|....
gi 1039737300  585 SGIRRNWIQTIMKHVLPASAPDVT 608
Cdd:cd13275     81 SGIRTNWIQALRKAAGLPSPPALP 104
PH smart00233
Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...
507-599 4.02e-16

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.


Pssm-ID: 214574 [Multi-domain]  Cd Length: 102  Bit Score: 76.05  E-value: 4.02e-16
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   507 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTC---YDVTEYPVQRNYGFQIHTKEGE-FTL 580
Cdd:smart00233    3 KEGWLYKKSGGGkkSWKKRYFVLFNSTLLYYKSKKDKKSYKPKGSIDLSGCtvrEAPDPDSSKKPHCFEIKTSDRKtLLL 82
                            90
                    ....*....|....*....
gi 1039737300   581 SAMTSGIRRNWIQTIMKHV 599
Cdd:smart00233   83 QAESEEEREKWVEALRKAI 101
PH pfam00169
PH domain; PH stands for pleckstrin homology.
507-595 2.89e-15

PH domain; PH stands for pleckstrin homology.


Pssm-ID: 459697 [Multi-domain]  Cd Length: 105  Bit Score: 73.75  E-value: 2.89e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDV---TEYPVQRNYGFQIHTKEG----E 577
Cdd:pfam00169    3 KEGWLLKKGGGkkKSWKKRYFVLFDGSLLYYKDDKSGKSKEPKGSISLSGCEVVevvASDSPKRKFCFELRTGERtgkrT 82
                           90
                   ....*....|....*...
gi 1039737300  578 FTLSAMTSGIRRNWIQTI 595
Cdd:pfam00169   83 YLLQAESEEERKDWIKAI 100
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
796-1110 6.01e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 74.97  E-value: 6.01e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEAS------DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATspsgawQRLHRVNQDLQSELEAQCRR 869
Cdd:COG1196    198 LERQLEPLERQAEkaeryrELKEELKELEAELLLLKLRELEAELEELEAELEEL------EAELEELEAELAELEAELEE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  870 QEL----ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQgkvrEQLEEWQHSKA 945
Cdd:COG1196    272 LRLeleeLELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELE----EELEELEEELE 347
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  946 MLSGQLRASEQKLRSTEARLLEKTQELrdLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytlll 1025
Cdd:COG1196    348 EAEEELEEAEAELAEAEEALLEAEAEL--AEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL------ 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1026 escEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1105
Cdd:COG1196    420 ---EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496

                   ....*
gi 1039737300 1106 RDLIK 1110
Cdd:COG1196    497 LEAEA 501
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
850-1209 9.17e-13

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 74.20  E-value: 9.17e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  850 QRLHRVNqDLQSELEAQcrRQELITQQIQTLKhsYGEAKDAIRHHEAEIQTLQTRlgNAAAELAIKEQALAKLKGELKME 929
Cdd:COG1196    186 ENLERLE-DILGELERQ--LEPLERQAEKAER--YRELKEELKELEAELLLLKLR--ELEAELEELEAELEELEAELEEL 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  930 QGKVREQLEEWQHSKAmlsgQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV----QRLQECIAELSQQLGT 1005
Cdd:COG1196    259 EAELAELEAELEELRL----ELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELeerlEELEEELAELEEELEE 334
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1006 SEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSEtckgseqvhklEEEL 1085
Cdd:COG1196    335 LEEELEELEEELEEA--------EEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEA-----------LRAA 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1086 EAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREK 1165
Cdd:COG1196    396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL 475
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1039737300 1166 EEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQ 1209
Cdd:COG1196    476 EAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGL 519
PH cd00821
Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are ...
507-595 2.76e-12

Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 275388 [Multi-domain]  Cd Length: 92  Bit Score: 64.87  E-value: 2.76e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQ--YEDGQWKKHWFVLADQSLRYYRDSvAEEAADLDGEINLSTCYDVTEY-PVQRNYGFQIHTKEGE-FTLSA 582
Cdd:cd00821      1 KEGYLLKRggGGLKSWKKRWFVLFEGVLLYYKSK-KDSSYKPKGSIPLSGILEVEEVsPKERPHCFELVTPDGRtYYLQA 79
                           90
                   ....*....|...
gi 1039737300  583 MTSGIRRNWIQTI 595
Cdd:cd00821     80 DSEEERQEWLKAL 92
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
784-1065 4.89e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 68.54  E-value: 4.89e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  784 SERLSTHELtSLLEKELEQSQKEASDLLEQNRLLQDQLRvALGREQSAREGYVLQTEVATSPsgawqrLHRVNQDLQSEL 863
Cdd:TIGR02168  219 KAELRELEL-ALLVLRLEELREELEELQEELKEAEEELE-ELTAELQELEEKLEELRLEVSE------LEEEIEELQKEL 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  864 EAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEqgkvREQLEEWQHS 943
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESL----EAELEELEAE 366
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  944 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 1019
Cdd:TIGR02168  367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARlerlEDRRERLQQEIEELLKKLEEAELKELQAELEELE 446
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1039737300 1020 NYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 1065
Cdd:TIGR02168  447 EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
848-1213 1.32e-10

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 67.01  E-value: 1.32e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  848 AWQRLHRVNQDLQSELEaqcRRQELITQQiqtlkhsyGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELK 927
Cdd:PRK03918   163 AYKNLGEVIKEIKRRIE---RLEKFIKRT--------ENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  928 mEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----------------QALQRDRQKEVQ 990
Cdd:PRK03918   232 -ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKvkelkelkekaeeyiklSEFYEEYLDELR 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  991 RLQECIAELSQQL-GTSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYED------QLQGHVQQVEAL 1063
Cdd:PRK03918   311 EIEKRLSRLEEEInGIEERIKELEEKEER------LEELKKKLKELEKRLEELEERHELYEEakakkeELERLKKRLTGL 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1064 QKEKLSETCKGSEQVHKLEEELEAREAS-IRQLAQHVQSLHDE---------------RDLIKHQFQELMER----VATS 1123
Cdd:PRK03918   385 TPEKLEKELEELEKAKEEIEEEISKITArIGELKKEIKELKKAieelkkakgkcpvcgRELTEEHRKELLEEytaeLKRI 464
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1124 DGDVAELQEKLRGKEVDYQNLEHSHHRVS--VQLQSVRTLLREKEEELKHI------------KETHERV--LEKKDQDL 1187
Cdd:PRK03918   465 EKELKEIEEKERKLRKELRELEKVLKKESelIKLKELAEQLKELEEKLKKYnleelekkaeeyEKLKEKLikLKGEIKSL 544
                          410       420
                   ....*....|....*....|....*.
gi 1039737300 1188 NEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:PRK03918   545 KKELEKLEELKKKLAELEKKLDELEE 570
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
850-1245 2.47e-10

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 66.23  E-value: 2.47e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  850 QRLHRVNQDLQ------SELEAQCRRqeLITQQIQTLKhsYGEAKDAIRHHEAEIQTLQtrlgnaaaelaiKEQALAKLK 923
Cdd:TIGR02168  179 RKLERTRENLDrledilNELERQLKS--LERQAEKAER--YKELKAELRELELALLVLR------------LEELREELE 242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  924 gELKMEQGKVREQLEEwqhskamLSGQLRASEQKLRSTEARLLEKTQELRDLetqqalqrdrQKEVQRLQECIAELSQQL 1003
Cdd:TIGR02168  243 -ELQEELKEAEEELEE-------LTAELQELEEKLEELRLEVSELEEEIEEL----------QKELYALANEISRLEQQK 304
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1004 GTSEQAQRLMEKKLKRnYTLLLESCEQEKQALLQNLKEVEDKasayEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEE 1083
Cdd:TIGR02168  305 QILRERLANLERQLEE-LEAQLEELESKLDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEE 379
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1084 EleareasIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEklrgkevdyQNLEHSHHRVSVQLQSVRTLLR 1163
Cdd:TIGR02168  380 Q-------LETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ---------EIEELLKKLEEAELKELQAELE 443
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1164 EKEEELKHIKETHERV---LEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLRRFvSDSPKDAKEPLsttEPTEEGS 1240
Cdd:TIGR02168  444 ELEEELEELQEELERLeeaLEELREELEEAEQALDAAERELAQLQARLDSLERLQENL-EGFSEGVKALL---KNQSGLS 519

                   ....*
gi 1039737300 1241 GILPL 1245
Cdd:TIGR02168  520 GILGV 524
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
915-1213 1.05e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 64.31  E-value: 1.05e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  915 KEQALAKLKGELKMEQGKVREQLEEwqhsKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQqaLQRDRQkEVQRLQE 994
Cdd:TIGR02168  675 RRREIEELEEKIEELEEKIAELEKA----LAELRKELEELEEELEQLRKELEELSRQISALRKD--LARLEA-EVEQLEE 747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  995 CIAELSQQLgtSEQAQRLMEKKLKrnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEkLSETckg 1074
Cdd:TIGR02168  748 RIAQLSKEL--TELEAEIEELEER------LEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAE-LTLL--- 815
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1075 SEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQ 1154
Cdd:TIGR02168  816 NEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSE 895
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300 1155 LQSVRTLLREKE----------EELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:TIGR02168  896 LEELSEELRELEskrselrrelEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIE 964
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
798-1007 1.18e-09

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 64.17  E-value: 1.18e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  798 KELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPsgAWQRLHRVN------QDLQSELEAQCRRQE 871
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAALR--LWFAQRRLElleaelEELRAELARLEAELE 312
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  872 LITQQIQTLKHSYGEAKDAIRHH--------EAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHS 943
Cdd:COG4913    313 RLEARLDALREELDELEAQIRGNggdrleqlEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAAL 392
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039737300  944 KAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV-QRLQECIAELSQQLGTSE 1007
Cdd:COG4913    393 LEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIpARLLALRDALAEALGLDE 457
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
895-1203 1.24e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 63.92  E-value: 1.24e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  895 EAEIQTLQTRLGNAAAELAIKEQALAKLKGE---LKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQE 971
Cdd:TIGR02168  676 RREIEELEEKIEELEEKIAELEKALAELRKEleeLEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKE 755
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  972 LRDLETQ-QALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYE 1050
Cdd:TIGR02168  756 LTELEAEiEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREA-----LDELRAELTLLNEEAANLRERLESLE 830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1051 DQLQGHVQQVEALQKEKLSEtckgSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL 1130
Cdd:TIGR02168  831 RRIAATERRLEDLEEQIEEL----SEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELREL 906
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300 1131 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEEL----KHIKETHERVLEKKDQDLNEALVKMIALGSSLEE 1203
Cdd:TIGR02168  907 ESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLseeySLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
932-1216 2.62e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 63.03  E-value: 2.62e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  932 KVREQLEEWQHSKAMLsgQLRASEQKLRSTEARLLEKTQELRDLETQQALqrdRQKEVQRLQECIAELSQQLGTSEQAQR 1011
Cdd:COG1196    217 ELKEELKELEAELLLL--KLRELEAELEELEAELEELEAELEELEAELAE---LEAELEELRLELEELELELEEAQAEEY 291
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1012 LMEKKLKRnytlllesCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETckgsEQVHKLEEELEAREAS 1091
Cdd:COG1196    292 ELLAELAR--------LEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELE----EELEEAEEELEEAEAE 359
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1092 IRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKH 1171
Cdd:COG1196    360 LAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEE 439
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1039737300 1172 IKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLR 1216
Cdd:COG1196    440 EEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLE 484
PH cd00821
Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are ...
48-145 2.74e-09

Pleckstrin homology (PH) domain; PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 275388 [Multi-domain]  Cd Length: 92  Bit Score: 56.01  E-value: 2.74e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   48 GWLLLAPDGTdfdnpvhrSRKWQRRFFILYEHGLLRYALDEMPTTLPQGTINMNQCTDVVDGEaRTGQKFSLCILTPDKE 127
Cdd:cd00821      3 GYLLKRGGGG--------LKSWKKRWFVLFEGVLLYYKSKKDSSYKPKGSIPLSGILEVEEVS-PKERPHCFELVTPDGR 73
                           90
                   ....*....|....*....
gi 1039737300  128 HF-IRAETKEIISGWLEML 145
Cdd:cd00821     74 TYyLQADSEEERQEWLKAL 92
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
2053-2318 3.08e-09

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 62.65  E-value: 3.08e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQH- 2131
Cdd:COG1196    247 ELEELEAELEELEAELAELEAELEELRLE--LEELELELEEAQAEEY----------ELLAELARLEQDIARLEERRREl 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2132 QRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2211
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2212 YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLtgdgggestglpltqgKDAYELEVLL 2291
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAA----------------EEEAELEEEE 458
                          250       260
                   ....*....|....*....|....*..
gi 1039737300 2292 RVKESEIQYLKQEISSLKDELQTALRD 2318
Cdd:COG1196    459 EALLELLAELLEEAALLEAALAELLEE 485
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
850-1211 5.63e-09

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 62.01  E-value: 5.63e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  850 QRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKme 929
Cdd:TIGR02169  684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELK-- 761
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  930 qgKVREQLEEWQHSKAMLSGQLRASEQKLRstEARLLEKTQELRDLEtqqalqrdrqKEVQRLQECIAELSQQLGTSEQA 1009
Cdd:TIGR02169  762 --ELEARIEELEEDLHKLEEALNDLEARLS--HSRIPEIQAELSKLE----------EEVSRIEARLREIEQKLNRLTLE 827
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1010 QRLMEKKLkrnytlllesceQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEARE 1089
Cdd:TIGR02169  828 KEYLEKEI------------QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELE 895
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1090 ASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELqEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEE-E 1168
Cdd:TIGR02169  896 AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEI-EDPKGEDEEIPEEELSLEDVQAELQRVEEEIRALEPvN 974
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1039737300 1169 LKHIKEtHERVLEKKDqDLNEALVKMIALGSSLEETEIKLQEK 1211
Cdd:TIGR02169  975 MLAIQE-YEEVLKRLD-ELKEKRAKLEEERKAILERIEEYEKK 1015
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
934-1238 7.90e-09

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 61.24  E-value: 7.90e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  934 REQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQalqRDRQKEVQRLQEciaelsqqlgtSEQAQRLM 1013
Cdd:TIGR02169  673 PAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKI---GEIEKEIEQLEQ-----------EEEKLKER 738
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1014 EKKLKRNytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEAL-QKEKLSETCKGSEQVHKLEEELEAREASI 1092
Cdd:TIGR02169  739 LEELEED----LSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLeARLSHSRIPEIQAELSKLEEEVSRIEARL 814
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1093 RQLAQHVQSLHDERDLIKHQFQELMERVatsdgDVAELQEKLRGKEVDyqNLEHSHHRVSVQLQSVRTLLREKEEELKHI 1172
Cdd:TIGR02169  815 REIEQKLNRLTLEKEYLEKEIQELQEQR-----IDLKEQIKSIEKEIE--NLNGKKEELEEELEELEAALRDLESRLGDL 887
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 1173 K------ETHERVLEKKDQDLNEALVKmiaLGSSLEETEIKLQEKEECLRRFvsDSPKDAKEPLSTTEPTEE 1238
Cdd:TIGR02169  888 KkerdelEAQLRELERKIEELEAQIEK---KRKRLSELKAKLEALEEELSEI--EDPKGEDEEIPEEELSLE 954
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
850-1066 1.23e-08

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 59.78  E-value: 1.23e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  850 QRLHRVNQDL---QSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGEl 926
Cdd:COG4942     27 AELEQLQQEIaelEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEE- 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  927 kmeqgkvreqleewqhskamLSGQLRASEQKLRSTEARLL----EKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQ 1002
Cdd:COG4942    106 --------------------LAELLRALYRLGRQPPLALLlspeDFLDAVRRLQYLKYLAPARREQAEELRADLAELAAL 165
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039737300 1003 LGTSEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKE 1066
Cdd:COG4942    166 RAELEAERAELEALLAEL--------EEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221
PH smart00233
Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The ...
44-145 1.68e-08

Pleckstrin homology domain; Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.


Pssm-ID: 214574 [Multi-domain]  Cd Length: 102  Bit Score: 54.09  E-value: 1.68e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300    44 PIYGGWLLLAPDGtdfdnpvhRSRKWQRRFFILYEHGLLRYALDEMPTTL-PQGTINMNQCT-DVVDGEARTGQKFSLCI 121
Cdd:smart00233    1 VIKEGWLYKKSGG--------GKKSWKKRYFVLFNSTLLYYKSKKDKKSYkPKGSIDLSGCTvREAPDPDSSKKPHCFEI 72
                            90       100
                    ....*....|....*....|....*
gi 1039737300   122 LTPDKE-HFIRAETKEIISGWLEML 145
Cdd:smart00233   73 KTSDRKtLLLQAESEEEREKWVEAL 97
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
744-1239 2.17e-08

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 59.80  E-value: 2.17e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  744 LKTQNVHVEIEQRWHQV--ETTPLREEKQVPiAPLHLSLEDRSERLST---------HELTSLLEKELEQSQKEASDLLE 812
Cdd:pfam01576   22 QKAESELKELEKKHQQLceEKNALQEQLQAE-TELCAEAEEMRARLAArkqeleeilHELESRLEEEEERSQQLQNEKKK 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  813 QNRLLQDqLRVALGREQSAREGyvLQTEVATSPS----------------GAWQRLHRVNQDLQSELEAQCRRQELITQQ 876
Cdd:pfam01576  101 MQQHIQD-LEEQLDEEEAARQK--LQLEKVTTEAkikkleedillledqnSKLSKERKLLEERISEFTSNLAEEEEKAKS 177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  877 IQTLKHSygeakdairhHEAEIQTLQTRLGNAAAelaiKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQ 956
Cdd:pfam01576  178 LSKLKNK----------HEAMISDLEERLKKEEK----GRQELEKAKRKLEGESTDLQEQIAELQAQIAELRAQLAKKEE 243
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  957 KLRSTEARLlektqelrdlETQQALQRDRQKEVQRLQECIAELSQQLgTSEQAQRLMEKKLKRNYTLLLESCEQEKQALL 1036
Cdd:pfam01576  244 ELQAALARL----------EEETAQKNNALKKIRELEAQISELQEDL-ESERAARNKAEKQRRDLGEELEALKTELEDTL 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1037 ------QNLK-----EVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDE 1105
Cdd:pfam01576  313 dttaaqQELRskreqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAE 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1106 RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEelKHIKETHE-RVLEKKD 1184
Cdd:pfam01576  393 LRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEG--KNIKLSKDvSSLESQL 470
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300 1185 QDLNEALV----KMIALGSSLEETEI-------KLQEKEECLRRF----------VSDSPKDAKEPLSTTEPTEEG 1239
Cdd:pfam01576  471 QDTQELLQeetrQKLNLSTRLRQLEDernslqeQLEEEEEAKRNVerqlstlqaqLSDMKKKLEEDAGTLEALEEG 546
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
896-1067 2.87e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 59.01  E-value: 2.87e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  896 AEIQTLQTRLGNAAAELAIKEQALAKLKgELKMEQGKVREQLEEWQHSKAMLSGQLRASE--QKLRSTEARLLEKTQELR 973
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEELE-ELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPERLE 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  974 DLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQL 1053
Cdd:COG4717    150 ELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEEL 229
                          170
                   ....*....|....
gi 1039737300 1054 QGHVQQVEALQKEK 1067
Cdd:COG4717    230 EQLENELEAAALEE 243
PH2_MyoX cd13296
Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular ...
507-607 4.29e-08

Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular motor that has crucial functions in the transport and/or tethering of integrins in the actin-based extensions known as filopodia, microtubule binding, and in netrin-mediated axon guidance. It functions as a dimer. MyoX walks on bundles of actin, rather than single filaments, unlike the other unconventional myosins. MyoX is present in organisms ranging from humans to choanoflagellates, but not in Drosophila and Caenorhabditis elegans.MyoX consists of a N-terminal motor/head region, a neck made of 3 IQ motifs, and a tail consisting of a coiled-coil domain, a PEST region, 3 PH domains, a myosin tail homology 4 (MyTH4), and a FERM domain at its very C-terminus. The first PH domain in the MyoX tail is a split-PH domain, interupted by the second PH domain such that PH 1a and PH 1b flanks PH 2. The third PH domain (PH 3) follows the PH 1b domain. This cd contains the second PH repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270108  Cd Length: 103  Bit Score: 53.24  E-value: 4.29e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYEDG------QWKKHWFVLADQSLRYYRDsvAEEAADLDGEINLSTCYDVTEYPVQRNyGFQIHTKEGEFTL 580
Cdd:cd13296      1 KSGWLTKKGGGSstlsrrNWKSRWFVLRDTVLKYYEN--DQEGEKLLGTIDIRSAKEIVDNDPKEN-RLSITTEERTYHL 77
                           90       100
                   ....*....|....*....|....*..
gi 1039737300  581 SAMTSGIRRNWIQtIMKHVLPASAPDV 607
Cdd:cd13296     78 VAESPEDASQWVN-VLTRVISATDLEL 103
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
952-1213 4.45e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 58.79  E-value: 4.45e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  952 RASEQKLRSTEARLL-------EKTQELRDLETQ-------QALQ---RDRQKEVQRLQecIAELSQQLGTSEQAQRLME 1014
Cdd:COG1196    175 EEAERKLEATEENLErledilgELERQLEPLERQaekaeryRELKeelKELEAELLLLK--LRELEAELEELEAELEELE 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1015 KKLKRnYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETckgsEQVHKLEEELEAREASIRQ 1094
Cdd:COG1196    253 AELEE-LEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLE----ERRRELEERLEELEEELAE 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1095 LAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEvdyQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKE 1174
Cdd:COG1196    328 LEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAE---AELAEAEEELEELAEELLEALRAAAELAAQLEE 404
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1039737300 1175 ThERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:COG1196    405 L-EEAEEALLERLERLEEELEELEEALAELEEEEEEEEE 442
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
2064-2256 6.31e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 58.53  E-value: 6.31e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2064 LRERIQELEAQMGVMREELGH-----KELEGDVAALQEKyqrdFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2138
Cdd:TIGR02168  307 LRERLANLERQLEELEAQLEElesklDELAEELAELEEK----LEELKEELESLEAELEELEAELEELESRLEELEEQLE 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2139 REEKDRLLAEETAATIsaieamkNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2218
Cdd:TIGR02168  383 TLRSKVAQLELQIASL-------NNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEE 455
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1039737300 2219 NAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA 2256
Cdd:TIGR02168  456 LERLEEALEELREELEEAEQALDAAERELAQLQARLDS 493
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1879-2240 8.52e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 58.16  E-value: 8.52e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1879 QAVGALKEEYEELLhkqksEYQKVITliEKENTELKAKVSQMDHQQRCLQEAENKHSESmfalqgryEEEIRCMVEQLSH 1958
Cdd:TIGR02169  198 QQLERLRREREKAE-----RYQALLK--EKREYEGYELLKEKEALERQKEAIERQLASL--------EEELEKLTEEISE 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1959 TENTLqAERSRVLSQLDASVKDRQAMEQHHVQ--------QMKMLEDRFQLKVRELQ------AVHQEELRALQEHyIWS 2024
Cdd:TIGR02169  263 LEKRL-EEIEQLLEELNKKIKDLGEEEQLRVKekigeleaEIASLERSIAEKERELEdaeerlAKLEAEIDKLLAE-IEE 340
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2025 LRGALSLYQpshpdsslapgpSEPRAVPAA-KDEAESMSGLRERIQELEAQMGVMREElgHKELegdvaalqekyqrdfe 2103
Cdd:TIGR02169  341 LEREIEEER------------KRRDKLTEEyAELKEELEDLRAELEEVDKEFAETRDE--LKDY---------------- 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2104 slkatcergfaameethQKKIEDLQRQH---QRELEKLREEKDRLLAE--ETAATISAIEAMKNAHREEME--RELEKSQ 2176
Cdd:TIGR02169  391 -----------------REKLEKLKREInelKRELDRLQEELQRLSEElaDLNAAIAGIEAKINELEEEKEdkALEIKKQ 453
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300 2177 RSQISSINSDIEALRRQYL---EELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQREN 2240
Cdd:TIGR02169  454 EWKLEQLAADLSKYEQELYdlkEEYDRVEKELSKLQRELAE-----------AEAQARASEERVRGG 509
PH2_MyoX cd13296
Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular ...
48-145 8.66e-08

Myosin X Pleckstrin homology (PH) domain, repeat 2; MyoX, a MyTH-FERM myosin, is a molecular motor that has crucial functions in the transport and/or tethering of integrins in the actin-based extensions known as filopodia, microtubule binding, and in netrin-mediated axon guidance. It functions as a dimer. MyoX walks on bundles of actin, rather than single filaments, unlike the other unconventional myosins. MyoX is present in organisms ranging from humans to choanoflagellates, but not in Drosophila and Caenorhabditis elegans.MyoX consists of a N-terminal motor/head region, a neck made of 3 IQ motifs, and a tail consisting of a coiled-coil domain, a PEST region, 3 PH domains, a myosin tail homology 4 (MyTH4), and a FERM domain at its very C-terminus. The first PH domain in the MyoX tail is a split-PH domain, interupted by the second PH domain such that PH 1a and PH 1b flanks PH 2. The third PH domain (PH 3) follows the PH 1b domain. This cd contains the second PH repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270108  Cd Length: 103  Bit Score: 52.08  E-value: 8.66e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   48 GWLLLAPDGTdfdNPVHRsRKWQRRFFILYEHGLLRYALDEmPTTLPQGTINMNQCTDVVDgeaRTGQKFSLCILTPDKE 127
Cdd:cd13296      3 GWLTKKGGGS---STLSR-RNWKSRWFVLRDTVLKYYENDQ-EGEKLLGTIDIRSAKEIVD---NDPKENRLSITTEERT 74
                           90
                   ....*....|....*...
gi 1039737300  128 HFIRAETKEIISGWLEML 145
Cdd:cd13296     75 YHLVAESPEDASQWVNVL 92
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
875-1065 9.38e-08

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 58.00  E-value: 9.38e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  875 QQIQTLKHSYGEAKDAIRHHEA--EIQTLQTRLGNAAAELAIKEQALAKLK--------GELKMEQGKVREQLEEWQHSK 944
Cdd:COG4913    232 EHFDDLERAHEALEDAREQIELlePIRELAERYAAARERLAELEYLRAALRlwfaqrrlELLEAELEELRAELARLEAEL 311
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  945 AMLSGQLRASEQKLRSTEARLLE-KTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTL 1023
Cdd:COG4913    312 ERLEARLDALREELDELEAQIRGnGGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAA 391
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1039737300 1024 LLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQK 1065
Cdd:COG4913    392 LLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLER 433
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
2070-2364 1.08e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 57.77  E-value: 1.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2070 ELEAQMGVMREELGhkELEGDVAALQEKyqrdFESLKATCERGFAAMEETHqKKIEDLQRQHQRELEKLREEKDRLlaEE 2149
Cdd:TIGR02169  671 SEPAELQRLRERLE--GLKRELSSLQSE----LRRIENRLDELSQELSDAS-RKIGEIEKEIEQLEQEEEKLKERL--EE 741
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2150 TAATISAIEAMKNAHREEMErELEK---SQRSQISSINSDIEALRRQYLEE-LQSVQRELEVLSEQYSQKCLENAHLAQA 2225
Cdd:TIGR02169  742 LEEDLSSLEQEIENVKSELK-ELEArieELEEDLHKLEEALNDLEARLSHSrIPEIQAELSKLEEEVSRIEARLREIEQK 820
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2226 LEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGEStglpltqgkDAYELEVLLRVKESEIQYLKQEI 2305
Cdd:TIGR02169  821 LNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEE---------ELEELEAALRDLESRLGDLKKER 891
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300 2306 SSLKDELQTALRDKKYASDKYKDIYTELSIAKAKAdcdiSRLKEQLKAATEALGEKSPE 2364
Cdd:TIGR02169  892 DELEAQLRELERKIEELEAQIEKKRKRLSELKAKL----EALEEELSEIEDPKGEDEEI 946
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
2052-2318 1.16e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 57.62  E-value: 1.16e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2052 PAAKDEAESMSGLRERIQELEAQMGVMREELGHkeLEgDVAALQEKYQRDFESLKA--TCERGFAAmeETHQKKIEDLQR 2129
Cdd:COG4913    221 PDTFEAADALVEHFDDLERAHEALEDAREQIEL--LE-PIRELAERYAAARERLAEleYLRAALRL--WFAQRRLELLEA 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2130 qhqrELEKLREEKDRLLAEETAATISAIEAmkNAHREEMERELEKSQRSQISSINSDIEALRRqyleELQSVQRELEVLS 2209
Cdd:COG4913    296 ----ELEELRAELARLEAELERLEARLDAL--REELDELEAQIRGNGGDRLEQLEREIERLER----ELEERERRRARLE 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2210 EQysqkcLENAHLAQALEAE--RQALRQCQRENQELNAHNQELNNRLAAEITRLRtlltgdgggestglpltqgkdayEL 2287
Cdd:COG4913    366 AL-----LAALGLPLPASAEefAALRAEAAALLEALEEELEALEEALAEAEAALR-----------------------DL 417
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1039737300 2288 EVLLRVKESEIQYLKQEISSLKDELQTALRD 2318
Cdd:COG4913    418 RRELRELEAEIASLERRKSNIPARLLALRDA 448
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
768-1173 1.25e-07

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 57.44  E-value: 1.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  768 EKQVPIAPLHLSlEDRSER----LSTHELTSLLEK---ELEQSQKEASDLLEQNRLLQDQ-LRVALGREQSAREGYVLQT 839
Cdd:pfam15921  348 EKQLVLANSELT-EARTERdqfsQESGNLDDQLQKllaDLHKREKELSLEKEQNKRLWDRdTGNSITIDHLRRELDDRNM 426
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  840 EVatspsgawQRLHRVNQDLQSELEAQCRRQ--------------ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRL 905
Cdd:pfam15921  427 EV--------QRLEALLKAMKSECQGQMERQmaaiqgkneslekvSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTV 498
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  906 GNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHskamlsgqLRASEQKLRSTEArllektqELRDLETQQAlQRDR 985
Cdd:pfam15921  499 SDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQH--------LKNEGDHLRNVQT-------ECEALKLQMA-EKDK 562
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  986 QKEVQRLQ-ECIAELSQQLGTSEQAQRLMEKKLKRNYtlllesceQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEaLQ 1064
Cdd:pfam15921  563 VIEILRQQiENMTQLVGQHGRTAGAMQVEKAQLEKEI--------NDRRLELQEFKILKDKKDAKIRELEARVSDLE-LE 633
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1065 KEKLSETckGSEQVhkleeeleareasirqlaQHVQSLHDERDlikhqfqELMERVATSDGDVAELQEKLRGKEVDYQN- 1143
Cdd:pfam15921  634 KVKLVNA--GSERL------------------RAVKDIKQERD-------QLLNEVKTSRNELNSLSEDYEVLKRNFRNk 686
                          410       420       430
                   ....*....|....*....|....*....|...
gi 1039737300 1144 ---LEHSHHRVSVQLQSVRTLLREKEEELKHIK 1173
Cdd:pfam15921  687 seeMETTTNKLKMQLKSAQSELEQTRNTLKSME 719
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
751-996 1.63e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 57.00  E-value: 1.63e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  751 VEIEQRWHQVET--TPLREEKQVPIAPLHLSLEdrSERLSTHELTSLLEKELEQSQKE-ASDLLEQNRLLQD--QLRVAL 825
Cdd:TIGR02169  268 EEIEQLLEELNKkiKDLGEEEQLRVKEKIGELE--AEIASLERSIAEKERELEDAEERlAKLEAEIDKLLAEieELEREI 345
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  826 GREQSAREGyvLQTEVATSPsgawQRLHRVNQDLQS-ELEAQCRRQEL--ITQQIQTLKHSYGEAKDAIRHHEAEIQTLQ 902
Cdd:TIGR02169  346 EEERKRRDK--LTEEYAELK----EELEDLRAELEEvDKEFAETRDELkdYREKLEKLKREINELKRELDRLQEELQRLS 419
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  903 TRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSG---QLRASEQKLRSTEARLLEKTQELRDLETQQ 979
Cdd:TIGR02169  420 EELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKyeqELYDLKEEYDRVEKELSKLQRELAEAEAQA 499
                          250
                   ....*....|....*..
gi 1039737300  980 ALQRDRQKEVQRLQECI 996
Cdd:TIGR02169  500 RASEERVRGGRAVEEVL 516
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1946-2243 2.25e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 56.61  E-value: 2.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1946 EEEIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLE--DRFQLKVRELQAVHQEELRALQehyiw 2023
Cdd:TIGR02169  676 LQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEqlEQEEEKLKERLEELEEDLSSLE----- 750
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2024 slrgalslyqpshpdsslapgpsepRAVPAAKDEaesMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKyQRDFE 2103
Cdd:TIGR02169  751 -------------------------QEIENVKSE---LKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEI-QAELS 801
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2104 SLKATCERGFAAMEETHQKkiedLQRQHQRE--LEKLREEK--DRLLAEETAATISAIEAMKNAHREEMERELEKSQRS- 2178
Cdd:TIGR02169  802 KLEEEVSRIEARLREIEQK----LNRLTLEKeyLEKEIQELqeQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAl 877
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300 2179 -QISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQEL 2243
Cdd:TIGR02169  878 rDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGED 943
PH-GRAM1_AGT26 cd13215
Autophagy-related protein 26/Sterol 3-beta-glucosyltransferase Pleckstrin homology (PH) domain, ...
506-599 2.86e-07

Autophagy-related protein 26/Sterol 3-beta-glucosyltransferase Pleckstrin homology (PH) domain, repeat 1; ATG26 (also called UGT51/UDP-glycosyltransferase 51), a member of the glycosyltransferase 28 family, resulting in the biosynthesis of sterol glucoside. ATG26 in decane metabolism and autophagy. There are 32 known autophagy-related (ATG) proteins, 17 are components of the core autophagic machinery essential for all autophagy-related pathways and 15 are the additional components required only for certain pathways or species. The core autophagic machinery includes 1) the ATG9 cycling system (ATG1, ATG2, ATG9, ATG13, ATG18, and ATG27), 2) the phosphatidylinositol 3-kinase complex (ATG6/VPS30, ATG14, VPS15, and ATG34), and 3) the ubiquitin-like protein system (ATG3, ATG4, ATG5, ATG7, ATG8, ATG10, ATG12, and ATG16). Less is known about how the core machinery is adapted or modulated with additional components to accommodate the nonselective sequestration of bulk cytosol (autophagosome formation) or selective sequestration of specific cargos (Cvt vesicle, pexophagosome, or bacteria-containing autophagosome formation). The pexophagosome-specific additions include the ATG30-ATG11-ATG17 receptor-adaptors complex, the coiled-coil protein ATG25, and the sterol glucosyltransferase ATG26. ATG26 is necessary for the degradation of medium peroxisomes. It contains 2 GRAM domains and a single PH domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 275402  Cd Length: 116  Bit Score: 51.08  E-value: 2.86e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  506 FKKGWLTKQ-YEDGQWKKHWFVLADQSLRYYRDSvaeeaADL---DGEINLSTCY--DVTEYPVQRNYGFQIHTKEGEFT 579
Cdd:cd13215     22 IKSGYLSKRsKRTLRYTRYWFVLKGDTLSWYNSS-----TDLyfpAGTIDLRYATsiELSKSNGEATTSFKIVTNSRTYK 96
                           90       100
                   ....*....|....*....|
gi 1039737300  580 LSAMTSGIRRNWIQTIMKHV 599
Cdd:cd13215     97 FKADSETSADEWVKALKKQI 116
PH_AtPH1 cd13276
Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all ...
507-603 3.24e-07

Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all plant tissue and is proposed to be the plant homolog of human pleckstrin. Pleckstrin consists of two PH domains separated by a linker region, while AtPH has a single PH domain with a short N-terminal extension. AtPH1 binds PtdIns3P specifically and is thought to be an adaptor molecule since it has no obvious catalytic functions. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270095  Cd Length: 106  Bit Score: 50.78  E-value: 3.24e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYED-GQWKKHWFVLADQSLRYYRDSVAEEAADLDGEINLSTCYDVT--EYPVQRNYGFQIHTKEGEFTLSAM 583
Cdd:cd13276      1 KAGWLEKQGEFiKTWRRRWFVLKQGKLFWFKEPDVTPYSKPRGVIDLSKCLTVKsaEDATNKENAFELSTPEETFYFIAD 80
                           90       100
                   ....*....|....*....|
gi 1039737300  584 TSGIRRNWIQTIMKHVLPAS 603
Cdd:cd13276     81 NEKEKEEWIGAIGRAIVKHS 100
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
2062-2360 3.32e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 56.10  E-value: 3.32e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2062 SGLRERIQELEAQMGVMREELG-----HKELEGDVAALQE---------KYQRDFESLKAtcergfaameETHQKKIEDL 2127
Cdd:COG1196    168 SKYKERKEEAERKLEATEENLErlediLGELERQLEPLERqaekaeryrELKEELKELEA----------ELLLLKLREL 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2128 QRQHQRELEKLREEKDRL--LAEETAATISAIEAMKNAHREEMERELEKSQR-----SQISSINSDIEAL---RRQYLEE 2197
Cdd:COG1196    238 EAELEELEAELEELEAELeeLEAELAELEAELEELRLELEELELELEEAQAEeyellAELARLEQDIARLeerRRELEER 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2198 LQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTGDgggestglp 2277
Cdd:COG1196    318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEEL--------- 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2278 LTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2357
Cdd:COG1196    389 LEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAEL 468

                   ...
gi 1039737300 2358 LGE 2360
Cdd:COG1196    469 LEE 471
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1987-2313 3.58e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 55.83  E-value: 3.58e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1987 HHVQQMKMLEDRFQLKVRELQAVhQEELRALQEhyiwSLRGALSLYQPSHPDSSLAPGPSEpRAVPAAKDEAESMSGLRE 2066
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAEL-RKELEELEE----ELEQLRKELEELSRQISALRKDLA-RLEAEVEQLEERIAQLSK 754
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2067 RIQELEAQMGVMREELGH-----KELEGDVAALQEKYQRDFESLKATCERgfaameethqkkIEDLQRQHQRELEKLREE 2141
Cdd:TIGR02168  755 ELTELEAEIEELEERLEEaeeelAEAEAEIEELEAQIEQLKEELKALREA------------LDELRAELTLLNEEAANL 822
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2142 KDRLLAEETAAtisaieAMKNAHREEMERELEKsQRSQISSINSDIEALRRQyLEELQSvqrELEVLSEQYSQKCLENAH 2221
Cdd:TIGR02168  823 RERLESLERRI------AATERRLEDLEEQIEE-LSEDIESLAAEIEELEEL-IEELES---ELEALLNERASLEEALAL 891
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2222 LAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLltgdgggESTGLPLTQ-----GKDAYE-LEVLLRVKE 2295
Cdd:TIGR02168  892 LRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGL-------EVRIDNLQErlseeYSLTLEeAEALENKIE 964
                          330
                   ....*....|....*...
gi 1039737300 2296 SEIQYLKQEISSLKDELQ 2313
Cdd:TIGR02168  965 DDEEEARRRLKRLENKIK 982
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
896-1236 3.79e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 55.84  E-value: 3.79e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  896 AEIQTLQTRLGNAAAELAIKEQALAKLKGElkmeqgkvREQLEEWQHskamLSGQLRASEQKLRSTEARLLEKTQE--LR 973
Cdd:TIGR02169  177 EELEEVEENIERLDLIIDEKRQQLERLRRE--------REKAERYQA----LLKEKREYEGYELLKEKEALERQKEaiER 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  974 DLETQQALQRDRQKEVQRLQECIAELSQQLgtSEQAQRLMEKKLKRNYTLllesceQEKQALLQ-NLKEVEDKASAYEDQ 1052
Cdd:TIGR02169  245 QLASLEEELEKLTEEISELEKRLEEIEQLL--EELNKKIKDLGEEEQLRV------KEKIGELEaEIASLERSIAEKERE 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1053 LQghvqQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQE 1132
Cdd:TIGR02169  317 LE----DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYRE 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1133 KLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETH---ERVLEKKDQDLNEALVKMIALGSSLEETEIKLQ 1209
Cdd:TIGR02169  393 KLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKInelEEEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472
                          330       340
                   ....*....|....*....|....*..
gi 1039737300 1210 EKEECLRRfVSDSPKDAKEPLSTTEPT 1236
Cdd:TIGR02169  473 DLKEEYDR-VEKELSKLQRELAEAEAQ 498
PH pfam00169
PH domain; PH stands for pleckstrin homology.
44-145 4.37e-07

PH domain; PH stands for pleckstrin homology.


Pssm-ID: 459697 [Multi-domain]  Cd Length: 105  Bit Score: 50.25  E-value: 4.37e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   44 PIYGGWLLLAPDGtdfdnpvhRSRKWQRRFFILYEHGLLRYALDEMPTTL-PQGTINMNQCTDV-VDGEARTGQKFSLCI 121
Cdd:pfam00169    1 VVKEGWLLKKGGG--------KKKSWKKRYFVLFDGSLLYYKDDKSGKSKePKGSISLSGCEVVeVVASDSPKRKFCFEL 72
                           90       100
                   ....*....|....*....|....*...
gi 1039737300  122 LTPD----KEHFIRAETKEIISGWLEML 145
Cdd:pfam00169   73 RTGErtgkRTYLLQAESEEERKDWIKAI 100
PH_CNK_mammalian-like cd01260
Connector enhancer of KSR (Kinase suppressor of ras) (CNK) pleckstrin homology (PH) domain; ...
509-553 5.73e-07

Connector enhancer of KSR (Kinase suppressor of ras) (CNK) pleckstrin homology (PH) domain; CNK family members function as protein scaffolds, regulating the activity and the subcellular localization of RAS activated RAF. There is a single CNK protein present in Drosophila and Caenorhabditis elegans in contrast to mammals which have 3 CNK proteins (CNK1, CNK2, and CNK3). All of the CNK members contain a sterile a motif (SAM), a conserved region in CNK (CRIC) domain, and a PSD-95/DLG-1/ZO-1 (PDZ) domain, and, with the exception of CNK3, a PH domain. A CNK2 splice variant CNK2A also has a PDZ domain-binding motif at its C terminus and Drosophila CNK (D-CNK) also has a domain known as the Raf-interacting region (RIR) that mediates binding of the Drosophila Raf kinase. This cd contains CNKs from mammals, chickens, amphibians, fish, and crustacea. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269962  Cd Length: 114  Bit Score: 50.10  E-value: 5.73e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039737300  509 GWLTKQYEDG-----QWKKHWFVLADQSLRYYRDSVAEEAadlDGEINLS 553
Cdd:cd01260     17 GWLWKKKEAKsffgqKWKKYWFVLKGSSLYWYSNQQDEKA---EGFINLP 63
PH1_ARAP cd13253
ArfGAP with RhoGAP domain, ankyrin repeat and PH domain Pleckstrin homology (PH) domain, ...
507-600 5.78e-07

ArfGAP with RhoGAP domain, ankyrin repeat and PH domain Pleckstrin homology (PH) domain, repeat 1; ARAP proteins (also called centaurin delta) are phosphatidylinositol 3,4,5-trisphosphate-dependent GTPase-activating proteins that modulate actin cytoskeleton remodeling by regulating ARF and RHO family members. They bind phosphatidylinositol 3,4,5-trisphosphate (PtdIns(3,4,5)P3) and phosphatidylinositol 3,4-bisphosphate (PtdIns(3,4,5)P2) binding. There are 3 mammalian ARAP proteins: ARAP1, ARAP2, and ARAP3. All ARAP proteins contain a N-terminal SAM (sterile alpha motif) domain, 5 PH domains, an ArfGAP domain, 2 ankyrin domain, A RhoGap domain, and a Ras-associating domain. This hierarchy contains the first PH domain in ARAP. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270073  Cd Length: 94  Bit Score: 49.69  E-value: 5.78e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYEDGQ---WKKHWFVLADQSLRYYRdsvAEEAADLDGEINLSTcydVTEYPVQRNYGFQIHTKEGEFTLSAM 583
Cdd:cd13253      2 KSGYLDKQGGQGNnkgFQKRWVVFDGLSLRYFD---SEKDAYSKRIIPLSA---ISTVRAVGDNKFELVTTNRTFVFRAE 75
                           90
                   ....*....|....*..
gi 1039737300  584 TSGIRRNWIQTIMKHVL 600
Cdd:cd13253     76 SDDERNLWCSTLQAAIS 92
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1907-2292 5.95e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 55.33  E-value: 5.95e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1907 EKENTELKAKVSQMDHQQRCLQEAENKHSESmfalqgryEEEIRCMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQ 1986
Cdd:COG1196    221 ELKELEAELLLLKLRELEAELEELEAELEEL--------EAELEELEAELAELEAELEELRLE-LEELELELEEAQAEEY 291
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1987 HHVQQMKMLEDRFQLKVRELQAVHQEELRALQEhyiwslrgalslyqpshpdsslapgpsepravpaakdEAEsmsgLRE 2066
Cdd:COG1196    292 ELLAELARLEQDIARLEERRRELEERLEELEEE-------------------------------------LAE----LEE 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2067 RIQELEAQMGVMREELghKELEGDVAALQEKYQRdfeslkatcergfaamEETHQKKIEDLQRQHQRELEKLREEKDRLL 2146
Cdd:COG1196    331 ELEELEEELEELEEEL--EEAEEELEEAEAELAE----------------AEEALLEAEAELAEAEEELEELAEELLEAL 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2147 AEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQAL 2226
Cdd:COG1196    393 RAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEA 472
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039737300 2227 EAERQALRQCQRENQELNA-----HNQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVLLR 2292
Cdd:COG1196    473 ALLEAALAELLEELAEAAArllllLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAA 543
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
796-1168 7.37e-07

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 54.77  E-value: 7.37e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSARegyvLQTEVATSPSG---------AWQRLHRVNQDLQSEL-EA 865
Cdd:COG4717    100 LEEELEELEAELEELREELEKLEKLLQLLPLYQELEA----LEAELAELPERleeleerleELRELEEELEELEAELaEL 175
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  866 QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLkgELKMEQGKVREQLEEWQHSKA 945
Cdd:COG4717    176 QEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQL--ENELEAAALEERLKEARLLLL 253
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  946 MLSGQL------------------------------RASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQEC 995
Cdd:COG4717    254 IAAALLallglggsllsliltiagvlflvlgllallFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPD 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  996 I--AELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQ-----NLKEVEDKASAYED--QLQGHVQQVEALQKE 1066
Cdd:COG4717    334 LspEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAeagveDEEELRAALEQAEEyqELKEELEELEEQLEE 413
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1067 KLSETCKGSEQVHKLE--EELEAREASIRQLAQHVQSLHDERDLIKHQFQELMErvatsDGDVAELQEKLRGKEVDYQNL 1144
Cdd:COG4717    414 LLGELEELLEALDEEEleEELEELEEELEELEEELEELREELAELEAELEQLEE-----DGELAELLQELEELKAELREL 488
                          410       420
                   ....*....|....*....|....
gi 1039737300 1145 EHSHHRVSVQLQSVRTLLREKEEE 1168
Cdd:COG4717    489 AEEWAALKLALELLEEAREEYREE 512
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
796-1213 7.64e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 54.64  E-value: 7.64e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVaLGREQSAREGYVLQTEvatspsgaWQRLhRVNQDLqSELEAQCRRQELITQ 875
Cdd:TIGR04523  150 KEKELEKLNNKYNDLKKQKEELENELNL-LEKEKLNIQKNIDKIK--------NKLL-KLELLL-SNLKKKIQKNKSLES 218
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  876 QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAikeqalaklkgELKMEQGKVREQLEEWQHskamlsgQLRASE 955
Cdd:TIGR04523  219 QISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLN-----------QLKDEQNKIKKQLSEKQK-------ELEQNN 280
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  956 QKLRSTEARLLEKTQELRDL--ETQQALQRDRQKEVQRLQECIAELSQQLGTSEQA-QRLMEK--KLKRNytllLESCEQ 1030
Cdd:TIGR04523  281 KKIKELEKQLNQLKSEISDLnnQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIiSQLNEQisQLKKE----LTNSES 356
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1031 EKQALLQNLKEVEDKASAYEDQLQGHVQQVEAL--QKEKLSETCKGSEQVHKleeeleareasirQLAQHVQSLHDERDL 1108
Cdd:TIGR04523  357 ENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLesQINDLESKIQNQEKLNQ-------------QKDEQIKKLQQEKEL 423
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1109 IKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNL--------------EHSHHRVSVQLQSVRTLLREKEEELKHIKE 1174
Cdd:TIGR04523  424 LEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLdntresletqlkvlSRSINKIKQNLEQKQKELKSKEKELKKLNE 503
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1039737300 1175 tHERVLEKKDQDLN----EALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:TIGR04523  504 -EKKELEEKVKDLTkkisSLKEKIEKLESEKKEKESKISDLED 545
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
911-1212 8.25e-07

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 54.59  E-value: 8.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  911 ELAIKEQALAKL------KGELKMEQGKVREQL--EEWQHSKAMLSGQLRASEQKLRSTEARLLEKT---QELRDLETQQ 979
Cdd:pfam02463  167 LKRKKKEALKKLieetenLAELIIDLEELKLQElkLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKlneERIDLLQELL 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  980 ALQRDRQKEVQRLQECIAELSQQ----LGTSEQAQRLMEKKLKRNYTLL------LESCEQEKQALLQNLKEVEDKASAY 1049
Cdd:pfam02463  247 RDEQEEIESSKQEIEKEEEKLAQvlkeNKEEEKEKKLQEEELKLLAKEEeelkseLLKLERRKVDDEEKLKESEKEKKKA 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1050 EDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQ---FQELMERVATSDGD 1126
Cdd:pfam02463  327 EKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAaklKEEELELKSEEEKE 406
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1127 VAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEI 1206
Cdd:pfam02463  407 AQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQL 486

                   ....*.
gi 1039737300 1207 KLQEKE 1212
Cdd:pfam02463  487 ELLLSR 492
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
796-1006 8.35e-07

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 54.64  E-value: 8.35e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRL--LQDQLRVALGREQSAREGYV-LQTEVAtspsGAWQRLHRVNQDLQSELEAQcrRQEL 872
Cdd:COG3206    187 LRKELEEAEAALEEFRQKNGLvdLSEEAKLLLQQLSELESQLAeARAELA----EAEARLAALRAQLGSGPDAL--PELL 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  873 ITQQIQTLKHSYGEAkdairhhEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLE-EWQHSKAMLSgQL 951
Cdd:COG3206    261 QSPVIQQLRAQLAEL-------EAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILASLEaELEALQAREA-SL 332
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300  952 RASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKE-VQRLQEciAELSQQLGTS 1006
Cdd:COG3206    333 QAQLAQLEARLAELPELEAELRRLEREVEVARELYESlLQRLEE--ARLAEALTVG 386
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
875-1205 1.00e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 54.26  E-value: 1.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  875 QQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAI---KEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQL 951
Cdd:TIGR04523  117 EQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKlnnKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKL 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  952 RASEQKLRSTEA-----RLLEKtqELRDLETQQA-LQRDRQKEVQRLQECIAELSQqlgTSEQAQRLMEKKLKRNYTLll 1025
Cdd:TIGR04523  197 LKLELLLSNLKKkiqknKSLES--QISELKKQNNqLKDNIEKKQQEINEKTTEISN---TQTQLNQLKDEQNKIKKQL-- 269
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1026 esceQEKQallQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKG--------SEQVHKLEEELEAREASIRQLAQ 1097
Cdd:TIGR04523  270 ----SEKQ---KELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKElkselknqEKKLEEIQNQISQNNKIISQLNE 342
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1098 HVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKhIKETHE 1177
Cdd:TIGR04523  343 QISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIK-KLQQEK 421
                          330       340
                   ....*....|....*....|....*...
gi 1039737300 1178 RVLEKKDQDLNEALVKMIALGSSLEETE 1205
Cdd:TIGR04523  422 ELLEKEIERLKETIIKNNSEIKDLTNQD 449
PH_PEPP1_2_3 cd13248
Phosphoinositol 3-phosphate binding proteins 1, 2, and 3 pleckstrin homology (PH) domain; ...
507-595 1.18e-06

Phosphoinositol 3-phosphate binding proteins 1, 2, and 3 pleckstrin homology (PH) domain; PEPP1 (also called PLEKHA4/PH domain-containing family A member 4 and RHOXF1/Rhox homeobox family member 1), and related homologs PEPP2 (also called PLEKHA5/PH domain-containing family A member 5) and PEPP3 (also called PLEKHA6/PH domain-containing family A member 6), have PH domains that interact specifically with PtdIns(3,4)P3. Other proteins that bind PtdIns(3,4)P3 specifically are: TAPP1 (tandem PH-domain-containing protein-1) and TAPP2], PtdIns3P AtPH1, and Ptd- Ins(3,5)P2 (centaurin-beta2). All of these proteins contain at least 5 of the 6 conserved amino acids that make up the putative phosphatidylinositol 3,4,5- trisphosphate-binding motif (PPBM) located at their N-terminus. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270068  Cd Length: 104  Bit Score: 49.19  E-value: 1.18e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYEDG--QWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTcYDVT----EYPVQRNYGFQIhTKEGEFT- 579
Cdd:cd13248      9 MSGWLHKQGGSGlkNWRKRWFVLKDNCLYYYKD---PEEEKALGSILLPS-YTISpappSDEISRKFAFKA-EHANMRTy 83
                           90
                   ....*....|....*..
gi 1039737300  580 -LSAMTSGIRRNWIQTI 595
Cdd:cd13248     84 yFAADTAEEMEQWMNAM 100
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
752-1217 1.38e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 53.79  E-value: 1.38e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  752 EIEQRWHQVETTPLREEKQvpIAPLHLSLEDRSERL-STHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQS 830
Cdd:COG1196    285 EAQAEEYELLAELARLEQD--IARLEERRRELEERLeELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  831 AREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAA 910
Cdd:COG1196    363 AEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEE 442
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  911 ELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSgQLRASEQKLRSTEARLLE--KTQELRDLETQQALQRDRQKE 988
Cdd:COG1196    443 ALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALA-ELLEELAEAAARLLLLLEaeADYEGFLEGVKAALLLAGLRG 521
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  989 VQR---------------LQECIAELSQQLGT------SEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKAS 1047
Cdd:COG1196    522 LAGavavligveaayeaaLEAALAAALQNIVVeddevaAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAV 601
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1048 AYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELME---RVATSD 1124
Cdd:COG1196    602 DLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAallEAEAEL 681
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1125 GDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEET 1204
Cdd:COG1196    682 EELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPD 761
                          490
                   ....*....|...
gi 1039737300 1205 EIKLQEKEECLRR 1217
Cdd:COG1196    762 LEELERELERLER 774
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1758-2360 1.48e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 53.91  E-value: 1.48e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1758 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLppTEPLGGCQ------RLLRMSQHLSYESCLEGLGQYS 1831
Cdd:TIGR02168  308 RERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEEL--KEELESLEaeleelEAELEELESRLEELEEQLETLR 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1832 S----LLVQDAIIQAQVCYAACRI-RLEYEKELRFYKKACQEAKGASGQKraQAVGALKEEYEELLHKQKSEYQKVITLI 1906
Cdd:TIGR02168  386 SkvaqLELQIASLNNEIERLEARLeRLEDRRERLQQEIEELLKKLEEAEL--KELQAELEELEEELEELQEELERLEEAL 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1907 EKENTELKAKVSQMDHQQRCLQEAENKhSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEq 1986
Cdd:TIGR02168  464 EELREELEEAEQALDAAERELAQLQAR-LDSLERLQENLEGFSE-GVKALLKNQSGLSGILGVLSELISVDEGYEAAIE- 540
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1987 hhvqqmKMLEDRFQ-LKVRELQAVhqeelRALQEHYIWSLRG-----ALSLYQPSHPDSSLAPG-PSEPRAVPAAKDEAE 2059
Cdd:TIGR02168  541 ------AALGGRLQaVVVENLNAA-----KKAIAFLKQNELGrvtflPLDSIKGTEIQGNDREIlKNIEGFLGVAKDLVK 609
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2060 SMSGLRERIQELEAQMGV---------MREELGHKE----LEGDVAAlqekyqrdfeslkatceRGFAAMEETHQKKIED 2126
Cdd:TIGR02168  610 FDPKLRKALSYLLGGVLVvddldnaleLAKKLRPGYrivtLDGDLVR-----------------PGGVITGGSAKTNSSI 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2127 LQRQhqRELEKLREEKDRLLAEETAATISAIEAMKNahREEMERELEKSQRsQISSINSDIEALRRQYLEELQSVQR--- 2203
Cdd:TIGR02168  673 LERR--REIEELEEKIEELEEKIAELEKALAELRKE--LEELEEELEQLRK-ELEELSRQISALRKDLARLEAEVEQlee 747
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2204 ELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLR------TLLTGDGGGESTGLP 2277
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDelraelTLLNEEAANLRERLE 827
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2278 LTQgKDAYELEVLLRVKESEIQYLKQEISSLKDElqtaLRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEA 2357
Cdd:TIGR02168  828 SLE-RRIAATERRLEDLEEQIEELSEDIESLAAE----IEELEELIEELESELEALLNERASLEEALALLRSELEELSEE 902

                   ...
gi 1039737300 2358 LGE 2360
Cdd:TIGR02168  903 LRE 905
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
908-1138 2.03e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 52.46  E-value: 2.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  908 AAAELAIKEQALAKLKGELKmeqgKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQ 986
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIA----ELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAElAELEKEIA 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  987 KEVQRLQECIAELSQQLgtseqaqRLMEKKLKRNYTLLLESCEQEKQAL--LQNLKEVEDKASAYEDQLQGHVQQVEALQ 1064
Cdd:COG4942     94 ELRAELEAQKEELAELL-------RALYRLGRQPPLALLLSPEDFLDAVrrLQYLKYLAPARREQAEELRADLAELAALR 166
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039737300 1065 KEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKE 1138
Cdd:COG4942    167 AELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAA 240
PH2_ADAP cd01251
ArfGAP with dual PH domains Pleckstrin homology (PH) domain, repeat 2; ADAP (also called ...
505-595 2.21e-06

ArfGAP with dual PH domains Pleckstrin homology (PH) domain, repeat 2; ADAP (also called centaurin alpha) is a phophatidlyinositide binding protein consisting of an N-terminal ArfGAP domain and two PH domains. In response to growth factor activation, PI3K phosphorylates phosphatidylinositol 4,5-bisphosphate to phosphatidylinositol 3,4,5-trisphosphate. Centaurin alpha 1 is recruited to the plasma membrane following growth factor stimulation by specific binding of its PH domain to phosphatidylinositol 3,4,5-trisphosphate. Centaurin alpha 2 is constitutively bound to the plasma membrane since it binds phosphatidylinositol 4,5-bisphosphate and phosphatidylinositol 3,4,5-trisphosphate with equal affinity. This cd contains the second PH domain repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 241282  Cd Length: 105  Bit Score: 48.35  E-value: 2.21e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  505 NFKK-GWLTK----QYEdgQWKKHWFVLADQSLRYYRDSvaeeaadLD----GEINLSTC---YDVTE-----YPVQRNY 567
Cdd:cd01251      1 DFLKeGYLEKtgpkQTD--GFRKRWFTLDDRRLMYFKDP-------LDafpkGEIFIGSKeegYSVREglppgIKGHWGF 71
                           90       100
                   ....*....|....*....|....*...
gi 1039737300  568 GFQIHTKEGEFTLSAMTSGIRRNWIQTI 595
Cdd:cd01251     72 GFTLVTPDRTFLLSAETEEERREWITAI 99
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
781-1110 2.94e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 52.76  E-value: 2.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  781 EDRSERLSTheLTSLLEKELEQSQKEASDLLEQNrllqdqlrvALGREQSAREGYVLqtevatspSGAWQRLHRVNQDLQ 860
Cdd:TIGR02169  183 EENIERLDL--IIDEKRQQLERLRREREKAERYQ---------ALLKEKREYEGYEL--------LKEKEALERQKEAIE 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  861 SELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQ--------TLQTRLGNAAAELAIKEQALAKLKGELKMEQGK 932
Cdd:TIGR02169  244 RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKdlgeeeqlRVKEKIGELEAEIASLERSIAEKERELEDAEER 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  933 VR---EQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-----QALQRDRQKEVQRlQECIAELSQQLG 1004
Cdd:TIGR02169  324 LAkleAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEleevdKEFAETRDELKDY-REKLEKLKREIN 402
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1005 TSEQAQ-RLMEKKLKRNYTLL-----LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKL---SETCKGS 1075
Cdd:TIGR02169  403 ELKRELdRLQEELQRLSEELAdlnaaIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYdlkEEYDRVE 482
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1039737300 1076 EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIK 1110
Cdd:TIGR02169  483 KELSKLQRELAEAEAQARASEERVRGGRAVEEVLK 517
PH_Gab-like cd13324
Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are ...
45-141 3.25e-06

Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. There are 3 families: Gab1, Gab2, and Gab3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270133  Cd Length: 112  Bit Score: 48.18  E-value: 3.25e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   45 IYGGWLLLAPdgtdfdnPVHR--SRKWQRRFFILYEHGL------LRYALDEMPTTlPQGTINMNQCTDVVDGEARTGQK 116
Cdd:cd13324      2 VYEGWLTKSP-------PEKKiwRAAWRRRWFVLRSGRLsggqdvLEYYTDDHCKK-LKGIIDLDQCEQVDAGLTFEKKK 73
                           90       100
                   ....*....|....*....|....*....
gi 1039737300  117 FS----LCILTPDKEHFIRAETKEIISGW 141
Cdd:cd13324     74 FKnqfiFDIRTPKRTYYLVAETEEEMNKW 102
PH_PLEKHD1 cd13281
Pleckstrin homology (PH) domain containing, family D (with coiled-coil domains) member 1 PH ...
64-145 3.62e-06

Pleckstrin homology (PH) domain containing, family D (with coiled-coil domains) member 1 PH domain; Human PLEKHD1 (also called UPF0639, pleckstrin homology domain containing, family D (with M protein repeats) member 1) is a single transcript and contains a single PH domain. PLEKHD1 is conserved in human, chimpanzee, , dog, cow, mouse, chicken, zebrafish, and Caenorhabditis elegans. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270099  Cd Length: 139  Bit Score: 48.47  E-value: 3.62e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   64 HRSRKWQRRFFILYEHGLLRYALDEMPT--------TLPQGTINMNQCTDVVDGEarTGQKFSLCILTPD--KEHFIRAE 133
Cdd:cd13281     25 HQSAKWSKRFFIIKEGFLLYYSESEKKDfektrhfnIHPKGVIPLGGCSIEAVED--PGKPYAISISHSDfkGNIILAAD 102
                           90
                   ....*....|..
gi 1039737300  134 TKEIISGWLEML 145
Cdd:cd13281    103 SEFEQEKWLDML 114
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
795-1042 4.02e-06

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 51.82  E-value: 4.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  795 LLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQ----CRRQ 870
Cdd:pfam07888   31 LLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKykelSASS 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  871 ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAikeqalaklkgELKMEQGKVREQLEEWQHSKAMLSGQ 950
Cdd:pfam07888  111 EELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELE-----------RMKERAKKAGAQRKEEEAERKQLQAK 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  951 LRASEQKLRSTEARLlektQELRDLETQQALQrdrqkeVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTL--LLESC 1028
Cdd:pfam07888  180 LQQTEEELRSLSKEF----QELRNSLAQRDTQ------VLQLQDTITTLTQKLTTAHRKEAENEALLEELRSLqeRLNAS 249
                          250
                   ....*....|....
gi 1039737300 1029 EQEKQALLQNLKEV 1042
Cdd:pfam07888  250 ERKVEGLGEELSSM 263
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
2091-2400 4.26e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 52.38  E-value: 4.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2091 VAALQEKYQRDFESLKATCERgfaamEETHQKKIEDLQRQhqreLEKLREEKDRLL--------AEETAATI--SAIEAM 2160
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEEN-----IERLDLIIDEKRQQ----LERLRREREKAEryqallkeKREYEGYEllKEKEAL 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2161 K------NAHREEMERELEKSQRsQISSINSDIEALRRqyleELQSVQRELEVLSEQYSQKCLENAHLAQA-LEAERQAL 2233
Cdd:TIGR02169  236 ErqkeaiERQLASLEEELEKLTE-EISELEKRLEEIEQ----LLEELNKKIKDLGEEEQLRVKEKIGELEAeIASLERSI 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2234 RQCQRENQELNAHNQELN---NRLAAEITRLRTLLtgdgggESTGLPLTQGKDAY-ELEVLLRVKESEIQYLKQEISSLK 2309
Cdd:TIGR02169  311 AEKERELEDAEERLAKLEaeiDKLLAEIEELEREI------EEERKRRDKLTEEYaELKEELEDLRAELEEVDKEFAETR 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2310 DELQTALRDKKYASDKYKDIYTELSI---AKAKADCDISRLKEQLKAATEALGEkSPEGTTVSGYDIMKSKSNPDFLKKD 2386
Cdd:TIGR02169  385 DELKDYREKLEKLKREINELKRELDRlqeELQRLSEELADLNAAIAGIEAKINE-LEEEKEDKALEIKKQEWKLEQLAAD 463
                          330
                   ....*....|....
gi 1039737300 2387 RSCVTRQLRNIRSK 2400
Cdd:TIGR02169  464 LSKYEQELYDLKEE 477
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
934-1170 6.17e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 51.84  E-value: 6.17e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  934 REQLEEWQHSKAMLSgQLRASEQKLRSTEARLlEKTQELRD----------LETQQALQRDRQKEVQRLQECIAELSQQL 1003
Cdd:COG4913    241 HEALEDAREQIELLE-PIRELAERYAAARERL-AELEYLRAalrlwfaqrrLELLEAELEELRAELARLEAELERLEARL 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1004 GTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQghvqqvealqkeKLSETCKGSEQVHKLEE 1083
Cdd:COG4913    319 DALREELDELEAQIRGNGGDRLEQLEREIERLERELEERERRRARLEALLA------------ALGLPLPASAEEFAALR 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1084 eleareasiRQLAQHVQSLHDERDlikhqfqELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLR 1163
Cdd:COG4913    387 ---------AEAAALLEALEEELE-------ALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRDALA 450
                          250
                   ....*....|.
gi 1039737300 1164 E----KEEELK 1170
Cdd:COG4913    451 EalglDEAELP 461
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
857-1213 6.70e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 51.56  E-value: 6.70e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  857 QDLQSELEAQCRRQELITQQIQTLKHSYGEAKDA-------IRHHEAEIQTLQTRLGNAAAELAI----KEQALAK-LKG 924
Cdd:TIGR04523  235 EKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQlsekqkeLEQNNKKIKELEKQLNQLKSEISDlnnqKEQDWNKeLKS 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  925 ELKMEQGKVRE---QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQ---KEVQRLQECIA 997
Cdd:TIGR04523  315 ELKNQEKKLEEiqnQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEiEKLKKENQsykQEIKNLESQIN 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  998 ELSQQLGTSEQAQRLME---KKLKRNYTLLLESCEQEKQALLQN---LKEVEDKASAYEDQLQGHVQQVEAlQKEKLSEt 1071
Cdd:TIGR04523  395 DLESKIQNQEKLNQQKDeqiKKLQQEKELLEKEIERLKETIIKNnseIKDLTNQDSVKELIIKNLDNTRES-LETQLKV- 472
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1072 ckgseqvhkleeeleaREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRV 1151
Cdd:TIGR04523  473 ----------------LSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEK 536
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 1152 SVQLQSVRTLLREKEEELKhiKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:TIGR04523  537 ESKISDLEDELNKDDFELK--KENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEK 596
PH_DAPP1 cd10573
Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; ...
504-595 6.93e-06

Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; DAPP1 (also known as PHISH/3' phosphoinositide-interacting SH2 domain-containing protein or Bam32) plays a role in B-cell activation and has potential roles in T-cell and mast cell function. DAPP1 promotes B cell receptor (BCR) induced activation of Rho GTPases Rac1 and Cdc42, which feed into mitogen-activated protein kinases (MAPK) activation pathways and affect cytoskeletal rearrangement. DAPP1can also regulate BCR-induced activation of extracellular signal-regulated kinase (ERK), and c-jun NH2-terminal kinase (JNK). DAPP1 contains an N-terminal SH2 domain and a C-terminal pleckstrin homology (PH) domain with a single tyrosine phosphorylation site located centrally. DAPP1 binds strongly to both PtdIns(3,4,5)P3 and PtdIns(3,4)P2. The PH domain is essential for plasma membrane recruitment of PI3K upon cell activation. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269977 [Multi-domain]  Cd Length: 96  Bit Score: 46.55  E-value: 6.93e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  504 LNFKKGWLTKQyedGQ----WKKHWFVLADQSLRYYRDSVAEEAADldgEINLSTCYDVTEYPVQ-RNYGFQIHTKEGEF 578
Cdd:cd10573      2 LGSKEGYLTKL---GGivknWKTRWFVLRRNELKYFKTRGDTKPIR---VLDLRECSSVQRDYSQgKVNCFCLVFPERTF 75
                           90
                   ....*....|....*..
gi 1039737300  579 TLSAMTSGIRRNWIQTI 595
Cdd:cd10573     76 YMYANTEEEADEWVKLL 92
PH_Gab2_2 cd13384
Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily ...
45-141 7.35e-06

Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily includes several Gab proteins, Drosophila DOS and C. elegans SOC-1. They are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. Members here include insect, nematodes, and crustacean Gab2s. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 241535  Cd Length: 115  Bit Score: 47.05  E-value: 7.35e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   45 IYGGWLLLAPdgtdfdnPVHR--SRKWQRRFFILY-----EHGLLRYALDEMPTTLpQGTINMNQCTDV-----VDGEAR 112
Cdd:cd13384      4 VYEGWLTKSP-------PEKRiwRAKWRRRYFVLRqseipGQYFLEYYTDRTCRKL-KGSIDLDQCEQVdagltFETKNK 75
                           90       100
                   ....*....|....*....|....*....
gi 1039737300  113 TGQKFSLCILTPDKEHFIRAETKEIISGW 141
Cdd:cd13384     76 LKDQHIFDIRTPKRTYYLVADTEDEMNKW 104
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1897-2366 8.51e-06

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 51.27  E-value: 8.51e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1897 SEYQKVITLIEKENTELKAKVSQMDHQQRCLQ-EAENK-------HSESMFALQGRYEEEIRCMVEQLSHTENTLQAERS 1968
Cdd:pfam15921  220 SAISKILRELDTEISYLKGRIFPVEDQLEALKsESQNKielllqqHQDRIEQLISEHEVEITGLTEKASSARSQANSIQS 299
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1969 RvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSlrgalslyqpshpDSSLAPGPSEp 2048
Cdd:pfam15921  300 Q-LEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLA-------------NSELTEARTE- 364
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2049 ravpaaKDEAESMSG-LRERIQELEAQMGVMREELG---------------------HKELEGDVAALQ-EKYQRDFESL 2105
Cdd:pfam15921  365 ------RDQFSQESGnLDDQLQKLLADLHKREKELSlekeqnkrlwdrdtgnsitidHLRRELDDRNMEvQRLEALLKAM 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2106 KATC----ERGFAAMEETHQ--KKIEDLQRQHQRELEKLREEKDRLLA-----EETAATISAIeamkNAHREEMERELEK 2174
Cdd:pfam15921  439 KSECqgqmERQMAAIQGKNEslEKVSSLTAQLESTKEMLRKVVEELTAkkmtlESSERTVSDL----TASLQEKERAIEA 514
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2175 SQrSQISSINS--DIEALRRQYL----EELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQ 2248
Cdd:pfam15921  515 TN-AEITKLRSrvDLKLQELQHLknegDHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKA 593
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2249 ELNNRLAAEITRLRTLLTgdgggestglpLTQGKDAYELEVLLRVKESEIQY----------------LKQEISSLKDEL 2312
Cdd:pfam15921  594 QLEKEINDRRLELQEFKI-----------LKDKKDAKIRELEARVSDLELEKvklvnagserlravkdIKQERDQLLNEV 662
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300 2313 QTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGE-----KSPEGT 2366
Cdd:pfam15921  663 KTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQtrntlKSMEGS 721
PH_TBC1D2A cd01265
TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1 ...
520-598 8.54e-06

TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1/Prostate antigen recognized and identified by SEREX 1 and ARMUS) contains a PH domain and a TBC-type GTPase catalytic domain. TBC1D2A integrates signaling between Arf6, Rac1, and Rab7 during junction disassembly. Activated Rac1 recruits TBC1D2A to locally inactivate Rab7 via its C-terminal TBC/RabGAP domain and facilitate E-cadherin degradation in lysosomes. The TBC1D2A PH domain mediates localization at cell-cell contacts and coprecipitates with cadherin complexes. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269966  Cd Length: 102  Bit Score: 46.55  E-value: 8.54e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  520 WKKHWFVLADQS--LRYYRDSvaeeaADLD--GEINLSTCydVTEYPVQRNYG-FQIHTKEGEFTLSAMTSGIRRNWIQT 594
Cdd:cd01265     19 WKRRWFVLDESKcqLYYYRSP-----QDATplGSIDLSGA--AFSYDPEAEPGqFEIHTPGRVHILKASTRQAMLYWLQA 91

                   ....
gi 1039737300  595 IMKH 598
Cdd:cd01265     92 LQSK 95
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1875-2308 8.88e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 51.48  E-value: 8.88e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1875 QKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRCLQEAEN--KHSESMFALQGRYEEEIRCM 1952
Cdd:COG1196    347 EEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEEleEAEEALLERLERLEEELEEL 426
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1953 VEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKmLEDRFQLKVRELQAVHQEELRALQEHyiWSLRGALSLY 2032
Cdd:COG1196    427 EEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAE-LLEEAALLEAALAELLEELAEAAARL--LLLLEAEADY 503
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2033 QPSHPDSSLAPGPSEPRAVPAAKDEAesMSGLRERIQELEAQMGVMREELGHKELEgdVAALQEKYQRDFESLKAT---- 2108
Cdd:COG1196    504 EGFLEGVKAALLLAGLRGLAGAVAVL--IGVEAAYEAALEAALAAALQNIVVEDDE--VAAAAIEYLKAAKAGRATflpl 579
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2109 --CERGFAAMEETHQKKIEDLQRQHQRELEKLREEKDRL---LAEETAATISAIEAMKNAHREEMERELEKSQRSQISSI 2183
Cdd:COG1196    580 dkIRARAALAAALARGAIGAAVDLVASDLREADARYYVLgdtLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAG 659
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2184 NSDIEALRRQYLEELQSVQRELEVLSEQ--YSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRL 2261
Cdd:COG1196    660 GSLTGGSRRELLAALLEAEAELEELAERlaEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLE 739
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1039737300 2262 RTLltgdgggESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSL 2308
Cdd:COG1196    740 ELL-------EEEELLEEEALEELPEPPDLEELERELERLEREIEAL 779
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
871-1182 9.95e-06

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 49.91  E-value: 9.95e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  871 ELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQtrlgNAAAELAIKEQALAKLKGELKMEQGKVREQLEEwqhskamLSGQ 950
Cdd:COG1340      4 DELSSSLEELEEKIEELREEIEELKEKRDELN----EELKELAEKRDELNAQVKELREEAQELREKRDE-------LNEK 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  951 LRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTS----EQAQRLMEK--KLKRNYTLL 1024
Cdd:COG1340     73 VKELKEERDELNEKLNELREELDELRKELAELNKAGGSIDKLRKEIERLEWRQQTEvlspEEEKELVEKikELEKELEKA 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1025 LESCEQEKQallqnLKEVEDKASAYEDQLQGHVQQVEALQKEklsetckgSEQVHKleeeleareaSIRQLAQHVQSLHD 1104
Cdd:COG1340    153 KKALEKNEK-----LKELRAELKELRKEAEEIHKKIKELAEE--------AQELHE----------EMIELYKEADELRK 209
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 1105 ERDLIKHQFQELMERvatsdgdVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRtllreKEEELKHIKETHERVLEK 1182
Cdd:COG1340    210 EADELHKEIVEAQEK-------ADELHEEIIELQKELRELRKELKKLRKKQRALK-----REKEKEELEEKAEEIFEK 275
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
856-1212 1.11e-05

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 50.79  E-value: 1.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  856 NQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGElkmEQGKvRE 935
Cdd:TIGR04523  309 NKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKE---NQSY-KQ 384
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  936 QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQ----ALQRDRQKEVQRLQECIAELSQQLGTSEQAQR 1011
Cdd:TIGR04523  385 EIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIerlkETIIKNNSEIKDLTNQDSVKELIIKNLDNTRE 464
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1012 LMEKKLKrnytLLLESCEQEKQALLQNLKEVEDKASAYeDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREAS 1091
Cdd:TIGR04523  465 SLETQLK----VLSRSINKIKQNLEQKQKELKSKEKEL-KKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESK 539
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1092 IRQLAQHVQSLHDE--RDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEEL 1169
Cdd:TIGR04523  540 ISDLEDELNKDDFElkKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKEL 619
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 1039737300 1170 KHIKETHERVLEKKDqDLNEALVKMIALGSSLEETEIKLQEKE 1212
Cdd:TIGR04523  620 EKAKKENEKLSSIIK-NIKSKKNKLKQEVKQIKETIKEIRNKW 661
PH1_PLEKHH1_PLEKHH2 cd13282
Pleckstrin homology (PH) domain containing, family H (with MyTH4 domain) members 1 and 2 ...
507-595 1.56e-05

Pleckstrin homology (PH) domain containing, family H (with MyTH4 domain) members 1 and 2 (PLEKHH1) PH domain, repeat 1; PLEKHH1 and PLEKHH2 (also called PLEKHH1L) are thought to function in phospholipid binding and signal transduction. There are 3 Human PLEKHH genes: PLEKHH1, PLEKHH2, and PLEKHH3. There are many isoforms, the longest of which contain a FERM domain, a MyTH4 domain, two PH domains, a peroximal domain, a vacuolar domain, and a coiled coil stretch. The FERM domain has a cloverleaf tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM domain is part of the PH domain family. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 241436  Cd Length: 96  Bit Score: 45.75  E-value: 1.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQyeDGQ---WKKHWFVLADQSLRYYRD--SVAEEAAdldGEINLSTCYDVTeyPVQRNYGFQIHTKEGEFTLS 581
Cdd:cd13282      1 KAGYLTKL--GGKvktWKRRWFVLKNGELFYYKSpnDVIRKPQ---GQIALDGSCEIA--RAEGAQTFEIVTEKRTYYLT 73
                           90
                   ....*....|....
gi 1039737300  582 AMTSGIRRNWIQTI 595
Cdd:cd13282     74 ADSENDLDEWIRVI 87
mukB PRK04863
chromosome partition protein MukB;
780-1120 1.61e-05

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 50.73  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  780 LEDRSERLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQtevatspsgawQRLHRVNQDL 859
Cdd:PRK04863   289 LELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQAASDHLNLVQTALRQQ-----------EKIERYQADL 357
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  860 QsELEAQCRRQELITQQIQTLKHSYGEAKDAIrhhEAEIQTLQTRLGNAAAELAIKE----------QALAKLKGELK-- 927
Cdd:PRK04863   358 E-ELEERLEEQNEVVEEADEQQEENEARAEAA---EEEVDELKSQLADYQQALDVQQtraiqyqqavQALERAKQLCGlp 433
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  928 -MEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEA--RLLEKTQEL-----------------RDLETQQALQRDRQK 987
Cdd:PRK04863   434 dLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAahSQFEQAYQLvrkiagevsrseawdvaRELLRRLREQRHLAE 513
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  988 EVQRLQECIAELSQQLGTSEQAQRLM---EKKLKRNYTL--LLESCEQEKQALLQNLKE----VEDKASAYEDQLQGHVQ 1058
Cdd:PRK04863   514 QLQQLRMRLSELEQRLRQQQRAERLLaefCKRLGKNLDDedELEQLQEELEARLESLSEsvseARERRMALRQQLEQLQA 593
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039737300 1059 QVEALQK---------EKLSETCkgsEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERV 1120
Cdd:PRK04863   594 RIQRLAArapawlaaqDALARLR---EQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDEEI 661
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
761-1227 1.89e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 50.35  E-value: 1.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  761 ETTPLREEkQVPIAPLHLSLEDRSERLSTHELTSLLEKELEQ-----SQKEASDLLEQNRLLQDQLRVALGREQSAREGY 835
Cdd:TIGR00618  401 ELDILQRE-QATIDTRTSAFRDLQGQLAHAKKQQELQQRYAElcaaaITCTAQCEKLEKIHLQESAQSLKEREQQLQTKE 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  836 VLQTEVATSPSGAWQRLHRVnQDLQSELEAQCRRQELITQQIQTL------------KHSY-GEAKDAIRHHEAEIQTLQ 902
Cdd:TIGR00618  480 QIHLQETRKKAVVLARLLEL-QEEPCPLCGSCIHPNPARQDIDNPgpltrrmqrgeqTYAQlETSEEDVYHQLTSERKQR 558
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  903 TRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSgqlraseqklRSTEARLLEKTQELRDLETQQALQ 982
Cdd:TIGR00618  559 ASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLS----------EAEDMLACEQHALLRKLQPEQDLQ 628
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  983 RDRQKEvqrlQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQN-LKEVEDKASAYEDQLQGHVQQVE 1061
Cdd:TIGR00618  629 DVRLHL----QQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLaLQKMQSEKEQLTYWKEMLAQCQT 704
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1062 ALQKEKLSETcKGSEQVHKLEEELEAREASIR-QLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVD 1140
Cdd:TIGR00618  705 LLRELETHIE-EYDREFNEIENASSSLGSDLAaREDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGAELSHL 783
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1141 YQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECLRRFVS 1220
Cdd:TIGR00618  784 AAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQ 863

                   ....*..
gi 1039737300 1221 DSPKDAK 1227
Cdd:TIGR00618  864 LTQEQAK 870
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
755-1210 1.97e-05

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 50.14  E-value: 1.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  755 QRWHQVETTPLREEKQVPIAPLHLSLEDRSERLSTHELTSLLE-KELEQSQKEASDLLEQNRLLQDQLRVALGREQSARE 833
Cdd:pfam07111  146 QRLHQEQLSSLTQAHEEALSSLTSKAEGLEKSLNSLETKRAGEaKQLAEAQKEAELLRKQLSKTQEELEAQVTLVESLRK 225
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  834 gYVLQTEVATSPSGAWqrlhrvnqdlqsELEaqcrRQELItQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELA 913
Cdd:pfam07111  226 -YVGEQVPPEVHSQTW------------ELE----RQELL-DTMQHLQEDRADLQATVELLQVRVQSLTHMLALQEEELT 287
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  914 IKEQALAKLKGELKMeqgKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQA-----LQR---DR 985
Cdd:pfam07111  288 RKIQPSDSLEPEFPK---KCRSLLNRWREKVFALMVQLKAQDLEHRDSVKQLRGQVAELQEQVTSQSqeqaiLQRalqDK 364
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  986 QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLL------LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQ 1059
Cdd:pfam07111  365 AAEVEVERMSAKGLQMELSRAQEARRRQQQQTASAEEQLkfvvnaMSSTQIWLETTMTRVEQAVARIPSLSNRLSYAVRK 444
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1060 V---EALQKEKLS------ETCKGSEqvhKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERvATSDGDVAEL 1130
Cdd:pfam07111  445 VhtiKGLMARKVAlaqlrqESCPPPP---PAPPVDADLSLELEQLREERNRLDAELQLSAHLIQQEVGR-AREQGEAERQ 520
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1131 Q--EKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKM-IALGSSLEETEIK 1207
Cdd:pfam07111  521 QlsEVAQQLEQELQRAQESLASVGQQLEVARQGQQESTEEAASLRQELTQQQEIYGQALQEKVAEVeTRLREQLSDTKRR 600

                   ...
gi 1039737300 1208 LQE 1210
Cdd:pfam07111  601 LNE 603
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
781-1213 2.10e-05

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 50.04  E-value: 2.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  781 EDRSERLSTHELTSLLEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYvlqTEVATSPSGAWQRLHRVNQDLQ 860
Cdd:PRK02224   293 EERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESL---REDADDLEERAEELREEAAELE 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  861 SELEAqCRRQelitqqiqtlkhsYGEAKDAIRHHEAEIQTLQTRLGNAAAELaikeQALAKLKGELKMEQGKVREQLEEw 940
Cdd:PRK02224   370 SELEE-AREA-------------VEDRREEIEELEEEIEELRERFGDAPVDL----GNAEDFLEELREERDELREREAE- 430
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  941 qhskamLSGQLRASEQKLRSTEaRLLEK------TQELRDLETQQALQRDRQKevqrlqecIAELSQQLGTSEQAQRLME 1014
Cdd:PRK02224   431 ------LEATLRTARERVEEAE-ALLEAgkcpecGQPVEGSPHVETIEEDRER--------VEELEAELEDLEEEVEEVE 495
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1015 KKLKRNYTllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQvhklEEELEAREASIRQ 1094
Cdd:PRK02224   496 ERLERAED--LVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEK----REAAAEAEEEAEE 569
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1095 LAQHVQSLHDERDLIKHQFQELmERVATSDGDVAELQ---EKLRGKEVDYQNLE-HSHHRVSVQLQSVRTLLREKE---- 1166
Cdd:PRK02224   570 AREEVAELNSKLAELKERIESL-ERIRTLLAAIADAEdeiERLREKREALAELNdERRERLAEKRERKRELEAEFDeari 648
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1039737300 1167 EELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEE 1213
Cdd:PRK02224   649 EEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEE 695
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
796-1009 2.22e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 49.38  E-value: 2.22e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLrvalgrEQSAREGYVLQTEVATspsgAWQRLHRVNQDLQSELEAQCRRQELITQ 875
Cdd:COG4942     39 LEKELAALKKEEKALLKQLAALERRI------AALARRIRALEQELAA----LEAELAELEKEIAELRAELEAQKEELAE 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  876 QIQTL----KHSY-------GEAKDAIRHHEAeIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSK 944
Cdd:COG4942    109 LLRALyrlgRQPPlalllspEDFLDAVRRLQY-LKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEER 187
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1039737300  945 AMLSGQLRASEQKLRSTEARLLEKTQELRDLetqqalqrdrQKEVQRLQECIAELSQQLGTSEQA 1009
Cdd:COG4942    188 AALEALKAERQKLLARLEKELAELAAELAEL----------QQEAEELEALIARLEAEAAAAAER 242
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
963-1213 2.52e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 49.91  E-value: 2.52e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  963 ARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKklkrnytlllescEQEKQALLQNLKEV 1042
Cdd:COG4913    194 LRLLHKTQSFKPIGDLDDFVREYMLEEPDTFEAADALVEHFDDLERAHEALED-------------AREQIELLEPIREL 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1043 EDKASAYEDQLQGHVQQVEALQKEKlsetckGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVAT 1122
Cdd:COG4913    261 AERYAAARERLAELEYLRAALRLWF------AQRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRG 334
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1123 SDGD-VAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSL 1201
Cdd:COG4913    335 NGGDrLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAAL 414
                          250
                   ....*....|..
gi 1039737300 1202 EETEIKLQEKEE 1213
Cdd:COG4913    415 RDLRRELRELEA 426
PH_SWAP-70 cd13273
Switch-associated protein-70 Pleckstrin homology (PH) domain; SWAP-70 (also called ...
507-595 3.10e-05

Switch-associated protein-70 Pleckstrin homology (PH) domain; SWAP-70 (also called Differentially expressed in FDCP 6/DEF-6 or IRF4-binding protein) functions in cellular signal transduction pathways (in conjunction with Rac), regulates cell motility through actin rearrangement, and contributes to the transformation and invasion activity of mouse embryo fibroblasts. Metazoan SWAP-70 is found in B lymphocytes, mast cells, and in a variety of organs. Metazoan SWAP-70 contains an N-terminal EF-hand motif, a centrally located PH domain, and a C-terminal coiled-coil domain. The PH domain of Metazoan SWAP-70 contains a phosphoinositide-binding site and a nuclear localization signal (NLS), which localize SWAP-70 to the plasma membrane and nucleus, respectively. The NLS is a sequence of four Lys residues located at the N-terminus of the C-terminal a-helix; this is a unique characteristic of the Metazoan SWAP-70 PH domain. The SWAP-70 PH domain binds PtdIns(3,4,5)P3 and PtdIns(4,5)P2 embedded in lipid bilayer vesicles. There are additional plant SWAP70 proteins, but these are not included in this hierarchy. Rice SWAP70 (OsSWAP70) exhibits GEF activity toward the its Rho GTPase, OsRac1, and regulates chitin-induced production of reactive oxygen species and defense gene expression in rice. Arabidopsis SWAP70 (AtSWAP70) plays a role in both PAMP- and effector-triggered immunity. Plant SWAP70 contains both DH and PH domains, but their arrangement is the reverse of that in typical DH-PH-type Rho GEFs, wherein the DH domain is flanked by a C-terminal PH domain. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270092  Cd Length: 110  Bit Score: 44.98  E-value: 3.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQ-YEDGQWKKHWFVLADQSLRYYrdsVAEEAADLDGEINL--STCYDVTEYPVQRNYGFQIHTKEGEFTLSAM 583
Cdd:cd13273     10 KKGYLWKKgHLLPTWTERWFVLKPNSLSYY---KSEDLKEKKGEIALdsNCCVESLPDREGKKCRFLVKTPDKTYELSAS 86
                           90
                   ....*....|..
gi 1039737300  584 TSGIRRNWIQTI 595
Cdd:cd13273     87 DHKTRQEWIAAI 98
PH_Osh1p_Osh2p_yeast cd13292
Yeast oxysterol binding protein homologs 1 and 2 Pleckstrin homology (PH) domain; Yeast Osh1p ...
508-599 3.13e-05

Yeast oxysterol binding protein homologs 1 and 2 Pleckstrin homology (PH) domain; Yeast Osh1p is proposed to function in postsynthetic sterol regulation, piecemeal microautophagy of the nucleus, and cell polarity establishment. Yeast Osh2p is proposed to function in sterol metabolism and cell polarity establishment. Both Osh1p and Osh2p contain 3 N-terminal ankyrin repeats, a PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. OSBP andOsh1p PH domains specifically localize to the Golgi apparatus in a PtdIns4P-dependent manner. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. In general OSBPs and ORPs have been found to be involved in the transport and metabolism of cholesterol and related lipids in eukaryotes. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. They are members of the oxysterol binding protein (OSBP) family which includes OSBP, OSBP-related proteins (ORP), Goodpasture antigen binding protein (GPBP), and Four phosphate adaptor protein 1 (FAPP1). They have a wide range of purported functions including sterol transport, cell cycle control, pollen development and vessicle transport from Golgi recognize both PI lipids and ARF proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 241446  Cd Length: 103  Bit Score: 44.99  E-value: 3.13e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  508 KGWLTK--QYEDGqWKKHWFVLADQSLRYYRDSVAEEAAdLDGEINLSTCYDVteYPVQRNYGFQIHTKEG---EFTLSA 582
Cdd:cd13292      5 KGYLKKwtNYAKG-YKTRWFVLEDGVLSYYRHQDDEGSA-CRGSINMKNARLV--SDPSEKLRFEVSSKTSgspKWYLKA 80
                           90
                   ....*....|....*..
gi 1039737300  583 MTSGIRRNWIQTIMKHV 599
Cdd:cd13292     81 NHPVEAARWIQALQKAI 97
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
2041-2271 4.53e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 48.22  E-value: 4.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2041 LAPGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQRDFESLKATcERGFAAMEeth 2120
Cdd:COG4942     10 LLALAAAAQADAAAEAEAE-LEQLQQEIAELEKELAALKKE--EKALLKQLAALERRIAALARRIRAL-EQELAALE--- 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2121 qKKIEDLQRQH---QRELEKLREEKDRLLA--------EETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEA 2189
Cdd:COG4942     83 -AELAELEKEIaelRAELEAQKEELAELLRalyrlgrqPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAE 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2190 LRR------QYLEELQSVQRELE----VLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEIT 2259
Cdd:COG4942    162 LAAlraeleAERAELEALLAELEeeraALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241
                          250
                   ....*....|..
gi 1039737300 2260 RLRTLLTGDGGG 2271
Cdd:COG4942    242 RTPAAGFAALKG 253
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1865-2349 4.65e-05

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 48.95  E-value: 4.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1865 ACQEAKGASGQKRAQAVGALKEEYEELLHKQKsEYQKVITLIEKENTELKAKVSqmdhqqrclqEAENKHSESMFALqgr 1944
Cdd:pfam05483  198 AFEELRVQAENARLEMHFKLKEDHEKIQHLEE-EYKKEINDKEKQVSLLLIQIT----------EKENKMKDLTFLL--- 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1945 yeEEIRCMVEQLSHtENTLQAERSRVLSQ----LDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEH 2020
Cdd:pfam05483  264 --EESRDKANQLEE-KTKLQDENLKELIEkkdhLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEEL 340
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2021 YIWSLRGALSLYQPSHPDSSLAPG-PSEPRAVPAAKD-------EAESMSGLRERIQELEAQMGVMREELgHKELEGDVA 2092
Cdd:pfam05483  341 NKAKAAHSFVVTEFEATTCSLEELlRTEQQRLEKNEDqlkiitmELQKKSSELEEMTKFKNNKEVELEEL-KKILAEDEK 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2093 ALQEKYQRD--FESLKATcERGFAAMEETHQKKIEDLQRQ----------HQRELEKLREE--KDRLLAEETAATISAIE 2158
Cdd:pfam05483  420 LLDEKKQFEkiAEELKGK-EQELIFLLQAREKEIHDLEIQltaiktseehYLKEVEDLKTEleKEKLKNIELTAHCDKLL 498
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2159 AMKNAHREE---MERELEKSQRSQISSINSDIEALRR-QYLEELQSVQR-ELEVLSEQYSQ-----KCLENAHLAQALEA 2228
Cdd:pfam05483  499 LENKELTQEasdMTLELKKHQEDIINCKKQEERMLKQiENLEEKEMNLRdELESVREEFIQkgdevKCKLDKSEENARSI 578
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2229 ERQALRQCQRENQELNAHNQ-----ELNNRLAAEITRLRTLLTGDGGGESTGLpltqgkDAYELEVllRVKESEIQYLKQ 2303
Cdd:pfam05483  579 EYEVLKKEKQMKILENKCNNlkkqiENKNKNIEELHQENKALKKKGSAENKQL------NAYEIKV--NKLELELASAKQ 650
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*.
gi 1039737300 2304 EISSLKDELQTALRDKKYASDKykdIYTELSIAKAKADCDISRLKE 2349
Cdd:pfam05483  651 KFEEIIDNYQKEIEDKKISEEK---LLEEVEKAKAIADEAVKLQKE 693
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1883-2364 4.69e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 48.91  E-value: 4.69e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1883 ALKEEYEELLhKQKSEYQKVITLIEKENTELKAKVSQmdhqqrcLQEAENKHSESMFALQGRYE--EEIRCMVEQLSHTE 1960
Cdd:PRK03918   176 RRIERLEKFI-KRTENIEELIKEKEKELEEVLREINE-------ISSELPELREELEKLEKEVKelEELKEEIEELEKEL 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1961 NTLQAErsrvLSQLDASVKDRQAMEQHHVQQMKMLEDrfqlKVRELqavhqEELRALQEHYIwSLRGALSLY--QPSHPD 2038
Cdd:PRK03918   248 ESLEGS----KRKLEEKIRELEERIEELKKEIEELEE----KVKEL-----KELKEKAEEYI-KLSEFYEEYldELREIE 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2039 SSLAPGPSEPRAVPAAKDEAESMSglrERIQELEAQMGVMREELGhkELEGDVAALQEKYQRDFESLKATCERGFAAMEE 2118
Cdd:PRK03918   314 KRLSRLEEEINGIEERIKELEEKE---ERLEELKKKLKELEKRLE--ELEERHELYEEAKAKKEELERLKKRLTGLTPEK 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2119 ThQKKIEDLQRQH---QRELEKLREEKDRLLAEEtAATISAIEAMKNAHR----------EEMERELEKSQRSQISSINS 2185
Cdd:PRK03918   389 L-EKELEELEKAKeeiEEEISKITARIGELKKEI-KELKKAIEELKKAKGkcpvcgreltEEHRKELLEEYTAELKRIEK 466
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2186 DIEALRRQyLEELQSVQRELEVLSEQYSQkclenahlaqaLEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLL 2265
Cdd:PRK03918   467 ELKEIEEK-ERKLRKELRELEKVLKKESE-----------LIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKL 534
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2266 TGDGGgESTGLpLTQGKDAYELEVLLRVKESEIQYLKQEISSLK-----------DELQTALRDKKYASDKY---KDIYT 2331
Cdd:PRK03918   535 IKLKG-EIKSL-KKELEKLEELKKKLAELEKKLDELEEELAELLkeleelgfesvEELEERLKELEPFYNEYlelKDAEK 612
                          490       500       510
                   ....*....|....*....|....*....|...
gi 1039737300 2332 ELSIAKAKadcdISRLKEQLKAATEALGEKSPE 2364
Cdd:PRK03918   613 ELEREEKE----LKKLEEELDKAFEELAETEKR 641
PH_Gab-like cd13324
Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are ...
509-595 4.92e-05

Grb2-associated binding protein family Pleckstrin homology (PH) domain; Gab proteins are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. There are 3 families: Gab1, Gab2, and Gab3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270133  Cd Length: 112  Bit Score: 44.71  E-value: 4.92e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  509 GWLTK-----QYEDGQWKKHWFVL------ADQS-LRYYRDsvaEEAADLDGEINLSTCYDVT-----EYPVQRN-YGFQ 570
Cdd:cd13324      5 GWLTKsppekKIWRAAWRRRWFVLrsgrlsGGQDvLEYYTD---DHCKKLKGIIDLDQCEQVDagltfEKKKFKNqFIFD 81
                           90       100
                   ....*....|....*....|....*
gi 1039737300  571 IHTKEGEFTLSAMTSGIRRNWIQTI 595
Cdd:cd13324     82 IRTPKRTYYLVAETEEEMNKWVRCI 106
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
973-1221 5.78e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 48.51  E-value: 5.78e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  973 RDLETQQALqRDRQKEVQRLQECIAELSQQLGT----SEQAQRLMEKKLKRNytlllescEQEKQALLQNLKEVEDKASA 1048
Cdd:TIGR02168  173 RRKETERKL-ERTRENLDRLEDILNELERQLKSlerqAEKAERYKELKAELR--------ELELALLVLRLEELREELEE 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1049 YEDQLQGHVQQVEALQKEKlsetckgseqvhkleeelEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVA 1128
Cdd:TIGR02168  244 LQEELKEAEEELEELTAEL------------------QELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQ 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1129 ELQEKLRgkevdyqNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHErVLEKKDQDLNEALVKMIALgssLEETEIKL 1208
Cdd:TIGR02168  306 ILRERLA-------NLERQLEELEAQLEELESKLDELAEELAELEEKLE-ELKEELESLEAELEELEAE---LEELESRL 374
                          250
                   ....*....|...
gi 1039737300 1209 QEKEECLRRFVSD 1221
Cdd:TIGR02168  375 EELEEQLETLRSK 387
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
2064-2262 6.97e-05

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 48.09  E-value: 6.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2064 LRERIQELEAQMGVMREELghKELEGDVAALQEKYQrdfeslkatcERGFAAMEETHQKKIEDLQRQhqreLEKLREEKD 2143
Cdd:COG3206    173 ARKALEFLEEQLPELRKEL--EEAEAALEEFRQKNG----------LVDLSEEAKLLLQQLSELESQ----LAEARAELA 236
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2144 RLLAEETAATiSAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEE---LQSVQRELEVLSEQYSQkclENA 2220
Cdd:COG3206    237 EAEARLAALR-AQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPNhpdVIALRAQIAALRAQLQQ---EAQ 312
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1039737300 2221 HLAQALEAERQALRQCQRE-NQELNAHNQELN--NRLAAEITRLR 2262
Cdd:COG3206    313 RILASLEAELEALQAREASlQAQLAQLEARLAelPELEAELRRLE 357
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
2059-2324 7.19e-05

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 48.53  E-value: 7.19e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2059 ESMSGLRERIQELEAQMGVMREELGH------KELEGDVAALQEKYqRDFESLKATCERGFAAMEEtHQKKIEDLQRQHQ 2132
Cdd:TIGR02169  251 EELEKLTEEISELEKRLEEIEQLLEElnkkikDLGEEEQLRVKEKI-GELEAEIASLERSIAEKER-ELEDAEERLAKLE 328
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2133 RELEKLREEKDRL---LAEETAATISAIEAMKNAHREEMEReleksqRSQISSINSDIEALRR---QYLEELQSVQRELE 2206
Cdd:TIGR02169  329 AEIDKLLAEIEELereIEEERKRRDKLTEEYAELKEELEDL------RAELEEVDKEFAETRDelkDYREKLEKLKREIN 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2207 VLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTlltgdgggestglpLTQGKDAYE 2286
Cdd:TIGR02169  403 ELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ--------------LAADLSKYE 468
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1039737300 2287 LEvlLRVKESEIQYLKQEISSLKDELQTALRDKKYASD 2324
Cdd:TIGR02169  469 QE--LYDLKEEYDRVEKELSKLQRELAEAEAQARASEE 504
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
857-1071 7.33e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 47.52  E-value: 7.33e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  857 QDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLkgelkmeqgkvREQ 936
Cdd:COG3883     19 QAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEER-----------REE 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  937 LEEWQHSKAMLSGQLRASEQKLRSTE-ARLLEKTQELRDL-ETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLME 1014
Cdd:COG3883     88 LGERARALYRSGGSVSYLDVLLGSESfSDFLDRLSALSKIaDADADLLEELKADKAELEAKKAELEAKLAELEALKAELE 167
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300 1015 KKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSET 1071
Cdd:COG3883    168 AAKAE-----LEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAA 219
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
766-1215 7.47e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 48.50  E-value: 7.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  766 REEKQVPIAPLHLSLEDRSERlSTHELTSLLEKELEQSQKEASDLLEQNRLLQdqlrVALGREQSARegyVLQTEVAtsp 845
Cdd:TIGR00606  724 RRDEMLGLAPGRQSIIDLKEK-EIPELRNKLQKVNRDIQRLKNDIEEQETLLG----TIMPEEESAK---VCLTDVT--- 792
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  846 sgawqrlhrVNQDLQSELEAQCRRqelITQQIQTLKHSygeakDAIRHHEAEIQTLQTrlgnaaaelaiKEQALAKLKGE 925
Cdd:TIGR00606  793 ---------IMERFQMELKDVERK---IAQQAAKLQGS-----DLDRTVQQVNQEKQE-----------KQHELDTVVSK 844
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  926 LKMEQGKVREQLEEWQHskamlsgqLRASEQKLRStearllEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGT 1005
Cdd:TIGR00606  845 IELNRKLIQDQQEQIQH--------LKSKTNELKS------EKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQ 910
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1006 SEQAQRLMEKKLKRNYTLLLESCEQEKQAL--LQNLKEVEDKASAYEDQLQGHVQQ-VEALQKEKLSETCKGSEQVHKLE 1082
Cdd:TIGR00606  911 DSPLETFLEKDQQEKEELISSKETSNKKAQdkVNDIKEKVKNIHGYMKDIENKIQDgKDDYLKQKETELNTVNAQLEECE 990
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1083 EELEAREASIRQLAQHVQSLHDERDLIKHQF---------QELMERVATSDGDVAELQekLRGKEVDYQNLEHSHHRVSV 1153
Cdd:TIGR00606  991 KHQEKINEDMRLMRQDIDTQKIQERWLQDNLtlrkrenelKEVEEELKQHLKEMGQMQ--VLQMKQEHQKLEENIDLIKR 1068
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 1154 QLQSVRTLLREKEEELKHIKethERVLEKKDQDLNEALVKMIALGSSLEETEIKLQEKEECL 1215
Cdd:TIGR00606 1069 NHVLALGRQKGYEKEIKHFK---KELREPQFRDAEEKYREMMIVMRTTELVNKDLDIYYKTL 1127
PH_DOCK-D cd13267
Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also ...
48-145 8.14e-05

Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also called Zizimin subfamily) consists of Dock9/Zizimin1, Dock10/Zizimin3, and Dock11/Zizimin2. DOCK-D has a N-terminal DUF3398 domain, a PH-like domain, a Dock Homology Region 1, DHR1 (also called CZH1), a C2 domain, and a C-terminal DHR2 domain (also called CZH2). Zizimin1 is enriched in the brain, lung, and kidney; zizimin2 is found in B and T lymphocytes, and zizimin3 is enriched in brain, lung, spleen and thymus. Zizimin1 functions in autoinhibition and membrane targeting. Zizimin2 is an immune-related and age-regulated guanine nucleotide exchange factor, which facilitates filopodial formation through activation of Cdc42, which results in activation of cell migration. No function has been determined for Zizimin3 to date. The N-terminal half of zizimin1 binds to the GEF domain through three distinct areas, including CZH1, to inhibit the interaction with Cdc42. In addition its PH domain binds phosphoinositides and mediates zizimin1 membrane targeting. DOCK is a family of proteins involved in intracellular signalling networks. They act as guanine nucleotide exchange factors for small G proteins of the Rho family, such as Rac and Cdc42. There are 4 subfamilies of DOCK family proteins based on their sequence homology: A-D. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270087  Cd Length: 126  Bit Score: 44.24  E-value: 8.14e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   48 GWLLLAPDGTDFDNPVHRSRKWQRRFFILYEHG----LLRYALDEMPTTlPQGTINMNQCTDVVDGEARTGQKFSLCILT 123
Cdd:cd13267     10 GYLYKGPENSSDSFISLAMKSFKRRFFHLKQLVdgsyILEFYKDEKKKE-AKGTIFLDSCTGVVQNSKRRKFCFELRMQD 88
                           90       100
                   ....*....|....*....|..
gi 1039737300  124 PdKEHFIRAETKEIISGWLEML 145
Cdd:cd13267     89 K-KSYVLAAESEAEMDEWISKL 109
PH_evt cd13265
Evectin Pleckstrin homology (PH) domain; There are 2 members of the evectin family (also ...
506-558 8.75e-05

Evectin Pleckstrin homology (PH) domain; There are 2 members of the evectin family (also called pleckstrin homology domain containing, family B): evt-1 (also called PLEKHB1) and evt-2 (also called PLEKHB2). evt-1 is specific to the nervous system, where it is expressed in photoreceptors and myelinating glia. evt-2 is widely expressed in both neural and nonneural tissues. Evectins possess a single N-terminal PH domain and a C-terminal hydrophobic region. evt-1 is thought to function as a mediator of post-Golgi trafficking in cells that produce large membrane-rich organelles. It is a candidate gene for the inherited human retinopathy autosomal dominant familial exudative vitreoretinopathy and a susceptibility gene for multiple sclerosis. evt-2 is essential for retrograde endosomal membrane transport from the plasma membrane (PM) to the Golgi. Two membrane trafficking pathways pass through recycling endosomes: a recycling pathway and a retrograde pathway that links the PM to the Golgi/ER. Its PH domain that is unique in that it specifically recognizes phosphatidylserine (PS), but not polyphosphoinositides. PS is an anionic phospholipid class in eukaryotic biomembranes, is highly enriched in the PM, and plays key roles in various physiological processes such as the coagulation cascade, recruitment and activation of signaling molecules, and clearance of apoptotic cells. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270085  Cd Length: 108  Bit Score: 43.83  E-value: 8.75e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300  506 FKKGWLTKQYE-DGQWKKHWFVL-ADQSLRYYRDsvaEEAADLDGEINL-STCYDV 558
Cdd:cd13265      4 VKSGWLLRQSTiLKRWKKNWFVLyGDGNLVYYED---ETRREVEGRINMpRECRNI 56
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
780-1217 9.31e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 47.75  E-value: 9.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  780 LEDRSERLSTheltslLEKELEQSQKEASDLLEQNRLLQD--QLRVALGREQSAREGYVLQtevatspsgawqrlhrvnq 857
Cdd:PRK03918   333 LEEKEERLEE------LKKKLKELEKRLEELEERHELYEEakAKKEELERLKKRLTGLTPE------------------- 387
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  858 DLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNA---AAELAikEQALAKLKGELKMEQGKVR 934
Cdd:PRK03918   388 KLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCpvcGRELT--EEHRKELLEEYTAELKRIE 465
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  935 EQLEEWQHSKAMLSGQLRASEQKLR--STEARLLEKTQELRDLETQqaLQRDRQKEVQRLQECIAELSQQLGTSEQAQRL 1012
Cdd:PRK03918   466 KELKEIEEKERKLRKELRELEKVLKkeSELIKLKELAEQLKELEEK--LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKS 543
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1013 MEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEalqkEKLSETCKGSEQVHKLEEELEAREASI 1092
Cdd:PRK03918   544 LKKELEK-----LEELKKKLAELEKKLDELEEELAELLKELEELGFESV----EELEERLKELEPFYNEYLELKDAEKEL 614
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1093 RQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL-----QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEE 1167
Cdd:PRK03918   615 EREEKELKKLEEELDKAFEELAETEKRLEELRKELEELekkysEEEYEELREEYLELSRELAGLRAELEELEKRREEIKK 694
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 1168 ELKHIKETHERVLEKKD--QDLNEALVKMIALGSSLEetEIKLQEKEECLRR 1217
Cdd:PRK03918   695 TLEKLKEELEEREKAKKelEKLEKALERVEELREKVK--KYKALLKERALSK 744
PTZ00121 PTZ00121
MAEBL; Provisional
1854-2258 9.73e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 48.21  E-value: 9.73e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1854 EYEKELRFYKKACQEAKGASGQKRAQAVGALKEE---YEELlhKQKSEYQKVITLIEKENTELKAKVSQMdhqqRCLQEA 1930
Cdd:PTZ00121  1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEakkADEA--KKKAEEAKKADEAKKKAEEAKKKADEA----KKAAEA 1508
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1931 ENKHSESMFALQGRYEEEIRcMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRElqavh 2010
Cdd:PTZ00121  1509 KKKADEAKKAEEAKKADEAK-KAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRK----- 1582
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2011 QEELRALQEHYIwslrgalslyqpshpdsslapgpsepravpaakdeaesmsglrERIQELEAQMGVMREELGHKELEGD 2090
Cdd:PTZ00121  1583 AEEAKKAEEARI-------------------------------------------EEVMKLYEEEKKMKAEEAKKAEEAK 1619
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2091 VAALQEKYQrdfESLKATCERgFAAMEETHQKKIEDLQRQHqrELEKLREEKDRLLAEETAATISAIEAMKNAHREEMER 2170
Cdd:PTZ00121  1620 IKAEELKKA---EEEKKKVEQ-LKKKEAEEKKKAEELKKAE--EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA 1693
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2171 ELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQEL 2250
Cdd:PTZ00121  1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773

                   ....*...
gi 1039737300 2251 NNRLAAEI 2258
Cdd:PTZ00121  1774 RKEKEAVI 1781
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1947-2359 1.06e-04

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 47.91  E-value: 1.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1947 EEIRCMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHyIWSLR 2026
Cdd:pfam12128  237 MKIRPEFTKLQQEFNTLESAELR-LSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLDDQWKEKRDELNGE-LSAAD 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2027 GALSLYQpSHPDSSlapgpsEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREElgHKELEGDVAALQEKYQRDFESLK 2106
Cdd:pfam12128  315 AAVAKDR-SELEAL------EDQHGAFLDADIETAAADQEQLPSWQSELENLEER--LKALTGKHQDVTAKYNRRRSKIK 385
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2107 ATCERGFAAMEEthqkkiedlqrqhqrELEKLREEKDRLLAEETAatisAIEAMKNAHREEME------RELEKSQRSQI 2180
Cdd:pfam12128  386 EQNNRDIAGIKD---------------KLAKIREARDRQLAVAED----DLQALESELREQLEagklefNEEEYRLKSRL 446
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2181 SSIN---------SDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELN 2251
Cdd:pfam12128  447 GELKlrlnqatatPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELE 526
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2252 NRLAAEITRLRTLLTGDGGG--ESTG----------------LPLTQGKDAYEL-EVLLRVKESEIQ---YLKQEISSLK 2309
Cdd:pfam12128  527 LQLFPQAGTLLHFLRKEAPDweQSIGkvispellhrtdldpeVWDGSVGGELNLyGVKLDLKRIDVPewaASEEELRERL 606
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039737300 2310 DELQTALRDkkyASDKYKDIYTELSIAKA---KADCDISRLKEQLKAATEALG 2359
Cdd:pfam12128  607 DKAEEALQS---AREKQAAAEEQLVQANGeleKASREETFARTALKNARLDLR 656
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
2066-2242 1.09e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 47.43  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2066 ERIQELEA-QMGVMRE-ELGHKELEG--DVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKLREE 2141
Cdd:pfam17380  375 SRMRELERlQMERQQKnERVRQELEAarKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLE 454
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2142 KdrLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISS---INSDIEALRRQYLEELQS---VQRELE-----VLSE 2210
Cdd:pfam17380  455 E--QERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRrkiLEKELEERKQAMIEEERKrklLEKEMEerqkaIYEE 532
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1039737300 2211 QYSQKCLENAHLAQALEAERQALRQCQRENQE 2242
Cdd:pfam17380  533 ERRREAEEERRKQQEMEERRRIQEQMRKATEE 564
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
851-1217 1.15e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 47.45  E-value: 1.15e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  851 RLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKE--QALAKLKGELKM 928
Cdd:COG4717     64 RKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAE 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  929 EQGKV---REQLEEWQHskamLSGQLRASEQKLRSTEARlLEKTQELRDLETQQALQrDRQKEVQRLQECIAELSQQLGT 1005
Cdd:COG4717    144 LPERLeelEERLEELRE----LEEELEELEAELAELQEE-LEELLEQLSLATEEELQ-DLAEELEELQQRLAELEEELEE 217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1006 SEQAQRLMEKKLKRNYTLLLESCEQEKQ--------------ALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSET 1071
Cdd:COG4717    218 AQEELEELEEELEQLENELEAAALEERLkearlllliaaallALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREK 297
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1072 CKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRV 1151
Cdd:COG4717    298 ASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLA 377
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 1152 SVQLQSvRTLLREKEEELKHIKETHERVLEKKDQdLNEALVKMIALGSSLEETEIK--LQEKEECLRR 1217
Cdd:COG4717    378 EAGVED-EEELRAALEQAEEYQELKEELEELEEQ-LEELLGELEELLEALDEEELEeeLEELEEELEE 443
PH_TBC1D2A cd01265
TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1 ...
67-145 1.21e-04

TBC1 domain family member 2A pleckstrin homology (PH) domain; TBC1D2A (also called PARIS-1/Prostate antigen recognized and identified by SEREX 1 and ARMUS) contains a PH domain and a TBC-type GTPase catalytic domain. TBC1D2A integrates signaling between Arf6, Rac1, and Rab7 during junction disassembly. Activated Rac1 recruits TBC1D2A to locally inactivate Rab7 via its C-terminal TBC/RabGAP domain and facilitate E-cadherin degradation in lysosomes. The TBC1D2A PH domain mediates localization at cell-cell contacts and coprecipitates with cadherin complexes. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269966  Cd Length: 102  Bit Score: 43.08  E-value: 1.21e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300   67 RKWQRRFFILYEHGLLRYALDEMPTTLPQGTINMNQCTDVVDGEARTGQkFSlcILTPDKEHFIRAETKEIISGWLEML 145
Cdd:cd01265     17 KGWKRRWFVLDESKCQLYYYRSPQDATPLGSIDLSGAAFSYDPEAEPGQ-FE--IHTPGRVHILKASTRQAMLYWLQAL 92
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1919-2337 1.37e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 47.42  E-value: 1.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1919 QMDHQQRCLQEAENKHSESMFAlqgRYEEEIRCMVEQLSHTeNTLQAERSRVLSQldaSVKDRQAmeqhHVQQMKMLEDR 1998
Cdd:pfam15921   60 ELDSPRKIIAYPGKEHIERVLE---EYSHQVKDLQRRLNES-NELHEKQKFYLRQ---SVIDLQT----KLQEMQMERDA 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1999 FqLKVRELQAVHQEELRALQEHYIWSLRGALSLYQPSHPDSSLAPGP------------SEPRAVPAAKDEAESmsglrE 2066
Cdd:pfam15921  129 M-ADIRRRESQSQEDLRNQLQNTVHELEAAKCLKEDMLEDSNTQIEQlrkmmlshegvlQEIRSILVDFEEASG-----K 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2067 RIQELEAQMGVMREELGH------KELEGDVAALQEK---YQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEK 2137
Cdd:pfam15921  203 KIYEHDSMSTMHFRSLGSaiskilRELDTEISYLKGRifpVEDQLEALKSESQNKIELLLQQHQDRIEQLISEHEVEITG 282
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2138 LREEKdrllaeetaatiSAIEAMKNAHREEME--RELEKSQRSQISSINSDIEALRRQYLEELQSVQRELE-VLSEQYSQ 2214
Cdd:pfam15921  283 LTEKA------------SSARSQANSIQSQLEiiQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEdKIEELEKQ 350
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2215 KCLENAHLAQAlEAERQALRQ----CQRENQELNAHNQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEVl 2290
Cdd:pfam15921  351 LVLANSELTEA-RTERDQFSQesgnLDDQLQKLLADLHKREKELSLEKEQNKRLWDRDTGNSITIDHLRRELDDRNMEV- 428
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1039737300 2291 lRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAK 2337
Cdd:pfam15921  429 -QRLEALLKAMKSECQGQMERQMAAIQGKNESLEKVSSLTAQLESTK 474
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
796-1217 1.41e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 47.27  E-value: 1.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSA-------REGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCR 868
Cdd:TIGR00618  227 ELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLkqlrariEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIE 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  869 RQ-ELITQQIQTLKHSYGEAKDAIRHHEAEIQTL--QTRLGN---AAAELAIKEQALAKLKGELKMEQGKVREQLEEWQH 942
Cdd:TIGR00618  307 QQaQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIeeQRRLLQtlhSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQ 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  943 SKAMLSGQLRASEQ---KLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKR 1019
Cdd:TIGR00618  387 QKTTLTQKLQSLCKeldILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQ 466
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1020 NYTLLLEScEQEKQALLQNLKEVEDKASAYEDQLQG------------HVQQVEALQKEKLSETCKGSEQVHKLEEELEA 1087
Cdd:TIGR00618  467 SLKEREQQ-LQTKEQIHLQETRKKAVVLARLLELQEepcplcgscihpNPARQDIDNPGPLTRRMQRGEQTYAQLETSEE 545
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1088 REASIRQ-LAQHVQSLHDERDLIKHQFQELmervATSDGDVAELQEKLRGKEVDYQNL--EHSHHRVSVQLQSVRTLLRE 1164
Cdd:TIGR00618  546 DVYHQLTsERKQRASLKEQMQEIQQSFSIL----TQCDNRSKEDIPNLQNITVRLQDLteKLSEAEDMLACEQHALLRKL 621
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039737300 1165 KEEELKHIKETHERVLEKKDQDLNEALVKmialgsslEETEIKLQEKEECLRR 1217
Cdd:TIGR00618  622 QPEQDLQDVRLHLQQCSQELALKLTALHA--------LQLTLTQERVREHALS 666
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
956-1159 1.52e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 47.22  E-value: 1.52e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  956 QKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQ-QLGTSEQAQRLMEKKLKRNyTLLLESCEQEKQA 1034
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAAlRLWFAQRRLELLEAELEEL-RAELARLEAELER 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1035 LLQNLKEVEDKASAYEDQLQGH-VQQVEALQKEklsetckgseqVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQF 1113
Cdd:COG4913    314 LEARLDALREELDELEAQIRGNgGDRLEQLERE-----------IERLERELEERERRRARLEALLAALGLPLPASAEEF 382
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1114 QEL----MERVATSDGDVAELQEKLRGKEVDYQNLEHSHHRVSVQLQSVR 1159
Cdd:COG4913    383 AALraeaAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLE 432
PH_AtPH1 cd13276
Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all ...
67-142 1.57e-04

Arabidopsis thaliana Pleckstrin homolog (PH) 1 (AtPH1) PH domain; AtPH1 is expressed in all plant tissue and is proposed to be the plant homolog of human pleckstrin. Pleckstrin consists of two PH domains separated by a linker region, while AtPH has a single PH domain with a short N-terminal extension. AtPH1 binds PtdIns3P specifically and is thought to be an adaptor molecule since it has no obvious catalytic functions. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270095  Cd Length: 106  Bit Score: 43.07  E-value: 1.57e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300   67 RKWQRRFFILYEHGLLRY-ALDEMPTTLPQGTINMNQCTDVVDGEARTGQKFSLCILTPDKEHFIRAETKEIISGWL 142
Cdd:cd13276     13 KTWRRRWFVLKQGKLFWFkEPDVTPYSKPRGVIDLSKCLTVKSAEDATNKENAFELSTPEETFYFIADNEKEKEEWI 89
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
2053-2210 1.60e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 47.22  E-value: 1.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAES-MSGLRERIQELEAQM---GVMREElghkELEGDVAALQEKY---QRDFESLKATCER-GFAAmeETHQKKI 2124
Cdd:COG4913    309 AELERLEArLDALREELDELEAQIrgnGGDRLE----QLEREIERLERELeerERRRARLEALLAAlGLPL--PASAEEF 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2125 EDLQRQHQRELEKLREEKDRLLAEETAAtISAIEAMKNAHREeMERELEkSQRSQISSINSDIEALRRQYLEELQSVQRE 2204
Cdd:COG4913    383 AALRAEAAALLEALEEELEALEEALAEA-EAALRDLRRELRE-LEAEIA-SLERRKSNIPARLLALRDALAEALGLDEAE 459

                   ....*.
gi 1039737300 2205 LEVLSE 2210
Cdd:COG4913    460 LPFVGE 465
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
2133-2400 1.80e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 46.98  E-value: 1.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2133 RELEKLREEKDRLLAEEtaatiSAIEAMKnahrEEMERELEKSQRsQISSINSDIEALRRQyLEELQSVQRELEVLSEQY 2212
Cdd:PRK03918   172 KEIKRRIERLEKFIKRT-----ENIEELI----KEKEKELEEVLR-EINEISSELPELREE-LEKLEKEVKELEELKEEI 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2213 SQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRlAAEITRLRTLltgdgggestglpltqgKDAY-ELEVLL 2291
Cdd:PRK03918   241 EELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEK-VKELKELKEK-----------------AEEYiKLSEFY 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2292 RVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIyTELSIAKAKADCDISRLKEQLKAATEA---LGEKSPEGTTV 2368
Cdd:PRK03918   303 EEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERL-EELKKKLKELEKRLEELEERHELYEEAkakKEELERLKKRL 381
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1039737300 2369 SGYDIMKSKSNPDFLKKDRSCVTRQLRNIRSK 2400
Cdd:PRK03918   382 TGLTPEKLEKELEELEKAKEEIEEEISKITAR 413
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
752-1054 2.10e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 46.98  E-value: 2.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  752 EIEQRWHQVETTPLREEKQVPIAPLHLSLEDRSERLSTHELTSLLEK--ELEQSQKEASDLLEQNRLLQDQLRVALGREQ 829
Cdd:TIGR02169  699 RIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDlsSLEQEIENVKSELKELEARIEELEEDLHKLE 778
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  830 SAREGyvLQTEVATSPsgaWQRLhrvnQDLQSELEAQCRRQELITQQIQ------TLKHSYgeAKDAIRHHEAEIQTLQT 903
Cdd:TIGR02169  779 EALND--LEARLSHSR---IPEI----QAELSKLEEEVSRIEARLREIEqklnrlTLEKEY--LEKEIQELQEQRIDLKE 847
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  904 RLGNAAAELAIKEQALAKLKGELKMEQGKVRE---QLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQA 980
Cdd:TIGR02169  848 QIKSIEKEIENLNGKKEELEEELEELEAALRDlesRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039737300  981 LQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRnytllLESCEQEKQALLQNLKEVEDKASAYEDQLQ 1054
Cdd:TIGR02169  928 ALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEE-----IRALEPVNMLAIQEYEEVLKRLDELKEKRA 996
PH_Boi cd13316
Boi family Pleckstrin homology domain; Yeast Boi proteins Boi1 and Boi2 are functionally ...
508-597 2.21e-04

Boi family Pleckstrin homology domain; Yeast Boi proteins Boi1 and Boi2 are functionally redundant and important for cell growth with Boi mutants displaying defects in bud formation and in the maintenance of cell polarity.They appear to be linked to Rho-type GTPase, Cdc42 and Rho3. Boi1 and Boi2 display two-hybrid interactions with the GTP-bound ("active") form of Cdc42, while Rho3 can suppress of the lethality caused by deletion of Boi1 and Boi2. These findings suggest that Boi1 and Boi2 are targets of Cdc42 that promote cell growth in a manner that is regulated by Rho3. Boi proteins contain a N-terminal SH3 domain, followed by a SAM (sterile alpha motif) domain, a proline-rich region, which mediates binding to the second SH3 domain of Bem1, and C-terminal PH domain. The PH domain is essential for its function in cell growth and is important for localization to the bud, while the SH3 domain is needed for localization to the neck. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270126  Cd Length: 97  Bit Score: 42.36  E-value: 2.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  508 KGWLTKQYED-GQWKKHWFVLADQSLRYYRdsvAEEAADLDGEINLsTCYDVT----EYPVQRNYGFQI--HTKEGEFTL 580
Cdd:cd13316      3 SGWMKKRGERyGTWKTRYFVLKGTRLYYLK---SENDDKEKGLIDL-TGHRVVpddsNSPFRGSYGFKLvpPAVPKVHYF 78
                           90
                   ....*....|....*..
gi 1039737300  581 SAMTSGIRRNWIQTIMK 597
Cdd:cd13316     79 AVDEKEELREWMKALMK 95
PH_3BP2 cd13308
SH3 domain-binding protein 2 Pleckstrin homology (PH) domain; SH3BP2 (the gene that encodes ...
66-145 2.53e-04

SH3 domain-binding protein 2 Pleckstrin homology (PH) domain; SH3BP2 (the gene that encodes the adaptor protein 3BP2), HD, ITU, IT10C3, and ADD1 are located near the Huntington's Disease Gene on Human Chromosome 4pl6.3. SH3BP2 lies in a region that is often missing in individuals with Wolf-Hirschhorn syndrome (WHS). Gain of function mutations in SH3BP2 causes enhanced B-cell antigen receptor (BCR)-mediated activation of nuclear factor of activated T cells (NFAT), resulting in a rare, genetic disorder called cherubism. This results in an increase in the signaling complex formation with Syk, phospholipase C-gamma2 (PLC-gamma2), and Vav1. It was recently discovered that Tankyrase regulates 3BP2 stability through ADP-ribosylation and ubiquitylation by the E3-ubiquitin ligase. Cherubism mutations uncouple 3BP2 from Tankyrase-mediated protein destruction, which results in its stabilization and subsequent hyperactivation of the Src, Syk, and Vav signaling pathways. SH3BP2 is also a potential negative regulator of the abl oncogene. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270118  Cd Length: 113  Bit Score: 42.78  E-value: 2.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300   66 SRKWQRRFFILYEHGLLrYALDEMPTTlPQGTINMNQCTDVVDGEARTGQKFSLCILTPDKEH---FIRAETKEIISGWL 142
Cdd:cd13308     25 LQNWQLRYVIIHQGCVY-YYKNDQSAK-PKGVFSLNGYNRRAAEERTSKLKFVFKIIHLSPDHrtwYFAAKSEDEMSEWM 102

                   ...
gi 1039737300  143 EML 145
Cdd:cd13308    103 EYI 105
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
2055-2235 2.84e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 46.30  E-value: 2.84e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2055 KDEAESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCERgfaameethqkkIEDLQRQHQrE 2134
Cdd:COG4717     91 AELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPER------------LEELEERLE-E 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2135 LEKLREEKDRLLAEetaatisaiEAMKNAHREEMERELEKSQRSQISSINSDIEALR---RQYLEELQSVQRELEVLSEQ 2211
Cdd:COG4717    158 LRELEEELEELEAE---------LAELQEELEELLEQLSLATEEELQDLAEELEELQqrlAELEEELEEAQEELEELEEE 228
                          170       180
                   ....*....|....*....|....
gi 1039737300 2212 YSQkcLENAHLAQALEAERQALRQ 2235
Cdd:COG4717    229 LEQ--LENELEAAALEERLKEARL 250
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
745-1170 2.87e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 46.25  E-value: 2.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  745 KTQNVHVEIEQRWHQVETT--PLREEKQVPIAPLHLSLEDRSERLstHELTSLLEKELEQSQKEASDLLEQNRLLQDQLR 822
Cdd:pfam05483  180 ETRQVYMDLNNNIEKMILAfeELRVQAENARLEMHFKLKEDHEKI--QHLEEEYKKEINDKEKQVSLLLIQITEKENKMK 257
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  823 VALGREQSAREGyVLQTEVATSpsgawqrlhrvnqdLQSE-LEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTL 901
Cdd:pfam05483  258 DLTFLLEESRDK-ANQLEEKTK--------------LQDEnLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIA 322
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  902 QTRLGNAAAELAIKEQALAKLKGELKMeqgkvreQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELR----DLET 977
Cdd:pfam05483  323 TKTICQLTEEKEAQMEELNKAKAAHSF-------VVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQkkssELEE 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  978 QQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKklkrnytllLESCEQEKQALLQNL-KEVED---KASAYEDQL 1053
Cdd:pfam05483  396 MTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEE---------LKGKEQELIFLLQAReKEIHDleiQLTAIKTSE 466
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1054 QGHVQQVEALQKEKLSETCKGSEqvhkleeeleareasirqLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEK 1133
Cdd:pfam05483  467 EHYLKEVEDLKTELEKEKLKNIE------------------LTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQ 528
                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 1039737300 1134 LRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELK 1170
Cdd:pfam05483  529 EERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVK 565
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
2053-2349 2.88e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 46.10  E-value: 2.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCE-----RGFAAMEETHQKKIEDL 2127
Cdd:COG5185    269 KLGENAESSKRLNENANNLIKQFENTKEKIAEYTKSIDIKKATESLEEQLAAAEAEQEleeskRETETGIQNLTAEIEQG 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2128 QRQHQRELEKLREEKDRLLAEETAATisaieamknahREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEV 2207
Cdd:COG5185    349 QESLTENLEAIKEEIENIVGEVELSK-----------SSEELDSFKDTIESTKESLDEIPQNQRGYAQEILATLEDTLKA 417
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2208 LSEQysqkclenahlaqaLEAERQALRQCQRENQElnahNQELNNRLAAEITRLRTLLTGDGggeSTGLPLTQGKDAYEL 2287
Cdd:COG5185    418 ADRQ--------------IEELQRQIEQATSSNEE----VSKLLNELISELNKVMREADEES---QSRLEEAYDEINRSV 476
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039737300 2288 EVLLRVKESEIQYLKQEISSLKDELQT--ALRDKKYASDKYKDIYTELSIAKAKADCDISRLKE 2349
Cdd:COG5185    477 RSKKEDLNEELTQIESRVSTLKATLEKlrAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILA 540
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1758-2235 3.11e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 45.91  E-value: 3.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1758 QEPLQALHQSPEVLAAIQDELAQQLREKASILEEISAALPVLPPTEPLGGCQRLLRMSQHlSYESCLEGLGQYSSLLVQd 1837
Cdd:COG4717     87 EEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAELPE-RLEELEERLEELRELEEE- 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1838 aiiqaqvcyaacriRLEYEKELRFYKKACQEAKGASGQKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV 1917
Cdd:COG4717    165 --------------LEELEAELAELQEELEELLEQLSLATEEELQDLAEELEEL-QQRLAELEEELEEAQEELEELEEEL 229
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1918 SQMDHQQRCLQEAENKHSESMFALqgryeeeIRCMVEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLED 1997
Cdd:COG4717    230 EQLENELEAAALEERLKEARLLLL-------IAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGK 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1998 RFQlkvrelQAVHQEELRALQEHYIWSLRGALSLyqpshpdsslaPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMgv 2077
Cdd:COG4717    303 EAE------ELQALPALEELEEEELEELLAALGL-----------PPDLSPEELLELLDRIEELQELLREAEELEEEL-- 363
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2078 mreelghkelegDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQR--QHQRELEKLREEKDRLLAEETAATIS 2155
Cdd:COG4717    364 ------------QLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEEleELEEQLEELLGELEELLEALDEEELE 431
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2156 AIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY-----LEELQSVQRELEVLSEQYSQKCLenahLAQALEAER 2230
Cdd:COG4717    432 EELEELEEELEELEEELEE-LREELAELEAELEQLEEDGelaelLQELEELKAELRELAEEWAALKL----ALELLEEAR 506

                   ....*
gi 1039737300 2231 QALRQ 2235
Cdd:COG4717    507 EEYRE 511
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
796-1215 3.42e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 46.26  E-value: 3.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNrllQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQelITQ 875
Cdd:pfam15921  247 LEALKSESQNKIELLLQQH---QDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMRQ--LSD 321
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  876 QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQAlaklKGELKMEQGKVREQLEEwqhskamLSGQLRASE 955
Cdd:pfam15921  322 LESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTE----RDQFSQESGNLDDQLQK-------LLADLHKRE 390
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  956 QKLRstearlLEKTQELR--DLETQQA-----LQR---DRQKEVQRLQ--------ECIAELSQQLGTSEQAQRLMEKkl 1017
Cdd:pfam15921  391 KELS------LEKEQNKRlwDRDTGNSitidhLRReldDRNMEVQRLEallkamksECQGQMERQMAAIQGKNESLEK-- 462
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1018 krnYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSE--QVHKLEEELEAREASIRQL 1095
Cdd:pfam15921  463 ---VSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEitKLRSRVDLKLQELQHLKNE 539
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1096 AQHVQSLHDERDLIKHQFQELMERVatsdgdvaelqEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLrEKE--------E 1167
Cdd:pfam15921  540 GDHLRNVQTECEALKLQMAEKDKVI-----------EILRQQIENMTQLVGQHGRTAGAMQVEKAQL-EKEindrrlelQ 607
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 1168 ELKHIKETHE---RVLEKKDQDLNEALVKMIALGSS-LEETEIKLQEKEECL 1215
Cdd:pfam15921  608 EFKILKDKKDakiRELEARVSDLELEKVKLVNAGSErLRAVKDIKQERDQLL 659
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
761-1176 3.48e-04

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 46.26  E-value: 3.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  761 ETTPLREEKQVPIAPLH-----LSLE-DRSERLSTHELTSL-----LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQ 829
Cdd:pfam15921  371 ESGNLDDQLQKLLADLHkrekeLSLEkEQNKRLWDRDTGNSitidhLRRELDDRNMEVQRLEALLKAMKSECQGQMERQM 450
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  830 SAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGnaa 909
Cdd:pfam15921  451 AAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVD--- 527
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  910 aelaIKEQALAKLKGE---------------LKM-EQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLlEKTQELR 973
Cdd:pfam15921  528 ----LKLQELQHLKNEgdhlrnvqtecealkLQMaEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQL-EKEINDR 602
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  974 DLETQQ--ALQRDRQKEVQRLQECIAEL-----------SQQLGT-----SEQAQRLMEKKLKRNYtllLESCEQEKQAL 1035
Cdd:pfam15921  603 RLELQEfkILKDKKDAKIRELEARVSDLelekvklvnagSERLRAvkdikQERDQLLNEVKTSRNE---LNSLSEDYEVL 679
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1036 LQNLKEVEDKASAYEDQLQGHVQ--QVEALQKEKLSETCKGSE------------QVHKLEEELEAREASIRQLAQHVQS 1101
Cdd:pfam15921  680 KRNFRNKSEEMETTTNKLKMQLKsaQSELEQTRNTLKSMEGSDghamkvamgmqkQITAKRGQIDALQSKIQFLEEAMTN 759
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1102 LHDERDLIKHQFQEL---MERVATSDGDVAELQEKLRGKEVDYQ----NLEHSHHRVSVQLQSVRTLLREKEEELKHIKE 1174
Cdd:pfam15921  760 ANKEKHFLKEEKNKLsqeLSTVATEKNKMAGELEVLRSQERRLKekvaNMEVALDKASLQFAECQDIIQRQEQESVRLKL 839

                   ..
gi 1039737300 1175 TH 1176
Cdd:pfam15921  840 QH 841
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
912-1168 3.63e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 45.53  E-value: 3.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  912 LAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQrdrQKEVQR 991
Cdd:COG4942     11 LALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAAL---EAELAE 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  992 LQECIAELSQQLGTSEQ--AQRL--MEKKLKRNYTLLLESCEqekqallqNLKEVEDKASAYEDQLQGHVQQVEALqkek 1067
Cdd:COG4942     88 LEKEIAELRAELEAQKEelAELLraLYRLGRQPPLALLLSPE--------DFLDAVRRLQYLKYLAPARREQAEEL---- 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1068 lsetckgseqvhklEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHS 1147
Cdd:COG4942    156 --------------RADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQE 221
                          250       260
                   ....*....|....*....|.
gi 1039737300 1148 HHRVSVQLQSVRTLLREKEEE 1168
Cdd:COG4942    222 AEELEALIARLEAEAAAAAER 242
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1854-2145 3.63e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 46.20  E-value: 3.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1854 EYEKELRFYKKACQEAKGASGQKRaQAVGALKEEYEELLHKQKSEYQKvITLIEKENTELKAKVSQMDHQQRCLQEAENK 1933
Cdd:TIGR02168  702 ELRKELEELEEELEQLRKELEELS-RQISALRKDLARLEAEVEQLEER-IAQLSKELTELEAEIEELEERLEEAEEELAE 779
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1934 HSESMFALQGRYEEeircMVEQLSHTENTLQAERSRvLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVREL--QAVHQ 2011
Cdd:TIGR02168  780 AEAEIEELEAQIEQ----LKEELKALREALDELRAE-LTLLNEEAANLRERLESLERRIAATERRLEDLEEQIeeLSEDI 854
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2012 EELRALQEHYiWSLRGALSlyqpSHPDSSLAPGPSEPRAVPAAKDEAESMSG----LRERIQELEAQMGVMREELGH--- 2084
Cdd:TIGR02168  855 ESLAAEIEEL-EELIEELE----SELEALLNERASLEEALALLRSELEELSEelreLESKRSELRRELEELREKLAQlel 929
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300 2085 --KELEGDVAALQEK----YQRDFEslkatcergfaaMEETHQKKIEDLQRQHQRELEKLREEKDRL 2145
Cdd:TIGR02168  930 rlEGLEVRIDNLQERlseeYSLTLE------------EAEALENKIEDDEEEARRRLKRLENKIKEL 984
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
852-1217 4.33e-04

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 45.58  E-value: 4.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  852 LHRVNQDLQSELEAQCRRQELITQQIQT--------------LKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAE------ 911
Cdd:pfam10174  245 LERNIRDLEDEVQMLKTNGLLHTEDREEeikqmevykshskfMKNKIDQLKQELSKKESELLALQTKLETLTNQnsdckq 324
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  912 --------LAIKEQALAKLKGE-----LKMEQ-----GKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELR 973
Cdd:pfam10174  325 hievlkesLTAKEQRAAILQTEvdalrLRLEEkesflNKKTKQLQDLTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIE 404
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  974 DLETQqalQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDkasaYEDQL 1053
Cdd:pfam10174  405 NLQEQ---LRDKDKQLAGLKERVKSLQTDSSNTDTALTTLEEALSEKERIIERLKEQREREDRERLEELES----LKKEN 477
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1054 QGHVQQVEALQKEKLSETCKGS---EQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDvAEL 1130
Cdd:pfam10174  478 KDLKEKVSALQPELTEKESSLIdlkEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKLENQLKKAHNAEEAVRTN-PEI 556
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1131 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEElKHIKETHERVLEK------KDQDLNEALVKMialgSSLEET 1204
Cdd:pfam10174  557 NDRIRLLEQEVARYKEESGKAQAEVERLLGILREVENE-KNDKDKKIAELESltlrqmKEQNKKVANIKH----GQQEMK 631
                          410
                   ....*....|...
gi 1039737300 1205 EIKLQEKEECLRR 1217
Cdd:pfam10174  632 KKGAQLLEEARRR 644
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1717-2352 4.60e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 45.83  E-value: 4.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1717 ERVIQQILetlrhptgREDQVQTSWDQnpLGEILRPgTDGSQEPLQALHQSPEvlaAIQDELAQQLREKASILEEISAAL 1796
Cdd:PRK03918   148 EKVVRQIL--------GLDDYENAYKN--LGEVIKE-IKRRIERLEKFIKRTE---NIEELIKEKEKELEEVLREINEIS 213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1797 PVLPPT-EPLGGCQRLLRmsqhlSYESCLEGLgqySSLLVQDAIIQAQVCYAACRIRlEYEKELRFYKKACQEAKgaSGQ 1875
Cdd:PRK03918   214 SELPELrEELEKLEKEVK-----ELEELKEEI---EELEKELESLEGSKRKLEEKIR-ELEERIEELKKEIEELE--EKV 282
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1876 KRAQAVGALKEEYEElLHKQKSEYQKVITLIEKENTELKAKVSQMdhqQRCLQEAENKHSEsMFALQGRyEEEIRCMVEQ 1955
Cdd:PRK03918   283 KELKELKEKAEEYIK-LSEFYEEYLDELREIEKRLSRLEEEINGI---EERIKELEEKEER-LEELKKK-LKELEKRLEE 356
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1956 LSHTENTLQAERsRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALqEHYIWSLRGALSLYQPS 2035
Cdd:PRK03918   357 LEERHELYEEAK-AKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL-KKEIKELKKAIEELKKA 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2036 HPDSSL--APGPSEPRAVPAAKDEAEsMSGLRERIQELEAQMGVMREELghKELEGDVaalqeKYQRDFESLKATCERGF 2113
Cdd:PRK03918   435 KGKCPVcgRELTEEHRKELLEEYTAE-LKRIEKELKEIEEKERKLRKEL--RELEKVL-----KKESELIKLKELAEQLK 506
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2114 AAMEETHQKKIEDLQRQhQRELEKLREEKDRLLAE--ETAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALR 2191
Cdd:PRK03918   507 ELEEKLKKYNLEELEKK-AEEYEKLKEKLIKLKGEikSLKKELEKLEELKK-KLAELEKKLDELEE-ELAELLKELEELG 583
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2192 RQYLEELQSVQRELEVLSEQYsqkcLENAHLAQALEAERQALRQCQRENQELNAHNQELNNR---LAAEITRLRTLLTGD 2268
Cdd:PRK03918   584 FESVEELEERLKELEPFYNEY----LELKDAEKELEREEKELKKLEEELDKAFEELAETEKRleeLRKELEELEKKYSEE 659
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2269 GGGESTGLPLtqgkdayELEVLLRVKESEIQYLKqeisSLKDELQTALRDKKYASDKYKDIYTEL-SIAKAKAdcDISRL 2347
Cdd:PRK03918   660 EYEELREEYL-------ELSRELAGLRAELEELE----KRREEIKKTLEKLKEELEEREKAKKELeKLEKALE--RVEEL 726

                   ....*
gi 1039737300 2348 KEQLK 2352
Cdd:PRK03918   727 REKVK 731
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1877-2339 4.68e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 45.53  E-value: 4.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1877 RAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRcLQEAENKHSESMFALQGRYEE---EIRCMV 1953
Cdd:COG4717     44 RAMLLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAE-LQEELEELEEELEELEAELEElreELEKLE 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1954 EQLSHTENTLQAER-SRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQlKVRELQAVHQEELRALQEHYIWSLRGALSLY 2032
Cdd:COG4717    123 KLLQLLPLYQELEAlEAELAELPERLEELEERLEELRELEEELEELEA-ELAELQEELEELLEQLSLATEEELQDLAEEL 201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2033 QpshpdsslapgpsepravpAAKDEAESmsgLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFESLKATCER- 2111
Cdd:COG4717    202 E-------------------ELQQRLAE---LEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLLLIAAALl 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2112 GFAAMEETHQKKIEDLQRQHQ-----RELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSD 2186
Cdd:COG4717    260 ALLGLGGSLLSLILTIAGVLFlvlglLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEEL 339
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2187 IEALRRqyLEELQSVQRELEVLSEQYSQKCLE---NAHLAQALEAERQALRQCQRENQELnahnQELNNRLAAEITRLRT 2263
Cdd:COG4717    340 LELLDR--IEELQELLREAEELEEELQLEELEqeiAALLAEAGVEDEEELRAALEQAEEY----QELKEELEELEEQLEE 413
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300 2264 LLTGDGGGESTGLPLTQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTALRDKKYAsdkykDIYTELSIAKAK 2339
Cdd:COG4717    414 LLGELEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELA-----ELLQELEELKAE 484
PH_OSBP_ORP4 cd13284
Human Oxysterol binding protein and OSBP-related protein 4 Pleckstrin homology (PH) domain; ...
507-593 5.04e-04

Human Oxysterol binding protein and OSBP-related protein 4 Pleckstrin homology (PH) domain; Human OSBP is proposed to function is sterol-dependent regulation of ERK dephosphorylation and sphingomyelin synthesis as well as modulation of insulin signaling and hepatic lipogenesis. It contains a N-terminal PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. OSBPs and Osh1p PH domains specifically localize to the Golgi apparatus in a PtdIns4P-dependent manner. ORP4 is proposed to function in Vimentin-dependent sterol transport and/or signaling. Human ORP4 has 2 forms, a long (ORP4L) and a short (ORP4S). ORP4L contains a N-terminal PH domain, a FFAT motif (two phenylalanines in an acidic tract), and a C-terminal OSBP-related domain. ORP4S is truncated and contains only an OSBP-related domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. They are members of the oxysterol binding protein (OSBP) family which includes OSBP, OSBP-related proteins (ORP), Goodpasture antigen binding protein (GPBP), and Four phosphate adaptor protein 1 (FAPP1). They have a wide range of purported functions including sterol transport, cell cycle control, pollen development and vessicle transport from Golgi recognize both PI lipids and ARF proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270101  Cd Length: 99  Bit Score: 41.59  E-value: 5.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTK--QYEDGqWKKHWFVLADQSLRYYRdSVAEEAADLDGEINLSTCYDVTEYPVQrnygFQIHT-KEGEFTLSAM 583
Cdd:cd13284      1 MKGWLLKwtNYIKG-YQRRWFVLSNGLLSYYR-NQAEMAHTCRGTINLAGAEIHTEDSCN----FVISNgGTQTFHLKAS 74
                           90
                   ....*....|
gi 1039737300  584 TSGIRRNWIQ 593
Cdd:cd13284     75 SEVERQRWVT 84
PH_GRP1-like cd01252
General Receptor for Phosphoinositides-1-like Pleckstrin homology (PH) domain; GRP1/cytohesin3 ...
507-542 5.27e-04

General Receptor for Phosphoinositides-1-like Pleckstrin homology (PH) domain; GRP1/cytohesin3 and the related proteins ARNO (ARF nucleotide-binding site opener)/cytohesin-2 and cytohesin-1 are ARF exchange factors that contain a pleckstrin homology (PH) domain thought to target these proteins to cell membranes through binding polyphosphoinositides. The PH domains of all three proteins exhibit relatively high affinity for PtdIns(3,4,5)P3. Within the Grp1 family, diglycine (2G) and triglycine (3G) splice variants, differing only in the number of glycine residues in the PH domain, strongly influence the affinity and specificity for phosphoinositides. The 2G variants selectively bind PtdIns(3,4,5)P3 with high affinity,the 3G variants bind PtdIns(3,4,5)P3 with about 30-fold lower affinity and require the polybasic region for plasma membrane targeting. These ARF-GEFs share a common, tripartite structure consisting of an N-terminal coiled-coil domain, a central domain with homology to the yeast protein Sec7, a PH domain, and a C-terminal polybasic region. The Sec7 domain is autoinhibited by conserved elements proximal to the PH domain. GRP1 binds to the DNA binding domain of certain nuclear receptors (TRalpha, TRbeta, AR, ER, but not RXR), and can repress thyroid hormone receptor (TR)-mediated transactivation by decreasing TR-complex formation on thyroid hormone response elements. ARNO promotes sequential activation of Arf6, Cdc42 and Rac1 and insulin secretion. Cytohesin acts as a PI 3-kinase effector mediating biological responses including cell spreading and adhesion, chemotaxis, protein trafficking, and cytoskeletal rearrangements, only some of which appear to depend on their ability to activate ARFs. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269954  Cd Length: 119  Bit Score: 41.92  E-value: 5.27e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1039737300  507 KKGWLTKQyeDGQ---WKKHWFVLADQSLRYYRDSVAEE 542
Cdd:cd01252      5 REGWLLKL--GGRvksWKRRWFILTDNCLYYFEYTTDKE 41
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
2053-2315 5.45e-04

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 45.42  E-value: 5.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2053 AAKDEAEsmsgLRERIQELEAQMGVMREELGHKELEGDVAALQ-----------EKYQRDFESLKATCERGFAAMEETHQ 2121
Cdd:PRK02224   197 EEKEEKD----LHERLNGLESELAELDEEIERYEEQREQARETrdeadevleehEERREELETLEAEIEDLRETIAETER 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2122 KK--IEDLQRQHQRELEKLREEKDRLLAEE--TAATISAIEAMKN---AHREEMERELEKsQRSQISSINSDIEALRrqy 2194
Cdd:PRK02224   273 EReeLAEEVRDLRERLEELEEERDDLLAEAglDDADAEAVEARREeleDRDEELRDRLEE-CRVAAQAHNEEAESLR--- 348
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2195 leelqsvqRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAaeitrlrtlltgdgggest 2274
Cdd:PRK02224   349 --------EDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFG------------------- 401
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1039737300 2275 GLPLTQGKDAYELEVLLrvkeSEIQYLKQEISSLKDELQTA 2315
Cdd:PRK02224   402 DAPVDLGNAEDFLEELR----EERDELREREAELEATLRTA 438
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
908-1183 5.91e-04

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 45.34  E-value: 5.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  908 AAAELAIKEQALAK---LKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRD 984
Cdd:TIGR00618  182 ALMEFAKKKSLHGKaelLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQL 261
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  985 RQKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQ 1064
Cdd:TIGR00618  262 LKQLRARIEELRAQEAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIE 341
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1065 KEKLSETCKGSEQVH------------KLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSD-------- 1124
Cdd:TIGR00618  342 EQRRLLQTLHSQEIHirdahevatsirEISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDtrtsafrd 421
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 1125 --GDVAEL-------QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKK 1183
Cdd:TIGR00618  422 lqGQLAHAkkqqelqQRYAELCAAAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEQIHLQETRKK 489
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
2128-2409 5.91e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.44  E-value: 5.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2128 QRQHQRELEKLREEKDRLLAEEtAATISAIEAMKNaHREEMERELEKSQRsQISSINSDIEALrrqyLEELQSVQRELEV 2207
Cdd:TIGR02169  669 SRSEPAELQRLRERLEGLKREL-SSLQSELRRIEN-RLDELSQELSDASR-KIGEIEKEIEQL----EQEEEKLKERLEE 741
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2208 LSEQYSQkclenahLAQALEAERQALRQCQRENQELnahnQELNNRLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYEL 2287
Cdd:TIGR02169  742 LEEDLSS-------LEQEIENVKSELKELEARIEEL----EEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRI 810
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2288 EVLLRVKESEIQYLKQEISSLKDELQTALRDKKYASDKYKDIYTELSIAKAKadcdISRLKEQLKAATEALgekspegtt 2367
Cdd:TIGR02169  811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGK----KEELEEELEELEAAL--------- 877
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|..
gi 1039737300 2368 vsgYDIMKSKSNpdfLKKDRSCVTRQLRNIRSKsvIEQVSWD 2409
Cdd:TIGR02169  878 ---RDLESRLGD---LKKERDELEAQLRELERK--IEELEAQ 911
46 PHA02562
endonuclease subunit; Provisional
852-1077 6.25e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 45.01  E-value: 6.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  852 LHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKlkgeLKMEQG 931
Cdd:PHA02562   190 IDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNK----LNTAAA 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  932 KVREQLEEWQHSKAMLS--GQLRASEQKLRSTEARLLEKTQELRDLETQ-QALQRDRQKEVQRLQEcIAELSQQLgtseq 1008
Cdd:PHA02562   266 KIKSKIEQFQKVIKMYEkgGVCPTCTQQISEGPDRITKIKDKLKELQHSlEKLDTAIDELEEIMDE-FNEQSKKL----- 339
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1039737300 1009 aqrlmeKKLKRNYtlllESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKE--KLSETCKGSEQ 1077
Cdd:PHA02562   340 ------LELKNKI----STNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDEldKIVKTKSELVK 400
PH_Btk cd01238
Bruton's tyrosine kinase pleckstrin homology (PH) domain; Btk is a member of the Tec family of ...
507-595 6.67e-04

Bruton's tyrosine kinase pleckstrin homology (PH) domain; Btk is a member of the Tec family of cytoplasmic protein tyrosine kinases that includes BMX, IL2-inducible T-cell kinase (Itk) and Tec. Btk plays a role in the maturation of B cells. Tec proteins general have an N-terminal PH domain, followed by a Tek homology (TH) domain, a SH3 domain, a SH2 domain and a kinase domain. The Btk PH domain binds phosphatidylinositol 3,4,5-trisphosphate and responds to signalling via phosphatidylinositol 3-kinase. The PH domain is also involved in membrane anchoring which is confirmed by the discovery of a mutation of a critical arginine residue in the BTK PH domain. This results in severe human immunodeficiency known as X-linked agammaglobulinemia (XLA) in humans and a related disorder is mice.PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269944 [Multi-domain]  Cd Length: 140  Bit Score: 42.22  E-value: 6.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQyedGQ---------WKKHWFVLADQSLRYYrDSVAEEAADLDGEINLSTCYDV----TEYPVQRNYGFQIHT 573
Cdd:cd01238      1 LEGLLVKR---SQgkkrfgpvnYKERWFVLTKSSLSYY-EGDGEKRGKEKGSIDLSKVRCVeevkDEAFFERKYPFQVVY 76
                           90       100
                   ....*....|....*....|..
gi 1039737300  574 KEGEFTLSAMTSGIRRNWIQTI 595
Cdd:cd01238     77 DDYTLYVFAPSEEDRDEWIAAL 98
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1868-2255 6.80e-04

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 45.17  E-value: 6.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1868 EAKGASGQKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKEN------TELKAkVSQMDHQQRCLQEAENKHSESMFAL 1941
Cdd:pfam01576  562 EEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKkfdqmlAEEKA-ISARYAEERDRAEAEAREKETRALS 640
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1942 QGRYEEEIRCMVEQLSHTENTLQAERSRVLSQLDASVKD-------RQAMEQhHVQQMKM----LEDrfqlkvrELQAVH 2010
Cdd:pfam01576  641 LARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNvhelersKRALEQ-QVEEMKTqleeLED-------ELQATE 712
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2011 QEELR------ALQEHYIWSLRgalslyqpshpdsslapgpsepravpaAKDEA--ESMSGLRERIQELEAQMGVMREEl 2082
Cdd:pfam01576  713 DAKLRlevnmqALKAQFERDLQ---------------------------ARDEQgeEKRRQLVKQVRELEAELEDERKQ- 764
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2083 ghkelEGDVAALQEKYQRDFESLKATCERGFAAMEET--HQKKIEDLQRQHQRELEKLREEKDRLLA--EETAATISAIE 2158
Cdd:pfam01576  765 -----RAQAVAAKKKLELDLKELEAQIDAANKGREEAvkQLKKLQAQMKDLQRELEEARASRDEILAqsKESEKKLKNLE 839
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2159 A-----------------MKNAHREEMERELEK--SQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLEN 2219
Cdd:pfam01576  840 AellqlqedlaaserarrQAQQERDELADEIASgaSGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQV 919
                          410       420       430
                   ....*....|....*....|....*....|....*.
gi 1039737300 2220 AHLAQALEAERQALRQCQRENQELNAHNQELNNRLA 2255
Cdd:pfam01576  920 EQLTTELAAERSTSQKSESARQQLERQNKELKAKLQ 955
PRK09039 PRK09039
peptidoglycan -binding protein;
2115-2267 7.05e-04

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 44.19  E-value: 7.05e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2115 AMEETHQKKIEDLQRQHQRELEKLREEKDRL--LAEETAATISAIEAMKNAHREEM--ERELEKSQRSQISSINSDIEAL 2190
Cdd:PRK09039    70 SLERQGNQDLQDSVANLRASLSAAEAERSRLqaLLAELAGAGAAAEGRAGELAQELdsEKQVSARALAQVELLNQQIAAL 149
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 2191 RRQyleeLQSVQRELEVlSEQYSQkclenahlaqalEAERQALRQCQRENQELNAHNQELnNRLAAE-ITRLRTLLTG 2267
Cdd:PRK09039   150 RRQ----LAALEAALDA-SEKRDR------------ESQAKIADLGRRLNVALAQRVQEL-NRYRSEfFGRLREILGD 209
PH_Gab2_2 cd13384
Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily ...
506-595 7.61e-04

Grb2-associated binding protein family pleckstrin homology (PH) domain; The Gab subfamily includes several Gab proteins, Drosophila DOS and C. elegans SOC-1. They are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. Members here include insect, nematodes, and crustacean Gab2s. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 241535  Cd Length: 115  Bit Score: 41.27  E-value: 7.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  506 FKKGWLTK-----QYEDGQWKKHWFVLADQS------LRYYRDsvaEEAADLDGEINLSTCYDV-----TEYPVQRNYG- 568
Cdd:cd13384      4 VYEGWLTKsppekRIWRAKWRRRYFVLRQSEipgqyfLEYYTD---RTCRKLKGSIDLDQCEQVdagltFETKNKLKDQh 80
                           90       100
                   ....*....|....*....|....*...
gi 1039737300  569 -FQIHTKEGEFTLSAMTSGIRRNWIQTI 595
Cdd:cd13384     81 iFDIRTPKRTYYLVADTEDEMNKWVNCI 108
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
794-963 8.19e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 44.37  E-value: 8.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  794 SLLEKELEQSQKEAS----DLLEQNRLLQDQLRVALGREQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEAQCRR 869
Cdd:COG4942     79 AALEAELAELEKEIAelraELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRAD 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  870 QELITQQIQTLKhsygEAKDAIRHHEAEIQTLQTRLgnaAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSG 949
Cdd:COG4942    159 LAELAALRAELE----AERAELEALLAELEEERAAL---EALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231
                          170
                   ....*....|....
gi 1039737300  950 QLRASEQKLRSTEA 963
Cdd:COG4942    232 LEAEAAAAAERTPA 245
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
2174-2381 8.21e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 44.44  E-value: 8.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2174 KSQRSQISSINSDIEALRrqylEELQSVQRELEVLSEQYSQKcleNAHLAQALEAERQALRQCQRENQELNAHNQELNNR 2253
Cdd:COG3883     19 QAKQKELSELQAELEAAQ----AELDALQAELEELNEEYNEL---QAELEALQAEIDKLQAEIAEAEAEIEERREELGER 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2254 LAA------EITRLRTLLTGDGGGE--------STGLPLTQG--KDAYELEVLLRVKESEIQYLKQEISSLKDELQTALR 2317
Cdd:COG3883     92 ARAlyrsggSVSYLDVLLGSESFSDfldrlsalSKIADADADllEELKADKAELEAKKAELEAKLAELEALKAELEAAKA 171
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1039737300 2318 DKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKSPEGTTVSGYDIMKSKSNPD 2381
Cdd:COG3883    172 ELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAA 235
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
2058-2258 8.22e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 44.91  E-value: 8.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2058 AESMSGLRERIQELEAQMGVMR--------------EELGHKELEGDVAALQEKYQR------DFESLK---ATCERGFA 2114
Cdd:COG4913    623 EEELAEAEERLEALEAELDALQerrealqrlaeyswDEIDVASAEREIAELEAELERldassdDLAALEeqlEELEAELE 702
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2115 AMEEtHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAI------------EAMKNAHREEMERELEKSQ---RSQ 2179
Cdd:COG4913    703 ELEE-ELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARlelralleerfaAALGDAVERELRENLEERIdalRAR 781
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2180 ISSINSDIEALRRQYLEE----LQSVQRELEVLSEqYSQKC--LENAHLAQALEAERQALRQCQRENQElnahnqELNNR 2253
Cdd:COG4913    782 LNRAEEELERAMRAFNREwpaeTADLDADLESLPE-YLALLdrLEEDGLPEYEERFKELLNENSIEFVA------DLLSK 854

                   ....*
gi 1039737300 2254 LAAEI 2258
Cdd:COG4913    855 LRRAI 859
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1850-2243 1.01e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 44.54  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1850 RIRLEYEKELRFYKKACQEAKGASGQKRAQAvgALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRcLQE 1929
Cdd:COG1196    380 ELEELAEELLEALRAAAELAAQLEELEEAEE--ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAE-LEE 456
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1930 AENKHSESMFALQGRYEEEIrcmvEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAV 2009
Cdd:COG1196    457 EEEALLELLAELLEEAALLE----AALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGV 532
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2010 HQEELRALQEhyiwsLRGALSLYQPSHPDSSLAPGPSEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREElGHKELEG 2089
Cdd:COG1196    533 EAAYEAALEA-----ALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGA-AVDLVAS 606
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2090 DVAALQEKYQRDFESL------KATCERGFAAMEETHQKKIE---DLQRQHQRELEKLREEKDRLLAEETAATISAIEAM 2160
Cdd:COG1196    607 DLREADARYYVLGDTLlgrtlvAARLEAALRRAVTLAGRLREvtlEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAE 686
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2161 KNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAE----------R 2230
Cdd:COG1196    687 RLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEElpeppdleelE 766
                          410
                   ....*....|...
gi 1039737300 2231 QALRQCQRENQEL 2243
Cdd:COG1196    767 RELERLEREIEAL 779
COG5022 COG5022
Myosin heavy chain [General function prediction only];
856-1322 1.01e-03

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 44.68  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  856 NQDLQSELEAQCRRQELITQQI----QTLKHSYGEAkdAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKmEQG 931
Cdd:COG5022    819 IIKLQKTIKREKKLRETEEVEFslkaEVLIQKFGRS--LKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVK-SIS 895
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  932 KVREQLEEWQHSKAMLSGQLRASEQ---KLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLgtseq 1008
Cdd:COG5022    896 SLKLVNLELESEIIELKKSLSSDLIenlEFKTELIARLKKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETS----- 970
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1009 aqrlmekklkrnytlllesceQEKQALLqnlkeveDKASAYEDQLQGHVQQVEALQKEkLSETCKGSEQVHKLEEELEAR 1088
Cdd:COG5022    971 ---------------------EEYEDLL-------KKSTILVREGNKANSELKNFKKE-LAELSKQYGALQESTKQLKEL 1021
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1089 EASIRQLAQHVQSLHDERDlIKHQFQELMERVATSDGDVAELQEKLrgKEVDYQN-LEHSHHRVSVQLQSVRTLlrEKEE 1167
Cdd:COG5022   1022 PVEVAELQSASKIISSEST-ELSILKPLQKLKGLLLLENNQLQARY--KALKLRReNSLLDDKQLYQLESTENL--LKTI 1096
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1168 ELKHIKETHERVLEKKdqdlnEALVKMIALGSSLEEteikLQEKEECLRRFVSDSPkDAKEPLSTTEPTEEGSGILPLGS 1247
Cdd:COG5022   1097 NVKDLEVTNRNLVKPA-----NVLQFIVAQMIKLNL----LQEISKFLSQLVNTLE-PVFQKLSVLQLELDGLFWEANLE 1166
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1248 VTRVFPGFpHSQPEDEDPSAGLGEEGSSGS-------------LSREENTILPKSADMPE--REG-HLQSTSKSDPGapI 1311
Cdd:COG5022   1167 ALPSPPPF-AALSEKRLYQSALYDEKSKLSssevndlkneliaLFSKIFSGWPRGDKLKKliSEGwVPTEYSTSLKG--F 1243
                          490
                   ....*....|.
gi 1039737300 1312 KRPRIRFSTIQ 1322
Cdd:COG5022   1244 NNLNKKFDTPA 1254
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
2058-2243 1.05e-03

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 44.65  E-value: 1.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2058 AESMSGLRERIQELEAQMGVMREELGHKELEGDVAALQekyQRDFESLKATCERgfaAMEETHQKkiedlQRQHQRELEK 2137
Cdd:PRK02224   278 AEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEAR---REELEDRDEELRD---RLEECRVA-----AQAHNEEAES 346
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2138 LREEKDRLlaEETAATISAIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQY---LEELQSVQRELEVLSEQysq 2214
Cdd:PRK02224   347 LREDADDL--EERAEELREEAAELESELEEAREAVED-RREEIEELEEEIEELRERFgdaPVDLGNAEDFLEELREE--- 420
                          170       180       190
                   ....*....|....*....|....*....|
gi 1039737300 2215 kcLENAHLAQA-LEAERQALRQCQRENQEL 2243
Cdd:PRK02224   421 --RDELREREAeLEATLRTARERVEEAEAL 448
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
796-1059 1.22e-03

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 44.41  E-value: 1.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRvalgREQSAREGYVLQTEVATSpsgAWQRL-------HRVNQDLQSELEAQCR 868
Cdd:PRK10246   535 LEKEVKKLGEEGAALRGQLDALTKQLQ----RDESEAQSLRQEEQALTQ---QWQAVcaslnitLQPQDDIQPWLDAQEE 607
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  869 RQELITQ--QIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIK------EQA-LAKLKGELKMEQGKVREQ--L 937
Cdd:PRK10246   608 HERQLRLlsQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTlpqedeEASwLATRQQEAQSWQQRQNELtaL 687
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  938 EEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRD----LETQ-QALQRDRQKEVQRLQECIAELSQQLGTSEQAQR- 1011
Cdd:PRK10246   688 QNRIQQLTPLLETLPQSDDLPHSEETVALDNWRQVHEqclsLHSQlQTLQQQDVLEAQRLQKAQAQFDTALQASVFDDQq 767
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1039737300 1012 ------LMEKKLKRnytlllesCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQ 1059
Cdd:PRK10246   768 aflaalLDEETLTQ--------LEQLKQNLENQRQQAQTLVTQTAQALAQHQQH 813
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
2058-2227 1.24e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 44.24  E-value: 1.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2058 AESMSGLRERIQELEAQMGVMREELghKELEGDVAALQEKYQRDFESLKATCErgfAAMEETHQKKIEDLQRQHQRELEK 2137
Cdd:COG3206    211 SEEAKLLLQQLSELESQLAEARAEL--AEAEARLAALRAQLGSGPDALPELLQ---SPVIQQLRAQLAELEAELAELSAR 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2138 LREEKDRL--LAEETAATISAIEAMKNAHREEMERELEkSQRSQISSINSDIEALRRQYLE------ELQSVQRELEVLS 2209
Cdd:COG3206    286 YTPNHPDViaLRAQIAALRAQLQQEAQRILASLEAELE-ALQAREASLQAQLAQLEARLAElpeleaELRRLEREVEVAR 364
                          170       180
                   ....*....|....*....|
gi 1039737300 2210 EQYSQ--KCLENAHLAQALE 2227
Cdd:COG3206    365 ELYESllQRLEEARLAEALT 384
mukB PRK04863
chromosome partition protein MukB;
1867-2264 1.27e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 44.18  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1867 QEAKGASGQ-KRAQA-VGALKEEYEE------LLHKQKSEYQKVITLIEKENTELKAKVSqmDHQQRcLQEAENKhsesm 1938
Cdd:PRK04863   341 QTALRQQEKiERYQAdLEELEERLEEqnevveEADEQQEENEARAEAAEEEVDELKSQLA--DYQQA-LDVQQTR----- 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1939 fALQGRyeeeircmveqlshteNTLQA-ERSRVLSQLDA----SVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEE 2013
Cdd:PRK04863   413 -AIQYQ----------------QAVQAlERAKQLCGLPDltadNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAHSQF 475
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2014 LRALQehyiwSLRgalslyqpshpdsSLAPGPSEPRAVPAAKD---EAESMSGLRERIQELEAQmgvmreelgHKELEGD 2090
Cdd:PRK04863   476 EQAYQ-----LVR-------------KIAGEVSRSEAWDVAREllrRLREQRHLAEQLQQLRMR---------LSELEQR 528
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2091 VAAlqekyQRDFESLKATCERGFAAMEEThQKKIEDLQRQHQRELEKLREEKDRLlaeetaatisaieamkNAHREEMER 2170
Cdd:PRK04863   529 LRQ-----QQRAERLLAEFCKRLGKNLDD-EDELEQLQEELEARLESLSESVSEA----------------RERRMALRQ 586
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2171 ELEKsqrsqissINSDIEALRRQYLEELQSvQRELEVLSEQY------SQKCLEnaHLAQALEAERQALR---QCQRENQ 2241
Cdd:PRK04863   587 QLEQ--------LQARIQRLAARAPAWLAA-QDALARLREQSgeefedSQDVTE--YMQQLLERERELTVerdELAARKQ 655
                          410       420
                   ....*....|....*....|...
gi 1039737300 2242 ELNAHNQELNNRLAAEITRLRTL 2264
Cdd:PRK04863   656 ALDEEIERLSQPGGSEDPRLNAL 678
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
2121-2264 1.37e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 44.04  E-value: 1.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2121 QKKIEDLQRQHQRELEKLREEKDRL--LAEETAATISA--------------IEAMKNAH-REEMER--ELEkSQRSQIS 2181
Cdd:pfam10174  400 QKKIENLQEQLRDKDKQLAGLKERVksLQTDSSNTDTAlttleealsekeriIERLKEQReREDRERleELE-SLKKENK 478
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2182 SINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQR-ENQELNAHNQELNNRLAAEIT- 2259
Cdd:pfam10174  479 DLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKlENQLKKAHNAEEAVRTNPEINd 558

                   ....*
gi 1039737300 2260 RLRTL 2264
Cdd:pfam10174  559 RIRLL 563
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
850-1004 1.37e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 44.19  E-value: 1.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  850 QRLHRVNQDLQSELEA-QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQtRLGNAAAELAIKEQALAKLKGELKM 928
Cdd:TIGR00618  725 NASSSLGSDLAAREDAlNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGA-ELSHLAAEIQFFNRLREEDTHLLKT 803
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300  929 EQGKVREQLEewqHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLG 1004
Cdd:TIGR00618  804 LEAEIGQEIP---SDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSD 876
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1875-2315 1.69e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 44.01  E-value: 1.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1875 QKRAQAVGALKEEYEELlHKQKSEYQKVITLIEKENTELKAKV-----SQMDHQQR------CLQEAENKHSESMFALQG 1943
Cdd:pfam01576  352 QKHTQALEELTEQLEQA-KRNKANLEKAKQALESENAELQAELrtlqqAKQDSEHKrkklegQLQELQARLSESERQRAE 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1944 RYEEEIRCMVEQLSHTENTLQAER-----SRVLSQLDASVKDRQAMEQHHVQQMKMLEDRfqlkVRELQavhqEELRALQ 2018
Cdd:pfam01576  431 LAEKLSKLQSELESVSSLLNEAEGkniklSKDVSSLESQLQDTQELLQEETRQKLNLSTR----LRQLE----DERNSLQ 502
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2019 EHYiwslrgalslyqpshpdsslapgpsepravpaaKDEAESMSGLRERIQELEAQMGVMREELghKELEGDVAALQE-- 2096
Cdd:pfam01576  503 EQL---------------------------------EEEEEAKRNVERQLSTLQAQLSDMKKKL--EEDAGTLEALEEgk 547
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2097 -KYQRDFESLKATCERGFAAMEETH------QKKIEDL------QRQHQRELEKLREEKDRLLAEETAATISAIEAMKNA 2163
Cdd:pfam01576  548 kRLQRELEALTQQLEEKAAAYDKLEktknrlQQELDDLlvdldhQRQLVSNLEKKQKKFDQMLAEEKAISARYAEERDRA 627
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2164 HREEMERELEksqrsqissinsdiealrrqyleelqsvqreleVLSeqysqkclenahLAQALEAERQALRQCQRENQEL 2243
Cdd:pfam01576  628 EAEAREKETR---------------------------------ALS------------LARALEEALEAKEELERTNKQL 662
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1039737300 2244 NAHNQELNNRLAAeitrlrtlltgdgggestglpltQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTA 2315
Cdd:pfam01576  663 RAEMEDLVSSKDD-----------------------VGKNVHELERSKRALEQQVEEMKTQLEELEDELQAT 711
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
2056-2320 1.74e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 43.89  E-value: 1.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2056 DEAESMSGLRERIQELEAQMGVMREELG-----HKELEGDVAALQ------EKYQRDFESLKATcERGFAAME-ETHQKK 2123
Cdd:TIGR02168  162 EEAAGISKYKERRKETERKLERTRENLDrlediLNELERQLKSLErqaekaERYKELKAELREL-ELALLVLRlEELREE 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2124 IEDLQRQhQRELEKLREEKDRLLAEETAAtisaIEAMKNAHREeMERELEKSQRS--QISSINSDIEALRRQYLEELQSV 2201
Cdd:TIGR02168  241 LEELQEE-LKEAEEELEELTAELQELEEK----LEELRLEVSE-LEEEIEELQKElyALANEISRLEQQKQILRERLANL 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2202 QRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRtlltgdgggestglpltqg 2281
Cdd:TIGR02168  315 ERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLE------------------- 375
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1039737300 2282 kdayELEVLLRVKESEIQYLKQEISSLKDELQTALRDKK 2320
Cdd:TIGR02168  376 ----ELEEQLETLRSKVAQLELQIASLNNEIERLEARLE 410
PH_RhoGap25-like cd13263
Rho GTPase activating protein 25 and related proteins Pleckstrin homology (PH) domain; ...
507-597 1.95e-03

Rho GTPase activating protein 25 and related proteins Pleckstrin homology (PH) domain; RhoGAP25 (also called ArhGap25) like other RhoGaps are involved in cell polarity, cell morphology and cytoskeletal organization. They act as GTPase activators for the Rac-type GTPases by converting them to an inactive GDP-bound state and control actin remodeling by inactivating Rac downstream of Rho leading to suppress leading edge protrusion and promotes cell retraction to achieve cellular polarity and are able to suppress RAC1 and CDC42 activity in vitro. Overexpression of these proteins induces cell rounding with partial or complete disruption of actin stress fibers and formation of membrane ruffles, lamellipodia, and filopodia. This hierarchy contains RhoGAP22, RhoGAP24, and RhoGAP25. Members here contain an N-terminal PH domain followed by a RhoGAP domain and either a BAR or TATA Binding Protein (TBP) Associated Factor 4 (TAF4) domain. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270083  Cd Length: 114  Bit Score: 40.06  E-value: 1.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYED-GQWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTCyDVTEYPVQRN----YGFQIHTKEGE---- 577
Cdd:cd13263      5 KSGWLKKQGSIvKNWQQRWFVLRGDQLYYYKD---EDDTKPQGTIPLPGN-KVKEVPFNPEepgkFLFEIIPGGGGdrmt 80
                           90       100
                   ....*....|....*....|....*
gi 1039737300  578 -----FTLSAMTSGIRRNWIQTIMK 597
Cdd:cd13263     81 snhdsYLLMANSQAEMEEWVKVIRR 105
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
2063-2351 1.97e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 43.79  E-value: 1.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2063 GLRERIQELEAQMGVMREElgHKELEGDVAALQE----------KYQRDFESLKATceRGFAAMEETHQKKIEDLQRQHQ 2132
Cdd:COG3096    372 EAAEQLAEAEARLEAAEEE--VDSLKSQLADYQQaldvqqtraiQYQQAVQALEKA--RALCGLPDLTPENAEDYLAAFR 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2133 RELEKLREEkdrLLAEETAATISaiEAMKNAHREEME------------------RELEKSQRSQiSSINSDIEALRRQY 2194
Cdd:COG3096    448 AKEQQATEE---VLELEQKLSVA--DAARRQFEKAYElvckiageversqawqtaRELLRRYRSQ-QALAQRLQQLRAQL 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2195 --LEELQSVQRELEVLSEQYSQ---KCLENA----HLAQALEAERQAL-----------RQCQRENQELNAHNQELNNRL 2254
Cdd:COG3096    522 aeLEQRLRQQQNAERLLEEFCQrigQQLDAAeeleELLAELEAQLEELeeqaaeaveqrSELRQQLEQLRARIKELAARA 601
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2255 AAEIT---RLRTLltgdggGESTGLPLT--QGKDAYeLEVLLRvKESEIQYLKQEISSLKDELQTALRdkkyasdkykdi 2329
Cdd:COG3096    602 PAWLAaqdALERL------REQSGEALAdsQEVTAA-MQQLLE-REREATVERDELAARKQALESQIE------------ 661
                          330       340
                   ....*....|....*....|..
gi 1039737300 2330 ytELSIAKAKADCDISRLKEQL 2351
Cdd:COG3096    662 --RLSQPGGAEDPRLLALAERL 681
Apolipoprotein pfam01442
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ...
2059-2235 2.05e-03

Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.


Pssm-ID: 460211 [Multi-domain]  Cd Length: 175  Bit Score: 41.48  E-value: 2.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2059 ESMSGLRERIQELEAQMGVMREELgHKELEGDVAALQEKYQRDFESLKATCERGFAAMEETHQKKIEDLQRQHQRELEKL 2138
Cdd:pfam01442    4 DSLDELSTYAEELQEQLGPVAQEL-VDRLEKETEALRERLQKDLEEVRAKLEPYLEELQAKLGQNVEELRQRLEPYTEEL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2139 ReekdrllaEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLE 2218
Cdd:pfam01442   83 R--------KRLNADAEELQEKLAPYGEELRERLEQNVDALRARLAPYAEELRQKLAERLEELKESLAPYAEEVQAQLSQ 154
                          170
                   ....*....|....*...
gi 1039737300 2219 NA-HLAQALEAERQALRQ 2235
Cdd:pfam01442  155 RLqELREKLEPQAEDLRE 172
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1775-2360 2.28e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.42  E-value: 2.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1775 QDELAQQLREKASILEEISAALPVLPPTEPLGgcqrLLRMSQHLSYESCLEGLGQYSSLLVQDAIIQAQVCYAACRIRLE 1854
Cdd:TIGR00618  151 QGEFAQFLKAKSKEKKELLMNLFPLDQYTQLA----LMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEK 226
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1855 YEKELRFYKKACQEAKGASGQKRAQAVGALKEEYE-ELLHKQKSEYQKVITLIEKENTELK------------AKVSQMD 1921
Cdd:TIGR00618  227 ELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLlKQLRARIEELRAQEAVLEETQERINrarkaaplaahiKAVTQIE 306
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1922 HQ-QRCLQEAENKHSESMFALQGRYE-EEIRCMVEQLSHTENTLQAERSRVLSQLDA-----SVKDRQAMEQHHV----Q 1990
Cdd:TIGR00618  307 QQaQRIHTELQSKMRSRAKLLMKRAAhVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVatsirEISCQQHTLTQHIhtlqQ 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1991 QMKMLEDRFQLKVRELQAVHQE---------ELRALQEHYIwSLRGALSLYQPSHPDSSLAPGPSEPRAVPAAKDEAESM 2061
Cdd:TIGR00618  387 QKTTLTQKLQSLCKELDILQREqatidtrtsAFRDLQGQLA-HAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESA 465
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2062 SGLRERIQELEAQMGVMREELGHKELEGDVAALQEKYQRDFES--LKATCERGFAAMEETHQKK---IEDLQRQHQRELE 2136
Cdd:TIGR00618  466 QSLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGscIHPNPARQDIDNPGPLTRRmqrGEQTYAQLETSEE 545
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2137 KLREEKDRLLaeETAATISAieamknahREEMERELEKSQRSQISSINSDIEALRRqyleELQSVQRELEVLSEQYSQKC 2216
Cdd:TIGR00618  546 DVYHQLTSER--KQRASLKE--------QMQEIQQSFSILTQCDNRSKEDIPNLQN----ITVRLQDLTEKLSEAEDMLA 611
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2217 LEN-AHL-----AQALEAERQALRQCQRENQELNAHNQELNNRLAAEITRLRTLLTgdgggestglplTQGKDAYELEVL 2290
Cdd:TIGR00618  612 CEQhALLrklqpEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSI------------RVLPKELLASRQ 679
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039737300 2291 LrvKESEIQYLKQEISSLKDEL---QTALRDKKYASDKYKDIYTELSIAKAKAdcdISRLKEQLKAATEALGE 2360
Cdd:TIGR00618  680 L--ALQKMQSEKEQLTYWKEMLaqcQTLLRELETHIEEYDREFNEIENASSSL---GSDLAAREDALNQSLKE 747
PRK11281 PRK11281
mechanosensitive channel MscK;
857-1065 2.46e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 43.36  E-value: 2.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  857 QDLQSELEAQCRRQELITQQ---IQTLKHSYgEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKG--------- 924
Cdd:PRK11281    39 ADVQAQLDALNKQKLLEAEDklvQQDLEQTL-ALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDdndeetret 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  925 -------ELKMEQGKVREQLEEWQHSKAMLSGQL--------RASEQkLRSTEARLLEKTQELRDLETQQALQRDRQKev 989
Cdd:PRK11281   118 lstlslrQLESRLAQTLDQLQNAQNDLAEYNSQLvslqtqpeRAQAA-LYANSQRLQQIRNLLKGGKVGGKALRPSQR-- 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  990 QRLQECIAELSQQ-------LGTSEQAQRLMEKklKRNYTLLLESCEQEKQALLQNLkeVEDKasaYEDQLQGHVQQVEA 1062
Cdd:PRK11281   195 VLLQAEQALLNAQndlqrksLEGNTQLQDLLQK--QRDYLTARIQRLEHQLQLLQEA--INSK---RLTLSEKTVQEAQS 267

                   ...
gi 1039737300 1063 LQK 1065
Cdd:PRK11281   268 QDE 270
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
2067-2211 2.89e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.83  E-value: 2.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2067 RIQELEAQMGVMREELghKELEGDVAALQ---EKYQRDFESLKATCERgFAAMEETHQKKIEDLQRQH-----QRELEKL 2138
Cdd:COG1579     18 ELDRLEHRLKELPAEL--AELEDELAALEarlEAAKTELEDLEKEIKR-LELEIEEVEARIKKYEEQLgnvrnNKEYEAL 94
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039737300 2139 REEKDRLLAEETAATISAIEAMKNahREEMERELEKSQrSQISSINSDIEALRRQYLEELQSVQRELEVLSEQ 2211
Cdd:COG1579     95 QKEIESLKRRISDLEDEILELMER--IEELEEELAELE-AELAELEAELEEKKAELDEELAELEAELEELEAE 164
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
950-1120 2.94e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.98  E-value: 2.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  950 QLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTSEQAQRLMEKK--LKRNYTLL--- 1024
Cdd:COG4913    611 KLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELerLDASSDDLaal 690
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1025 ---LESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQS 1101
Cdd:COG4913    691 eeqLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELREN 770
                          170
                   ....*....|....*....
gi 1039737300 1102 LHDERDLIKHQFQELMERV 1120
Cdd:COG4913    771 LEERIDALRARLNRAEEEL 789
PRK09039 PRK09039
peptidoglycan -binding protein;
897-1039 3.23e-03

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 42.26  E-value: 3.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  897 EIQTLQTRLGNAAAELAIKEQALA---KLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELr 973
Cdd:PRK09039    47 EISGKDSALDRLNSQIAELADLLSlerQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQEL- 125
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1039737300  974 DLETQQALQRDRQkeVQRLQECIAELSQQLGTSEQAQRLMEKKlkrnytlllescEQEKQALLQNL 1039
Cdd:PRK09039   126 DSEKQVSARALAQ--VELLNQQIAALRRQLAALEAALDASEKR------------DRESQAKIADL 177
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
895-1213 3.34e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.20  E-value: 3.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  895 EAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRD 974
Cdd:COG4372     12 RLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  975 LETQQALQRDR----QKEVQRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLEScEQEKQALLQNLKEVEDKASAYE 1050
Cdd:COG4372     92 AQAELAQAQEEleslQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAER-EEELKELEEQLESLQEELAALE 170
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1051 DQLQGHVQQVEALQKEKLSETCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAEL 1130
Cdd:COG4372    171 QELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEE 250
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1131 QEKLRGKEVDYQNLEHSHHRVSVQLQSVRTLLREKEEELKHIKETHERVLEKKDQDLNEALVKMIALGSSLEETEIKLQE 1210
Cdd:COG4372    251 LLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLEL 330

                   ...
gi 1039737300 1211 KEE 1213
Cdd:COG4372    331 ALA 333
PH_DAPP1 cd10573
Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; ...
69-145 3.85e-03

Dual Adaptor for Phosphotyrosine and 3-Phosphoinositides Pleckstrin homology (PH) domain; DAPP1 (also known as PHISH/3' phosphoinositide-interacting SH2 domain-containing protein or Bam32) plays a role in B-cell activation and has potential roles in T-cell and mast cell function. DAPP1 promotes B cell receptor (BCR) induced activation of Rho GTPases Rac1 and Cdc42, which feed into mitogen-activated protein kinases (MAPK) activation pathways and affect cytoskeletal rearrangement. DAPP1can also regulate BCR-induced activation of extracellular signal-regulated kinase (ERK), and c-jun NH2-terminal kinase (JNK). DAPP1 contains an N-terminal SH2 domain and a C-terminal pleckstrin homology (PH) domain with a single tyrosine phosphorylation site located centrally. DAPP1 binds strongly to both PtdIns(3,4,5)P3 and PtdIns(3,4)P2. The PH domain is essential for plasma membrane recruitment of PI3K upon cell activation. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269977 [Multi-domain]  Cd Length: 96  Bit Score: 38.84  E-value: 3.85e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039737300   69 WQRRFFILYEHgLLRYALDEMPTTlPQGTINMNQCTDVvDGEARTGQKFSLCILTPDKEHFIRAETKEIISGWLEML 145
Cdd:cd10573     19 WKTRWFVLRRN-ELKYFKTRGDTK-PIRVLDLRECSSV-QRDYSQGKVNCFCLVFPERTFYMYANTEEEADEWVKLL 92
PTZ00121 PTZ00121
MAEBL; Provisional
910-1187 3.89e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 42.82  E-value: 3.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  910 AELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAMlsgQLRASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEV 989
Cdd:PTZ00121  1542 AEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM---ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  990 QRLQECIAELSQQLGTSEQAQRLMEKKLKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQLQGHVQQVEALQKEKLS 1069
Cdd:PTZ00121  1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEA 1698
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1070 ETCKGSEQVHKLEEELEAREASIRQlaqhvqslHDERDLIKhqfQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHH 1149
Cdd:PTZ00121  1699 EEAKKAEELKKKEAEEKKKAEELKK--------AEEENKIK---AEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEE 1767
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1039737300 1150 RVSVQLQSVRTLLreKEEELKHIKETHERVLEKKDQDL 1187
Cdd:PTZ00121  1768 KKAEEIRKEKEAV--IEEELDEEDEKRRMEVDKKIKDI 1803
DUF4618 pfam15397
Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins ...
2054-2199 3.91e-03

Domain of unknown function (DUF4618); This family of proteins is found in eukaryotes. Proteins in this family are typically between 238 and 363 amino acids in length. There are two conserved sequence motifs: EYP and KCTPD.


Pssm-ID: 464704 [Multi-domain]  Cd Length: 258  Bit Score: 41.48  E-value: 3.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2054 AKDEAESMSGLRERIQELEAQMGVMREELG----HKELEGDVAALQ-EKYQRDFESLKatcergfaameETHQKKIEDLQ 2128
Cdd:pfam15397   76 EEKEESKLNKLEQQLEQLNAKIQKTQEELNflstYKDKEYPVKAVQiANLVRQLQQLK-----------DSQQDELDELE 144
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 2129 RQHQRELEKL----REEKDRLL---AEETAATISAIEAMKNAHREEMERELEKsQRSQISSINSDIEALRRQyLEELQ 2199
Cdd:pfam15397  145 EMRRMVLESLsrkiQKKKEKILsslAEKTLSPYQESLLQKTRDNQVMLKEIEQ-FREFIDELEEEIPKLKAE-VQQLQ 220
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
796-1070 4.06e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 42.63  E-value: 4.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGYVLQTEVA--TSPSGAWQRlhrvnqdlQSELEAQCRRQELI 873
Cdd:COG3096    439 AEDYLAAFRAKEQQATEEVLELEQKLSVADAARRQFEKAYELVCKIAgeVERSQAWQT--------ARELLRRYRSQQAL 510
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  874 TQQIQTLKHSYGEA-KDAIRHHEAEiqtlqtrlgnaaaelaikeqalaKLKGELKMEQGKVREQLEEWQHSKAMLSGQLR 952
Cdd:COG3096    511 AQRLQQLRAQLAELeQRLRQQQNAE-----------------------RLLEEFCQRIGQQLDAAEELEELLAELEAQLE 567
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  953 ASEQKLRSTEARLLEKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLGTS--------EQAQRLMEKklKRNYTLL 1024
Cdd:COG3096    568 ELEEQAAEAVEQRSELRQQLEQLRARIKELAARAPAWLAAQDALERLREQSGEAladsqevtAAMQQLLER--EREATVE 645
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1039737300 1025 LESCEQEKQALLQNLKEVEDKASAYEDQLqghVQQVEALQKEKLSE 1070
Cdd:COG3096    646 RDELAARKQALESQIERLSQPGGAEDPRL---LALAERLGGVLLSE 688
PH_ACAP cd13250
ArfGAP with coiled-coil, ankyrin repeat and PH domains Pleckstrin homology (PH) domain; ACAP ...
507-595 4.21e-03

ArfGAP with coiled-coil, ankyrin repeat and PH domains Pleckstrin homology (PH) domain; ACAP (also called centaurin beta) functions both as a Rab35 effector and as an Arf6-GTPase-activating protein (GAP) by which it controls actin remodeling and membrane trafficking. ACAP contain an NH2-terminal bin/amphiphysin/Rvs (BAR) domain, a phospholipid-binding domain, a PH domain, a GAP domain, and four ankyrin repeats. The AZAPs constitute a family of Arf GAPs that are characterized by an NH2-terminal pleckstrin homology (PH) domain and a central Arf GAP domain followed by two or more ankyrin repeats. On the basis of sequence and domain organization, the AZAP family is further subdivided into four subfamilies: 1) the ACAPs contain an NH2-terminal bin/amphiphysin/Rvs (BAR) domain (a phospholipid-binding domain that is thought to sense membrane curvature), a single PH domain followed by the GAP domain, and four ankyrin repeats; 2) the ASAPs also contain an NH2-terminal BAR domain, the tandem PH domain/GAP domain, three ankyrin repeats, two proline-rich regions, and a COOH-terminal Src homology 3 domain; 3) the AGAPs contain an NH2-terminal GTPase-like domain (GLD), a split PH domain, and the GAP domain followed by four ankyrin repeats; and 4) the ARAPs contain both an Arf GAP domain and a Rho GAP domain, as well as an NH2-terminal sterile-a motif (SAM), a proline-rich region, a GTPase-binding domain, and five PH domains. PMID 18003747 and 19055940 Centaurin can bind to phosphatidlyinositol (3,4,5)P3. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270070  Cd Length: 98  Bit Score: 38.74  E-value: 4.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYED--GQWKKHWFVLADQSLRYYRDSVAEEAADLdgEINLSTCYDVTEYPVQRNYGFQIHTKEGEFTLSAMT 584
Cdd:cd13250      1 KEGYLFKRSSNafKTWKRRWFSLQNGQLYYQKRDKKDEPTVM--VEDLRLCTVKPTEDSDRRFCFEVISPTKSYMLQAES 78
                           90
                   ....*....|.
gi 1039737300  585 SGIRRNWIQTI 595
Cdd:cd13250     79 EEDRQAWIQAI 89
PH_TAAP2-like cd13255
Tandem PH-domain-containing protein 2 Pleckstrin homology (PH) domain; The binding of TAPP2 ...
507-595 4.35e-03

Tandem PH-domain-containing protein 2 Pleckstrin homology (PH) domain; The binding of TAPP2 (also called PLEKHA2) adaptors to PtdIns(3,4)P(2), but not PI(3,4, 5)P3, function as negative regulators of insulin and PI3K signalling pathways (i.e. TAPP/utrophin/syntrophin complex). TAPP2 contains two sequential PH domains in which the C-terminal PH domain specifically binds PtdIns(3,4)P2 with high affinity. The N-terminal PH domain does not interact with any phosphoinositide tested. They also contain a C-terminal PDZ-binding motif that interacts with several PDZ-binding proteins, including PTPN13 (known previously as PTPL1 or FAP-1) as well as the scaffolding proteins MUPP1 (multiple PDZ-domain-containing protein 1), syntrophin and utrophin. The members here are most sequence similar to TAPP2 proteins, but may not be actual TAPP2 proteins. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270075  Cd Length: 110  Bit Score: 38.93  E-value: 4.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQYEDGQ-WKKHWFVLADQSLRYYRDSVAEEAADLdgeINLSTCYDVTEYPVQRN-YGFQIHTKEGEFTLSAMT 584
Cdd:cd13255      8 KAGYLEKKGERRKtWKKRWFVLRPTKLAYYKNDKEYRLLRL---IDLTDIHTCTEVQLKKHdNTFGIVTPARTFYVQADS 84
                           90
                   ....*....|.
gi 1039737300  585 SGIRRNWIQTI 595
Cdd:cd13255     85 KAEMESWISAI 95
PH_DOCK-D cd13267
Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also ...
506-597 4.64e-03

Dedicator of cytokinesis-D subfamily Pleckstrin homology (PH) domain; DOCK-D subfamily (also called Zizimin subfamily) consists of Dock9/Zizimin1, Dock10/Zizimin3, and Dock11/Zizimin2. DOCK-D has a N-terminal DUF3398 domain, a PH-like domain, a Dock Homology Region 1, DHR1 (also called CZH1), a C2 domain, and a C-terminal DHR2 domain (also called CZH2). Zizimin1 is enriched in the brain, lung, and kidney; zizimin2 is found in B and T lymphocytes, and zizimin3 is enriched in brain, lung, spleen and thymus. Zizimin1 functions in autoinhibition and membrane targeting. Zizimin2 is an immune-related and age-regulated guanine nucleotide exchange factor, which facilitates filopodial formation through activation of Cdc42, which results in activation of cell migration. No function has been determined for Zizimin3 to date. The N-terminal half of zizimin1 binds to the GEF domain through three distinct areas, including CZH1, to inhibit the interaction with Cdc42. In addition its PH domain binds phosphoinositides and mediates zizimin1 membrane targeting. DOCK is a family of proteins involved in intracellular signalling networks. They act as guanine nucleotide exchange factors for small G proteins of the Rho family, such as Rac and Cdc42. There are 4 subfamilies of DOCK family proteins based on their sequence homology: A-D. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270087  Cd Length: 126  Bit Score: 39.23  E-value: 4.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  506 FKKGWLTKQYEDGQ----------WKKHWFVL---ADQS--LRYYRDsvaEEAADLDGEINLSTCYDVTEYPVQRNYGFQ 570
Cdd:cd13267      7 TKEGYLYKGPENSSdsfislamksFKRRFFHLkqlVDGSyiLEFYKD---EKKKEAKGTIFLDSCTGVVQNSKRRKFCFE 83
                           90       100
                   ....*....|....*....|....*...
gi 1039737300  571 IHTKEGE-FTLSAMTSGIRRNWIQTIMK 597
Cdd:cd13267     84 LRMQDKKsYVLAAESEAEMDEWISKLNK 111
Mitofilin pfam09731
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...
887-1068 4.86e-03

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.


Pssm-ID: 430783 [Multi-domain]  Cd Length: 618  Bit Score: 42.05  E-value: 4.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  887 AKDAIRHHEAEIQTLQTRlGNAAAELAIKEQALAKLKGELKmeqgkVREQLEEwqhskamlsgQLRASEQKLRSTEARLL 966
Cdd:pfam09731  292 AHREIDQLSKKLAELKKR-EEKHIERALEKQKEELDKLAEE-----LSARLEE----------VRAADEAQLRLEFERER 355
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  967 EKTQELRDLETQQALQRDRQKEVQRLQECIAELSQQLgtseqaQRLMEKKLKrnytlllESCEQEKQALLQNLKEVEDKA 1046
Cdd:pfam09731  356 EEIRESYEEKLRTELERQAEAHEEHLKDVLVEQEIEL------QREFLQDIK-------EKVEEERAGRLLKLNELLANL 422
                          170       180
                   ....*....|....*....|...
gi 1039737300 1047 SAYEDQLQGHVQQV-EALQKEKL 1068
Cdd:pfam09731  423 KGLEKATSSHSEVEdENRKAQQL 445
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
796-961 5.17e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.21  E-value: 5.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVALGREQSAREGY---VLQTEVAtspsgAWQ-RLHRVNQD------LQSELEA 865
Cdd:COG4913    622 LEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIdvaSAEREIA-----ELEaELERLDASsddlaaLEEQLEE 696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  866 QCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKA 945
Cdd:COG4913    697 LEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERID 776
                          170
                   ....*....|....*.
gi 1039737300  946 MLSGQLRASEQKLRST 961
Cdd:COG4913    777 ALRARLNRAEEELERA 792
PH_KIFIA_KIFIB cd01233
KIFIA and KIFIB protein pleckstrin homology (PH) domain; The kinesin-3 family motors KIFIA ...
507-595 5.23e-03

KIFIA and KIFIB protein pleckstrin homology (PH) domain; The kinesin-3 family motors KIFIA (Caenorhabditis elegans homolog unc-104) and KIFIB transport synaptic vesicle precursors that contain synaptic vesicle proteins, such as synaptophysin, synaptotagmin and the small GTPase RAB3A, but they do not transport organelles that contain plasma membrane proteins. They have a N-terminal motor domain, followed by a coiled-coil domain, and a C-terminal PH domain. KIF1A adopts a monomeric form in vitro, but acts as a processive dimer in vivo. KIF1B has alternatively spliced isoforms distinguished by the presence or absence of insertion sequences in the conserved amino-terminal region of the protein; this results in their different motor activities. KIF1A and KIF1B bind to RAB3 proteins through the adaptor protein mitogen-activated protein kinase (MAPK) -activating death domain (MADD; also calledDENN), which was first identified as a RAB3 guanine nucleotide exchange factor (GEF). PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 269939  Cd Length: 103  Bit Score: 38.73  E-value: 5.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  507 KKGWLTKQyEDG--QWKKHWFVLADQSLRYYRDSvaeeaADLD--GEINLSTC---YDV-TEYPVQRNYGFQIHTKEGEF 578
Cdd:cd01233      8 KRGYLLFL-EDAtdGWVRRWVVLRRPYLHIYSSE-----KDGDerGVINLSTArveYSPdQEALLGRPNVFAVYTPTNSY 81
                           90
                   ....*....|....*..
gi 1039737300  579 TLSAMTSGIRRNWIQTI 595
Cdd:cd01233     82 LLQARSEKEMQDWLYAI 98
PH1_PH_fungal cd13298
Fungal proteins Pleckstrin homology (PH) domain, repeat 1; The functions of these fungal ...
506-595 5.41e-03

Fungal proteins Pleckstrin homology (PH) domain, repeat 1; The functions of these fungal proteins are unknown, but they all contain 2 PH domains. This cd represents the first PH repeat. PH domains have diverse functions, but in general are involved in targeting proteins to the appropriate cellular location or in the interaction with a binding partner. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. Less than 10% of PH domains bind phosphoinositide phosphates (PIPs) with high affinity and specificity. PH domains are distinguished from other PIP-binding domains by their specific high-affinity binding to PIPs with two vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or PtdIns(3,4,5)P3 which results in targeting some PH domain proteins to the plasma membrane. A few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.


Pssm-ID: 270110  Cd Length: 106  Bit Score: 38.76  E-value: 5.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  506 FKKGWLTKQYED-GQWKKHWFVLADQSLRYYRDsvaEEAADLDGEINLSTCYDVTEYPV-QRNYGFQIHTKEGEFTLSAM 583
Cdd:cd13298      7 LKSGYLLKRSRKtKNWKKRWVVLRPCQLSYYKD---EKEYKLRRVINLSELLAVAPLKDkKRKNVFGIYTPSKNLHFRAT 83
                           90
                   ....*....|..
gi 1039737300  584 TSGIRRNWIQTI 595
Cdd:cd13298     84 SEKDANEWVEAL 95
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
796-971 5.69e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 41.74  E-value: 5.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQ---DQLRVALGR------EQSAREGYVLQTEVATSPSGAWQRLHRVNQDLQSELEaq 866
Cdd:COG3883     56 LQAELEALQAEIDKLQAEIAEAEaeiEERREELGEraralyRSGGSVSYLDVLLGSESFSDFLDRLSALSKIADADAD-- 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  867 crrqelITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEWQHSKAM 946
Cdd:COG3883    134 ------LLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAA 207
                          170       180
                   ....*....|....*....|....*
gi 1039737300  947 LSGQLRASEQKLRSTEARLLEKTQE 971
Cdd:COG3883    208 AEAAAAAAAAAAAAAAAAAAAAAAA 232
Yuri_gagarin pfam15934
Yuri gagarin; The yuri gagarin protein found in Drosophila, it plays roles in spermatogenesis.
754-982 6.57e-03

Yuri gagarin; The yuri gagarin protein found in Drosophila, it plays roles in spermatogenesis.


Pssm-ID: 318204 [Multi-domain]  Cd Length: 234  Bit Score: 40.71  E-value: 6.57e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  754 EQRWHQVETTPLREEKQVPIAPLHLSLEDRSERLstheltslleKELEQSQKEASDLLEQNRLlqdqlrvalgreqsARE 833
Cdd:pfam15934   45 EQEQQLKEFTVQNQRLACQIDNLHETLKDRDHQI----------KQLQSMITGYSDISENNRL--------------KEE 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  834 GYVLQTEVATspsgawqrLHRVNQDLQSELEAQCRRQELITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNaaaela 913
Cdd:pfam15934  101 IHDLKQKNCV--------QARVVRKMGLELKGQEEQRVELCDKYESLLGSFEEQCQELKRANRRVQSLQTRLSQ------ 166
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039737300  914 ikeqaLAKLKGELKMEQGKVREQLEEWQHSKAMLSGQLRASEQKLRSTEARLLEKTQELRDLETQQALQ 982
Cdd:pfam15934  167 -----VEKLQEELRTERKILREEVIALKEKDAKSNGRERALQDQLKCCQTEIEKSRTLIRNMQSHLQLE 230
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
2153-2266 6.91e-03

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 40.90  E-value: 6.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2153 TISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQA 2232
Cdd:pfam09787   43 TALTLELEELRQERDLLREEIQKLRGQIQQLRTELQELEAQQQEEAESSREQLQELEEQLATERSARREAEAELERLQEE 122
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1039737300 2233 LRQCQRENQELNAHNQELNNRLAAEITRLRTLLT 2266
Cdd:pfam09787  123 LRYLEEELRRSKATLQSRIKDREAEIEKLRNQLT 156
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
991-1203 7.60e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 41.42  E-value: 7.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  991 RLQECIAELSQQLGTSEQAQRLMEKKlKRNYTLLLESCEQEKQALLQNLKEVEDKASAYEDQlqghVQQVEALQKEKLSE 1070
Cdd:pfam07888   35 RLEECLQERAELLQAQEAANRQREKE-KERYKRDREQWERQRRELESRVAELKEELRQSREK----HEELEEKYKELSAS 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1071 TCKGSEQVHKLEEELEAREASIRQLAQHVQSLHDERDLIKHQFQELMERVATSDGDVAELQEKLRGKEVDYQNLEHSHHR 1150
Cdd:pfam07888  110 SEELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRS 189
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1039737300 1151 VSVQLQSVRTLLREKEEELKHIKEThervLEKKDQDLNEALVKMIALGSSLEE 1203
Cdd:pfam07888  190 LSKEFQELRNSLAQRDTQVLQLQDT----ITTLTQKLTTAHRKEAENEALLEE 238
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
796-974 8.22e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 40.29  E-value: 8.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  796 LEKELEQSQKEASDLLEQNRLLQDQLRVA------LGREQSAREGYVLQTEvatspsgawQRLHRVNQDLQS-----ELE 864
Cdd:COG1579     22 LEHRLKELPAELAELEDELAALEARLEAAkteledLEKEIKRLELEIEEVE---------ARIKKYEEQLGNvrnnkEYE 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  865 AqcrrqelITQQIQTLKHSYGEAKDAIRHHEAEIQTLQTRLGNAAAELAIKEQALAKLKGELKMEQGKVREQLEEwqhsk 944
Cdd:COG1579     93 A-------LQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEE----- 160
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1039737300  945 amlsgqLRASEQKLRST-EARLLEKTQELRD 974
Cdd:COG1579    161 ------LEAEREELAAKiPPELLALYERIRK 185
TOPEUc smart00435
DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina ...
2049-2145 8.27e-03

DNA Topoisomerase I (eukaryota); DNA Topoisomerase I (eukaryota), DNA topoisomerase V, Vaccina virus topoisomerase, Variola virus topoisomerase, Shope fibroma virus topoisomeras


Pssm-ID: 214661 [Multi-domain]  Cd Length: 391  Bit Score: 41.18  E-value: 8.27e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300  2049 RAVPaaKDEAESMSGLRERIQELEAQMGVMREELGHKELEGDV-AALQEKYQRDFESLKATCERGFAAM--EETHQKKIE 2125
Cdd:smart00435  269 RTVS--KTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLkRKLKSKFERDNEKLDAEVKEKKKEKkkEEKKKKQIE 346
                            90       100
                    ....*....|....*....|
gi 1039737300  2126 DLQRQHQReLEKLREEKDRL 2145
Cdd:smart00435  347 RLEERIEK-LEVQATDKEEN 365
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1861-2362 8.43e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 41.49  E-value: 8.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1861 FYKKACQEAKGASGQKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRCLQEAENKHSESMFA 1940
Cdd:TIGR00618  186 FAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKREAQEEQLKKQQLLKQL 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1941 LqgRYEEEIRCMVEQLSHTENTLQAERSrvlsqldasvKDRQAMEQHHV----QQMKMLEDRFQLKVREL-QAVHQEELR 2015
Cdd:TIGR00618  266 R--ARIEELRAQEAVLEETQERINRARK----------AAPLAAHIKAVtqieQQAQRIHTELQSKMRSRaKLLMKRAAH 333
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2016 ALQEHYIWSLRGALSLYQPSHPDSslapgpsepravpaaKDEAESMSGLREriqELEAQmgvmreelghKELEGDVAALQ 2095
Cdd:TIGR00618  334 VKQQSSIEEQRRLLQTLHSQEIHI---------------RDAHEVATSIRE---ISCQQ----------HTLTQHIHTLQ 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2096 EKYQRDFESLKATCERGFAAMEETHQKKIEDLQR----------QHQRELEKLREEKDRLLAEETAAtisaIEAMKNAHR 2165
Cdd:TIGR00618  386 QQKTTLTQKLQSLCKELDILQREQATIDTRTSAFrdlqgqlahaKKQQELQQRYAELCAAAITCTAQ----CEKLEKIHL 461
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2166 EEM-----ERELEKSQRSQISSINSDIEALRRQYLEELQSVQRELEvlseqysQKCLE-NAHLAQALEAERQALRQCQRE 2239
Cdd:TIGR00618  462 QESaqslkEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLC-------GSCIHpNPARQDIDNPGPLTRRMQRGE 534
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2240 NqELNAHNQELNN---RLAAEITRLRTLLTGDGGGESTGLPLTQGKDAYELEV-LLRVKESEIQYLKQEISSLKD----E 2311
Cdd:TIGR00618  535 Q-TYAQLETSEEDvyhQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIpNLQNITVRLQDLTEKLSEAEDmlacE 613
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1039737300 2312 LQTALRDKKYASDKYKDIYTELSIAKAKADCDISRLKEQLKAATEALGEKS 2362
Cdd:TIGR00618  614 QHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHA 664
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
2054-2266 9.25e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 41.20  E-value: 9.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2054 AKDEAESMSGLRERIQELEAQMGVMREELGH-KELEGDVAALQEKYQRDFESL----KATCERGFAAMEEThQKKIEDLQ 2128
Cdd:PRK03918   520 LEKKAEEYEKLKEKLIKLKGEIKSLKKELEKlEELKKKLAELEKKLDELEEELaellKELEELGFESVEEL-EERLKELE 598
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2129 RQHQRELE------KLREEKDRL-LAEETAATISAIEAMKNAHREEMERELEKSQRsqiSSINSDIEALRRQYLE---EL 2198
Cdd:PRK03918   599 PFYNEYLElkdaekELEREEKELkKLEEELDKAFEELAETEKRLEELRKELEELEK---KYSEEEYEELREEYLElsrEL 675
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1039737300 2199 QSVQRELEVLSEQYSqkclENAHLAQALEAERQALRQCQRENQELNAHNQELnNRLAAEITRLRTLLT 2266
Cdd:PRK03918   676 AGLRAELEELEKRRE----EIKKTLEKLKEELEEREKAKKELEKLEKALERV-EELREKVKKYKALLK 738
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1867-2318 9.26e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 41.46  E-value: 9.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1867 QEAKGASGQKRAQAVGALKEEYEELLHKQKSEYQKVITLIEKENTELKAKVSQMDHQQRCLQEAENKHSESMFALQGRYE 1946
Cdd:COG1196    258 LEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEE 337
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 1947 EEIRCMvEQLSHTENTLQAERSRVLSQLDASVKDRQAMEQHHVQQMKMLEDRFQLKVRELQAVHQEELRALQEHYIWSLR 2026
Cdd:COG1196    338 ELEELE-EELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERL 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2027 GALSLYQPSHPDSSLApgpSEPRAVPAAKDEAESMSGLRERIQELEAQMGVMREELghKELEGDVAALQEKYQRDFESLK 2106
Cdd:COG1196    417 ERLEEELEELEEALAE---LEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELL--EEAALLEAALAELLEELAEAAA 491
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2107 AtcERGFAAMEETHQKKIEDLQRQHQRELEKLREEKDRLLAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSd 2186
Cdd:COG1196    492 R--LLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKA- 568
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2187 iEALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQREN---QELNAHNQELNNRLAAEITRLRT 2263
Cdd:COG1196    569 -AKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTllgRTLVAARLEAALRRAVTLAGRLR 647
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1039737300 2264 LLTGDGGGESTGLPLTQGKDAyELEVLLRVKESEIQYLKQEISSLKDELQTALRD 2318
Cdd:COG1196    648 EVTLEGEGGSAGGSLTGGSRR-ELLAALLEAEAELEELAERLAEEELELEEALLA 701
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
2120-2318 9.33e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 41.44  E-value: 9.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2120 HQKKIEDLQRQH-------QRELEKLREEKDRLlAEETAATISAIEAMKNAHREEMERELEKSQRSQISSINSDIEALRR 2192
Cdd:COG4913    590 HEKDDRRRIRSRyvlgfdnRAKLAALEAELAEL-EEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAER 668
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2193 qyleELQSVQRELEVLSEqySQKCLEnaHLAQALEAERQALRQCQRENQELNAHNQELNNRLAA---EITRLRTLLtgDG 2269
Cdd:COG4913    669 ----EIAELEAELERLDA--SSDDLA--ALEEQLEELEAELEELEEELDELKGEIGRLEKELEQaeeELDELQDRL--EA 738
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1039737300 2270 GGESTGLPLTQGKDAYELEVLLRVKESEIQY-LKQEISSLKDELQTALRD 2318
Cdd:COG4913    739 AEDLARLELRALLEERFAAALGDAVERELREnLEERIDALRARLNRAEEE 788
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH