NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907082211|ref|XP_036012688|]
View 

unconventional myosin-XVIIIa isoform X3 [Mus musculus]

Protein Classification

class XVIII myosin( domain architecture ID 12910318)

class XVIII myosin such as human unconventional myosin-XVIIIa that may link Golgi membranes to the cytoskeleton and participate in the tensile force required for vesicle budding from the Golgi, and contains a myosin head/motor domain with ATP- and actin-binding sites but does not have ATPase activity

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MYSc_Myo18 cd01386
class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain ...
467-1217 0e+00

class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain which is commonly found in proteins establishing molecular complexes. The motor domain itself does not exhibit ATPase activity, suggesting that it functions as an actin tether protein. It also has two IQ domains that probably bind light chains or related calmodulins and a C-terminal tail with two sections of coiled-coil domains, which are thought to mediate homodimerization. The function of these myosins are largely unknown. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


:

Pssm-ID: 276837 [Multi-domain]  Cd Length: 689  Bit Score: 1131.26  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd01386      1 SSVLHTLRQRYGANLIHTYAGPSLIVINPRHPLAVYSEKVAKMFKGCRREDMPPHIYASAQSAYRAMLMSRRDQSIVLLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGtKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 626
Cdd:cd01386     81 RSGSGKTTNCRHILEYLVTAAGSVG-GVLSVEKLNAALTVLEAFGNVRTALNGNATRFSQLFSLDFDQAGQLASASIQTL 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  627 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEE 706
Cdd:cd01386    160 LLERSRVARRPEGESNFNVFYYLLAGADAALRTELHLNQLAESNSFGIVPLQKPEDKQKAAAAFSKLQAAMKTLGISEEE 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  707 QKTCWLILASIYHLGAAGATKeAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTLQRSTSFRQGPEESGL 786
Cdd:cd01386    240 QRAIWSILAAIYHLGAAGATK-AASAGRKQFARPEWAQRAAYLLGCTLEELSSAIFKHHLSGGPQQSTTSSGQESPARSS 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  787 GEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGGSARGASFEELCHNYAQDRLQ 866
Cdd:cd01386    319 SGGPKLTGVEALEGFAAGLYSELFAAVVSLINRSLSSSHHSTSSITIVDTPGFQNPAHSGSQRGATFEDLCHNYAQERLQ 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  867 RLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASH--LVRSLAHADEARGLLWLLEEEALVPGATEDALLDR 944
Cdd:cd01386    399 LLFHERTFVAPLERYKQENVEVDFDLPELSPGALVALIDQAPQqaLVRSDLRDEDRRGLLWLLDEEALYPGSSDDTFLER 478
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  945 LFSYYGPQEGDkKGQSPLLRSSKPRHFLLGHSHGTNWVEYNVAGWLNYTKQNPATQNAPRLLQDSQKKiisnlflgrags 1024
Cdd:cd01386    479 LFSHYGDKEGG-KGHSLLRRSEGPLQFVLGHLLGTNPVEYDVSGWLKAAKENPSAQNATQLLQESQKE------------ 545
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1025 atvlsgsiagleggsqlalrratsmrktfttgMAAVKKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGEPRs 1104
Cdd:cd01386    546 --------------------------------TAAVKRKSPCLQIKFQVDALIDTLRRTGLHFVHCLLPQHNAGKDERS- 592
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1105 assrrvsssseldlppGDPCEAGLLQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKKHGRNY 1184
Cdd:cd01386    593 ----------------TSSPAAGDELLDVPLLRSQLRGSQLLDALRLYRQGFPDHMPLGEFRRRFQVLAPPLTKKLGLNS 656
                          730       740       750
                   ....*....|....*....|....*....|...
gi 1907082211 1185 IVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd01386    657 EVADERKAVEELLEELDLEKSSYRIGLSQVFFR 689
COG5022 super family cl34868
Myosin heavy chain [General function prediction only];
429-1943 9.27e-109

Myosin heavy chain [General function prediction only];


The actual alignment was detected with superfamily member COG5022:

Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 383.66  E-value: 9.27e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  429 KLDHDGAILDVDEDDIEKANAPSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMH 508
Cdd:COG5022     42 KEDGESVSVKKKVLGNDRIKLPKFDGVDDLTELSYLNEPAVLHNLEKRYNNGQIYTYSGLVLIAVNPYRDLGIYTDDIIQ 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  509 MFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKwQALST--L 586
Cdd:COG5022    122 SYSGKNRLELEPHVFAIAEEAYRNLLSEKENQTIIISGESGAGKTENAKRIMQYLASVTSSSTVEISSIEK-QILATnpI 200
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  587 LEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL--- 663
Cdd:COG5022    201 LEAFGNAKTVRNDNSSRFGKYIKIEFDENGEICGAKIETYLLEKSRVVHQNKNERNYHIFYQLLAGDPEELKKLLLLqnp 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  664 ---NHLAENNVFGIVPLSKPEEkqkaaqqFSKLQAAMKVLAISPEEQKTCWLILASIYHLG--AAGATKEaaeaGRKQFA 738
Cdd:COG5022    281 kdyIYLSQGGCDKIDGIDDAKE-------FKITLDALKTIGIDEEEQDQIFKILAAILHIGniEFKEDRN----GAAIFS 349
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  739 RHEWAQKAAYLLGCSLEELSSAIFKHQLKGG--TLQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISL 816
Cdd:COG5022    350 DNSVLDKACYLLGIDPSLFVKWLVKRQIKTGgeWIVVPLNLEQ--------------ALAIRDSLAKALYSNLFDWIVDR 415
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  817 VNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDL 893
Cdd:COG5022    416 INKSLDHSAAASNFIGVLDIYGFEIFEKN------SFEQLCINYTNEKLQQFFNQHMFKLEQEEYVKEGIEwsfIDYFDN 489
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  894 EPVADDSvaavdQASHLVRSLAHADEArgllwlleeeALVPGATEDALLDRLFSYYgPQEGDKKGQSPLLRSSKprhFLL 973
Cdd:COG5022    490 QPCIDLI-----EKKNPLGILSLLDEE----------CVMPHATDESFTSKLAQRL-NKNSNPKFKKSRFRDNK---FVV 550
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  974 GHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsatvlsgsiaglEGGSQLALRRATSMrktf 1053
Cdd:COG5022    551 KHYAGD--VEYDVEGFLDKNK-DPLNDDLLELLKASTNEFVSTLFDDEE-------------NIESKGRFPTLGSR---- 610
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1054 ttgmaavkkkslciqIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDV 1133
Cdd:COG5022    611 ---------------FKESLNSLMSTLNSTQPHYIRCIKPNEEKSPW-----------------------------TFDN 646
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1134 SLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHlTKKHGRNYIVVDEKRAVEELLESLDLEKSSCCLGLSR 1213
Cdd:COG5022    647 QMVLSQLRCCGVLETIRISRAGFPSRWTFDEFVQRYRILSPS-KSWTGEYTWKEDTKNAVKSILEELVIDSSKYQIGNTK 725
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1214 VFFRAGTLARLEEQRDEQTSRHLTLFQAACRGYLARQHF-KKRKIQDlAIRCVQKNIKKNKGVKDWPWWKLFTTVRPLIQ 1292
Cdd:COG5022    726 VFFKAGVLAALEDMRDAKLDNIATRIQRAIRGRYLRRRYlQALKRIK-KIQVIQHGFRLRRLVDYELKWRLFIKLQPLLS 804
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1293 VQLSEEQIRNKDEEIQQLRSKLEKV----EKERNELRLSSDRLETRISEltSELTDERNTGESASQLLDAETAERLRTEK 1368
Cdd:COG5022    805 LLGSRKEYRSYLACIIKLQKTIKREkklrETEEVEFSLKAEVLIQKFGR--SLKAKKRFSLLKKETIYLQSAQRVELAER 882
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1369 EMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDF----TKKRLQQELEDKMEVE 1444
Cdd:COG5022    883 QLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLeegpSIEYVKLPELNKLHEV 962
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1445 QQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRrfdsELSQAHEETQREKLQ 1524
Cdd:COG5022    963 ESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPV----EVAELQSASKIISSE 1038
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1525 REKLQREKDMLLAEAFSLKQQME-EKDLdiagftqKVVSLEAELQDIssqESKDEASLAKVKKQLRDLEAK---VKDQEE 1600
Cdd:COG5022   1039 STELSILKPLQKLKGLLLLENNQlQARY-------KALKLRRENSLL---DDKQLYQLESTENLLKTINVKdleVTNRNL 1108
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1601 ELDEQAGSIQMLEQAKLRLEMEMERM--RQTHSKEMESRDEEVEEARQSCQKKLKQMEvqlEEEYEDKQKALREKRELES 1678
Cdd:COG5022   1109 VKPANVLQFIVAQMIKLNLLQEISKFlsQLVNTLEPVFQKLSVLQLELDGLFWEANLE---ALPSPPPFAALSEKRLYQS 1185
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1679 KL----STLSDQVNQ---RDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLknqleESEFTCAAAVKark 1751
Cdd:COG5022   1186 ALydekSKLSSSEVNdlkNELIALFSKIFSGWPRGDKLKKLISEGWVPTEYSTSLKGFNNL-----NKKFDTPASMS--- 1257
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1752 ameveMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLeeDQEDMNEL-MKKHKAAVAQASRDmaqmnDLQAQIEES 1830
Cdd:COG5022   1258 -----NEKLLSLLNSIDNLLSSYKLEEEVLPATINSLLQYI--NVGLFNALrTKASSLRWKSATEV-----NYNSEELDD 1325
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1831 NKEKQELQEklqaLQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKT-QVKRLENlasrlketmeklteeRDQRAAA 1909
Cdd:COG5022   1326 WCREFEISD----VDEELEELIQAVKVLQLLKDDLNKLDELLDACYSLNPaEIQNLKS---------------RYDPADK 1386
                         1530      1540      1550
                   ....*....|....*....|....*....|....*
gi 1907082211 1910 ENR-EKEQNKRLQRQLRDTKEEMSELARKEAEASR 1943
Cdd:COG5022   1387 ENNlPKEILKKIEALLIKQELQLSLEGKDETEVHL 1421
PDZ_MYO18-like cd06747
PDZ domain of MYO18A protein, and related domains; PDZ (PSD-95 (Postsynaptic density protein ...
220-308 1.31e-51

PDZ domain of MYO18A protein, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MYO18 protein and related domains. MYO18 (also known as myosin XVIIIA, KIAA0216, MysPDZ), a member of the myosin superfamily, is involved in regulating cell protrusion and migration, and Golgi trafficking and morphology, and is required for myoblast adhesion and muscle integrity. The MYO18A/MRCK/LRAP35a complex regulates actomyosin retrograde flow in cell protrusion and migration; the PtdIns(4)P/GOLPH3/MYO18A/F-actin complex is a hub for signals that regulate Golgi trafficking function. The MYO18A PDZ domain binds p190Rho-guanine nucleotide exchange factor (p190RhoGEF), Golgin45, and leucine repeat adaptor protein 1 (Lurap1, also known as Lrap35a). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MYO18-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged as beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta- strand F.


:

Pssm-ID: 467229 [Multi-domain]  Cd Length: 90  Bit Score: 176.73  E-value: 1.31e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLRRTTMLDRAPE-GQAYRRVVHFAEPGAGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQ 298
Cdd:cd06747      1 EITLKRQPTGDFGFSLRRGTIVERGPDdGQELKRTVHFAEPGAGTKNLATGLLPGDRLIEVNGVNVENASRDEIIEMIRK 80
                           90
                   ....*....|
gi 1907082211  299 SGDSVRLKVQ 308
Cdd:cd06747     81 SGDTVTLKVQ 90
PRK03918 super family cl35229
DNA double-strand break repair ATPase Rad50;
1868-2033 1.09e-04

DNA double-strand break repair ATPase Rad50;


The actual alignment was detected with superfamily member PRK03918:

Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 47.75  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1868 IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLqRQLRDTKEEMSELARKEAEASRKKHE 1947
Cdd:PRK03918   171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL-EKLEKEVKELEELKEEIEELEKELES 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1948 LEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSdvdSELEDRVDGV 2027
Cdd:PRK03918   250 LEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL---SRLEEEINGI 326

                   ....*.
gi 1907082211 2028 KSWLSK 2033
Cdd:PRK03918   327 EERIKE 332
 
Name Accession Description Interval E-value
MYSc_Myo18 cd01386
class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain ...
467-1217 0e+00

class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain which is commonly found in proteins establishing molecular complexes. The motor domain itself does not exhibit ATPase activity, suggesting that it functions as an actin tether protein. It also has two IQ domains that probably bind light chains or related calmodulins and a C-terminal tail with two sections of coiled-coil domains, which are thought to mediate homodimerization. The function of these myosins are largely unknown. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276837 [Multi-domain]  Cd Length: 689  Bit Score: 1131.26  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd01386      1 SSVLHTLRQRYGANLIHTYAGPSLIVINPRHPLAVYSEKVAKMFKGCRREDMPPHIYASAQSAYRAMLMSRRDQSIVLLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGtKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 626
Cdd:cd01386     81 RSGSGKTTNCRHILEYLVTAAGSVG-GVLSVEKLNAALTVLEAFGNVRTALNGNATRFSQLFSLDFDQAGQLASASIQTL 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  627 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEE 706
Cdd:cd01386    160 LLERSRVARRPEGESNFNVFYYLLAGADAALRTELHLNQLAESNSFGIVPLQKPEDKQKAAAAFSKLQAAMKTLGISEEE 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  707 QKTCWLILASIYHLGAAGATKeAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTLQRSTSFRQGPEESGL 786
Cdd:cd01386    240 QRAIWSILAAIYHLGAAGATK-AASAGRKQFARPEWAQRAAYLLGCTLEELSSAIFKHHLSGGPQQSTTSSGQESPARSS 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  787 GEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGGSARGASFEELCHNYAQDRLQ 866
Cdd:cd01386    319 SGGPKLTGVEALEGFAAGLYSELFAAVVSLINRSLSSSHHSTSSITIVDTPGFQNPAHSGSQRGATFEDLCHNYAQERLQ 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  867 RLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASH--LVRSLAHADEARGLLWLLEEEALVPGATEDALLDR 944
Cdd:cd01386    399 LLFHERTFVAPLERYKQENVEVDFDLPELSPGALVALIDQAPQqaLVRSDLRDEDRRGLLWLLDEEALYPGSSDDTFLER 478
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  945 LFSYYGPQEGDkKGQSPLLRSSKPRHFLLGHSHGTNWVEYNVAGWLNYTKQNPATQNAPRLLQDSQKKiisnlflgrags 1024
Cdd:cd01386    479 LFSHYGDKEGG-KGHSLLRRSEGPLQFVLGHLLGTNPVEYDVSGWLKAAKENPSAQNATQLLQESQKE------------ 545
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1025 atvlsgsiagleggsqlalrratsmrktfttgMAAVKKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGEPRs 1104
Cdd:cd01386    546 --------------------------------TAAVKRKSPCLQIKFQVDALIDTLRRTGLHFVHCLLPQHNAGKDERS- 592
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1105 assrrvsssseldlppGDPCEAGLLQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKKHGRNY 1184
Cdd:cd01386    593 ----------------TSSPAAGDELLDVPLLRSQLRGSQLLDALRLYRQGFPDHMPLGEFRRRFQVLAPPLTKKLGLNS 656
                          730       740       750
                   ....*....|....*....|....*....|...
gi 1907082211 1185 IVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd01386    657 EVADERKAVEELLEELDLEKSSYRIGLSQVFFR 689
MYSc smart00242
Myosin. Large ATPases; ATPase; molecular motor. Muscle contraction consists of a cyclical ...
448-1229 7.74e-114

Myosin. Large ATPases; ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.


Pssm-ID: 214580 [Multi-domain]  Cd Length: 677  Bit Score: 378.81  E-value: 7.74e-114
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   448 NAPSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQ 527
Cdd:smart00242    1 NPPKFEGVEDLVLLTYLNEPAVLHNLKKRYLKDLIYTYIGLVLVAVNPYKQLPIYTDEVIKKYRGKSRGELPPHVFAIAD 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   528 TAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGtSGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFS 605
Cdd:smart00242   81 NAYRNMLNDKENQSIIISGESGAGKTENTKKIMQYLASVSG-SNTEVGSVED-QILESnpILEAFGNAKTLRNNNSSRFG 158
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   606 QILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAENNVFgivpLSKPEEKQK 685
Cdd:smart00242  159 KFIEIHFDAKGKIIGAKIETYLLEKSRVVSQAKGERNYHIFYQLLAGASEELKKELGLKS-PEDYRY----LNQGGCLTV 233
                           250       260       270       280       290       300       310       320
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   686 A----AQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAI 761
Cdd:smart00242  234 DgiddAEEFKETLNAMRVLGFSEEEQESIFKILAAILHLGNIEFEEGRNDNAASTVKDKEELSNAAELLGVDPEELEKAL 313
                           330       340       350       360       370       380       390       400
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   762 fkhqlkggtLQRSTSFRQGPEESGLgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQ- 840
Cdd:smart00242  314 ---------TKRKIKTGGEVITKPL---NVEQALDARDALAKALYSRLFDWLVKRINQSLSFKDGSTYFIGVLDIYGFEi 381
                           410       420       430       440       450       460       470       480
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   841 ---NpewggsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaavdqashLVRS- 913
Cdd:smart00242  382 fevN----------SFEQLCINYANEKLQQFFNQHVFKLEQEEYEREGIDwtfIDFFDNQDCID-----------LIEKk 440
                           490       500       510       520       530       540       550       560
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   914 ----LAHADEArgllwlleeeALVPGATEDALLDRLFSYYgpqegdkkGQSPLLRSSKPR---HFLLGHSHGTnwVEYNV 986
Cdd:smart00242  441 ppgiLSLLDEE----------CRFPKGTDQTFLEKLNQHH--------KKHPHFSKPKKKgrtEFIIKHYAGD--VTYDV 500
                           570       580       590       600       610       620       630       640
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   987 AGWLnytKQN--PATQNAPRLLQDSQKKIISNLFlgragsatvlsGSIAGleggsqlalrRATSMRKTFTTGMaavkkks 1064
Cdd:smart00242  501 TGFL---EKNkdTLSDDLIELLQSSKNPLIASLF-----------PSGVS----------NAGSKKRFQTVGS------- 549
                           650       660       670       680       690       700       710       720
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  1065 lciQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSR 1144
Cdd:smart00242  550 ---QFKEQLNELMDTLNSTNPHFIRCIKPNEEKKPG-----------------------------DFDSSLVLHQLRYLG 597
                           730       740       750       760       770       780       790       800
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  1145 LLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHlTKKHGRNyivvDEKRAVEELLESLDLEKSSCCLGLSRVFFRAGTLARL 1224
Cdd:smart00242  598 VLENIRIRRAGFPYRLPFDEFLQRYRVLLPD-TWPPWGG----DAKKACEALLQSLGLDEDEYQLGKTKVFLRPGQLAEL 672

                    ....*
gi 1907082211  1225 EEQRD 1229
Cdd:smart00242  673 EELRE 677
COG5022 COG5022
Myosin heavy chain [General function prediction only];
429-1943 9.27e-109

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 383.66  E-value: 9.27e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  429 KLDHDGAILDVDEDDIEKANAPSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMH 508
Cdd:COG5022     42 KEDGESVSVKKKVLGNDRIKLPKFDGVDDLTELSYLNEPAVLHNLEKRYNNGQIYTYSGLVLIAVNPYRDLGIYTDDIIQ 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  509 MFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKwQALST--L 586
Cdd:COG5022    122 SYSGKNRLELEPHVFAIAEEAYRNLLSEKENQTIIISGESGAGKTENAKRIMQYLASVTSSSTVEISSIEK-QILATnpI 200
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  587 LEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL--- 663
Cdd:COG5022    201 LEAFGNAKTVRNDNSSRFGKYIKIEFDENGEICGAKIETYLLEKSRVVHQNKNERNYHIFYQLLAGDPEELKKLLLLqnp 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  664 ---NHLAENNVFGIVPLSKPEEkqkaaqqFSKLQAAMKVLAISPEEQKTCWLILASIYHLG--AAGATKEaaeaGRKQFA 738
Cdd:COG5022    281 kdyIYLSQGGCDKIDGIDDAKE-------FKITLDALKTIGIDEEEQDQIFKILAAILHIGniEFKEDRN----GAAIFS 349
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  739 RHEWAQKAAYLLGCSLEELSSAIFKHQLKGG--TLQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISL 816
Cdd:COG5022    350 DNSVLDKACYLLGIDPSLFVKWLVKRQIKTGgeWIVVPLNLEQ--------------ALAIRDSLAKALYSNLFDWIVDR 415
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  817 VNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDL 893
Cdd:COG5022    416 INKSLDHSAAASNFIGVLDIYGFEIFEKN------SFEQLCINYTNEKLQQFFNQHMFKLEQEEYVKEGIEwsfIDYFDN 489
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  894 EPVADDSvaavdQASHLVRSLAHADEArgllwlleeeALVPGATEDALLDRLFSYYgPQEGDKKGQSPLLRSSKprhFLL 973
Cdd:COG5022    490 QPCIDLI-----EKKNPLGILSLLDEE----------CVMPHATDESFTSKLAQRL-NKNSNPKFKKSRFRDNK---FVV 550
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  974 GHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsatvlsgsiaglEGGSQLALRRATSMrktf 1053
Cdd:COG5022    551 KHYAGD--VEYDVEGFLDKNK-DPLNDDLLELLKASTNEFVSTLFDDEE-------------NIESKGRFPTLGSR---- 610
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1054 ttgmaavkkkslciqIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDV 1133
Cdd:COG5022    611 ---------------FKESLNSLMSTLNSTQPHYIRCIKPNEEKSPW-----------------------------TFDN 646
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1134 SLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHlTKKHGRNYIVVDEKRAVEELLESLDLEKSSCCLGLSR 1213
Cdd:COG5022    647 QMVLSQLRCCGVLETIRISRAGFPSRWTFDEFVQRYRILSPS-KSWTGEYTWKEDTKNAVKSILEELVIDSSKYQIGNTK 725
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1214 VFFRAGTLARLEEQRDEQTSRHLTLFQAACRGYLARQHF-KKRKIQDlAIRCVQKNIKKNKGVKDWPWWKLFTTVRPLIQ 1292
Cdd:COG5022    726 VFFKAGVLAALEDMRDAKLDNIATRIQRAIRGRYLRRRYlQALKRIK-KIQVIQHGFRLRRLVDYELKWRLFIKLQPLLS 804
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1293 VQLSEEQIRNKDEEIQQLRSKLEKV----EKERNELRLSSDRLETRISEltSELTDERNTGESASQLLDAETAERLRTEK 1368
Cdd:COG5022    805 LLGSRKEYRSYLACIIKLQKTIKREkklrETEEVEFSLKAEVLIQKFGR--SLKAKKRFSLLKKETIYLQSAQRVELAER 882
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1369 EMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDF----TKKRLQQELEDKMEVE 1444
Cdd:COG5022    883 QLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLeegpSIEYVKLPELNKLHEV 962
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1445 QQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRrfdsELSQAHEETQREKLQ 1524
Cdd:COG5022    963 ESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPV----EVAELQSASKIISSE 1038
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1525 REKLQREKDMLLAEAFSLKQQME-EKDLdiagftqKVVSLEAELQDIssqESKDEASLAKVKKQLRDLEAK---VKDQEE 1600
Cdd:COG5022   1039 STELSILKPLQKLKGLLLLENNQlQARY-------KALKLRRENSLL---DDKQLYQLESTENLLKTINVKdleVTNRNL 1108
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1601 ELDEQAGSIQMLEQAKLRLEMEMERM--RQTHSKEMESRDEEVEEARQSCQKKLKQMEvqlEEEYEDKQKALREKRELES 1678
Cdd:COG5022   1109 VKPANVLQFIVAQMIKLNLLQEISKFlsQLVNTLEPVFQKLSVLQLELDGLFWEANLE---ALPSPPPFAALSEKRLYQS 1185
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1679 KL----STLSDQVNQ---RDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLknqleESEFTCAAAVKark 1751
Cdd:COG5022   1186 ALydekSKLSSSEVNdlkNELIALFSKIFSGWPRGDKLKKLISEGWVPTEYSTSLKGFNNL-----NKKFDTPASMS--- 1257
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1752 ameveMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLeeDQEDMNEL-MKKHKAAVAQASRDmaqmnDLQAQIEES 1830
Cdd:COG5022   1258 -----NEKLLSLLNSIDNLLSSYKLEEEVLPATINSLLQYI--NVGLFNALrTKASSLRWKSATEV-----NYNSEELDD 1325
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1831 NKEKQELQEklqaLQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKT-QVKRLENlasrlketmeklteeRDQRAAA 1909
Cdd:COG5022   1326 WCREFEISD----VDEELEELIQAVKVLQLLKDDLNKLDELLDACYSLNPaEIQNLKS---------------RYDPADK 1386
                         1530      1540      1550
                   ....*....|....*....|....*....|....*
gi 1907082211 1910 ENR-EKEQNKRLQRQLRDTKEEMSELARKEAEASR 1943
Cdd:COG5022   1387 ENNlPKEILKKIEALLIKQELQLSLEGKDETEVHL 1421
Myosin_head pfam00063
Myosin head (motor domain);
456-1217 1.69e-108

Myosin head (motor domain);


Pssm-ID: 395017 [Multi-domain]  Cd Length: 674  Bit Score: 363.14  E-value: 1.69e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  456 EDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLM 535
Cdd:pfam00063    2 EDMVELSYLNEPSVLHNLKKRYKSDLIYTYSGLVLVAVNPYKQLPIYSEDMIKAYRGKRRGELPPHIFAIADEAYRSMLQ 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  536 SRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGT-SGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDF 612
Cdd:pfam00063   82 DKENQSILISGESGAGKTENTKKIMQYLASVSGSgSAGNVGRLEE-QILQSnpILEAFGNAKTVRNNNSSRFGKYIEIQF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  613 DQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL---------NHLAENNVFGIvplskpeek 683
Cdd:pfam00063  161 DAKGDIVGGKIETYLLEKSRVVYQAEGERNYHIFYQLLAGASAQLKKELRLtnpkdyhylSQSGCYTIDGI--------- 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  684 qKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKqFARHEWAQKAAYLLGCSLEELSSAIFK 763
Cdd:pfam00063  232 -DDSEEFKITDKAMDILGFSDEEQMGIFRIVAAILHLGNIEFKKERNDEQAV-PDDTENLQKAASLLGIDSTELEKALCK 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  764 HQLKGGTLQRSTSFrqgpeesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNP 842
Cdd:pfam00063  310 RRIKTGRETVSKPQ------------NVEQANYARDALAKAIYSRLFDWLVDRINKSLDVKTIEKASFIgVLDIYGFEIF 377
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  843 EWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaavdqashLVRS-----L 914
Cdd:pfam00063  378 EKN------SFEQLCINYVNEKLQQFFNHHMFKLEQEEYVREGIEwtfIDFGDNQPCID-----------LIEKkplgiL 440
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  915 AHADEArgllwlleeeALVPGATEDALLDRLFSYYgpqegdkkGQSPLLRSSKPR---HFLLGHSHGTnwVEYNVAGWLN 991
Cdd:pfam00063  441 SLLDEE----------CLFPKATDQTFLDKLYSTF--------SKHPHFQKPRLQgetHFIIKHYAGD--VEYNVEGFLE 500
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  992 YTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsatvLSGSIAGLEGGSQLALRRATSMRKtfTTGMaavkkkslciQIKL 1071
Cdd:pfam00063  501 KNK-DPLNDDLVSLLKSSSDPLLAELFPDYE-----TAESAAANESGKSTPKRTKKKRFI--TVGS----------QFKE 562
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1072 QVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRM 1151
Cdd:pfam00063  563 SLGELMKTLNSTNPHYIRCIKPNEKKRAG-----------------------------VFDNSLVLHQLRCNGVLEGIRI 613
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1152 YRQGYPDHMVFSEFRRRFDVLAPHLTKKhgrnyIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:pfam00063  614 RRAGFPNRITFQEFVQRYRILAPKTWPK-----WKGDAKKGCEAILQSLNLDKEEYQFGKTKIFFR 674
PDZ_MYO18-like cd06747
PDZ domain of MYO18A protein, and related domains; PDZ (PSD-95 (Postsynaptic density protein ...
220-308 1.31e-51

PDZ domain of MYO18A protein, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MYO18 protein and related domains. MYO18 (also known as myosin XVIIIA, KIAA0216, MysPDZ), a member of the myosin superfamily, is involved in regulating cell protrusion and migration, and Golgi trafficking and morphology, and is required for myoblast adhesion and muscle integrity. The MYO18A/MRCK/LRAP35a complex regulates actomyosin retrograde flow in cell protrusion and migration; the PtdIns(4)P/GOLPH3/MYO18A/F-actin complex is a hub for signals that regulate Golgi trafficking function. The MYO18A PDZ domain binds p190Rho-guanine nucleotide exchange factor (p190RhoGEF), Golgin45, and leucine repeat adaptor protein 1 (Lurap1, also known as Lrap35a). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MYO18-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged as beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta- strand F.


Pssm-ID: 467229 [Multi-domain]  Cd Length: 90  Bit Score: 176.73  E-value: 1.31e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLRRTTMLDRAPE-GQAYRRVVHFAEPGAGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQ 298
Cdd:cd06747      1 EITLKRQPTGDFGFSLRRGTIVERGPDdGQELKRTVHFAEPGAGTKNLATGLLPGDRLIEVNGVNVENASRDEIIEMIRK 80
                           90
                   ....*....|
gi 1907082211  299 SGDSVRLKVQ 308
Cdd:cd06747     81 SGDTVTLKVQ 90
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1268-1962 5.22e-44

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 176.13  E-value: 5.22e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1268 NIKKNKGVKDWPWWKLFTTVRPLIQVQLSEEQIRNKDE-EIQQLRSKLEKVEKERNELrlssdrlETRISELTSELtder 1346
Cdd:pfam01576  374 NLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEgQLQELQARLSESERQRAEL-------AEKLSKLQSEL---- 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1347 ntgESASQLLDAETAERLRTEKEMKELQTQ-YDALKKQMEVMEMEVMEARLIRA--AEINGEVDDDDAGGEWRLKYERAV 1423
Cdd:pfam01576  443 ---ESVSSLLNEAEGKNIKLSKDVSSLESQlQDTQELLQEETRQKLNLSTRLRQleDERNSLQEQLEEEEEAKRNVERQL 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1424 REVDF----TKKRLQQELEdKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHE 1499
Cdd:pfam01576  520 STLQAqlsdMKKKLEEDAG-TLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSN 598
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1500 LEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSqeSKDEA 1579
Cdd:pfam01576  599 LEKKQKKFDQMLAEEKAISARYAEERDRAEAEAREKETRALSLARALEEALEAKEELERTNKQLRAEMEDLVS--SKDDV 676
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1580 --SLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEV 1657
Cdd:pfam01576  677 gkNVHELERSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNMQALKAQFERDLQARDEQGEEKRRQLVKQVRELEA 756
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1658 QLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLrKDLKRTKALLADAQIMLDHLKNN-----APSKREIAQL 1732
Cdd:pfam01576  757 ELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAV-KQLKKLQAQMKDLQRELEEARASrdeilAQSKESEKKL 835
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1733 KN------QLEESeftCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKH 1806
Cdd:pfam01576  836 KNleaellQLQED---LAASERARRQAQQERDELADEIASGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRL 912
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1807 KAAVAQ---------ASRDMAQMND-LQAQIEESNKekqELQEKLQALQSQVEFLEqsmvdKSLVSRQEAKIRELETRLE 1876
Cdd:pfam01576  913 RKSTLQveqlttelaAERSTSQKSEsARQQLERQNK---ELKAKLQEMEGTVKSKF-----KSSIAALEAKIAQLEEQLE 984
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1877 FEKTQ----VKRLENLASRLKETMEKLTEER---DQRAAAENREKEQNKRLQRQLRDTKEEMSelarkEAEASRKKheLE 1949
Cdd:pfam01576  985 QESRErqaaNKLVRRTEKKLKEVLLQVEDERrhaDQYKDQAEKGNSRMKQLKRQLEEAEEEAS-----RANAARRK--LQ 1057
                          730
                   ....*....|...
gi 1907082211 1950 MDLESLEAANQSL 1962
Cdd:pfam01576 1058 RELDDATESNESM 1070
PTZ00014 PTZ00014
myosin-A; Provisional
402-1273 2.20e-35

myosin-A; Provisional


Pssm-ID: 240229 [Multi-domain]  Cd Length: 821  Bit Score: 147.48  E-value: 2.20e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  402 LVHRDgfSLASQLKSEELSLPEGKarvkldhdgaILDVDEDDIEKANAP-SCDRLEDLASLVYLNESSVLHTLRQRYGAS 480
Cdd:PTZ00014    56 LVLPG--STGEKLTLKQIDPPTNS----------TFEVKPEHAFNANSQiDPMTYGDIGLLPHTNIPCVLDFLKHRYLKN 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  481 LLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRR-EDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHL 559
Cdd:PTZ00014   124 QIYTTADPLLVAINPFKDLGNTTNDWIRRYRDAKDsDKLPPHVFTTARRALENLHGVKKSQTIIVSGESGAGKTEATKQI 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  560 VQYLAtiAGTSGTKVFSVEK--WQAlSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRP 637
Cdd:PTZ00014   204 MRYFA--SSKSGNMDLKIQNaiMAA-NPVLEAFGNAKTIRNNNSSRFGRFMQLQLGEEGGIRYGSIVAFLLEKSRVVTQE 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  638 ASEATFNVFYYLLACGDATLRTELHLNHLAENNVfgIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASI 717
Cdd:PTZ00014   281 DDERSYHIFYQLLKGANDEMKEKYKLKSLEEYKY--INPKCLDVPGIDDVKDFEEVMESFDSMGLSESQIEDIFSILSGV 358
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  718 YHLG-------AAGATKEAAEAGRKQFarhEWAQKAAYLLGCSLEELS-SAIFKHQLKGGTLQRStsfRQGPEESglgEG 789
Cdd:PTZ00014   359 LLLGnveiegkEEGGLTDAAAISDESL---EVFNEACELLFLDYESLKkELTVKVTYAGNQKIEG---PWSKDES---EM 429
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  790 TKLSaleclegMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQ----NpewggsargaSFEELCHNYAQDRL 865
Cdd:PTZ00014   430 LKDS-------LSKAVYEKLFLWIIRNLNATIEPPGGFKVFIGMLDIFGFEvfknN----------SLEQLFINITNEML 492
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  866 QRLFHERTFLQELERYKEDNI---ELAFDDLEPVAD------DSVAAV--DQashlvrSLAhadeargllwlleeealvP 934
Cdd:PTZ00014   493 QKNFVDIVFERESKLYKDEGIsteELEYTSNESVIDllcgkgKSVLSIleDQ------CLA------------------P 548
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  935 GATEDALLDRLFSYYGPQEGDKKGqspllRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKII 1014
Cdd:PTZ00014   549 GGTDEKFVSSCNTNLKNNPKYKPA-----KVDSNKNFVIKHTIGD--IQYCASGFLFKNK-DVLRPELVEVVKASPNPLV 620
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1015 SNLFLGragsATVLSGSIA-GLEGGSQLalrratsMRktfttgmaavkkkslciqiklQVDALIDTIKRSKMHFVHCFLP 1093
Cdd:PTZ00014   621 RDLFEG----VEVEKGKLAkGQLIGSQF-------LN---------------------QLDSLMSLINSTEPHFIRCIKP 668
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1094 vaegwpgeprsassrrvssssELDLPPGDPCEAGLLqldvsllrAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLA 1173
Cdd:PTZ00014   669 ---------------------NENKKPLDWNSSKVL--------IQLHSLSILEALQLRQLGFSYRRTFAEFLSQFKYLD 719
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1174 PHLTKKHGrnyivVDEKRAVEELLESLDLEKSSCCLGLSRVFFR---AGTLARLEEQRDEQTSRHLTLFQAACRGYLARQ 1250
Cdd:PTZ00014   720 LAVSNDSS-----LDPKEKAEKLLERSGLPKDSYAIGKTMVFLKkdaAKELTQIQREKLAAWEPLVSVLEALILKIKKKR 794
                          890       900
                   ....*....|....*....|...
gi 1907082211 1251 HFKKRkIQDLAIrcVQKNIKKNK 1273
Cdd:PTZ00014   795 KVRKN-IKSLVR--IQAHLRRHL 814
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1307-2025 1.55e-32

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 139.05  E-value: 1.55e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1307 IQQLRSKLEKVEKERNELRLSSDRLETRISELTSE---LTDERNTGESASQLL----DAETAERLR----TEKEMKELQT 1375
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEENIERLDLIIDEKRQQlerLRREREKAERYQALLkekrEYEGYELLKekeaLERQKEAIER 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1376 QYDALKKQMEVMEMEVM------EARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTK-KRLQQELEDKMEVEQQSR 1448
Cdd:TIGR02169  245 QLASLEEELEKLTEEISelekrlEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASlERSIAEKERELEDAEERL 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1449 RQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKL 1528
Cdd:TIGR02169  325 AKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINEL 404
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1529 QREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDE-------ASLAKVKKQLRDLEAKVKDQEEE 1601
Cdd:TIGR02169  405 KRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEwkleqlaADLSKYEQELYDLKEEYDRVEKE 484
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1602 LDEQAGSIQMLEqAKLRLEMEMERMRQTHSKEMESRDEEV---------------------------------EEARQSC 1648
Cdd:TIGR02169  485 LSKLQRELAEAE-AQARASEERVRGGRAVEEVLKASIQGVhgtvaqlgsvgeryataievaagnrlnnvvvedDAVAKEA 563
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1649 QKKLKQME----------------------------------VQLEEEYEDKQK----------ALREKREL--ESKLST 1682
Cdd:TIGR02169  564 IELLKRRKagratflplnkmrderrdlsilsedgvigfavdlVEFDPKYEPAFKyvfgdtlvveDIEAARRLmgKYRMVT 643
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1683 L-------------------SDQVNQR-DFESEKRLRKDLKRTKALLADAQIMLDHLKNNA--------PSKREIAQLKN 1734
Cdd:TIGR02169  644 LegelfeksgamtggsraprGGILFSRsEPAELQRLRERLEGLKRELSSLQSELRRIENRLdelsqelsDASRKIGEIEK 723
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1735 QLEESEFTcAAAVKAR--------KAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEI-----QNRLEEDQEDMNE 1801
Cdd:TIGR02169  724 EIEQLEQE-EEKLKERleeleedlSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLearlsHSRIPEIQAELSK 802
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1802 LMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETRLE----F 1877
Cdd:TIGR02169  803 LEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKE------IENLNGKKEELEEELEeleaA 876
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1878 EKTQVKRLENLAS---RLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHElEMDLES 1954
Cdd:TIGR02169  877 LRDLESRLGDLKKerdELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEE-ELSLED 955
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1955 LEAANQSLQADLklafKRIGDLQAAIEDEMEsdENEDLINSLQDmvtkyqkKKNKLEGDSDvdsELEDRVD 2025
Cdd:TIGR02169  956 VQAELQRVEEEI----RALEPVNMLAIQEYE--EVLKRLDELKE-------KRAKLEEERK---AILERIE 1010
PTZ00121 PTZ00121
MAEBL; Provisional
1295-1990 7.10e-26

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 117.93  E-value: 7.10e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1295 LSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRL--ETRISELTSELTDERNTgESASQLLDAETAERLRTEKEMKE 1372
Cdd:PTZ00121  1099 KAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKaeDARKAEEARKAEDAKRV-EIARKAEDARKAEEARKAEDAKK 1177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1373 LQTQYDALKKQMEVMEMEVMEARLIRAAEingevddddaggewRLKYERAVREVdftkkrlqQELEDKMEVEQqSRRQLE 1452
Cdd:PTZ00121  1178 AEAARKAEEVRKAEELRKAEDARKAEAAR--------------KAEEERKAEEA--------RKAEDAKKAEA-VKKAEE 1234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1453 RRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKK--QRRFDSELSQAHEETQREKLQREKLQR 1530
Cdd:PTZ00121  1235 AKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKaeEKKKADEAKKAEEKKKADEAKKKAEEA 1314
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1531 EKdmllAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLE-AKVKDQEEELDEQAGSI 1609
Cdd:PTZ00121  1315 KK----ADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKkEEAKKKADAAKKKAEEK 1390
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1610 QMLEQAKLRLEMEMERMRQTHSKEMESRdeEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQ 1689
Cdd:PTZ00121  1391 KKADEAKKKAEEDKKKADELKKAAAAKK--KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEE 1468
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1690 RDFESEKRLRKDLKR----TKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCA-AAVKARKAMEVEMEDLHLQI 1764
Cdd:PTZ00121  1469 AKKADEAKKKAEEAKkadeAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAkKAEEAKKADEAKKAEEKKKA 1548
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1765 DDIAKA---KTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKL 1841
Cdd:PTZ00121  1549 DELKKAeelKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKA 1628
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1842 QALQSQVEFLEQSMVDKslvSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQ 1921
Cdd:PTZ00121  1629 EEEKKKVEQLKKKEAEE---KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1922 R-------------QLRDTKEE----MSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEM 1984
Cdd:PTZ00121  1706 ElkkkeaeekkkaeELKKAEEEnkikAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785

                   ....*.
gi 1907082211 1985 ESDENE 1990
Cdd:PTZ00121  1786 DEEDEK 1791
PDZ smart00228
Domain present in PSD-95, Dlg, and ZO-1/2; Also called DHR (Dlg homologous region) or GLGF ...
220-309 2.24e-11

Domain present in PSD-95, Dlg, and ZO-1/2; Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.


Pssm-ID: 214570 [Multi-domain]  Cd Length: 85  Bit Score: 62.01  E-value: 2.24e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   220 ELELQRRPTGdFGFSLRRttmlDRAPEGQAYrrVVHFAEPGAGTKDlalGLVPGDRLVEINGQNVENKSRDEIVEMIRQS 299
Cdd:smart00228    4 LVELEKGGGG-LGFSLVG----GKDEGGGVV--VSSVVPGSPAAKA---GLRVGDVILEVNGTSVEGLTHLEAVDLLKKA 73
                            90
                    ....*....|
gi 1907082211   300 GDSVRLKVQP 309
Cdd:smart00228   74 GGKVTLTVLR 83
PDZ pfam00595
PDZ domain; PDZ domains are found in diverse signaling proteins.
220-308 9.68e-09

PDZ domain; PDZ domains are found in diverse signaling proteins.


Pssm-ID: 395476 [Multi-domain]  Cd Length: 81  Bit Score: 54.21  E-value: 9.68e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLRRttMLDRAPEGQAYRRVVHFaepGAGTKDlalGLVPGDRLVEINGQNVENKSRDEIVEMIRQS 299
Cdd:pfam00595    1 QVTLEKDGRGGLGFSLKG--GSDQGDPGIFVSEVLPG---GAAEAG---GLKVGDRILSINGQDVENMTHEEAVLALKGS 72

                   ....*....
gi 1907082211  300 GDSVRLKVQ 308
Cdd:pfam00595   73 GGKVTLTIL 81
CtpA COG0793
C-terminal processing protease CtpA/Prc, contains a PDZ domain [Posttranslational modification, ...
269-309 2.65e-05

C-terminal processing protease CtpA/Prc, contains a PDZ domain [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 440556 [Multi-domain]  Cd Length: 341  Bit Score: 48.71  E-value: 2.65e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1907082211  269 GLVPGDRLVEINGQNVENKSRDEIVEMIR-QSGDSVRLKVQP 309
Cdd:COG0793     88 GIKPGDIILAIDGKSVAGLTLDDAVKLLRgKAGTKVTLTIKR 129
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1438-1842 5.05e-05

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 48.86  E-value: 5.05e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1438 EDKMEVEQQSRRQLERRLGDLQADSDESQ--------RALQQLKKKCQRLTAELQDTKlhlegqqvrNHELEKKQRrfdS 1509
Cdd:NF033838    54 ESQKEHAKEVESHLEKILSEIQKSLDKRKhtqnvalnKKLSDIKTEYLYELNVLKEKS---------EAELTSKTK---K 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1510 ELSQAHEETQREKLQREKLQREKDMLLAEA-FSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESkdEASLAKVKKQL 1588
Cdd:NF033838   122 ELDAAFEQFKKDTLEPGKKVAEATKKVEEAeKKAKDQKEEDRRNYPTNTYKTLELEIAESDVEVKKA--ELELVKEEAKE 199
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1589 RDLEAKVKDQEEELDEQAGSIQMLEQAKlrlememermrqTHSKEMEsrdeevEEARQSCQKKLKQ-MEVQLEEEYEDKQ 1667
Cdd:NF033838   200 PRDEEKIKQAKAKVESKKAEATRLEKIK------------TDREKAE------EEAKRRADAKLKEaVEKNVATSEQDKP 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1668 KAlREKRELESKLSTlsdqvnqrdfesekrlrKDLKRTKALLADAQIMLDHLKNnaPSKREiaqlKNQLEESEFTCAAAV 1747
Cdd:NF033838   262 KR-RAKRGVLGEPAT-----------------PDKKENDAKSSDSSVGEETLPS--PSLKP----EKKVAEAEKKVEEAK 317
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1748 KARKAMEVE---------MEDLHLQiddIAKAKTALEE-QLSRLQREKNEIQNrleedQEDMNELMKKHKAAVAQASRdM 1817
Cdd:NF033838   318 KKAKDQKEEdrrnyptntYKTLELE---IAESDVKVKEaELELVKEEAKEPRN-----EEKIKQAKAKVESKKAEATR-L 388
                          410       420
                   ....*....|....*....|....*
gi 1907082211 1818 AQMNDLQAQIEESNKEKQELQEKLQ 1842
Cdd:NF033838   389 EKIKTDRKKAEEEAKRKAAEEDKVK 413
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1868-2033 1.09e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 47.75  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1868 IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLqRQLRDTKEEMSELARKEAEASRKKHE 1947
Cdd:PRK03918   171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL-EKLEKEVKELEELKEEIEELEKELES 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1948 LEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSdvdSELEDRVDGV 2027
Cdd:PRK03918   250 LEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL---SRLEEEINGI 326

                   ....*.
gi 1907082211 2028 KSWLSK 2033
Cdd:PRK03918   327 EERIKE 332
growth_prot_Scy NF041483
polarized growth protein Scy;
1493-1967 1.51e-04

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 47.51  E-value: 1.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1493 QQVRNHELEKKQRRFDSE--LSQAHEETQR----EKLQREKLQREkdmLLAEAFSLKQQMEEK------------DLDIA 1554
Cdd:NF041483    76 QLLRNAQIQADQLRADAEreLRDARAQTQRilqeHAEHQARLQAE---LHTEAVQRRQQLDQElaerrqtveshvNENVA 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1555 GFTQKVVSLEAE---LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEM-------- 1623
Cdd:NF041483   153 WAEQLRARTESQarrLLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESARAEAEAILRRARKDAERLLnaastqaq 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1624 ------ERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNqrdfesEKR 1697
Cdd:NF041483   233 eatdhaEQLRSSTAAESDQARRQAAELSRAAEQRMQEAEEALREARAEAEKVVAEAKEAAAKQLASAESAN------EQR 306
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1698 LRKdlkrtkalladaqimldhlknnapSKREIAQLKNQ-LEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEE 1776
Cdd:NF041483   307 TRT------------------------AKEEIARLVGEaTKEAEALKAEAEQALADARAEAEKLVAEAAEKARTVAAEDT 362
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1777 --QLSRLQREKNEIQNRLEEDQEDM-----NELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQ--------ELQEKL 1841
Cdd:NF041483   363 aaQLAKAARTAEEVLTKASEDAKATtraaaEEAERIRREAEAEADRLRGEAADQAEQLKGAAKDDTkeyraktvELQEEA 442
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1842 QALQSQVEFLEQSMVDKSLVSRQEAKireletrlefeKTQVKRLENLASRLKETMEKLTEERDQ--RAAAENREKEQNKR 1919
Cdd:NF041483   443 RRLRGEAEQLRAEAVAEGERIRGEAR-----------REAVQQIEEAARTAEELLTKAKADADElrSTATAESERVRTEA 511
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1920 LQR--QLRDTKEEMSELARKEAEasRKKHELEMDLESLEAANQSLQADLK 1967
Cdd:NF041483   512 IERatTLRRQAEETLERTRAEAE--RLRAEAEEQAEEVRAAAERAARELR 559
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
1673-1847 3.13e-04

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 45.39  E-value: 3.13e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  1673 KRELESKLSTLSdqvnqrdfESEKRLRKDLKRTKALLADAQIMLDHLKnnapskREIAQLKNQleeseftcaaavkarka 1752
Cdd:smart00787  146 KEGLDENLEGLK--------EDYKLLMKELELLNSIKPKLRDRKDALE------EELRQLKQL----------------- 194
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  1753 mEVEMEDLHLQIDDIAKAKtaleeqLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEE--- 1829
Cdd:smart00787  195 -EDELEDCDPTELDRAKEK------LKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQcrg 267
                           170
                    ....*....|....*....
gi 1907082211  1830 -SNKEKQELQEKLQALQSQ 1847
Cdd:smart00787  268 fTFKEIEKLKEQLKLLQSL 286
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
1799-2015 4.38e-04

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 45.05  E-value: 4.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1799 MNELMKKHKAAVAQASRDMAqMNDLQAQIEESNKEKQELQEKLQALQSQV-EFLEQSMVDKSlvsrqeaKIRELETRLEf 1877
Cdd:cd22656     96 ILELIDDLADATDDEELEEA-KKTIKALLDDLLKEAKKYQDKAAKVVDKLtDFENQTEKDQT-------ALETLEKALK- 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1878 ektqvkrlenlasrlketmEKLTEErdqraaAENREKEQNKRLQRQLrdtKEEMSELARKEAEASRKKHELEMDLESLEA 1957
Cdd:cd22656    167 -------------------DLLTDE------GGAIARKEIKDLQKEL---EKLNEEYAAKLKAKIDELKALIADDEAKLA 218
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1958 ANQSLQADLKLAFKRIGDLQAAIEDEMESDENedLINSLQDMVTKYQKKKNKLEGDSD 2015
Cdd:cd22656    219 AALRLIADLTAADTDLDNLLALIGPAIPALEK--LQGAWQAIATDLDSLKDLLEDDIS 274
OmpH pfam03938
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1918-2012 7.62e-04

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 461098 [Multi-domain]  Cd Length: 140  Bit Score: 41.79  E-value: 7.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1918 KRLQRQLRDTKEEM-SELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEdLINSL 1996
Cdd:pfam03938   18 KAAQAQLEKKFKKRqAELEAKQKELQKLYEELQKDGALLEEEREEKEQELQKKEQELQQLQQKAQQELQKKQQE-LLQPI 96
                           90
                   ....*....|....*.
gi 1907082211 1997 QDMVTKYQKKKNKLEG 2012
Cdd:pfam03938   97 QDKINKAIKEVAKEKG 112
 
Name Accession Description Interval E-value
MYSc_Myo18 cd01386
class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain ...
467-1217 0e+00

class XVIII myosin, motor domain; Many members of this class contain a N-terminal PDZ domain which is commonly found in proteins establishing molecular complexes. The motor domain itself does not exhibit ATPase activity, suggesting that it functions as an actin tether protein. It also has two IQ domains that probably bind light chains or related calmodulins and a C-terminal tail with two sections of coiled-coil domains, which are thought to mediate homodimerization. The function of these myosins are largely unknown. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276837 [Multi-domain]  Cd Length: 689  Bit Score: 1131.26  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd01386      1 SSVLHTLRQRYGANLIHTYAGPSLIVINPRHPLAVYSEKVAKMFKGCRREDMPPHIYASAQSAYRAMLMSRRDQSIVLLG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGtKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 626
Cdd:cd01386     81 RSGSGKTTNCRHILEYLVTAAGSVG-GVLSVEKLNAALTVLEAFGNVRTALNGNATRFSQLFSLDFDQAGQLASASIQTL 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  627 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEE 706
Cdd:cd01386    160 LLERSRVARRPEGESNFNVFYYLLAGADAALRTELHLNQLAESNSFGIVPLQKPEDKQKAAAAFSKLQAAMKTLGISEEE 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  707 QKTCWLILASIYHLGAAGATKeAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTLQRSTSFRQGPEESGL 786
Cdd:cd01386    240 QRAIWSILAAIYHLGAAGATK-AASAGRKQFARPEWAQRAAYLLGCTLEELSSAIFKHHLSGGPQQSTTSSGQESPARSS 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  787 GEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGGSARGASFEELCHNYAQDRLQ 866
Cdd:cd01386    319 SGGPKLTGVEALEGFAAGLYSELFAAVVSLINRSLSSSHHSTSSITIVDTPGFQNPAHSGSQRGATFEDLCHNYAQERLQ 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  867 RLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASH--LVRSLAHADEARGLLWLLEEEALVPGATEDALLDR 944
Cdd:cd01386    399 LLFHERTFVAPLERYKQENVEVDFDLPELSPGALVALIDQAPQqaLVRSDLRDEDRRGLLWLLDEEALYPGSSDDTFLER 478
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  945 LFSYYGPQEGDkKGQSPLLRSSKPRHFLLGHSHGTNWVEYNVAGWLNYTKQNPATQNAPRLLQDSQKKiisnlflgrags 1024
Cdd:cd01386    479 LFSHYGDKEGG-KGHSLLRRSEGPLQFVLGHLLGTNPVEYDVSGWLKAAKENPSAQNATQLLQESQKE------------ 545
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1025 atvlsgsiagleggsqlalrratsmrktfttgMAAVKKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGEPRs 1104
Cdd:cd01386    546 --------------------------------TAAVKRKSPCLQIKFQVDALIDTLRRTGLHFVHCLLPQHNAGKDERS- 592
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1105 assrrvsssseldlppGDPCEAGLLQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKKHGRNY 1184
Cdd:cd01386    593 ----------------TSSPAAGDELLDVPLLRSQLRGSQLLDALRLYRQGFPDHMPLGEFRRRFQVLAPPLTKKLGLNS 656
                          730       740       750
                   ....*....|....*....|....*....|...
gi 1907082211 1185 IVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd01386    657 EVADERKAVEELLEELDLEKSSYRIGLSQVFFR 689
MYSc cd00124
Myosin motor domain superfamily; Myosin motor domain. The catalytic (head) domain has ATPase ...
468-1217 5.14e-143

Myosin motor domain superfamily; Myosin motor domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276950 [Multi-domain]  Cd Length: 633  Bit Score: 461.68  E-value: 5.14e-143
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRR-EDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd00124      2 AILHNLRERYARDLIYTYVGDILVAVNPFKWLPLYSEEVMEKYRGKGRsADLPPHVFAVADAAYRAMLRDGQNQSILISG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGTKVFS-----VEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASA 621
Cdd:cd00124     82 ESGAGKTETTKLVLKYLAALSGSGSSKSSSsassiEQQILQSNPILEAFGNAKTVRNDNSSRFGKFIELQFDPTGRLVGA 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  622 SIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAEN---NVFGIVPLSKPEEKQKAAQQFSKLQAAMK 698
Cdd:cd00124    162 SIETYLLEKSRVVSQAPGERNFHIFYQLLAGLSDGAREELKLELLLSYyylNDYLNSSGCDRIDGVDDAEEFQELLDALD 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  699 VLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRK-QFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTlqrstsf 777
Cdd:cd00124    242 VLGFSDEEQDSIFRILAAILHLGNIEFEEDEEDEDSSaEVADDESLKAAAKLLGVDAEDLEEALTTRTIKVGG------- 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  778 rqgpeESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSS--QHSLCSMMIVDTPGFQNPEWggsargASFEE 855
Cdd:cd00124    315 -----ETITKPLTVEQAEDARDALAKALYSRLFDWLVNRINAALSPTdaAESTSFIGILDIFGFENFEV------NSFEQ 383
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  856 LCHNYAQDRLQRLFHERTFLQELERYKEDNIELA---FDDLEPVAD------DSV-AAVDQASHLvrslahadeargllw 925
Cdd:cd00124    384 LCINYANEKLQQFFNQHVFKLEQEEYEEEGIDWSfidFPDNQDCLDliegkpLGIlSLLDEECLF--------------- 448
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  926 lleeealvPGATEDALLDRLFSYYGPQEGDKKGqspllRSSKPRHFllGHSHGTNWVEYNVAGWLNYTKQNpatqnaprl 1005
Cdd:cd00124    449 --------PKGTDATFLEKLYSAHGSHPRFFSK-----KRKAKLEF--GIKHYAGDVTYDADGFLEKNKDT--------- 504
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1006 lqdsqkkiisnlflgragsatvLSGSIAGLeggsqlaLRRATSMRKtfttgmaavkkkslciqiklQVDALIDTIKRSKM 1085
Cdd:cd00124    505 ----------------------LPPDLVDL-------LRSGSQFRS--------------------QLDALMDTLNSTQP 535
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1086 HFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEF 1165
Cdd:cd00124    536 HFVRCIKPNDEKKPG-----------------------------LFDPELVLEQLRCAGVLEAVRIRRAGYPVRLPFDEF 586
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082211 1166 RRRFDVLAPHLTKKHGRNYIVVDEKRAveellESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd00124    587 LKRYRILAPGATEKASDSKKAAVLALL-----LLLKLDSSGYQLGKTKVFLR 633
MYSc smart00242
Myosin. Large ATPases; ATPase; molecular motor. Muscle contraction consists of a cyclical ...
448-1229 7.74e-114

Myosin. Large ATPases; ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.


Pssm-ID: 214580 [Multi-domain]  Cd Length: 677  Bit Score: 378.81  E-value: 7.74e-114
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   448 NAPSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQ 527
Cdd:smart00242    1 NPPKFEGVEDLVLLTYLNEPAVLHNLKKRYLKDLIYTYIGLVLVAVNPYKQLPIYTDEVIKKYRGKSRGELPPHVFAIAD 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   528 TAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGtSGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFS 605
Cdd:smart00242   81 NAYRNMLNDKENQSIIISGESGAGKTENTKKIMQYLASVSG-SNTEVGSVED-QILESnpILEAFGNAKTLRNNNSSRFG 158
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   606 QILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAENNVFgivpLSKPEEKQK 685
Cdd:smart00242  159 KFIEIHFDAKGKIIGAKIETYLLEKSRVVSQAKGERNYHIFYQLLAGASEELKKELGLKS-PEDYRY----LNQGGCLTV 233
                           250       260       270       280       290       300       310       320
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   686 A----AQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAI 761
Cdd:smart00242  234 DgiddAEEFKETLNAMRVLGFSEEEQESIFKILAAILHLGNIEFEEGRNDNAASTVKDKEELSNAAELLGVDPEELEKAL 313
                           330       340       350       360       370       380       390       400
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   762 fkhqlkggtLQRSTSFRQGPEESGLgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQ- 840
Cdd:smart00242  314 ---------TKRKIKTGGEVITKPL---NVEQALDARDALAKALYSRLFDWLVKRINQSLSFKDGSTYFIGVLDIYGFEi 381
                           410       420       430       440       450       460       470       480
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   841 ---NpewggsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaavdqashLVRS- 913
Cdd:smart00242  382 fevN----------SFEQLCINYANEKLQQFFNQHVFKLEQEEYEREGIDwtfIDFFDNQDCID-----------LIEKk 440
                           490       500       510       520       530       540       550       560
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   914 ----LAHADEArgllwlleeeALVPGATEDALLDRLFSYYgpqegdkkGQSPLLRSSKPR---HFLLGHSHGTnwVEYNV 986
Cdd:smart00242  441 ppgiLSLLDEE----------CRFPKGTDQTFLEKLNQHH--------KKHPHFSKPKKKgrtEFIIKHYAGD--VTYDV 500
                           570       580       590       600       610       620       630       640
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   987 AGWLnytKQN--PATQNAPRLLQDSQKKIISNLFlgragsatvlsGSIAGleggsqlalrRATSMRKTFTTGMaavkkks 1064
Cdd:smart00242  501 TGFL---EKNkdTLSDDLIELLQSSKNPLIASLF-----------PSGVS----------NAGSKKRFQTVGS------- 549
                           650       660       670       680       690       700       710       720
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  1065 lciQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSR 1144
Cdd:smart00242  550 ---QFKEQLNELMDTLNSTNPHFIRCIKPNEEKKPG-----------------------------DFDSSLVLHQLRYLG 597
                           730       740       750       760       770       780       790       800
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  1145 LLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHlTKKHGRNyivvDEKRAVEELLESLDLEKSSCCLGLSRVFFRAGTLARL 1224
Cdd:smart00242  598 VLENIRIRRAGFPYRLPFDEFLQRYRVLLPD-TWPPWGG----DAKKACEALLQSLGLDEDEYQLGKTKVFLRPGQLAEL 672

                    ....*
gi 1907082211  1225 EEQRD 1229
Cdd:smart00242  673 EELRE 677
COG5022 COG5022
Myosin heavy chain [General function prediction only];
429-1943 9.27e-109

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 383.66  E-value: 9.27e-109
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  429 KLDHDGAILDVDEDDIEKANAPSCDRLEDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMH 508
Cdd:COG5022     42 KEDGESVSVKKKVLGNDRIKLPKFDGVDDLTELSYLNEPAVLHNLEKRYNNGQIYTYSGLVLIAVNPYRDLGIYTDDIIQ 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  509 MFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKwQALST--L 586
Cdd:COG5022    122 SYSGKNRLELEPHVFAIAEEAYRNLLSEKENQTIIISGESGAGKTENAKRIMQYLASVTSSSTVEISSIEK-QILATnpI 200
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  587 LEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL--- 663
Cdd:COG5022    201 LEAFGNAKTVRNDNSSRFGKYIKIEFDENGEICGAKIETYLLEKSRVVHQNKNERNYHIFYQLLAGDPEELKKLLLLqnp 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  664 ---NHLAENNVFGIVPLSKPEEkqkaaqqFSKLQAAMKVLAISPEEQKTCWLILASIYHLG--AAGATKEaaeaGRKQFA 738
Cdd:COG5022    281 kdyIYLSQGGCDKIDGIDDAKE-------FKITLDALKTIGIDEEEQDQIFKILAAILHIGniEFKEDRN----GAAIFS 349
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  739 RHEWAQKAAYLLGCSLEELSSAIFKHQLKGG--TLQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISL 816
Cdd:COG5022    350 DNSVLDKACYLLGIDPSLFVKWLVKRQIKTGgeWIVVPLNLEQ--------------ALAIRDSLAKALYSNLFDWIVDR 415
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  817 VNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDL 893
Cdd:COG5022    416 INKSLDHSAAASNFIGVLDIYGFEIFEKN------SFEQLCINYTNEKLQQFFNQHMFKLEQEEYVKEGIEwsfIDYFDN 489
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  894 EPVADDSvaavdQASHLVRSLAHADEArgllwlleeeALVPGATEDALLDRLFSYYgPQEGDKKGQSPLLRSSKprhFLL 973
Cdd:COG5022    490 QPCIDLI-----EKKNPLGILSLLDEE----------CVMPHATDESFTSKLAQRL-NKNSNPKFKKSRFRDNK---FVV 550
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  974 GHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsatvlsgsiaglEGGSQLALRRATSMrktf 1053
Cdd:COG5022    551 KHYAGD--VEYDVEGFLDKNK-DPLNDDLLELLKASTNEFVSTLFDDEE-------------NIESKGRFPTLGSR---- 610
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1054 ttgmaavkkkslciqIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDV 1133
Cdd:COG5022    611 ---------------FKESLNSLMSTLNSTQPHYIRCIKPNEEKSPW-----------------------------TFDN 646
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1134 SLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHlTKKHGRNYIVVDEKRAVEELLESLDLEKSSCCLGLSR 1213
Cdd:COG5022    647 QMVLSQLRCCGVLETIRISRAGFPSRWTFDEFVQRYRILSPS-KSWTGEYTWKEDTKNAVKSILEELVIDSSKYQIGNTK 725
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1214 VFFRAGTLARLEEQRDEQTSRHLTLFQAACRGYLARQHF-KKRKIQDlAIRCVQKNIKKNKGVKDWPWWKLFTTVRPLIQ 1292
Cdd:COG5022    726 VFFKAGVLAALEDMRDAKLDNIATRIQRAIRGRYLRRRYlQALKRIK-KIQVIQHGFRLRRLVDYELKWRLFIKLQPLLS 804
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1293 VQLSEEQIRNKDEEIQQLRSKLEKV----EKERNELRLSSDRLETRISEltSELTDERNTGESASQLLDAETAERLRTEK 1368
Cdd:COG5022    805 LLGSRKEYRSYLACIIKLQKTIKREkklrETEEVEFSLKAEVLIQKFGR--SLKAKKRFSLLKKETIYLQSAQRVELAER 882
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1369 EMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDF----TKKRLQQELEDKMEVE 1444
Cdd:COG5022    883 QLQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLeegpSIEYVKLPELNKLHEV 962
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1445 QQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRrfdsELSQAHEETQREKLQ 1524
Cdd:COG5022    963 ESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPV----EVAELQSASKIISSE 1038
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1525 REKLQREKDMLLAEAFSLKQQME-EKDLdiagftqKVVSLEAELQDIssqESKDEASLAKVKKQLRDLEAK---VKDQEE 1600
Cdd:COG5022   1039 STELSILKPLQKLKGLLLLENNQlQARY-------KALKLRRENSLL---DDKQLYQLESTENLLKTINVKdleVTNRNL 1108
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1601 ELDEQAGSIQMLEQAKLRLEMEMERM--RQTHSKEMESRDEEVEEARQSCQKKLKQMEvqlEEEYEDKQKALREKRELES 1678
Cdd:COG5022   1109 VKPANVLQFIVAQMIKLNLLQEISKFlsQLVNTLEPVFQKLSVLQLELDGLFWEANLE---ALPSPPPFAALSEKRLYQS 1185
                         1290      1300      1310      1320      1330      1340      1350      1360
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1679 KL----STLSDQVNQ---RDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLknqleESEFTCAAAVKark 1751
Cdd:COG5022   1186 ALydekSKLSSSEVNdlkNELIALFSKIFSGWPRGDKLKKLISEGWVPTEYSTSLKGFNNL-----NKKFDTPASMS--- 1257
                         1370      1380      1390      1400      1410      1420      1430      1440
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1752 ameveMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLeeDQEDMNEL-MKKHKAAVAQASRDmaqmnDLQAQIEES 1830
Cdd:COG5022   1258 -----NEKLLSLLNSIDNLLSSYKLEEEVLPATINSLLQYI--NVGLFNALrTKASSLRWKSATEV-----NYNSEELDD 1325
                         1450      1460      1470      1480      1490      1500      1510      1520
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1831 NKEKQELQEklqaLQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKT-QVKRLENlasrlketmeklteeRDQRAAA 1909
Cdd:COG5022   1326 WCREFEISD----VDEELEELIQAVKVLQLLKDDLNKLDELLDACYSLNPaEIQNLKS---------------RYDPADK 1386
                         1530      1540      1550
                   ....*....|....*....|....*....|....*
gi 1907082211 1910 ENR-EKEQNKRLQRQLRDTKEEMSELARKEAEASR 1943
Cdd:COG5022   1387 ENNlPKEILKKIEALLIKQELQLSLEGKDETEVHL 1421
Myosin_head pfam00063
Myosin head (motor domain);
456-1217 1.69e-108

Myosin head (motor domain);


Pssm-ID: 395017 [Multi-domain]  Cd Length: 674  Bit Score: 363.14  E-value: 1.69e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  456 EDLASLVYLNESSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLM 535
Cdd:pfam00063    2 EDMVELSYLNEPSVLHNLKKRYKSDLIYTYSGLVLVAVNPYKQLPIYSEDMIKAYRGKRRGELPPHIFAIADEAYRSMLQ 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  536 SRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGT-SGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDF 612
Cdd:pfam00063   82 DKENQSILISGESGAGKTENTKKIMQYLASVSGSgSAGNVGRLEE-QILQSnpILEAFGNAKTVRNNNSSRFGKYIEIQF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  613 DQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL---------NHLAENNVFGIvplskpeek 683
Cdd:pfam00063  161 DAKGDIVGGKIETYLLEKSRVVYQAEGERNYHIFYQLLAGASAQLKKELRLtnpkdyhylSQSGCYTIDGI--------- 231
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  684 qKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKqFARHEWAQKAAYLLGCSLEELSSAIFK 763
Cdd:pfam00063  232 -DDSEEFKITDKAMDILGFSDEEQMGIFRIVAAILHLGNIEFKKERNDEQAV-PDDTENLQKAASLLGIDSTELEKALCK 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  764 HQLKGGTLQRSTSFrqgpeesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNP 842
Cdd:pfam00063  310 RRIKTGRETVSKPQ------------NVEQANYARDALAKAIYSRLFDWLVDRINKSLDVKTIEKASFIgVLDIYGFEIF 377
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  843 EWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaavdqashLVRS-----L 914
Cdd:pfam00063  378 EKN------SFEQLCINYVNEKLQQFFNHHMFKLEQEEYVREGIEwtfIDFGDNQPCID-----------LIEKkplgiL 440
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  915 AHADEArgllwlleeeALVPGATEDALLDRLFSYYgpqegdkkGQSPLLRSSKPR---HFLLGHSHGTnwVEYNVAGWLN 991
Cdd:pfam00063  441 SLLDEE----------CLFPKATDQTFLDKLYSTF--------SKHPHFQKPRLQgetHFIIKHYAGD--VEYNVEGFLE 500
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  992 YTKqNPATQNAPRLLQDSQKKIISNLFLGRAgsatvLSGSIAGLEGGSQLALRRATSMRKtfTTGMaavkkkslciQIKL 1071
Cdd:pfam00063  501 KNK-DPLNDDLVSLLKSSSDPLLAELFPDYE-----TAESAAANESGKSTPKRTKKKRFI--TVGS----------QFKE 562
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1072 QVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRM 1151
Cdd:pfam00063  563 SLGELMKTLNSTNPHYIRCIKPNEKKRAG-----------------------------VFDNSLVLHQLRCNGVLEGIRI 613
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1152 YRQGYPDHMVFSEFRRRFDVLAPHLTKKhgrnyIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:pfam00063  614 RRAGFPNRITFQEFVQRYRILAPKTWPK-----WKGDAKKGCEAILQSLNLDKEEYQFGKTKIFFR 674
MYSc_class_II cd01377
class II myosins, motor domain; Myosin motor domain in class II myosins. Class II myosins, ...
468-1217 1.02e-98

class II myosins, motor domain; Myosin motor domain in class II myosins. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. Thus, myosin II has two heads. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276951 [Multi-domain]  Cd Length: 662  Bit Score: 334.43  E-value: 1.02e-98
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPaVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd01377      2 SVLHNLRERYYSDLIYTYSGLFCVAVNPyKRLP-IYTEEVIDKYKGKRREEMPPHIFAIADNAYRNMLQDRENQSILITG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKW-----QALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVA 619
Cdd:cd01377     81 ESGAGKTENTKKVIQYLASVAASSKKKKESGKKKgtledQILQAnpILEAFGNAKTVRNNNSSRFGKFIRIHFGSTGKIA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  620 SASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFgIVPLSKPEEKQKAAQQFSKLQAAMKV 699
Cdd:cd01377    161 GADIETYLLEKSRVVRQAKGERNYHIFYQLLSGADPELKEKLLLTGDPSYYFF-LSQGELTIDGVDDAEEFKLTDEAFDI 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  700 LAISPEEQKTCWLILASIYHLGAAGATKEaaeaGRKQFAR---HEWAQKAAYLLGCSLEELSSAIFKHQLKGGT--LQRS 774
Cdd:cd01377    240 LGFSEEEKMSIFKIVAAILHLGNIKFKQR----RREEQAEldgTEEADKAAHLLGVNSSDLLKALLKPRIKVGRewVTKG 315
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  775 TSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQ----Npewggsarg 850
Cdd:cd01377    316 QNKEQ--------------VVFSVGALAKALYERLFLWLVKRINKTLDTKSKRQYFIGVLDIAGFEifefN--------- 372
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  851 aSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAF----DDLEPVADdsvaavdqashLVRS-----LAHADEar 921
Cdd:cd01377    373 -SFEQLCINYTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFidfgLDLQPTID-----------LIEKpnmgiLSILDE-- 438
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  922 gllwlleeEALVPGATEDALLDRLFSYygpQEGDKKGQSPLLRSSKPRHFLLGHSHGTnwVEYNVAGWLnyTK-QNPATQ 1000
Cdd:cd01377    439 --------ECVFPKATDKTFVEKLYSN---HLGKSKNFKKPKPKKSEAHFILKHYAGD--VEYNIDGWL--EKnKDPLNE 503
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1001 NAPRLLQDSQKKIISNLFlgragsatvlsgSIAGLEGGSQLALRRATSMRKTfttgMAAVKKKSLciqiklqvDALIDTI 1080
Cdd:cd01377    504 NVVALLKKSSDPLVASLF------------KDYEESGGGGGKKKKKGGSFRT----VSQLHKEQL--------NKLMTTL 559
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1081 KRSKMHFVHCFLPVAEGWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHM 1160
Cdd:cd01377    560 RSTHPHFVRCIIPNEEKKPGK-----------------------------IDAPLVLHQLRCNGVLEGIRICRKGFPNRI 610
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1161 VFSEFRRRFDVLAPHLTKKhgrnyIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd01377    611 IFAEFKQRYSILAPNAIPK-----GFDDGKAACEKILKALQLDPELYRIGNTKVFFK 662
MYSc_Myh10 cd14920
class II myosin heavy chain 10, motor domain; Myosin motor domain of non-muscle myosin heavy ...
467-1217 2.86e-90

class II myosin heavy chain 10, motor domain; Myosin motor domain of non-muscle myosin heavy chain 10 (also called NMMHCB). Mutations in this gene have been associated with May-Hegglin anomaly and developmental defects in brain and heart. Multiple transcript variants encoding different isoforms have been found for this gene. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276952 [Multi-domain]  Cd Length: 673  Bit Score: 310.02  E-value: 2.86e-90
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14920      1 ASVLHNLKDRYYSGLIYTYSGLFCVVINPYKNLPIYSENIIEMYRGKKRHEMPPHIYAISESAYRCMLQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIA-----GTSGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVA 619
Cdd:cd14920     81 ESGAGKTENTKKVIQYLAHVAsshkgRKDHNIPGELER-QLLQAnpILESFGNAKTVKNDNSSRFGKFIRINFDVTGYIV 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  620 SASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVF--GIVPLSkpeeKQKAAQQFSKLQAAM 697
Cdd:cd14920    160 GANIETYLLEKSRAVRQAKDERTFHIFYQLLSGAGEHLKSDLLLEGFNNYRFLsnGYIPIP----GQQDKDNFQETMEAM 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  698 KVLAISPEEQKTCWLILASIYHLGAAGATKEaAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGT--LQRST 775
Cdd:cd14920    236 HIMGFSHEEILSMLKVVSSVLQFGNISFKKE-RNTDQASMPENTVAQKLCHLLGMNVMEFTRAILTPRIKVGRdyVQKAQ 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  776 sfrqgpeesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgsargaSFE 854
Cdd:cd14920    315 --------------TKEQADFAVEALAKATYERLFRWLVHRINKALDRTKRQGASFIgILDIAGFEIFELN------SFE 374
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  855 ELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFD----DLEPVADdsvaAVDQASHLVRSLAHADEargllwlleeE 930
Cdd:cd14920    375 QLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIdfglDLQPCID----LIERPANPPGVLALLDE----------E 440
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  931 ALVPGATEDALLDRLfsyygpqEGDKKGQSPLLRSSKPR---HFLLGHSHGTnwVEYNVAGWLnYTKQNPATQNAPRLLQ 1007
Cdd:cd14920    441 CWFPKATDKTFVEKL-------VQEQGSHSKFQKPRQLKdkaDFCIIHYAGK--VDYKADEWL-MKNMDPLNDNVATLLH 510
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1008 DSQKKIISNLFlgRAGSATVlsgSIAGLEGGSQLALRRATSMRKTFTTGMAAVKKKSLciqiklqvDALIDTIKRSKMHF 1087
Cdd:cd14920    511 QSSDRFVAELW--KDVDRIV---GLDQVTGMTETAFGSAYKTKKGMFRTVGQLYKESL--------TKLMATLRNTNPNF 577
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1088 VHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRR 1167
Cdd:cd14920    578 VRCIIPNHEKRAG-----------------------------KLDPHLVLDQLRCNGVLEGIRICRQGFPNRIVFQEFRQ 628
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1168 RFDVLAPHLTKKhgrnyIVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14920    629 RYEILTPNAIPK-----GFMDGKQACERMIRALELDPNLYRIGQSKIFFR 673
MYSc_Myh11 cd14921
class II myosin heavy chain 11, motor domain; Myosin motor domain of smooth muscle myosin ...
467-1217 1.35e-84

class II myosin heavy chain 11, motor domain; Myosin motor domain of smooth muscle myosin heavy chain 11 (also called SMMHC, SMHC). The gene product is a subunit of a hexameric protein that consists of two heavy chain subunits and two pairs of non-identical light chain subunits. It functions as a major contractile protein, converting chemical energy into mechanical energy through the hydrolysis of ATP. The gene encoding a human ortholog of rat NUDE1 is transcribed from the reverse strand of this gene, and its 3' end overlaps with that of the latter. Inversion of the MYH11 locus is one of the most frequent chromosomal aberrations found in acute myeloid leukemia. Alternative splicing generates isoforms that are differentially expressed, with ratios changing during muscle cell maturation. Mutations in MYH11 have been described in individuals with thoracic aortic aneurysms leading to acute aortic dissections with patent ductus arteriosus. MYH11 mutations are also thought to contribute to human colorectal cancer and are also associated with Peutz-Jeghers syndrome. The mutations found in human intestinal neoplasia result in unregulated proteins with constitutive motor activity, similar to the mutant myh11 zebrafish. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276885 [Multi-domain]  Cd Length: 673  Bit Score: 293.46  E-value: 1.35e-84
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14921      1 ASVLHNLRERYFSGLIYTYSGLFCVVVNPYKHLPIYSEKIVDMYKGKKRHEMPPHIYAIADTAYRSMLQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIA----GTSGTKVFSVEKWQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVAS 620
Cdd:cd14921     81 ESGAGKTENTKKVIQYLAVVAsshkGKKDTSITGELEKQLLQAnpILEAFGNAKTVKNDNSSRFGKFIRINFDVTGYIVG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  621 ASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNV--FGIVPLSkpeeKQKAAQQFSKLQAAMK 698
Cdd:cd14921    161 ANIETYLLEKSRAIRQARDERTFHIFYYLIAGAKEKMRSDLLLEGFNNYTFlsNGFVPIP----AAQDDEMFQETLEAMS 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  699 VLAISPEEQKTCWLILASIYHLGAAGATKEaAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGtlqrsTSFR 778
Cdd:cd14921    237 IMGFSEEEQLSILKVVSSVLQLGNIVFKKE-RNTDQASMPDNTAAQKVCHLMGINVTDFTRSILTPRIKVG-----RDVV 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  779 QGPEesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgsargaSFEELC 857
Cdd:cd14921    311 QKAQ-------TKEQADFAIEALAKATYERLFRWILTRVNKALDKTHRQGASFLgILDIAGFEIFEVN------SFEQLC 377
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  858 HNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASHLVRSLAHADEargllwlleeEALVPGAT 937
Cdd:cd14921    378 INYTNEKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIELIERPNNPPGVLALLDE----------ECWFPKAT 447
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  938 EDALLDRLFSyygPQEGDKKGQSPLLRSSKPRHFLLghsHGTNWVEYNVAGWLNyTKQNPATQNAPRLLQDSQKKIISNL 1017
Cdd:cd14921    448 DKSFVEKLCT---EQGNHPKFQKPKQLKDKTEFSII---HYAGKVDYNASAWLT-KNMDPLNDNVTSLLNASSDKFVADL 520
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1018 FLGragsatvlSGSIAGLEGGSQLALRRATSMRKTfTTGMAavkkKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEG 1097
Cdd:cd14921    521 WKD--------VDRIVGLDQMAKMTESSLPSASKT-KKGMF----RTVGQLYKEQLGKLMTTLRNTTPNFVRCIIPNHEK 587
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1098 WPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLT 1177
Cdd:cd14921    588 RSG-----------------------------KLDAFLVLEQLRCNGVLEGIRICRQGFPNRIVFQEFRQRYEILAANAI 638
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|
gi 1907082211 1178 KKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14921    639 PKG-----FMDGKQACILMIKALELDPNLYRIGQSKIFFR 673
MYSc_Myh2_insects_mollusks cd14911
class II myosin heavy chain 2, motor domain; Myosin motor domain of type IIa skeletal muscle ...
467-1217 7.48e-84

class II myosin heavy chain 2, motor domain; Myosin motor domain of type IIa skeletal muscle myosin heavy chain 2 (also called MYH2A, MYHSA2, MyHC-IIa, MYHas8, MyHC-2A) in insects and mollusks. This gene encodes a member of the class II or conventional myosin heavy chains, and functions in skeletal muscle contraction. Mutations in this gene results in inclusion body myopathy-3 and familial congenital myopathy. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276876 [Multi-domain]  Cd Length: 674  Bit Score: 291.11  E-value: 7.48e-84
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14911      1 ASVLHNIKDRYYSGLIYTYSGLFCVVVNPYKKLPIYTEKIMERYKGIKRHEVPPHVFAITDSAYRNMLGDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIA-----GTSGTKVFSV-------EKWQAL---STLLEAFGNSPTIMNGSATRFSQILSLD 611
Cdd:cd14911     81 ESGAGKTENTKKVIQFLAYVAaskpkGSGAVPHPAVnpavligELEQQLlqaNPILEAFGNAKTVKNDNSSRFGKFIRIN 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  612 FDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAENNVF---GIVPLSKPEEkqkaAQ 688
Cdd:cd14911    161 FDASGFISGANIETYLLEKSRAIRQAKDERTFHIFYQLLAGATPEQREKFILDD-VKSYAFlsnGSLPVPGVDD----YA 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  689 QFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAgATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKG 768
Cdd:cd14911    236 EFQATVKSMNIMGMTSEDFNSIFRIVSAVLLFGSM-KFRQERNNDQATLPDNTVAQKIAHLLGLSVTDMTRAFLTPRIKV 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  769 GtlqrstsfRQGPEESGLGEGTKLSalecLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgs 847
Cdd:cd14911    315 G--------RDFVTKAQTKEQVEFA----VEAIAKACYERMFKWLVNRINRSLDRTKRQGASFIgILDMAGFEIFELN-- 380
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  848 argaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFD----DLEPVADdsvaavdqashLVrslahaDEARGL 923
Cdd:cd14911    381 ----SFEQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWKFIdfglDLQPTID-----------LI------DKPGGI 439
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  924 LWLLEEEALVPGATEDALLDRLFSYYgpqegdkkGQSPLLRSSKPR---HFLLGHSHGTnwVEYNVAGWLnYTKQNPATQ 1000
Cdd:cd14911    440 MALLDEECWFPKATDKTFVDKLVSAH--------SMHPKFMKTDFRgvaDFAIVHYAGR--VDYSAAKWL-MKNMDPLNE 508
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1001 NAPRLLQDSQKKIISNLFlgraGSATVLSGSIAGLeGGSQLALRRATSMRKTFTTgmaavkkkslciQIKLQVDALIDTI 1080
Cdd:cd14911    509 NIVSLLQGSQDPFVVNIW----KDAEIVGMAQQAL-TDTQFGARTRKGMFRTVSH------------LYKEQLAKLMDTL 571
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1081 KRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHM 1160
Cdd:cd14911    572 RNTNPNFVRCIIPNHEKRAG-----------------------------KIDAPLVLDQLRCNGVLEGIRICRQGFPNRI 622
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1161 VFSEFRRRFDVLAPHLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14911    623 PFQEFRQRYELLTPNVIPKG-----FMDGKKACEKMIQALELDSNLYRVGQSKIFFR 674
MYSc_Myh18 cd14932
class II myosin heavy chain 18, motor domain; Myosin motor domain of muscle myosin heavy chain ...
467-1217 4.32e-82

class II myosin heavy chain 18, motor domain; Myosin motor domain of muscle myosin heavy chain 18. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276895 [Multi-domain]  Cd Length: 676  Bit Score: 286.15  E-value: 4.32e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14932      1 ASVLHNLKERYYSGLIYTYSGLFCVVINPYKYLPIYSEEIVNMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGTK------VFS---VEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQA 615
Cdd:cd14932     81 ESGAGKTENTKKVIQYLAYVASSFKTKkdqssiALShgeLEK-QLLQAnpILEAFGNAKTVKNDNSSRFGKFIRINFDVN 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  616 GQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKaaQQFSKLQA 695
Cdd:cd14932    160 GYIVGANIETYLLEKSRAIRQAKDERAFHIFYYLLTGAGDKLRSELCLEDYSKYRFLSNGNVTIPGQQDK--ELFAETME 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 AMKVLAISPEEQKTCWLILASIYHLGAAGATKEaAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGtlqrsT 775
Cdd:cd14932    238 AFRIMSIPEEEQTGLLKVVSAVLQLGNMSFKKE-RNSDQASMPDDTAAQKVCHLLGMNVTDFTRAILSPRIKVG-----R 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  776 SFRQGPEesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgsargaSFE 854
Cdd:cd14932    312 DYVQKAQ-------TQEQAEFAVEALAKASYERMFRWLVMRINKALDKTKRQGASFIgILDIAGFEIFELN------SFE 378
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  855 ELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASHLVRSLAHADEargllwlleeEALVP 934
Cdd:cd14932    379 QLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWSFIDFGLDLQPCIELIEKPNGPPGILALLDE----------ECWFP 448
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  935 GATEDALLDRLFSYYGpqeGDKKGQSPlLRSSKPRHFLLGHSHGTnwVEYNVAGWLnYTKQNPATQNAPRLLQDSQKKII 1014
Cdd:cd14932    449 KATDKSFVEKVVQEQG---NNPKFQKP-KKLKDDADFCIIHYAGK--VDYKANEWL-MKNMDPLNENVATLLNQSTDKFV 521
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1015 SNLFlgRAGSATVLSGSIAGLEGGSQLALRRATSMRKTFTTgmaavkkkslciQIKLQVDALIDTIKRSKMHFVHCFLPV 1094
Cdd:cd14932    522 SELW--KDVDRIVGLDKVAGMGESLHGAFKTRKGMFRTVGQ------------LYKEQLMNLMTTLRNTNPNFVRCIIPN 587
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1095 AEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAP 1174
Cdd:cd14932    588 HEKKAG-----------------------------KLAHHLVLDQLRCNGVLEGIRICRQGFPNRIVFQEFRQRYEILTP 638
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|...
gi 1907082211 1175 HLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14932    639 NAIPKG-----FMDGKQACVLMVKALELDPNLYRIGQSKVFFR 676
MYSc_Myh9 cd14919
class II myosin heavy chain 9, motor domain; Myosin motor domain of non-muscle myosin heavy ...
467-1217 2.14e-80

class II myosin heavy chain 9, motor domain; Myosin motor domain of non-muscle myosin heavy chain 9 (also called NMMHCA, NMHC-II-A, MHA, FTNS, EPSTS, and DFNA17). Myosin is a hexameric protein composed of a pair of myosin heavy chains (MYH) and two pairs of nonidentical light chains. The encoded protein is a myosin IIA heavy chain that contains an IQ domain and a myosin head-like domain which is involved in several important functions, including cytokinesis, cell motility and maintenance of cell shape. Defects in this gene have been associated with non-syndromic sensorineural deafness autosomal dominant type 17, Epstein syndrome, Alport syndrome with macrothrombocytopenia, Sebastian syndrome, Fechtner syndrome and macrothrombocytopenia with progressive sensorineural deafness. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276883 [Multi-domain]  Cd Length: 670  Bit Score: 281.21  E-value: 2.14e-80
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14919      1 ASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQAL---STLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASI 623
Cdd:cd14919     81 ESGAGKTENTKKVIQYLAHVASSHKSKKDQGELERQLlqaNPILEAFGNAKTVKNDNSSRFGKFIRINFDVNGYIVGANI 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  624 QTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKaaQQFSKLQAAMKVLAIS 703
Cdd:cd14919    161 ETYLLEKSRAIRQAKEERTFHIFYYLLSGAGEHLKTDLLLEPYNKYRFLSNGHVTIPGQQDK--DMFQETMEAMRIMGIP 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  704 PEEQKTCWLILASIYHLGAAGATKEaAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGtlqrsTSFRQGPEe 783
Cdd:cd14919    239 EEEQMGLLRVISGVLQLGNIVFKKE-RNTDQASMPDNTAAQKVSHLLGINVTDFTRGILTPRIKVG-----RDYVQKAQ- 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  784 sglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgsargaSFEELCHNYAQ 862
Cdd:cd14919    312 ------TKEQADFAIEALAKATYERMFRWLVLRINKALDKTKRQGASFIgILDIAGFEIFDLN------SFEQLCINYTN 379
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  863 DRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASHLVRSLAHADEargllwlleeEALVPGATEDALL 942
Cdd:cd14919    380 EKLQQLFNHTMFILEQEEYQREGIEWNFIDFGLDLQPCIDLIEKPAGPPGILALLDE----------ECWFPKATDKSFV 449
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  943 DRLFSYYGPQegdKKGQSPLLRSSKPrHFLLGHSHGTnwVEYNVAGWLnYTKQNPATQNAPRLLQDSQKKIISNLFlgRA 1022
Cdd:cd14919    450 EKVVQEQGTH---PKFQKPKQLKDKA-DFCIIHYAGK--VDYKADEWL-MKNMDPLNDNIATLLHQSSDKFVSELW--KD 520
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1023 GSATVLSGSIAGLeggSQLALRRATSMRKTFTTGMAAVKKKslciqiklQVDALIDTIKRSKMHFVHCFLPVAEGWPGep 1102
Cdd:cd14919    521 VDRIIGLDQVAGM---SETALPGAFKTRKGMFRTVGQLYKE--------QLAKLMATLRNTNPNFVRCIIPNHEKKAG-- 587
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1103 rsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKKHgr 1182
Cdd:cd14919    588 ---------------------------KLDPHLVLDQLRCNGVLEGIRICRQGFPNRVVFQEFRQRYEILTPNSIPKG-- 638
                          730       740       750
                   ....*....|....*....|....*....|....*
gi 1907082211 1183 nyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14919    639 ---FMDGKQACVLMIKALELDSNLYRIGQSKVFFR 670
MYSc_Myh15_mammals cd14929
class II myosin heavy chain 15, motor domain; Myosin motor domain of sarcomeric myosin heavy ...
467-1217 2.91e-77

class II myosin heavy chain 15, motor domain; Myosin motor domain of sarcomeric myosin heavy chain 15 in mammals (also called KIAA1000) . MYH15 is a slow-twitch myosin. Myh15 is a ventricular myosin heavy chain. Myh15 is absent in embryonic and fetal muscles and is found in orbital layer of extraocular muscles at birth. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276892 [Multi-domain]  Cd Length: 662  Bit Score: 271.46  E-value: 2.91e-77
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14929      1 ASVLHTLRRRYDHWMIYTYSGLFCVTINPYKWLPVYQKEVMAAYKGKRRSEAPPHIFAVANNAFQDMLHNRENQSILFTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGTKvfsvEKWQAL-------STLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVA 619
Cdd:cd14929     81 ESGAGKTVNTKHIIQYFATIAAMIESK----KKLGALedqimqaNPVLEAFGNAKTLRNDNSSRFGKFIRMHFGARGMLS 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  620 SASIQTMLLEKLRVARRPASEATFNVFYYLLAcGDATLRTELHLN------HLAENNVFGIvplskpeEKQKAAQQFSKL 693
Cdd:cd14929    157 SADIDIYLLEKSRVIFQQPGERNYHIFYQILS-GKKELRDLLLVSanpsdfHFCSCGAVAV-------ESLDDAEELLAT 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  694 QAAMKVLAISPEEQKTCWLILASIYHLG----AAGATKEAAEAGRKqfarhEWAQKAAYLLGCSLEELSSAIFKHQLKGG 769
Cdd:cd14929    229 EQAMDILGFLPDEKYGCYKLTGAIMHFGnmkfKQKPREEQLEADGT-----ENADKAAFLMGINSSELVKGLIHPRIKVG 303
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  770 T--LQRSTSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgs 847
Cdd:cd14929    304 NeyVTRSQNIEQVTYAVG--------------ALSKSIYERMFKWLVARINRVLDAKLSRQFFIGILDITGFEILDYN-- 367
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  848 argaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELafddlepvaddsvAAVDQASHLVRSLAHADEARGLLWLL 927
Cdd:cd14929    368 ----SLEQLCINFTNEKLQQFFNQHMFVLEQEEYRKEGIDW-------------VSIDFGLDLQACIDLIEKPMGIFSIL 430
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  928 EEEALVPGATEDALLDRLFSYYgpqegdkKGQSPLLRSSKPR------HFLLGHSHGTnwVEYNVAGWLNYTKqNPATQN 1001
Cdd:cd14929    431 EEECMFPKATDLTFKTKLFDNH-------FGKSVHFQKPKPDkkkfeaHFELVHYAGV--VPYNISGWLEKNK-DLLNET 500
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1002 APRLLQDSQKKIISNLFlgragSATVLSGSiaGLEGGSQlALRRATSMRKtfttgMAAVKKKSLciqiklqvDALIDTIK 1081
Cdd:cd14929    501 VVAVFQKSSNRLLASLF-----ENYISTDS--AIQFGEK-KRKKGASFQT-----VASLHKENL--------NKLMTNLK 559
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1082 RSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMV 1161
Cdd:cd14929    560 STAPHFVRCINPNVNKIPG-----------------------------VLDPYLVLQQLRCNGVLEGIRICREGFPNRLL 610
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1162 FSEFRRRFDVLAPHLTKKHGrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14929    611 YADFKQRYCILNPRTFPKSK----FVSSRKAAEELLGSLEIDHTQYRFGITKVFFK 662
MYSc_Myh19 cd15896
class II myosin heavy chain19, motor domain; Myosin motor domain of muscle myosin heavy chain ...
467-1217 2.36e-76

class II myosin heavy chain19, motor domain; Myosin motor domain of muscle myosin heavy chain 19. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276899 [Multi-domain]  Cd Length: 675  Bit Score: 269.24  E-value: 2.36e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd15896      1 ASVLHNLKERYYSGLIYTYSGLFCVVINPYKNLPIYSEEIVEMYKGKKRHEMPPHIYAITDTAYRSMMQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQALSTL----------LEAFGNSPTIMNGSATRFSQILSLDFDQAG 616
Cdd:cd15896     81 ESGAGKTENTKKVIQYLAHVASSHKTKKDQNSLALSHGELekqllqanpiLEAFGNAKTVKNDNSSRFGKFIRINFDVNG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  617 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKaaQQFSKLQAA 696
Cdd:cd15896    161 YIVGANIETYLLEKSRAIRQAKEERTFHIFYYLLTGAGDKLRSELLLENYNNYRFLSNGNVTIPGQQDK--DLFTETMEA 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  697 MKVLAISPEEQKTCWLILASIYHLGAAGATKEaAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGT--LQRS 774
Cdd:cd15896    239 FRIMGIPEDEQIGMLKVVASVLQLGNMSFKKE-RHTDQASMPDNTAAQKVCHLMGMNVTDFTRAILSPRIKVGRdyVQKA 317
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  775 TSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgsargaSF 853
Cdd:cd15896    318 QTQEQ--------------AEFAVEALAKATYERMFRWLVMRINKALDKTKRQGASFIgILDIAGFEIFELN------SF 377
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  854 EELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASHLVRSLAHADEargllwlleeEALV 933
Cdd:cd15896    378 EQLCINYTNEKLQQLFNHTMFILEQEEYQREGIEWSFIDFGLDLQPCIDLIEKPASPPGILALLDE----------ECWF 447
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  934 PGATEDALLDRLFSyygpqegdKKGQSPllRSSKPRH------FLLGHSHGTnwVEYNVAGWLnYTKQNPATQNAPRLLQ 1007
Cdd:cd15896    448 PKATDKSFVEKVLQ--------EQGTHP--KFFKPKKlkdeadFCIIHYAGK--VDYKADEWL-MKNMDPLNDNVATLLN 514
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1008 DSQKKIISNLFLGragsatvlSGSIAGLEGGSQLA-LRRATSMRKTFTTGMAAVKKKslciqiklQVDALIDTIKRSKMH 1086
Cdd:cd15896    515 QSTDKFVSELWKD--------VDRIVGLDKVSGMSeMPGAFKTRKGMFRTVGQLYKE--------QLSKLMATLRNTNPN 578
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1087 FVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFR 1166
Cdd:cd15896    579 FVRCIIPNHEKKAG-----------------------------KLDPHLVLDQLRCNGVLEGIRICRQGFPNRIVFQEFR 629
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1167 RRFDVLAPHLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd15896    630 QRYEILTPNAIPKG-----FMDGKQACVLMIKSLELDPNLYRIGQSKVFFR 675
MYSc_Myh7b cd14927
class II myosin heavy chain 7b, motor domain; Myosin motor domain of cardiac muscle, beta ...
467-1217 4.01e-72

class II myosin heavy chain 7b, motor domain; Myosin motor domain of cardiac muscle, beta myosin heavy chain 7b (also called KIAA1512, dJ756N5.1, MYH14, MHC14). MYH7B is a slow-twitch myosin. Mutations in this gene result in one form of autosomal dominant hearing impairment. Multiple transcript variants encoding different isoforms have been found for this gene. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276953 [Multi-domain]  Cd Length: 676  Bit Score: 256.80  E-value: 4.01e-72
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14927      1 ASVLHNLRRRYSRWMIYTYSGLFCVTVNPYKWLPVYTAPVVAAYKGKRRSEAPPHIYAIADNAYNDMLRNRENQSMLITG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIA--GTSGTKVFSVEKWQALSTL----------LEAFGNSPTIMNGSATRFSQILSLDFDQ 614
Cdd:cd14927     81 ESGAGKTVNTKRVIQYFAIVAalGDGPGKKAQFLATKTGGTLedqiieanpaMEAFGNAKTLRNDNSSRFGKFIRIHFGP 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  615 AGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLN------HLAENNVFGIVPLSKPEEkqkaaq 688
Cdd:cd14927    161 TGKLASADIDIYLLEKSRVIFQQPGERSYHIYYQILSGKKPELQDMLLVSmnpydyHFCSQGVTTVDNMDDGEE------ 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  689 qFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATK----EAAEAGRKqfarhEWAQKAAYLLGCSLEELSSAIFKH 764
Cdd:cd14927    235 -LMATDHAMDILGFSPDEKYGCYKIVGAIMHFGNMKFKQkqreEQAEADGT-----ESADKAAYLMGVSSADLLKGLLHP 308
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  765 QLKGGtlQRSTSFRQGPEESGLGEGtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEW 844
Cdd:cd14927    309 RVKVG--NEYVTKGQSVEQVVYAVG----------ALAKATYDRMFKWLVSRINQTLDTKLPRQFFIGVLDIAGFEIFEF 376
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  845 GgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDSVAAVDQASHLVRSLAHADEargll 924
Cdd:cd14927    377 N------SFEQLCINFTNEKLQQFFNHHMFILEQEEYKREGIEWVFIDF---GLDLQACIDLIEKPLGILSILEE----- 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  925 wlleeEALVPGATEDALLDRLFSYYgpqegdkKGQSPLLRSSKP-------RHFLLGHSHGTnwVEYNVAGWLNYTKqNP 997
Cdd:cd14927    443 -----ECMFPKASDASFKAKLYDNH-------LGKSPNFQKPRPdkkrkyeAHFEVVHYAGV--VPYNIVGWLDKNK-DP 507
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  998 ATQNAPRLLQDSQKKIISNLFLGRAGSATVlsgsiAGLEGGSQLALRRATSmrktFTTgMAAVKKKSLciqiklqvDALI 1077
Cdd:cd14927    508 LNETVVAIFQKSQNKLLATLYENYVGSDST-----EDPKSGVKEKRKKAAS----FQT-VSQLHKENL--------NKLM 569
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1078 DTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYP 1157
Cdd:cd14927    570 TNLRATQPHFVRCIIPNETKTPG-----------------------------VMDPFLVLHQLRCNGVLEGIRICRKGFP 620
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1158 DHMVFSEFRRRFDVLAPHLTKKHGrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14927    621 NRILYADFKQRYRILNPSAIPDDK----FVDSRKATEKLLGSLDIDHTQYQFGHTKVFFK 676
MYSc_Myh1_insects_crustaceans cd14909
class II myosin heavy chain 1, motor domain; Myosin motor domain of type IIx skeletal muscle ...
467-1217 5.30e-71

class II myosin heavy chain 1, motor domain; Myosin motor domain of type IIx skeletal muscle myosin heavy chain 1 (also called MYHSA1, MYHa, MyHC-2X/D, MGC133384) in insects and crustaceans. Myh1 is a type I skeletal muscle myosin that in Humans is encoded by the MYH1 gene. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276874  Cd Length: 666  Bit Score: 253.22  E-value: 5.30e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14909      1 ASVLHNLRQRYYAKLIYTYSGLFCVAINPYKRYPVYTNRCAKMYRGKRRNEVPPHIFAISDGAYVDMLTNHVNQSMLITG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKW----QALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVAS 620
Cdd:cd14909     81 ESGAGKTENTKKVIAYFATVGASKKTDEAAKSKGsledQVVQTnpVLEAFGNAKTVRNDNSSRFGKFIRIHFGPTGKLAG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  621 ASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNhlaeNNVFGIVPLSKPE---EKQKAAQQFSKLQAAM 697
Cdd:cd14909    161 ADIETYLLEKARVISQQSLERSYHIFYQIMSGSVPGVKEMCLLS----DNIYDYYIVSQGKvtvPNVDDGEEFSLTDQAF 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  698 KVLAISPEEQKTCWLILASIYHLGaagaTKEAAEAGRKQFARH---EWAQKAAYLLGCSLEELSSAIFKHQLKGGTlQRS 774
Cdd:cd14909    237 DILGFTKQEKEDVYRITAAVMHMG----GMKFKQRGREEQAEQdgeEEGGRVSKLFGCDTAELYKNLLKPRIKVGN-EFV 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  775 TSFRQgpeesglgegtKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGGsargasFE 854
Cdd:cd14909    312 TQGRN-----------VQQVTNSIGALCKGVFDRLFKWLVKKCNETLDTQQKRQHFIGVLDIAGFEIFEYNG------FE 374
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  855 ELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDSVAAVDQASHLVRSLAHADEargllwlleeEALVP 934
Cdd:cd14909    375 QLCINFTNEKLQQFFNHHMFVLEQEEYKREGIDWAFIDF---GMDLLACIDLIEKPMGILSILEE----------ESMFP 441
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  935 GATEDALLDRLFSyygpqegDKKGQSPLLRSSKP-------RHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQ 1007
Cdd:cd14909    442 KATDQTFSEKLTN-------THLGKSAPFQKPKPpkpgqqaAHFAIAHYAGC--VSYNITGWLEKNK-DPLNDTVVDQFK 511
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1008 DSQKKIISNLFLGRAGSatvlSGSIAGLEGGsqlalrratsmRKTFTTGMAAVKKkslciQIKLQVDALIDTIKRSKMHF 1087
Cdd:cd14909    512 KSQNKLLIEIFADHAGQ----SGGGEQAKGG-----------RGKKGGGFATVSS-----AYKEQLNSLMTTLRSTQPHF 571
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1088 VHCFLPvaegwpgeprsassrrvsssSELDLPpgdpceaGLlqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRR 1167
Cdd:cd14909    572 VRCIIP--------------------NEMKQP-------GV--VDAHLVMHQLTCNGVLEGIRICRKGFPNRMMYPDFKM 622
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1168 RFDVLAPHLTKKHgrnyivVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14909    623 RYKILNPAGIQGE------EDPKKAAEIILESIALDPDQYRLGHTKVFFR 666
MYSc_Myh14_mammals cd14930
class II myosin heavy chain 14 motor domain; Myosin motor domain of non-muscle myosin heavy ...
467-1217 5.77e-71

class II myosin heavy chain 14 motor domain; Myosin motor domain of non-muscle myosin heavy chain 14 (also called FLJ13881, KIAA2034, MHC16, MYH17). Its members include mammals, chickens, and turtles. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. Some of the data used for this classification were produced by the CyMoBase team at the Max-Planck-Institute for Biophysical Chemistry. The sequence names are composed of the species abbreviation followed by the protein abbreviation and optional protein classifier and variant designations.


Pssm-ID: 276893 [Multi-domain]  Cd Length: 670  Bit Score: 253.09  E-value: 5.77e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14930      1 ASVLHNLRERYYSGLIYTYSGLFCVVINPYKQLPIYTEAIVEMYRGKKRHEVPPHVYAVTEGAYRSMLQDREDQSILCTG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGT-SGTKVFSV-----EKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVAS 620
Cdd:cd14930     81 ESGAGKTENTKKVIQYLAHVASSpKGRKEPGVpgeleRQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVAGYIVG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  621 ASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQkaaQQFSKLQAAMKVL 700
Cdd:cd14930    161 ANIETYLLEKSRAIRQAKDECSFHIFYQLLGGAGEQLKADLLLEPCSHYRFLTNGPSSSPGQER---ELFQETLESLRVL 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  701 AISPEEQKTCWLILASIYHLGAAGATKEaAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGtlqrsTSFRQG 780
Cdd:cd14930    238 GFSHEEITSMLRMVSAVLQFGNIVLKRE-RNTDQATMPDNTAAQKLCRLLGLGVTDFSRALLTPRIKVG-----RDYVQK 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  781 PEesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQNPEWGgsargaSFEELCHN 859
Cdd:cd14930    312 AQ-------TKEQADFALEALAKATYERLFRWLVLRLNRALDRSPRQGASFLgILDIAGFEIFQLN------SFEQLCIN 378
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  860 YAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEPVADDSVAAVDQASHLVRSLAHADEargllwlleeEALVPGATED 939
Cdd:cd14930    379 YTNEKLQQLFNHTMFVLEQEEYQREGIPWTFLDFGLDLQPCIDLIERPANPPGLLALLDE----------ECWFPKATDK 448
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  940 ALLDRLfsyygpqeGDKKGQSPllRSSKPRH------FLLGHSHGTnwVEYNVAGWLnYTKQNPATQNAPRLLQDSQKKI 1013
Cdd:cd14930    449 SFVEKV--------AQEQGGHP--KFQRPRHlrdqadFSVLHYAGK--VDYKANEWL-MKNMDPLNDNVAALLHQSTDRL 515
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1014 ISNLFLGRAGsatvlsgsIAGLEGGSQL--ALRRATSMRKTFTTgMAAVKKKSLciqiklqvDALIDTIKRSKMHFVHCF 1091
Cdd:cd14930    516 TAEIWKDVEG--------IVGLEQVSSLgdGPPGGRPRRGMFRT-VGQLYKESL--------SRLMATLSNTNPSFVRCI 578
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1092 LPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDV 1171
Cdd:cd14930    579 VPNHEKRAG-----------------------------KLEPRLVLDQLRCNGVLEGIRICRQGFPNRILFQEFRQRYEI 629
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082211 1172 LAPHLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14930    630 LTPNAIPKG-----FMDGKQACEKMIQALELDPNLYRVGQSKIFFR 670
MYSc_Myo8 cd01383
class VIII myosin, motor domain; These plant-specific type VIII myosins has been associated ...
468-1174 2.68e-69

class VIII myosin, motor domain; These plant-specific type VIII myosins has been associated with endocytosis, cytokinesis, cell-to-cell coupling and gating at plasmodesmata. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. It also contains IQ domains Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276834  Cd Length: 647  Bit Score: 247.61  E-value: 2.68e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKgcRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd01383      2 SVLHNLEYRYSQDIIYTKAGPVLIAVNPFKDVPLYGNEFITAYR--QKLLDSPHVYAVADTAYREMMRDEINQSIIISGE 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLATIAGTSGtkvfSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 625
Cdd:cd01383     80 SGAGKTETAKIAMQYLAALGGGSS----GIEN-EILQTnpILEAFGNAKTLRNDNSSRFGKLIDIHFDAAGKICGAKIQT 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  626 MLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL------NHLAENNVFGIVPLSKpeekqkaAQQFSKLQAAMKV 699
Cdd:cd01383    155 YLLEKSRVVQLANGERSYHIFYQLCAGASPALREKLNLksaseyKYLNQSNCLTIDGVDD-------AKKFHELKEALDT 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  700 LAISPEEQKTCWLILASIYHLG---------AAGATKEAAEAgrkqfarhewAQKAAYLLGCSLEELSSAIFKHQLKGGT 770
Cdd:cd01383    228 VGISKEDQEHIFQMLAAVLWLGnisfqvidnENHVEVVADEA----------VSTAASLLGCNANDLMLALSTRKIQAGG 297
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  771 LQRSTSFrqgpeesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQH-SLCSMMIVDTPGFQnpewggSAR 849
Cdd:cd01383    298 DKIVKKL------------TLQQAIDARDALAKAIYASLFDWLVEQINKSLEVGKRrTGRSISILDIYGFE------SFQ 359
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  850 GASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLEpvadDSVAAVDqashLVRS-----LAHADEargll 924
Cdd:cd01383    360 KNSFEQLCINYANERLQQHFNRHLFKLEQEEYELDGIDWTKVDFE----DNQECLD----LIEKkplglISLLDE----- 426
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  925 wlleeEALVPGATEDALLDRLFSYYGPQEGDKKGQSpllrsskpRHFLLGHSHGTnwVEYNVAGWLNytkqnpatQNAPR 1004
Cdd:cd01383    427 -----ESNFPKATDLTFANKLKQHLKSNSCFKGERG--------GAFTIRHYAGE--VTYDTSGFLE--------KNRDL 483
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1005 LLQDsqkkiISNLFLGRAGSATVLSGSIAGLEGGSQLALRRATSMRKTfttgmaavkKKSLCIQIKLQVDALIDTIKRSK 1084
Cdd:cd01383    484 LHSD-----LIQLLSSCSCQLPQLFASKMLDASRKALPLTKASGSDSQ---------KQSVATKFKGQLFKLMQRLENTT 549
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1085 MHFVHCFLPVAEGWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSE 1164
Cdd:cd01383    550 PHFIRCIKPNNKQLPGV-----------------------------FDQDLVLQQLRCCGVLEVVRISRSGYPTRMTHQE 600
                          730
                   ....*....|
gi 1907082211 1165 FRRRFDVLAP 1174
Cdd:cd01383    601 FARRYGFLLP 610
MYSc_Myh3 cd14913
class II myosin heavy chain 3, motor domain; Myosin motor domain of fetal skeletal muscle ...
468-1217 1.39e-68

class II myosin heavy chain 3, motor domain; Myosin motor domain of fetal skeletal muscle myosin heavy chain 3 (MYHC-EMB, MYHSE1, HEMHC, SMHCE) in tetrapods including mammals, lizards, and frogs. This gene is a member of the MYH family and encodes a protein with an IQ domain and a myosin head-like domain. Mutations in this gene have been associated with two congenital contracture (arthrogryposis) syndromes, Freeman-Sheldon syndrome and Sheldon-Hall syndrome. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276878 [Multi-domain]  Cd Length: 668  Bit Score: 246.12  E-value: 1.39e-68
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd14913      2 AVLYNLKDRYTSWMIYTYSGLFCVTVNPYKWLPVYNPEVVEGYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLATIAGT------SGTKVFSVEKWQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVA 619
Cdd:cd14913     82 SGAGKTVNTKRVIQYFATIAATgdlakkKDSKMKGTLEDQIISAnpLLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGKLA 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  620 SASIQTMLLEKLRVARRPASEATFNVFYYLLAcgdaTLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQA---A 696
Cdd:cd14913    162 SADIETYLLEKSRVTFQLKAERSYHIFYQILS----NKKPELIELLLITTNPYDYPFISQGEILVASIDDAEELLAtdsA 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  697 MKVLAISPEEQKTCWLILASIYHLGaagaTKEAAEAGRKQFAR---HEWAQKAAYLLGCSLEELSSAIFKHQLKGGTlqr 773
Cdd:cd14913    238 IDILGFTPEEKSGLYKLTGAVMHYG----NMKFKQKQREEQAEpdgTEVADKTAYLMGLNSSDLLKALCFPRVKVGN--- 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  774 stsfrqgpEESGLGEgTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSF 853
Cdd:cd14913    311 --------EYVTKGQ-TVDQVHHAVNALSKSVYEKLFLWMVTRINQQLDTKLPRQHFIGVLDIAGFEIFEYN------SL 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  854 EELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLWLLEEEALV 933
Cdd:cd14913    376 EQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTF-------------IDFGMDLAACIELIEKPMGIFSILEEECMF 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  934 PGATEDALLDRLFSYYGPQEGDKkgQSPLLRSSKPR-HFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKK 1012
Cdd:cd14913    443 PKATDTSFKNKLYDQHLGKSNNF--QKPKVVKGRAEaHFSLIHYAGT--VDYSVSGWLEKNK-DPLNETVVGLYQKSSNR 517
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1013 IISNLFLGRAGS-ATVLSGSIAGLEGGSqlalrratsmrktFTTgMAAVKKKSLciqiklqvDALIDTIKRSKMHFVHCF 1091
Cdd:cd14913    518 LLAHLYATFATAdADSGKKKVAKKKGSS-------------FQT-VSALFRENL--------NKLMSNLRTTHPHFVRCI 575
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1092 LPVAEGWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDV 1171
Cdd:cd14913    576 IPNETKTPGA-----------------------------MEHSLVLHQLRCNGVLEGIRICRKGFPNRILYGDFKQRYRV 626
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082211 1172 LAPHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14913    627 LNASAIPE-GQ---FIDSKKACEKLLASIDIDHTQYKFGHTKVFFK 668
MYSc_Myo22 cd14883
class XXII myosin, motor domain; These myosins possess an extended neck with multiple IQ ...
467-1217 4.07e-66

class XXII myosin, motor domain; These myosins possess an extended neck with multiple IQ motifs such as found in class V, VIII, XI, and XIII myosins. These myosins are defined by two tandem MyTH4 and FERM domains. The apicomplexan, but not diatom myosins contain 4-6 WD40 repeats near the end of the C-terminal tail which suggests a possible function of these myosins in signal transduction and transcriptional regulation. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276849 [Multi-domain]  Cd Length: 661  Bit Score: 238.76  E-value: 4.07e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGpSLLVlstrgapAV--------YSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQ 538
Cdd:cd14883      1 EGINTNLKVRYKKDLIYTYTG-SILV-------AVnpykelpiYTQDIVKQYFGKRMGALPPHIFALAEAAYTNMQEDGK 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  539 DQSIVLLGSSGSGKTTSFQHLVQYLAtiAGTSGTkvFSVEKwQAL--STLLEAFGNSPTIMNGSATRFSQILSLDFDQAG 616
Cdd:cd14883     73 NQSVIISGESGAGKTETTKLILQYLC--AVTNNH--SWVEQ-QILeaNTILEAFGNAKTVRNDNSSRFGKFIEVCFDASG 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  617 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLACG--DATLRTELHL---NHLAENNVFGIVPLSKPEEKQKaaqqFS 691
Cdd:cd14883    148 HIKGAIIQDYLLEQSRITFQAPGERNYHVFYQLLAGAkhSKELKEKLKLgepEDYHYLNQSGCIRIDNINDKKD----FD 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  692 KLQAAMKVLAISPEEQKTCWLILASIYHLGaaGATKEAAEAGRKQFAR--HEWAQKAAYLLGCSLEELSSA-IFKHQLKG 768
Cdd:cd14883    224 HLRLAMNVLGIPEEMQEGIFSVLSAILHLG--NLTFEDIDGETGALTVedKEILKIVAKLLGVDPDKLKKAlTIRQINVR 301
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  769 GTLqrsTSFRQGPEEsglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsa 848
Cdd:cd14883    302 GNV---TEIPLKVQE----------ARDNRDAMAKALYSRTFAWLVNHINSCTNPGQKNSRFIGVLDIFGFENFKVN--- 365
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  849 rgaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELA---FDDLEPVADdsvaAVDQASHLVRSLAhADEARgllw 925
Cdd:cd14883    366 ---SFEQLCINYTNEKLHKFFNHYVFKLEQEEYEKEGINWShivFTDNQECLD----LIEKPPLGILKLL-DEECR---- 433
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  926 lleeealVPGATEDALLDRLFSYYGPQEGDKKgqsPLLRSSKPRhFLLGHSHGTnwVEYNVAGWLNytkQNPATQ--NAP 1003
Cdd:cd14883    434 -------FPKGTDLTYLEKLHAAHEKHPYYEK---PDRRRWKTE-FGVKHYAGE--VTYTVQGFLD---KNKDTQqdDLF 497
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1004 RLLQDSQKKIISNLFLGRAgsatvlsgsiagleggsQLALRRATSMRKTFTTGMAAVKKK-SLCIQIKLQVDALIDTIKR 1082
Cdd:cd14883    498 DLMSRSKNKFVKELFTYPD-----------------LLALTGLSISLGGDTTSRGTSKGKpTVGDTFKHQLQSLVDVLSA 560
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1083 SKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVF 1162
Cdd:cd14883    561 TQPWYVRCIKPNSLKEPN-----------------------------VFDDELVLAQLRYAGMLEIIRIRKEGFPIHLTF 611
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1163 SEFRRRFDVLAPHLTKKHGRnyivvDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14883    612 KEFVDRYLCLDPRARSADHK-----ETCGAVRALMGLGGLPEDEWQVGKTKVFLR 661
MYSc_Myh7 cd14917
class II myosin heavy chain 7, motor domain; Myosin motor domain of beta (or slow) type I ...
468-1217 5.00e-66

class II myosin heavy chain 7, motor domain; Myosin motor domain of beta (or slow) type I cardiac muscle myosin heavy chain 7 (also called CMH1, MPD1, and CMD1S). Muscle myosin is a hexameric protein containing 2 heavy chain subunits, 2 alkali light chain subunits, and 2 regulatory light chain subunits. It is expressed predominantly in normal human ventrical and in skeletal muscle tissues rich in slow-twitch type I muscle fibers. Changes in the relative abundance of this protein and the alpha (or fast) heavy subunit of cardiac myosin correlate with the contractile velocity of cardiac muscle. Its expression is also altered during thyroid hormone depletion and hemodynamic overloading. Mutations in this gene are associated with familial hypertrophic cardiomyopathy, myosin storage myopathy, dilated cardiomyopathy, and Laing early-onset distal myopathy. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276881 [Multi-domain]  Cd Length: 668  Bit Score: 238.46  E-value: 5.00e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd14917      2 AVLYNLKERYASWMIYTYSGLFCVTVNPYKWLPVYNAEVVAAYRGKKRSEAPPHIFSISDNAYQYMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLATIAG---------TSGTKVFSVEKWQAlSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQV 618
Cdd:cd14917     82 SGAGKTVNTKRVIQYFAVIAAigdrskkdqTPGKGTLEDQIIQA-NPALEAFGNAKTVRNDNSSRFGKFIRIHFGATGKL 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  619 ASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELhlnhLAENNVFGIVPLSKPEEKQKAAQQFSKLQA--- 695
Cdd:cd14917    161 ASADIETYLLEKSRVIFQLKAERDYHIFYQILSNKKPELLDML----LITNNPYDYAFISQGETTVASIDDAEELMAtdn 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 AMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKQFARHEwAQKAAYLLGCSLEELSSAIFKHQLKGGT--LQR 773
Cdd:cd14917    237 AFDVLGFTSEEKNSMYKLTGAIMHFGNMKFKQKQREEQAEPDGTEE-ADKSAYLMGLNSADLLKGLCHPRVKVGNeyVTK 315
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  774 STSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSF 853
Cdd:cd14917    316 GQNVQQVIYATG--------------ALAKAVYEKMFNWMVTRINATLETKQPRQYFIGVLDIAGFEIFDFN------SF 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  854 EELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDSVAAVDQAshlvrslahaDEARGLLWLLEEEALV 933
Cdd:cd14917    376 EQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTFIDF---GMDLQACIDLI----------EKPMGIMSILEEECMF 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  934 PGATEDALLDRLFSYYGPQEGDKkgQSPLLRSSKPR-HFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKK 1012
Cdd:cd14917    443 PKATDMTFKAKLFDNHLGKSNNF--QKPRNIKGKPEaHFSLIHYAGT--VDYNIIGWLQKNK-DPLNETVVGLYQKSSLK 517
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1013 IISNLFLGRAGSATVLSGSIAGLEGGSqlalrratsmrkTFTTgMAAVKKKSLciqiklqvDALIDTIKRSKMHFVHCFL 1092
Cdd:cd14917    518 LLSNLFANYAGADAPIEKGKGKAKKGS------------SFQT-VSALHRENL--------NKLMTNLRSTHPHFVRCII 576
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1093 PVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVL 1172
Cdd:cd14917    577 PNETKSPG-----------------------------VMDNPLVMHQLRCNGVLEGIRICRKGFPNRILYGDFRQRYRIL 627
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1173 APHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14917    628 NPAAIPE-GQ---FIDSRKGAEKLLSSLDIDHNQYKFGHTKVFFK 668
MYSc_Myh16 cd14934
class II myosin heavy chain 16, motor domain; Myosin motor domain of myosin heavy chain 16 ...
467-1217 5.15e-66

class II myosin heavy chain 16, motor domain; Myosin motor domain of myosin heavy chain 16 pseudogene (also called MHC20, MYH16, and myh5), encoding a sarcomeric myosin heavy chain expressed in nonhuman primate masticatory muscles, is inactivated in humans. This cd contains Myh16 in mammals. MYH16 has intermediate fibres between that of slow type 1 and fast 2B fibres, but exert more force than any other fibre type examined. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. Some of the data used for this classification were produced by the CyMoBase team at the Max-Planck-Institute for Biophysical Chemistry. The sequence names are composed of the species abbreviation followed by the protein abbreviation and optional protein classifier and variant designations.


Pssm-ID: 276896 [Multi-domain]  Cd Length: 659  Bit Score: 238.39  E-value: 5.15e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14934      1 ASVLDNLRQRYTNMRIYTYSGLFCVTVNPYKWLPIYGARVANMYKGKKRTEMPPHLFSISDNAYHDMLMDRENQSMLITG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTS-----GTKVFSVEKWQAlSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASA 621
Cdd:cd14934     81 ESGAGKTENTKKVIQYFANIGGTGkqssdGKGSLEDQIIQA-NPVLEAFGNAKTTRNNNSSRFGKFIRIHFGTTGKLAGA 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  622 SIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL------NHLAENNVFGIVPLSKPEEKQKAaqqfsklQA 695
Cdd:cd14934    160 DIESYLLEKSRVISQQAAERGYHIFYQILSNKKPELIESLLLvpnpkeYHWVSQGVTVVDNMDDGEELQIT-------DV 232
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 AMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAgRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGT--LQR 773
Cdd:cd14934    233 AFDVLGFSAEEKIGVYKLTGGIMHFGNMKFKQKPREE-QAEVDTTEVADKVAHLMGLNSGELQKGITRPRVKVGNefVQK 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  774 STSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSF 853
Cdd:cd14934    312 GQNMEQCNNSIG--------------ALGKAVYDKMFKWLVVRINKTLDTKMQRQFFIGVLDIAGFEIFEFN------SF 371
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  854 EELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDSVAAVDQashlvrslahADEARGLLWLLEEEALV 933
Cdd:cd14934    372 EQLCINFTNEKLQQFFNHHMFVLEQEEYKREGIEWVFIDF---GLDLQACIDL----------LEKPMGIFSILEEQCVF 438
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  934 PGATE--------DALLDRLFSYYGPQEGDKKGQSPllrsskprHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRL 1005
Cdd:cd14934    439 PKATDatfkaalyDNHLGKSSNFLKPKGGKGKGPEA--------HFELVHYAGT--VGYNITGWLEKNK-DPLNETVVGL 507
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1006 LQDSQkKIISNLFLGRAGSAtvlsgsiagleGGSQLALRRATSMrktfttgmaavkkkSLCIQIKLQVDALIDTIKRSKM 1085
Cdd:cd14934    508 FQKSS-LGLLALLFKEEEAP-----------AGSKKQKRGSSFM--------------TVSNFYREQLNKLMTTLHSTAP 561
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1086 HFVHCFLpvaegwpgeprsassrrvsssseldlpPGDPCEAGLlqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEF 1165
Cdd:cd14934    562 HFVRCIV---------------------------PNEFKQSGV--VDAHLIMHQLACNGVLEGIRICRKGFPNRLQYPEF 612
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082211 1166 RRRFDVLAPHLTKKHgrnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14934    613 KQRYQVLNPNVIPQG-----FVDNKKASELLLGSIDLDVNEYKIGHTKVFFR 659
MYSc_Myo11 cd01384
class XI myosin, motor domain; These plant-specific type XI myosin are involved in organelle ...
469-1179 5.49e-65

class XI myosin, motor domain; These plant-specific type XI myosin are involved in organelle transport. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.


Pssm-ID: 276835  Cd Length: 647  Bit Score: 234.88  E-value: 5.49e-65
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  469 VLHTLRQRYGASLLHTYAGPSLL-VLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd01384      3 VLHNLKVRYELDEIYTYTGNILIaVNPFKRLPHLYDAHMMEQYKGAPLGELSPHVFAVADAAYRAMINEGKSQSILVSGE 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 625
Cdd:cd01384     83 SGAGKTETTKMLMQYLAYMGGRAVTEGRSVEQ-QVLESnpLLEAFGNAKTVRNNNSSRFGKFVEIQFDDAGRISGAAIRT 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  626 MLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL------NHLAENNVFGIVPLSKPEEkqkaaqqFSKLQAAMKV 699
Cdd:cd01384    162 YLLERSRVVQVSDPERNYHCFYQLCAGAPPEDREKYKLkdpkqfHYLNQSKCFELDGVDDAEE-------YRATRRAMDV 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  700 LAISPEEQKTCWLILASIYHLG--AAGATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQlkggtlqrstsf 777
Cdd:cd01384    235 VGISEEEQDAIFRVVAAILHLGniEFSKGEEDDSSVPKDEKSEFHLKAAAELLMCDEKALEDALCKRV------------ 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  778 RQGPEESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQnpewggSARGASFEELC 857
Cdd:cd01384    303 IVTPDGIITKPLDPDAATLSRDALAKTIYSRLFDWLVDKINRSIGQDPNSKRLIGVLDIYGFE------SFKTNSFEQFC 376
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  858 HNYAQDRLQRLFHERTFLQELERYKEDNIELAF----------DDLEPVADDSVAAVDQASHLVRSlahadeargllwll 927
Cdd:cd01384    377 INLANEKLQQHFNQHVFKMEQEEYTKEEIDWSYiefvdnqdvlDLIEKKPGGIIALLDEACMFPRS-------------- 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  928 eeealvpgaTEDALLDRLFSYYgpqeGDKKgqspllRSSKPR----HFLLGHSHGTnwVEYNVAGWLNYTKQN--PATQN 1001
Cdd:cd01384    443 ---------THETFAQKLYQTL----KDHK------RFSKPKlsrtDFTIDHYAGD--VTYQTDLFLDKNKDYvvAEHQA 501
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1002 aprLLQDSQKKIISNLFlgragsatvlsgsiagleggSQLALRRATSMRKtFTtgmaavkkkSLCIQIKLQVDALIDTIK 1081
Cdd:cd01384    502 ---LLNASKCPFVAGLF--------------------PPLPREGTSSSSK-FS---------SIGSRFKQQLQELMETLN 548
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1082 RSKMHFVHCFLPVAEgwpgeprsassrrvsssseldLPPGDPCEAGLLQldvsllraQLRGSRLLDAMRMYRQGYPDHMV 1161
Cdd:cd01384    549 TTEPHYIRCIKPNNL---------------------LKPGIFENANVLQ--------QLRCGGVLEAVRISCAGYPTRKP 599
                          730
                   ....*....|....*...
gi 1907082211 1162 FSEFRRRFDVLAPHLTKK 1179
Cdd:cd01384    600 FEEFLDRFGLLAPEVLKG 617
MYSc_Myh13 cd14923
class II myosin heavy chain 13, motor domain; Myosin motor domain of skeletal muscle myosin ...
468-1217 5.22e-64

class II myosin heavy chain 13, motor domain; Myosin motor domain of skeletal muscle myosin heavy chain 13 (also called MyHC-eo) in mammals, chicken, and green anole. Myh13 is a myosin whose expression is restricted primarily to the extrinsic eye muscles which are specialized for function in eye movement. Class II myosins, also called conventional myosins, are the myosin type responsible for producing muscle contraction in muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276887 [Multi-domain]  Cd Length: 671  Bit Score: 232.65  E-value: 5.22e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd14923      2 AVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNPEVVAAYRGKKRQEAPPHIFSISDNAYQFMLTDRDNQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLATIAgTSGTKVFSVEKWQALSTL----------LEAFGNSPTIMNGSATRFSQILSLDFDQAGQ 617
Cdd:cd14923     82 SGAGKTVNTKRVIQYFATIA-VTGDKKKEQQPGKMQGTLedqiiqanplLEAFGNAKTVRNDNSSRFGKFIRIHFGATGK 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  618 VASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELhlnhLAENNVFGIVPLSKPEEKQKAAQQFSKLQA-- 695
Cdd:cd14923    161 LASADIETYLLEKSRVTFQLSSERSYHIFYQIMSNKKPELIDLL----LISTNPFDFPFVSQGEVTVASIDDSEELLAtd 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 -AMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAgRKQFARHEWAQKAAYLLGCSLEELssaifkhqLKGGTLQRs 774
Cdd:cd14923    237 nAIDILGFSSEEKVGIYKLTGAVMHYGNMKFKQKQREE-QAEPDGTEVADKAGYLMGLNSAEM--------LKGLCCPR- 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  775 tsFRQGPEESGLGEGTKlSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSFE 854
Cdd:cd14923    307 --VKVGNEYVTKGQNVQ-QVTNSVGALAKAVYEKMFLWMVTRINQQLDTKQPRQYFIGVLDIAGFEIFDFN------SLE 377
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  855 ELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLWLLEEEALVP 934
Cdd:cd14923    378 QLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWEF-------------IDFGMDLAACIELIEKPMGIFSILEEECMFP 444
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  935 GATEDALLDRLFSYYGPQEGDKKGQSPLlRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKII 1014
Cdd:cd14923    445 KATDTSFKNKLYDQHLGKSNNFQKPKPA-KGKAEAHFSLVHYAGT--VDYNIAGWLDKNK-DPLNETVVGLYQKSSLKLL 520
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1015 SNLFLGRAGSAtvlsgsiAGLEGGSQLALRRATSMRKTfttgMAAVKKKSLciqiklqvDALIDTIKRSKMHFVHCFLPV 1094
Cdd:cd14923    521 SFLFSNYAGAE-------AGDSGGSKKGGKKKGSSFQT----VSAVFRENL--------NKLMTNLRSTHPHFVRCLIPN 581
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1095 AEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAP 1174
Cdd:cd14923    582 ETKTPG-----------------------------VMDHYLVMHQLRCNGVLEGIRICRKGFPSRILYADFKQRYRILNA 632
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|...
gi 1907082211 1175 HLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14923    633 SAIPE-GQ---FIDSKNASEKLLNSIDVDREQYRFGHTKVFFK 671
MYSc_Myh1_mammals cd14910
class II myosin heavy chain 1, motor domain; Myosin motor domain of type IIx skeletal muscle ...
468-1217 6.21e-64

class II myosin heavy chain 1, motor domain; Myosin motor domain of type IIx skeletal muscle myosin heavy chain 1 (also called MYHSA1, MYHa, MyHC-2X/D, MGC133384) in mammals. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276875 [Multi-domain]  Cd Length: 671  Bit Score: 232.31  E-value: 6.21e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd14910      2 AVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNAEVVTAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQALSTL----------LEAFGNSPTIMNGSATRFSQILSLDFDQAGQ 617
Cdd:cd14910     82 SGAGKTVNTKRVIQYFATIAVTGEKKKEEATSGKMQGTLedqiisanplLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGK 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  618 VASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELhlnhLAENNVFGIVPLSKPEEKQKAAQQFSKLQA-- 695
Cdd:cd14910    162 LASADIETYLLEKSRVTFQLKAERSYHIFYQIMSNKKPDLIEML----LITTNPYDYAFVSQGEITVPSIDDQEELMAtd 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 -AMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAgRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGT--LQ 772
Cdd:cd14910    238 sAIEILGFTSDERVSIYKLTGAVMHYGNMKFKQKQREE-QAEPDGTEVADKAAYLQNLNSADLLKALCYPRVKVGNeyVT 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  773 RSTSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaS 852
Cdd:cd14910    317 KGQTVQQVYNAVG--------------ALAKAVYDKMFLWMVTRINQQLDTKQPRQYFIGVLDIAGFEIFDFN------S 376
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  853 FEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLWLLEEEAL 932
Cdd:cd14910    377 LEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWEF-------------IDFGMDLAACIELIEKPMGIFSILEEECM 443
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  933 VPGATEDALLDRLFSYYGPQEGDKKGQSPLlRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKK 1012
Cdd:cd14910    444 FPKATDTSFKNKLYEQHLGKSNNFQKPKPA-KGKVEAHFSLIHYAGT--VDYNIAGWLDKNK-DPLNETVVGLYQKSSMK 519
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1013 IISNLFLGrAGSATVLSGsiagleGGSQLALRRATSMRKtfttgMAAVKKKSLciqiklqvDALIDTIKRSKMHFVHCFL 1092
Cdd:cd14910    520 TLALLFSG-AAAAEAEEG------GGKKGGKKKGSSFQT-----VSALFRENL--------NKLMTNLRSTHPHFVRCII 579
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1093 PVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVL 1172
Cdd:cd14910    580 PNETKTPG-----------------------------AMEHELVLHQLRCNGVLEGIRICRKGFPSRILYADFKQRYKVL 630
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1173 APHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14910    631 NASAIPE-GQ---FIDSKKASEKLLGSIDIDHTQYKFGHTKVFFK 671
MYSc_Myh4 cd14915
class II myosin heavy chain 4, motor domain; Myosin motor domain of skeletal muscle myosin ...
468-1217 2.29e-63

class II myosin heavy chain 4, motor domain; Myosin motor domain of skeletal muscle myosin heavy chain 4 (also called MYH2B, MyHC-2B, MyHC-IIb). Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276879 [Multi-domain]  Cd Length: 671  Bit Score: 230.77  E-value: 2.29e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd14915      2 AVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNPEVVTAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLATIA---------GTSGTKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAG 616
Cdd:cd14915     82 SGAGKTVNTKRVIQYFATIAvtgekkkeeAASGKMQGTLED-QIISAnpLLEAFGNAKTVRNDNSSRFGKFIRIHFGATG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  617 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLAcgdaTLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQA- 695
Cdd:cd14915    161 KLASADIETYLLEKSRVTFQLKAERSYHIFYQIMS----NKKPELIEMLLITTNPYDFAFVSQGEITVPSIDDQEELMAt 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 --AMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAgRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGT--L 771
Cdd:cd14915    237 dsAVDILGFSADEKVAIYKLTGAVMHYGNMKFKQKQREE-QAEPDGTEVADKAAYLTSLNSADLLKALCYPRVKVGNeyV 315
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  772 QRSTSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsarga 851
Cdd:cd14915    316 TKGQTVQQVYNSVG--------------ALAKAIYEKMFLWMVTRINQQLDTKQPRQYFIGVLDIAGFEIFDFN------ 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  852 SFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLWLLEEEA 931
Cdd:cd14915    376 SLEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWEF-------------IDFGMDLAACIELIEKPMGIFSILEEEC 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  932 LVPGATEDALLDRLFSYYGPQEGDKKGQSPLlRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQK 1011
Cdd:cd14915    443 MFPKATDTSFKNKLYEQHLGKSNNFQKPKPA-KGKAEAHFSLVHYAGT--VDYNIAGWLDKNK-DPLNETVVGLYQKSGM 518
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1012 KIISNLFLGraGSATVLSGSiagleGGSQLALRRATSMRKtfttgMAAVKKKSLciqiklqvDALIDTIKRSKMHFVHCF 1091
Cdd:cd14915    519 KTLAFLFSG--GQTAEAEGG-----GGKKGGKKKGSSFQT-----VSALFRENL--------NKLMTNLRSTHPHFVRCL 578
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1092 LPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDV 1171
Cdd:cd14915    579 IPNETKTPG-----------------------------AMEHELVLHQLRCNGVLEGIRICRKGFPSRILYADFKQRYKV 629
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082211 1172 LAPHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14915    630 LNASAIPE-GQ---FIDSKKASEKLLGSIDIDHTQYKFGHTKVFFK 671
MYSc_Myh8 cd14918
class II myosin heavy chain 8, motor domain; Myosin motor domain of perinatal skeletal muscle ...
469-1217 3.88e-63

class II myosin heavy chain 8, motor domain; Myosin motor domain of perinatal skeletal muscle myosin heavy chain 8 (also called MyHC-peri, MyHC-pn). Myosin is a hexameric protein composed of a pair of myosin heavy chains (MYH) and two pairs of nonidentical light chains. A mutation in this gene results in trismus-pseudocamptodactyly syndrome. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276882 [Multi-domain]  Cd Length: 668  Bit Score: 230.01  E-value: 3.88e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  469 VLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSS 548
Cdd:cd14918      3 VLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNPEVVAAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGES 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  549 GSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQ--------ALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVAS 620
Cdd:cd14918     83 GAGKTVNTKRVIQYFATIAVTGEKKKEESGKMQgtledqiiSANPLLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGKLAS 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  621 ASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELhlnhLAENNVFGIVPLSKPEEKQKAAQQFSKLQA---AM 697
Cdd:cd14918    163 ADIETYLLEKSRVTFQLKAERSYHIFYQITSNKKPDLIEML----LITTNPYDYAFVSQGEITVPSIDDQEELMAtdsAI 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  698 KVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAgRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTlqrstsf 777
Cdd:cd14918    239 DILGFTPEEKVSIYKLTGAVMHYGNMKFKQKQREE-QAEPDGTEVADKAAYLQSLNSADLLKALCYPRVKVGN------- 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  778 rqgpEESGLGEgTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSFEELC 857
Cdd:cd14918    311 ----EYVTKGQ-TVQQVYNAVGALAKAVYEKMFLWMVTRINQQLDTKQPRQYFIGVLDIAGFEIFDFN------SLEQLC 379
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  858 HNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLWLLEEEALVPGAT 937
Cdd:cd14918    380 INFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTF-------------IDFGMDLAACIELIEKPLGIFSILEEECMFPKAT 446
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  938 EDALLDRLFSYYGPQEGDKKgQSPLLRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNL 1017
Cdd:cd14918    447 DTSFKNKLYDQHLGKSANFQ-KPKVVKGKAEAHFSLIHYAGT--VDYNITGWLDKNK-DPLNDTVVGLYQKSAMKTLASL 522
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1018 FLGRAGSATVLSGSIAGLEGGSqlalrratsmrkTFTTGMAAVKKkslciqiklQVDALIDTIKRSKMHFVHCFLPVAEG 1097
Cdd:cd14918    523 FSTYASAEADSGAKKGAKKKGS------------SFQTVSALFRE---------NLNKLMTNLRSTHPHFVRCIIPNETK 581
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1098 WPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLT 1177
Cdd:cd14918    582 TPG-----------------------------AMEHELVLHQLRCNGVLEGIRICRKGFPSRILYGDFKQRYKVLNASAI 632
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|
gi 1907082211 1178 KKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14918    633 PE-GQ---FIDSKKASEKLLASIDIDHTQYKFGHTKVFFK 668
MYSc_Myo29 cd14890
class XXIX myosin, motor domain; Class XXIX myosins are comprised of Stramenopiles and have ...
467-1174 2.12e-62

class XXIX myosin, motor domain; Class XXIX myosins are comprised of Stramenopiles and have very long tail domains consisting of three IQ motifs, short coiled-coil regions, up to 18 CBS domains, a PB1 domain, and a carboxy-terminal transmembrane domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276855 [Multi-domain]  Cd Length: 662  Bit Score: 227.74  E-value: 2.12e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMS----RQDQS 541
Cdd:cd14890      1 ASLLHTLRLRYERDEIYTYVGPILISINPyKSIPDLYSEERMLLYHGTTAGELPPHVFAIADHAYTQLIQSgvldPSNQS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  542 IVLLGSSGSGKTTSFQHLVQYLATIagTSGTKVFSVEKWQALST------------------LLEAFGNSPTIMNGSATR 603
Cdd:cd14890     81 IIISGESGAGKTEATKIIMQYLARI--TSGFAQGASGEGEAASEaieqtlgsledrvlssnpLLESFGNAKTLRNDNSSR 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  604 FSQILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLnhlaENNVFGIVPLSK--PE 681
Cdd:cd14890    159 FGKFIEIQFDHHGKIVGAEISNFLLEKTRIVTQNDGERNYHIFYQLLAGADEALRERLKL----QTPVEYFYLRGEcsSI 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  682 EKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAI 761
Cdd:cd14890    235 PSCDDAKAFAETIRCLSTIGISEENQDAVFGLLAAVLHLGNVDFESENDTTVLEDATTLQSLKLAAELLGVNEDALEKAL 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  762 FKHQL--KGGTLQRstsfrqgPEESGLGEgTKLSALeclegmASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGF 839
Cdd:cd14890    315 LTRQLfvGGKTIVQ-------PQNVEQAR-DKRDAL------AKALYSSLFLWLVSELNRTISSPDDKWGFIGVLDIYGF 380
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  840 QNPEWGGsargasFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADDSVAAVDQASHLVRSLah 916
Cdd:cd14890    381 EKFEWNT------FEQLCINYANEKLQRHFNQHMFEVEQVEYSNEGIDwqyITFNDNQACLELIEGKVNGKPGIFITL-- 452
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  917 aDEARGLLWLLEEEALVpgatedALLDRLF--SYYGPQEGDKKGQSPLLRSSK---PRHFLLGHSHGTnwVEYNVAGwln 991
Cdd:cd14890    453 -DDCWRFKGEEANKKFV------SQLHASFgrKSGSGGTRRGSSQHPHFVHPKfdaDKQFGIKHYAGD--VIYDASG--- 520
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  992 YTKQNPATqnaprlLQDSQKKIIsnlflgragsatvlsgsiagleggsqlalrrATSMRktfttgmaAVKKKSLCIQIKL 1071
Cdd:cd14890    521 FNEKNNET------LNAEMKELI-------------------------------KQSRR--------SIREVSVGAQFRT 555
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1072 QVDALIDTIKRSKMHFVHCFLPVAEGWPGEprsassrrvsssseldlppgdpceagLLQLDVSLlraQLRGSRLLDAMRM 1151
Cdd:cd14890    556 QLQELMAKISLTNPRYVRCIKPNETKAPGK--------------------------FDGLDCLR---QLKYSGMMEAIQI 606
                          730       740
                   ....*....|....*....|....*.
gi 1907082211 1152 YRQGYP---DHMVFseFrRRFDVLAP 1174
Cdd:cd14890    607 RQQGFAlreEHDSF--F-YDFQVLLP 629
MYSc_Myh2_mammals cd14912
class II myosin heavy chain 2, motor domain; Myosin motor domain of type IIa skeletal muscle ...
468-1217 2.89e-62

class II myosin heavy chain 2, motor domain; Myosin motor domain of type IIa skeletal muscle myosin heavy chain 2 (also called MYH2A, MYHSA2, MyHC-IIa, MYHas8, MyHC-2A) in mammals. Mutations in this gene results in inclusion body myopathy-3 and familial congenital myopathy. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276877 [Multi-domain]  Cd Length: 673  Bit Score: 227.69  E-value: 2.89e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd14912      2 AVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNPEVVTAYRGKKRQEAPPHIFSISDNAYQFMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQALSTL----------LEAFGNSPTIMNGSATRFSQILSLDFDQAGQ 617
Cdd:cd14912     82 SGAGKTVNTKRVIQYFATIAVTGEKKKEEITSGKMQGTLedqiisanplLEAFGNAKTVRNDNSSRFGKFIRIHFGTTGK 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  618 VASASIQTMLLEKLRVARRPASEATFNVFYYLlacgDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQA-- 695
Cdd:cd14912    162 LASADIETYLLEKSRVTFQLKAERSYHIFYQI----TSNKKPELIEMLLITTNPYDYPFVSQGEISVASIDDQEELMAtd 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 -AMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAgRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGT--LQ 772
Cdd:cd14912    238 sAIDILGFTNEEKVSIYKLTGAVMHYGNLKFKQKQREE-QAEPDGTEVADKAAYLQSLNSADLLKALCYPRVKVGNeyVT 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  773 RSTSFRQGPEESGlgegtklsalecleGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaS 852
Cdd:cd14912    317 KGQTVEQVTNAVG--------------ALAKAVYEKMFLWMVARINQQLDTKQPRQYFIGVLDIAGFEIFDFN------S 376
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  853 FEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFddlepvaddsvaaVDQASHLVRSLAHADEARGLLWLLEEEAL 932
Cdd:cd14912    377 LEQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWTF-------------IDFGMDLAACIELIEKPMGIFSILEEECM 443
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  933 VPGATEDALLDRLFSYYGPQEGDKKgQSPLLRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKK 1012
Cdd:cd14912    444 FPKATDTSFKNKLYEQHLGKSANFQ-KPKVVKGKAEAHFSLIHYAGV--VDYNITGWLDKNK-DPLNETVVGLYQKSAMK 519
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1013 IISNLFLGragsATVLSGSIAGleGGSQLALRRATSMRKTfttgMAAVKKKSLciqiklqvDALIDTIKRSKMHFVHCFL 1092
Cdd:cd14912    520 TLAYLFSG----AQTAEGASAG--GGAKKGGKKKGSSFQT----VSALFRENL--------NKLMTNLRSTHPHFVRCII 581
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1093 PVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVL 1172
Cdd:cd14912    582 PNETKTPG-----------------------------AMEHELVLHQLRCNGVLEGIRICRKGFPSRILYADFKQRYKVL 632
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1173 APHLTKKhGRnyiVVDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14912    633 NASAIPE-GQ---FIDSKKASEKLLASIDIDHTQYKFGHTKVFFK 673
MYSc_Myh6 cd14916
class II myosin heavy chain 6, motor domain; Myosin motor domain of alpha (or fast) cardiac ...
468-1174 2.86e-61

class II myosin heavy chain 6, motor domain; Myosin motor domain of alpha (or fast) cardiac muscle myosin heavy chain 6. Cardiac muscle myosin is a hexamer consisting of two heavy chain subunits, two light chain subunits, and two regulatory subunits. This gene encodes the alpha heavy chain subunit of cardiac myosin. Mutations in this gene cause familial hypertrophic cardiomyopathy and atrial septal defect. Class II myosins, also called conventional myosins, are the myosin type responsible for producing actomyosin contraction in metazoan muscle and non-muscle cells. Myosin II contains two heavy chains made up of the head (N-terminal) and tail (C-terminal) domains with a coiled-coil morphology that holds the two heavy chains together. The intermediate neck domain is the region creating the angle between the head and tail. It also contains 4 light chains which bind the heavy chains in the "neck" region between the head and tail. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. Class-II myosins are regulated by phosphorylation of the myosin light chain or by binding of Ca2+. A cyclical interaction between myosin and actin provides the driving force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276880 [Multi-domain]  Cd Length: 670  Bit Score: 224.55  E-value: 2.86e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd14916      2 AVLYNLKERYAAWMIYTYSGLFCVTVNPYKWLPVYNAEVVAAYRGKKRSEAPPHIFSISDNAYQYMLTDRENQSILITGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLATIA--GTSGTKVFSVEKWQAL-------STLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQV 618
Cdd:cd14916     82 SGAGKTVNTKRVIQYFASIAaiGDRSKKENPNANKGTLedqiiqaNPALEAFGNAKTVRNDNSSRFGKFIRIHFGATGKL 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  619 ASASIQTMLLEKLRVARRPASEATFNVFYYLLAcgdaTLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQA--- 695
Cdd:cd14916    162 ASADIETYLLEKSRVIFQLKAERNYHIFYQILS----NKKPELLDMLLVTNNPYDYAFVSQGEVSVASIDDSEELLAtds 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 AMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAgRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGT--LQR 773
Cdd:cd14916    238 AFDVLGFTAEEKAGVYKLTGAIMHYGNMKFKQKQREE-QAEPDGTEDADKSAYLMGLNSADLLKGLCHPRVKVGNeyVTK 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  774 STSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSF 853
Cdd:cd14916    317 GQSVQQ--------------VYYSIGALAKSVYEKMFNWMVTRINATLETKQPRQYFIGVLDIAGFEIFDFN------SF 376
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  854 EELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLepvADDSVAAVDQAshlvrslahaDEARGLLWLLEEEALV 933
Cdd:cd14916    377 EQLCINFTNEKLQQFFNHHMFVLEQEEYKKEGIEWEFIDF---GMDLQACIDLI----------EKPMGIMSILEEECMF 443
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  934 PGATEDALLDRLF-SYYGPQEGDKKGQSplLRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKK 1012
Cdd:cd14916    444 PKASDMTFKAKLYdNHLGKSNNFQKPRN--VKGKQEAHFSLVHYAGT--VDYNILGWLEKNK-DPLNETVVGLYQKSSLK 518
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1013 IISNLFLGRAGSATvlsgsiaGLEGGSQLALRRATSMRKtfttgMAAVKKKSLciqiklqvDALIDTIKRSKMHFVHCFL 1092
Cdd:cd14916    519 LMATLFSTYASADT-------GDSGKGKGGKKKGSSFQT-----VSALHRENL--------NKLMTNLKTTHPHFVRCII 578
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1093 PVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVL 1172
Cdd:cd14916    579 PNERKAPG-----------------------------VMDNPLVMHQLRCNGVLEGIRICRKGFPNRILYGDFRQRYRIL 629

                   ..
gi 1907082211 1173 AP 1174
Cdd:cd14916    630 NP 631
MYSc_Myo1 cd01378
class I myosin, motor domain; Myosin I generates movement at the leading edge in cell motility, ...
467-1093 5.68e-60

class I myosin, motor domain; Myosin I generates movement at the leading edge in cell motility, and class I myosins have been implicated in phagocytosis and vesicle transport. Myosin I, an unconventional myosin, does not form dimers. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. There are 5 myosin subclasses with subclasses c/h, d/g, and a/b have an IQ domain and a TH1 domain. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276829  Cd Length: 652  Bit Score: 220.11  E-value: 5.68e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPslLVLST---RGAPaVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIV 543
Cdd:cd01378      1 EAINENLKKRFENDEIYTYIGH--VLISVnpfKDLG-IYTDEVLESYRGKNRYEVPPHVFALADSAYRNMKSEKENQCVI 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  544 LLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVeKWQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASA 621
Cdd:cd01378     78 ISGESGAGKTEASKRIMQYIAAVSGGSESEVERV-KDMLLASnpLLEAFGNAKTLRNDNSSRFGKYMEIQFDFKGEPVGG 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  622 SIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL------NHLAENNVFGIVPLSKpeekqkaAQQFSKLQA 695
Cdd:cd01378    157 HITNYLLEKSRVVGQIKGERNFHIFYQLLKGASQEYLQELGLqrpeqyYYYSKSGCFDVDGIDD-------AADFKEVLN 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 AMKVLAISPEEQKTCWLILASIYHLG----AAGATKEAAEAGRKQfarhewAQKAAYLLGCSLEELSSAIFKHQLKGGTL 771
Cdd:cd01378    230 AMKVIGFTEEEQDSIFRILAAILHLGniqfAEDEEGNAAISDTSV------LDFVAYLLGVDPDQLEKALTHRTIETGGG 303
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  772 QRST-SFRQGPEEsglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGF----QNpewg 845
Cdd:cd01378    304 GRSVyEVPLNVEQ----------AAYARDALAKAIYSRLFDWIVERINKSLAAKSGGKKKVIgVLDIYGFeifeKN---- 369
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  846 gsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELA----FDDlEPVADdsvaavdqashLVRS-----LAH 916
Cdd:cd01378    370 ------SFEQFCINYVNEKLQQIFIELTLKAEQEEYVREGIEWTpikyFNN-KIICD-----------LIEEkppgiFAI 431
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  917 ADEArgllwlleeEALVPGATEDALLDRLFSYYGPQEGDKKGQSPllRSSKPRHFLLGHSHGTnwVEYNVAGwlnYTKQN 996
Cdd:cd01378    432 LDDA---------CLTAGDATDQTFLQKLNQLFSNHPHFECPSGH--FELRRGEFRIKHYAGD--VTYNVEG---FLDKN 495
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  997 PAT--QNAPRLLQDSQKKIISNLFlgragsatvlsgsiagLEGGSQLALRRATsmrktfTTGMaavkkkslciQIKLQVD 1074
Cdd:cd01378    496 KDLlfKDLKELMQSSSNPFLRSLF----------------PEGVDLDSKKRPP------TAGT----------KFKNSAN 543
                          650
                   ....*....|....*....
gi 1907082211 1075 ALIDTIKRSKMHFVHCFLP 1093
Cdd:cd01378    544 ALVETLMKKQPSYIRCIKP 562
MYSc_Myo42 cd14903
class XLII myosin, motor domain; The class XLII myosins are comprised of Stramenopiles. Not ...
467-1174 8.44e-59

class XLII myosin, motor domain; The class XLII myosins are comprised of Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276868 [Multi-domain]  Cd Length: 658  Bit Score: 216.95  E-value: 8.44e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLL 545
Cdd:cd14903      1 AAILYNVKKRFLRKLPYTYTGDICIAVNPyQWLPELYTEEQHSKYLNKPKEELPPHVYATSVAAYNHMKRSGRNQSILVS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  546 GSSGSGKTTSFQHLVQYLATIAGtsGTKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 625
Cdd:cd14903     81 GESGAGKTETTKILMNHLATIAG--GLNDSTIKKIIEVNPLLESFGNAKTVRNDNSSRFGKFTQLQFDKNGTLVGAKCRT 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  626 MLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAennVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPE 705
Cdd:cd14903    159 YLLEKTRVISHERPERNYHIFYQLLASPDVEERLFLDSANEC---AYTGANKTIKIEGMSDRKHFARTKEALSLIGVSEE 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  706 EQKTCWLILASIYHLGAAGATKEAAEAGRKQFAR-HEWAQKAAYLLGCSLEELSSAIFKHQLKGGTLQRSTSFRqgpees 784
Cdd:cd14903    236 KQEVLFEVLAGILHLGQLQIQSKPNDDEKSAIAPgDQGAVYATKLLGLSPEALEKALCSRTMRAAGDVYTVPLK------ 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  785 glgegtKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsargaSFEELCHNYAQDR 864
Cdd:cd14903    310 ------KDQAEDCRDALAKAIYSNVFDWLVATINASLGNDAKMANHIGVLDIFGFEHFKHN------SFEQFCINYANEK 377
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  865 LQRLFHERTFLQELERYKEDNIELAFDDLepvADDS-VAAVDQASHLVRSLAHADEARgllwlleeealvPGATEDALLD 943
Cdd:cd14903    378 LQQKFTQDVFKTVQIEYEEEGIRWAHIDF---ADNQdVLAVIEDRLGIISLLNDEVMR------------PKGNEESFVS 442
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  944 RLFSYYgpqeGDKKGQSPLLRSSKPRhFLLGHSHGTnwVEYNVAGWLNYTKQNpatqnaprLLQD-------SQKKIISN 1016
Cdd:cd14903    443 KLSSIH----KDEQDVIEFPRTSRTQ-FTIKHYAGP--VTYESLGFLEKHKDA--------LLPDlsdlmrgSSKPFLRM 507
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1017 LFLGRAGSATVLSGSIAGLEGgsqlalRRATSMRKTFTTGMaavkkkslciQIKLQVDALIDTIKRSKMHFVHCFLPVAE 1096
Cdd:cd14903    508 LFKEKVESPAAASTSLARGAR------RRRGGALTTTTVGT----------QFKDSLNELMTTIRSTNVHYVRCIKPNSI 571
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1097 GWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAP 1174
Cdd:cd14903    572 KSPTE-----------------------------LDHLMVVSQLRCAGVIEAIRISRAAYPNRLLHEEFLDKFWLFLP 620
MYSc_Myo7 cd01381
class VII myosin, motor domain; These monomeric myosins have been associated with functions in ...
467-1174 2.94e-58

class VII myosin, motor domain; These monomeric myosins have been associated with functions in sensory systems such as vision and hearing. Mammalian myosin VII has a tail with 2 MyTH4 domains, 2 FERM domains, and a SH3 domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276832  Cd Length: 648  Bit Score: 214.81  E-value: 2.94e-58
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGpSLLVlstrgapAV--------YSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQ 538
Cdd:cd01381      1 AGILRNLLIRYREKLIYTYTG-SILV-------AVnpyqilpiYTAEQIRLYRNKKIGELPPHIFAIADNAYTNMKRNKR 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  539 DQSIVLLGSSGSGKTTSFQHLVQYLATIAGtsgtKVFSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAG 616
Cdd:cd01381     73 DQCVVISGESGAGKTESTKLILQYLAAISG----QHSWIEQ-QILEAnpILEAFGNAKTIRNDNSSRFGKYIDIHFNKNG 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  617 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNH------LAENNVFGIvplskpeEKQKAAQQF 690
Cdd:cd01381    148 VIEGAKIEQYLLEKSRIVSQAPDERNYHIFYCMLAGLSAEEKKKLELGDasdyyyLTQGNCLTC-------EGRDDAAEF 220
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  691 SKLQAAMKVLAISPEEQKTCWLILASIYHLGAAG------ATKEAAEagrkqFARHEWAQKAAYLLGCSLEELSSAIFKH 764
Cdd:cd01381    221 ADIRSAMKVLMFTDEEIWDIFKLLAAILHLGNIKfeatvvDNLDASE-----VRDPPNLERAAKLLEVPKQDLVDALTTR 295
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  765 QL--KGGTLQRSTSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRAL---KSSQHSLCSMMIVDTPGF 839
Cdd:cd01381    296 TIftRGETVVSPLSAEQ--------------ALDVRDAFVKGIYGRLFIWIVNKINSAIykpRGTDSSRTSIGVLDIFGF 361
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  840 QNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsVAAVDQAShlvrSLAH 916
Cdd:cd01381    362 ENFEVN------SFEQLCINFANENLQQFFVRHIFKLEQEEYDKEGINwqhIEFVDNQDVLD--LIALKPMN----IMSL 429
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  917 ADEargllwlleeEALVPGATEDALLDRLFSYYGPQEGDKKGQSPLLRSSKPRHFLlghshGTnwVEYNVAGWLNytKQN 996
Cdd:cd01381    430 IDE----------ESKFPKGTDQTMLEKLHSTHGNNKNYLKPKSDLNTSFGINHFA-----GV--VFYDTRGFLE--KNR 490
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  997 PA-TQNAPRLLQDSQKKIISNLFlgragsatvlsgsiagleggsQLALRRATSMRKtfttgmaavKKKSLCIQIKLQVDA 1075
Cdd:cd01381    491 DTfSADLLQLVQSSKNKFLKQLF---------------------NEDISMGSETRK---------KSPTLSSQFRKSLDQ 540
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1076 LIDTIKRSKMHFVHCFLPVAEGWPGEprsassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQG 1155
Cdd:cd01381    541 LMKTLSACQPFFVRCIKPNEYKKPML-----------------------------FDRELCVRQLRYSGMMETIRIRKAG 591
                          730
                   ....*....|....*....
gi 1907082211 1156 YPDHMVFSEFRRRFDVLAP 1174
Cdd:cd01381    592 YPIRHTFEEFVERYRVLVP 610
MYSc_Myo10 cd14873
class X myosin, motor domain; Myosin X is an unconventional myosin motor that functions as a ...
468-1177 2.44e-57

class X myosin, motor domain; Myosin X is an unconventional myosin motor that functions as a monomer. In mammalian cells, the motor is found to localize to filopodia. Myosin X walks towards the barbed ends of filaments and is thought to walk on bundles of actin, rather than single filaments, a unique behavior. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. C-terminal to the head domain are a variable number of IQ domains, 2 PH domains, a MyTH4 domain, and a FERM domain. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276840 [Multi-domain]  Cd Length: 651  Bit Score: 212.35  E-value: 2.44e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGpSLL--VLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLL 545
Cdd:cd14873      2 SIMYNLFQRYKRNQIYTYIG-SILasVNPYQPIAGLYEPATMEQYSRRHLGELPPHIFAIANECYRCLWKRHDNQCILIS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  546 GSSGSGKTTSFQHLVQYLATIAGTS-----GTKVFSVEKWQALST-LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVA 619
Cdd:cd14873     81 GESGAGKTESTKLILKFLSVISQQSlelslKEKTSCVEQAILESSpIMEAFGNAKTVYNNNSSRFGKFVQLNICQKGNIQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  620 SASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNhLAEN----NVFGIVPLSKPEEKqkaaQQFSKLQA 695
Cdd:cd14873    161 GGRIVDYLLEKNRVVRQNPGERNYHIFYALLAGLEHEEREEFYLS-TPENyhylNQSGCVEDKTISDQ----ESFREVIT 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 AMKVLAISPEEQKTCWLILASIYHLGAAgatkEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIfkhqlkggtLQRST 775
Cdd:cd14873    236 AMEVMQFSKEEVREVSRLLAGILHLGNI----EFITAGGAQVSFKTALGRSAELLGLDPTQLTDAL---------TQRSM 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  776 SFRqGPEESglgegTKLS---ALECLEGMASGLYSELFTLLISLVNRALKSSQHsLCSMMIVDTPGFQNPEWGgsargaS 852
Cdd:cd14873    303 FLR-GEEIL-----TPLNvqqAVDSRDSLAMALYARCFEWVIKKINSRIKGKED-FKSIGILDIFGFENFEVN------H 369
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  853 FEELCHNYAQDRLQRLFHERTF-LQELERYKEDnieLAFDDLEPVadDSVAAVDQASHLVRSLAHADEargllwlleeEA 931
Cdd:cd14873    370 FEQFNINYANEKLQEYFNKHIFsLEQLEYSREG---LVWEDIDWI--DNGECLDLIEKKLGLLALINE----------ES 434
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  932 LVPGATEDALLDRLFSyygpqeGDKKGQSPLlrssKPR--HFLLGHSHGTNWVEYNVAGWLnytKQNPAT--QNAPRLLQ 1007
Cdd:cd14873    435 HFPQATDSTLLEKLHS------QHANNHFYV----KPRvaVNNFGVKHYAGEVQYDVRGIL---EKNRDTfrDDLLNLLR 501
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1008 DSQKKIISNLFlgragsatvlsgsiagleggsqlalRRATSMRKTFTTGMAAVKKK-SLCIQIKLQVDALIDTIKRSKMH 1086
Cdd:cd14873    502 ESRFDFIYDLF-------------------------EHVSSRNNQDTLKCGSKHRRpTVSSQFKDSLHSLMATLSSSNPF 556
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1087 FVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFR 1166
Cdd:cd14873    557 FVRCIKPNMQKMPD-----------------------------QFDQAVVLNQLRYSGMLETVRIRKAGYAVRRPFQDFY 607
                          730
                   ....*....|.
gi 1907082211 1167 RRFDVLAPHLT 1177
Cdd:cd14873    608 KRYKVLMRNLA 618
MYSc_Myo5 cd01380
class V myosin, motor domain; Myo5, also called heavy chain 12, myoxin, are dimeric myosins ...
468-892 3.36e-57

class V myosin, motor domain; Myo5, also called heavy chain 12, myoxin, are dimeric myosins that transport a variety of intracellular cargo processively along actin filaments, such as melanosomes, synaptic vesicles, vacuoles, and mRNA. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. It also contains a IQ domain and a globular DIL domain. Myosin V is a class of actin-based motor proteins involved in cytoplasmic vesicle transport and anchorage, spindle-pole alignment and mRNA translocation. The protein encoded by this gene is abundant in melanocytes and nerve cells. Mutations in this gene cause Griscelli syndrome type-1 (GS1), Griscelli syndrome type-3 (GS3) and neuroectodermal melanolysosomal disease, or Elejalde disease. Multiple alternatively spliced transcript variants encoding different isoforms have been reported, but the full-length nature of some variants has not been determined. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. Note that the Dictyostelium myoVs are not contained in this child group. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276831 [Multi-domain]  Cd Length: 629  Bit Score: 211.24  E-value: 3.36e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYG-ASLLHTYAGpslLVLStrgapAV--------YSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQ 538
Cdd:cd01380      2 AVLHNLKVRFCqRNAIYTYCG---IVLV-----AInpyedlpiYGEDIIQAYSGQNMGELDPHIFAIAEEAYRQMARDEK 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  539 DQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVfSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAG 616
Cdd:cd01380     74 NQSIIVSGESGAGKTVSAKYAMRYFATVGGSSSGET-QVEE-KVLASnpIMEAFGNAKTTRNDNSSRFGKYIEILFDKNY 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  617 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAENNVFGIVPLSKPEEKQKAAQQFSKLQAA 696
Cdd:cd01380    152 RIIGANMRTYLLEKSRVVFQAEEERNYHIFYQLCAAASLPELKELHLGS-AEDFFYTNQGGSPVIDGVDDAAEFEETRKA 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  697 MKVLAISPEEQKTCWLILASIYHLGAAgATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLK--GGTLQRS 774
Cdd:cd01380    231 LTLLGISEEEQMEIFRILAAILHLGNV-EIKATRNDSASISPDDEHLQIACELLGIDESQLAKWLCKRKIVtrSEVIVKP 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  775 TSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQ-HSLCSMM-IVDTPGFQNPEWGgsargaS 852
Cdd:cd01380    310 LTLQQ--------------AIVARDALAKHIYAQLFDWIVDRINKALASPVkEKQHSFIgVLDIYGFETFEVN------S 369
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|
gi 1907082211  853 FEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDD 892
Cdd:cd01380    370 FEQFCINYANEKLQQQFNQHVFKLEQEEYVKEEIEWSFID 409
MYSc_Myo36 cd14897
class XXXVI myosin, motor domain; This class of molluscan myosins contains a motor domain ...
468-1019 1.74e-56

class XXXVI myosin, motor domain; This class of molluscan myosins contains a motor domain followed by a GlcAT-I (Beta1,3-glucuronyltransferase I) domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276862 [Multi-domain]  Cd Length: 635  Bit Score: 209.55  E-value: 1.74e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCR-REDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14897      2 TIVQTLKSRYNKDKFYTYIGDILVAVNPCKPLPIFDKKHHEEYSNLSvRSQRPPHLFWIADQAYRRLLETGRNQCILVSG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGTkvFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 626
Cdd:cd14897     82 ESGAGKTESTKYMIKHLMKLSPSDDS--DLLDKIVQINPLLEAFGNASTVMNDNSSRFGKFIELHFTENGQLLGAKIDDY 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  627 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNH----------LAENNVFGIVplskpEEKQKAAQQFSKLQAA 696
Cdd:cd14897    160 LLEKSRVVHRGNGEKNFHIFYALFAGMSRDRLLYYFLEDpdchrilrddNRNRPVFNDS-----EELEYYRQMFHDLTNI 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  697 MKVLAISPEEQKTCWLILASIYHLgAAGATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIF--KHQLKGGTLQRS 774
Cdd:cd14897    235 MKLIGFSEEDISVIFTILAAILHL-TNIVFIPDEDTDGVTVADEYPLHAVAKLLGIDEVELTEALIsnVNTIRGERIQSW 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  775 TSFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQ-----HSLCSMMIVDTPGFQNpewggsAR 849
Cdd:cd14897    314 KSLRQ--------------ANDSRDALAKDLYSRLFGWIVGQINRNLWPDKdfqimTRGPSIGILDMSGFEN------FK 373
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  850 GASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAfdDLEPVADDSV------------AAVDQASHLvrslaha 917
Cdd:cd14897    374 INSFDQLCINLSNERLQQYFNDYVFPRERSEYEIEGIEWR--DIEYHDNDDVlelffkkplgilPLLDEESTF------- 444
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  918 deargllwlleeealvPGATEDALLDRLFSYYGPqegdkkgqSPLLRSSKPRHFLLGHSHGTNWVEYNVAGWLNYTKQNp 997
Cdd:cd14897    445 ----------------PQSTDSSLVQKLNKYCGE--------SPRYVASPGNRVAFGIRHYAEQVTYDADGFLEKNRDN- 499
                          570       580
                   ....*....|....*....|..
gi 1907082211  998 ATQNAPRLLQDSQKKIISNLFL 1019
Cdd:cd14897    500 LSSDIVGCLLNSNNEFISDLFT 521
MYSc_Myo3 cd01379
class III myosin, motor domain; Myosin III has been shown to play a role in the vision process ...
468-898 2.56e-56

class III myosin, motor domain; Myosin III has been shown to play a role in the vision process in insects and in hearing in mammals. Myosin III, an unconventional myosin, does not form dimers. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. They are characterized by an N-terminal protein kinase domain and several IQ domains. Some members also contain WW, SH2, PH, and Y-phosphatase domains. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276830 [Multi-domain]  Cd Length: 633  Bit Score: 208.67  E-value: 2.56e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd01379      2 TIVSQLQKRYSRDQIYTYIGDILIAVNPFQNLGIYTEEHSRLYRGAKRSDNPPHIFAVADAAYQAMIHQKKNQCIVISGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLaTIAGTSGTKVFSvEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTML 627
Cdd:cd01379     82 SGAGKTESANLLVQQL-TVLGKANNRTLE-EKILQVNPLMEAFGNARTVINDNSSRFGKYLEMKFTSTGAVTGARISEYL 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  628 LEKLRVARRPASEATFNVFYYLLA-------CGDATLRTELHLNHLAEnnvFGIVPLSKpEEKQKAAQQFSKLQAAMKVL 700
Cdd:cd01379    160 LEKSRVVHQAIGERNFHIFYYIYAglaedkkLAKYKLPENKPPRYLQN---DGLTVQDI-VNNSGNREKFEEIEQCFKVI 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  701 AISPEEQKTCWLILASIYHLGA---AGATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQL--KGGTLQRST 775
Cdd:cd01379    236 GFTKEEVDSVYSILAAILHIGDiefTEVESNHQTDKSSRISNPEALNNVAKLLGIEADELQEALTSHSVvtRGETIIRNN 315
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  776 SFRQgpeesglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHS---LCSMMIVDTPGFQNpewggsARGAS 852
Cdd:cd01379    316 TVEE--------------ATDARDAMAKALYGRLFSWIVNRINSLLKPDRSAsdePLSIGILDIFGFEN------FQKNS 375
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*....
gi 1907082211  853 FEELCHNYAQDRLQRLFHERTFLQELERYKEDNIEL---AFDDLEPVAD 898
Cdd:cd01379    376 FEQLCINIANEQIQYYFNQHIFAWEQQEYLNEGIDVdliEYEDNRPLLD 424
MYSc_Myo15 cd01387
class XV mammal-like myosin, motor domain; The class XV myosins are monomeric. In vertebrates, ...
467-1217 2.76e-54

class XV mammal-like myosin, motor domain; The class XV myosins are monomeric. In vertebrates, myosin XV appears to be expressed in sensory tissue and play a role in hearing. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. C-terminal to the head domain are 2 MyTH4 domain, a FERM domain, and a SH3 domain. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276838 [Multi-domain]  Cd Length: 657  Bit Score: 203.45  E-value: 2.76e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd01387      1 TTVLWNLKTRYERNLIYTYIGSILVSVNPYKMFDIYGLEQVQQYSGRALGELPPHLFAIANLAFAKMLDAKQNQCVVISG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGTKVfsVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQaGQVASASIQTM 626
Cdd:cd01387     81 ESGSGKTEATKLIMQYLAAVNQRRNNLV--TEQILEATPLLEAFGNAKTVRNDNSSRFGKYLEVFFEG-GVIVGAITSQY 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  627 LLEKLRVARRPASEATFNVFYYLLACGDATLRTE---------LHLNHLAENNVFGivplskpeekQKAAQQFSKLQAAM 697
Cdd:cd01387    158 LLEKSRIVTQAKNERNYHVFYELLAGLPAQLRQKyglqeaekyFYLNQGGNCEIAG----------KSDADDFRRLLAAM 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  698 KVLAISPEEQKTCWLILASIYHLG-------AAGATKEAAEAGRKqfARHEWaqkAAYLLGCSLEELSSAIfkhqlkggt 770
Cdd:cd01387    228 QVLGFSSEEQDSIFRILASVLHLGnvyfhkrQLRHGQEGVSVGSD--AEIQW---VAHLLQISPEGLQKAL--------- 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  771 LQRSTSFRQGPEESGLgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsarg 850
Cdd:cd01387    294 TFKVTETRRERIFTPL---TIDQALDARDAIAKALYALLFSWLVTRVNAIVYSGTQDTLSIAILDIFGFEDLSEN----- 365
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  851 aSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVAddsvaavdqasHLVrslahADEARGLLWLL 927
Cdd:cd01387    366 -SFEQLCINYANENLQYYFNKHVFKLEQEEYIREQIDwteIAFADNQPVI-----------NLI-----SKKPVGILHIL 428
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  928 EEEALVPGATEDALLDRLFSYYGPQEgdkkgqspllRSSKPR----HFLLGHSHGTNWveYNVAGWLNYTKqNPATQNAP 1003
Cdd:cd01387    429 DDECNFPQATDHSFLEKCHYHHALNE----------LYSKPRmplpEFTIKHYAGQVW--YQVHGFLDKNR-DQLRQDVL 495
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1004 RLLQDSQKKIISNLFlgragsatvlsgsiagleggSQLALRRATSMRKTFTTGMAAVKKKSLCIQIKLQ--VDALIDTIK 1081
Cdd:cd01387    496 ELLVSSRTRVVAHLF--------------------SSHRAQTDKAPPRLGKGRFVTMKPRTPTVAARFQdsLLQLLEKME 555
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1082 RSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDPCEAGLLQLDVSLlrAQLRGSRLLDAMRMYRQGYPDHMV 1161
Cdd:cd01387    556 RCNPWFVRC---------------------------LKPNHKKEPMLFDMDVVM--AQLRYSGMLETIRIRKEGYPVRLP 606
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1162 FSEFRRRFDvlapHLTKKHGRNYIVVDEKRAVeELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd01387    607 FQVFIDRYR----CLVALKLPRPAPGDMCVSL-LSRLCTVTPKDMYRLGATKVFLR 657
MYSc_Myo28 cd14889
class XXVIII myosin, motor domain; These myosins are found in fish, chicken, and mollusks. The ...
469-1172 4.51e-54

class XXVIII myosin, motor domain; These myosins are found in fish, chicken, and mollusks. The tail regions of these class-XXVIII myosins consist of an IQ motif, a short coiled-coil region, and an SH2 domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276854  Cd Length: 659  Bit Score: 202.83  E-value: 4.51e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  469 VLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAML----MSRQDQSIVL 544
Cdd:cd14889      3 LLEVLKVRFMQSNIYTYVGDILVAINPFKYLHIYEKEVSQKYKCEKKSSLPPHIFAVADRAYQSMLgrlaRGPKNQCIVI 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  545 LGSSGSGKTTSFQHLVQYLATIAgtSGTKVFSVEKWQaLSTLLEAFGNSPTIMNGSATRFSQILSLDFdQAGQVASASIQ 624
Cdd:cd14889     83 SGESGAGKTESTKLLLRQIMELC--RGNSQLEQQILQ-VNPLLEAFGNAQTVMNDNSSRFGKYIQLRF-RNGHVKGAKIN 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  625 TMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL----NHLAENNVFGivplsKPEEKQKAAQQFSKLQAAMKVL 700
Cdd:cd14889    159 EYLLEKSRVVHQDGGEENFHIFYYMFAGISAEDRENYGLldpgKYRYLNNGAG-----CKREVQYWKKKYDEVCNAMDMV 233
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  701 AISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSsaifkhqlkgGTLQRSTSFRQG 780
Cdd:cd14889    234 GFTEQEEVDMFTILAGILSLGNITFEMDDDEALKVENDSNGWLKAAAGQFGVSEEDLL----------KTLTCTVTFTRG 303
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  781 peESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHS---LCSMMIVDTPGFQNPEWGgsargaSFEELC 857
Cdd:cd14889    304 --EQIQRHHTKQQAEDARDSIAKVAYGRVFGWIVSKINQLLAPKDDSsveLREIGILDIFGFENFAVN------RFEQAC 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  858 HNYAQDRLQRLFHERTFLQELERYKEDNI---ELAFDDLEPVAD-------DSVAAVDQASHLvrslahadeargllwll 927
Cdd:cd14889    376 INLANEQLQYFFNHHIFLMEQKEYKKEGIdwkEITYKDNKPILDlflnkpiGILSLLDEQSHF----------------- 438
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  928 eeealvPGATEDALLDRLFSYYGPQEGDKKGQspllrsSKPRHFLLGHSHGTnwVEYNVAGWLNYTKQN-PATQNAprLL 1006
Cdd:cd14889    439 ------PQATDESFVDKLNIHFKGNSYYGKSR------SKSPKFTVNHYAGK--VTYNASGFLEKNRDTiPASIRT--LF 502
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1007 QDSQKKIISNLFLGRAGSATVLSGSIAGLEGGSQlalrratsmrktfttGMAAVKKKSLCIQIKLQVDALIDTIKRSKMH 1086
Cdd:cd14889    503 INSATPLLSVLFTATRSRTGTLMPRAKLPQAGSD---------------NFNSTRKQSVGAQFKHSLGVLMEKMFAASPH 567
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1087 FVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFR 1166
Cdd:cd14889    568 FVRCIKPNHVKVPG-----------------------------QLDSKYIQDQLRYNGLLETIRIRREGFSWRPSFAEFA 618

                   ....*.
gi 1907082211 1167 RRFDVL 1172
Cdd:cd14889    619 ERYKIL 624
MYSc_Myo17 cd14879
class XVII myosin, motor domain; This fungal myosin which is also known as chitin synthase ...
464-896 1.02e-53

class XVII myosin, motor domain; This fungal myosin which is also known as chitin synthase uses its motor domain to tether its vesicular cargo to peripheral actin. It works in opposition to dynein, contributing to the retention of Mcs1 vesicles at the site of cell growth and increasing vesicle fusion necessary for polarized growth. Class 17 myosins consist of a N-terminal myosin motor domain with Cyt-b5, chitin synthase 2, and a DEK_C domains at it C-terminus. The chitin synthase region contains several transmembrane domains by which myosin 17 is thought to bind secretory vesicles. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276845 [Multi-domain]  Cd Length: 647  Bit Score: 201.24  E-value: 1.02e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  464 LNESSVLHTLRQRYGASLLHTYAGPS-LLVLSTRGAPAVYSEKVMHMFK-------GCRREDMAPHIYAVAQTAYRAMLM 535
Cdd:cd14879      1 PSDDAITSHLASRFRSDLPYTRLGSSaLVAVNPYKYLSSNSDASLGEYGseyydttSGSKEPLPPHAYDLAARAYLRMRR 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  536 SRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTS--GTKVfsVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFD 613
Cdd:cd14879     81 RSEDQAVVFLGETGSGKSESRRLLLRQLLRLSSHSkkGTKL--SSQISAAEFVLDSFGNAKTLTNPNASRFGRYTELQFN 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  614 QAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVF---GIVPL-SKPEEKQkaAQQ 689
Cdd:cd14879    159 ERGRLIGAKVLDYRLERSRVASVPTGERNFHVFYYLLAGASPEERQHLGLDDPSDYALLasyGCHPLpLGPGSDD--AEG 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  690 FSKLQAAMKVLAISPEEQKTCWLILASIYHLgaagatkeaaeaGRKQFA-----RHEWA--------QKAAYLLGCSLEE 756
Cdd:cd14879    237 FQELKTALKTLGFKRKHVAQICQLLAAILHL------------GNLEFTydhegGEESAvvkntdvlDIVAAFLGVSPED 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  757 LSSAI-FKHQLKGGtlQRSTSFrqgpeesgLG-EGTKLSALEclegMASGLYSELFTLLISLVNRALKSSQHSLCSMM-I 833
Cdd:cd14879    305 LETSLtYKTKLVRK--ELCTVF--------LDpEGAAAQRDE----LARTLYSLLFAWVVETINQKLCAPEDDFATFIsL 370
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211  834 VDTPGFQNPewgGSARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELA----FDDLEPV 896
Cdd:cd14879    371 LDFPGFQNR---SSTGGNSLDQFCVNFANERLHNYVLRSFFERKAEELEAEGVSVPatsyFDNSDCV 434
MYSc_Myo35 cd14896
class XXXV myosin, motor domain; This class of metazoan myosins contains 2 IQ motifs, 2 MyTH4 ...
467-1173 1.97e-53

class XXXV myosin, motor domain; This class of metazoan myosins contains 2 IQ motifs, 2 MyTH4 domains, a single FERM domain, and an SH3 domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276861 [Multi-domain]  Cd Length: 644  Bit Score: 200.39  E-value: 1.97e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14896      1 SSVLLCLKKRFHLGRIYTFGGPILLSLNPHRSLPLFSEEVLASYHPRKALNTTPHIFAIAASAYRLSQSTGQDQCILLSG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIagtsGTKVFSVEKWQALSTL--LEAFGNSPTIMNGSATRFSQILSLDFdQAGQVASASIQ 624
Cdd:cd14896     81 HSGSGKTEAAKKIVQFLSSL----YQDQTEDRLRQPEDVLpiLESFGHAKTILNANASRFGQVLRLHL-QHGVIVGASVS 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  625 TMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAEN----NVFGIVPLSKPEEkqkaAQQFSKLQAAMKVL 700
Cdd:cd14896    156 HYLLETSRVVFQAQAERSFHVFYELLAGLDPEEREQLSLQG-PETyyylNQGGACRLQGKED----AQDFEGLLKALQGL 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  701 AISPEEQKTCWLILASIYHLGA---AGATKEAAEAGrkqfARHEWA--QKAAYLLGCSLEELSSAIFKHQLKGGTLQRST 775
Cdd:cd14896    231 GLCAEELTAIWAVLAAILQLGNicfSSSERESQEVA----AVSSWAeiHTAARLLQVPPERLEGAVTHRVTETPYGRVSR 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  776 SFrqgPEEsglgegtklSALECLEGMASGLYSELFTLLISLVNRAL--KSSQHSLCSMMIVDTPGFQnpewggSARGASF 853
Cdd:cd14896    307 PL---PVE---------GAIDARDALAKTLYSRLFTWLLKRINAWLapPGEAESDATIGVVDAYGFE------ALRVNGL 368
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  854 EELCHNYAQDRLQRLFHERTFLQELERYKEDniELAFDDLEPVADDSVA--AVDQASHLVRSLahadeargllwllEEEA 931
Cdd:cd14896    369 EQLCINLASERLQLFSSQTLLAQEEEECQRE--LLPWVPIPQPPRESCLdlLVDQPHSLLSIL-------------DDQT 433
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  932 LVPGATEDALLDRLFSYYGPQEGDKKGQSPLlrsskPRhFLLGHSHGTnwVEYNVAGWLNYTKQ--NPATQNaprLLQDS 1009
Cdd:cd14896    434 WLSQATDHTFLQKCHYHHGDHPSYAKPQLPL-----PV-FTVRHYAGT--VTYQVHKFLNRNRDqlDPAVVE---MLAQS 502
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1010 QKKIISNLFlgragsatvlsgsiagLEGGSQLALRRatsmrktfttgmaavKKKSLCIQIKLQVDALIDTIKRSKMHFVH 1089
Cdd:cd14896    503 QLQLVGSLF----------------QEAEPQYGLGQ---------------GKPTLASRFQQSLGDLTARLGRSHVYFIH 551
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1090 CFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRF 1169
Cdd:cd14896    552 CLNPNPGKLPG-----------------------------LFDVGHVTEQLRQAGILEAIGTRSEGFPVRVPFQAFLARF 602

                   ....
gi 1907082211 1170 DVLA 1173
Cdd:cd14896    603 GALG 606
MYSc_Myo43 cd14904
class XLIII myosin, motor domain; The class XLIII myosins are comprised of Stramenopiles. Not ...
468-1174 1.07e-52

class XLIII myosin, motor domain; The class XLIII myosins are comprised of Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276869  Cd Length: 653  Bit Score: 198.63  E-value: 1.07e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14904      2 SILFNLKKRFAASKPYTYTNDIVIALNPyKWIDNLYGDHLHEQYLKKPRDKLQPHVYATSTAAYKHMLTNEMNQSILVSG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGtsGTKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 626
Cdd:cd14904     82 ESGAGKTETTKIVMNHLASVAG--GRKDKTIAKVIDVNPLLESFGNAKTTRNDNSSRFGKFTQLQFDGRGKLIGAKCETY 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  627 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEE 706
Cdd:cd14904    160 LLEKSRVVSIAEGERNYHIFYQLLAGLSSEERKEFGLDPNCQYQYLGDSLAQMQIPGLDDAKLFASTQKSLSLIGLDNDA 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  707 QKTCWLILASIYHLGAAGATKEAAEAGRKQfaRHEWAQKAAYLLGCSLEELSSAIfkhqlkggtLQRSTSFRQgpeESGL 786
Cdd:cd14904    240 QRTLFKILSGVLHLGEVMFDKSDENGSRIS--NGSQLSQVAKMLGLPTTRIEEAL---------CNRSVVTRN---ESVT 305
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  787 GEGTKLSALECLEGMASGLYSELFTLLISLVNRALkSSQHSLCSMMI--VDTPGFQNPEWGGsargasFEELCHNYAQDR 864
Cdd:cd14904    306 VPLAPVEAEENRDALAKAIYSKLFDWMVVKINAAI-STDDDRIKGQIgvLDIFGFEDFAHNG------FEQFCINYANEK 378
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  865 LQRLFHERTFLQELERYKEDNieLAFDDLEPVADDSVAAVDQASHLVRSLAHaDEARGllwlleeealvPGATEDALLDR 944
Cdd:cd14904    379 LQQKFTTDVFKTVEEEYIREG--LQWDHIEYQDNQGIVEVIDGKMGIIALMN-DHLRQ-----------PRGTEEALVNK 444
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  945 LFSYYGPQEGDKKGQSPllrSSKPRHFLLGHSHGTnwVEYNVAGWLNytKQNPATQN-APRLLQDSQKKIISNLFlgraG 1023
Cdd:cd14904    445 IRTNHQTKKDNESIDFP---KVKRTQFIINHYAGP--VTYETVGFME--KHRDTLQNdLLDLVLLSSLDLLTELF----G 513
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1024 SATVLSGSIAGLEGGSQLAlrratsmrktfttgmaavkKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGEpr 1103
Cdd:cd14904    514 SSEAPSETKEGKSGKGTKA-------------------PKSLGSQFKTSLSQLMDNIKTTNTHYVRCIKPNANKSPTE-- 572
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1104 sassrrvsssseldlppgdpceagllqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAP 1174
Cdd:cd14904    573 ---------------------------FDKRMVVEQLRSAGVIEAIRITRSGYPSRLTPKELATRYAIMFP 616
MYSc_Myo9 cd01385
class IX myosin, motor domain; Myosin IX is a processive single-headed motor, which might play ...
468-1174 7.16e-52

class IX myosin, motor domain; Myosin IX is a processive single-headed motor, which might play a role in signalling. It has a N-terminal RA domain, an IQ domain, a C1_1 domain, and a RhoGAP domain. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276836 [Multi-domain]  Cd Length: 690  Bit Score: 196.83  E-value: 7.16e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGS 547
Cdd:cd01385      2 TLLENLRARFKHGKIYTYVGSILIAVNPFKFLPIYNPKYVKMYQNRRLGKLPPHIFAIADVAYHAMLRKKKNQCIVISGE 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLaTIAGTSGTKVfSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 625
Cdd:cd01385     82 SGSGKTESTNFLLHHL-TALSQKGYGS-GVEQ-TILGAgpVLEAFGNAKTAHNNNSSRFGKFIQVNYRENGMVRGAVVEK 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  626 MLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHlAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPE 705
Cdd:cd01385    159 YLLEKSRIVSQEKNERNYHVFYYLLAGASEEERKELHLKQ-PEDYHYLNQSDCYTLEGEDEKYEFERLKQAMEMVGFLPE 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  706 EQKTCWLILASIYHLGaagatkeAAEAGRKQFARHEWAQKA--------AYLLGCSLEELSSAIF--KHQLKGGTLQRST 775
Cdd:cd01385    238 TQRQIFSVLSAVLHLG-------NIEYKKKAYHRDESVTVGnpevldiiSELLRVKEETLLEALTtkKTVTVGETLILPY 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  776 SFrqgPEesglgegtklsALECLEGMASGLYSELFTLLISLVNRAL---KSSQHSLC-SMMIVDTPGFQNPEWGgsarga 851
Cdd:cd01385    311 KL---PE-----------AIATRDAMAKCLYSALFDWIVLRINHALlnkKDLEEAKGlSIGVLDIFGFEDFGNN------ 370
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  852 SFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIElaFDDLEPVadDSVAAVdqasHLVRSLAHadearGLLWLLEEEA 931
Cdd:cd01385    371 SFEQFCINYANEHLQYYFNQHIFKLEQEEYKKEGIS--WHNIEYT--DNTGCL----QLISKKPT-----GLLCLLDEES 437
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  932 LVPGATEDALLDRLFS-------YYGPQEgdkkgqspllrssKPRHFLLGHSHGTnwVEYNVAGW----LNYTKQNPATq 1000
Cdd:cd01385    438 NFPGATNQTLLAKFKQqhkdnkyYEKPQV-------------MEPAFIIAHYAGK--VKYQIKDFreknLDLMRPDIVA- 501
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1001 naprLLQDSQKKIISNL-----------------FLGRAGSATVLSGSIAGLEGGSQLALRRATSMRktfttGMAAVKKK 1063
Cdd:cd01385    502 ----VLRSSSSAFVRELigidpvavfrwavlrafFRAMAAFREAGRRRAQRTAGHSLTLHDRTTKSL-----LHLHKKKK 572
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1064 SLCIQIKLQV--DALIDTIKRSKMHFVHCFLPVAEGWPgeprsassrrvsssseldlppgdpceaglLQLDVSLLRAQLR 1141
Cdd:cd01385    573 PPSVSAQFQTslSKLMETLGQAEPFFIRCIKSNAEKKP-----------------------------LRFDDELVLRQLR 623
                          730       740       750
                   ....*....|....*....|....*....|...
gi 1907082211 1142 GSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAP 1174
Cdd:cd01385    624 YTGMLETVRIRRSGYSVRYTFQEFITQFQVLLP 656
MYSc_Myo31 cd14892
class XXXI myosin, motor domain; Class XXXI myosins have a very long neck region consisting of ...
470-1173 7.30e-52

class XXXI myosin, motor domain; Class XXXI myosins have a very long neck region consisting of 17 IQ motifs and 2 tandem ANK repeats that are separated by a PH domain. The myosin classes XXX to XXXIV contain members from Phytophthora species and Hyaloperonospora parasitica. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276857 [Multi-domain]  Cd Length: 656  Bit Score: 196.13  E-value: 7.30e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  470 LHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSekVMHMFKGCRREDMA----PHIYAVAQTAYRAMLMSR----QDQ 540
Cdd:cd14892      4 LDVLRRRYERDAIYTFTADILISINPyKSIPLLYD--VPGFDSQRKEEATAssppPHVFSIAERAYRAMKGVGkgqgTPQ 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  541 SIVLLGSSGSGKTTSFQHLVQYLATI----AGTSGTKVF-----SVEKWQALS-TLLEAFGNSPTIMNGSATRFSQILSL 610
Cdd:cd14892     82 SIVVSGESGAGKTEASKYIMKYLATAsklaKGASTSKGAanaheSIEECVLLSnLILEAFGNAKTIRNDNSSRFGKYIQI 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  611 DFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAEnnvFGIVPLSKPEEKQKA--AQ 688
Cdd:cd14892    162 HYNSDGRIAGASTDHFLLEKSRLVGPDANERNYHIFYQLLAGLDANENAALELTPAES---FLFLNQGNCVEVDGVddAT 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  689 QFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRK-QFARHEWAQKAAYLLGCSLEELSSAIFKHQLK 767
Cdd:cd14892    239 EFKQLRDAMEQLGFDAEFQRPIFEVLAAVLHLGNVRFEENADDEDVFaQSADGVNVAKAAGLLGVDAAELMFKLVTQTTS 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  768 GGtlqRSTSFRQ--GPEEsglgegtklsALECLEGMASGLYSELFTLLISLVNRAlkSSQHSLCSMMIVDTP-------- 837
Cdd:cd14892    319 TA---RGSVLEIklTARE----------AKNALDALCKYLYGELFDWLISRINAC--HKQQTSGVTGGAASPtfspfigi 383
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  838 ----GFQnpewggSARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaaVDQASHL 910
Cdd:cd14892    384 ldifGFE------IMPTNSFEQLCINFTNEMLQQQFNKHVFVLEQEVYASEGIDvsaIEFQDNQDCLD-----LIQKKPL 452
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  911 vrslahadeargllwlleeeALVPGATEDALLDR-------LFSYYgpQEGDKKGQSpllrSSKPR----HFLLGHSHGT 979
Cdd:cd14892    453 --------------------GLLPLLEEQMLLKRkttdkqlLTIYH--QTHLDKHPH----YAKPRfecdEFVLRHYAGD 506
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  980 nwVEYNVAGWLnyTKQNPATQNAPRLLQDSQKKiisnlflgragsatvlsgsiagleggsqlalrratsmrktFTTgmaa 1059
Cdd:cd14892    507 --VTYDVHGFL--AKNNDNLHDDLRDLLRSSSK----------------------------------------FRT---- 538
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1060 vkkkslciqiklQVDALIDTIKRSKMHFVHCFLPvaegwpgeprsassrrvsssSELDLPPGDPCEagllqldvsLLRAQ 1139
Cdd:cd14892    539 ------------QLAELMEVLWSTTPSYIKCIKP--------------------NNLKFPGGFSCE---------LVRDQ 577
                          730       740       750
                   ....*....|....*....|....*....|....
gi 1907082211 1140 LRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLA 1173
Cdd:cd14892    578 LIYSGVLEVVRIRREGFPIRRQFEEFYEKFWPLA 611
MYSc_Myo4 cd14872
class IV myosin, motor domain; These myosins all possess a WW domain either N-terminal or ...
468-1217 7.56e-52

class IV myosin, motor domain; These myosins all possess a WW domain either N-terminal or C-terminal to their motor domain and a tail with a MyTH4 domain followed by a SH3 domain in some instances. The monomeric Acanthamoebas were the first identified members of this group and have been joined by Stramenopiles. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276839  Cd Length: 644  Bit Score: 195.76  E-value: 7.56e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVM--HMFKGCRRedMAPHIYAVAQTAYRAMLMSRQDQSIVLL 545
Cdd:cd14872      2 MIVHNLRKRFKNDQIYTNVGTILISVNPFKRLPLYTPTVMdqYMHKGPKE--MPPHTYNIADDAYRAMIVDAMNQSILIS 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  546 GSSGSGKTTSFQHLVQYLATIAGTSGtkvfSVEKwQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASI 623
Cdd:cd14872     80 GESGAGKTEATKQCLSFFAEVAGSTN----GVEQ-RVLLAnpILEAFGNAKTLRNNNSSRFGKWVEIHFDNRGRICGAST 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  624 QTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNhlaenNVFGIVPLSKPEEKQKA--AQQFSKLQAAMKVLA 701
Cdd:cd14872    155 ENYLLEKSRVVYQIKGERNFHIFYQLLASPDPASRGGWGSS-----AAYGYLSLSGCIEVEGVddVADFEEVVLAMEQLG 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  702 ISPEEQKTCWLILASIYHLG----AAGATKEAAEAgrKQFARHEWAQKAAYLLGCSLEELSSAIFKH--QLKGgtlQRST 775
Cdd:cd14872    230 FDDADINNVMSLIAAILKLGniefASGGGKSLVSG--STVANRDVLKEVATLLGVDAATLEEALTSRlmEIKG---CDPT 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  776 SFRQGPEEsglgegtklsALECLEGMASGLYSELFTLLISLVNRALKSSQHSL-CSMMIVDTPGFQNPEWGgsargaSFE 854
Cdd:cd14872    305 RIPLTPAQ----------ATDACDALAKAAYSRLFDWLVKKINESMRPQKGAKtTFIGVLDIFGFEIFEKN------SFE 368
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  855 ELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVADdsvaavdqashLVrslahadEAR--GLLWLLEE 929
Cdd:cd14872    369 QLCINFTNEKLQQHFNQYTFKLEEALYQSEGVKfehIDFIDNQPVLD-----------LI-------EKKqpGLMLALDD 430
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  930 EALVPGATEDALLDRLfsyyGPQEGDKKGQSPLLRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDS 1009
Cdd:cd14872    431 QVKIPKGSDATFMIAA----NQTHAAKSTFVYAEVRTSRTEFIVKHYAGD--VTYDITGFLEKNK-DTLQKDLYVLLSSS 503
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1010 QKKIISNLFlgragsatvlsgsiagleggSQLALRRATSmrktfttgmaavkKKSLCIQIKLQVDALIDTIKRSKMHFVH 1089
Cdd:cd14872    504 KNKLIAVLF--------------------PPSEGDQKTS-------------KVTLGGQFRKQLSALMTALNATEPHYIR 550
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1090 CflpvaegwpgeprsassrrvsssseldLPPGDPCEAGLLQLDVSLlrAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRF 1169
Cdd:cd14872    551 C---------------------------VKPNQEKRARLFDGFMSL--EQLRYAGVFEAVKIRKTGYPFRYSHERFLKRY 601
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*...
gi 1907082211 1170 DVLAPHLTKKHGRnyivvDEKRAVEELLESLDLEKSSCCLGLSRVFFR 1217
Cdd:cd14872    602 RFLVKTIAKRVGP-----DDRQRCDLLLKSLKQDFSKVQVGKTRVLYR 644
PDZ_MYO18-like cd06747
PDZ domain of MYO18A protein, and related domains; PDZ (PSD-95 (Postsynaptic density protein ...
220-308 1.31e-51

PDZ domain of MYO18A protein, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MYO18 protein and related domains. MYO18 (also known as myosin XVIIIA, KIAA0216, MysPDZ), a member of the myosin superfamily, is involved in regulating cell protrusion and migration, and Golgi trafficking and morphology, and is required for myoblast adhesion and muscle integrity. The MYO18A/MRCK/LRAP35a complex regulates actomyosin retrograde flow in cell protrusion and migration; the PtdIns(4)P/GOLPH3/MYO18A/F-actin complex is a hub for signals that regulate Golgi trafficking function. The MYO18A PDZ domain binds p190Rho-guanine nucleotide exchange factor (p190RhoGEF), Golgin45, and leucine repeat adaptor protein 1 (Lurap1, also known as Lrap35a). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MYO18-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged as beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta- strand F.


Pssm-ID: 467229 [Multi-domain]  Cd Length: 90  Bit Score: 176.73  E-value: 1.31e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLRRTTMLDRAPE-GQAYRRVVHFAEPGAGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQ 298
Cdd:cd06747      1 EITLKRQPTGDFGFSLRRGTIVERGPDdGQELKRTVHFAEPGAGTKNLATGLLPGDRLIEVNGVNVENASRDEIIEMIRK 80
                           90
                   ....*....|
gi 1907082211  299 SGDSVRLKVQ 308
Cdd:cd06747     81 SGDTVTLKVQ 90
MYSc_Myo41 cd14902
class XLI myosin, motor domain; The class XLI myosins are comprised of Stramenopiles. Not much ...
468-1179 1.47e-51

class XLI myosin, motor domain; The class XLI myosins are comprised of Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276867 [Multi-domain]  Cd Length: 716  Bit Score: 196.27  E-value: 1.47e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFK--------GCRREDMAPHIYAVAQTAYRAMLMS-R 537
Cdd:cd14902      2 ALLQALSERFEHDQIYTSIGDILVALNPlKPLPDLYSESQLNAYKasmtstspVSQLSELPPHVFAIGGKAFGGLLKPeR 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  538 QDQSIVLLGSSGSGKTTSFQHLVQYLATI-----AGTSGTKVFSVEKWQALST--LLEAFGNSPTIMNGSATRFSQILSL 610
Cdd:cd14902     82 RNQSILVSGESGSGKTESTKFLMQFLTSVgrdqsSTEQEGSDAVEIGKRILQTnpILESFGNAQTIRNDNSSRFGKFIKI 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  611 DFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAE---NNVFGIVPLSKPEEKQKAA 687
Cdd:cd14902    162 QFGANNEIVGAQIVSYLLEKVRLLHQSPEERSFHIFYELLEGADKTLLDLLGLQKGGKyelLNSYGPSFARKRAVADKYA 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  688 QQFSKLQAAMKVLAISPEEQKTCWLILASIYHLG--AAGATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQ 765
Cdd:cd14902    242 QLYVETVRAFEDTGVGELERLDIFKILAALLHLGnvNFTAENGQEDATAVTAASRFHLAKCAELMGVDVDKLETLLSSRE 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  766 LKGGtlQRSTSFRQGPEEsglgegtklsALECLEGMASGLYSELFTLLISLVN---------RALKSSQHSLCSMMIVDT 836
Cdd:cd14902    322 IKAG--VEVMVLKLTPEQ----------AKEICGSLAKAIYGRLFTWLVRRLSdeinyfdsaVSISDEDEELATIGILDI 389
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  837 PGFQNPEWGGsargasFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEPVaddsVAAVDqashlvrs 913
Cdd:cd14902    390 FGFESLNRNG------FEQLCINYANERLQAQFNEFVFVKEQQIYIAEGIDwknISYPSNAAC----LALFD-------- 451
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  914 lahaDEARGLLWLLEEEALVPGATEDALLDRLFSYYGPQEgdkkgqspllrsskprHFLLGHSHGTnwVEYNVAGWLNyT 993
Cdd:cd14902    452 ----DKSNGLFSLLDQECLMPKGSNQALSTKFYRYHGGLG----------------QFVVHHFAGR--VCYNVEQFVE-K 508
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  994 KQNPATQNAPRLLQDSQKKIISNLFL-GRAGSATVLSGSiagleggsqlALRRATSMrktfttgmaaVKKKSLCIQIKLQ 1072
Cdd:cd14902    509 NTDALPADASDILSSSSNEVVVAIGAdENRDSPGADNGA----------AGRRRYSM----------LRAPSVSAQFKSQ 568
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1073 VDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMY 1152
Cdd:cd14902    569 LDRLIVQIGRTEAHYVRCLKPNEVKKPG-----------------------------IFDRERMVEQMRSVGVLEAVRIA 619
                          730       740
                   ....*....|....*....|....*..
gi 1907082211 1153 RQGYPDHMVFSEFRRRFDVLAPHLTKK 1179
Cdd:cd14902    620 RHGYSVRLAHASFIELFSGFKCFLSTR 646
MYSc_Myo27 cd14888
class XXVII myosin, motor domain; Not much is known about this myosin class. The catalytic ...
468-1174 1.90e-51

class XXVII myosin, motor domain; Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276853 [Multi-domain]  Cd Length: 667  Bit Score: 194.91  E-value: 1.90e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLL-VLSTRGAPAVYSEKVM--HMFKGcrrEDMAPHIYAVAQTAYRAMLMSRQDQSIVL 544
Cdd:cd14888      2 SILHSLNLRFDIDEIYTFTGPILIaVNPFKTIPGLYSDEMLlkFIQPS---ISKSPHVFSTASSAYQGMCNNKKSQTILI 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  545 LGSSGSGKTTSFQHLVQYLATiAGTSGTKVFSVEKWQALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQ-------- 614
Cdd:cd14888     79 SGESGAGKTESTKYVMKFLAC-AGSEDIKKRSLVEAQVLESnpLLEAFGNARTLRNDNSSRFGKFIELQFSKlkskrmsg 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  615 -AGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIV--PLSKPEEKQKAAQQFS 691
Cdd:cd14888    158 dRGRLCGAKIQTYLLEKVRVCDQQEGERNYHIFYQLCAAAREAKNTGLSYEENDEKLAKGADakPISIDMSSFEPHLKFR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  692 KLQ--------------------AAMKVLAISPEEQKTCWLILASIYHLGAAG-ATKEAAEAGRKQFA-RHEWAQKAAYL 749
Cdd:cd14888    238 YLTksschelpdvddleefestlYAMQTVGISPEEQNQIFSIVAAILYLGNILfENNEACSEGAVVSAsCTDDLEKVASL 317
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  750 LGCSLEELSSAifkhqLKGGTLQRSTSFRQGPEESGLGEGTKlsaleclEGMASGLYSELFTLLISLVNRAL-KSSQHSL 828
Cdd:cd14888    318 LGVDAEDLLNA-----LCYRTIKTAHEFYTKPLRVDEAEDVR-------DALARALYSCLFDKVVERTNESIgYSKDNSL 385
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  829 CSMMIVDTPGFQnpewggSARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIElaFDDLE-PVADDSVAAVDQA 907
Cdd:cd14888    386 LFCGVLDIFGFE------CFQLNSFEQLCINFTNERLQQFFNNFVFKCEEKLYIEEGIS--WNPLDfPDNQDCVDLLQEK 457
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  908 SHLVrsLAHADEargllwlleeEALVPGATEDALLDRLFSYYGpqeGDKKGQSPllrSSKPRHFLLGHSHGTnwVEYNVA 987
Cdd:cd14888    458 PLGI--FCMLDE----------ECFVPGGKDQGLCNKLCQKHK---GHKRFDVV---KTDPNSFVIVHFAGP--VKYCSD 517
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  988 GWLNYTKqNPATQNAPRLLQDSQKKIISNLFlgragsatvlsgsiagleggsqlalrrATSMRKTFTTGMAAVKKKSLCI 1067
Cdd:cd14888    518 GFLEKNK-DQLSVDAQEVIKNSKNPFISNLF---------------------------SAYLRRGTDGNTKKKKFVTVSS 569
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1068 QIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPgeprsassrrvsssSELDlppgdpceagllQLDVSllrAQLRGSRLLD 1147
Cdd:cd14888    570 EFRNQLDVLMETIDKTEPHFIRCIKPNSQNVP--------------DLFD------------RISVN---EQLKYGGVLQ 620
                          730       740
                   ....*....|....*....|....*..
gi 1907082211 1148 AMRMYRQGYPDHMVFSEFRRRFDVLAP 1174
Cdd:cd14888    621 AVQVSRAGYPVRLSHAEFYNDYRILLN 647
MYSc_Myo40 cd14901
class XL myosin, motor domain; The class XL myosins are comprised of Stramenopiles. Not much ...
468-1215 2.47e-51

class XL myosin, motor domain; The class XL myosins are comprised of Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276866 [Multi-domain]  Cd Length: 655  Bit Score: 194.24  E-value: 2.47e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSE--KVMHMFKGCRRED----MAPHIYAVAQTAYRAMLMSR---- 537
Cdd:cd14901      2 SILHVLRRRFAHGLIYTSTGAILVAINPFRRLPLYDDetKEAYYEHGERRAAgerkLPPHVYAVADKAFRAMLFASrgqk 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  538 QDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKW----QALST--LLEAFGNSPTIMNGSATRFSQILSLD 611
Cdd:cd14901     82 CDQSILVSGESGAGKTETTKIIMNYLASVSSATTHGQNATEREnvrdRVLESnpILEAFGNARTNRNNNSSRFGKFIRLG 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  612 FDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSKPEEKQKAAQQFS 691
Cdd:cd14901    162 FASSGSLLGASISTYLLERVRLVSQAKGERNYHIFYELLRGASSDELHALGLTHVEEYKYLNSSQCYDRRDGVDDSVQYA 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  692 KLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTL 771
Cdd:cd14901    242 KTRHAMTTIGMSPDEQISVLQLVAAVLHLGNLCFVKKDGEGGTFSMSSLANVRAACDLLGLDMDVLEKTLCTREIRAGGE 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  772 QRSTSFRqgPEESGLGEgtklsaleclEGMASGLYSELFTLLISLVNRALK--SSQHSLCSMMIVDTPGF----QNpewg 845
Cdd:cd14901    322 YITMPLS--VEQALLTR----------DVVAKTLYAQLFDWLVDRINESIAysESTGASRFIGIVDIFGFeifaTN---- 385
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  846 gsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAFDDLePVADDSVAAVDQASHLVRSLahADEargllw 925
Cdd:cd14901    386 ------SLEQLCINFANEKLQQLFGKFVFEMEQDEYVAEAIPWTFVEY-PNNDACVAMFEARPTGLFSL--LDE------ 450
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  926 lleeEALVPGATEDALLDrlfSYYgpqegDKKGQSPLLRSSK----PRHFLLGHSHGTnwVEYNVAGWLNYTKQNPATqN 1001
Cdd:cd14901    451 ----QCLLPRGNDEKLAN---KYY-----DLLAKHASFSVSKlqqgKRQFVIHHYAGA--VCYATDGFCDKNKDHVHS-E 515
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1002 APRLLQDSqkkiiSNLFLgragSATVLSgsiagleggsqlalrratsmrktfttgmaavkkkslciQIKLQVDALIDTIK 1081
Cdd:cd14901    516 ALALLRTS-----SNAFL----SSTVVA--------------------------------------KFKVQLSSLLEVLN 548
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1082 RSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDpcEAGLLQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMV 1161
Cdd:cd14901    549 ATEPHFIRC---------------------------IKPND--VLSPSEFDAKRVLEQLRCSGVLEAVKISRSGYPVRFP 599
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1162 FSEFRRRFDVLAPHLTKKHGRNYIVVDEKRAVEELLESLDLEKSSCCLGLSRVF 1215
Cdd:cd14901    600 HDAFVHTYSCLAPDGASDTWKVNELAERLMSQLQHSELNIEHLPPFQVGKTKVF 653
MYSc_Myo46 cd14907
class XLVI myosin, motor domain; The class XLVI myosins are comprised of Alveolata. Not much ...
467-1172 1.12e-48

class XLVI myosin, motor domain; The class XLVI myosins are comprised of Alveolata. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276872 [Multi-domain]  Cd Length: 669  Bit Score: 186.78  E-value: 1.12e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRRE--------DMAPHIYAVAQTAYRAMLMSR 537
Cdd:cd14907      1 AELLINLKKRYQQDKIFTYVGPTLIVMNPyKQIDNLFSEEVMQMYKEQIIQngeyfdikKEPPHIYAIAALAFKQLFENN 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  538 QDQSIVLLGSSGSGKTTSFQHLVQYLATIAG----------------TSGTKVFSVEKwQALST--LLEAFGNSPTIMNG 599
Cdd:cd14907     81 KKQAIVISGESGAGKTENAKYAMKFLTQLSQqeqnseevltltssirATSKSTKSIEQ-KILSCnpILEAFGNAKTVRND 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  600 SATRFSQILSLDFD-QAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHL-NHLAENNVFgivPL 677
Cdd:cd14907    160 NSSRFGKYVSILVDkKKRKILGARIQNYLLEKSRVTQQGQGERNYHIFYHLLYGADQQLLQQLGLkNQLSGDRYD---YL 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  678 SKPE----EKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKQFARH-EWAQKAAYLLGC 752
Cdd:cd14907    237 KKSNcyevDTINDEKLFKEVQQSFQTLGFTEEEQDSIWRILAAILLLGNLQFDDSTLDDNSPCCVKNkETLQIIAKLLGI 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  753 SLEELSSAIFKHQLKGGTlQRSTSfrqgpeesglgegtKLSALEC---LEGMASGLYSELFTLLISLVNRAL-------- 821
Cdd:cd14907    317 DEEELKEALTTKIRKVGN-QVITS--------------PLSKKECinnRDSLSKELYDRLFNWLVERLNDTImpkdekdq 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  822 KSSQHSLCSMMIVDTPGFQNPEwggsarGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE-----LAFDDLEPV 896
Cdd:cd14907    382 QLFQNKYLSIGLLDIFGFEVFQ------NNSFEQLCINYTNEKLQQLYISYVFKAEEQEFKEEGLEdylnqLSYTDNQDV 455
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  897 AD--DS-----VAAVDQASHLvrslahadeargllwlleeealvPGATEDALLDRLFSYYGpqeGDKKGQSPllRSSKPR 969
Cdd:cd14907    456 IDllDKppigiFNLLDDSCKL-----------------------ATGTDEKLLNKIKKQHK---NNSKLIFP--NKINKD 507
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  970 HFLLGHSHGTnwVEYNVAGWL--NYTKQNPATQNaprLLQDSQKKIISNLFLGRAGSATVLSGSIAGleggsqlalrrat 1047
Cdd:cd14907    508 TFTIRHTAKE--VEYNIEGFRekNKDEISQSIIN---CIQNSKNRIISSIFSGEDGSQQQNQSKQKK------------- 569
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1048 smrktfttgmAAVKKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGwpgeprsassrrvsssseldlppgdpcEAG 1127
Cdd:cd14907    570 ----------SQKKDKFLGSKFRNQMKQLMNELMQCDVHFIRCIKPNEEK---------------------------KAD 612
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1128 LLQLDVSLLraQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVL 1172
Cdd:cd14907    613 LFIQGYVLN--QIRYLGVLESIRVRKQGYPYRKSYEDFYKQYSLL 655
MYSc_Myo34 cd14895
class XXXIV myosin, motor domain; Class XXXIV myosins are composed of an IQ motif, a short ...
468-1173 1.04e-45

class XXXIV myosin, motor domain; Class XXXIV myosins are composed of an IQ motif, a short coiled-coil region, 5 tandem ANK repeats, and a carboxy-terminal FYVE domain. The myosin classes XXX to XXXIV contain members from Phytophthora species and Hyaloperonospora parasitica. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276860 [Multi-domain]  Cd Length: 704  Bit Score: 178.22  E-value: 1.04e-45
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSekvMHMFkgcrREDM------APHIYAVAQTAYRAMLM----- 535
Cdd:cd14895      2 AFVDYLAQRYGVDQVYCRSGAVLIAVNPfKHIPGLYD---LHKY----REEMpgwtalPPHVFSIAEGAYRSLRRrlhep 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  536 --SRQDQSIVLLGSSGSGKTTSFQHLVQYLA-----TIAGTSGTKVFSVEKWQALST--LLEAFGNSPTIMNGSATRFSQ 606
Cdd:cd14895     75 gaSKKNQTILVSGESGAGKTETTKFIMNYLAesskhTTATSSSKRRRAISGSELLSAnpILESFGNARTLRNDNSSRFGK 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  607 ILSLDF-----DQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNvFGIVPLSKPE 681
Cdd:cd14895    155 FVRMFFeghelDTSLRMIGTSVETYLLEKVRVVHQNDGERNFHVFYELLAGAADDMKLELQLELLSAQE-FQYISGGQCY 233
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  682 EKQKAAQ---QFSKLQAAMKVLAISPEEQKTCWLILASIYHLG-----------------AAGATKEAAEAGRKQFARHE 741
Cdd:cd14895    234 QRNDGVRddkQFQLVLQSMKVLGFTDVEQAAIWKILSALLHLGnvlfvassedegeedngAASAPCRLASASPSSLTVQQ 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  742 WAQKAAYLLGCSLEELSSAIFKHQLKGGtlqrstsfrqgpEESGLGEGTKLSALECLEGMASGLYSELFTLLISLVNRAL 821
Cdd:cd14895    314 HLDIVSKLFAVDQDELVSALTTRKISVG------------GETFHANLSLAQCGDARDAMARSLYAFLFQFLVSKVNSAS 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  822 KSSQHSLCS-----------MMIVDTPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAF 890
Cdd:cd14895    382 PQRQFALNPnkaankdttpcIAVLDIFGFEEFEVN------QFEQFCINYANEKLQYQFIQDILLTEQQAHIEEGIKWNA 455
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  891 DDLEpvaDDSVA--AVDQASHLVRSLAhaDEargllwlleeEALVPGATEDALLDRLFSYYGpqegDKKGQSPLLRSSKP 968
Cdd:cd14895    456 VDYE---DNSVCleMLEQRPSGIFSLL--DE----------ECVVPKGSDAGFARKLYQRLQ----EHSNFSASRTDQAD 516
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  969 RHFLLGHSHGTnwVEYNVAGWLNYTKQNPaTQNAPRLLQDSQKKIISNLFlgragsaTVLSGSIAGLEGGSQLALRRATS 1048
Cdd:cd14895    517 VAFQIHHYAGA--VRYQAEGFCEKNKDQP-NAELFSVLGKTSDAHLRELF-------EFFKASESAELSLGQPKLRRRSS 586
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1049 MRKTFTTGMaavkkkslciQIKLQVDALIDTIKRSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDPCEAGl 1128
Cdd:cd14895    587 VLSSVGIGS----------QFKQQLASLLDVVQQTQTHYIRC---------------------------IKPNDESASD- 628
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1129 lQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLA 1173
Cdd:cd14895    629 -QFDMAKVSSQLRYGGVLKAVEIMRQSYPVRMKHADFVKQYRLLV 672
MYSc_Myo39 cd14900
class XXXIX myosin, motor domain; The class XXXIX myosins are found in Stramenopiles. Not much ...
468-1188 1.89e-44

class XXXIX myosin, motor domain; The class XXXIX myosins are found in Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276865  Cd Length: 627  Bit Score: 173.18  E-value: 1.89e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMF-------------KGcrREDMAPHIYAVAQTAYRAM 533
Cdd:cd14900      2 TILSALETRFYAQKIYTNTGAILLAVNPfQKLPGLYSSDTMAKYllsfearssstrnKG--SDPMPPHIYQVAGEAYKAM 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  534 LMSR----QDQSIVLLGSSGSGKTTSFQHLVQYLA---------TIAGTSGTKVFSVEKWQAlSTLLEAFGNSPTIMNGS 600
Cdd:cd14900     80 MLGLngvmSDQSILVSGESGSGKTESTKFLMEYLAqagdnnlaaSVSMGKSTSGIAAKVLQT-NILLESFGNARTLRNDN 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  601 ATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLAcGDAtlrtelhlnhlaennvfgivplskp 680
Cdd:cd14900    159 SSRFGKFIKLHFTSGGRLTGASIQTYLLEKVRLVSQSKGERNYHIFYEMAI-GAS------------------------- 212
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  681 eEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGaaGATKEAAEAGrkqfarHEWAQKAAYLLGCSLEELSSA 760
Cdd:cd14900    213 -EAARKRDMYRRVMDAMDIIGFTPHERAGIFDLLAALLHIG--NLTFEHDENS------DRLGQLKSDLAPSSIWSRDAA 283
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  761 IFKHQLKGGTLQRSTS---FRQGPEESGLgegtKLSALEC---LEGMASGLYSELFTLLISLVNRALK-----SSQHSLC 829
Cdd:cd14900    284 ATLLSVDATKLEKALSvrrIRAGTDFVSM----KLSAAQAnnaRDALAKALYGRLFDWLVGKMNAFLKmddssKSHGGLH 359
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  830 SMMIVDTPGF----QNpewggsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE---LAFDDLEpvadDSVA 902
Cdd:cd14900    360 FIGILDIFGFevfpKN----------SFEQLCINFANETLQQQFNDYVFKAEQREYESQGVDwkyVEFCDNQ----DCVN 425
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  903 AVDQASHLVRSLahADEargllwlleeEALVPGATEDALLDRLFSyygpqegdKKGQSPLLRSSKPRH----FLLGHSHG 978
Cdd:cd14900    426 LISQRPTGILSL--IDE----------ECVMPKGSDTTLASKLYR--------ACGSHPRFSASRIQRarglFTIVHYAG 485
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  979 TnwVEYNVAGWLNytkqnpatQNAPRLLQDsqkkiISNLFLGragsatvlsgsiagleggsqlalrratsmrktfttgma 1058
Cdd:cd14900    486 H--VEYSTDGFLE--------KNKDVLHQE-----AVDLFVY-------------------------------------- 512
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1059 avkkkslCIQIKLQVDALIDTIKRSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDPCEAGLLQLDVSLlrA 1138
Cdd:cd14900    513 -------GLQFKEQLTTLLETLQQTNPHYVRC---------------------------LKPNDLCKAGIYERERVL--N 556
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1139 QLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLA----PHLTKKHGrnYIVVD 1188
Cdd:cd14900    557 QLRCNGVMEAVRVARAGFPIRLLHDEFVARYFSLAraknRLLAKKQG--TSLPD 608
MYSc_Myo30 cd14891
class XXX myosin, motor domain; Myosins of class XXX are composed of an amino-terminal ...
507-887 3.13e-44

class XXX myosin, motor domain; Myosins of class XXX are composed of an amino-terminal SH3-like domain, two IQ motifs, a coiled-coil region and a PX domain. The myosin classes XXX to XXXIV contain members from Phytophthora species and Hyaloperonospora parasitica. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276856  Cd Length: 645  Bit Score: 172.92  E-value: 3.13e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  507 MHMFKGCRREDMAPHIYAVAQTAYRAMLMSR---QDQSIVLLGSSGSGKTTSFQHLVQYLAT-----IAGTSGTKVFSVE 578
Cdd:cd14891     40 KSDYINTPLDPCPPHPYAIAEMAYQQMCLGSgrmQNQSIVISGESGAGKTETSKIILRFLTTravggKKASGQDIEQSSK 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  579 KWQALST-----------LLEAFGNSPTIMNGSATRFSQILSLDF-DQAGQVASASIQTMLLEKLRVARRPASEATFNVF 646
Cdd:cd14891    120 KRKLSVTslderlmdtnpILESFGNAKTLRNHNSSRFGKFMKLQFtKDKFKLAGAFIETYLLEKSRLVAQPPGERNFHIF 199
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  647 YYLLACGDATLRTELHlnhlaennvfgivpLSKPEEKQKAAQ-------------QFSKLQAAMKVLAISPEEQKTCWLI 713
Cdd:cd14891    200 YQLLAGASAELLKELL--------------LLSPEDFIYLNQsgcvsddniddaaNFDNVVSALDTVGIDEDLQLQIWRI 265
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  714 LASIYHLG---------AAGATKEAAEAGRKQFArhewaqKAAYLLGCSLEELSSAIfkhqlkggtLQRSTSFRqgpees 784
Cdd:cd14891    266 LAGLLHLGniefdeedtSEGEAEIASESDKEALA------TAAELLGVDEEALEKVI---------TQREIVTR------ 324
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  785 GLGEGTKLSALECL---EGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEwggsaRGASFEELCHNYA 861
Cdd:cd14891    325 GETFTIKRNAREAVysrDAIAKSIYERLFLWIVQQINTSLGHDPDPLPYIGVLDIFGFESFE-----TKNDFEQLLINYA 399
                          410       420
                   ....*....|....*....|....*.
gi 1907082211  862 QDRLQRLFHERTFLQELERYKEDNIE 887
Cdd:cd14891    400 NEALQATFNQQVFIAEQELYKSEGID 425
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1268-1962 5.22e-44

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 176.13  E-value: 5.22e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1268 NIKKNKGVKDWPWWKLFTTVRPLIQVQLSEEQIRNKDE-EIQQLRSKLEKVEKERNELrlssdrlETRISELTSELtder 1346
Cdd:pfam01576  374 NLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEgQLQELQARLSESERQRAEL-------AEKLSKLQSEL---- 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1347 ntgESASQLLDAETAERLRTEKEMKELQTQ-YDALKKQMEVMEMEVMEARLIRA--AEINGEVDDDDAGGEWRLKYERAV 1423
Cdd:pfam01576  443 ---ESVSSLLNEAEGKNIKLSKDVSSLESQlQDTQELLQEETRQKLNLSTRLRQleDERNSLQEQLEEEEEAKRNVERQL 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1424 REVDF----TKKRLQQELEdKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHE 1499
Cdd:pfam01576  520 STLQAqlsdMKKKLEEDAG-TLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSN 598
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1500 LEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSqeSKDEA 1579
Cdd:pfam01576  599 LEKKQKKFDQMLAEEKAISARYAEERDRAEAEAREKETRALSLARALEEALEAKEELERTNKQLRAEMEDLVS--SKDDV 676
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1580 --SLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEV 1657
Cdd:pfam01576  677 gkNVHELERSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNMQALKAQFERDLQARDEQGEEKRRQLVKQVRELEA 756
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1658 QLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLrKDLKRTKALLADAQIMLDHLKNN-----APSKREIAQL 1732
Cdd:pfam01576  757 ELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAV-KQLKKLQAQMKDLQRELEEARASrdeilAQSKESEKKL 835
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1733 KN------QLEESeftCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKH 1806
Cdd:pfam01576  836 KNleaellQLQED---LAASERARRQAQQERDELADEIASGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRL 912
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1807 KAAVAQ---------ASRDMAQMND-LQAQIEESNKekqELQEKLQALQSQVEFLEqsmvdKSLVSRQEAKIRELETRLE 1876
Cdd:pfam01576  913 RKSTLQveqlttelaAERSTSQKSEsARQQLERQNK---ELKAKLQEMEGTVKSKF-----KSSIAALEAKIAQLEEQLE 984
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1877 FEKTQ----VKRLENLASRLKETMEKLTEER---DQRAAAENREKEQNKRLQRQLRDTKEEMSelarkEAEASRKKheLE 1949
Cdd:pfam01576  985 QESRErqaaNKLVRRTEKKLKEVLLQVEDERrhaDQYKDQAEKGNSRMKQLKRQLEEAEEEAS-----RANAARRK--LQ 1057
                          730
                   ....*....|...
gi 1907082211 1950 MDLESLEAANQSL 1962
Cdd:pfam01576 1058 RELDDATESNESM 1070
MYSc_Myo6 cd01382
class VI myosin, motor domain; Myosin VI is a monomeric myosin, which moves towards the ...
468-882 1.08e-43

class VI myosin, motor domain; Myosin VI is a monomeric myosin, which moves towards the minus-end of actin filaments, in contrast to most other myosins which moves towards the plus-end of actin filaments. It is thought that myosin VI, unlike plus-end directed myosins, does not use a pure lever arm mechanism, but instead steps with a mechanism analogous to the kinesin neck-linker uncoupling model. It has been implicated in a myriad of functions including: the transport of cytoplasmic organelles, maintenance of normal Golgi morphology, endocytosis, secretion, cell migration, border cell migration during development, and in cancer metastasis playing roles in deafness and retinal development among others. While how this is accomplished is largely unknown there are several interacting proteins that have been identified such as disabled homolog 2 (DAB2), GIPC1, synapse-associated protein 97 (SAP97; also known as DLG1) and optineurin, which have been found to target myosin VI to different cellular compartments. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the minus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276833  Cd Length: 649  Bit Score: 171.28  E-value: 1.08e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd01382      2 TLLNNIRVRYSKDKIYTYVANILIAVNPyFDIPKLYSSETIKSYQGKSLGTLPPHVFAIADKAYRDMKVLKQSQSIIVSG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIAGTSGTKVFS--VEKwqalSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQ 624
Cdd:cd01382     82 ESGAGKTESTKYILRYLTESWGSGAGPIEQriLEA----NPLLEAFGNAKTVRNNNSSRFGKFVEIHFNEKSSVVGGFVS 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  625 TMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENnvfgivplskpeekqkaAQQFSKLQAAMKVLAISP 704
Cdd:cd01382    158 HYLLEKSRICVQSKEERNYHIFYRLCAGAPEDLREKLLKDPLLDD-----------------VGDFIRMDKAMKKIGLSD 220
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  705 EEQKTCWLILASIYHLG--AAGATKEAAEAGRKQFARHEWA-QKAAYLLGCSLEELSSAI---FKHQLKGGTlqRSTSFR 778
Cdd:cd01382    221 EEKLDIFRVVAAVLHLGniEFEENGSDSGGGCNVKPKSEQSlEYAAELLGLDQDELRVSLttrVMQTTRGGA--KGTVIK 298
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  779 QG--PEESGlgegtklSALECLegmASGLYSELFTLLISLVNRAL--KSSQHSLCsmmIVDTPGFQ----Npewggsarg 850
Cdd:cd01382    299 VPlkVEEAN-------NARDAL---AKAIYSKLFDHIVNRINQCIpfETSSYFIG---VLDIAGFEyfevN--------- 356
                          410       420       430
                   ....*....|....*....|....*....|..
gi 1907082211  851 aSFEELCHNYAQDRLQRLFHERTFLQELERYK 882
Cdd:cd01382    357 -SFEQFCINYCNEKLQQFFNERILKEEQELYE 387
MYSc_Myo45 cd14906
class XLV myosin, motor domain; The class XLVI myosins are comprised of slime molds ...
468-1185 5.49e-40

class XLV myosin, motor domain; The class XLVI myosins are comprised of slime molds Dictyostelium and Polysphondylium. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276871 [Multi-domain]  Cd Length: 715  Bit Score: 160.92  E-value: 5.49e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRR-EDMAPHIYAVAQTAYRAMLMSRQDQSIVLL 545
Cdd:cd14906      2 IILNNLGKRYKSDSIYTYIGNVLISINPyKDISSIYSNLILNEYKDINQnKSPIPHIYAVALRAYQSMVSEKKNQSIIIS 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  546 GSSGSGKTTSFQHLVQYLATiagTSGTKVF----------SVEKWQALST-LLEAFGNSPTIMNGSATRFSQILSLDFDQ 614
Cdd:cd14906     82 GESGSGKTEASKTILQYLIN---TSSSNQQqnnnnnnnnnSIEKDILTSNpILEAFGNSRTTKNHNSSRFGKFLKIEFRS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  615 A-GQVASASIQTMLLEKLRVARRP-ASEATFNVFYYLLACGDATLRTELHLNH--------LAENNVFGIV------PLS 678
Cdd:cd14906    159 SdGKIDGASIETYLLEKSRISHRPdNINLSYHIFYYLVYGASKDERSKWGLNNdpskyrylDARDDVISSFksqssnKNS 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  679 KPEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEA--AEAGRKQFARHEWAQKAAYLLGCSLEE 756
Cdd:cd14906    239 NHNNKTESIESFQLLKQSMESMSINKEQCDAIFLSLAAILHLGNIEFEEDSdfSKYAYQKDKVTASLESVSKLLGYIESV 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  757 LSSAIFKHQLKGGTlqRSTSFRQgPEESGLGEGTKlsaleclEGMASGLYSELFTLLISLVNR-----------ALKSSQ 825
Cdd:cd14906    319 FKQALLNRNLKAGG--RGSVYCR-PMEVAQSEQTR-------DALSKSLYVRLFKYIVEKINRkfnqntqsndlAGGSNK 388
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  826 HSLCSMMIVDTPGFQNpewggsARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELA---FDDlepvADDSVA 902
Cdd:cd14906    389 KNNLFIGVLDIFGFEN------LSSNSLEQLLINFTNEKLQQQFNLNVFENEQKEYLSEGIPWSnsnFID----NKECIE 458
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  903 AVDQASHLVRSLAHaDEArgllwlleeeaLVPGATEDALLDRLFSYYgpqegdKKGQSPLLRSSKPRHFLLGHSHGTnwV 982
Cdd:cd14906    459 LIEKKSDGILSLLD-DEC-----------IMPKGSEQSLLEKYNKQY------HNTNQYYQRTLAKGTLGIKHFAGD--V 518
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  983 EYNVAGWLNYTKQNPATqNAPRLLQDSQKKIISNLFLGRAGSATvlsgsiagleggsqlalrrATSMRKTFTTGMAAvkk 1062
Cdd:cd14906    519 TYQTDGWLEKNRDSLYS-DVEDLLLASSNFLKKSLFQQQITSTT-------------------NTTKKQTQSNTVSG--- 575
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1063 kslciQIKLQVDALIDTIKRSKMHFVHCFLPvaegwpgeprsassrrvssssELDLPPGDpceagllqLDVSLLRAQLRG 1142
Cdd:cd14906    576 -----QFLEQLNQLIQTINSTSVHYIRCIKP---------------------NQTMDCNN--------FNNVHVLSQLRN 621
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|...
gi 1907082211 1143 SRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKKHGRNYI 1185
Cdd:cd14906    622 VGVLNTIKVRKMGYSYRRDFNQFFSRYKCIVDMYNRKNNNNPK 664
MYSc_Myo16 cd14878
class XVI myosin, motor domain; These XVI type myosins are also known as Neuronal ...
467-1173 4.42e-39

class XVI myosin, motor domain; These XVI type myosins are also known as Neuronal tyrosine-phosphorylated phosphoinositide-3-kinase adapter 3/NYAP3. Myo16 is thought to play a regulatory role in cell cycle progression and has been recently implicated in Schizophrenia. Class XVI myosins are characterized by an N-terminal ankyrin repeat domain and some with chitin synthase domains that arose independently from the ones in the class XVII fungal myosins. They bind protein phosphatase 1 catalytic subunits 1alpha/PPP1CA and 1gamma/PPP1CC. Human Myo16 interacts with ACOT9, ARHGAP26 and PIK3R2 and with components of the WAVE1 complex, CYFIP1 and NCKAP1. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276844 [Multi-domain]  Cd Length: 656  Bit Score: 157.28  E-value: 4.42e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMF---KGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIV 543
Cdd:cd14878      1 SSLLYEIQKRFGNNQIYTFIGDILLLVNPYKELPIYSTMVSQLYlssSGQLCSSLPPHLFSCAERAFHQLFQERRPQCFI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  544 LLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSveKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDF-DQAGQVASAS 622
Cdd:cd14878     81 LSGERGSGKTEASKQIMKHLTCRASSSRTTFDS--RFKHVNCILEAFGHAKTTLNDLSSCFIKYFELQFcERKKHLTGAR 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  623 IQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVfgiVPLSKPEEKQKAA-----QQFSKLQAAM 697
Cdd:cd14878    159 IYTYMLEKSRLVSQPPGQSNFLIFYLLMDGLSAEEKYGLHLNNLCAHRY---LNQTMREDVSTAErslnrEKLAVLKQAL 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  698 KVLAISPEEQKTCWLILASIYHLGAAGATKeAAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFK--HQLKGGTLQRST 775
Cdd:cd14878    236 NVVGFSSLEVENLFVILSAILHLGDIRFTA-LTEADSAFVSDLQLLEQVAGMLQVSTDELASALTTdiQYFKGDMIIRRH 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  776 SFRQGPEESGLgegtklsaleclegMASGLYSELFTLLISLVNRALKsSQHSLCSMM-----IVDTPGFQNpewggsARG 850
Cdd:cd14878    315 TIQIAEFYRDL--------------LAKSLYSRLFSFLVNTVNCCLQ-SQDEQKSMQtldigILDIFGFEE------FQK 373
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  851 ASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE-------------LAFDDLEPVADDSVaaVDQASHLVRSLAHA 917
Cdd:cd14878    374 NEFEQLCVNMTNEKMHHYINEVLFLQEQTECVQEGVTmetayspgnqtgvLDFFFQKPSGFLSL--LDEESQMIWSVEPN 451
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  918 DEARgllwlleeealVPGATEDALLDRLFSYYGPQEGD--KKGQSPLlrsskprhFLLGHSHGTnwVEYNVAGWLNYTKq 995
Cdd:cd14878    452 LPKK-----------LQSLLESSNTNAVYSPMKDGNGNvaLKDQGTA--------FTVMHYAGR--VMYEIVGAIEKNK- 509
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  996 NPATQNAPRLLQDSQKKIISNLFLGRAgsATVlsgsiaglegGSQLalrratsmrktfttgmaavkKKSLC-IQIKLQvd 1074
Cdd:cd14878    510 DSLSQNLLFVMKTSENVVINHLFQSKL--VTI----------ASQL--------------------RKSLAdIIGKLQ-- 555
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1075 alidtikRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQ 1154
Cdd:cd14878    556 -------KCTPHFIHCIKPNNSKLPD-----------------------------TFDNFYVSAQLQYIGVLEMVKIFRY 599
                          730
                   ....*....|....*....
gi 1907082211 1155 GYPDHMVFSEFRRRFDVLA 1173
Cdd:cd14878    600 GYPVRLSFSDFLSRYKPLA 618
MYSc_Myo26 cd14887
class XXVI myosin, motor domain; These MyTH-FERM myosins are thought to be related to the ...
470-1217 4.05e-37

class XXVI myosin, motor domain; These MyTH-FERM myosins are thought to be related to the other myosins that have a MyTH4 domain such as class III, VII, IX, X , XV, XVI, XVII, XX, XXII, XXV, and XXXIV. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276852  Cd Length: 725  Bit Score: 152.11  E-value: 4.05e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  470 LHTLRQRYGA--------SLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQS 541
Cdd:cd14887      4 LENLYQRYNKayinkenrNCIYTYTGTLLIAVNPYRFFNLYDRQWISRFDTEANSRLVPHPFGLAEFAYCRLVRDRRSQS 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  542 IVLLGSSGSGKTTSFQHLVQYLATI------AGTSGTKvfsvEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQA 615
Cdd:cd14887     84 ILISGESGAGKTETSKHVLTYLAAVsdrrhgADSQGLE----ARLLQSGPVLEAFGNAHTVLNANSSRFGKMLLLHFTGR 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  616 GQVASASIQTMLLEKLRVARRPASEATFNVFYYLlaCGDATLRTELHlnhlaennvfgivplSKPEEKQKAAQQFSKLQA 695
Cdd:cd14887    160 GKLTRASVATYLLANERVVRIPSDEFSFHIFYAL--CNAAVAAATQK---------------SSAGEGDPESTDLRRITA 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 AMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKQFARH-------EWAQKAAYLLgcSLEELSSAIFKHQLKG 768
Cdd:cd14887    223 AMKTVGIGGGEQADIFKLLAAILHLGNVEFTTDQEPETSKKRKLTsvsvgceETAADRSHSS--EVKCLSSGLKVTEASR 300
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  769 GTLQRSTSFRQGPEESGLGEGTKLS-----------------ALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSM 831
Cdd:cd14887    301 KHLKTVARLLGLPPGVEGEEMLRLAlvsrsvretrsffdldgAAAARDAACKNLYSRAFDAVVARINAGLQRSAKPSESD 380
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  832 MIVDTP--------------GFQNPEWGGSARgasFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIEL-----AFDD 892
Cdd:cd14887    381 SDEDTPsttgtqtigildlfGFEDLRNHSKNR---LEQLCINYANERLHCFLLEQLILNEHMLYTQEGVFQnqdcsAFPF 457
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  893 LEPVADDSVAAVDQASHLvrSLAHADEARGLLWLLEEEALVPGATEDALLdrlfsyygpqegdkkgQSPLLRSSKPRHFL 972
Cdd:cd14887    458 SFPLASTLTSSPSSTSPF--SPTPSFRSSSAFATSPSLPSSLSSLSSSLS----------------SSPPVWEGRDNSDL 519
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  973 LGHShgtnwVEYNVAGWLNYTKQNPAtqnaprlLQDSQKKIISNLFLGRA---------GSATVLSGSIAGLEGGSQLAL 1043
Cdd:cd14887    520 FYEK-----LNKNIINSAKYKNITPA-------LSRENLEFTVSHFACDVtydardfcrANREATSDELERLFLACSTYT 587
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1044 RRATSMRKTFTTGMAAvKKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGwpgeprsassrrvsssseldlppgdp 1123
Cdd:cd14887    588 RLVGSKKNSGVRAISS-RRSTLSAQFASQLQQVLKALQETSCHFIRCVKPNRVQ-------------------------- 640
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1124 cEAGLlqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKKHgrnyivVDEKRAVEELLESLDLE 1203
Cdd:cd14887    641 -EAGI--FEDAYVHRQLRCSGMSDLLRVMADGFPCRLPYVELWRRYETKLPMALREA------LTPKMFCKIVLMFLEIN 711
                          810
                   ....*....|....
gi 1907082211 1204 KSSCCLGLSRVFFR 1217
Cdd:cd14887    712 SNSYTFGKTKIFFR 725
MYSc_Myo47 cd14908
class XLVII myosin, motor domain; The class XLVII myosins are comprised of Stramenopiles. Not ...
468-890 8.11e-37

class XLVII myosin, motor domain; The class XLVII myosins are comprised of Stramenopiles. Not much is known about this myosin class. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276873 [Multi-domain]  Cd Length: 682  Bit Score: 150.44  E-value: 8.11e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFK--GCRRE-------DMAPHIYAVAQTAYRAMLM-SR 537
Cdd:cd14908      2 AILHSLSRRFFRGIIYTWTGPVLIAVNPFQRLPLYGKEILESYRqeGLLRSqgiespqALGPHVFAIADRSYRQMMSeIR 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  538 QDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQALSTL---------LEAFGNSPTIMNGSATRFSQIL 608
Cdd:cd14908     82 ASQSILISGESGAGKTESTKIVMLYLTTLGNGEEGAPNEGEELGKLSIMdrvlqsnpiLEAFGNARTLRNDNSSRFGKFI 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  609 SLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNvfgivpLSKPEEKQKAAQ 688
Cdd:cd14908    162 ELGFNRAGNLLGAKVQTYLLEKVRLPFHASGERNYHIFYQLLRGGDEEEHEKYEFHDGITGG------LQLPNEFHYTGQ 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  689 -------------QFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAG--ATKEAAEAGRKQFARHEWAQKAAYLLGCS 753
Cdd:cd14908    236 ggapdlreftdedGLVYTLKAMRTMGWEESSIDTILDIIAGLLHLGQLEfeSKEEDGAAEIAEEGNEKCLARVAKLLGVD 315
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  754 LEELSSAIFKHQLKGGTLQRSTSFrqgpeesglgegTKLSALECLEGMASGLYSELFTLLISLVNRALK--SSQHSLCSM 831
Cdd:cd14908    316 VDKLLRALTSKIIVVRGKEITTKL------------TPHKAYDARDALAKTIYGALFLWVVATVNSSINweNDKDIRSSV 383
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082211  832 MIVDTPGFQNPEWGgsargaSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAF 890
Cdd:cd14908    384 GVLDIFGFECFAHN------SFEQLCINFTNEALQQQFNQFIFKLEQKEYEKESIEWAF 436
MYSc_Myo19 cd14880
class XIX myosin, motor domain; Monomeric myosin-XIX (Myo19) functions as an actin-based motor ...
467-890 1.45e-36

class XIX myosin, motor domain; Monomeric myosin-XIX (Myo19) functions as an actin-based motor for mitochondrial movement in vertebrate cells. It contains a variable number of IQ domains. Human myo19 contains a motor domain, three IQ motifs, and a short tail. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276846 [Multi-domain]  Cd Length: 658  Bit Score: 149.62  E-value: 1.45e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRR-EDMAPHIYAVAQTAYRAMLMSRQ--DQSI 542
Cdd:cd14880      1 ETVLRCLQARYTADTFYTNAGCTLVALNPfKPVPQLYSPELMREYHAAPQpQKLKPHIFTVGEQTYRNVKSLIEpvNQSI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  543 VLLGSSGSGKTTSFQHLVQYLATIAG--TSGTKVFSVEKWQAL----STLLEAFGNSPTIMNGSATRFSQILSLDFDQAG 616
Cdd:cd14880     81 VVSGESGAGKTWTSRCLMKFYAVVAAspTSWESHKIAERIEQRilnsNPVMEAFGNACTLRNNNSSRFGKFIQLQLNRAQ 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  617 QVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELhlnHLAENNVFGIVPLSkpeEKQKAAQQFSKLQAA 696
Cdd:cd14880    161 QMTGAAVQTYLLEKTRVACQAPSERNFHIFYQICKGASADERLQW---HLPEGAAFSWLPNP---ERNLEEDCFEVTREA 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  697 MKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAGRKQFA--RHEWAQKAAYLLGCSLEELSSAIFKHQLKGGTLQRs 774
Cdd:cd14880    235 MLHLGIDTPTQNNIFKVLAGLLHLGNIQFADSEDEAQPCQPMddTKESVRTSALLLKLPEDHLLETLQIRTIRAGKQQQ- 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  775 tSFRQgpeesglgegtKLSALEC---LEGMASGLYSELFTLLISLVNRALKSSQHSLCSMM-IVDTPGFQN-PEwggsar 849
Cdd:cd14880    314 -VFKK-----------PCSRAECdtrRDCLAKLIYARLFDWLVSVINSSICADTDSWTTFIgLLDVYGFESfPE------ 375
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|.
gi 1907082211  850 gASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELAF 890
Cdd:cd14880    376 -NSLEQLCINYANEKLQQHFVAHYLRAQQEEYAVEGLEWSF 415
MYSc_Myo37 cd14898
class XXXVII myosin, motor domain; The class XXXVIII myosins are comprised of fungi. Not much ...
468-1176 1.56e-36

class XXXVII myosin, motor domain; The class XXXVIII myosins are comprised of fungi. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276863  Cd Length: 578  Bit Score: 148.12  E-value: 1.56e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGpsLLVLSTRGAPAVYSEKVMHMFKGCRREdMAPHIYAVAQTAYRAMLMSrQDQSIVLLGS 547
Cdd:cd14898      2 ATLEILEKRYASGKIYTKSG--LVFLALNPYETIYGAGAMKAYLKNYSH-VEPHVYDVAEASVQDLLVH-GNQTIVISGE 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  548 SGSGKTTSFQHLVQYLatIAGTSGTKvfSVEK-WQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDqaGQVASASIQTM 626
Cdd:cd14898     78 SGSGKTENAKLVIKYL--VERTASTT--SIEKlITAANLILEAFGNAKTQLNDNSSRFGKRIKLKFD--GKITGAKFETY 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  627 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVFGIVPLSkpeekqkaaQQFSKLQAAMKVLAI-SPE 705
Cdd:cd14898    152 LLEKSRVTHHEKGERNFHIFYQFCASKRLNIKNDFIDTSSTAGNKESIVQLS---------EKYKMTCSAMKSLGIaNFK 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  706 EQKTCwliLASIYHLGAAGATKEaaeaGRKQFARHEWAQKAAYLLGCSLEELSSAIFKH--QLKGGTLQRSTSFRQgpee 783
Cdd:cd14898    223 SIEDC---LLGILYLGSIQFVND----GILKLQRNESFTEFCKLHNIQEEDFEESLVKFsiQVKGETIEVFNTLKQ---- 291
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  784 sglgegtklsALECLEGMASGLYSELFTLLISLVNRALK-SSQHSLCsmmIVDTPGFQNPEWGGsargasFEELCHNYAQ 862
Cdd:cd14898    292 ----------ARTIRNSMARLLYSNVFNYITASINNCLEgSGERSIS---VLDIFGFEIFESNG------LDQLCINWTN 352
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  863 DRLQRLFHERTFLQELERYKEDNIElaFDDLEPVADDSVaavdqashlvrsLAHADEARGLLWLLEEEALVPGATEDALL 942
Cdd:cd14898    353 EKIQNDFIKKMFRAKQGMYKEEGIE--WPDVEFFDNNQC------------IRDFEKPCGLMDLISEESFNAWGNVKNLL 418
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  943 DRLFSYYGPQEGDKKGQSPLLrsskprhfllghSHGTNWVEYNVAGWLNYTKQNpatqnaprllqdSQKKIISNLFLGRA 1022
Cdd:cd14898    419 VKIKKYLNGFINTKARDKIKV------------SHYAGDVEYDLRDFLDKNREK------------GQLLIFKNLLINDE 474
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1023 GSatvlsgsiagleggsqlalrratsmrktfttgmaavkKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPgep 1102
Cdd:cd14898    475 GS-------------------------------------KEDLVKYFKDSMNKLLNSINETQAKYIKCIRPNEECRP--- 514
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1103 rsassrrvsssseldlppgdpceaglLQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHL 1176
Cdd:cd14898    515 --------------------------WCFDRDLVSKQLAECGILETIRLSKQCFPQEIPKDRFEERYRILGITL 562
MYSc_Myo38 cd14899
class XXXVIII myosin; The class XXXVIII myosins are comprised of Stramenopiles. Not much is ...
467-1169 1.73e-35

class XXXVIII myosin; The class XXXVIII myosins are comprised of Stramenopiles. Not much is known about this myosin class. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276864 [Multi-domain]  Cd Length: 717  Bit Score: 146.78  E-value: 1.73e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMF----------KGCRREDMAPHIYAVAQTAYRAMLM 535
Cdd:cd14899      1 ASILNALRLRYERHAIYTHIGDILISINPfQDLPQLYGDEILRGYaydhnsqfgdRVTSTDPREPHLFAVARAAYIDIVQ 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  536 SRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKW-------------QALST--LLEAFGNSPTIMNGS 600
Cdd:cd14899     81 NGRSQSILISGESGAGKTEATKIIMTYFAVHCGTGNNNLTNSESIsppaspsrttieeQVLQSnpILEAFGNARTVRNDN 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  601 ATRFSQILSLDF-DQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLA----CGDATLRTELHLNHLAEN-NVFGI 674
Cdd:cd14899    161 SSRFGKFIELRFrDERRRLAGARIRTYLLEKIRVIKQAPHERNFHIFYELLSadnnCVSKEQKQVLALSGGPQSfRLLNQ 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  675 VPLSKPEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAA-----------GATKEAAEAGRKQFARHEWA 743
Cdd:cd14899    241 SLCSKRRDGVKDGVQFRATKRAMQQLGMSEGEIGGVLEIVAAVLHMGNVdfeqiphkgddTVFADEARVMSSTTGAFDHF 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  744 QKAAYLLGCSLEELSSAIFKHQLKGGtlqrSTSFRQGPEESGLGEGTKLSALEClegmasglYSELFTLLISLVNRALK- 822
Cdd:cd14899    321 TKAAELLGVSTEALDHALTKRWLHAS----NETLVVGVDVAHARNTRNALTMEC--------YRLLFEWLVARVNNKLQr 388
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  823 --------------SSQHSLCSMMIVDTPGFQNpewggsARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIEL 888
Cdd:cd14899    389 qasapwgadesdvdDEEDATDFIGLLDIFGFED------MAENSFEQLCINYANEALQHQFNQYIFEEEQRLYRDEGIRW 462
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  889 AFDDLEpvadDSVAAVDQASHlvrslahadEARGLLWLLEEEALVPGATEDALLDRlfsYYgpQEGDKKGQSPLLRSS-- 966
Cdd:cd14899    463 SFVDFP----NNRACLELFEH---------RPIGIFSLTDQECVFPQGTDRALVAK---YY--LEFEKKNSHPHFRSApl 524
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  967 --KPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLflgRAGSATVLSGSIAGLEGGSQLALR 1044
Cdd:cd14899    525 iqRTTQFVVAHYAGC--VTYTIDGFLAKNK-DSFCESAAQLLAGSSNPLIQAL---AAGSNDEDANGDSELDGFGGRTRR 598
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1045 RATSmrktfttgmaAVKKKSLCIQIKLQVDALIDTIKRSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDPC 1124
Cdd:cd14899    599 RAKS----------AIAAVSVGTQFKIQLNELLSTVRATTPRYVRC---------------------------IKPNDSH 641
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1125 EAGLLQldVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRF 1169
Cdd:cd14899    642 VGSLFQ--STRVVEQLRSGGVLEAVRVARAGFPVRLTHKQFLGRY 684
PTZ00014 PTZ00014
myosin-A; Provisional
402-1273 2.20e-35

myosin-A; Provisional


Pssm-ID: 240229 [Multi-domain]  Cd Length: 821  Bit Score: 147.48  E-value: 2.20e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  402 LVHRDgfSLASQLKSEELSLPEGKarvkldhdgaILDVDEDDIEKANAP-SCDRLEDLASLVYLNESSVLHTLRQRYGAS 480
Cdd:PTZ00014    56 LVLPG--STGEKLTLKQIDPPTNS----------TFEVKPEHAFNANSQiDPMTYGDIGLLPHTNIPCVLDFLKHRYLKN 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  481 LLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRR-EDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHL 559
Cdd:PTZ00014   124 QIYTTADPLLVAINPFKDLGNTTNDWIRRYRDAKDsDKLPPHVFTTARRALENLHGVKKSQTIIVSGESGAGKTEATKQI 203
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  560 VQYLAtiAGTSGTKVFSVEK--WQAlSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTMLLEKLRVARRP 637
Cdd:PTZ00014   204 MRYFA--SSKSGNMDLKIQNaiMAA-NPVLEAFGNAKTIRNNNSSRFGRFMQLQLGEEGGIRYGSIVAFLLEKSRVVTQE 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  638 ASEATFNVFYYLLACGDATLRTELHLNHLAENNVfgIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASI 717
Cdd:PTZ00014   281 DDERSYHIFYQLLKGANDEMKEKYKLKSLEEYKY--INPKCLDVPGIDDVKDFEEVMESFDSMGLSESQIEDIFSILSGV 358
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  718 YHLG-------AAGATKEAAEAGRKQFarhEWAQKAAYLLGCSLEELS-SAIFKHQLKGGTLQRStsfRQGPEESglgEG 789
Cdd:PTZ00014   359 LLLGnveiegkEEGGLTDAAAISDESL---EVFNEACELLFLDYESLKkELTVKVTYAGNQKIEG---PWSKDES---EM 429
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  790 TKLSaleclegMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQ----NpewggsargaSFEELCHNYAQDRL 865
Cdd:PTZ00014   430 LKDS-------LSKAVYEKLFLWIIRNLNATIEPPGGFKVFIGMLDIFGFEvfknN----------SLEQLFINITNEML 492
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  866 QRLFHERTFLQELERYKEDNI---ELAFDDLEPVAD------DSVAAV--DQashlvrSLAhadeargllwlleeealvP 934
Cdd:PTZ00014   493 QKNFVDIVFERESKLYKDEGIsteELEYTSNESVIDllcgkgKSVLSIleDQ------CLA------------------P 548
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  935 GATEDALLDRLFSYYGPQEGDKKGqspllRSSKPRHFLLGHSHGTnwVEYNVAGWLNYTKqNPATQNAPRLLQDSQKKII 1014
Cdd:PTZ00014   549 GGTDEKFVSSCNTNLKNNPKYKPA-----KVDSNKNFVIKHTIGD--IQYCASGFLFKNK-DVLRPELVEVVKASPNPLV 620
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1015 SNLFLGragsATVLSGSIA-GLEGGSQLalrratsMRktfttgmaavkkkslciqiklQVDALIDTIKRSKMHFVHCFLP 1093
Cdd:PTZ00014   621 RDLFEG----VEVEKGKLAkGQLIGSQF-------LN---------------------QLDSLMSLINSTEPHFIRCIKP 668
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1094 vaegwpgeprsassrrvssssELDLPPGDPCEAGLLqldvsllrAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLA 1173
Cdd:PTZ00014   669 ---------------------NENKKPLDWNSSKVL--------IQLHSLSILEALQLRQLGFSYRRTFAEFLSQFKYLD 719
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1174 PHLTKKHGrnyivVDEKRAVEELLESLDLEKSSCCLGLSRVFFR---AGTLARLEEQRDEQTSRHLTLFQAACRGYLARQ 1250
Cdd:PTZ00014   720 LAVSNDSS-----LDPKEKAEKLLERSGLPKDSYAIGKTMVFLKkdaAKELTQIQREKLAAWEPLVSVLEALILKIKKKR 794
                          890       900
                   ....*....|....*....|...
gi 1907082211 1251 HFKKRkIQDLAIrcVQKNIKKNK 1273
Cdd:PTZ00014   795 KVRKN-IKSLVR--IQAHLRRHL 814
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1307-2025 1.55e-32

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 139.05  E-value: 1.55e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1307 IQQLRSKLEKVEKERNELRLSSDRLETRISELTSE---LTDERNTGESASQLL----DAETAERLR----TEKEMKELQT 1375
Cdd:TIGR02169  165 VAEFDRKKEKALEELEEVEENIERLDLIIDEKRQQlerLRREREKAERYQALLkekrEYEGYELLKekeaLERQKEAIER 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1376 QYDALKKQMEVMEMEVM------EARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTK-KRLQQELEDKMEVEQQSR 1448
Cdd:TIGR02169  245 QLASLEEELEKLTEEISelekrlEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASlERSIAEKERELEDAEERL 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1449 RQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKL 1528
Cdd:TIGR02169  325 AKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINEL 404
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1529 QREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDE-------ASLAKVKKQLRDLEAKVKDQEEE 1601
Cdd:TIGR02169  405 KRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEwkleqlaADLSKYEQELYDLKEEYDRVEKE 484
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1602 LDEQAGSIQMLEqAKLRLEMEMERMRQTHSKEMESRDEEV---------------------------------EEARQSC 1648
Cdd:TIGR02169  485 LSKLQRELAEAE-AQARASEERVRGGRAVEEVLKASIQGVhgtvaqlgsvgeryataievaagnrlnnvvvedDAVAKEA 563
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1649 QKKLKQME----------------------------------VQLEEEYEDKQK----------ALREKREL--ESKLST 1682
Cdd:TIGR02169  564 IELLKRRKagratflplnkmrderrdlsilsedgvigfavdlVEFDPKYEPAFKyvfgdtlvveDIEAARRLmgKYRMVT 643
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1683 L-------------------SDQVNQR-DFESEKRLRKDLKRTKALLADAQIMLDHLKNNA--------PSKREIAQLKN 1734
Cdd:TIGR02169  644 LegelfeksgamtggsraprGGILFSRsEPAELQRLRERLEGLKRELSSLQSELRRIENRLdelsqelsDASRKIGEIEK 723
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1735 QLEESEFTcAAAVKAR--------KAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEI-----QNRLEEDQEDMNE 1801
Cdd:TIGR02169  724 EIEQLEQE-EEKLKERleeleedlSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLearlsHSRIPEIQAELSK 802
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1802 LMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETRLE----F 1877
Cdd:TIGR02169  803 LEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKE------IENLNGKKEELEEELEeleaA 876
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1878 EKTQVKRLENLAS---RLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHElEMDLES 1954
Cdd:TIGR02169  877 LRDLESRLGDLKKerdELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEE-ELSLED 955
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1955 LEAANQSLQADLklafKRIGDLQAAIEDEMEsdENEDLINSLQDmvtkyqkKKNKLEGDSDvdsELEDRVD 2025
Cdd:TIGR02169  956 VQAELQRVEEEI----RALEPVNMLAIQEYE--EVLKRLDELKE-------KRAKLEEERK---AILERIE 1010
MYSc_Myo21 cd14882
class XXI myosin, motor domain; The myosins here are comprised of insects. Leishmania class ...
468-906 8.33e-32

class XXI myosin, motor domain; The myosins here are comprised of insects. Leishmania class XXI myosins do not group with them. Myo21, unlike other myosin proteins, contains UBA-like protein domains and has no structural or functional relationship with the myosins present in other organisms possessing cilia or flagella. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. They have diverse tails with IQ, WW, PX, and Tub domains. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276848  Cd Length: 642  Bit Score: 134.48  E-value: 8.33e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKvMHMFKGCR-REDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14882      2 NILEELRHRYLMGESYTFIGDILLSLNPNEIKQEYPQE-FHAKYRCKsRSDNAPHIFSVADSAYQDMLHHEEPQHIILSG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLATIA-GTSGTkvfsVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 625
Cdd:cd14882     81 ESYSGKTTNARLLIKHLCYLGdGNRGA----TGRVESSIKAILALVNAGTPLNADSTRCILQYQLTFGSTGKMSGAIFWM 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  626 MLLEKLRVARRPASEATFNVFYYLLACGDATLRteLHLNHLAENNVFGIVPLSKPEEKQKA----------AQQFSKLQA 695
Cdd:cd14882    157 YQLEKLRVSTTDGNQSNFHIFYYFYDFIEAQNR--LKEYNLKAGRNYRYLRIPPEVPPSKLkyrrddpegnVERYKEFEE 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  696 AMKVLAISPEEQKTCWLILASIYHLGAAGATKEAAEAgrkQFARHEWAQKAAYLLGCSLEELSSAIFKHQL-KGGTLQRS 774
Cdd:cd14882    235 ILKDLDFNEEQLETVRKVLAAILNLGEIRFRQNGGYA---ELENTEIASRVAELLRLDEKKFMWALTNYCLiKGGSAERR 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  775 tsfRQGPEEsglgegtklsALECLEGMASGLYSELFTLLISLVN------RALKSSQHSLcsmMIVDTPGFQNPEWGGsa 848
Cdd:cd14882    312 ---KHTTEE----------ARDARDVLASTLYSRLVDWIINRINmkmsfpRAVFGDKYSI---SIHDMFGFECFHRNR-- 373
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907082211  849 rgasFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIelafddlePVAD----DSVAAVDQ 906
Cdd:cd14882    374 ----LEQLMVNTLNEQMQYHYNQRIFISEMLEMEEEDI--------PTINlrfyDNKTAVDQ 423
MYSc_Myo32 cd14893
class XXXII myosin, motor domain; Class XXXII myosins do not contain any IQ motifs, but ...
467-1175 1.05e-31

class XXXII myosin, motor domain; Class XXXII myosins do not contain any IQ motifs, but possess tandem MyTH4 and FERM domains. The myosin classes XXX to XXXIV contain members from Phytophthora species and Hyaloperonospora parasitica. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276858  Cd Length: 741  Bit Score: 135.10  E-value: 1.05e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKGCRRE----------DMAPHIYAVAQTAYRAMLMS 536
Cdd:cd14893      1 NVALYTLRARYRMEQVYTWVDRVLVGVNPVTPLPIYTPDHMQAYNKSREQtplyekdtvnDAPPHVFALAQNALRCMQDA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  537 RQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQALS----------TLLEAFGNSPTIMNGSATRFSQ 606
Cdd:cd14893     81 GEDQAVILLGGMGAGKSEAAKLIVQYLCEIGDETEPRPDSEGASGVLHpigqqilhafTILEAFGNAATRQNRNSSRFAK 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  607 ILSLDFDQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLAC--GDATLRTELHLNHLAENnvFGIVPLSKPEEKQ 684
Cdd:cd14893    161 MISVEFSKHGHVIGGGFTTHYFEKSRVIDCRSHERNFHVFYQVLAGvqHDPTLRDSLEMNKCVNE--FVMLKQADPLATN 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  685 KA--AQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAG-----ATKEAAEAGRKQFARHEWA----QKAAYLLGCS 753
Cdd:cd14893    239 FAldARDYRDLMSSFSALRIRKNQRVEIVRIVAALLHLGNVDfvpdpEGGKSVGGANSTTVSDAQScalkDPAQILLAAK 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  754 LEELSSAIFKHQLKggtLQRSTSFRQGPEESGLGEGTKLSALECLEGMASGLYSELFTLLISLVN--------RALKS-- 823
Cdd:cd14893    319 LLEVEPVVLDNYFR---TRQFFSKDGNKTVSSLKVVTVHQARKARDTFVRSLYESLFNFLVETLNgilggifdRYEKSni 395
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  824 ---SQhslcSMMIVDTPGFQNPEwggsARGASFEELCHNYAQDRLQRLFHERTfLQELERYKEDNIELAFDDLepvADDS 900
Cdd:cd14893    396 vinSQ----GVHVLDMVGFENLT----PSQNSFDQLCFNYWSEKVHHFYVQNT-LAINFSFLEDESQQVENRL---TVNS 463
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  901 VAAVDQASHLVRSLAHaDEARGLLWLLEEEALVPGATEDALLDRLFS-------YYGPQEGDKKGQSPLLRSSKPR-HFL 972
Cdd:cd14893    464 NVDITSEQEKCLQLFE-DKPFGIFDLLTENCKVRLPNDEDFVNKLFSgneavggLSRPNMGADTTNEYLAPSKDWRlLFI 542
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  973 LGHSHGTnwVEYNVAGwlnYTKQNPATQNA--PRLLQDSQkkiisNLFLGRAGSATVLSGSIAglEGGSQLALRRATS-- 1048
Cdd:cd14893    543 VQHHCGK--VTYNGKG---LSSKNMLSISStcAAIMQSSK-----NAVLHAVGAAQMAAASSE--KAAKQTEERGSTSsk 610
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1049 MRKTFTTGMAAVK-KKSLCIQIKLQVDALIDTIKRSKMHFVHCflpvaegwpgeprsassrrvsssseldLPPGDPCEAG 1127
Cdd:cd14893    611 FRKSASSARESKNiTDSAATDVYNQADALLHALNHTGKNFLVC---------------------------IKPNETLEEG 663
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*...
gi 1907082211 1128 LlqLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPH 1175
Cdd:cd14893    664 V--FDSAYVMKQIRMNHLVELMQASRSIFTVHLTYGHFFRRYKNVCGH 709
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1305-2012 1.12e-31

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 136.07  E-value: 1.12e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1305 EEI-QQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDErntgESASQLLDAEtaeRLRTEKEMKELQTQYDALKKQ 1383
Cdd:pfam01576   74 EEIlHELESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEE----EAARQKLQLE---KVTTEAKIKKLEEDILLLEDQ 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1384 MEVMEMEVMEARLiRAAEINGEVDDDDAG----GEWRLKYERAVREVDftkKRLQQELEDKMEVEQqSRRQLERRLGDLQ 1459
Cdd:pfam01576  147 NSKLSKERKLLEE-RISEFTSNLAEEEEKakslSKLKNKHEAMISDLE---ERLKKEEKGRQELEK-AKRKLEGESTDLQ 221
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1460 ADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEA 1539
Cdd:pfam01576  222 EQIAELQAQIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLESERAARNKAEKQRRDLGEEL 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1540 FSLKQQMEEkdldiagfTQKVVSLEAELQdissqeSKDEASLAKVKK-----------QLRDLEAKVKDQEEELDEQags 1608
Cdd:pfam01576  302 EALKTELED--------TLDTTAAQQELR------SKREQEVTELKKaleeetrsheaQLQEMRQKHTQALEELTEQ--- 364
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1609 iqmLEQAKlRLEMEMERMRQTHSKEMESRDEEV---EEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLS- 1684
Cdd:pfam01576  365 ---LEQAK-RNKANLEKAKQALESENAELQAELrtlQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQs 440
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1685 --DQVNQRDFESEK---RLRKDLKRTKALLADAQIML-------------------------DHLKNNAPSKREI----- 1729
Cdd:pfam01576  441 elESVSSLLNEAEGkniKLSKDVSSLESQLQDTQELLqeetrqklnlstrlrqledernslqEQLEEEEEAKRNVerqls 520
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1730 ------AQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQRE----------KNEIQNRLE 1793
Cdd:pfam01576  521 tlqaqlSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQElddllvdldhQRQLVSNLE 600
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1794 EDQEDMNELMKKHKAAVAQAS--RDMAQMN---------DLQAQIEESNKEKQELQEKLQALQSQVEFLEQSM--VDKSL 1860
Cdd:pfam01576  601 KKQKKFDQMLAEEKAISARYAeeRDRAEAEareketralSLARALEEALEAKEELERTNKQLRAEMEDLVSSKddVGKNV 680
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1861 VSRQEAKiRELETRLEFEKTQVKRL-------ENLASRLKETME--KLTEERDQRAAAENREkEQNKRLQRQLRDTKEEM 1931
Cdd:pfam01576  681 HELERSK-RALEQQVEEMKTQLEELedelqatEDAKLRLEVNMQalKAQFERDLQARDEQGE-EKRRQLVKQVRELEAEL 758
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1932 SELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMEsdENEDLINSLQDMVTKYQKKKNKLE 2011
Cdd:pfam01576  759 EDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKKLQAQMKDLQR--ELEEARASRDEILAQSKESEKKLK 836

                   .
gi 1907082211 2012 G 2012
Cdd:pfam01576  837 N 837
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1291-2013 4.69e-31

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 134.41  E-value: 4.69e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1291 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntgesasqlldaetAERLRTEKEM 1370
Cdd:TIGR02168  218 LKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELR--------------LEVSELEEEI 283
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1371 KELQTQYDALKkqmevmemevmearliraAEINgevddddaggewrlkyeRAVREVDFTKKRLQQeLEDKMEVEQQSRRQ 1450
Cdd:TIGR02168  284 EELQKELYALA------------------NEIS-----------------RLEQQKQILRERLAN-LERQLEELEAQLEE 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1451 LERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQR 1530
Cdd:TIGR02168  328 LESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEA 407
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1531 EKDMLLAEAFSLKQQMEEKDLDIAgftqkvvslEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQ 1610
Cdd:TIGR02168  408 RLERLEDRRERLQQEIEELLKKLE---------EAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALD 478
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1611 M----LEQAKLRLEMeMERMRQTHSKEMESRDEEVEEARQ------------SCQKKLKQ-MEVQLEE-----EYEDKQK 1668
Cdd:TIGR02168  479 AaereLAQLQARLDS-LERLQENLEGFSEGVKALLKNQSGlsgilgvlseliSVDEGYEAaIEAALGGrlqavVVENLNA 557
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1669 ALRE----KRELESKLSTLS-DQVNQRDFESEKRLRKDLKRTKALLAD--------AQIMLDHLKNN---APSKREIAQL 1732
Cdd:TIGR02168  558 AKKAiaflKQNELGRVTFLPlDSIKGTEIQGNDREILKNIEGFLGVAKdlvkfdpkLRKALSYLLGGvlvVDDLDNALEL 637
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1733 KNQLEESE----------FTCAAAVKARK-------AMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEED 1795
Cdd:TIGR02168  638 AKKLRPGYrivtldgdlvRPGGVITGGSAktnssilERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL 717
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1796 QEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMV-DKSLVSRQEAKIRELETR 1874
Cdd:TIGR02168  718 RKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAeAEAEIEELEAQIEQLKEE 797
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1875 LEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLES 1954
Cdd:TIGR02168  798 LKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEA 877
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1955 LEAANQSLQADLKLAFKRIGDLQAAIED-EMESDENEDLINSLQDMVTKYQKKKNKLEGD 2013
Cdd:TIGR02168  878 LLNERASLEEALALLRSELEELSEELRElESKRSELRRELEELREKLAQLELRLEGLEVR 937
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1431-2011 7.04e-31

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 133.52  E-value: 7.04e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1431 KRLQQELED-KMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDS 1509
Cdd:COG1196    216 RELKEELKElEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLA 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1510 ELSQAHEETQREKLQREKLQREKDmllaeafSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLR 1589
Cdd:COG1196    296 ELARLEQDIARLEERRRELEERLE-------ELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALL 368
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1590 DLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMEsRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKA 1669
Cdd:COG1196    369 EAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLE-RLERLEEELEELEEALAELEEEEEEEEEALEEA 447
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1670 LREKRELESKLSTLSDQVnqrdfeseKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESeftcAAAVKA 1749
Cdd:COG1196    448 AEEEAELEEEEEALLELL--------AELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEG----VKAALL 515
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1750 RKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQReknEIQNRLEEDQEDMNELMKKHKAAVA-------QASRDMAQMND 1822
Cdd:COG1196    516 LAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQ---NIVVEDDEVAAAAIEYLKAAKAGRAtflpldkIRARAALAAAL 592
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1823 LQAQIEESNKEKQELQEKLQALQSQveFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEE 1902
Cdd:COG1196    593 ARGAIGAAVDLVASDLREADARYYV--LGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRREL 670
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1903 RDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIED 1982
Cdd:COG1196    671 LAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEE 750
                          570       580       590
                   ....*....|....*....|....*....|
gi 1907082211 1983 EMESDENEDL-INSLQDMVTKYQKKKNKLE 2011
Cdd:COG1196    751 EALEELPEPPdLEELERELERLEREIEALG 780
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1316-1959 7.34e-30

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 130.06  E-value: 7.34e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1316 KVEKERNELRLSS--DRLEtRISELTSELtdERNTGESASQlldAETAERLRtekemkELQTQYDALKKQMEVMEMEVME 1393
Cdd:COG1196    171 KERKEEAERKLEAteENLE-RLEDILGEL--ERQLEPLERQ---AEKAERYR------ELKEELKELEAELLLLKLRELE 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1394 ARLIRAAEINGEVDDDDAggewRLKYERAVREVDFTKKRLQ-QELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQL 1472
Cdd:COG1196    239 AELEELEAELEELEAELE----ELEAELAELEAELEELRLElEELELELEEAQAEEYELLAELARLEQDIARLEERRREL 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1473 KKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLD 1552
Cdd:COG1196    315 EERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA 394
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1553 IAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSK 1632
Cdd:COG1196    395 AAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAAL 474
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1633 EMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLrkdlkrtkALLADA 1712
Cdd:COG1196    475 LEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAAL--------EAALAA 546
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1713 QIMLDHLKNNAPSKREIAQLK-NQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNR 1791
Cdd:COG1196    547 ALQNIVVEDDEVAAAAIEYLKaAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRT 626
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1792 LEEDQEDM-NELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEfleqsmvdkslvSRQEAKIRE 1870
Cdd:COG1196    627 LVAARLEAaLRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELA------------ERLAEEELE 694
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1871 LETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEM 1950
Cdd:COG1196    695 LEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLER 774

                   ....*....
gi 1907082211 1951 DLESLEAAN 1959
Cdd:COG1196    775 EIEALGPVN 783
MYSc_Myo25 cd14886
class XXV myosin, motor domain; These myosins are MyTH-FERM myosins that play a role in cell ...
469-887 7.48e-30

class XXV myosin, motor domain; These myosins are MyTH-FERM myosins that play a role in cell adhesion and filopodia formation. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276851  Cd Length: 650  Bit Score: 128.47  E-value: 7.48e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  469 VLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKGCRRE-----DMAPHIYAVAQTAYRAMLMSRQDQSI 542
Cdd:cd14886      3 VIDILRDRFAKDKIYTYAGKLLVALNPfKQIRNLYGTEVIGRYRQADTSrgfpsDLPPHSYAVAQSALNGLISDGISQSC 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  543 VLLGSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVekwqALST--LLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVAS 620
Cdd:cd14886     83 IVSGESGAGKTETAKQLMNFFAYGHSTSSTDVQSL----ILGSnpLLESFGNAKTLRNNNSSRFGKFIKLLVGPDGGLKG 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  621 ASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENNVF-GIVPLSKPE-EKQKaaqQFSKLQAAMK 698
Cdd:cd14886    159 GKITSYMLELSRIEFQSTNERNYHIFYQCIKGLSPEEKKSLGFKSLESYNFLnASKCYDAPGiDDQK---EFAPVRSQLE 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  699 VLaISPEEQKTCWLILASIYHLG--------AAGATKEAAEAGRKQFarhewaQKAAYLLGCSLEELSSAIfkhqlkggt 770
Cdd:cd14886    236 KL-FSKNEIDSFYKCISGILLAGniefseegDMGVINAAKISNDEDF------GKMCELLGIESSKAAQAI--------- 299
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  771 LQRSTSFRQGPEESGLgegTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQNPEWGgsarg 850
Cdd:cd14886    300 ITKVVVINNETIISPV---TQAQAEVNIRAVAKDLYGALFELCVDTLNEIIQFDADARPWIGILDIYGFEFFERN----- 371
                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 1907082211  851 aSFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE 887
Cdd:cd14886    372 -TYEQLLINYANERLQQYFINQVFKSEIQEYEIEGID 407
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1291-2022 1.05e-29

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 129.91  E-value: 1.05e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1291 IQVQLSE--EQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntgesasqlldaetAERLRTEK 1368
Cdd:pfam01576  227 LQAQIAElrAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLESER--------------AARNKAEK 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1369 EMKELQTQYDALKkqmevmemevmearliraAEINGEVDDDDAGGEWRLKYERAVREVdftKKRLQQELEDKMEVEQQSR 1448
Cdd:pfam01576  293 QRRDLGEELEALK------------------TELEDTLDTTAAQQELRSKREQEVTEL---KKALEEETRSHEAQLQEMR 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1449 RQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKL 1528
Cdd:pfam01576  352 QKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAEL 431
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1529 QREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGS 1608
Cdd:pfam01576  432 AEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEEEEA 511
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1609 IQMLEQAKLRLEMEMERMRQtHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLS-DQV 1687
Cdd:pfam01576  512 KRNVERQLSTLQAQLSDMKK-KLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLvDLD 590
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1688 NQRDFES--EKRLRK---DLKRTKALLADAQIMLDHLKNNAPSKREIA-QLKNQLEESEFTCAAAVKARKAMEVEMEDLH 1761
Cdd:pfam01576  591 HQRQLVSnlEKKQKKfdqMLAEEKAISARYAEERDRAEAEAREKETRAlSLARALEEALEAKEELERTNKQLRAEMEDLV 670
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1762 LQIDDIAK-------AKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDmaqmndLQAQiEESNKEK 1834
Cdd:pfam01576  671 SSKDDVGKnvhelerSKRALEQQVEEMKTQLEELEDELQATEDAKLRLEVNMQALKAQFERD------LQAR-DEQGEEK 743
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1835 QelqeklQALQSQVefleqsmvdkslvsrqeakiRELETRLEFEKTQvkRLENLASRLKETMEKLTEERDQRAAAENREK 1914
Cdd:pfam01576  744 R------RQLVKQV--------------------RELEAELEDERKQ--RAQAVAAKKKLELDLKELEAQIDAANKGREE 795
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1915 --EQNKRLQRQLRDTKEEMSE--LARKEAEASRKkhELEMDLESLEAANQSLQADLKLAFKrigdlqAAIEDEMESDENE 1990
Cdd:pfam01576  796 avKQLKKLQAQMKDLQRELEEarASRDEILAQSK--ESEKKLKNLEAELLQLQEDLAASER------ARRQAQQERDELA 867
                          730       740       750
                   ....*....|....*....|....*....|...
gi 1907082211 1991 DLINSLQDMVTKYQKKKNKLEGD-SDVDSELED 2022
Cdd:pfam01576  868 DEIASGASGKSALQDEKRRLEARiAQLEEELEE 900
MYSc_Myo13 cd14875
class XIII myosin, motor domain; These myosins have an N-terminal motor domain, a light-chain ...
520-1179 4.52e-29

class XIII myosin, motor domain; These myosins have an N-terminal motor domain, a light-chain binding domain, and a C-terminal GPA/Q-rich domain. There is little known about the function of this myosin class. Two of the earliest members identified in this class are green alga Acetabularia cliftonii, Aclmyo1 and Aclmyo2. They are striking with their short tail of Aclmyo1 of 18 residues and the maximum of 7 IQ motifs in Aclmyo2. It is thought that these myosins are involved in organelle transport and tip growth. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276842 [Multi-domain]  Cd Length: 664  Bit Score: 126.08  E-value: 4.52e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  520 PHIYAVAQTAYRAMLM-SRQDQSIVLLGSSGSGKTTSFQHLVQYLA---------TIAGTSGTKVFSVEKWQalSTLLEA 589
Cdd:cd14875     56 PHIWQVAHKAFNAIFVqGLGNQSVVISGESGSGKTENAKMLIAYLGqlsymhssnTSQRSIADKIDENLKWS--NPVMES 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  590 FGNSPTIMNGSATRFSQILSLDFDQA-GQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDAT-------LRTEL 661
Cdd:cd14875    134 FGNARTVRNDNSSRFGKYIKLYFDPTsGVMVGGQTVTYLLEKSRIIMQSPGERNYHIFYEMLAGLSPEekkelggLKTAQ 213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  662 HLNHLAENNVF---GI--VPLSKPEEkqkaaqqFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEaaeagrkQ 736
Cdd:cd14875    214 DYKCLNGGNTFvrrGVdgKTLDDAHE-------FQNVRHALSMIGVELETQNSIFRVLASILHLMEVEFESD-------Q 279
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  737 FARHEWAQKAAYLLGCSLEELSSAifkhQLKGGTLQRS-----TSFRQGPEESGLgegtklsalecLEGMASGLYSELFT 811
Cdd:cd14875    280 NDKAQIADETPFLTACRLLQLDPA----KLRECFLVKSktslvTILANKTEAEGF-----------RNAFCKAIYVGLFD 344
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  812 LLISLVNRALKSSQH-SLCSMM-IVDTPGFQNpewggsARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIE-- 887
Cdd:cd14875    345 RLVEFVNASITPQGDcSGCKYIgLLDIFGFEN------FTRNSFEQLCINYANESLQNHYNKYTFINDEEECRREGIQip 418
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  888 -LAFDDlepvADDSVAAVDQASHLVRSLahADEargllwlleeEALVPGATEDALLDRLFSYYGpqegdkkGQSPLL--- 963
Cdd:cd14875    419 kIEFPD----NSECVNMFDQKRTGIFSM--LDE----------ECNFKGGTTERFTTNLWDQWA-------NKSPYFvlp 475
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  964 RSSKPRHFllGHSHGTNWVEYNVAGWLNytKQNPA-TQNAPRLLQDSQKKIISNLFLGRAGSAtvlsgsiagleggsqla 1042
Cdd:cd14875    476 KSTIPNQF--GVNHYAAFVNYNTDEWLE--KNTDAlKEDMYECVSNSTDEFIRTLLSTEKGLA----------------- 534
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1043 lRRatsmrktfttgmaavkKKSLCIQIKLQVDALIDTIKRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgd 1122
Cdd:cd14875    535 -RR----------------KQTVAIRFQRQLTDLRTELESTETQFIRCIKPNMEASPS---------------------- 575
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1123 pceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDVLAPHLTKK 1179
Cdd:cd14875    576 -------FLDNLLVGSQLESAGVLQTIALKRQGYPVRRPIEQFCRYFYLIMPRSTAS 625
MYSc_Myo23 cd14884
class XXIII myosin, motor domain; These myosins are predicted to have a neck region with 1-2 ...
468-906 4.29e-28

class XXIII myosin, motor domain; These myosins are predicted to have a neck region with 1-2 IQ motifs and a single MyTH4 domain in its C-terminal tail. The lack of a FERM domain here is odd since MyTH4 domains are usually found alongside FERM domains where they bind to microtubules. At any rate these Class XXIII myosins are still proposed to function in the apicomplexan microtubule cytoskeleton. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276850 [Multi-domain]  Cd Length: 685  Bit Score: 123.09  E-value: 4.29e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMF-------KGCRREDMAPHIYAVAQTAYRAMLMSRQD 539
Cdd:cd14884      2 NVLQNLKNRYLKNKIYTFHASLLLALNPyKPLKELYDQDVMNVYlhkksnsAASAAPFPKAHIYDIANMAYKNMRGKLKR 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  540 QSIVLLGSSGSGKTTSFQHLVQYLATIAGTSgTKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQ----- 614
Cdd:cd14884     82 QTIVVSGHSGSGKTENCKFLFKYFHYIQTDS-QMTERIDKLIYINNILESMSNATTIKNNNSSRCGRINLLIFEEventq 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  615 ----AGQVASASIQTMLLEKLRVARRPASEATFNVFYYLL-ACGDATLRT-----ELHLNHLAENNVFGIVPLSK----- 679
Cdd:cd14884    161 knmfNGCFRNIKIKILLLEINRCIAHNFGERNFHVFYQVLrGLSDEDLARrnlvrNCGVYGLLNPDESHQKRSVKgtlrl 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  680 --------PEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAgATKEAAEagrkqfarhewaqkaayLLG 751
Cdd:cd14884    241 gsdsldpsEEEKAKDEKNFVALLHGLHYIKYDERQINEFFDIIAGILHLGNR-AYKAAAE-----------------CLQ 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  752 CSLEELssaifKHQLKGGTLQ-RSTSFRQgpeesglgEGTKLSALECLEGMASGLYSELFTLLISLVNRALKSSQHSLCS 830
Cdd:cd14884    303 IEEEDL-----ENVIKYKNIRvSHEVIRT--------ERRKENATSTRDTLIKFIYKKLFNKIIEDINRNVLKCKEKDES 369
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  831 M------------MIVDTPGFQnpewggSARGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIeLAFDDLEPVAD 898
Cdd:cd14884    370 DnediysineaiiSILDIYGFE------ELSGNDFDQLCINLANEKLNNYYINNEIEKEKRIYARENI-ICCSDVAPSYS 442

                   ....*...
gi 1907082211  899 DSVAAVDQ 906
Cdd:cd14884    443 DTLIFIAK 450
MYSc_Myo12 cd14874
class XXXIII myosin, motor domain; Little is known about the XXXIII class of myosins. They ...
467-1174 3.19e-27

class XXXIII myosin, motor domain; Little is known about the XXXIII class of myosins. They are found predominately in nematodes. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276841 [Multi-domain]  Cd Length: 628  Bit Score: 119.98  E-value: 3.19e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVmhmFKGCrredmapHIYAVAQTAYRAML-MSRQDQSIVLL 545
Cdd:cd14874      1 AGIAQNLHERFKKGQTYTKASNVLVFVNDFNKLSIQDQLV---IKKC-------HISGVAENALDRIKsMSSNAESIVFG 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  546 GSSGSGKTTSFQHLVQYLATIAGTSGTKVFSvekwQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQT 625
Cdd:cd14874     71 GESGSGKSYNAFQVFKYLTSQPKSKVTTKHS----SAIESVFKSFGCAKTLKNDEATRFGCSIDLLYKRNVLTGLNLKYT 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  626 MLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLaeNNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISPE 705
Cdd:cd14874    147 VPLEVPRVISQKPGERNFNVFYEVYHGLNDEMKAKFGIKGL--QKFFYINQGNSTENIQSDVNHFKHLEDALHVLGFSDD 224
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  706 EQKTCWLILASIYHLG--------AAGATKEAAEAGrkQFARHEWaqkAAYLLGCSLEELSSAIFKHQLKGGTLQRStsf 777
Cdd:cd14874    225 HCISIYKIISTILHIGniyfrtkrNPNVEQDVVEIG--NMSEVKW---VAFLLEVDFDQLVNFLLPKSEDGTTIDLN--- 296
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  778 rqgpeesglgegtklSALECLEGMASGLYSELFTLLISLVNRALKSSQHSlCSMMIVDTPGFQNPEWGGsargasFEELC 857
Cdd:cd14874    297 ---------------AALDNRDSFAMLIYEELFKWVLNRIGLHLKCPLHT-GVISILDHYGFEKYNNNG------VEEFL 354
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  858 HNYAQDRLQRLFHERTFLQELERYKEDNIELafddlepvaDDSVAAVDQASHLVRSLAHADEarGLLWLLEEEALVPGAT 937
Cdd:cd14874    355 INSVNERIENLFVKHSFHDQLVDYAKDGISV---------DYKVPNSIENGKTVELLFKKPY--GLLPLLTDECKFPKGS 423
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  938 EDALLDRL------FSYYGPQegdkkgqspllRSSKPRHFLLGHSHGTNWveYNVAGWLNYTKQNpATQNAPRLLQDSQK 1011
Cdd:cd14874    424 HESYLEHCnlnhtdRSSYGKA-----------RNKERLEFGVRHCIGTTW--YNVTDFFSRNKRI-ISLSAVQLLRSSKN 489
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1012 KIISNLFLGRAGSATVLSGSIAgleggsQLALRRATSmrktfttgmaavkkkslciqiklqvdaLIDTIKRSKMHFVHCF 1091
Cdd:cd14874    490 PIIGLLFESYSSNTSDMIVSQA------QFILRGAQE---------------------------IADKINGSHAHFVRCI 536
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1092 LPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHMVFSEFRRRFDV 1171
Cdd:cd14874    537 KSNNERQPK-----------------------------KFDIPLVNRQIKNLLLAELLSFRIKGYPVKISKTTFARQYRC 587

                   ...
gi 1907082211 1172 LAP 1174
Cdd:cd14874    588 LLP 590
MYSc_Myo14 cd14876
class XIV myosin, motor domain; These myosins localize to plasma membranes of the ...
469-1093 3.81e-27

class XIV myosin, motor domain; These myosins localize to plasma membranes of the intracellular parasites and may be involved in the cell invasion process. Their known functions include: transporting phagosomes to the nucleus and perturbing the developmentally regulated elimination of the macronucleus during conjugation. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. C-terminal to their motor domain these myosins have a MyTH4-FERM protein domain combination. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276843  Cd Length: 649  Bit Score: 120.09  E-value: 3.81e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  469 VLHTLRQRYGASLLHTYAGPSLLVLST---------------RGAPAVysekvmhmfkgcrrEDMAPHIYAVAQTAYRAM 533
Cdd:cd14876      3 VLDFLKHRYLKNQIYTTADPLLVAINPfkdlgnatdewirkyRDAPDL--------------TKLPPHVFYTARRALENL 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  534 LMSRQDQSIVLLGSSGSGKTTSFQHLVQYLAtiAGTSGTKVFSVEKW-QALSTLLEAFGNSPTIMNGSATRFSQILSLDF 612
Cdd:cd14876     69 HGVNKSQTIIVSGESGAGKTEATKQIMRYFA--SAKSGNMDLRIQTAiMAANPVLEAFGNAKTIRNNNSSRFGRFMQLDV 146
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  613 DQAGQVASASIQTMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNHLAENN--------VFGIVPLskpeekq 684
Cdd:cd14876    147 ASEGGIRYGSVVAFLLEKSRIVTQDDNERSYHIFYQLLKGADSEMKSKYHLLGLKEYKflnpkcldVPGIDDV------- 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  685 kaaQQFSKLQAAMKVLAISPEEQKTCWLILASIYHLGAAGATKEaAEAGRKQFARHEWAQKAAYLLGCSLEELSSAIFKH 764
Cdd:cd14876    220 ---ADFEEVLESLKSMGLTEEQIDTVFSIVSGVLLLGNVKITGK-TEQGVDDAAAISNESLEVFKEACSLLFLDPEALKR 295
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  765 QL------KGGtlQRSTSFRQGPEesglGEGTKLSaleclegMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPG 838
Cdd:cd14876    296 ELtvkvtkAGG--QEIEGRWTKDD----AEMLKLS-------LAKAMYDKLFLWIIRNLNSTIEPPGGFKNFMGMLDIFG 362
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  839 F---QNpewggsargASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNI---ELAFDDLEPVAD------DSVAAV-- 904
Cdd:cd14876    363 FevfKN---------NSLEQLFINITNEMLQKNFIDIVFERESKLYKDEGIptaELEYTSNAEVIDvlcgkgKSVLSIle 433
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  905 DQAshlvrsLAhadeargllwlleeealvPGATEDALLDRLFSyygpQEGDKKGQSPLLRSSKpRHFLLGHSHGTnwVEY 984
Cdd:cd14876    434 DQC------LA------------------PGGSDEKFVSACVS----KLKSNGKFKPAKVDSN-INFIVVHTIGD--IQY 482
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  985 NVAGWLNYTKqNPATQNAPRLLQDSQKKIISNLFLGragsATVLSGSIA-GLEGGSQLalrratsMRktfttgmaavkkk 1063
Cdd:cd14876    483 NAEGFLFKNK-DVLRAELVEVVQASTNPVVKALFEG----VVVEKGKIAkGSLIGSQF-------LK------------- 537
                          650       660       670
                   ....*....|....*....|....*....|
gi 1907082211 1064 slciqiklQVDALIDTIKRSKMHFVHCFLP 1093
Cdd:cd14876    538 --------QLESLMGLINSTEPHFIRCIKP 559
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1297-1959 5.50e-27

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 120.93  E-value: 5.50e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1297 EEQIRNKDEEIQQLRSKLEKVEKERNELR-----LSSD--RLETRISELTSELTDERNTGESASQLLDAETAERLRTEKE 1369
Cdd:TIGR02168  259 TAELQELEEKLEELRLEVSELEEEIEELQkelyaLANEisRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEE 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1370 MKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQeLEDKMEVEQQSRR 1449
Cdd:TIGR02168  339 LAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIER-LEARLERLEDRRE 417
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1450 QLERRLGDLQADSDESQRALQQLKkkCQRLTAELQDTKLHLEGQQVR----NHELEKKQRRFDSELSQAHEETQR----E 1521
Cdd:TIGR02168  418 RLQQEIEELLKKLEEAELKELQAE--LEELEEELEELQEELERLEEAleelREELEEAEQALDAAERELAQLQARldslE 495
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1522 KLQR--EKLQREKDMLLAEAF-------SLKQQME-----EKDLDIA--GFTQKVV--SLEAELQDISSQESKDE---AS 1580
Cdd:TIGR02168  496 RLQEnlEGFSEGVKALLKNQSglsgilgVLSELISvdegyEAAIEAAlgGRLQAVVveNLNAAKKAIAFLKQNELgrvTF 575
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1581 LAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERM--------------RQTHSKEMESR--------- 1637
Cdd:TIGR02168  576 LPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLlggvlvvddldnalELAKKLRPGYRivtldgdlv 655
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1638 ---------DEEVEEARQSCQKKLKQMEVQ---LEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRL---RKDL 1702
Cdd:TIGR02168  656 rpggvitggSAKTNSSILERRREIEELEEKieeLEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIsalRKDL 735
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1703 KRTKALLADAQIMLDHL-KNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRL 1781
Cdd:TIGR02168  736 ARLEAEVEQLEERIAQLsKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLL 815
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1782 QREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSL 1860
Cdd:TIGR02168  816 NEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALlRSE 895
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1861 VSRQEAKIRELETRlefektqVKRLENLasrLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARK--- 1937
Cdd:TIGR02168  896 LEELSEELRELESK-------RSELRRE---LEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKied 965
                          730       740
                   ....*....|....*....|...
gi 1907082211 1938 -EAEASRKKHELEMDLESLEAAN 1959
Cdd:TIGR02168  966 dEEEARRRLKRLENKIKELGPVN 988
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1295-1999 7.71e-27

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 120.56  E-value: 7.71e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1295 LSEEqiRNKDEEIQQLRSKLEKVE-----KERNELRLSSDRLETRISELTSELT------DERN-TGESASQLLDAETAE 1362
Cdd:TIGR02169  203 LRRE--REKAERYQALLKEKREYEgyellKEKEALERQKEAIERQLASLEEELEklteeiSELEkRLEEIEQLLEELNKK 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1363 -RLRTEKEMKELQTQYDALKKQMEVMEMEVMEA--RLIRAAE--INGEVDDDDAGGEWRlKYERAVREVDFTKKRLQQEL 1437
Cdd:TIGR02169  281 iKDLGEEEQLRVKEKIGELEAEIASLERSIAEKerELEDAEErlAKLEAEIDKLLAEIE-ELEREIEEERKRRDKLTEEY 359
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1438 EDKmeveQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDtklhLEGQQVRnheLEKKQRRFDSELSQAHEE 1517
Cdd:TIGR02169  360 AEL----KEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINE----LKRELDR---LQEELQRLSEELADLNAA 428
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1518 TQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKV-- 1595
Cdd:TIGR02169  429 IAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVrg 508
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1596 -KDQEEELD------------------------EQAGS-----------------IQMLEQAKL-RLE-MEMERMRQTHS 1631
Cdd:TIGR02169  509 gRAVEEVLKasiqgvhgtvaqlgsvgeryataiEVAAGnrlnnvvveddavakeaIELLKRRKAgRATfLPLNKMRDERR 588
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1632 -KEMESRD------------------------------EEVEEAR----------------------------------- 1645
Cdd:TIGR02169  589 dLSILSEDgvigfavdlvefdpkyepafkyvfgdtlvvEDIEAARrlmgkyrmvtlegelfeksgamtggsraprggilf 668
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1646 --------QSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEkRLRKDLKRTKALLADAQIMLD 1717
Cdd:TIGR02169  669 srsepaelQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIE-QLEQEEEKLKERLEELEEDLS 747
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1718 HLKNN-APSKREIAQLKNQLEESEftcAAAVKARKAME-VEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEED 1795
Cdd:TIGR02169  748 SLEQEiENVKSELKELEARIEELE---EDLHKLEEALNdLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRL 824
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1796 QEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSLVSRQEAKIRELETR 1874
Cdd:TIGR02169  825 TLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDlKKERDELEAQLRELERK 904
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1875 LEFEKTQVKRLENLASRLKETMEKLteerdqraaaenreKEQNKRLQRQLRDTKEEMSELARKEAEASRKKhELEMDLES 1954
Cdd:TIGR02169  905 IEELEAQIEKKRKRLSELKAKLEAL--------------EEELSEIEDPKGEDEEIPEEELSLEDVQAELQ-RVEEEIRA 969
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082211 1955 LEAANQSLQADLKLAFKRIGDLQAAIED-EMESDENEDLINSLQDM 1999
Cdd:TIGR02169  970 LEPVNMLAIQEYEEVLKRLDELKEKRAKlEEERKAILERIEEYEKK 1015
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1434-2010 2.62e-26

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 117.81  E-value: 2.62e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1434 QQELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKlhlegqqvrNHELEKKQR--RFDSEL 1511
Cdd:TIGR04523   35 EKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLN---------DKLKKNKDKinKLNSDL 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1512 SQAHEETQREKLQREKLQREKDmllaeafSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKdeaslakVKKQLRDL 1591
Cdd:TIGR04523  106 SKINSEIKNDKEQKNKLEVELN-------KLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYND-------LKKQKEEL 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1592 EAKVKDQEEELDEQAGSIQMLEQAKLRLEMEM---ERMRQTHsKEMESRDEEVEEARQSCQKKLKQmevqLEEEYEDKQK 1668
Cdd:TIGR04523  172 ENELNLLEKEKLNIQKNIDKIKNKLLKLELLLsnlKKKIQKN-KSLESQISELKKQNNQLKDNIEK----KQQEINEKTT 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1669 ALREKrelESKLSTLSDQvNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNnapskrEIAQLKNQLEESeftCAAAVK 1748
Cdd:TIGR04523  247 EISNT---QTQLNQLKDE-QNKIKKQLSEKQKELEQNNKKIKELEKQLNQLKS------EISDLNNQKEQD---WNKELK 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1749 AR-KAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKN-------EIQNRLEEDQEDMnELMKKHKAAVAQASRDM-AQ 1819
Cdd:TIGR04523  314 SElKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTnsesensEKQRELEEKQNEI-EKLKKENQSYKQEIKNLeSQ 392
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1820 MNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSLVSRQ--------------EAKIRELETRLEFEKTQVKR 1884
Cdd:TIGR04523  393 INDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERlKETIIKNnseikdltnqdsvkELIIKNLDNTRESLETQLKV 472
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1885 LENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQA 1964
Cdd:TIGR04523  473 LSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDF 552
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1965 DLKLAF--KRIGDLQAAIEdEMEsDENEDLINS---LQDMVTKYQKKKNKL 2010
Cdd:TIGR04523  553 ELKKENleKEIDEKNKEIE-ELK-QTQKSLKKKqeeKQELIDQKEKEKKDL 601
PTZ00121 PTZ00121
MAEBL; Provisional
1295-1990 7.10e-26

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 117.93  E-value: 7.10e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1295 LSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRL--ETRISELTSELTDERNTgESASQLLDAETAERLRTEKEMKE 1372
Cdd:PTZ00121  1099 KAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKaeDARKAEEARKAEDAKRV-EIARKAEDARKAEEARKAEDAKK 1177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1373 LQTQYDALKKQMEVMEMEVMEARLIRAAEingevddddaggewRLKYERAVREVdftkkrlqQELEDKMEVEQqSRRQLE 1452
Cdd:PTZ00121  1178 AEAARKAEEVRKAEELRKAEDARKAEAAR--------------KAEEERKAEEA--------RKAEDAKKAEA-VKKAEE 1234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1453 RRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKK--QRRFDSELSQAHEETQREKLQREKLQR 1530
Cdd:PTZ00121  1235 AKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKaeEKKKADEAKKAEEKKKADEAKKKAEEA 1314
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1531 EKdmllAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLE-AKVKDQEEELDEQAGSI 1609
Cdd:PTZ00121  1315 KK----ADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKkEEAKKKADAAKKKAEEK 1390
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1610 QMLEQAKLRLEMEMERMRQTHSKEMESRdeEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQ 1689
Cdd:PTZ00121  1391 KKADEAKKKAEEDKKKADELKKAAAAKK--KADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEE 1468
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1690 RDFESEKRLRKDLKR----TKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCA-AAVKARKAMEVEMEDLHLQI 1764
Cdd:PTZ00121  1469 AKKADEAKKKAEEAKkadeAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAkKAEEAKKADEAKKAEEKKKA 1548
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1765 DDIAKA---KTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKL 1841
Cdd:PTZ00121  1549 DELKKAeelKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKA 1628
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1842 QALQSQVEFLEQSMVDKslvSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQ 1921
Cdd:PTZ00121  1629 EEEKKKVEQLKKKEAEE---KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1922 R-------------QLRDTKEE----MSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEM 1984
Cdd:PTZ00121  1706 ElkkkeaeekkkaeELKKAEEEnkikAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEEL 1785

                   ....*.
gi 1907082211 1985 ESDENE 1990
Cdd:PTZ00121  1786 DEEDEK 1791
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1292-1934 7.90e-25

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 113.19  E-value: 7.90e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1292 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMK 1371
Cdd:TIGR04523  111 EIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNID 190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1372 ELQTQYDALKKQMEVMEMEVMEARLIRAaEINgevddddaggewrlKYERAVREVDFTKKRLQQELEDKMEVEQQSRRQL 1451
Cdd:TIGR04523  191 KIKNKLLKLELLLSNLKKKIQKNKSLES-QIS--------------ELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQL 255
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1452 E----------RRLGDLQADSDESQRALQQLKKKCQRLTAELQDtkLHLEGQQVRNHEL-------EKKQRRFDSELSQA 1514
Cdd:TIGR04523  256 NqlkdeqnkikKQLSEKQKELEQNNKKIKELEKQLNQLKSEISD--LNNQKEQDWNKELkselknqEKKLEEIQNQISQN 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1515 HEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKdldiagftqkvvslEAELQDISSQESKDEASLAKVKKQLRDLEAK 1594
Cdd:TIGR04523  334 NKIISQLNEQISQLKKELTNSESENSEKQRELEEK--------------QNEIEKLKKENQSYKQEIKNLESQINDLESK 399
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1595 VKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHS------KEMESRD-------EEVEEARQSCQKKLKQMEVQ--- 1658
Cdd:TIGR04523  400 IQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIknnseiKDLTNQDsvkeliiKNLDNTRESLETQLKVLSRSink 479
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1659 LEEEYEDKQKALREKrelESKLSTLSDQVNQrdfeSEKRLrKDLKRTKALLADAQIMLDHLKNNapSKREIAQLKNQLEE 1738
Cdd:TIGR04523  480 IKQNLEQKQKELKSK---EKELKKLNEEKKE----LEEKV-KDLTKKISSLKEKIEKLESEKKE--KESKISDLEDELNK 549
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1739 SEFTcaaavkarkamevemedlhLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMA 1818
Cdd:TIGR04523  550 DDFE-------------------LKKENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEK 610
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1819 QMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSM------VDKSLVSRQEA--KIRELETRL-EFEKTQVKRLENLA 1889
Cdd:TIGR04523  611 KISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVkqiketIKEIRNKWPEIikKIKESKTKIdDIIELMKDWLKELS 690
                          650       660       670       680
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1890 SRLKETMEKLTEERDqraaaENREKEQNKRLQRQLRDTKEEMSEL 1934
Cdd:TIGR04523  691 LHYKKYITRMIRIKD-----LPKLEEKYKEIEKELKKLDEFSKEL 730
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1292-1937 1.46e-24

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 113.23  E-value: 1.46e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1292 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLD----AETAERL-RT 1366
Cdd:TIGR02168  366 ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEeaelKELQAELeEL 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1367 EKEMKELQTQYDALKKQMEVMEMEVMEARLIR------AAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRL---QQEL 1437
Cdd:TIGR02168  446 EEELEELQEELERLEEALEELREELEEAEQALdaaereLAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLsgiLGVL 525
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1438 EDKMEVEQQSRRQLERRLG-DLQA----DSDESQRALQQLKK--------------KCQRLTAELQDTKLHLEGQQVRNH 1498
Cdd:TIGR02168  526 SELISVDEGYEAAIEAALGgRLQAvvveNLNAAKKAIAFLKQnelgrvtflpldsiKGTEIQGNDREILKNIEGFLGVAK 605
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1499 ELEKKQRRFDSELS------------QAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAE 1566
Cdd:TIGR02168  606 DLVKFDPKLRKALSyllggvlvvddlDNALELAKKLRPGYRIVTLDGDLVRPGGVITGGSAKTNSSILERRREIEELEEK 685
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1567 LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERmrqtHSKEMESRDEEVEEArq 1646
Cdd:TIGR02168  686 IEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQ----LEERIAQLSKELTEL-- 759
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1647 scQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSK 1726
Cdd:TIGR02168  760 --EAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR-EALDELRAELTLLNEEAANLRERLESLERRIAAT 836
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1727 -REIAQLKNQLEESEftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKK 1805
Cdd:TIGR02168  837 eRRLEDLEEQIEELS-------EDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESK 909
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1806 HKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQsQVEFleqsMVDKSLVSRQEAKIRELETRLEFEKTQVKRL 1885
Cdd:TIGR02168  910 RSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEY-SLTL----EEAEALENKIEDDEEEARRRLKRLENKIKEL 984
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082211 1886 E--NLAS-----RLKETMEKLTEERDQRAAAENrekeqnkrlqrQLRDTKEEMSELARK 1937
Cdd:TIGR02168  985 GpvNLAAieeyeELKERYDFLTAQKEDLTEAKE-----------TLEEAIEEIDREARE 1032
PTZ00121 PTZ00121
MAEBL; Provisional
1217-2006 9.02e-24

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 111.00  E-value: 9.02e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1217 RAGTLARLEEQRDEQTSRHLTLF---QAACRGYLARQHFKKRKIQDlAIRCVQ----KNIKKNKGVKDWPWWKLFTTVRP 1289
Cdd:PTZ00121  1135 KAEDARKAEEARKAEDAKRVEIArkaEDARKAEEARKAEDAKKAEA-ARKAEEvrkaEELRKAEDARKAEAARKAEEERK 1213
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1290 LIQVQLSEEQIR----NKDEEIQQLRSKLEKVEKERNE---LRLSSDRLETRISELTSELTDERNTGESASQLLDAETAE 1362
Cdd:PTZ00121  1214 AEEARKAEDAKKaeavKKAEEAKKDAEEAKKAEEERNNeeiRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD 1293
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1363 RLRTEKEMKELqtqyDALKKQMEVMEMEVMEARliRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKME 1442
Cdd:PTZ00121  1294 EAKKAEEKKKA----DEAKKKAEEAKKADEAKK--KAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAE 1367
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1443 VEQQSRRQLERRLGDLQADSDESQRAlQQLKKKCQrltaelQDTKlhlEGQQVRNHELEKKQRrfdSELSQAHEETQREK 1522
Cdd:PTZ00121  1368 AAEKKKEEAKKKADAAKKKAEEKKKA-DEAKKKAE------EDKK---KADELKKAAAAKKKA---DEAKKKAEEKKKAD 1434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1523 LQREKLQREKdmllaEAFSLKQQMEEKDldiagftqkvvslEAELQDISSQESKDEASLAKVKKQLRDLEaKVKDQEEEL 1602
Cdd:PTZ00121  1435 EAKKKAEEAK-----KADEAKKKAEEAK-------------KAEEAKKKAEEAKKADEAKKKAEEAKKAD-EAKKKAEEA 1495
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1603 DEQAGSIQMLEQAKLRLEmEMERMRQTHSKEMESRDEE---VEEARQSCQKKlKQMEVQLEEEY---EDKQKALREKREL 1676
Cdd:PTZ00121  1496 KKKADEAKKAAEAKKKAD-EAKKAEEAKKADEAKKAEEakkADEAKKAEEKK-KADELKKAEELkkaEEKKKAEEAKKAE 1573
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1677 ESKLSTL-----SDQVNQRDFESEKRLRKDLKRTKALLA----DAQIMLDHLKNNAPSKREIAQLKNQLEES-------- 1739
Cdd:PTZ00121  1574 EDKNMALrkaeeAKKAEEARIEEVMKLYEEEKKMKAEEAkkaeEAKIKAEELKKAEEEKKKVEQLKKKEAEEkkkaeelk 1653
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1740 ----EFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTalEEQLSRLQREK---NEIQNRLEEDQEDMNELMKKHKAAVAQ 1812
Cdd:PTZ00121  1654 kaeeENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA--AEALKKEAEEAkkaEELKKKEAEEKKKAEELKKAEEENKIK 1731
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1813 ASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLAS-- 1890
Cdd:PTZ00121  1732 AEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANii 1811
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1891 --------RLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSL 1962
Cdd:PTZ00121  1812 eggkegnlVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIE 1891
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|....
gi 1907082211 1963 QADLKlafkrigDLQAAIEDEMESDENEDLINSLQDmVTKYQKK 2006
Cdd:PTZ00121  1892 KIDKD-------DIEREIPNNNMAGKNNDIIDDKLD-KDEYIKR 1927
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1413-1981 9.71e-24

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 110.26  E-value: 9.71e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1413 GEWRLKYERAVREVDFTKKRLQQELE-----DKMEVEQQSRRQ-LERRLGDLQADSDESQRALQQLKKKCQRLTAELQDT 1486
Cdd:pfam01576   29 KELEKKHQQLCEEKNALQEQLQAETElcaeaEEMRARLAARKQeLEEILHELESRLEEEEERSQQLQNEKKKMQQHIQDL 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1487 KLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMllaeafslkqqmeekdldiagftqkvvsLEAE 1566
Cdd:pfam01576  109 EEQLDEEEAARQKLQLEKVTTEAKIKKLEEDILLLEDQNSKLSKERKL----------------------------LEER 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1567 LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSkEMESRDEEVEEARQ 1646
Cdd:pfam01576  161 ISEFTSNLAEEEEKAKSLSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGESTDLQEQIA-ELQAQIAELRAQLA 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1647 SCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLsdqvnQRDFESEKRLRKDLKRTK-----ALLADAQIMLDHLKN 1721
Cdd:pfam01576  240 KKEEELQAALARLEEETAQKNNALKKIRELEAQISEL-----QEDLESERAARNKAEKQRrdlgeELEALKTELEDTLDT 314
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1722 NAP-----SKR--EIAQLKNQLEESEFTCAAAVKA-RKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLE 1793
Cdd:pfam01576  315 TAAqqelrSKReqEVTELKKALEEETRSHEAQLQEmRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELR 394
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1794 EDQEDMNELMKKHKAAvaqasrdMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLeqsmvdKSLVSRQEAKIRELEt 1873
Cdd:pfam01576  395 TLQQAKQDSEHKRKKL-------EGQLQELQARLSESERQRAELAEKLSKLQSELESV------SSLLNEAEGKNIKLS- 460
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1874 rlefektqvKRLENLASRLKETMEKLTEERDQRAAAENR----EKEQNKrLQRQLRDTKE-------EMSELARKEAEAS 1942
Cdd:pfam01576  461 ---------KDVSSLESQLQDTQELLQEETRQKLNLSTRlrqlEDERNS-LQEQLEEEEEakrnverQLSTLQAQLSDMK 530
                          570       580       590
                   ....*....|....*....|....*....|....*....
gi 1907082211 1943 RKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIE 1981
Cdd:pfam01576  531 KKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYD 569
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1221-1906 3.72e-23

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 108.10  E-value: 3.72e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1221 LARLEEQRdEQTSRHLTLfQAACRGYLARQHFKKRKIQDLAIRCVQKNIKKNKgvkdwpwwklfttvrplIQVQLSEEQI 1300
Cdd:COG1196    202 LEPLERQA-EKAERYREL-KEELKELEAELLLLKLRELEAELEELEAELEELE-----------------AELEELEAEL 262
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1301 RNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTQYDAL 1380
Cdd:COG1196    263 AELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEEL 342
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1381 KKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREvdftkkrlQQELEDKMEVEQQSRRQLERRLGDLQA 1460
Cdd:COG1196    343 EEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEE--------LLEALRAAAELAAQLEELEEAEEALLE 414
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1461 DSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAF 1540
Cdd:COG1196    415 RLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLL 494
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1541 SLKQQMEEKDLDIAGftqkvvSLEAELQDISSQESKDEASLAKVKKQLRD-----LEAKVKDQEEELDEQAGS-IQMLEQ 1614
Cdd:COG1196    495 LLLEAEADYEGFLEG------VKAALLLAGLRGLAGAVAVLIGVEAAYEAaleaaLAAALQNIVVEDDEVAAAaIEYLKA 568
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1615 AKL-RLE-MEMERMRQTHSKEMESRDEEVEEARqscqkklkqmevqLEEEYEDKQKALREKRELESKLSTLSDQvnqrdf 1692
Cdd:COG1196    569 AKAgRATfLPLDKIRARAALAAALARGAIGAAV-------------DLVASDLREADARYYVLGDTLLGRTLVA------ 629
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1693 eseKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEftcaaaVKARKAMEVEMEDLHLQIDDIAKAKT 1772
Cdd:COG1196    630 ---ARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAAL------LEAEAELEELAERLAEEELELEEALL 700
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1773 ALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLE 1852
Cdd:COG1196    701 AEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEALG 780
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1853 QsmVDksLVSRQEAKirELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQR 1906
Cdd:COG1196    781 P--VN--LLAIEEYE--ELEERYDFLSEQREDLEEARETLEEAIEEIDRETRER 828
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1397-2011 5.92e-23

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 107.46  E-value: 5.92e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1397 IRAAEINGEVDDDDA---------GGEwrlKYERA-------VREVDFTKKRLQQELEDKMEVEQQSRRQlERRLGDLQA 1460
Cdd:PRK03918   132 IRQGEIDAILESDESrekvvrqilGLD---DYENAyknlgevIKEIKRRIERLEKFIKRTENIEELIKEK-EKELEEVLR 207
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1461 DSDESQRALQQLKKKCQRLTAELQD---TKLHLEGQQVRNHELEKKQRRFDSELSQAhEETQREKLQREKLQREKDMLLA 1537
Cdd:PRK03918   208 EINEISSELPELREELEKLEKEVKEleeLKEEIEELEKELESLEGSKRKLEEKIREL-EERIEELKKEIEELEEKVKELK 286
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1538 EAfslkQQMEEKDLDIAGFTQKVVSleaELQDISSQESKDEASLAKVKKQLRDLE---AKVKDQEEELDEQAGSIQMLEQ 1614
Cdd:PRK03918   287 EL----KEKAEEYIKLSEFYEEYLD---ELREIEKRLSRLEEEINGIEERIKELEekeERLEELKKKLKELEKRLEELEE 359
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1615 AKLRLEMEMERMRQTHSKEMESRDEEVEEArqscQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQrdFES 1694
Cdd:PRK03918   360 RHELYEEAKAKKEELERLKKRLTGLTPEKL----EKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEE--LKK 433
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1695 EKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEftcAAAVKARKAMEVEMEDLHLQidDIAKAKTAL 1774
Cdd:PRK03918   434 AKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLR---KELRELEKVLKKESELIKLK--ELAEQLKEL 508
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1775 EEQLSRLQREKNEiqnRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQS 1854
Cdd:PRK03918   509 EEKLKKYNLEELE---KKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFE 585
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1855 MVDKSlvsrqEAKIRELET---RLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEM 1931
Cdd:PRK03918   586 SVEEL-----EERLKELEPfynEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEE 660
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1932 SElarkeaEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIED----EMESDENEDLINSLQDMVTKYQKKK 2007
Cdd:PRK03918   661 YE------ELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEErekaKKELEKLEKALERVEELREKVKKYK 734

                   ....
gi 1907082211 2008 NKLE 2011
Cdd:PRK03918   735 ALLK 738
MYSc_Myo24A cd14937
class XXIV A myosin, motor domain; These myosins have a 1-2 IQ motifs in their neck and a ...
467-886 6.77e-23

class XXIV A myosin, motor domain; These myosins have a 1-2 IQ motifs in their neck and a coiled-coil region in their C-terminal tail. The function of the class XXIV myosins remain elusive. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276897  Cd Length: 637  Bit Score: 106.25  E-value: 6.77e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  467 SSVLHTLRQRYGASLLHTYAGPSLLVLStrgaPAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14937      1 AEVLNMLALRYKKNYIYTIAEPMLISIN----PYQVIDVDINEYKNKNTNELPPHVYSYAKDAMTDFINTKTNQSIIISG 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLatIAGTSGTKVFSVEKWQAlSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQTM 626
Cdd:cd14937     77 ESGSGKTEASKLVIKYY--LSGVKEDNEISNTLWDS-NFILEAFGNAKTLKNNNSSRYGKYIKIELDEYQNIVSSSIEIF 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  627 LLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLNhlAENNVFGIVPLSKPEEKQKAAQQFSKLQAAMKVLAISpEE 706
Cdd:cd14937    154 LLENIRVVSQEEEERGYHIFYQIFNGMSQELKNKYKIR--SENEYKYIVNKNVVIPEIDDAKDFGNLMISFDKMNMH-DM 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  707 QKTCWLILASIYHLG-------AAGATKEAAEAGRKQFarhEWAQKAAYLLGCSLEELSSAIfkhqlkggTLQRSTSFRQ 779
Cdd:cd14937    231 KDDLFLTLSGLLLLGnveyqeiEKGGKTNCSELDKNNL---ELVNEISNLLGINYENLKDCL--------VFTEKTIANQ 299
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  780 GPEesglgegTKLSALECL---EGMASGLYSELFTLLISLVNRALKSSQHSLCSMMIVDTPGFQnpewggSARGASFEEL 856
Cdd:cd14937    300 KIE-------IPLSVEESVsicKSISKDLYNKIFSYITKRINNFLNNNKELNNYIGILDIFGFE------IFSKNSLEQL 366
                          410       420       430
                   ....*....|....*....|....*....|
gi 1907082211  857 CHNYAQDRLQRLFHERTFLQELERYKEDNI 886
Cdd:cd14937    367 LINIANEEIHSIYLYIVYEKETELYKAEDI 396
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1418-1940 1.02e-22

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 106.69  E-value: 1.02e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1418 KYERAVREVDFTKKRLQqELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRN 1497
Cdd:PRK03918   225 KLEKEVKELEELKEEIE-ELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE 303
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1498 HELEKKQR--RFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFtQKVVSLEAELQDISSQES 1575
Cdd:PRK03918   304 EYLDELREieKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELY-EEAKAKKEELERLKKRLT 382
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1576 kdEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSK----EMESRDEEVEEARQSCQKK 1651
Cdd:PRK03918   383 --GLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKcpvcGRELTEEHRKELLEEYTAE 460
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1652 LKQMEVQLEEEYEDKQKALREKRELESKLstlsdqvnqrdfESEKRLRKDLKrtkalladaqiMLDHLKN--NAPSKREI 1729
Cdd:PRK03918   461 LKRIEKELKEIEEKERKLRKELRELEKVL------------KKESELIKLKE-----------LAEQLKEleEKLKKYNL 517
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1730 AQLKNQLEESEFTCAAAVKARKAMEVEMEDLHlQIDDIAKAKTALEEQLSRLQREKNEIQNRL--------EEDQEDMNE 1801
Cdd:PRK03918   518 EELEKKAEEYEKLKEKLIKLKGEIKSLKKELE-KLEELKKKLAELEKKLDELEEELAELLKELeelgfesvEELEERLKE 596
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1802 LMKKH------KAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLvSRQEAKIRELETRL 1875
Cdd:PRK03918   597 LEPFYneylelKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEY-EELREEYLELSREL 675
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1876 EFEKTQVKRLENLASRLKETMEKLTEERDQRAAAE------NREKEQNKRLQRQLRDTKEEMSELARKEAE 1940
Cdd:PRK03918   676 AGLRAELEELEKRREEIKKTLEKLKEELEEREKAKkeleklEKALERVEELREKVKKYKALLKERALSKVG 746
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1298-1956 1.28e-22

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 105.87  E-value: 1.28e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1298 EQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESasqlLDAET---AERLRTEKEMK-EL 1373
Cdd:TIGR04523   47 NELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINK----LNSDLskiNSEIKNDKEQKnKL 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1374 QTQYDALKKQMEVMEMEVmearliraAEINGEV-DDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKMEVEQ------Q 1446
Cdd:TIGR04523  123 EVELNKLEKQKKENKKNI--------DKFLTEIkKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKnidkikN 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1447 SRRQLERRLGDLQAdsdesqralqqLKKKCQRLTAELQDTKlhlEGQQVRNHELEKKQRRFdSELSQAHEETQrEKLQRE 1526
Cdd:TIGR04523  195 KLLKLELLLSNLKK-----------KIQKNKSLESQISELK---KQNNQLKDNIEKKQQEI-NEKTTEISNTQ-TQLNQL 258
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1527 KLQREKDmllaeafslKQQMEEKDLDIAGFTQKVVSLEAELQDISSQES-----KDEASLAKVKKQLRDLEAKVKDQEEE 1601
Cdd:TIGR04523  259 KDEQNKI---------KKQLSEKQKELEQNNKKIKELEKQLNQLKSEISdlnnqKEQDWNKELKSELKNQEKKLEEIQNQ 329
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1602 LDEQAGSIQMLEQAKLRLEMEmermRQTHSKEMESRDEEVEEArqscQKKLKQMEVQLEEEYEDKQKALREKRELESKLS 1681
Cdd:TIGR04523  330 ISQNNKIISQLNEQISQLKKE----LTNSESENSEKQRELEEK----QNEIEKLKKENQSYKQEIKNLESQINDLESKIQ 401
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1682 TLSDQVNQRD-----FESEKR-LRKDLKRTKALLADAQIMLDHLKNnapskrEIAQLKNQLEESEFTcaaavkarkamev 1755
Cdd:TIGR04523  402 NQEKLNQQKDeqikkLQQEKElLEKEIERLKETIIKNNSEIKDLTN------QDSVKELIIKNLDNT------------- 462
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1756 emedlhlqiddiakaKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASrdmaqmnDLQAQIEESNKEKQ 1835
Cdd:TIGR04523  463 ---------------RESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKK-------ELEEKVKDLTKKIS 520
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1836 ELQEKLQALQSQVEFLEQSMVDKslvsrqEAKIRELETRLEFE--KTQVKRLENLASRLKETMEKLTEERDQ-RAAAENR 1912
Cdd:TIGR04523  521 SLKEKIEKLESEKKEKESKISDL------EDELNKDDFELKKEnlEKEIDEKNKEIEELKQTQKSLKKKQEEkQELIDQK 594
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1913 EKEQNK-------------RLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLE 1956
Cdd:TIGR04523  595 EKEKKDlikeieekekkisSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIK 651
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1564-2022 6.43e-22

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 104.49  E-value: 6.43e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1564 EAELQDISSQESKDEASLAKVKKQLRDLEAKVkdqeEELDEQAGSIQMLEQAKLRLEMEMERMRQTHS----------KE 1633
Cdd:pfam01576    4 EEEMQAKEEELQKVKERQQKAESELKELEKKH----QQLCEEKNALQEQLQAETELCAEAEEMRARLAarkqeleeilHE 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1634 MESRDEEVEEARQSCQ---KKLKQ----MEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDfESEKRLRKDLKRTK 1706
Cdd:pfam01576   80 LESRLEEEEERSQQLQnekKKMQQhiqdLEEQLDEEEAARQKLQLEKVTTEAKIKKLEEDILLLE-DQNSKLSKERKLLE 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1707 ALLADAQI----------MLDHLKNNAPSKreIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEE 1776
Cdd:pfam01576  159 ERISEFTSnlaeeeekakSLSKLKNKHEAM--ISDLEERLKKEEKGRQELEKAKRKLEGESTDLQEQIAELQAQIAELRA 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1777 QLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVE-FLEQSM 1855
Cdd:pfam01576  237 QLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLESERAARNKAEKQRRDLGEELEALKTELEdTLDTTA 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1856 VDKSLVSRQEAKIRELETRLEFE-KTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSEL 1934
Cdd:pfam01576  317 AQQELRSKREQEVTELKKALEEEtRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTL 396
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1935 ARKEAEASRKKHELEMDLESLEA-ANQSLQADLKLAfKRIGDLQAAIEdemesdenedlinSLQDMVTKYQKKKNKLEGD 2013
Cdd:pfam01576  397 QQAKQDSEHKRKKLEGQLQELQArLSESERQRAELA-EKLSKLQSELE-------------SVSSLLNEAEGKNIKLSKD 462
                          490
                   ....*....|
gi 1907082211 2014 -SDVDSELED 2022
Cdd:pfam01576  463 vSSLESQLQD 472
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1296-2002 1.84e-21

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 102.41  E-value: 1.84e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1296 SEEQIRNKDEEIQQLRSKL----EKVEKERNELRLSSDRLETR---ISELTSELTDERNTGESASQLLDAETAERLRTEK 1368
Cdd:TIGR04523   73 SNNKIKILEQQIKDLNDKLkknkDKINKLNSDLSKINSEIKNDkeqKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEK 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1369 EMKELQTQYDALKKQmevmemevmearliraaeingevddddaggewrlkyeraVREVDFTKKRLQQELEDKMEVEQQSR 1448
Cdd:TIGR04523  153 ELEKLNNKYNDLKKQ---------------------------------------KEELENELNLLEKEKLNIQKNIDKIK 193
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1449 RQLERrlgdlqadsdeSQRALQQLKKKCQRLTaELQDTKLHLEGQQ-VRNHELEKKQRRFdSELSQAHEETQrEKLQREK 1527
Cdd:TIGR04523  194 NKLLK-----------LELLLSNLKKKIQKNK-SLESQISELKKQNnQLKDNIEKKQQEI-NEKTTEISNTQ-TQLNQLK 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1528 LQREKDmllaeafslKQQMEEKDLDIAGFTQKVVSLEAELQDISSQES-----KDEASLAKVKKQLRDLEAKVKDQEEEL 1602
Cdd:TIGR04523  260 DEQNKI---------KKQLSEKQKELEQNNKKIKELEKQLNQLKSEISdlnnqKEQDWNKELKSELKNQEKKLEEIQNQI 330
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1603 DEQAGSIQMLEQAKLRLEMEmermRQTHSKEMESRDEEVEEArqscQKKLKQMEVQLEEEYEDKQKALREKRELESKLST 1682
Cdd:TIGR04523  331 SQNNKIISQLNEQISQLKKE----LTNSESENSEKQRELEEK----QNEIEKLKKENQSYKQEIKNLESQINDLESKIQN 402
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1683 LSDQVNQRD-----FESEKR-LRKDLKRTKALLADAQIMLDHLKNnapskrEIAQLKNQLEESEFTcaaavkarkameve 1756
Cdd:TIGR04523  403 QEKLNQQKDeqikkLQQEKElLEKEIERLKETIIKNNSEIKDLTN------QDSVKELIIKNLDNT-------------- 462
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1757 medlhlqiddiakaKTALEEQLSRLQREKNEIQNRLEEDQEdmnELMKKHKaavaqasrdmaqmndlqaQIEESNKEKQE 1836
Cdd:TIGR04523  463 --------------RESLETQLKVLSRSINKIKQNLEQKQK---ELKSKEK------------------ELKKLNEEKKE 507
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1837 LQEKLQALQSQVEFLEQSMVD-KSLVSRQEAKIRELETRLEFEKTQVKRlenlaSRLKETMEKLTEERDQRAAAENREKE 1915
Cdd:TIGR04523  508 LEEKVKDLTKKISSLKEKIEKlESEKKEKESKISDLEDELNKDDFELKK-----ENLEKEIDEKNKEIEELKQTQKSLKK 582
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1916 QNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQ---SLQADLKLAFKRIGDLQAAIEDEMESDENE-- 1990
Cdd:TIGR04523  583 KQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEklsSIIKNIKSKKNKLKQEVKQIKETIKEIRNKwp 662
                          730
                   ....*....|..
gi 1907082211 1991 DLINSLQDMVTK 2002
Cdd:TIGR04523  663 EIIKKIKESKTK 674
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1290-1930 2.02e-21

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 102.74  E-value: 2.02e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1290 LIQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLE-----TRISELTSELTDERNTGESASQLLDAETAERl 1364
Cdd:TIGR00618  245 LTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINrarkaAPLAAHIKAVTQIEQQAQRIHTELQSKMRSR- 323
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1365 rtekeMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGgeWRLKYERAVREVDFTKKrLQQELEDKMEVE 1444
Cdd:TIGR00618  324 -----AKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATS--IREISCQQHTLTQHIHT-LQQQKTTLTQKL 395
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1445 QQSRRQLE--RRLGDLQADSDESQRALQQlkkkcqrltaelqdTKLHLEGQQVRNHELEKKQRRFDSElsQAHEETQREK 1522
Cdd:TIGR00618  396 QSLCKELDilQREQATIDTRTSAFRDLQG--------------QLAHAKKQQELQQRYAELCAAAITC--TAQCEKLEKI 459
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1523 LQREKLQ--REKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISS---QESKDEASLAKVKKQLRDLEAKVKD 1597
Cdd:TIGR00618  460 HLQESAQslKEREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIhpnPARQDIDNPGPLTRRMQRGEQTYAQ 539
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1598 QEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRD---EEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKR 1674
Cdd:TIGR00618  540 LETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNrskEDIPNLQNITVRLQDLTEKLSEAEDMLACEQHALLR 619
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1675 ELESKLSTLSDQVNQRDFESEKRLRK-DLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAM 1753
Cdd:TIGR00618  620 KLQPEQDLQDVRLHLQQCSQELALKLtALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLTYWKEML 699
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1754 EVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKE 1833
Cdd:TIGR00618  700 AQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQTGAE 779
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1834 KQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQvKRLENLASRLKETMEKLTEERDQRAAAENRE 1913
Cdd:TIGR00618  780 LSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLV-QEEEQFLSRLEEKSATLGEITHQLLKYEECS 858
                          650
                   ....*....|....*..
gi 1907082211 1914 KEQNKRLQRQLRDTKEE 1930
Cdd:TIGR00618  859 KQLAQLTQEQAKIIQLS 875
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1288-2044 2.53e-21

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 102.36  E-value: 2.53e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1288 RPLIQVQLSEEQIRNKDEEIQqlRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTE 1367
Cdd:pfam02463  176 KKLIEETENLAELIIDLEELK--LQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEI 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1368 KEMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKMEVEQQS 1447
Cdd:pfam02463  254 ESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKE 333
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1448 RRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREK 1527
Cdd:pfam02463  334 KEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLEL 413
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1528 LQREKDMLLAEAFSL--KQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQ 1605
Cdd:pfam02463  414 ARQLEDLLKEEKKEEleILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQ 493
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1606 AG-SIQMLEQ------AKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELES 1678
Cdd:pfam02463  494 KLeERSQKESkarsglKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTEL 573
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1679 KLStlSDQVNQRDFESEKRLRKDLK------RTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEftcaaavKARKA 1752
Cdd:pfam02463  574 PLG--ARKLRLLIPKLKLPLKSIAVleidpiLNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKE-------SAKAK 644
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1753 MEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNK 1832
Cdd:pfam02463  645 ESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLA 724
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1833 EKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENR 1912
Cdd:pfam02463  725 DRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELR 804
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1913 E--KEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENE 1990
Cdd:pfam02463  805 AleEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKL 884
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082211 1991 DLINS---------LQDMVTKYQKKKNKLEGDSDVDSELEDRVDGVKSWLSKNKGPSKAPSDD 2044
Cdd:pfam02463  885 KDELEskeekekeeKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADE 947
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1297-2007 1.28e-20

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 99.80  E-value: 1.28e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1297 EEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSE---LTDERNTGESASQLLD---AETAERL-RTEKE 1369
Cdd:pfam05483   98 EAELKQKENKLQENRKIIEAQRKAIQELQFENEKVSLKLEEEIQEnkdLIKENNATRHLCNLLKetcARSAEKTkKYEYE 177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1370 MKELQTQYDALKKQMEvmemevmeaRLIRAAEiNGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKmevEQQSRR 1449
Cdd:pfam05483  178 REETRQVYMDLNNNIE---------KMILAFE-ELRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDK---EKQVSL 244
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1450 QL------ERRLGDLQADSDESQRALQQLKKKcqrltAELQDTKLHlegqqvrnhELEKKQRRFDSELSQAHEETQREKL 1523
Cdd:pfam05483  245 LLiqitekENKMKDLTFLLEESRDKANQLEEK-----TKLQDENLK---------ELIEKKDHLTKELEDIKMSLQRSMS 310
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1524 QREKLQREKDMLLAEAFSLKQ----QMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQE 1599
Cdd:pfam05483  311 TQKALEEDLQIATKTICQLTEekeaQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKS 390
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1600 EELDEQAG-----SIQMLEQAKLRLEMEM----ERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKAL 1670
Cdd:pfam05483  391 SELEEMTKfknnkEVELEELKKILAEDEKlldeKKQFEKIAEELKGKEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYL 470
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1671 REKRELESKLstlsdqvnqrdfESEKrlrkdLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEEseftcaaaVKAR 1750
Cdd:pfam05483  471 KEVEDLKTEL------------EKEK-----LKNIELTAHCDKLLLENKELTQEASDMTLELKKHQED--------IINC 525
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1751 KAMEvemEDLHLQIDDIAKAKTALEEQLSRLQRE----KNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQ 1826
Cdd:pfam05483  526 KKQE---ERMLKQIENLEEKEMNLRDELESVREEfiqkGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQ 602
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1827 IEESNKEKQELQEKLQALQsqveflEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKE----TMEKLTEE 1902
Cdd:pfam05483  603 IENKNKNIEELHQENKALK------KKGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKEIEdkkiSEEKLLEE 676
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1903 RDQRAA----AENREKEQNKRLQRQLRD---------------TKEEMSELA---RKEAEASRKKHELEMDLESLEAANQ 1960
Cdd:pfam05483  677 VEKAKAiadeAVKLQKEIDKRCQHKIAEmvalmekhkhqydkiIEERDSELGlykNKEQEQSSAKAALEIELSNIKAELL 756
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*..
gi 1907082211 1961 SLQADLklafkrigdlqaaiedEMESDENEDLINSLQDMVTKYQKKK 2007
Cdd:pfam05483  757 SLKKQL----------------EIEKEEKEKLKMEAKENTAILKDKK 787
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1297-2024 2.61e-20

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 99.04  E-value: 2.61e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1297 EEQIRNKDEEIQQLRSKL----EKVEKERNELRLSSDRLETRISELTSE---LTDERNTgESASQ----------LLDAE 1359
Cdd:pfam15921   77 ERVLEEYSHQVKDLQRRLnesnELHEKQKFYLRQSVIDLQTKLQEMQMErdaMADIRRR-ESQSQedlrnqlqntVHELE 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1360 TAERLRtEKEMKELQTQYDALKKQMEVMEMEVMEARLIRaaeingeVDDDDAGGEWRLKYERA----VREVDFTKKRLQQ 1435
Cdd:pfam15921  156 AAKCLK-EDMLEDSNTQIEQLRKMMLSHEGVLQEIRSIL-------VDFEEASGKKIYEHDSMstmhFRSLGSAISKILR 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1436 ELEDKMEVEQQSRRQLERRLGDLQADS-DESQRALQQLKKKCQRLTAE--LQDTKLHLEGQQVRNhELEKKQRRFDSELS 1512
Cdd:pfam15921  228 ELDTEISYLKGRIFPVEDQLEALKSESqNKIELLLQQHQDRIEQLISEheVEITGLTEKASSARS-QANSIQSQLEIIQE 306
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1513 QAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKdldIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLE 1592
Cdd:pfam15921  307 QARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDK---IEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLL 383
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1593 AKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRqthsKEMESRDEEVeearQSCQKKLKQMEVQLEEEYEDKQKALRE 1672
Cdd:pfam15921  384 ADLHKREKELSLEKEQNKRLWDRDTGNSITIDHLR----RELDDRNMEV----QRLEALLKAMKSECQGQMERQMAAIQG 455
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1673 KRELESKLSTLSDQVnqrdfESEKRLrkdLKRTKALLADAQIMLDHlknnapSKREIAQLKNQLEESE----FTCAAAVK 1748
Cdd:pfam15921  456 KNESLEKVSSLTAQL-----ESTKEM---LRKVVEELTAKKMTLES------SERTVSDLTASLQEKEraieATNAEITK 521
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1749 ARKAMEVEMEDL-HLQIDDIAKAKTALEEQLSRLQ-REKNEIQNRLEEDQEDMNELmkkhkaaVAQASRDMAQMNDLQAQ 1826
Cdd:pfam15921  522 LRSRVDLKLQELqHLKNEGDHLRNVQTECEALKLQmAEKDKVIEILRQQIENMTQL-------VGQHGRTAGAMQVEKAQ 594
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1827 IE-ESNKEKQELQEKlqalqsqvefleqsmvdKSLVSRQEAKIRELETR---LEFEKTqvkRLENLASRLKETMEKLTEE 1902
Cdd:pfam15921  595 LEkEINDRRLELQEF-----------------KILKDKKDAKIRELEARvsdLELEKV---KLVNAGSERLRAVKDIKQE 654
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1903 RDQ-------RAAAENREKEQNKRLQRQLRDTKEEMSELARK-EAEASRKKHELEM---DLESLEAAN-------QSLQA 1964
Cdd:pfam15921  655 RDQllnevktSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKlKMQLKSAQSELEQtrnTLKSMEGSDghamkvaMGMQK 734
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1965 DLKLAFKRIGDLQAAI---EDEMESDENEDLI-----NSLQDMVTKYQKKKNKLEGDSDVDSELEDRV 2024
Cdd:pfam15921  735 QITAKRGQIDALQSKIqflEEAMTNANKEKHFlkeekNKLSQELSTVATEKNKMAGELEVLRSQERRL 802
MYSc_Myo44 cd14905
class XLIV myosin, motor domain; There is little known about the function of the myosin XLIV ...
468-913 5.65e-20

class XLIV myosin, motor domain; There is little known about the function of the myosin XLIV class. Members here include cellular slime mold Polysphondylium and soil-living amoeba Dictyostelium. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276870  Cd Length: 673  Bit Score: 97.09  E-value: 5.65e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLST-RGAPAVYSEKVMHMFKgcRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLG 546
Cdd:cd14905      2 TLINIIQARYKKEIIYTYIGPILVSVNPlRYLPFLHSQELVRNYN--QRRGLPPHLFALAAKAISDMQDFRRDQLIFIGG 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  547 SSGSGKTTSFQHLVQYLAT--IAGTSGTKVFSVEKwqalSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAGQVASASIQ 624
Cdd:cd14905     80 ESGSGKSENTKIIIQYLLTtdLSRSKYLRDYILES----GIILESFGHASTDSNHNSSRWGKYFEMFYSLYGEIQGAKLY 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  625 TMLLEKLRVARRPASEATFNVFYYLLAcgdatlrtelhlnhlaennvfGIVplskpeEKQKAAQQFSKLQaamkvlaisp 704
Cdd:cd14905    156 SYFLDENRVTYQNKGERNFHIFYQFLK---------------------GIT------DEEKAAYQLGDIN---------- 198
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  705 eeqktcwlilaSIYHLGAAGATKEAAEAGRKQFARhewaQKAAYLLGCSLEELSSAIFK-------------HQLKGGTL 771
Cdd:cd14905    199 -----------SYHYLNQGGSISVESIDDNRVFDR----LKMSFVFFDFPSEKIDLIFKtlsfiiilgnvtfFQKNGKTE 263
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  772 QRSTSFRQGPEESGLGEGTKL-------------SALECLEGMASGLYSELFTLLISLVNRALKSSQHSLcSMMIVDTPG 838
Cdd:cd14905    264 VKDRTLIESLSHNITFDSTKLenilisdrsmpvnEAVENRDSLARSLYSALFHWIIDFLNSKLKPTQYSH-TLGILDLFG 342
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  839 FQNPEWGGsargasFEELCHNYAQDRLQRLFHERTFLQELERYKEDNI----ELAFDDLE---PVADDSVAAVDQASHLV 911
Cdd:cd14905    343 QESSQLNG------YEQFSINFLEERLQQIYLQTVLKQEQREYQTERIpwmtPISFKDNEesvEMMEKIINLLDQESKNI 416

                   ..
gi 1907082211  912 RS 913
Cdd:cd14905    417 NS 418
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1552-1942 1.10e-19

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 97.06  E-value: 1.10e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1552 DIAG---FTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEaKVKDQEEELDEQAGSIQMLEQAKLRLEMEmermrq 1628
Cdd:TIGR02169  161 EIAGvaeFDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLR-REREKAERYQALLKEKREYEGYELLKEKE------ 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1629 thskEMESRDEEVEEARQSCQKKLKQMEVQLEEeyedkqkalREKR--ELESKLSTLSDQVNQRDFESEKRLRKDLKRTK 1706
Cdd:TIGR02169  234 ----ALERQKEAIERQLASLEEELEKLTEEISE---------LEKRleEIEQLLEELNKKIKDLGEEEQLRVKEKIGELE 300
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1707 ALLADAQimldhlknnapskREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKN 1786
Cdd:TIGR02169  301 AEIASLE-------------RSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELE 367
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1787 EIQNRLEEDQEDMNELMKKHKAAVaqasrdmAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKslvsrqEA 1866
Cdd:TIGR02169  368 DLRAELEEVDKEFAETRDELKDYR-------EKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGI------EA 434
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1867 KIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRaaaenreKEQNKRLQRQLRDTKeemSELARKEAEAS 1942
Cdd:TIGR02169  435 KINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDL-------KEEYDRVEKELSKLQ---RELAEAEAQAR 500
Motor_domain cd01363
Myosin and Kinesin motor domain; Myosin and Kinesin motor domain. These ATPases belong to the ...
498-616 1.35e-19

Myosin and Kinesin motor domain; Myosin and Kinesin motor domain. These ATPases belong to the P-loop NTPase family and provide the driving force in myosin and kinesin mediated processes. Some of the names do not match with what is given in the sequence list. This is because they are based on the current nomenclature by Kollmar/Sebe-Pedros.


Pssm-ID: 276814 [Multi-domain]  Cd Length: 170  Bit Score: 88.17  E-value: 1.35e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  498 APAVYSEKVMHMFKGCRREDMAPHIYAVAQTAYRAMLMSRQDQSIVLLGSSGSGKTTSFQHLVQYLATIAGTSGTK---- 573
Cdd:cd01363     11 LPIYRDSKIIVFYRGFRRSESQPHVFAIADPAYQSMLDGYNNQSIFAYGESGAGKTETMKGVIPYLASVAFNGINKgete 90
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082211  574 ---------VFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQAG 616
Cdd:cd01363     91 gwvylteitVTLEDQILQANPILEAFGNAKTTRNENSSRFGKFIEILLDIAG 142
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1362-2011 2.67e-19

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 95.81  E-value: 2.67e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1362 ERLRTEKEM---KELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERA---VREVDFTKKRLQQ 1435
Cdd:pfam02463  154 RRLEIEEEAagsRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEkleLEEEYLLYLDYLK 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1436 ELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAH 1515
Cdd:pfam02463  234 LNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDE 313
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1516 EETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLaKVKKQLRDLEAKV 1595
Cdd:pfam02463  314 EKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKK-KLESERLSSAAKL 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1596 KDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQthsKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRE 1675
Cdd:pfam02463  393 KEEELELKSEEEKEAQLLLELARQLEDLLKEEK---KEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKK 469
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1676 LESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQIMLDhlKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAM-- 1753
Cdd:pfam02463  470 SEDLLKETQLVKLQEQ-LELLLSRQKLEERSQKESKARSGLK--VLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAis 546
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1754 ---EVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDM----NELMKKHKAAVAQASRDMAQMNDLQAQ 1826
Cdd:pfam02463  547 tavIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAvleiDPILNLAQLDKATLEADEDDKRAKVVE 626
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1827 IEESNKEKQELQEKLQALQSQVefLEQSMVDKSLVSRQEAKIRELETRLEfekTQVKRLENLASRLKETMEKLTEERDQR 1906
Cdd:pfam02463  627 GILKDTELTKLKESAKAKESGL--RKGVSLEEGLAEKSEVKASLSELTKE---LLEIQELQEKAESELAKEEILRRQLEI 701
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1907 AAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRigdLQAAIEDEMES 1986
Cdd:pfam02463  702 KKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKS---ELSLKEKELAE 778
                          650       660
                   ....*....|....*....|....*
gi 1907082211 1987 DENEDLINSLQDMVTKYQKKKNKLE 2011
Cdd:pfam02463  779 EREKTEKLKVEEEKEEKLKAQEEEL 803
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1422-2003 3.31e-19

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 95.11  E-value: 3.31e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1422 AVREVDFTKKRLQQELEDkmEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELE 1501
Cdd:PRK02224   177 GVERVLSDQRGSLDQLKA--QIEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELE 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1502 kkqrrfdsELSQAHEETQREKLQREklqREKDMLLAEAFSLKQQMEEkdldiagftqkvvsLEAELQDISSQESKDEASL 1581
Cdd:PRK02224   255 --------TLEAEIEDLRETIAETE---REREELAEEVRDLRERLEE--------------LEEERDDLLAEAGLDDADA 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1582 AKVKKQLRDLEAKVKDQEEELDEQAGSIQML-EQAK------LRLEMEMERMRqTHSKEMESRDEEVEEARQSCQKKLKQ 1654
Cdd:PRK02224   310 EAVEARREELEDRDEELRDRLEECRVAAQAHnEEAEslredaDDLEERAEELR-EEAAELESELEEAREAVEDRREEIEE 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1655 MEVQLEEEYEDKQKALREKRELESKLSTLSDQVNqRDFESEKRLRKDLKRTKALLADAQIMLDHLKnnAPskrEIAQlkn 1734
Cdd:PRK02224   389 LEEEIEELRERFGDAPVDLGNAEDFLEELREERD-ELREREAELEATLRTARERVEEAEALLEAGK--CP---ECGQ--- 459
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1735 QLEESEFTCAAAVK--ARKAMEVEMEDLHLQIDDIAKAKTALEEqLSRLQREKNeiqnRLEEDQEDMNELMKKHKAAVaq 1812
Cdd:PRK02224   460 PVEGSPHVETIEEDreRVEELEAELEDLEEEVEEVEERLERAED-LVEAEDRIE----RLEERREDLEELIAERRETI-- 532
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1813 asrdmaqmNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKslvsRQEAKirELETRLEFEKTQVKRLENLASRL 1892
Cdd:PRK02224   533 --------EEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEA----REEVA--ELNSKLAELKERIESLERIRTLL 598
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1893 ------KETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHE-----LEMDLESLEAANQS 1961
Cdd:PRK02224   599 aaiadaEDEIERLREKREALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEeyleqVEEKLDELREERDD 678
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|....
gi 1907082211 1962 LQadlklafKRIGDLQAAIEdEMES--DENEDLINSLQDMVTKY 2003
Cdd:PRK02224   679 LQ-------AEIGAVENELE-ELEElrERREALENRVEALEALY 714
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1429-2011 2.91e-18

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 92.34  E-value: 2.91e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1429 TKKRLQQELEDKMEVEQQSRRQLeRRLGDLQADSDESQRALQQLKKKCQRLTA-----ELQDTKLHLEGQQVRNHELEKK 1503
Cdd:TIGR00618  223 VLEKELKHLREALQQTQQSHAYL-TQKREAQEEQLKKQQLLKQLRARIEELRAqeavlEETQERINRARKAAPLAAHIKA 301
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1504 QRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQ--MEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASL 1581
Cdd:TIGR00618  302 VTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQrrLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHI 381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1582 AKVKKQLRDLEAKVK---DQEEELDEQAGSIQMLEQAKLRLEMEMERMRqthsKEMESRDEEVEEARQSCQKKLkQMEVQ 1658
Cdd:TIGR00618  382 HTLQQQKTTLTQKLQslcKELDILQREQATIDTRTSAFRDLQGQLAHAK----KQQELQQRYAELCAAAITCTA-QCEKL 456
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1659 LEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRL-------RKDLKRTKALLADAQIMLDHLKNNAPSKR---E 1728
Cdd:TIGR00618  457 EKIHLQESAQSLKEREQQLQTKEQIHLQETRKKAVVLARLlelqeepCPLCGSCIHPNPARQDIDNPGPLTRRMQRgeqT 536
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1729 IAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEdmnELMKKHKA 1808
Cdd:TIGR00618  537 YAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSE---AEDMLACE 613
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1809 AVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQ---------EAKIRELETRLEFEK 1879
Cdd:TIGR00618  614 QHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVlpkellasrQLALQKMQSEKEQLT 693
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1880 TQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEA--------SRKKHELEMD 1951
Cdd:TIGR00618  694 YWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVlkarteahFNNNEEVTAA 773
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1952 L------ESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQdMVTKYQKKKNKLE 2011
Cdd:TIGR00618  774 LqtgaelSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCET-LVQEEEQFLSRLE 838
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1564-1989 3.71e-18

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 92.04  E-value: 3.71e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1564 EAELQDISSQESKD--EASLAKVKKQLRDLEAKVKDQEE--ELDEQagsiqmLEQAKLRLememermrqtHSKEMESRDE 1639
Cdd:TIGR02168  176 ETERKLERTRENLDrlEDILNELERQLKSLERQAEKAERykELKAE------LRELELAL----------LVLRLEELRE 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1640 EVEEARQScqkkLKQMEVQLEEeyedkqkALREKRELESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQIMLDHL 1719
Cdd:TIGR02168  240 ELEELQEE----LKEAEEELEE-------LTAELQELEEKLEELRLEVSELE-EEIEELQKELYALANEISRLEQQKQIL 307
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1720 KNnapskrEIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDM 1799
Cdd:TIGR02168  308 RE------RLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQL 381
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1800 NELmkkhKAAVAQASRdmaQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMvdkslvsrQEAKIRELETRLEfek 1879
Cdd:TIGR02168  382 ETL----RSKVAQLEL---QIASLNNEIERLEARLERLEDRRERLQQEIEELLKKL--------EEAELKELQAELE--- 443
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1880 tqvkrlenlasRLKETMEKLTEERDQRAAAEnrekeqnKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAAN 1959
Cdd:TIGR02168  444 -----------ELEEELEELQEELERLEEAL-------EELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFS 505
                          410       420       430
                   ....*....|....*....|....*....|
gi 1907082211 1960 QSLQAdLKLAFKRIGDLQAAIEDEMESDEN 1989
Cdd:TIGR02168  506 EGVKA-LLKNQSGLSGILGVLSELISVDEG 534
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1729-1991 5.57e-18

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 91.15  E-value: 5.57e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1729 IAQLKNQLEESEftcAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKA 1808
Cdd:COG1196    195 LGELERQLEPLE---RQAEKAERYRELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1809 AVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSL-VSRQEAKIRELETRLEFEKTQVKRLEN 1887
Cdd:COG1196    272 LRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEeLAELEEELEELEEELEELEEELEEAEE 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1888 LASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLK 1967
Cdd:COG1196    352 ELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALA 431
                          250       260
                   ....*....|....*....|....
gi 1907082211 1968 LAFKRIGDLQAAIEDEMESDENED 1991
Cdd:COG1196    432 ELEEEEEEEEEALEEAAEEEAELE 455
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1402-1801 6.01e-18

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 91.27  E-value: 6.01e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1402 INGEVDDDDAGgewRLKYERAVREVDFTKKRLQQ---ELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQR 1478
Cdd:TIGR02168  661 ITGGSAKTNSS---ILERRREIEELEEKIEELEEkiaELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLAR 737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1479 LTAELQdtklhlegqqvrnhELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQ 1558
Cdd:TIGR02168  738 LEAEVE--------------QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALRE 803
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1559 KVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRD 1638
Cdd:TIGR02168  804 ALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERA 883
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1639 eEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNqrdfesekRLRKDLKRTK-ALLADAQIMLD 1717
Cdd:TIGR02168  884 -SLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLE--------GLEVRIDNLQeRLSEEYSLTLE 954
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1718 HLKNN--------APSKREIAQLKNQLEE---------SEFtcaAAVKARKamevemEDLHLQIDDIAKAKTALEEQLSR 1780
Cdd:TIGR02168  955 EAEALenkieddeEEARRRLKRLENKIKElgpvnlaaiEEY---EELKERY------DFLTAQKEDLTEAKETLEEAIEE 1025
                          410       420
                   ....*....|....*....|.
gi 1907082211 1781 LQReknEIQNRLEEDQEDMNE 1801
Cdd:TIGR02168 1026 IDR---EARERFKDTFDQVNE 1043
PDZ_canonical cd00136
canonical PDZ domain; Canonical PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs ...
220-308 1.39e-17

canonical PDZ domain; Canonical PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain. PDZ domains usually bind to short specific peptide sequences located at the C-terminal end of their partner proteins known as PDZ binding motifs. These domains can also interact with internal peptide motifs and certain lipids, and can take part in a head-to-tail oligomerization with other PDZ domains. The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. The canonical PDZ domain contains six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467153 [Multi-domain]  Cd Length: 81  Bit Score: 79.51  E-value: 1.39e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLRRTTMLDRAPegqayrrVVHFAEPGaGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQS 299
Cdd:cd00136      1 TVTLEKDPGGGLGFSIRGGKDGGGGI-------FVSRVEPG-GPAARDGRLRVGDRILEVNGVSLEGLTHEEAVELLKSA 72

                   ....*....
gi 1907082211  300 GDSVRLKVQ 308
Cdd:cd00136     73 GGEVTLTVR 81
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1253-1842 1.58e-17

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 89.69  E-value: 1.58e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1253 KKRKIQDLAIRCVQKNIKKNKgvkdwpwwKLFTTVRPLIQVQLS-EEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRL 1331
Cdd:TIGR04523  194 NKLLKLELLLSNLKKKIQKNK--------SLESQISELKKQNNQlKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKI 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1332 ETRISELTSELtderntgESASQLLDaetaerlRTEKEMKELQTQYDALKKQmevmemevmearliRAAEINGEVDDDDA 1411
Cdd:TIGR04523  266 KKQLSEKQKEL-------EQNNKKIK-------ELEKQLNQLKSEISDLNNQ--------------KEQDWNKELKSELK 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1412 GGEWRL--------KYERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKkcqrLTAEL 1483
Cdd:TIGR04523  318 NQEKKLeeiqnqisQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKN----LESQI 393
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1484 QDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDmllaeafSLKQQMEEKDLDIAGFTQKVVSL 1563
Cdd:TIGR04523  394 NDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIK-------DLTNQDSVKELIIKNLDNTRESL 466
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1564 EAELQDISSQESKDEASLAKVKKQLrdleakvKDQEEELDeqagsiqMLEQAKLRLEMEMermrqthsKEMESRdeevee 1643
Cdd:TIGR04523  467 ETQLKVLSRSINKIKQNLEQKQKEL-------KSKEKELK-------KLNEEKKELEEKV--------KDLTKK------ 518
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1644 arqscQKKLKQMEVQLEeeyedkqkalREKRELESKLSTLSDQVNQRDFESEKRLRKD-----------LKRTKALLADA 1712
Cdd:TIGR04523  519 -----ISSLKEKIEKLE----------SEKKEKESKISDLEDELNKDDFELKKENLEKeideknkeieeLKQTQKSLKKK 583
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1713 QIMLDHLKNNapSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNR- 1791
Cdd:TIGR04523  584 QEEKQELIDQ--KEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNKw 661
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1792 ---------LEEDQEDMNELMKKHK----AAVAQASRDMAQMNDLQaQIEESNKEKQELQEKLQ 1842
Cdd:TIGR04523  662 peiikkikeSKTKIDDIIELMKDWLkelsLHYKKYITRMIRIKDLP-KLEEKYKEIEKELKKLD 724
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1288-2017 2.87e-17

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 89.34  E-value: 2.87e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1288 RPLIQVQLSEEQIRNKDEEIQ--QLRSKLEKVEKE-------RNELRLSSDRLETR---ISELTSELTDERNTGESASQL 1355
Cdd:TIGR00606  351 RLQLQADRHQEHIRARDSLIQslATRLELDGFERGpfserqiKNFHTLVIERQEDEaktAAQLCADLQSKERLKQEQADE 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1356 LDAETAERLRT--------EKEMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVR--- 1424
Cdd:TIGR00606  431 IRDEKKGLGRTielkkeilEKKQEELKFVIKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQnek 510
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1425 -EVDFTKKRLQQELE-----------------DKMEVEQQSRRQLERRLGDLQA------DSDESQRALQQLKKKCQRLT 1480
Cdd:TIGR00606  511 aDLDRKLRKLDQEMEqlnhhtttrtqmemltkDKMDKDEQIRKIKSRHSDELTSllgyfpNKKQLEDWLHSKSKEINQTR 590
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1481 AELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHE-----------ETQREKLQR--EKLQREKDMLLAEAFSLKQQME 1547
Cdd:TIGR00606  591 DRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDklfdvcgsqdeESDLERLKEeiEKSSKQRAMLAGATAVYSQFIT 670
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1548 EKDLDIAG---FTQKVVSLEAELQDISsqeSKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEME 1624
Cdd:TIGR00606  671 QLTDENQSccpVCQRVFQTEAELQEFI---SDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIP 747
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1625 RMRQTHSK------EMESRDEEVEEARQSCQKKLKQMEV---------QLEEEYEDKQKALREK-RELESKLSTLS-DQV 1687
Cdd:TIGR00606  748 ELRNKLQKvnrdiqRLKNDIEEQETLLGTIMPEEESAKVcltdvtimeRFQMELKDVERKIAQQaAKLQGSDLDRTvQQV 827
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1688 NQRDFESEKRLRK---DLKRTKALLADAQIMLDHLKNNApskreiaqlkNQLEESEFTCAAAVKARKAME-------VEM 1757
Cdd:TIGR00606  828 NQEKQEKQHELDTvvsKIELNRKLIQDQQEQIQHLKSKT----------NELKSEKLQIGTNLQRRQQFEeqlvelsTEV 897
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1758 EDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEED----QEDMNELMKKHKAAVA---------------QASRDMA 1818
Cdd:TIGR00606  898 QSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSnkkaQDKVNDIKEKVKNIHGymkdienkiqdgkddYLKQKET 977
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1819 QMNDLQAQIEESNKEKQELQEKLQALQSQVE---FLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLasRLKET 1895
Cdd:TIGR00606  978 ELNTVNAQLEECEKHQEKINEDMRLMRQDIDtqkIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVL--QMKQE 1055
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1896 MEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEAsrKKHELEMDLESLEAANQSL--------QADLK 1967
Cdd:TIGR00606 1056 HQKLEENIDLIKRNHVLALGRQKGYEKEIKHFKKELREPQFRDAEE--KYREMMIVMRTTELVNKDLdiyyktldQAIMK 1133
                          810       820       830       840       850       860
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1968 LAFKRIGDLQAAIED--------------EMESDENEDLinSLQDMVTKYQKKKNKLEGDSDVD 2017
Cdd:TIGR00606 1134 FHSMKMEEINKIIRDlwrstyrgqdieyiEIRSDADENV--SASDKRRNYNYRVVMLKGDTALD 1195
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1298-1785 1.37e-16

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 86.36  E-value: 1.37e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1298 EQIRNKDEEIQQLRSKLEKVEKERNELrlssDRLETRISELTSELTDERNTGESASQLLDAET--AERLRTEKEMKELQT 1375
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEEL----EELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPE 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1376 QYDALKKQMEVMEmevmearliraaeingevddddaggEWRLKYERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRL 1455
Cdd:COG4717    147 RLEELEERLEELR-------------------------ELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEEL 201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1456 GDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQvrnhelekkqrrfdselsqaheetqreklQREKLQREKDML 1535
Cdd:COG4717    202 EELQQRLAELEEELEEAQEELEELEEELEQLENELEAAA-----------------------------LEERLKEARLLL 252
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1536 LAEA-----FSLKQQMEEKDLDIAGFTQKVVSLeaeLQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQ 1610
Cdd:COG4717    253 LIAAallalLGLGGSLLSLILTIAGVLFLVLGL---LALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALG 329
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1611 MLEQAKL-RLEMEMERMRQTHSKEMESRDEE----VEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSD 1685
Cdd:COG4717    330 LPPDLSPeELLELLDRIEELQELLREAEELEeelqLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEELEELEE 409
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1686 QVNQR--------DFESEKRLRKDLKRTKALLADAQIMLDHLknnapsKREIAQLKNQLE--ESEFTCAAAVKARKAMEV 1755
Cdd:COG4717    410 QLEELlgeleellEALDEEELEEELEELEEELEELEEELEEL------REELAELEAELEqlEEDGELAELLQELEELKA 483
                          490       500       510
                   ....*....|....*....|....*....|
gi 1907082211 1756 EMEDLHLQIDDIAKAKTALEEQLSRLQREK 1785
Cdd:COG4717    484 ELRELAEEWAALKLALELLEEAREEYREER 513
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1297-1778 1.50e-16

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 86.77  E-value: 1.50e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1297 EEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISEL---TSELTDERNTGESAsqlldaetaeRLRTEKEMKEL 1373
Cdd:pfam01576  656 ERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVEEMktqLEELEDELQATEDA----------KLRLEVNMQAL 725
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1374 QTQYDAlkkqmevmemevmearliraaEINGEvddDDAGGEWRLKYERAVREvdftkkrLQQELEDKMEVEQQ---SRRQ 1450
Cdd:pfam01576  726 KAQFER---------------------DLQAR---DEQGEEKRRQLVKQVRE-------LEAELEDERKQRAQavaAKKK 774
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1451 LERRLGDLQADSDESQR----ALQQLKK---KCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKL 1523
Cdd:pfam01576  775 LELDLKELEAQIDAANKgreeAVKQLKKlqaQMKDLQRELEEARASRDEILAQSKESEKKLKNLEAELLQLQEDLAASER 854
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1524 QREKLQREKDmllaeafslkqqmeekdldiagftqkvvsleaELQDISSQESKDEASLAKVKkqlRDLEAKVKDQEEELD 1603
Cdd:pfam01576  855 ARRQAQQERD--------------------------------ELADEIASGASGKSALQDEK---RRLEARIAQLEEELE 899
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1604 EQAGSIQMLEQAKLRLEMEMERMRQTHSKEmESRDEEVEEARQSCQKKLKQMEVQL-EEEYEDKQKALREKRELESKLST 1682
Cdd:pfam01576  900 EEQSNTELLNDRLRKSTLQVEQLTTELAAE-RSTSQKSESARQQLERQNKELKAKLqEMEGTVKSKFKSSIAALEAKIAQ 978
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1683 LSDQVNQrdfESEKRLR--KDLKRTKALLADAQIMLDHLKNNAPSKREIA--------QLKNQLEESEFTCAAAVKARKA 1752
Cdd:pfam01576  979 LEEQLEQ---ESRERQAanKLVRRTEKKLKEVLLQVEDERRHADQYKDQAekgnsrmkQLKRQLEEAEEEASRANAARRK 1055
                          490       500
                   ....*....|....*....|....*.
gi 1907082211 1753 MEVEMEDLHLQIDDIAKAKTALEEQL 1778
Cdd:pfam01576 1056 LQRELDDATESNESMNREVSTLKSKL 1081
MYSc_Myo24B cd14938
class XXIV B myosin, motor domain; These myosins have a 1-2 IQ motifs in their neck and a ...
468-650 1.82e-16

class XXIV B myosin, motor domain; These myosins have a 1-2 IQ motifs in their neck and a coiled-coil region in their C-terminal tail. The functions of these myosins remain elusive. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276898 [Multi-domain]  Cd Length: 713  Bit Score: 86.04  E-value: 1.82e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLSTRGAPAVYSEKVMHMFKgCRR--EDMAPHIYAVAQTAYRAMLMSRQDQSIVLL 545
Cdd:cd14938      2 SVLYHLKERFKNNKFYTKMGPLLIFINPKINNNINNEETIEKYK-CIDciEDLSLNEYHVVHNALKNLNELKRNQSIIIS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  546 GSSGSGKTTSFQHLVQYLATIAGTSGTKVFSVEKWQAL---------------------STLLEAFGNSPTIMNGSATRF 604
Cdd:cd14938     81 GESGSGKSEIAKNIINFIAYQVKGSRRLPTNLNDQEEDnihneentdyqfnmsemlkhvNVVMEAFGNAKTVKNNNSSRF 160
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082211  605 SQILSLDFDQAgQVASASIQTMLLEKLRVARRPASEATFNVFYYLL 650
Cdd:cd14938    161 SKFCTIHIENE-EIKSFHIKKFLLDKERLINRKANENSFNIFYYII 205
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1479-1979 1.82e-16

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 85.97  E-value: 1.82e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1479 LTAELQDTKLHLEGQQVRNHELEKKQRRfdsELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQ 1558
Cdd:COG4717     47 LLERLEKEADELFKPQGRKPELNLKELK---ELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEK 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1559 KVvsleaELQDISSQESKDEASLAKVKKQLRDLEAkvkdQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRD 1638
Cdd:COG4717    124 LL-----QLLPLYQELEALEAELAELPERLEELEE----RLEELRELEEELEELEAELAELQEELEELLEQLSLATEEEL 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1639 EEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQvnqrdfesekrlrKDLKRTKALLADAQIMLdh 1718
Cdd:COG4717    195 QDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALE-------------ERLKEARLLLLIAAALL-- 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1719 lknnapskrEIAQLKNQLEESEFTCAAAVKARKAMevemedLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQED 1798
Cdd:COG4717    260 ---------ALLGLGGSLLSLILTIAGVLFLVLGL------LALLFLLLAREKASLGKEAEELQALPALEELEEEELEEL 324
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1799 MNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKlQALQSQVEFLEQSMVDKslvsrqEAKIRELETRLEFE 1878
Cdd:COG4717    325 LAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLE-ELEQEIAALLAEAGVED------EEELRAALEQAEEY 397
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1879 KTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQnkRLQRQLRDTKEEMSELARKEAEASRKKHELEMD--LESLE 1956
Cdd:COG4717    398 QELKEELEELEEQLEELLGELEELLEALDEEELEEELE--ELEEELEELEEELEELREELAELEAELEQLEEDgeLAELL 475
                          490       500
                   ....*....|....*....|...
gi 1907082211 1957 AANQSLQADLKLAFKRIGDLQAA 1979
Cdd:COG4717    476 QELEELKAELRELAEEWAALKLA 498
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1291-1807 2.31e-16

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 85.89  E-value: 2.31e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1291 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLET---------RISELTSELTDERNTGE-------SASQ 1354
Cdd:PRK03918   245 KELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKElkekaeeyiKLSEFYEEYLDELREIEkrlsrleEEIN 324
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1355 LLDAETAERLRTEKEMKELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEvddddaggewRLKYERAVREVDFTKKRLQ 1434
Cdd:PRK03918   325 GIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELE----------RLKKRLTGLTPEKLEKELE 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1435 QELEDKMEVEQQsRRQLERRLGDLQADSDESQRALQQLKK---KCQRLTAELQDtklhlegqqvrnHELEKKQRRFDSEL 1511
Cdd:PRK03918   395 ELEKAKEEIEEE-ISKITARIGELKKEIKELKKAIEELKKakgKCPVCGRELTE------------EHRKELLEEYTAEL 461
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1512 SQAHEETQREKLQREKLQREKDMLlaEAFSLKQQMEEKDLDIAgftQKVVSLEAELQDISSQE-SKDEASLAKVKKQLRD 1590
Cdd:PRK03918   462 KRIEKELKEIEEKERKLRKELREL--EKVLKKESELIKLKELA---EQLKELEEKLKKYNLEElEKKAEEYEKLKEKLIK 536
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1591 LEAKVKDQEEELDEQagsiqmleqaklrlememermrqthsKEMESRDEEVEEARQSCQKKLKQMEVQLEEE----YEDK 1666
Cdd:PRK03918   537 LKGEIKSLKKELEKL--------------------------EELKKKLAELEKKLDELEEELAELLKELEELgfesVEEL 590
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1667 QKALREKRELESKLSTLSDQVnqRDFESEKRLRKDLKRTkalLADAQIMLDHLKNNAPSKR-EIAQLKNQLEESEFTCAA 1745
Cdd:PRK03918   591 EERLKELEPFYNEYLELKDAE--KELEREEKELKKLEEE---LDKAFEELAETEKRLEELRkELEELEKKYSEEEYEELR 665
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1746 AVKARKAMEV-----EMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNrLEEDQEDMNELMKKHK 1807
Cdd:PRK03918   666 EEYLELSRELaglraELEELEKRREEIKKTLEKLKEELEEREKAKKELEK-LEKALERVEELREKVK 731
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1570-2022 3.30e-16

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 85.07  E-value: 3.30e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1570 ISSQESKD---EASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEmemERMRQTHSKEMESRD-------- 1638
Cdd:TIGR04523   28 ANKQDTEEkqlEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILE---QQIKDLNDKLKKNKDkinklnsd 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1639 -----EEVEEARQSCQKK---LKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNqrdfesekrlrkDLKRTKALLA 1710
Cdd:TIGR04523  105 lskinSEIKNDKEQKNKLeveLNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYN------------DLKKQKEELE 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1711 DAQIMLDHLKNNApsKREIAQLKNQLEESEFTCAAAVKARKamevEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQN 1790
Cdd:TIGR04523  173 NELNLLEKEKLNI--QKNIDKIKNKLLKLELLLSNLKKKIQ----KNKSLESQISELKKQNNQLKDNIEKKQQEINEKTT 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1791 RLEEDQEDMNELMKKHKaavaqasRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD------KSLVSRQ 1864
Cdd:TIGR04523  247 EISNTQTQLNQLKDEQN-------KIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQdwnkelKSELKNQ 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1865 EAKIRELETRLEFEKTQVKRLENLASRLKETMEKLT---EERDQRAAAENRE----KEQNKRLQRQLRDTKEEMSELARK 1937
Cdd:TIGR04523  320 EKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSEsenSEKQRELEEKQNEieklKKENQSYKQEIKNLESQINDLESK 399
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1938 EAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESD-ENEDLINSLQDMVTKYQKKKNKLEGD-SD 2015
Cdd:TIGR04523  400 IQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDsVKELIIKNLDNTRESLETQLKVLSRSiNK 479

                   ....*..
gi 1907082211 2016 VDSELED 2022
Cdd:TIGR04523  480 IKQNLEQ 486
PDZ_NHERF-like cd06768
PDZ domains of the Na+/H+ exchange regulatory cofactor (NHERF) family (NHERF1-4), and related ...
222-307 3.77e-16

PDZ domains of the Na+/H+ exchange regulatory cofactor (NHERF) family (NHERF1-4), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of the Na+/H+ exchange regulatory cofactor (NHERF) family of multi-PDZ-domain-containing scaffolding proteins (NHERF1-4), and related domains. The NHERF family includes NHERF1 (also known as EBP50), NHERF2 (also known as E3KARP; TKA-1; SIP-1), NHERF3 (also known as CAP70; CLAMP; Napi-Cap-1; PDZD1) and NHERF4 (also known as IKEPP; PDZK2; Napi-Cap-2). NHERF1 and NHERF2 have tandem PDZ domains (PDZ1-2); NHERF3 and NHERF4 have four PDZ domains (PDZ1-4). NHERFs are involved in the regulation of multiple receptors or transporters, such as type II sodium-phosphate cotransporter (Npt2a), purinergic P2Y1 receptor P2Y1R, the beta2-adrenergic receptor (beta2-AR), parathyroid hormone receptor type 1 (PTHR), the lysophosphatidic acid receptors (LPARs), sodium-hydrogen exchanger 3 (NHE3), and cystic fibrosis transmembrane conductance regulator (CFTR). NHERF-PDZ1 domain interaction partners include Npt2a, purinergic P2Y1 receptor, beta2-AR, CFTR, PTHR, NH3, G-protein-coupled receptor kinase 6 (GRK6A), platelet-derived growth factor receptor (PDGFR), B1 subunit of the H+ATPase, cholesterol, receptor for activated C-kinase RACK1, aquaporin 9, among others. The NHERF PDZ2 domain interacts with fewer proteins: NHERF1 PDZ2 binds Npt2a, PTHR, beta-catenin, aquaporin 9, and RACK1; NHERF2 PDZ2 binds LPA2, P2Y1R, and NHE3, cGMP-dependent protein kinase type II (cGKII). NHERF4 PDZ1 and PDZ4 bind the epithelial Ca(2+) channels TRPV5 and TRPV6. NHERF2/NHERF3 heterodimerization is mediated by PDZ domains of NHERF2 and the C-terminal PDZ domain recognition motif of NHERF3. NHERF4 regulates several transporters mediating influx of xenobiotics and nutrients in the small intestine. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This NHERF-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467249 [Multi-domain]  Cd Length: 80  Bit Score: 75.17  E-value: 3.77e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  222 ELQRRPTGdFGFSLRRttmlDRAPEGQaYRRVVhfaEPGaGTKDLAlGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGD 301
Cdd:cd06768      4 HLVKGPEG-YGFNLHA----EKGRPGH-FIREV---DPG-SPAERA-GLKDGDRLVEVNGENVEGESHEQVVEKIKASGN 72

                   ....*.
gi 1907082211  302 SVRLKV 307
Cdd:cd06768     73 QVTLLV 78
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1671-2027 1.11e-15

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 83.96  E-value: 1.11e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1671 REKRELESKLSTLS--DQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTcaAAVK 1748
Cdd:TIGR02169  153 VERRKIIDEIAGVAefDRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGY--ELLK 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1749 ARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDM-AQMNDLQAQI 1827
Cdd:TIGR02169  231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELeAEIASLERSI 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1828 EESNKEKQELQEKLQ-------ALQSQVEFLEQSMVDKSL--------VSRQEAKIRELETRLEFEKTQVKRLENLASRL 1892
Cdd:TIGR02169  311 AEKERELEDAEERLAkleaeidKLLAEIEELEREIEEERKrrdklteeYAELKEELEDLRAELEEVDKEFAETRDELKDY 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1893 KETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKR 1972
Cdd:TIGR02169  391 REKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQE 470
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1973 IGDLQA---AIEDEMESDENEdlinsLQDMVTKYQKKKNKLEGDSDVDSELEDRVDGV 2027
Cdd:TIGR02169  471 LYDLKEeydRVEKELSKLQRE-----LAEAEAQARASEERVRGGRAVEEVLKASIQGV 523
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1291-1637 1.27e-15

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 83.57  E-value: 1.27e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1291 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLET-------RISELTSELTDERNTGESASQLLDAETAER 1363
Cdd:TIGR02168  698 KALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAeveqleeRIAQLSKELTELEAEIEELEERLEEAEEEL 777
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1364 LRTEKEMKELQTQYDALKKQMEVMEMevmearliRAAEINGEVDD-DDAGGEWRLKYERAVREVDFTKKRLqQELEDKME 1442
Cdd:TIGR02168  778 AEAEAEIEELEAQIEQLKEELKALRE--------ALDELRAELTLlNEEAANLRERLESLERRIAATERRL-EDLEEQIE 848
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1443 VEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREK 1522
Cdd:TIGR02168  849 ELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLE 928
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1523 LQREKLQREKDMLLaEAFSLKQQMeekdldiagftqkvvsleaELQDISSQESKDEASLAKVKKQLRDLEAKVK------ 1596
Cdd:TIGR02168  929 LRLEGLEVRIDNLQ-ERLSEEYSL-------------------TLEEAEALENKIEDDEEEARRRLKRLENKIKelgpvn 988
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1907082211 1597 --------DQEEELDEQAGSIQMLEQAKLRLEMEMERMrqthSKEMESR 1637
Cdd:TIGR02168  989 laaieeyeELKERYDFLTAQKEDLTEAKETLEEAIEEI----DREARER 1033
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1294-1945 2.43e-15

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 82.71  E-value: 2.43e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1294 QLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDaetaERLRTEKEMKEL 1373
Cdd:pfam02463  380 KLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQ----GKLTEEKEELEK 455
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1374 QTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELE--DKMEVEQQSRRQL 1451
Cdd:pfam02463  456 QELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVggRIISAHGRLGDLG 535
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1452 ERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQrrfDSELSQAHEETQREKLQREKLQRE 1531
Cdd:pfam02463  536 VAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLK---LPLKSIAVLEIDPILNLAQLDKAT 612
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1532 KDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQE------SKDEASLAKVKKQLRDLEAKVKDQEEELDEQ 1605
Cdd:pfam02463  613 LEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEeglaekSEVKASLSELTKELLEIQELQEKAESELAKE 692
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1606 AGSIQMLEQAKLRLEMEmERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSD 1685
Cdd:pfam02463  693 EILRRQLEIKKKEQREK-EELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSL 771
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1686 QVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQID 1765
Cdd:pfam02463  772 KEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKL 851
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1766 DIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVaqasrdMAQMNDLQAQIEESNKEKQELQEKLQALQ 1845
Cdd:pfam02463  852 AEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEK------EEKKELEEESQKLNLLEEKENEIEERIKE 925
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1846 SQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLEN---LASRLKETMEKLTEERDQRAAAENREKEQN----K 1918
Cdd:pfam02463  926 EAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLakeELGKVNLMAIEEFEEKEERYNKDELEKERLeeekK 1005
                          650       660
                   ....*....|....*....|....*..
gi 1907082211 1919 RLQRQLRDTKEEMSELARKEAEASRKK 1945
Cdd:pfam02463 1006 KLIRAIIEETCQRLKEFLELFVSINKG 1032
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1689-2022 2.65e-15

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 82.80  E-value: 2.65e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1689 QRDFESEKRLRK---DLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEEseftcaaavkarkamevemedlhLQID 1765
Cdd:TIGR02168  172 ERRKETERKLERtreNLDRLEDILNELERQLKSLERQAEKAERYKELKAELRE-----------------------LELA 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1766 DIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELmkkhkaavaqasrdmaqmndlQAQIEESNKEKQELQEKLQALQ 1845
Cdd:TIGR02168  229 LLVLRLEELREELEELQEELKEAEEELEELTAELQEL---------------------EEKLEELRLEVSELEEEIEELQ 287
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1846 SqvEFLEQSmvdkslvsrqeAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLR 1925
Cdd:TIGR02168  288 K--ELYALA-----------NEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELE 354
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1926 DTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKYQK 2005
Cdd:TIGR02168  355 SLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAE 434
                          330
                   ....*....|....*..
gi 1907082211 2006 KKNKLEGDSDVDSELED 2022
Cdd:TIGR02168  435 LKELQAELEELEEELEE 451
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1432-1937 2.89e-15

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 82.66  E-value: 2.89e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1432 RLQQELEDKMEVEQQSRRQLERR--LGDLQADSDESQRALQQLKkkcqrlTAELQDTKLHLEGQQVRNHELEKKQRRFDS 1509
Cdd:COG4913    229 ALVEHFDDLERAHEALEDAREQIelLEPIRELAERYAAARERLA------ELEYLRAALRLWFAQRRLELLEAELEELRA 302
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1510 ELSQAHEETQREKLQREKLQREKDMLLAEAFS--------LKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASL 1581
Cdd:COG4913    303 ELARLEAELERLEARLDALREELDELEAQIRGnggdrleqLEREIERLERELEERERRRARLEALLAALGLPLPASAEEF 382
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1582 AKVKKQLR------------------DLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKE---------- 1633
Cdd:COG4913    383 AALRAEAAallealeeelealeealaEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRDALAEAlgldeaelpf 462
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1634 ----MESRDEE-------------------VEEARQSC------QKKLKQmEVQLEEEYEDKQKALREK-------RELE 1677
Cdd:COG4913    463 vgelIEVRPEEerwrgaiervlggfaltllVPPEHYAAalrwvnRLHLRG-RLVYERVRTGLPDPERPRldpdslaGKLD 541
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1678 SKLSTLSDQVNQ---RDF-----ESEKRLRKDlkrTKALLADAQImldhlKNNApSKREIaQLKNQLEESEFTCAAAVKA 1749
Cdd:COG4913    542 FKPHPFRAWLEAelgRRFdyvcvDSPEELRRH---PRAITRAGQV-----KGNG-TRHEK-DDRRRIRSRYVLGFDNRAK 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1750 RKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNelmkkhkaaVAQASRDMAQMNDLQAQIEE 1829
Cdd:COG4913    612 LAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEID---------VASAEREIAELEAELERLDA 682
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1830 SNKEKQELQEKLQALQSQVEFLEQsmvdksLVSRQEAKIRELETRLEFEKTQVK----RLENLASRLKETMEKLTEERDQ 1905
Cdd:COG4913    683 SSDDLAALEEQLEELEAELEELEE------ELDELKGEIGRLEKELEQAEEELDelqdRLEAAEDLARLELRALLEERFA 756
                          570       580       590
                   ....*....|....*....|....*....|..
gi 1907082211 1906 RAAAENREKEQNKRLQRQLRDTKEEMSELARK 1937
Cdd:COG4913    757 AALGDAVERELRENLEERIDALRARLNRAEEE 788
MYSc_Myo20 cd14881
class XX myosin, motor domain; These class 20 myosins are primarily insect myosins with such ...
468-1190 2.89e-15

class XX myosin, motor domain; These class 20 myosins are primarily insect myosins with such members as Drosophila, Daphnia, and mosquitoes. These myosins contain a single IQ motif in the neck region. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276847 [Multi-domain]  Cd Length: 633  Bit Score: 82.08  E-value: 2.89e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  468 SVLHTLRQRYGASLLHTYAGPSLLVLST---RGAPAVYSEKvmhmfkgcRREDMAPHIYAVAQTAYRAMLMSRQDQSIVL 544
Cdd:cd14881      2 AVMKCLQARFYAKEFFTNVGPILLSVNPyrdVGNPLTLTST--------RSSPLAPQLLKVVQEAVRQQSETGYPQAIIL 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  545 LGSSGSGKTTSFQHLVQYLATIAGtSGTKVFSVEKWQALSTLLEAFGNSPTIMNGSATRFSQILSLDFDQaGQVASASIQ 624
Cdd:cd14881     74 SGTSGSGKTYASMLLLRQLFDVAG-GGPETDAFKHLAAAFTVLRSLGSAKTATNSESSRIGHFIEVQVTD-GALYRTKIH 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  625 TMLLEKLRVARRPASEATFNVFYYLLACGDATLRTELHLN--------HLAENNVFgivplskpEEKQKAAQQFSKLQAA 696
Cdd:cd14881    152 CYFLDQTRVIRPLPGEKNYHIFYQMLAGLSQEERVKLHLDgyspanlrYLSHGDTR--------QNEAEDAARFQAWKAC 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  697 MKVLAISpeeqktcWL----ILASIYHLG----AAGATKEAAEAGRKQFarhewaQKAAYLLGCSleelSSAIFKhqlkg 768
Cdd:cd14881    224 LGILGIP-------FLdvvrVLAAVLLLGnvqfIDGGGLEVDVKGETEL------KSVAALLGVS----GAALFR----- 281
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  769 GTLQRSTSFRQGPEESGLGEGTKLSALECLegmASGLYSELFTLLISLVNrALKSSQHSLC------SMMIVDTPGFQNP 842
Cdd:cd14881    282 GLTTRTHNARGQLVKSVCDANMSNMTRDAL---AKALYCRTVATIVRRAN-SLKRLGSTLGthatdgFIGILDMFGFEDP 357
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  843 ewggsaRGASFEELCHNYAQDRLQRLFHERTFLQELERYKEDNIELafdDLEPVADDSVAAVDqashLVRSLahadeARG 922
Cdd:cd14881    358 ------KPSQLEHLCINLCAETMQHFYNTHIFKSSIESCRDEGIQC---EVEVDYVDNVPCID----LISSL-----RTG 419
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  923 LLWLLEEEALVPGATEdalldrlfSYYGPQEGDKKGQSPLLRSSK--PRHFLLGHSHGTnwVEYNVAGWLnytkqnpatq 1000
Cdd:cd14881    420 LLSMLDVECSPRGTAE--------SYVAKIKVQHRQNPRLFEAKPqdDRMFGIRHFAGR--VVYDASDFL---------- 479
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1001 naprllqDSQKKIISNlflgragsatvlsgsiagleggSQLALRRATSMRKTFTTGMAavkkkslciQIKLQVDALIDTI 1080
Cdd:cd14881    480 -------DTNRDVVPD----------------------DLVAVFYKQNCNFGFATHTQ---------DFHTRLDNLLRTL 521
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1081 KRSKMHFVHCFLPVAEGWPGeprsassrrvsssseldlppgdpceagllQLDVSLLRAQLRGSRLLDAMRMYRQGYPDHM 1160
Cdd:cd14881    522 VHARPHFVRCIRSNTTETPN-----------------------------HFDRGTVVRQIRSLQVLETVNLMAGGYPHRM 572
                          730       740       750
                   ....*....|....*....|....*....|
gi 1907082211 1161 VFSEFRRRFDVLAPHLTKKHGRNYIVVDEK 1190
Cdd:cd14881    573 RFKAFNARYRLLAPFRLLRRVEEKALEDCA 602
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1301-1906 8.80e-15

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 80.85  E-value: 8.80e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1301 RNKDEEIQQLRSKLEkvEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDaETAERLrteKEMKELQTQYDAL 1380
Cdd:PRK02224   183 SDQRGSLDQLKAQIE--EKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRD-EADEVL---EEHEERREELETL 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1381 KkqmevmemevmearliraAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELED---KMEVEQQSRRQLERRLGD 1457
Cdd:PRK02224   257 E------------------AEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDllaEAGLDDADAEAVEARREE 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1458 LQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDmlla 1537
Cdd:PRK02224   319 LEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIE---- 394
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1538 eafSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKD-------QEEELDEQAGSIQ 1610
Cdd:PRK02224   395 ---ELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAgkcpecgQPVEGSPHVETIE 471
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1611 MLEQAKLRLEMEMERMRQTHskemESRDEEVEEArqscqKKLKQMEVQLEEeyedkqkaLREKRELESKLSTlsdqvNQR 1690
Cdd:PRK02224   472 EDRERVEELEAELEDLEEEV----EEVEERLERA-----EDLVEAEDRIER--------LEERREDLEELIA-----ERR 529
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1691 DFESEKRLRKDLKRTKAlladaqimlDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVE----MEDLHLQIDD 1766
Cdd:PRK02224   530 ETIEEKRERAEELRERA---------AELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKEriesLERIRTLLAA 600
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1767 IAKAKTALE---EQLSRLQREKNEIQNRLEEDQEDMNELMKKH-KAAVAQASRDMAQMNDLQAQIEEsnkEKQELQEKLQ 1842
Cdd:PRK02224   601 IADAEDEIErlrEKREALAELNDERRERLAEKRERKRELEAEFdEARIEEAREDKERAEEYLEQVEE---KLDELREERD 677
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1843 ALQSQVefleqSMVDKSLvsrqeAKIRELETRLEFEKTQVKRLENL---ASRLKETMEKLTEERDQR 1906
Cdd:PRK02224   678 DLQAEI-----GAVENEL-----EELEELRERREALENRVEALEALydeAEELESMYGDLRAELRQR 734
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1726-1942 2.41e-14

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 77.50  E-value: 2.41e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1726 KREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKK 1805
Cdd:COG4942     33 QQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELLRA 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1806 -------HKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQsmvdkslvsRQEAKIRELETRLEFE 1878
Cdd:COG4942    113 lyrlgrqPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRA---------ELEAERAELEALLAEL 183
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1879 KTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEAS 1942
Cdd:COG4942    184 EEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAG 247
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1610-2011 2.97e-14

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 78.66  E-value: 2.97e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1610 QMLEQaklRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEvQLEEEYEDKQKALREKRELESKLSTLSDQVNQ 1689
Cdd:COG4717     45 AMLLE---RLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEE-EYAELQEELEELEEELEELEAELEELREELEK 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1690 -RDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPS----KREIAQLKNQLEESEFTCAAAVK-ARKAMEVEMEDLHLQ 1763
Cdd:COG4717    121 lEKLLQLLPLYQELEALEAELAELPERLEELEERLEElrelEEELEELEAELAELQEELEELLEqLSLATEEELQDLAEE 200
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1764 IDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHK-----------AAVA--------------------- 1811
Cdd:COG4717    201 LEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERlkearlllliaAALLallglggsllsliltiagvlf 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1812 ---------------QASRDMAQMNDLQAQIEESNKEKQELQEKLQALQ----SQVEFLEQSMVDKSLVSRQEAKIRELE 1872
Cdd:COG4717    281 lvlgllallflllarEKASLGKEAEELQALPALEELEEEELEELLAALGlppdLSPEELLELLDRIEELQELLREAEELE 360
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1873 TRLEFEKTQvKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEAsrkkhELEMDL 1952
Cdd:COG4717    361 EELQLEELE-QEIAALLAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDEE-----ELEEEL 434
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082211 1953 ESLEAANQSLQADLKLAFKRIGDLQAAIEdEMESDEnedlinSLQDMVTKYQKKKNKLE 2011
Cdd:COG4717    435 EELEEELEELEEELEELREELAELEAELE-QLEEDG------ELAELLQELEELKAELR 486
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1298-1934 3.74e-14

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 78.80  E-value: 3.74e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1298 EQIRNKDEEIQQLRSKLEKVEKERNELRLssDRLETRISELTSELTDERntgesasQLLDAETAERLRTEKEMKELQTQY 1377
Cdd:COG4913    255 EPIRELAERYAAARERLAELEYLRAALRL--WFAQRRLELLEAELEELR-------AELARLEAELERLEARLDALREEL 325
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1378 DALKKQMevmemevmearliraaeingevddDDAGGEwRLkyERAVREVdftkKRLQQELEDKmeveQQSRRQLERRLGD 1457
Cdd:COG4913    326 DELEAQI------------------------RGNGGD-RL--EQLEREI----ERLERELEER----ERRRARLEALLAA 370
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1458 LQADSDESQRALQQLKKKCQRLTAELQDtklHLEGQQVRNHELEKKQRRFDSELSQAHEE----TQREKLQREKLQREKD 1533
Cdd:COG4913    371 LGLPLPASAEEFAALRAEAAALLEALEE---ELEALEEALAEAEAALRDLRRELRELEAEiaslERRKSNIPARLLALRD 447
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1534 MLlAEAFSLKqqmeEKDLDIAGftqkvvsleaELQDISSQESK------------------DEASLAKVKKQL--RDLEA 1593
Cdd:COG4913    448 AL-AEALGLD----EAELPFVG----------ELIEVRPEEERwrgaiervlggfaltllvPPEHYAAALRWVnrLHLRG 512
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1594 KVKDQEEELDEQAGSIQMLEQAKL--RLEMEMERMRQTHSKEMESRD-----EEVEEARQ-------SCQKKL------K 1653
Cdd:COG4913    513 RLVYERVRTGLPDPERPRLDPDSLagKLDFKPHPFRAWLEAELGRRFdyvcvDSPEELRRhpraitrAGQVKGngtrheK 592
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1654 QMEVQLEEEY------EDKQKALREKR-ELESKLSTLSDQVNQRdfeseKRLRKDLKRTKALLADAQIMLDHLKNNAPSK 1726
Cdd:COG4913    593 DDRRRIRSRYvlgfdnRAKLAALEAELaELEEELAEAEERLEAL-----EAELDALQERREALQRLAEYSWDEIDVASAE 667
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1727 REIAQLKNQ---LEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEE----DQEDM 1799
Cdd:COG4913    668 REIAELEAElerLDASSDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAaedlARLEL 747
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1800 NELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQveFLEQSMVDKSLVSRQEAKIRELETRLEfek 1879
Cdd:COG4913    748 RALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMRA--FNREWPAETADLDADLESLPEYLALLD--- 822
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1880 tqvkRLENlaSRLKETMEKLteeRDQRAAAENREKEQ-NKRLQRQLRDTKEEMSEL 1934
Cdd:COG4913    823 ----RLEE--DGLPEYEERF---KELLNENSIEFVADlLSKLRRAIREIKERIDPL 869
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1409-2021 5.19e-14

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 78.94  E-value: 5.19e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1409 DDAGGEWRLKYERAVREVDFTKKRLQQeledkmeveqqsrrQLERRLGDLQADSDESQRALQQLKKKCQRLTaELQDTKL 1488
Cdd:TIGR01612 1099 DDFGKEENIKYADEINKIKDDIKNLDQ--------------KIDHHIKALEEIKKKSENYIDEIKAQINDLE-DVADKAI 1163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1489 HLEGQQvrnhELEKKQRRFDSELSQA---HEETQreKLQREKLQREKDmllaeafslKQQMEE-KDLDIAgFTQKVVSLE 1564
Cdd:TIGR01612 1164 SNDDPE----EIEKKIENIVTKIDKKkniYDEIK--KLLNEIAEIEKD---------KTSLEEvKGINLS-YGKNLGKLF 1227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1565 aeLQDISSQESKDEASLAKVKKQLRDLEaKVKDQEEELDEQAGsIQMLEQAklrlemEMERMRQTHSKEM------ESRD 1638
Cdd:TIGR01612 1228 --LEKIDEEKKKSEHMIKAMEAYIEDLD-EIKEKSPEIENEMG-IEMDIKA------EMETFNISHDDDKdhhiisKKHD 1297
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1639 EEVEEARQscqKKLKQMEVQLEE-EYEDKQKAL--------REKRELESKLSTLSDQVNQRDFESEKRLRKDLKR-TKAL 1708
Cdd:TIGR01612 1298 ENISDIRE---KSLKIIEDFSEEsDINDIKKELqknlldaqKHNSDINLYLNEIANIYNILKLNKIKKIIDEVKEyTKEI 1374
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1709 LADAQIMLDHLKNnapSKREIAQLKNQLeeSEFTCAAAVKArkamEVEMEDLHLQIDDIAKAKTALEEQLSRLQ------ 1782
Cdd:TIGR01612 1375 EENNKNIKDELDK---SEKLIKKIKDDI--NLEECKSKIES----TLDDKDIDECIKKIKELKNHILSEESNIDtyfkna 1445
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1783 REKNE----IQNRLEEDQEDMNELMKKHKAavaQASRDMA-QMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvd 1857
Cdd:TIGR01612 1446 DENNEnvllLFKNIEMADNKSQHILKIKKD---NATNDHDfNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQY--- 1519
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1858 KSLVSRQEAKIRELETRLEFEKTQvKRLENLASRLKETMEKLTEErdqraaAENREKEQNKRLQRQLRDTKEemselARK 1937
Cdd:TIGR01612 1520 KKDVTELLNKYSALAIKNKFAKTK-KDSEIIIKEIKDAHKKFILE------AEKSEQKIKEIKKEKFRIEDD-----AAK 1587
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1938 EAEASRKKHELEMDLESLEaaNQSLQ-ADLKlafKRIGDLQAAIED-------------EMESDENEDLINSLQDMVTKY 2003
Cdd:TIGR01612 1588 NDKSNKAAIDIQLSLENFE--NKFLKiSDIK---KKINDCLKETESiekkissfsidsqDTELKENGDNLNSLQEFLESL 1662
                          650       660
                   ....*....|....*....|..
gi 1907082211 2004 QKKKNKLEGDS----DVDSELE 2021
Cdd:TIGR01612 1663 KDQKKNIEDKKkeldELDSEIE 1684
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1470-2003 6.57e-14

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 78.09  E-value: 6.57e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1470 QQLKKKCQRLTAELQDTKLHLEGQQVRNHElekKQRRFDSELSQAHEETQREKLQREKLQREkDMLLAEAFSLKQQMEEK 1549
Cdd:TIGR00618  190 KSLHGKAELLTLRSQLLTLCTPCMPDTYHE---RKQVLEKELKHLREALQQTQQSHAYLTQK-REAQEEQLKKQQLLKQL 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1550 DLDIAGFT--QKVVSLEAELQDISSQESK---DEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEME 1624
Cdd:TIGR00618  266 RARIEELRaqEAVLEETQERINRARKAAPlaaHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRR 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1625 RMRQTHSKEMESRDE-EVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFE--SEKRLRKD 1701
Cdd:TIGR00618  346 LLQTLHSQEIHIRDAhEVATSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRtsAFRDLQGQ 425
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1702 LKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTcaaAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRL 1781
Cdd:TIGR00618  426 LAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQ---SLKEREQQLQTKEQIHLQETRKKAVVLARLLELQEE 502
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1782 QREKNE-----------------IQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQAL 1844
Cdd:TIGR00618  503 PCPLCGscihpnparqdidnpgpLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRS 582
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1845 QSQVEFLEQSMVD------------KSLVSRQEAKIRELETRLE------FEKTQVKRLENLASRLKETMEKLTEERDQR 1906
Cdd:TIGR00618  583 KEDIPNLQNITVRlqdlteklseaeDMLACEQHALLRKLQPEQDlqdvrlHLQQCSQELALKLTALHALQLTLTQERVRE 662
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1907 AAAENREKEQNKRLQRQLRDTKEE-------------------MSELARKEAEASRKKHELEMDLESLEaanQSLQADLK 1967
Cdd:TIGR00618  663 HALSIRVLPKELLASRQLALQKMQsekeqltywkemlaqcqtlLRELETHIEEYDREFNEIENASSSLG---SDLAARED 739
                          570       580       590
                   ....*....|....*....|....*....|....*.
gi 1907082211 1968 LAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKY 2003
Cdd:TIGR00618  740 ALNQSLKELMHQARTVLKARTEAHFNNNEEVTAALQ 775
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
1627-2056 8.45e-14

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 77.37  E-value: 8.45e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1627 RQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKrLRKDLKRTK 1706
Cdd:TIGR04523   31 QDTEEKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINK-LNSDLSKIN 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1707 AlladaQIMLD-HLKNNapSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREK 1785
Cdd:TIGR04523  110 S-----EIKNDkEQKNK--LEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEK 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1786 NEIQN----------RLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSm 1855
Cdd:TIGR04523  183 LNIQKnidkiknkllKLELLLSNLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDE- 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1856 vDKSLVSRQEAKIRELETrlefEKTQVKRLENLASRLKETMEKLteerdqraaaeNREKEQN--KRLQRQLRDTKEEMSE 1933
Cdd:TIGR04523  262 -QNKIKKQLSEKQKELEQ----NNKKIKELEKQLNQLKSEISDL-----------NNQKEQDwnKELKSELKNQEKKLEE 325
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1934 LarkeaeasrkkhelemdlesleaanqslqadlklafkrigdlqaaiedEMESDENEDLINSLQDMVTKYQKKKNKLEGD 2013
Cdd:TIGR04523  326 I------------------------------------------------QNQISQNNKIISQLNEQISQLKKELTNSESE 357
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 1907082211 2014 -SDVDSELEDRVDGVKSWLSKNKGPSKAPSddgSLKSSRTALNS 2056
Cdd:TIGR04523  358 nSEKQRELEEKQNEIEKLKKENQSYKQEIK---NLESQINDLES 398
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1447-2012 8.64e-14

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 77.65  E-value: 8.64e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1447 SRRQLERRLGDLQADSDESQRALQqlkkkcQRLTAelqdtKLHLEGQQ--------------------VRNHELEKKQ-- 1504
Cdd:COG4913    156 DIRALKARLKKQGVEFFDSFSAYL------ARLRR-----RLGIGSEKalrllhktqsfkpigdlddfVREYMLEEPDtf 224
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1505 RRFDS------ELSQAHEETQREKLQREKLQREKDmlLAEAF-SLKQQMEEKDLDIAGFTQKVVSLEAELQdissqeskd 1577
Cdd:COG4913    225 EAADAlvehfdDLERAHEALEDAREQIELLEPIRE--LAERYaAARERLAELEYLRAALRLWFAQRRLELL--------- 293
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1578 EASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKL--------RLEMEMERMRQTHskemesrdEEVEEARQSCQ 1649
Cdd:COG4913    294 EAELEELRAELARLEAELERLEARLDALREELDELEAQIRgnggdrleQLEREIERLEREL--------EERERRRARLE 365
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1650 KKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQImLDHLKNNAPskREI 1729
Cdd:COG4913    366 ALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIAS-LERRKSNIP--ARL 442
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1730 AQLKNQLEEseftcAAAVKARKA------MEVEMED--------------------------------------LHLQID 1765
Cdd:COG4913    443 LALRDALAE-----ALGLDEAELpfvgelIEVRPEEerwrgaiervlggfaltllvppehyaaalrwvnrlhlrGRLVYE 517
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1766 DI------AKAKTALEEQLSR-LQREKNEIQNRLE------------EDQEDmnelMKKHKAAVAQA-----SRDMAQMN 1821
Cdd:COG4913    518 RVrtglpdPERPRLDPDSLAGkLDFKPHPFRAWLEaelgrrfdyvcvDSPEE----LRRHPRAITRAgqvkgNGTRHEKD 593
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1822 DL-------------QAQIEESNKEKQELQEKLQALQSQVEFLEQSMvdKSLVSRQEAKIRELETRleFEKTQVKRLENL 1888
Cdd:COG4913    594 DRrrirsryvlgfdnRAKLAALEAELAELEEELAEAEERLEALEAEL--DALQERREALQRLAEYS--WDEIDVASAERE 669
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1889 ASRLKETMEKLTEERDQRAAAENREKEQNKR---LQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEA-ANQSLQA 1964
Cdd:COG4913    670 IAELEAELERLDASSDDLAALEEQLEELEAEleeLEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDlARLELRA 749
                          650       660       670       680
                   ....*....|....*....|....*....|....*....|....*...
gi 1907082211 1965 DLKLAFkrigdlQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEG 2012
Cdd:COG4913    750 LLEERF------AAALGDAVERELRENLEERIDALRARLNRAEEELER 791
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1417-1969 2.26e-13

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 76.01  E-value: 2.26e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1417 LKYERAVREVDFTKKRLqqeLEDKMEVEQQSRRQLERRLGDLQaDSDESQRALQQL----KKKCQRLTAELQDTKLHLEG 1492
Cdd:pfam10174   41 LKKERALRKEEAARISV---LKEQYRVTQEENQHLQLTIQALQ-DELRAQRDLNQLlqqdFTTSPVDGEDKFSTPELTEE 116
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1493 QQVRNHELEKKQRRFDSELSQAHEE------TQREKL--QREKLQREKDMLLAEAFSLKQQMEEKDLD--IAGFTQKVVS 1562
Cdd:pfam10174  117 NFRRLQSEHERQAKELFLLRKTLEEmelrieTQKQTLgaRDESIKKLLEMLQSKGLPKKSGEEDWERTrrIAEAEMQLGH 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1563 LEAELQDISSQESKDEASLaKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVE 1642
Cdd:pfam10174  197 LEVLLDQKEKENIHLREEL-HRRNQLQPDPAKTKALQTVIEMKDTKISSLERNIRDLEDEVQMLKTNGLLHTEDREEEIK 275
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1643 --EARQSCQKKLKQMEVQLEEEYEDKQKalrEKRELESKLSTLSDQvNQRDFESEKRLRKDLKRTKALLADAQIMLDHLK 1720
Cdd:pfam10174  276 qmEVYKSHSKFMKNKIDQLKQELSKKES---ELLALQTKLETLTNQ-NSDCKQHIEVLKESLTAKEQRAAILQTEVDALR 351
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1721 NNAPSKREIAQLKNQ----LEESEFTCAAA---------VKARKAMEVE--MEDLHLQIDDIAKAKTALEEQLSRLQREK 1785
Cdd:pfam10174  352 LRLEEKESFLNKKTKqlqdLTEEKSTLAGEirdlkdmldVKERKINVLQkkIENLQEQLRDKDKQLAGLKERVKSLQTDS 431
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1786 NEIQNRL---EE---DQEDMNELMKKHKAAVAQASRDmaqmndlqaQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-- 1857
Cdd:pfam10174  432 SNTDTALttlEEalsEKERIIERLKEQREREDRERLE---------ELESLKKENKDLKEKVSALQPELTEKESSLIDlk 502
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1858 ---KSLVSR---QEAKIRELETRLEFEKTQVKRLENlasRLKETMEKlteERDQRAAAENREKEQNkrLQRqlrdtkeem 1931
Cdd:pfam10174  503 ehaSSLASSglkKDSKLKSLEIAVEQKKEECSKLEN---QLKKAHNA---EEAVRTNPEINDRIRL--LEQ--------- 565
                          570       580       590
                   ....*....|....*....|....*....|....*....
gi 1907082211 1932 sELARKEAEASRKKHELEMDLESL-EAANQSLQADLKLA 1969
Cdd:pfam10174  566 -EVARYKEESGKAQAEVERLLGILrEVENEKNDKDKKIA 603
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1511-1713 2.49e-13

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 74.42  E-value: 2.49e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1511 LSQAHEETQREKlQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRD 1590
Cdd:COG4942     16 AAQADAAAEAEA-ELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAE 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1591 LEAKVKDQEEELDEQAGSIQML-EQAKLRLEM------EMERMRQTHSKEMESRDEEVEEARQScQKKLKQMEVQLEEEY 1663
Cdd:COG4942     95 LRAELEAQKEELAELLRALYRLgRQPPLALLLspedflDAVRRLQYLKYLAPARREQAEELRAD-LAELAALRAELEAER 173
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1664 EDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQ 1713
Cdd:COG4942    174 AELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAE 223
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1768-1972 2.82e-13

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 74.03  E-value: 2.82e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1768 AKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQ 1847
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1848 VEFLEQSMVD--------------KSLVSRQEAK-----IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAA 1908
Cdd:COG4942     99 LEAQKEELAEllralyrlgrqpplALLLSPEDFLdavrrLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEA 178
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1909 AENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKR 1972
Cdd:COG4942    179 LLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1420-1958 5.72e-13

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 74.87  E-value: 5.72e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1420 ERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRL----GDLQADSDESQRALQQLKkkcQRLTAELQDTKLHLEGQQV 1495
Cdd:pfam12128  275 ASRQEERQETSAELNQLLRTLDDQWKEKRDELNGELsaadAAVAKDRSELEALEDQHG---AFLDADIETAAADQEQLPS 351
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1496 RNHELEKKQRRFDSeLSQAHEETQREKLQREKL--QREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQ 1573
Cdd:pfam12128  352 WQSELENLEERLKA-LTGKHQDVTAKYNRRRSKikEQNNRDIAGIKDKLAKIREARDRQLAVAEDDLQALESELREQLEA 430
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1574 eSKDEASLAKVKKQLRDLEAKVK-----DQEEELDEQAGSIQMLEQAKLRLEM---EMERMrQTHSKEMESRDEEVEEAR 1645
Cdd:pfam12128  431 -GKLEFNEEEYRLKSRLGELKLRlnqatATPELLLQLENFDERIERAREEQEAanaEVERL-QSELRQARKRRDQASEAL 508
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1646 QSCQKKLKQMEVQLEE-------------EYEDKQKALREK---RELESKLSTLSDQVNQRDFESEKR------LRKDLK 1703
Cdd:pfam12128  509 RQASRRLEERQSALDElelqlfpqagtllHFLRKEAPDWEQsigKVISPELLHRTDLDPEVWDGSVGGelnlygVKLDLK 588
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1704 RTkallaDAQIMLDHlknNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDiakAKTALEEQLSRLQR 1783
Cdd:pfam12128  589 RI-----DVPEWAAS---EEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETF---ARTALKNARLDLRR 657
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1784 EKNEIQNrleeDQEDMNELMKKHKAAVAQASRDMAQ-----MNDLQAQIEESNKEKQELQ-EKLQALQSQVEFLEQSM-- 1855
Cdd:pfam12128  658 LFDEKQS----EKDKKNKALAERKDSANERLNSLEAqlkqlDKKHQAWLEEQKEQKREARtEKQAYWQVVEGALDAQLal 733
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1856 VDKSLVSRQEAKIRELETRLEFEKTQVKRL---ENLASRLKETMEKLTE-----ERDQRAAAENRE------KEQNKRLQ 1921
Cdd:pfam12128  734 LKAAIAARRSGAKAELKALETWYKRDLASLgvdPDVIAKLKREIRTLERkieriAVRRQEVLRYFDwyqetwLQRRPRLA 813
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|.
gi 1907082211 1922 RQLRDTKEEMSE----LARKEAEASRKKHELEMDLESLEAA 1958
Cdd:pfam12128  814 TQLSNIERAISElqqqLARLIADTKLRRAKLEMERKASEKQ 854
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1620-1960 7.43e-13

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 74.39  E-value: 7.43e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1620 EMEMERMRQTH---SKEMESRD--EEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTlsDQVNQRDFES 1694
Cdd:pfam17380  295 KMEQERLRQEKeekAREVERRRklEEAEKARQAEMDRQAAIYAEQERMAMERERELERIRQEERKREL--ERIRQEEIAM 372
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1695 EKRLRKDLKRtkalladaqIMLDHLKNNAPSKREI-AQLKNQLEESEFTcaaavKARKAMEVEMEDLHLQIDDiakaktA 1773
Cdd:pfam17380  373 EISRMRELER---------LQMERQQKNERVRQELeAARKVKILEEERQ-----RKIQQQKVEMEQIRAEQEE------A 432
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1774 LEEQLSRLQREKNEIQNRLEEDqedmnELMKKHkaavaqasrdmaQMNDLQAQIEESNKEKQELqEKLQALQSQVEFLEQ 1853
Cdd:pfam17380  433 RQREVRRLEEERAREMERVRLE-----EQERQQ------------QVERLRQQEEERKRKKLEL-EKEKRDRKRAEEQRR 494
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1854 SMVDKSLVSRQEAKIRELETRLEFEKtqvkrlenlasRLKETMEKLTEERDQRAAAENREKEQN----KRLQRQLRDTKE 1929
Cdd:pfam17380  495 KILEKELEERKQAMIEEERKRKLLEK-----------EMEERQKAIYEEERRREAEEERRKQQEmeerRRIQEQMRKATE 563
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1907082211 1930 EMSEL--ARKEAEASRKKHELEMDLESLEAANQ 1960
Cdd:pfam17380  564 ERSRLeaMEREREMMRQIVESEKARAEYEATTP 596
PRK01156 PRK01156
chromosome segregation protein; Provisional
1562-2036 1.22e-12

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 73.78  E-value: 1.22e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1562 SLEAELQDISSQESKDEAS---LAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMermrqthsKEMESRD 1638
Cdd:PRK01156   177 MLRAEISNIDYLEEKLKSSnleLENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSAL--------NELSSLE 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1639 EEVeearqscqkklKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRkDLKRTKALLADAQIMLDH 1718
Cdd:PRK01156   249 DMK-----------NRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYIN-DYFKYKNDIENKKQILSN 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1719 LKNNAPSKREIAQLKNQLEE--SEFTcaaavkarkAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQ 1796
Cdd:PRK01156   317 IDAEINKYHAIIKKLSVLQKdyNDYI---------KKKSRYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIE 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1797 EDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSM-----------------VDKS 1859
Cdd:PRK01156   388 RMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMemlngqsvcpvcgttlgEEKS 467
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1860 L---------VSRQEAKIRELEtrlefekTQVKRLENLASRLKETMEKLTEERDQRAAAENRekeQNKRLQRQLRDTKEE 1930
Cdd:PRK01156   468 NhiinhynekKSRLEEKIREIE-------IEVKDIDEKIVDLKKRKEYLESEEINKSINEYN---KIESARADLEDIKIK 537
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1931 MSELARKEAEASRKKHELE-MDLESLEA------------------ANQSLQADLKLAFKRIGDLQAAIEDEMESDE--N 1989
Cdd:PRK01156   538 INELKDKHDKYEEIKNRYKsLKLEDLDSkrtswlnalavislidieTNRSRSNEIKKQLNDLESRLQEIEIGFPDDKsyI 617
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*..
gi 1907082211 1990 EDLINSLQDMVTKYQKKKNKLEGDSDVDSELEDRVDGVKSWLSKNKG 2036
Cdd:PRK01156   618 DKSIREIENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDS 664
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1292-1872 2.40e-12

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 72.84  E-value: 2.40e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1292 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGEsasqlLDAETAERLRTEKE-M 1370
Cdd:pfam15921  469 QLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVD-----LKLQELQHLKNEGDhL 543
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1371 KELQTQYDALKKQMEVMEMEVMEAR--LIRAAEINGEvDDDDAGGewrLKYERAVREVDFTKKRLQ-QELEDKMEVEQQS 1447
Cdd:pfam15921  544 RNVQTECEALKLQMAEKDKVIEILRqqIENMTQLVGQ-HGRTAGA---MQVEKAQLEKEINDRRLElQEFKILKDKKDAK 619
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1448 RRQLERRLGDLQADS-------DESQRALQQLKKKCQRLTAELQDTKLHLEG---------QQVRN--HELEKKQRRFDS 1509
Cdd:pfam15921  620 IRELEARVSDLELEKvklvnagSERLRAVKDIKQERDQLLNEVKTSRNELNSlsedyevlkRNFRNksEEMETTTNKLKM 699
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1510 ELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLE---------------------AELQ 1568
Cdd:pfam15921  700 QLKSAQSELEQTRNTLKSMEGSDGHAMKVAMGMQKQITAKRGQIDALQSKIQFLEeamtnankekhflkeeknklsQELS 779
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1569 DISSQESKDEASLAKVKKQLRDLEAKVKDQEEELD-------EQAGSIQMLEQAKLRLEM-------EMERMRQTHSKEM 1634
Cdd:pfam15921  780 TVATEKNKMAGELEVLRSQERRLKEKVANMEVALDkaslqfaECQDIIQRQEQESVRLKLqhtldvkELQGPGYTSNSSM 859
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1635 ESR-DEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALRE--KRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLAD 1711
Cdd:pfam15921  860 KPRlLQPASFTRTHSNVPSSQSTASFLSHHSRKTNALKEdpTRDLKQLLQELRSVINEEPTVQLSKAEDKGRAPSLGALD 939
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1712 AQIMLDHLKNNAPSK---REIAQLKNQLEESEFTCAAAVKARKAMevEMEDLHLQIDDIAKAKTALEEQLSR---LQREK 1785
Cdd:pfam15921  940 DRVRDCIIESSLRSDichSSSNSLQTEGSKSSETCSREPVLLHAG--ELEDPSSCFTFPSTASPSVKNSASRsfhSSPKK 1017
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1786 NEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQmnDLQAQ-IEESNKEKQELQEKLQALQSQVEFLE---QSMvdKSLV 1861
Cdd:pfam15921 1018 SPVHSLLTSSAEGSIGSSSQYRSAKTIHSPDSVK--DSQSLpIETTGKTCRKLQNRLESLQTLVEDLQlknQAM--SSMI 1093
                          650
                   ....*....|.
gi 1907082211 1862 SRQEAKIRELE 1872
Cdd:pfam15921 1094 RNQEKRIQKVK 1104
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1755-1916 3.79e-12

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 68.41  E-value: 3.79e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1755 VEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQ---EDMNELMKKHKAAVAQASRDMAQMNDLQAQIeESN 1831
Cdd:COG1579     10 LDLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKtelEDLEKEIKRLELEIEEVEARIKKYEEQLGNV-RNN 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1832 KEkqelqekLQALQSQVEFLEQsmvdksLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAEN 1911
Cdd:COG1579     89 KE-------YEALQKEIESLKR------RISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELE 155

                   ....*
gi 1907082211 1912 REKEQ 1916
Cdd:COG1579    156 AELEE 160
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1289-1935 5.78e-12

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 71.79  E-value: 5.78e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1289 PLIQVQLSeeqirNKDEEIQQLRSKLEKVEKERNELRLSSD-RLETRISELTSELTDERntgesasqlldaETAERLRTE 1367
Cdd:pfam12128  350 PSWQSELE-----NLEERLKALTGKHQDVTAKYNRRRSKIKeQNNRDIAGIKDKLAKIR------------EARDRQLAV 412
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1368 KEmKELQTQYDALKKQMEVMEMEVMEARL---IRAAEINGEVDDDDAGGEWRLKYERAVREVDftkkRLQQELEDKMEve 1444
Cdd:pfam12128  413 AE-DDLQALESELREQLEAGKLEFNEEEYrlkSRLGELKLRLNQATATPELLLQLENFDERIE----RAREEQEAANA-- 485
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1445 QQSRRQLERRLgdLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQrrfdselSQAHEETQREKLQ 1524
Cdd:pfam12128  486 EVERLQSELRQ--ARKRRDQASEALRQASRRLEERQSALDELELQLFPQAGTLLHFLRKE-------APDWEQSIGKVIS 556
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1525 REKLQREKDMLLAEAFSLKQQ-------MEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKD 1597
Cdd:pfam12128  557 PELLHRTDLDPEVWDGSVGGElnlygvkLDLKRIDVPEWAASEEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEK 636
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1598 QEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKR-EL 1676
Cdd:pfam12128  637 ASREETFARTALKNARLDLRRLFDEKQSEKDKKNKALAERKDSANERLNSLEAQLKQLDKKHQAWLEEQKEQKREARtEK 716
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1677 ESKL----STLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHL----KNNAPSKREIAQLKNQLEESEFTCAAAVK 1748
Cdd:pfam12128  717 QAYWqvveGALDAQLALLKAAIAARRSGAKAELKALETWYKRDLASLgvdpDVIAKLKREIRTLERKIERIAVRRQEVLR 796
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1749 ARKAME----VEMEDLHLQIDDIAKAKTALEEQLSRLQ------REKNEIQNRLEEDQED-MNELMKKHKAAVAQASRDM 1817
Cdd:pfam12128  797 YFDWYQetwlQRRPRLATQLSNIERAISELQQQLARLIadtklrRAKLEMERKASEKQQVrLSENLRGLRCEMSKLATLK 876
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1818 AQMNDLQAQ--IEESNKEKQELQEKLQ----ALQSQVEFLEQSMVDKS---------LVSRQEAKIRELETRLEFEKTQV 1882
Cdd:pfam12128  877 EDANSEQAQgsIGERLAQLEDLKLKRDylseSVKKYVEHFKNVIADHSgsglaetweSLREEDHYQNDKGIRLLDYRKLV 956
                          650       660       670       680       690       700
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1883 KRLENLASRLKETMEKLTEER------------DQRAAAENREKEQNKRLQRQLRDTK--EEMSELA 1935
Cdd:pfam12128  957 PYLEQWFDVRVPQSIMVLREQvsilgvdltefyDVLADFDRRIASFSRELQREVGEEAffEGVSESA 1023
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1298-1793 5.86e-12

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 71.61  E-value: 5.86e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1298 EQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntGESASQLLDAETAERLRTEKEMKELQTQy 1377
Cdd:PRK02224   251 EELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLL--AEAGLDDADAEAVEARREELEDRDEELR- 327
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1378 DALKKQMevmemevmearlIRAAEINGEVddddaggewrlkyERAVREVDftkkrlqqELEDKMEVEQQSRRQLERRLGD 1457
Cdd:PRK02224   328 DRLEECR------------VAAQAHNEEA-------------ESLREDAD--------DLEERAEELREEAAELESELEE 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1458 LQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEkkqrrfdSELSQAHEetqREKLQREKLQREKDMlLA 1537
Cdd:PRK02224   375 AREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELR-------EERDELRE---REAELEATLRTARER-VE 443
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1538 EAFSLK---------QQMEEKDL--DIAGFTQKVVSLEAELQDISSQESKDEA------SLAKVKKQLRDLEAKVK---- 1596
Cdd:PRK02224   444 EAEALLeagkcpecgQPVEGSPHveTIEEDRERVEELEAELEDLEEEVEEVEErleraeDLVEAEDRIERLEERREdlee 523
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1597 ---DQEEELDEQAGSIQMLEQAKLRLEMEMERMRQThSKEMESRDEEVEEARQSCQKKLKQMEVQLE------------E 1661
Cdd:PRK02224   524 liaERRETIEEKRERAEELRERAAELEAEAEEKREA-AAEAEEEAEEAREEVAELNSKLAELKERIEslerirtllaaiA 602
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1662 EYEDKQKALREKRElesKLSTLSDQvnQRDFESEKRLRKDlkrtkalladaqimldhlknnapskreiaQLKNQLEESEF 1741
Cdd:PRK02224   603 DAEDEIERLREKRE---ALAELNDE--RRERLAEKRERKR-----------------------------ELEAEFDEARI 648
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1742 TCAAAVKARKA-----MEVEMEDLHLQIDDIAKAKTALE---EQLSRLQREKNEIQNRLE 1793
Cdd:PRK02224   649 EEAREDKERAEeyleqVEEKLDELREERDDLQAEIGAVEnelEELEELRERREALENRVE 708
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1771-1979 6.65e-12

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 69.86  E-value: 6.65e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1771 KTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKL--QALQSQV 1848
Cdd:COG3883     18 IQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELgeRARALYR 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1849 EFLEQSMVDKSLVSRQeakIRELETRLEFektqvkrLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTK 1928
Cdd:COG3883     98 SGGSVSYLDVLLGSES---FSDFLDRLSA-------LSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELE 167
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1929 EEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAA 1979
Cdd:COG3883    168 AAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAA 218
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1298-1914 6.76e-12

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 71.62  E-value: 6.76e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1298 EQIRNKDEE-IQQLRSKLEKVEKERNELRLSSD--RLETRISELTSELTDERNTGESASQLLDaETAErlrTEKEmkelQ 1374
Cdd:TIGR01612 1135 EEIKKKSENyIDEIKAQINDLEDVADKAISNDDpeEIEKKIENIVTKIDKKKNIYDEIKKLLN-EIAE---IEKD----K 1206
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1375 TQYDALKKQMEVMEMEVMEARLiraaeinGEVDDDDAGGEWRLK-YERAVREVDFTKKRlQQELEDKMEVEQQSRRQLEr 1453
Cdd:TIGR01612 1207 TSLEEVKGINLSYGKNLGKLFL-------EKIDEEKKKSEHMIKaMEAYIEDLDEIKEK-SPEIENEMGIEMDIKAEME- 1277
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1454 rLGDLQADSDESQRALQQLKKKcqrltaELQDtklhlegqqVRNHELEKKQRRF-DSELSQAHEETQREKLQREKLQREK 1532
Cdd:TIGR01612 1278 -TFNISHDDDKDHHIISKKHDE------NISD---------IREKSLKIIEDFSeESDINDIKKELQKNLLDAQKHNSDI 1341
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1533 DMLLAEAFSLKQQMEEKDL-----DIAGFTQKVvslEAELQDISSQESKDEASLAKVKKQLRDLEAKVKdQEEELDEQ-- 1605
Cdd:TIGR01612 1342 NLYLNEIANIYNILKLNKIkkiidEVKEYTKEI---EENNKNIKDELDKSEKLIKKIKDDINLEECKSK-IESTLDDKdi 1417
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1606 AGSIQMLEQAKLRLEMEmERMRQTHSKEMESRDEEV-------EEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELES 1678
Cdd:TIGR01612 1418 DECIKKIKELKNHILSE-ESNIDTYFKNADENNENVlllfkniEMADNKSQHILKIKKDNATNDHDFNINELKEHIDKSK 1496
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1679 KLSTLSDQvNQRDFESEKRLRKDLKRTKALLADAQIMLDhLKNN-APSKREIAQLKNQLEE--SEFTCAAAVKARKAMEV 1755
Cdd:TIGR01612 1497 GCKDEADK-NAKAIEKNKELFEQYKKDVTELLNKYSALA-IKNKfAKTKKDSEIIIKEIKDahKKFILEAEKSEQKIKEI 1574
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1756 EMEDLHLQiDDIAKA----KTALEEQLSrlqrekneIQNrLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESn 1831
Cdd:TIGR01612 1575 KKEKFRIE-DDAAKNdksnKAAIDIQLS--------LEN-FENKFLKISDIKKKINDCLKETESIEKKISSFSIDSQDT- 1643
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1832 kekqELQEKLQALQSQVEFLEqSMVDkslvsrQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAEN 1911
Cdd:TIGR01612 1644 ----ELKENGDNLNSLQEFLE-SLKD------QKKNIEDKKKELDELDSEIEKIEIDVDQHKKNYEIGIIEKIKEIAIAN 1712

                   ...
gi 1907082211 1912 REK 1914
Cdd:TIGR01612 1713 KEE 1715
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1445-1669 8.27e-12

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 69.41  E-value: 8.27e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1445 QQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAheETQREKLQ 1524
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAEL--EKEIAELR 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1525 REKLQREKDM--LLAEAFSLKQQMEEKDL-DIAGFTQKVVSLEAeLQDISSQeskDEASLAKVKKQLRDLEAKVKDQEEE 1601
Cdd:COG4942     97 AELEAQKEELaeLLRALYRLGRQPPLALLlSPEDFLDAVRRLQY-LKYLAPA---RREQAEELRADLAELAALRAELEAE 172
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1602 LDEQAGSIQMLEQAKLRLEMEMERMRQT---HSKEMESRDEEVEEARQScQKKLKQMEVQLEEEYEDKQKA 1669
Cdd:COG4942    173 RAELEALLAELEEERAALEALKAERQKLlarLEKELAELAAELAELQQE-AEELEALIARLEAEAAAAAER 242
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1441-2028 8.89e-12

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 71.02  E-value: 8.89e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1441 MEVEQQSRRQLERRLGDLQADsDESQRALQQLKKkcQRLTAELQDTKlHLEGQQVRNHELEKKQRRFDS------ELSQA 1514
Cdd:pfam12128  188 MHSKEGKFRDVKSMIVAILED-DGVVPPKSRLNR--QQVEHWIRDIQ-AIAGIMKIRPEFTKLQQEFNTlesaelRLSHL 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1515 HEETQREKLQREKLQREKDMLLAEafslkqqmeekdldiagFTQKVVSLEAELQdissqESKDEASLakvkkQLRDLEAK 1594
Cdd:pfam12128  264 HFGYKSDETLIASRQEERQETSAE-----------------LNQLLRTLDDQWK-----EKRDELNG-----ELSAADAA 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1595 VKDQEEEL---DEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEE-------YE 1664
Cdd:pfam12128  317 VAKDRSELealEDQHGAFLDADIETAAADQEQLPSWQSELENLEERLKALTGKHQDVTAKYNRRRSKIKEQnnrdiagIK 396
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1665 DKQKALREKRE------------LESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDhlknnapskreiaqL 1732
Cdd:pfam12128  397 DKLAKIREARDrqlavaeddlqaLESELREQLEAGKLEFNEEEYRLKSRLGELKLRLNQATATPE--------------L 462
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1733 KNQLEESEFTCAAAVKARKAMEVEMEDLHlqiDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQ 1812
Cdd:pfam12128  463 LLQLENFDERIERAREEQEAANAEVERLQ---SELRQARKRRDQASEALRQASRRLEERQSALDELELQLFPQAGTLLHF 539
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1813 ASRDMAQMNDLQAQIEESNK-EKQELQEKLQALQSQVEF--------LEQSMVDKSLVSRQEAKIR--ELETRLEFEKTQ 1881
Cdd:pfam12128  540 LRKEAPDWEQSIGKVISPELlHRTDLDPEVWDGSVGGELnlygvkldLKRIDVPEWAASEEELRERldKAEEALQSAREK 619
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1882 VKRLENLASRLKETMEKLT-EERDQRAAAENREKEQnKRL---QRQLRDTKEEMSELARKEAEASRKK--HELEMDLESL 1955
Cdd:pfam12128  620 QAAAEEQLVQANGELEKASrEETFARTALKNARLDL-RRLfdeKQSEKDKKNKALAERKDSANERLNSleAQLKQLDKKH 698
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1956 EAANQSLQADLKLAfkRIGDLQAAIEDEMESDENEDLINslQDMVTKYQKKKNKLEG-DSDVDSELEDR-VDGVK 2028
Cdd:pfam12128  699 QAWLEEQKEQKREA--RTEKQAYWQVVEGALDAQLALLK--AAIAARRSGAKAELKAlETWYKRDLASLgVDPDV 769
46 PHA02562
endonuclease subunit; Provisional
1674-1937 9.63e-12

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 70.43  E-value: 9.63e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1674 RELESKLSTLSDQVN--------QRDFESEKRlrkdlKRTKALLADAQIMLDHLKNNAPS-KREIAQLKNQLEEseftca 1744
Cdd:PHA02562   177 RELNQQIQTLDMKIDhiqqqiktYNKNIEEQR-----KKNGENIARKQNKYDELVEEAKTiKAEIEELTDELLN------ 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1745 aavkarkaMEVEMEDLHLQIDDIAKAKTALEEQLSRLQR-----EKNEIQNRLEEDQEDMNELMKKHKAavaqasrdmaQ 1819
Cdd:PHA02562   246 --------LVMDIEDPSAALNKLNTAAAKIKSKIEQFQKvikmyEKGGVCPTCTQQISEGPDRITKIKD----------K 307
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1820 MNDLQAQIEESNKEKQELQEKLqalqsqVEFLEQSMvdkslvsrqeaKIRELETRLEFEKTQVKRLENLASRLKETMEKL 1899
Cdd:PHA02562   308 LKELQHSLEKLDTAIDELEEIM------DEFNEQSK-----------KLLELKNKISTNKQSLITLVDKAKKVKAAIEEL 370
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1907082211 1900 TEERDQRaaaenreKEQNKRLQRQLRDTKEEMSELARK 1937
Cdd:PHA02562   371 QAEFVDN-------AEELAKLQDELDKIVKTKSELVKE 401
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1305-1622 1.04e-11

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 70.87  E-value: 1.04e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1305 EEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTD-ERNTGESASQLLDAEtaERLRTEKE-MKELQTQYDAL-- 1380
Cdd:TIGR02169  674 AELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDaSRKIGEIEKEIEQLE--QEEEKLKErLEELEEDLSSLeq 751
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1381 KKQMEVMEMEVMEARL---------IRAAEinGEVDDDDAGGEWRlKYERAVREVDFTKKRLQ----------QELEDKM 1441
Cdd:TIGR02169  752 EIENVKSELKELEARIeeleedlhkLEEAL--NDLEARLSHSRIP-EIQAELSKLEEEVSRIEarlreieqklNRLTLEK 828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1442 EVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQRE 1521
Cdd:TIGR02169  829 EYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEEL 908
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1522 KLQREKLQREKDMLLAEAFSLKQQMEE-------------KDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQL 1588
Cdd:TIGR02169  909 EAQIEKKRKRLSELKAKLEALEEELSEiedpkgedeeipeEELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRL 988
                          330       340       350
                   ....*....|....*....|....*....|....
gi 1907082211 1589 RDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEME 1622
Cdd:TIGR02169  989 DELKEKRAKLEEERKAILERIEEYEKKKREVFME 1022
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1559-1965 1.05e-11

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 70.54  E-value: 1.05e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1559 KVVSLEAELQDISSQESKDEASLakvKKQLRDLEAKVKDQEEELDEQAgsiqmleQAKLRLEMEMERMRQTHSKEMESRD 1638
Cdd:pfam05557    3 ELIESKARLSQLQNEKKQMELEH---KRARIELEKKASALKRQLDRES-------DRNQELQKRIRLLEKREAEAEEALR 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1639 EEVEEARQsCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFEsekrlrkdLKRTKALLADAQIMLDH 1718
Cdd:pfam05557   73 EQAELNRL-KKKYLEALNKKLNEKESQLADAREVISCLKNELSELRRQIQRAELE--------LQSTNSELEELQERLDL 143
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1719 LKNNApskREIAQLKNQLEESEFTCAAAVKARKAMEVEMEdlhLQIDDIAKAKTALEEQLS--------RLQREKNEIQN 1790
Cdd:pfam05557  144 LKAKA---SEAEQLRQNLEKQQSSLAEAEQRIKELEFEIQ---SQEQDSEIVKNSKSELARipelekelERLREHNKHLN 217
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1791 RLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQ----------ALQSQVEFLEQSmvDKSL 1860
Cdd:pfam05557  218 ENIENKLLLKEEVEDLKRKLEREEKYREEAATLELEKEKLEQELQSWVKLAQdtglnlrspeDLSRRIEQLQQR--EIVL 295
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1861 VSRQ-----------------EAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERD-QRAAAENREKEQN-KRLQ 1921
Cdd:pfam05557  296 KEENssltssarqlekarrelEQELAQYLKKIEDLNKKLKRHKALVRRLQRRVLLLTKERDgYRAILESYDKELTmSNYS 375
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1922 RQLRDTKEEMSELARK-EAEASRKKHELEMDLESLEAANQSLQAD 1965
Cdd:pfam05557  376 PQLLERIEEAEDMTQKmQAHNEEMEAQLSVAEEELGGYKQQAQTL 420
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1295-1843 1.14e-11

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 70.15  E-value: 1.14e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1295 LSEEQIRNKDEEIQQLRSKLEKVEKERNElrlsSDRLETRISELTSELTDErnTGESASQLLDAETAERlRTEKEMKELQ 1374
Cdd:pfam05557   66 EAEEALREQAELNRLKKKYLEALNKKLNE----KESQLADAREVISCLKNE--LSELRRQIQRAELELQ-STNSELEELQ 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1375 TQYDALKKqmevmemevmearliRAAEINGEVDDDDAGGEWRLKYERAVREVDFtkkRLQQELEDKMEVEQQSRRQLerR 1454
Cdd:pfam05557  139 ERLDLLKA---------------KASEAEQLRQNLEKQQSSLAEAEQRIKELEF---EIQSQEQDSEIVKNSKSELA--R 198
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1455 LGDLQadsdesqRALQQLKKKCQRLTaELQDTKLHLEGQQvrnHELEKKQRRfdselsqahEETQREKLQreKLQREKDM 1534
Cdd:pfam05557  199 IPELE-------KELERLREHNKHLN-ENIENKLLLKEEV---EDLKRKLER---------EEKYREEAA--TLELEKEK 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1535 LLAEAFSLKQQMEEKDLDIA---GFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVK-------DQEEELDE 1604
Cdd:pfam05557  257 LEQELQSWVKLAQDTGLNLRspeDLSRRIEQLQQREIVLKEENSSLTSSARQLEKARRELEQELAqylkkieDLNKKLKR 336
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1605 QAGSIQMLEQAKLRLEMEMERMRQ------------THSKEMESRDEEVEEARQSCQKKLKQMEVQLEeeyedkqKALRE 1672
Cdd:pfam05557  337 HKALVRRLQRRVLLLTKERDGYRAilesydkeltmsNYSPQLLERIEEAEDMTQKMQAHNEEMEAQLS-------VAEEE 409
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1673 KRELESKLSTLSDQVnqrdfesekrlrkDLKRTKALLADAqimldhlknnAPSKREIAQLKNQLEESEFTCAAAVKARKA 1752
Cdd:pfam05557  410 LGGYKQQAQTLEREL-------------QALRQQESLADP----------SYSKEEVDSLRRKLETLELERQRLREQKNE 466
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1753 MEVEMED--------------LHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRdma 1818
Cdd:pfam05557  467 LEMELERrclqgdydpkktkvLHLSMNPAAEAYQQRKNQLEKLQAEIERLKRLLKKLEDDLEQVLRLPETTSTMNFK--- 543
                          570       580
                   ....*....|....*....|....*
gi 1907082211 1819 QMNDLQAQIEESNKEKQELQEKLQA 1843
Cdd:pfam05557  544 EVLDLRKELESAELKNQRLKEVFQA 568
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1490-1876 1.20e-11

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 69.92  E-value: 1.20e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1490 LEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREkdmllaeafsLKQQMEEKDLDIAGFTQKVVSLEAELQD 1569
Cdd:pfam07888   36 LEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRE----------LESRVAELKEELRQSREKHEELEEKYKE 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1570 ISsqeskdeASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQScQ 1649
Cdd:pfam07888  106 LS-------ASSEELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQL-Q 177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1650 KKLKQMEVQLeeeyedkqkalrekRELESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQimlDHLKNNAPSKREI 1729
Cdd:pfam07888  178 AKLQQTEEEL--------------RSLSKEFQELRNSLAQRD-TQVLQLQDTITTLTQKLTTAH---RKEAENEALLEEL 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1730 AQLKNQLEESEFTCAA-------AVKARKAMEVEMEDLHLQIDD----IAKAKTALEEQLSRLQREKNEIQNRLEEDQED 1798
Cdd:pfam07888  240 RSLQERLNASERKVEGlgeelssMAAQRDRTQAELHQARLQAAQltlqLADASLALREGRARWAQERETLQQSAEADKDR 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1799 M----NELMKKHKAAVAQASR------DMAQMNDL-QAQIEESNKEKQELQEKLQALQSQVEFLeqsmvdksLVSRQE-- 1865
Cdd:pfam07888  320 IeklsAELQRLEERLQEERMEreklevELGREKDCnRVQLSESRRELQELKASLRVAQKEKEQL--------QAEKQEll 391
                          410
                   ....*....|.
gi 1907082211 1866 AKIRELETRLE 1876
Cdd:pfam07888  392 EYIRQLEQRLE 402
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1305-1522 1.40e-11

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 69.02  E-value: 1.40e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1305 EEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTQYDALKKQm 1384
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE- 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1385 eVMEMEVMEARLIRAAEINGEVD--------DDDAGGEWRLKYERAVREVDFTK----KRLQQELEDKMEVEQQSRRQLE 1452
Cdd:COG4942     99 -LEAQKEELAELLRALYRLGRQPplalllspEDFLDAVRRLQYLKYLAPARREQaeelRADLAELAALRAELEAERAELE 177
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1453 RRLGDLQadsdESQRALQQLKKKCQRLTAELQDTKlhlEGQQVRNHELEKKQRRFDSELSQAHEETQREK 1522
Cdd:COG4942    178 ALLAELE----EERAALEALKAERQKLLARLEKEL---AELAAELAELQQEAEELEALIARLEAEAAAAA 240
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1294-1735 1.46e-11

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 70.53  E-value: 1.46e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1294 QLSEEQiRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETR-------ISELTSELTDERNTGESASQLLDAETAE-RLR 1365
Cdd:pfam15921  367 QFSQES-GNLDDQLQKLLADLHKREKELSLEKEQNKRLWDRdtgnsitIDHLRRELDDRNMEVQRLEALLKAMKSEcQGQ 445
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1366 TEKEMKELQTQYDALKKQMEVMEMEVMEARLIRAAeingeVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKMEVEQ 1445
Cdd:pfam15921  446 MERQMAAIQGKNESLEKVSSLTAQLESTKEMLRKV-----VEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEIT 520
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1446 QSRRQLERRLGDLQADSDESQRaLQQLKKKCQRLTAEL--QDTKLHLEGQQVRNHelekkqrrfdSELSQAHEET----Q 1519
Cdd:pfam15921  521 KLRSRVDLKLQELQHLKNEGDH-LRNVQTECEALKLQMaeKDKVIEILRQQIENM----------TQLVGQHGRTagamQ 589
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1520 REKLQREKLQREKDMLLAEAFSLKqqmEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQE 1599
Cdd:pfam15921  590 VEKAQLEKEINDRRLELQEFKILK---DKKDAKIRELEARVSDLELEKVKLVNAGSERLRAVKDIKQERDQLLNEVKTSR 666
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1600 EELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQS-------------------------------C 1648
Cdd:pfam15921  667 NELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTlksmegsdghamkvamgmqkqitakrgqidaL 746
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1649 QKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKrLRKDLKRTKALLADAQIMLDHLKNNAPSKRE 1728
Cdd:pfam15921  747 QSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEV-LRSQERRLKEKVANMEVALDKASLQFAECQD 825

                   ....*..
gi 1907082211 1729 IAQLKNQ 1735
Cdd:pfam15921  826 IIQRQEQ 832
PDZ_SHANK1_3-like cd06746
PDZ domain of SH3 and multiple ankyrin repeat domains protein 1 (SHANK1), SHANK2, SHANK3, and ...
223-307 1.47e-11

PDZ domain of SH3 and multiple ankyrin repeat domains protein 1 (SHANK1), SHANK2, SHANK3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of SHANK1, SHANK2, SHANK3, and related domains. SHANK family proteins, SHANK1 (also known as somatostatin receptor-interacting protein, SSTR-interacting protein, SSTRIP), SHANK2 (also known as cortactin-binding protein 1, proline-rich synapse-associated protein 1), and SHANK3 (proline-rich synapse-associated protein 2) are synaptic scaffolding proteins which are highly enriched in the post-synaptic densities of excitatory synapses. They have been implicated in synaptic transmission, synapse formation, synaptic plasticity, and cytoskeletal remodeling, and are regulators of Cav1 calcium current and CREB target expression. Many protein ligands have been identified for the Shank PDZ domain, such as GKAP (also known as SAPAP), betaPIX (a guanine nucleotide exchange factor used by Rho GTPase family members Rac1 and Cdc42), alpha-latrotoxin, neuroligin, group I metabotropic glutamate receptors (mGluRs), and L-type calcium channels. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This SHANK-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged as beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta- strand F.


Pssm-ID: 467228 [Multi-domain]  Cd Length: 101  Bit Score: 63.00  E-value: 1.47e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  223 LQRRPTGdFGFSLRRT---------TMLDRAPEGQAYRRVvhfaEPGaGTKDLAlGLVPGDRLVEINGQNVENKSRDEIV 293
Cdd:cd06746     11 LQKGDKG-FGFVLRGAkavgpilefTPTPAFPALQYLESV----DPG-GVADKA-GLKKGDFLLEINGEDVVKASHEQVV 83
                           90
                   ....*....|....
gi 1907082211  294 EMIRQSGDSVRLKV 307
Cdd:cd06746     84 NLIRQSGNTLVLKV 97
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1636-2003 2.00e-11

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 69.82  E-value: 2.00e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1636 SRDEEVEEARqscqkklkqmEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQrdfesekrLRKDLKRTKALLADAQIM 1715
Cdd:pfam01576    1 TRQEEEMQAK----------EEELQKVKERQQKAESELKELEKKHQQLCEEKNA--------LQEQLQAETELCAEAEEM 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1716 LDHLKNNAPSKREIAQ-LKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTAL-------------------- 1774
Cdd:pfam01576   63 RARLAARKQELEEILHeLESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEEEAARQKLqlekvtteakikkleedill 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1775 -EEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQE--------------LQE 1839
Cdd:pfam01576  143 lEDQNSKLSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKHEAMISDLEERLKKEEKGRQElekakrklegestdLQE 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1840 KLQALQSQVEFLEQSMvdkslvSRQEAKIRELETRLEFEKTQ----VKRLENLASRLKETMEKLTEERDQRAAAEnreke 1915
Cdd:pfam01576  223 QIAELQAQIAELRAQL------AKKEEELQAALARLEEETAQknnaLKKIRELEAQISELQEDLESERAARNKAE----- 291
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1916 qnkrlqRQLRDTKEEMSELarkeaeasrkKHELEMDLESlEAANQSLQADLKlafKRIGDLQAAIEDEMESDENEdlins 1995
Cdd:pfam01576  292 ------KQRRDLGEELEAL----------KTELEDTLDT-TAAQQELRSKRE---QEVTELKKALEEETRSHEAQ----- 346

                   ....*...
gi 1907082211 1996 LQDMVTKY 2003
Cdd:pfam01576  347 LQEMRQKH 354
PDZ smart00228
Domain present in PSD-95, Dlg, and ZO-1/2; Also called DHR (Dlg homologous region) or GLGF ...
220-309 2.24e-11

Domain present in PSD-95, Dlg, and ZO-1/2; Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.


Pssm-ID: 214570 [Multi-domain]  Cd Length: 85  Bit Score: 62.01  E-value: 2.24e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211   220 ELELQRRPTGdFGFSLRRttmlDRAPEGQAYrrVVHFAEPGAGTKDlalGLVPGDRLVEINGQNVENKSRDEIVEMIRQS 299
Cdd:smart00228    4 LVELEKGGGG-LGFSLVG----GKDEGGGVV--VSSVVPGSPAAKA---GLRVGDVILEVNGTSVEGLTHLEAVDLLKKA 73
                            90
                    ....*....|
gi 1907082211   300 GDSVRLKVQP 309
Cdd:smart00228   74 GGKVTLTVLR 83
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1279-1710 2.28e-11

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 69.71  E-value: 2.28e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1279 PWWKLFTTVRPLiQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntgESASQLLDA 1358
Cdd:PRK03918   359 ERHELYEEAKAK-KEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK---KAIEELKKA 434
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1359 E----TAERLRTEKEMKELQTQYDA-LKKQMEVMEMEVMEARLIRAAEIngEVDDDDAGGEWRLKYERAVREVDFTKKRL 1433
Cdd:PRK03918   435 KgkcpVCGRELTEEHRKELLEEYTAeLKRIEKELKEIEEKERKLRKELR--ELEKVLKKESELIKLKELAEQLKELEEKL 512
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1434 QQELEDKMEVEQQSRRQLERRLGDLQADS---DESQRALQQLKKKCQRLTAELQDTKLHLEgqqvrnhELEKKQRRFDSE 1510
Cdd:PRK03918   513 KKYNLEELEKKAEEYEKLKEKLIKLKGEIkslKKELEKLEELKKKLAELEKKLDELEEELA-------ELLKELEELGFE 585
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1511 LSQAHEETQREklqREKLQREkdmlLAEAFSLKQQMEEKDldiagftQKVVSLEAELQDISSQESKDEASLAKVKKQLRD 1590
Cdd:PRK03918   586 SVEELEERLKE---LEPFYNE----YLELKDAEKELEREE-------KELKKLEEELDKAFEELAETEKRLEELRKELEE 651
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1591 LEAKVKDQEEEldeqagsiqMLEQAKLRLEMEMERMRqthskemeSRDEEVEEARQSCQKKLKqmevQLEEEYEDKQKAL 1670
Cdd:PRK03918   652 LEKKYSEEEYE---------ELREEYLELSRELAGLR--------AELEELEKRREEIKKTLE----KLKEELEEREKAK 710
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|
gi 1907082211 1671 REKRELESKLSTLsdqvnqrdfeseKRLRKDLKRTKALLA 1710
Cdd:PRK03918   711 KELEKLEKALERV------------EELREKVKKYKALLK 738
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1757-2011 3.00e-11

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 66.86  E-value: 3.00e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1757 MEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQE 1836
Cdd:COG1340      3 TDELSSSLEELEEKIEELREEIEELKEKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKVKELKEERDE 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1837 LQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFE-KTQVKRLEN---LASRLKETMEKLteerdQRAAAENR 1912
Cdd:COG1340     83 LNEKLNELREELDELRKELAELNKAGGSIDKLRKEIERLEWRqQTEVLSPEEekeLVEKIKELEKEL-----EKAKKALE 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1913 EKEQNKRLQRQLRDTKEEMSELARKEAEASRK--KHELEMDlESLEAANQsLQADLKLAFKRIGDLQAAIEDEMES-DEN 1989
Cdd:COG1340    158 KNEKLKELRAELKELRKEAEEIHKKIKELAEEaqELHEEMI-ELYKEADE-LRKEADELHKEIVEAQEKADELHEEiIEL 235
                          250       260
                   ....*....|....*....|..
gi 1907082211 1990 EDLINSLQDMVTKYQKKKNKLE 2011
Cdd:COG1340    236 QKELRELRKELKKLRKKQRALK 257
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1417-2011 5.65e-11

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 68.53  E-value: 5.65e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1417 LKYERAVREVDFTKKRLQQ---ELEDKME-VEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQdtklhLEG 1492
Cdd:TIGR00606  265 MKLDNEIKALKSRKKQMEKdnsELELKMEkVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNKERR-----LLN 339
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1493 QQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISS 1572
Cdd:TIGR00606  340 QEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVIERQEDEAKTAAQLCADLQS 419
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1573 QESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKL 1652
Cdd:TIGR00606  420 KERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILELDQELRKAERELSKAEKNSLTETL 499
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1653 KQMEVQLEEeyedkqkalrEKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMldhlKNNAPSKREIAQL 1732
Cdd:TIGR00606  500 KKEVKSLQN----------EKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIR----KIKSRHSDELTSL 565
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1733 ------KNQLEEsefTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRL------EEDQEDMN 1800
Cdd:TIGR00606  566 lgyfpnKKQLED---WLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLfdvcgsQDEESDLE 642
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1801 ELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQ-ALQSQVEFLEqsmvdksLVSRQEAKIRELETRLEFEK 1879
Cdd:TIGR00606  643 RLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQrVFQTEAELQE-------FISDLQSKLRLAPDKLKSTE 715
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1880 TQVKRLEnlasRLKETMEKLTEERDQRAAAENRE-KEQNKRLQRQLRDTKEEMSELARKEA--EASRKKHELEMDLESLE 1956
Cdd:TIGR00606  716 SELKKKE----KRRDEMLGLAPGRQSIIDLKEKEiPELRNKLQKVNRDIQRLKNDIEEQETllGTIMPEEESAKVCLTDV 791
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1957 AANQSLQADLKlafkrigDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLE 2011
Cdd:TIGR00606  792 TIMERFQMELK-------DVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELD 839
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1612-2019 6.74e-11

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 67.84  E-value: 6.74e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1612 LEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEvQLEEEYEDKQKALREKRELesklstlsdqvNQRD 1691
Cdd:pfam05557   14 LQNEKKQMELEHKRARIELEKKASALKRQLDRESDRNQELQKRIR-LLEKREAEAEEALREQAEL-----------NRLK 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1692 FESEKRLRKDLKRTKALLADAQIMLDHLKNnapskrEIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAK 1771
Cdd:pfam05557   82 KKYLEALNKKLNEKESQLADAREVISCLKN------ELSELRRQIQRAELELQSTNSELEELQERLDLLKAKASEAEQLR 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1772 TALEEQLSRLQREKNEIQNrLE---EDQEDMNELMKKHKAAVAQasrdMAQMNDLQAQIEESNKEKQELQEKLQALQSQV 1848
Cdd:pfam05557  156 QNLEKQQSSLAEAEQRIKE-LEfeiQSQEQDSEIVKNSKSELAR----IPELEKELERLREHNKHLNENIENKLLLKEEV 230
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1849 EFLEQSMvdkslvSRQE---AKIRELETRLEFEKTQVKRLENLASRLKETM---EKLTEERDQRAAAENREKEQNKRLQR 1922
Cdd:pfam05557  231 EDLKRKL------EREEkyrEEAATLELEKEKLEQELQSWVKLAQDTGLNLrspEDLSRRIEQLQQREIVLKEENSSLTS 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1923 QLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIE---DEME----SDENEDLINS 1995
Cdd:pfam05557  305 SARQLEKARRELEQELAQYLKKIEDLNKKLKRHKALVRRLQRRVLLLTKERDGYRAILEsydKELTmsnySPQLLERIEE 384
                          410       420
                   ....*....|....*....|....
gi 1907082211 1996 LQDMVTKYQKKKNKLEGDSDVDSE 2019
Cdd:pfam05557  385 AEDMTQKMQAHNEEMEAQLSVAEE 408
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1542-2011 9.66e-11

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 67.69  E-value: 9.66e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1542 LKQQMEEKDLDIAGFTqkvvslEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQ--EEELDEQAGSIQMLEQAKLRL 1619
Cdd:TIGR00618  158 LKAKSKEKKELLMNLF------PLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLctPCMPDTYHERKQVLEKELKHL 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1620 EMEMERMRQTHSKEmeSRDEEVEEARQSCQKKLKQMEVQLEE------EYEDKQKALREKRELEsKLSTLSDQVNQRDFE 1693
Cdd:TIGR00618  232 REALQQTQQSHAYL--TQKREAQEEQLKKQQLLKQLRARIEElraqeaVLEETQERINRARKAA-PLAAHIKAVTQIEQQ 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1694 SEkRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQL--EESEFTCAAAVKA--RKAMEVEMEDLHlQIDDIAK 1769
Cdd:TIGR00618  309 AQ-RIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLhsQEIHIRDAHEVATsiREISCQQHTLTQ-HIHTLQQ 386
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1770 AKTALEEQ-------LSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQ 1842
Cdd:TIGR00618  387 QKTTLTQKlqslckeLDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQ 466
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1843 ALQSqvefLEQSMVDKSLVSRQEAKIRELE-TRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQnKRLQ 1921
Cdd:TIGR00618  467 SLKE----REQQLQTKEQIHLQETRKKAVVlARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTY-AQLE 541
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1922 RQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIedEMESDENEDLINSLQDMVT 2001
Cdd:TIGR00618  542 TSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLT--EKLSEAEDMLACEQHALLR 619
                          490
                   ....*....|
gi 1907082211 2002 KYQKKKNKLE 2011
Cdd:TIGR00618  620 KLQPEQDLQD 629
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
1509-1897 1.19e-10

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 66.78  E-value: 1.19e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1509 SELSQAHEETQREKLQREKLQRE--KDmLLAEAFS-------LKQQMEEKDLDIAGFTQKVVS---LEAE--LQDISSQE 1574
Cdd:PRK04778   129 QELLESEEKNREEVEQLKDLYRElrKS-LLANRFSfgpaldeLEKQLENLEEEFSQFVELTESgdyVEAReiLDQLEEEL 207
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1575 SKDEASLAKVKKQLRDLEAKVKDQEEELdeQAGSIQMLEQ----AKLRLEMEMERMRQTHSK---EMESRD-EEVEEARQ 1646
Cdd:PRK04778   208 AALEQIMEEIPELLKELQTELPDQLQEL--KAGYRELVEEgyhlDHLDIEKEIQDLKEQIDEnlaLLEELDlDEAEEKNE 285
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1647 SCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEkRLR-------KDLKRTKALLADAQIM---- 1715
Cdd:PRK04778   286 EIQERIDQLYDILEREVKARKYVEKNSDTLPDFLEHAKEQNKELKEEID-RVKqsytlneSELESVRQLEKQLESLekqy 364
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1716 LDHLKNNAPSKREIAQLKNQLEESEftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLE-- 1793
Cdd:PRK04778   365 DEITERIAEQEIAYSELQEELEEIL-------KQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEIKRYLEks 437
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1794 ------EDQEDMNELMKKH-KAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSM----------- 1855
Cdd:PRK04778   438 nlpglpEDYLEMFFEVSDEiEALAEELEEKPINMEAVNRLLEEATEDVETLEEETEELVENATLTEQLIqyanryrsdne 517
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082211 1856 -VDKSLVS--------RQEAKIRELETRLE-FEKTQVKRLENLASRLKETME 1897
Cdd:PRK04778   518 eVAEALNEaerlfreyDYKAALEIIATALEkVEPGVTKRIEDSYEKEKETIR 569
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1639-1864 1.42e-10

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 65.56  E-value: 1.42e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1639 EEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKrLRKDLKRTKALLADAQIMLDH 1718
Cdd:COG4942     23 AEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAA-LEAELAELEKEIAELRAELEA 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1719 LKNN-APSKREIAQLKNQ------LEESEFtcAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQ-- 1789
Cdd:COG4942    102 QKEElAELLRALYRLGRQpplallLSPEDF--LDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEal 179
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1790 -NRLEEDQEDMNELMKKHKAAVAQASRDMAQmndLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQ 1864
Cdd:COG4942    180 lAELEEERAALEALKAERQKLLARLEKELAE---LAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFAALK 252
46 PHA02562
endonuclease subunit; Provisional
1763-2029 1.67e-10

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 66.19  E-value: 1.67e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1763 QIDDIAKAKT-ALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAqasrdmaqmnDLQAQIEESNKEKQELQEKL 1841
Cdd:PHA02562   167 EMDKLNKDKIrELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIA----------RKQNKYDELVEEAKTIKAEI 236
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1842 QALQSQVEFLEQSMVDkslVSRQEAKIRELETRLefeKTQVKRLENLASRLKE------TMEKLTEERDQRAAAENREKE 1915
Cdd:PHA02562   237 EELTDELLNLVMDIED---PSAALNKLNTAAAKI---KSKIEQFQKVIKMYEKggvcptCTQQISEGPDRITKIKDKLKE 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1916 QNKRLqRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDemESDENEDLINS 1995
Cdd:PHA02562   311 LQHSL-EKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVD--NAEELAKLQDE 387
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1907082211 1996 LQDMVTKYQKKKNKLEGDSDVDSELEDrvDGVKS 2029
Cdd:PHA02562   388 LDKIVKTKSELVKEKYHRGIVTDLLKD--SGIKA 419
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1292-1570 2.00e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 66.63  E-value: 2.00e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1292 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAE-----TAERLRT 1366
Cdd:TIGR02169  724 EIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSripeiQAELSKL 803
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1367 EKEMKELQTQYDALKKQMEVMEMEVMEARliraAEINGEVDDDDaggEWRLKYERAVREVDFTKKRLQqELEDKMEVEQQ 1446
Cdd:TIGR02169  804 EEEVSRIEARLREIEQKLNRLTLEKEYLE----KEIQELQEQRI---DLKEQIKSIEKEIENLNGKKE-ELEEELEELEA 875
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1447 SRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQrEKLQRE 1526
Cdd:TIGR02169  876 ALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPE-EELSLE 954
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1527 KLQREKDMLLAEAFSLK-------QQMEEKDLDIAGFTQKVVSLEAELQDI 1570
Cdd:TIGR02169  955 DVQAELQRVEEEIRALEpvnmlaiQEYEEVLKRLDELKEKRAKLEEERKAI 1005
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1658-2045 2.90e-10

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 64.92  E-value: 2.90e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1658 QLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQimldhlknnapskREIAQLKNQLE 1737
Cdd:COG4372      3 RLGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAR-------------EELEQLEEELE 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1738 ESEftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDM 1817
Cdd:COG4372     70 QAR-------SELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQ 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1818 AQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETME 1897
Cdd:COG4372    143 SEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLE 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1898 KLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQ 1977
Cdd:COG4372    223 AKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLL 302
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1978 AAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSDVDSELEDRVDGVKSWLSKNKGPSKAPSDDG 2045
Cdd:COG4372    303 NLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAGVADG 370
PRK01156 PRK01156
chromosome segregation protein; Provisional
1503-2055 4.01e-10

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 65.69  E-value: 4.01e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1503 KQRRFDSELSQAHEETQR---EKLQREKLQREKDMLLAEAFSLKQQMEekdlDIAGFTQKVVSLEAELQDISSQESKDEA 1579
Cdd:PRK01156   136 GQGEMDSLISGDPAQRKKildEILEINSLERNYDKLKDVIDMLRAEIS----NIDYLEEKLKSSNLELENIKKQIADDEK 211
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1580 SLAKVKKQLRDLEAKVKDQEEE---LDEQAGSIQMLEQAKLRLEME---------MERMRQTHSKEMESRDEEVEEARQS 1647
Cdd:PRK01156   212 SHSITLKEIERLSIEYNNAMDDynnLKSALNELSSLEDMKNRYESEiktaesdlsMELEKNNYYKELEERHMKIINDPVY 291
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1648 CQKKLKQMEVQLEEEYEDKQKALRE-KRELES------KLSTLsdQVNQRDFESEKRLRKDLKRTKALLADaqimlDHLK 1720
Cdd:PRK01156   292 KNRNYINDYFKYKNDIENKKQILSNiDAEINKyhaiikKLSVL--QKDYNDYIKKKSRYDDLNNQILELEG-----YEMD 364
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1721 NNApSKREIAQLKNQLEE----SEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQ 1796
Cdd:PRK01156   365 YNS-YLKSIESLKKKIEEysknIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELS 443
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1797 EDMNELMKKHKAAVAQASRDMAQMNDLqaqIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSLVSRQEA-KIRELET- 1873
Cdd:PRK01156   444 RNMEMLNGQSVCPVCGTTLGEEKSNHI---INHYNEKKSRLEEKIREIEIEVKDIDEKIVDlKKRKEYLESeEINKSINe 520
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1874 --RLEFEKTQVKRLENLASRLKETMEKLT-----------EERDQR----------------AAAENREKEQNKRLQRQL 1924
Cdd:PRK01156   521 ynKIESARADLEDIKIKINELKDKHDKYEeiknrykslklEDLDSKrtswlnalavislidiETNRSRSNEIKKQLNDLE 600
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1925 RDTKEEMSELARKEA--EASRKKHELEMD--------LESLEAANQSLQADLKLAFKRIGDLQAAIEDEME-----SDEN 1989
Cdd:PRK01156   601 SRLQEIEIGFPDDKSyiDKSIREIENEANnlnnkyneIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEitsriNDIE 680
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082211 1990 EDLINS---LQDMVTKYQKKKNKLEGDSDVDSELEDRVDGVKSWLSKNKGPSKAPSDdgsLKSSRTALN 2055
Cdd:PRK01156   681 DNLKKSrkaLDDAKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKAIGD---LKRLREAFD 746
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1758-1963 4.14e-10

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 65.42  E-value: 4.14e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1758 EDLHLQIDDIAKAKTALEEQLsrlqrekNEIQNRLEEDQEDMNELMKKHKAAVAQAsrdmaQMNDLQAQIEESNKEKQEL 1837
Cdd:COG3206    164 QNLELRREEARKALEFLEEQL-------PELRKELEEAEAALEEFRQKNGLVDLSE-----EAKLLLQQLSELESQLAEA 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1838 QEKLQALQSQVEFLEQSM----------VDKSLVSRQEAKIRELETRLEFEKTQV-----------KRLENLASRLKETM 1896
Cdd:COG3206    232 RAELAEAEARLAALRAQLgsgpdalpelLQSPVIQQLRAQLAELEAELAELSARYtpnhpdvialrAQIAALRAQLQQEA 311
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1897 EKLTEE-RDQRAAAENREKEqnkrLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQ 1963
Cdd:COG3206    312 QRILASlEAELEALQAREAS----LQAQLAQLEARLAELPELEAELRRLEREVEVARELYESLLQRLE 375
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1425-1925 6.57e-10

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 64.76  E-value: 6.57e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1425 EVDFTKKRLQQELEDKMEVEQQSRRQLerrlgdlqadsDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQ 1504
Cdd:pfam05557    3 ELIESKARLSQLQNEKKQMELEHKRAR-----------IELEKKASALKRQLDRESDRNQELQKRIRLLEKREAEAEEAL 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1505 RrfdsELSQAHEETQREKLQREKLQREKDMLLAEA----FSLKQQMEEKDLDIAGFTQKVVSLEAELQDIS--------- 1571
Cdd:pfam05557   72 R----EQAELNRLKKKYLEALNKKLNEKESQLADAreviSCLKNELSELRRQIQRAELELQSTNSELEELQerldllkak 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1572 -SQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRL----EMEME----RMRQTHSKEMESRDEEVE 1642
Cdd:pfam05557  148 aSEAEQLRQNLEKQQSSLAEAEQRIKELEFEIQSQEQDSEIVKNSKSELaripELEKElerlREHNKHLNENIENKLLLK 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1643 EARQSCQKKLKQME------VQLEEEYEDKQKAL---------------------REKRELESKLSTLSDQV-------- 1687
Cdd:pfam05557  228 EEVEDLKRKLEREEkyreeaATLELEKEKLEQELqswvklaqdtglnlrspedlsRRIEQLQQREIVLKEENssltssar 307
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1688 ----NQRDFESEKR--------LRKDLKRTKALLADAQimldhlKNNAPSKREIAQLKNQLE--ESEFTCAAAVKARKAM 1753
Cdd:pfam05557  308 qlekARRELEQELAqylkkiedLNKKLKRHKALVRRLQ------RRVLLLTKERDGYRAILEsyDKELTMSNYSPQLLER 381
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1754 EVEMEDLhlqIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNelMKKHKAAVAQASRDMAQMNDLQAQIEESNKE 1833
Cdd:pfam05557  382 IEEAEDM---TQKMQAHNEEMEAQLSVAEEELGGYKQQAQTLERELQ--ALRQQESLADPSYSKEEVDSLRRKLETLELE 456
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1834 KQELQEKLQALQSQVEFLE-QSMVDKS------LVSRQEAKIRELEtrlefeKTQVKRLENLASRLKETMEKLTEERDQR 1906
Cdd:pfam05557  457 RQRLREQKNELEMELERRClQGDYDPKktkvlhLSMNPAAEAYQQR------KNQLEKLQAEIERLKRLLKKLEDDLEQV 530
                          570
                   ....*....|....*....
gi 1907082211 1907 AAAENREKEQNKRLQRQLR 1925
Cdd:pfam05557  531 LRLPETTSTMNFKEVLDLR 549
PDZ7_MUPP1-PD6_PATJ-like cd06671
PDZ domain 7 of multi-PDZ-domain protein 1 (MUPP1), PDZ domain 6 of PATJ (protein-associated ...
221-310 7.02e-10

PDZ domain 7 of multi-PDZ-domain protein 1 (MUPP1), PDZ domain 6 of PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 7 of MUPP1 and PDZ domain 6 of PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ7 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467159 [Multi-domain]  Cd Length: 96  Bit Score: 58.10  E-value: 7.02e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  221 LELQRRPTGDFGFSL---RrtTMLDRAPEGQAYRRVV--HFAE-PGAGTKDLalgLVPGDRLVEINGQNVENKSRDEIVE 294
Cdd:cd06671      5 VELWREPGKSLGISIvggR--VMGSRLSNGEEIRGIFikHVLEdSPAGRNGT---LKTGDRILEVNGVDLRNATHEEAVE 79
                           90
                   ....*....|....*.
gi 1907082211  295 MIRQSGDSVRLKVQPI 310
Cdd:cd06671     80 AIRNAGNPVVFLVQSL 95
PRK01156 PRK01156
chromosome segregation protein; Provisional
1297-1839 7.24e-10

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 64.54  E-value: 7.24e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1297 EEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRIS-------ELTSELTDERNTGESASQLL---DAETAERLRT 1366
Cdd:PRK01156   248 EDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINdpvyknrNYINDYFKYKNDIENKKQILsniDAEINKYHAI 327
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1367 EKEMKELQTQYDA-LKKQMevmemevmearliRAAEINGEVDDDDaggEWRLKYERAVREVDFTKKRLQQELEDKMEVEQ 1445
Cdd:PRK01156   328 IKKLSVLQKDYNDyIKKKS-------------RYDDLNNQILELE---GYEMDYNSYLKSIESLKKKIEEYSKNIERMSA 391
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1446 QSRRQLERRLGD---LQADSDESQRALQQLKKKCQRLTA----------ELQDTKLHLEGQQV----RNHELEKKQRR-- 1506
Cdd:PRK01156   392 FISEILKIQEIDpdaIKKELNEINVKLQDISSKVSSLNQriralrenldELSRNMEMLNGQSVcpvcGTTLGEEKSNHii 471
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1507 --FDSELSQAHEETQREKLQREKLQREKDMLLaeafSLKQQMEEKDLD-IAGFTQKVVSLEAELQDissqeskDEASLAK 1583
Cdd:PRK01156   472 nhYNEKKSRLEEKIREIEIEVKDIDEKIVDLK----KRKEYLESEEINkSINEYNKIESARADLED-------IKIKINE 540
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1584 VKKQlrdleakvKDQEEELDEQAGSIQmLEQAKLRLEMEMERMRQTHSKEME---SRDEEVEEARQSCQKKLKQMEVQLE 1660
Cdd:PRK01156   541 LKDK--------HDKYEEIKNRYKSLK-LEDLDSKRTSWLNALAVISLIDIEtnrSRSNEIKKQLNDLESRLQEIEIGFP 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1661 EEYEDKQKALrekRELESKLSTLSDQVNQrdFESEKRLRKDLKRTkalladaqimLDHLKNNAPSKREIAQLKNQLeese 1740
Cdd:PRK01156   612 DDKSYIDKSI---REIENEANNLNNKYNE--IQENKILIEKLRGK----------IDNYKKQIAEIDSIIPDLKEI---- 672
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1741 ftcaaAVKARKaMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMnELMKKHKAAVAQAS--RDMA 1818
Cdd:PRK01156   673 -----TSRIND-IEDNLKKSRKALDDAKANRARLESTIEILRTRINELSDRINDINETL-ESMKKIKKAIGDLKrlREAF 745
                          570       580
                   ....*....|....*....|....*..
gi 1907082211 1819 QMNDLQAQIEES------NKEKQELQE 1839
Cdd:PRK01156   746 DKSGVPAMIRKSasqamtSLTRKYLFE 772
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
1498-1856 8.71e-10

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 64.10  E-value: 8.71e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1498 HELEKKQRRFDSELSQ--AHEETQREKLQ--REKLQREKDMLLAEAFS-------LKQQMEEKDLDIAGFTQKVVS---L 1563
Cdd:pfam06160   96 DDIEEDIKQILEELDEllESEEKNREEVEelKDKYRELRKTLLANRFSygpaideLEKQLAEIEEEFSQFEELTESgdyL 175
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1564 EAE--LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELdeQAGSIQMLEQ----AKLRLEMEMERMRQTHSKEMES- 1636
Cdd:pfam06160  176 EARevLEKLEEETDALEELMEDIPPLYEELKTELPDQLEEL--KEGYREMEEEgyalEHLNVDKEIQQLEEQLEENLALl 253
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1637 ---RDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQ----------------------VNQRD 1691
Cdd:pfam06160  254 enlELDEAEEALEEIEERIDQLYDLLEKEVDAKKYVEKNLPEIEDYLEHAEEQnkelkeelervqqsytlnenelERVRG 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1692 FESE-KRLRKDLKRTKALLADAQI----MLDHLKnnapskreiaQLKNQLEEseftcaaavkarkaMEVEMEDLHLQIDD 1766
Cdd:pfam06160  334 LEKQlEELEKRYDEIVERLEEKEVayseLQEELE----------EILEQLEE--------------IEEEQEEFKESLQS 389
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1767 IAKAKTALEEQLSRLQREKNEIQNRLEEDQ-----EDMNELMKKHKAAVAQASRDMAQ----MNDLQAQIEESNKEKQEL 1837
Cdd:pfam06160  390 LRKDELEAREKLDEFKLELREIKRLVEKSNlpglpESYLDYFFDVSDEIEDLADELNEvplnMDEVNRLLDEAQDDVDTL 469
                          410
                   ....*....|....*....
gi 1907082211 1838 QEKLQALQSQVEFLEQSMV 1856
Cdd:pfam06160  470 YEKTEELIDNATLAEQLIQ 488
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1297-1760 8.97e-10

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 64.55  E-value: 8.97e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1297 EEQIRNKD-EEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDE-----------RNTGESASQLLDAETAERL 1364
Cdd:COG4913    329 EAQIRGNGgDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASaeefaalraeaAALLEALEEELEALEEALA 408
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1365 RTEKEMKELQTQYDALKKQMEV---------MEMEVMEARLIRAAEING----------EVDDDDAggEWRL-------- 1417
Cdd:COG4913    409 EAEAALRDLRRELRELEAEIASlerrksnipARLLALRDALAEALGLDEaelpfvgeliEVRPEEE--RWRGaiervlgg 486
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1418 ---------KYERAVRE----------VDFTKKRLQQELEDKMEVEQQS----------------RRQLERRLGDLQADS 1462
Cdd:COG4913    487 faltllvppEHYAAALRwvnrlhlrgrLVYERVRTGLPDPERPRLDPDSlagkldfkphpfrawlEAELGRRFDYVCVDS 566
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1463 desqraLQQLKKKCQRLTAELQ----------DTKLHLEGQQV-------RNHELEKKQRRFDSELSQAHEETQREKLQR 1525
Cdd:COG4913    567 ------PEELRRHPRAITRAGQvkgngtrhekDDRRRIRSRYVlgfdnraKLAALEAELAELEEELAEAEERLEALEAEL 640
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1526 EKLQRekdmlLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDIssqeSKDEASLAKVKKQLRDLEAKVKDQEEELDEQ 1605
Cdd:COG4913    641 DALQE-----RREALQRLAEYSWDEIDVASAEREIAELEAELERL----DASSDDLAALEEQLEELEAELEELEEELDEL 711
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1606 AGSIQMLEQAKLRLEMEMERMRQTHSkemESRDEEVEEARQSCQKKLKQmeVQLEEEYEDKQKALREKRE-LESKLSTLS 1684
Cdd:COG4913    712 KGEIGRLEKELEQAEEELDELQDRLE---AAEDLARLELRALLEERFAA--ALGDAVERELRENLEERIDaLRARLNRAE 786
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1685 DQVN------QRDFESEKRlrkDLKRTKALLADAQIMLDHLKNNA-PSKRE-IAQLKNQLEESEFT--CAAAVKARKAME 1754
Cdd:COG4913    787 EELEramrafNREWPAETA---DLDADLESLPEYLALLDRLEEDGlPEYEErFKELLNENSIEFVAdlLSKLRRAIREIK 863

                   ....*.
gi 1907082211 1755 VEMEDL 1760
Cdd:COG4913    864 ERIDPL 869
mukB PRK04863
chromosome partition protein MukB;
1421-1969 9.48e-10

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 64.59  E-value: 9.48e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1421 RAVREVDFTKKRLQQELEDKMEVEQQSRRQLERR--LGDLQADSDES----QRALQQLKKKCQRLTA------ELQDTKL 1488
Cdd:PRK04863   534 RAERLLAEFCKRLGKNLDDEDELEQLQEELEARLesLSESVSEARERrmalRQQLEQLQARIQRLAArapawlAAQDALA 613
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1489 HLEGQQvrNHELEKKQRRfdSELSQAHEETQRE-KLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKV--VSLeA 1565
Cdd:PRK04863   614 RLREQS--GEEFEDSQDV--TEYMQQLLERERElTVERDELAARKQALDEEIERLSQPGGSEDPRLNALAERFggVLL-S 688
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1566 EL-QDISSQESKD-EASLAKVKKQL--RDLEAkVKDQEEELDEQAGSIQMLEQAKLRL--------EMEMERMRQTHSKE 1633
Cdd:PRK04863   689 EIyDDVSLEDAPYfSALYGPARHAIvvPDLSD-AAEQLAGLEDCPEDLYLIEGDPDSFddsvfsveELEKAVVVKIADRQ 767
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1634 ME-SRDEEV----EEARQscqKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQR-------DFESE-KRLRK 1700
Cdd:PRK04863   768 WRySRFPEVplfgRAARE---KRIEQLRAEREELAERYATLSFDVQKLQRLHQAFSRFIGSHlavafeaDPEAElRQLNR 844
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1701 DLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESeftcaAAVKARKAMEVEMEDLHLQIDDIAKAK--------- 1771
Cdd:PRK04863   845 RRVELERALADHESQEQQQRSQLEQAKEGLSALNRLLPR-----LNLLADETLADRVEEIREQLDEAEEAKrfvqqhgna 919
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1772 -TALEEQLSRLQREKNEI---QNRLEEDQEDMNELMKKHKA-----------AVAQASRDMAQMNDLQAQIEESNKEKQE 1836
Cdd:PRK04863   920 lAQLEPIVSVLQSDPEQFeqlKQDYQQAQQTQRDAKQQAFAltevvqrrahfSYEDAAEMLAKNSDLNEKLRQRLEQAEQ 999
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1837 LQEKL--QALQSQVEFLEQSMVDKSLVSRQEAKireLETRLEFEKtqvkRLENLASRLKETME-KLTEERDQRAAAENRE 1913
Cdd:PRK04863  1000 ERTRAreQLRQAQAQLAQYNQVLASLKSSYDAK---RQMLQELKQ----ELQDLGVPADSGAEeRARARRDELHARLSAN 1072
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1914 KEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEmdlESLEAANQSLQADLKLA 1969
Cdd:PRK04863  1073 RSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEMR---EQVVNAKAGWCAVLRLV 1125
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1727-1908 1.15e-09

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 61.09  E-value: 1.15e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1727 REIAQLKNQLEESEftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELmkkh 1806
Cdd:COG1579     17 SELDRLEHRLKELP-------AELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNV---- 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1807 kaavaqasRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETRLEFEKtqvKRLE 1886
Cdd:COG1579     86 --------RNNKEYEALQKEIESLKRRISDLEDEILELMERIEELEEE------LAELEAELAELEAELEEKK---AELD 148
                          170       180
                   ....*....|....*....|..
gi 1907082211 1887 NLASRLKETMEKLTEERDQRAA 1908
Cdd:COG1579    149 EELAELEAELEELEAEREELAA 170
PRK01156 PRK01156
chromosome segregation protein; Provisional
1297-1898 1.22e-09

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 64.15  E-value: 1.22e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1297 EEQIRNKDEEIQQLRSKLEKVEKernELRLSSDRLEtRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTq 1376
Cdd:PRK01156   189 EEKLKSSNLELENIKKQIADDEK---SHSITLKEIE-RLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEIKTAES- 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1377 yDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYeravrevDFTKKRlqqELEDKMEVEQQSRRQLERRLG 1456
Cdd:PRK01156   264 -DLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKN-------DIENKK---QILSNIDAEINKYHAIIKKLS 332
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1457 DLQADSDEsqraLQQLKKKCQRLTAELQDTKLHLEGQQ--VRNHELEKKQRRfdselsqahEETQREKLQREKLQREKDM 1534
Cdd:PRK01156   333 VLQKDYND----YIKKKSRYDDLNNQILELEGYEMDYNsyLKSIESLKKKIE---------EYSKNIERMSAFISEILKI 399
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1535 LLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLR------DL-EAKVKDQEEELDEQAG 1607
Cdd:PRK01156   400 QEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQSVcpvcgtTLgEEKSNHIINHYNEKKS 479
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1608 SiqmLEQAKLRLEMEMERM--RQTHSKEMESR--DEEVEEARQScQKKLKQMEVQLeEEYEDKQKALREKrelESKLSTL 1683
Cdd:PRK01156   480 R---LEEKIREIEIEVKDIdeKIVDLKKRKEYleSEEINKSINE-YNKIESARADL-EDIKIKINELKDK---HDKYEEI 551
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1684 SDQVNQRDFESekrlrKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEeseftcaaavKARKAMEVEMEDLHLQ 1763
Cdd:PRK01156   552 KNRYKSLKLED-----LDSKRTSWLNALAVISLIDIETNRSRSNEIKKQLNDLE----------SRLQEIEIGFPDDKSY 616
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1764 IDDIAKaktALEEQLSRLQREKNEIQNrLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQ- 1842
Cdd:PRK01156   617 IDKSIR---EIENEANNLNNKYNEIQE-NKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKSRKALDd 692
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1843 ALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLaSRLKETMEK 1898
Cdd:PRK01156   693 AKANRARLESTIEILRTRINELSDRINDINETLESMKKIKKAIGDL-KRLREAFDK 747
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1221-1637 1.41e-09

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 63.63  E-value: 1.41e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1221 LARLEEQRDEQTSRHLTLFQAacRGYLARQHFKKRKIQDLAIRCVQKNIKKNKGVKDWPWWKLFTTVRP-LIQVQLSEEQ 1299
Cdd:COG4717     73 LKELEEELKEAEEKEEEYAEL--QEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAeLAELPERLEE 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1300 IRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELT-SELTDERNTGESASQLLDAETAERLRTEKEMKELQTQYD 1378
Cdd:COG4717    151 LEERLEELRELEEELEELEAELAELQEELEELLEQLSLATeEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELE 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1379 ALKKQMEVMEMEVMEARLIRAAEINGEV------DDDDAGGEWRLK---------------YERAVREVDFTKKRLQQEL 1437
Cdd:COG4717    231 QLENELEAAALEERLKEARLLLLIAAALlallglGGSLLSLILTIAgvlflvlgllallflLLAREKASLGKEAEELQAL 310
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1438 EDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHE---LEKKQRRFDSELSQA 1514
Cdd:COG4717    311 PALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIaalLAEAGVEDEEELRAA 390
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1515 HEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAgfTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLE-- 1592
Cdd:COG4717    391 LEQAEEYQELKEELEELEEQLEELLGELEELLEALDEEEL--EEELEELEEELEELEEELEELREELAELEAELEQLEed 468
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082211 1593 -------AKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESR 1637
Cdd:COG4717    469 gelaellQELEELKAELRELAEEWAALKLALELLEEAREEYREERLPPVLER 520
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
1429-1848 1.84e-09

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 63.39  E-value: 1.84e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1429 TKKRLQQELEDKMEVEQQSRRQLerrlgdlqadSDESQR---ALQQLKKKCQRLT-AELQDTKLHLEGQQVRNhELEKKQ 1504
Cdd:pfam15964  301 TIERLTKERDDLMSALVSVRSSL----------AEAQQRessAYEQVKQAVQMTEeANFEKTKALIQCEQLKS-ELERQK 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1505 RRFDSEL-SQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAK 1583
Cdd:pfam15964  370 ERLEKELaSQQEKRAQEKEALRKEMKKEREELGATMLALSQNVAQLEAQVEKVTREKNSLVSQLEEAQKQLASQEMDVTK 449
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1584 VKKQLRDLEAKVKDQEEELdeqagsiqmleqaklrlEMEMERMRQTHSKEMESRDEEVEEARQscqkKLKQMEVQLEEEY 1663
Cdd:pfam15964  450 VCGEMRYQLNQTKMKKDEA-----------------EKEHREYRTKTGRQLEIKDQEIEKLGL----ELSESKQRLEQAQ 508
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1664 EDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKnnapskREIAQLKNQLEESEFTC 1743
Cdd:pfam15964  509 QDAARAREECLKLTELLGESEHQLHLTRLEKESIQQSFSNEAKAQALQAQQREQELT------QKMQQMEAQHDKTVNEQ 582
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1744 AAAVKARKAMEVEMED----LHLQIDDIAKAKTALEEQLSR----LQREKNEIQNRLEEDQED------MNELMK----- 1804
Cdd:pfam15964  583 YSLLTSQNTFIAKLKEecctLAKKLEEITQKSRSEVEQLSQekeyLQDRLEKLQKRNEELEEQcvqhgrMHERMKqrlrq 662
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082211 1805 --KHKAAVAQasrdmaQMNDLQAQIEESNKEKQELQEKLQALQSQV 1848
Cdd:pfam15964  663 ldKHCQATAQ------QLVQLLSKQNQLFKERQNLTEEVQSLRSQV 702
CCDC22 pfam05667
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ...
1433-1847 2.11e-09

Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.


Pssm-ID: 461708 [Multi-domain]  Cd Length: 600  Bit Score: 62.74  E-value: 2.11e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1433 LQQELEDKMEVEQQSrrqLERRLGDLQADSDESQRALQQLKkkcQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELS 1512
Cdd:pfam05667  216 LAAAQEWEEEWNSQG---LASRLTPEEYRKRKRTKLLKRIA---EQLRSAALAGTEATSGASRSAQDLAELLSSFSGSST 289
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1513 QAHEETQREKLQR-EKLQREKDMLLAEAFSLKQQMEEKDLDIAGfTQKVVSLEAELQDISSqeskdeaSLAKVKKQLRDL 1591
Cdd:pfam05667  290 TDTGLTKGSRFTHtEKLQFTNEAPAATSSPPTKVETEEELQQQR-EEELEELQEQLEDLES-------SIQELEKEIKKL 361
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1592 EAKVKDQEEELDEQagsiqmlEQAKLRLEMEMERMRQTHsKEMESRDEEVEEARQSCQKKLKQMeVQLEEEYEDKQKAL- 1670
Cdd:pfam05667  362 ESSIKQVEEELEEL-------KEQNEELEKQYKVKKKTL-DLLPDAEENIAKLQALVDASAQRL-VELAGQWEKHRVPLi 432
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1671 REKRELESKLSTlsdqvnqRDFESEKRLR--KDLK-RTKALLADAQimldhlknnapSKRE-IAQLKNQLEE-------S 1739
Cdd:pfam05667  433 EEYRALKEAKSN-------KEDESQRKLEeiKELReKIKEVAEEAK-----------QKEElYKQLVAEYERlpkdvsrS 494
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1740 EFTcaaavkaRKAMEVeMEDLHLQIDDIAKaktALEEQLSrLQREKNEIQNRLEEDQEDMNELM---KKHKAAVAQASRD 1816
Cdd:pfam05667  495 AYT-------RRILEI-VKNIKKQKEEITK---ILSDTKS-LQKEINSLTGKLDRTFTVTDELVfkdAKKDESVRKAYKY 562
                          410       420       430
                   ....*....|....*....|....*....|....*...
gi 1907082211 1817 MAQMNDLQAQ----IEESNK---EKQELQEKLQALQSQ 1847
Cdd:pfam05667  563 LAALHENCEQliqtVEETGTimrEIRDLEEQIETESGK 600
PDZ5_MAGI-1_3-like cd06735
PDZ domain 5 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, ...
222-309 3.36e-09

PDZ domain 5 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 5 of MAGI1, 2, 3 (MAGI is also known as Membrane-associated guanylate kinase, WW and PDZ domain-containing protein) and related domains. MAGI proteins have been implicated in the control of cell migration and invasion through altering the activity of phosphatase and tensin homolog (PTEN) and modulating Akt signaling. Four MAGI proteins have been identified (MAGI1-3 and MAGIX). MAGI1-3 have 6 PDZ domains and bind to the C-terminus of PTEN via their PDZ2 domain. MAGIX has a single PDZ domain that is related to MAGI1-3 PDZ domain 5, and belongs to this MAGI1,2,3-like family. Other binding partners for MAGI1 include JAM4, C-terminal tail of high risk HPV-18 E6, megalin, TRAF6, Kir4.1 (basolateral K+ channel subunit), and cadherin 23; for MAGI2, include DASM1, dendrin, axin, beta- and delta-catenin, neuroligin, hyperpolarization-activated cation channels, beta1-adrenergic receptors, NMDA receptor, and TARPs; and for MAGI3 includes LPA2. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MAGI family PDZ5 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged as beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467217 [Multi-domain]  Cd Length: 84  Bit Score: 55.66  E-value: 3.36e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  222 ELQRRPTGdFGFSLRrttmldrapEGQAYRR----VVHFAEPGAGTKDlalG-LVPGDRLVEINGQNVENKSRDEIVEMI 296
Cdd:cd06735      5 ELERGPKG-FGFSIR---------GGREYNNmplyVLRLAEDGPAQRD---GrLRVGDQILEINGESTQGMTHAQAIELI 71
                           90
                   ....*....|...
gi 1907082211  297 RQSGDSVRLKVQP 309
Cdd:cd06735     72 RSGGSVVRLLLRR 84
PDZ_FRMPD1_3_4-like cd06769
PDZ domain of FERM and PDZ domain-containing protein 1 (FRMPD1), FRMPD3, FRMPD4, and related ...
220-307 4.04e-09

PDZ domain of FERM and PDZ domain-containing protein 1 (FRMPD1), FRMPD3, FRMPD4, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of FRMPD1, FRMPD3, FRMPD4, and related domains. FRMPD1 (also known as FERM domain-containing protein 2, FRMD2), inhibits the malignant phenotype of lung cancer by activating the Hippo pathway via interaction with WWC3; the FRMPD1 PDZ domain binds WWC3. FRMPD3 is a target gene of the neuron-specific transcription factor NPAS4 that is involved in synaptic plasticity. FRMPD4 (also known as PDZ domain-containing protein 10, PDZD10, PDZK10, PSD-95-interacting regulator of spine morphogenesis, and Preso) regulates dendritic spine morphogenesis, and mGluR1/5 signaling; the FRMPD4 PDZ domain binds PAK-interacting exchange factor-beta (betaPix). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This FRMPD1,3,4-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467250 [Multi-domain]  Cd Length: 75  Bit Score: 54.94  E-value: 4.04e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLRRTTMLdrapegqayrrVVHFAEPGaGTKDlalG-LVPGDRLVEINGQNVENKSRDEIVEMIRQ 298
Cdd:cd06769      1 TVEIQRDAVLGFGFVAGSERPV-----------VVRSVTPG-GPSE---GkLLPGDQILKINNEPVEDLPRERVIDLIRE 65

                   ....*....
gi 1907082211  299 SGDSVRLKV 307
Cdd:cd06769     66 CKDSIVLTV 74
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1324-1853 4.62e-09

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 62.28  E-value: 4.62e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1324 LRLSSDR------LETRI-----SELTSELTD---ERNTG-ESASQLLDAETAERLRTEKEMKELQTQYDaLKKQMEVME 1388
Cdd:COG3096    187 LRSSSDRskfyrlIEASLyggisSAITRSLRDyllPENSGvRKAFQDMEAALRENRMTLEAIRVTQSDRD-LFKHLITEA 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1389 MEVMEARLIRAAEingevddddaggEWRLKYERAV--REVDFTKKRLQQELEDKM-----EVEQQSRRQ--LERrlgDLQ 1459
Cdd:COG3096    266 TNYVAADYMRHAN------------ERRELSERALelRRELFGARRQLAEEQYRLvemarELEELSAREsdLEQ---DYQ 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1460 ADSDESQRALQ--QLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDML-- 1535
Cdd:COG3096    331 AASDHLNLVQTalRQQEKIERYQEDLEELTERLEEQEEVVEEAAEQLAEAEARLEAAEEEVDSLKSQLADYQQALDVQqt 410
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1536 -----------LAEAfslKQQMEEKDLDIAGFTQKVVSLEAELQDISSQeskdeaslakvkkqLRDLEAKVKDQEEELDE 1604
Cdd:COG3096    411 raiqyqqavqaLEKA---RALCGLPDLTPENAEDYLAAFRAKEQQATEE--------------VLELEQKLSVADAARRQ 473
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1605 QAGSIQMLEqaklRLEMEMERmrqthSKEMESRDEEVEEARQscQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLS 1684
Cdd:COG3096    474 FEKAYELVC----KIAGEVER-----SQAWQTARELLRRYRS--QQALAQRLQQLRAQLAELEQRLRQQQNAERLLEEFC 542
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1685 dqvnqrdfeseKRLRKDLkrtkallaDAQIMLDHLKnnapskreiAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQI 1764
Cdd:COG3096    543 -----------QRIGQQL--------DAAEELEELL---------AELEAQLEELEEQAAEAVEQRSELRQQLEQLRARI 594
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1765 D---DIAKAKTALEEQLSRLQREKNEiqnrleedqedmnelmkkhkaAVAQASRDMAQMNDLQAQIEESNKEKQELQEKL 1841
Cdd:COG3096    595 KelaARAPAWLAAQDALERLREQSGE---------------------ALADSQEVTAAMQQLLEREREATVERDELAARK 653
                          570
                   ....*....|..
gi 1907082211 1842 QALQSQVEFLEQ 1853
Cdd:COG3096    654 QALESQIERLSQ 665
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1304-1925 5.00e-09

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 61.76  E-value: 5.00e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1304 DEEIQQLRSKLEKVEKERNELRLSSDRLETRIS--ELTSELTDE-----------RNTGESASQLLDAETAERLRTEKEM 1370
Cdd:pfam10174  115 EENFRRLQSEHERQAKELFLLRKTLEEMELRIEtqKQTLGARDEsikkllemlqsKGLPKKSGEEDWERTRRIAEAEMQL 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1371 KELQTQYD-------ALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGgewrlKYERAVREVDFTKKRLQQELEDKMEV 1443
Cdd:pfam10174  195 GHLEVLLDqkekeniHLREELHRRNQLQPDPAKTKALQTVIEMKDTKIS-----SLERNIRDLEDEVQMLKTNGLLHTED 269
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1444 EQQSRRQLE----------RRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEgqqVRNHELEKKQRR---FDSE 1510
Cdd:pfam10174  270 REEEIKQMEvykshskfmkNKIDQLKQELSKKESELLALQTKLETLTNQNSDCKQHIE---VLKESLTAKEQRaaiLQTE 346
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1511 ---LSQAHEETQR------EKLQRekLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDissqesKDeasl 1581
Cdd:pfam10174  347 vdaLRLRLEEKESflnkktKQLQD--LTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIENLQEQLRD------KD---- 414
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1582 akvkKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQsCQKKLKQMEVQLEE 1661
Cdd:pfam10174  415 ----KQLAGLKERVKSLQTDSSNTDTALTTLEEALSEKERIIERLKEQREREDRERLEELESLKK-ENKDLKEKVSALQP 489
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1662 EYEDKQKALREKRELESKLST----LSDQVNQRDFESEKRlRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQL- 1736
Cdd:pfam10174  490 ELTEKESSLIDLKEHASSLASsglkKDSKLKSLEIAVEQK-KEECSKLENQLKKAHNAEEAVRTNPEINDRIRLLEQEVa 568
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1737 ---EESEfTCAAAVKARKAM--EVEMEDlhlqiDDIAKAKTALEEQLSRLQREkneiQNRLEEDQEDMNELMKKHKAAVA 1811
Cdd:pfam10174  569 rykEESG-KAQAEVERLLGIlrEVENEK-----NDKDKKIAELESLTLRQMKE----QNKKVANIKHGQQEMKKKGAQLL 638
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1812 QASR---DMAQMNDLQAQIEES----NKEKQELQEKLQALQSQVEFLEQSmvDKSLVSRQEAKIRELETRLEF------- 1877
Cdd:pfam10174  639 EEARrreDNLADNSQQLQLEELmgalEKTRQELDATKARLSSTQQSLAEK--DGHLTNLRAERRKQLEEILEMkqealla 716
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082211 1878 ---EK-TQVKRLENLASRLKETMEKLteerdqraAAENREKEqnkRLQRQLR 1925
Cdd:pfam10174  717 aisEKdANIALLELSSSKKKKTQEEV--------MALKREKD---RLVHQLK 757
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1414-1755 5.52e-09

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 61.68  E-value: 5.52e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1414 EWRLKYE-RAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRlgdlqadsdeSQRALQQLKKKCQRltaELQDTKLHLEG 1492
Cdd:pfam17380  255 EYTVRYNgQTMTENEFLNQLLHIVQHQKAVSERQQQEKFEKM----------EQERLRQEKEEKAR---EVERRRKLEEA 321
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1493 QQVRNHELEKKQRRFDSELSQAHE-ETQREKLQREKLQREKDMLLAEAFSLK-QQMEEKDLDIAGFTQKVVSLEAELQDI 1570
Cdd:pfam17380  322 EKARQAEMDRQAAIYAEQERMAMErERELERIRQEERKRELERIRQEEIAMEiSRMRELERLQMERQQKNERVRQELEAA 401
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1571 SSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQagsIQMLEQAKLRlemEMERMRQTHSKEMES----RDEEVEEARQ 1646
Cdd:pfam17380  402 RKVKILEEERQRKIQQQKVEMEQIRAEQEEARQRE---VRRLEEERAR---EMERVRLEEQERQQQverlRQQEEERKRK 475
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1647 SCQKKLKQMEVQLEEEY----------EDKQKALREKR-------ELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALL 1709
Cdd:pfam17380  476 KLELEKEKRDRKRAEEQrrkilekeleERKQAMIEEERkrkllekEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQ 555
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082211 1710 ADAQIMLDHLKNNAPSKREiAQLKNQLEESEftcaaavKARKAMEV 1755
Cdd:pfam17380  556 EQMRKATEERSRLEAMERE-REMMRQIVESE-------KARAEYEA 593
PDZ4_MAGI-1_3-like cd06734
PDZ domain 4 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, ...
270-309 5.55e-09

PDZ domain 4 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of MAGI1, 2, 3 (MAGI is also known as Membrane-associated guanylate kinase, WW and PDZ domain-containing protein) and related domains. MAGI proteins have been implicated in the control of cell migration and invasion through altering the activity of phosphatase and tensin homolog (PTEN) and modulating Akt signaling. Four MAGI proteins have been identified (MAGI1-3 and MAGIX). MAGI1-3 have 6 PDZ domains and bind to the C-terminus of PTEN via their PDZ2 domain. MAGIX has a single PDZ domain that is related to MAGI1-3 PDZ domain 5. Other binding partners for MAGI1 include JAM4, C-terminal tail of high risk HPV-18 E6, megalin, TRAF6, Kir4.1 (basolateral K+ channel subunit), and cadherin 23; for MAGI2, include DASM1, dendrin, axin, beta- and delta-catenin, neuroligin, hyperpolarization-activated cation channels, beta1-adrenergic receptors, NMDA receptor, and TARPs; and for MAGI3 includes LPA2. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MAGI family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2); arranged as beta-strands A, -B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467216 [Multi-domain]  Cd Length: 84  Bit Score: 54.93  E-value: 5.55e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1907082211  270 LVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQP 309
Cdd:cd06734     45 LKVGDRILAVNGISILNLSHGDIVNLIKDSGLSVTLTIVP 84
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1290-1693 5.64e-09

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 61.06  E-value: 5.64e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1290 LIQVQLsEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntgESASQLLDAETaerlRTEKE 1369
Cdd:pfam07888   31 LLQNRL-EECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELK---EELRQSREKHE----ELEEK 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1370 MKELQTQYDALKKQMevmemevmearliraaeingevdddDAGGEWRLKYERAVREVDFTKKRLQQ---ELEDKMEVEQQ 1446
Cdd:pfam07888  103 YKELSASSEELSEEK-------------------------DALLAQRAAHEARIRELEEDIKTLTQrvlERETELERMKE 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1447 SRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETqreklqre 1526
Cdd:pfam07888  158 RAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKE-------- 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1527 klqrekdmllAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQA 1606
Cdd:pfam07888  230 ----------AENEALLEELRSLQERLNASERKVEGLGEELSSMAAQRDRTQAELHQARLQAAQLTLQLADASLALREGR 299
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1607 GSIQMlEQAKLRLEMEMERMR-QTHSKEMESRDEEVEEARQSCQKklkqMEVQLEEEYEDKQKALRE-KRELESKLSTLs 1684
Cdd:pfam07888  300 ARWAQ-ERETLQQSAEADKDRiEKLSAELQRLEERLQEERMEREK----LEVELGREKDCNRVQLSEsRRELQELKASL- 373

                   ....*....
gi 1907082211 1685 dQVNQRDFE 1693
Cdd:pfam07888  374 -RVAQKEKE 381
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1784-2022 6.41e-09

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 61.85  E-value: 6.41e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1784 EKNEIQNRLEEDQEDMNELMKKHKAAVaQASRdmaQMNDLQaQIEESNKEKQELQEKLQALQSQVEFLEqsmvdkslVSR 1863
Cdd:COG4913    219 EEPDTFEAADALVEHFDDLERAHEALE-DARE---QIELLE-PIRELAERYAAARERLAELEYLRAALR--------LWF 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1864 QEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERD----QRAAAENREKEQnkrLQRQLRDTKEEMSELARKEA 1939
Cdd:COG4913    286 AQRRLELLEAELEELRAELARLEAELERLEARLDALREELDeleaQIRGNGGDRLEQ---LEREIERLERELEERERRRA 362
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1940 EASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMesDENEDLINSLQDMVTKYQKKKNKLE-GDSDVDS 2018
Cdd:COG4913    363 RLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEAL--AEAEAALRDLRRELRELEAEIASLErRKSNIPA 440

                   ....
gi 1907082211 2019 ELED 2022
Cdd:COG4913    441 RLLA 444
Filament pfam00038
Intermediate filament protein;
1730-1994 7.15e-09

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 59.93  E-value: 7.15e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1730 AQLKNQLEESEFTCAAAVKARKAM-EVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEdqedmnELMKKHKA 1808
Cdd:pfam00038   28 KLLETKISELRQKKGAEPSRLYSLyEKEIEDLRRQLDTLTVERARLQLELDNLRLAAEDFRQKYED------ELNLRTSA 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1809 AvaqasrdmAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslvsrQEAKIRELETRLEFEKTQVK----R 1884
Cdd:pfam00038  102 E--------NDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKN---------HEEEVRELQAQVSDTQVNVEmdaaR 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1885 LENLASRLKETMEKLTEE-RDQRAAAENREKEQNKRLQRQ-------LRDTKEEMSELarkeaeaSRKKHELEMDLESLE 1956
Cdd:pfam00038  165 KLDLTSALAEIRAQYEEIaAKNREEAEEWYQSKLEELQQAaarngdaLRSAKEEITEL-------RRTIQSLEIELQSLK 237
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1957 AANQSLQA-----------DLKLAFKRIGDLQAA---IEDEMES--DENEDLIN 1994
Cdd:pfam00038  238 KQKASLERqlaeteeryelQLADYQELISELEAElqeTRQEMARqlREYQELLN 291
PDZ pfam00595
PDZ domain; PDZ domains are found in diverse signaling proteins.
220-308 9.68e-09

PDZ domain; PDZ domains are found in diverse signaling proteins.


Pssm-ID: 395476 [Multi-domain]  Cd Length: 81  Bit Score: 54.21  E-value: 9.68e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLRRttMLDRAPEGQAYRRVVHFaepGAGTKDlalGLVPGDRLVEINGQNVENKSRDEIVEMIRQS 299
Cdd:pfam00595    1 QVTLEKDGRGGLGFSLKG--GSDQGDPGIFVSEVLPG---GAAEAG---GLKVGDRILSINGQDVENMTHEEAVLALKGS 72

                   ....*....
gi 1907082211  300 GDSVRLKVQ 308
Cdd:pfam00595   73 GGKVTLTIL 81
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1723-1918 1.22e-08

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 59.84  E-value: 1.22e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1723 APSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNEL 1802
Cdd:COG3883     12 AFADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGER 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1803 ---MKKHKAAVAQ-----------------------ASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMV 1856
Cdd:COG3883     92 araLYRSGGSVSYldvllgsesfsdfldrlsalskiADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKA 171
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082211 1857 D-KSLVSRQEAKIRELETRlefEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNK 1918
Cdd:COG3883    172 ElEAQQAEQEALLAQLSAE---EAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAA 231
PRK11281 PRK11281
mechanosensitive channel MscK;
1673-1938 1.65e-08

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 60.31  E-value: 1.65e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1673 KRELESKLSTLSDQvnqRDFESEKrlrkdlkrtKALLADAQIMLDHLKNNAPSKREIAQLKNQLEEseftcaAAVKARKA 1752
Cdd:PRK11281    38 EADVQAQLDALNKQ---KLLEAED---------KLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQ------APAKLRQA 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1753 MEvEMEDlhLQIDDIAKAKTALEEQ-LSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESN 1831
Cdd:PRK11281   100 QA-ELEA--LKDDNDEETRETLSTLsLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIR 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1832 K---------------EKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELET----RLEfekTQVKRLENLAS-- 1890
Cdd:PRK11281   177 NllkggkvggkalrpsQRVLLQAEQALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTariqRLE---HQLQLLQEAINsk 253
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1891 RLKETMEKLTEERDQRAAAEN-------REKEQNKRLQRQLRDTKEEMSELARKE 1938
Cdd:PRK11281   254 RLTLSEKTVQEAQSQDEAARIqanplvaQELEINLQLSQRLLKATEKLNTLTQQN 308
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1588-1938 2.04e-08

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 58.78  E-value: 2.04e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1588 LRDLEAKV------KDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEE-VEEARQSCQKKLKQMEVQLE 1660
Cdd:pfam13868    8 LRELNSKLlaakcnKERDAQIAEKKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEErKEERKRYRQELEEQIEEREQ 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1661 EEYEDKQKALREKRELesklstLSDQVNQRDFESEKRLRKDLKRTKAlladaqimldhlknnapsKREIAQLKNQLEese 1740
Cdd:pfam13868   88 KRQEEYEEKLQEREQM------DEIVERIQEEDQAEAEEKLEKQRQL------------------REEIDEFNEEQA--- 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1741 ftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQ----REKNEIQNRLEEDQEDMNELMKKHKAAVAQASRD 1816
Cdd:pfam13868  141 -------EWKELEKEEEREEDERILEYLKEKAEREEEREAEReeieEEKEREIARLRAQQEKAQDEKAERDELRAKLYQE 213
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1817 MAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMV-----DKSLVSRQEAKIRELEtRLEFEKTQVKRLENLasR 1891
Cdd:pfam13868  214 EQERKERQKEREEAEKKARQRQELQQAREEQIELKERRLAeeaerEEEEFERMLRKQAEDE-EIEQEEAEKRRMKRL--E 290
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 1907082211 1892 LKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKE 1938
Cdd:pfam13868  291 HRRELEKQIEEREEQRAAEREEELEEGERLREEEAERRERIEEERQK 337
mukB PRK04863
chromosome partition protein MukB;
1296-1969 2.57e-08

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 59.97  E-value: 2.57e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1296 SEEQIRNKDEEIQQLRSKLEKVEKERNEL----RLSSDRL------------ETRISELTSELTDERNTGESASQLLDAE 1359
Cdd:PRK04863   298 SRRQLAAEQYRLVEMARELAELNEAESDLeqdyQAASDHLnlvqtalrqqekIERYQADLEELEERLEEQNEVVEEADEQ 377
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1360 TAER----LRTEKEMKELQTQ-------YDALkkQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAvREVDF 1428
Cdd:PRK04863   378 QEENearaEAAEEEVDELKSQladyqqaLDVQ--QTRAIQYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQA-KEQEA 454
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1429 TKKRLQqeLEDKMEVEQQSRRQLERRLGDLQ--ADSDESQRALQQLKKKCQRL-TAELQDTKLhlegQQVRNHElekkqr 1505
Cdd:PRK04863   455 TEELLS--LEQKLSVAQAAHSQFEQAYQLVRkiAGEVSRSEAWDVARELLRRLrEQRHLAEQL----QQLRMRL------ 522
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1506 rfdSELSQAHEETQR-EKLQREKLQREKDMLLAEAFsLKQQMEEkdldiagftqkvvsLEAELQDISSQeskdeaslakv 1584
Cdd:PRK04863   523 ---SELEQRLRQQQRaERLLAEFCKRLGKNLDDEDE-LEQLQEE--------------LEARLESLSES----------- 573
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1585 KKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRdEEVEEARQSCQKKLKQMEVQlEEEYE 1664
Cdd:PRK04863   574 VSEARERRMALRQQLEQLQARIQRLAARAPAWLAAQDALARLREQSGEEFEDS-QDVTEYMQQLLERERELTVE-RDELA 651
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1665 dkqkalREKRELESKLSTLSdqvnQRDFESEKRLRKDLKRTKA-LLAD--AQIMLDhlknNAP----------------- 1724
Cdd:PRK04863   652 ------ARKQALDEEIERLS----QPGGSEDPRLNALAERFGGvLLSEiyDDVSLE----DAPyfsalygparhaivvpd 717
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1725 ---SKREIAQLKNQLEE------------------SEFTCAAAVK-----------------ARKAMEVEMEDLHLQIDD 1766
Cdd:PRK04863   718 lsdAAEQLAGLEDCPEDlyliegdpdsfddsvfsvEELEKAVVVKiadrqwrysrfpevplfGRAAREKRIEQLRAEREE 797
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1767 IAkaktaleEQLSRLQREKNEIQnRLeedQEDMNELMKKHkAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQS 1846
Cdd:PRK04863   798 LA-------ERYATLSFDVQKLQ-RL---HQAFSRFIGSH-LAVAFEADPEAELRQLNRRRVELERALADHESQEQQQRS 865
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1847 QVEFLEQSMV-------------DKSLVSRQEAkIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAEnRE 1913
Cdd:PRK04863   866 QLEQAKEGLSalnrllprlnllaDETLADRVEE-IREQLDEAEEAKRFVQQHGNALAQLEPIVSVLQSDPEQFEQLK-QD 943
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1914 KEQNKRLQRQLRDTKEEMSEL-ARKEAEASrkkHELEMDLESLEAANQSLQADLKLA 1969
Cdd:PRK04863   944 YQQAQQTQRDAKQQAFALTEVvQRRAHFSY---EDAAEMLAKNSDLNEKLRQRLEQA 997
PDZ_SNX27-like cd23070
PDZ domain of sorting nexin-27 (SNX27), and related domains; PDZ (PSD-95 (Postsynaptic density ...
231-307 2.63e-08

PDZ domain of sorting nexin-27 (SNX27), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of SNX27, and related domains. SNX27 is involved in retrograde transport from endosome to plasma membrane. The PDZ domain of SNX27 links cargo identification to retromer-mediated transport. SNX27 binds to the retromer complex (vacuolar protein sorting 26(VPS26)-VPS29-VPS35), via its PDZ domain binding to VPS26. The SNX27 PDZ domain also binds to cargo including the G-protein-coupled receptors (GPCRs): beta2-adrenergic receptor (beta2AR), beta1AR, parathyroid hormone receptor (PTHR), alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptors (AMPARs), NMDA receptors, 5-hydroxytryptamine 4a receptors, frizzled receptors, and somatostatin receptor subtype 5 (SSTR5). Additional binding partners of the SNX27 PDZ domain include G protein-gated inwardly rectifying potassium (Kir3) channels, angiotensin-converting enzyme 2 (ACE2), and PTEN (phosphatase and tensin homolog deleted on chromosome 10); PTEN binding to SNX27 prevents SNX27's association with the retromer complex. SNX27 has been reported to be a host factor needed for efficient entry of an engineered SARS-CoV-2 variant, the spike protein of which contains a deletion at the S1/S2 subunit cleavage site; the PDZ domain of SNX27 binds angiotensin-converting enzyme 2 (ACE2), and may be involved in recycling ACE2 to the plasma membrane, thereby promoting viral entry. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This SNX27-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467283 [Multi-domain]  Cd Length: 93  Bit Score: 53.57  E-value: 2.63e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  231 FGFSLRRTTMLD---RAPEGQAYRRVVHFA---EPGAGTKDlalGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVR 304
Cdd:cd23070     12 FGFNVRGQVSEGgqlRSINGELYAPLQHVSavlEGGAADKA---GVRKGDRILEVNGVNVEGATHKQVVDLIKSGGDELT 88

                   ...
gi 1907082211  305 LKV 307
Cdd:cd23070     89 LTV 91
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1726-1942 2.91e-08

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 59.26  E-value: 2.91e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1726 KREIAQLKNQLEESEftcaAAVKARKAmEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEdqedMNELMKK 1805
Cdd:COG3206    181 EEQLPELRKELEEAE----AALEEFRQ-KNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAA----LRAQLGS 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1806 HKAAVAQASRDmAQMNDLQAQIEESNKEKQELQEKL-------QALQSQVEFLEQSmvdksLVSRQEAKIRELETRLEFE 1878
Cdd:COG3206    252 GPDALPELLQS-PVIQQLRAQLAELEAELAELSARYtpnhpdvIALRAQIAALRAQ-----LQQEAQRILASLEAELEAL 325
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1879 KTQVKRLENLASRLKETMEKLTEERDQRAaaenrekeqnkRLQRQLRDTKEEMSELARKEAEAS 1942
Cdd:COG3206    326 QAREASLQAQLAQLEARLAELPELEAELR-----------RLEREVEVARELYESLLQRLEEAR 378
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1671-2025 2.96e-08

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 59.28  E-value: 2.96e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1671 REKRELESKLSTLSDQVNQRDfesEKRLRKDLKRTKALLADAQIMLDHLKnnapSKREIAqlKNQLEESEFTCAAAVKAR 1750
Cdd:PRK02224   180 RVLSDQRGSLDQLKAQIEEKE---EKDLHERLNGLESELAELDEEIERYE----EQREQA--RETRDEADEVLEEHEERR 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1751 KamevEMEDLHLQIDDIAKAKTALEeqlsrlqREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEES 1830
Cdd:PRK02224   251 E----ELETLEAEIEDLRETIAETE-------REREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREEL 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1831 NKEKQELQEKL-QALQSQVEFLEQsmvdkslVSRQEAKIRELETRLEFEKTQVKRLEnlaSRLKETMEKLTEERDQRAAA 1909
Cdd:PRK02224   320 EDRDEELRDRLeECRVAAQAHNEE-------AESLREDADDLEERAEELREEAAELE---SELEEAREAVEDRREEIEEL 389
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1910 ENREKEQNKRLQR---QLRDTKEEMSELARKEAEASRKKHELEMDLESLE---AANQSLQADLKLA-----FKRIGDLQA 1978
Cdd:PRK02224   390 EEEIEELRERFGDapvDLGNAEDFLEELREERDELREREAELEATLRTARervEEAEALLEAGKCPecgqpVEGSPHVET 469
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 1907082211 1979 AIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSDVDSELEDRVD 2025
Cdd:PRK02224   470 IEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEE 516
FAM184 pfam15665
Family with sequence similarity 184, A and B; The function of FAM184 is not known.
1463-1677 3.86e-08

Family with sequence similarity 184, A and B; The function of FAM184 is not known.


Pssm-ID: 464788 [Multi-domain]  Cd Length: 211  Bit Score: 56.21  E-value: 3.86e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1463 DESQRALQQLKK----KCQRLTAELQDTKLHLEGQQVRNHELEKKqrrfdselSQAHEETQREKlqreklQREKDMLLAE 1538
Cdd:pfam15665   10 DEHEAEIQALKEaheeEIQQILAETREKILQYKSKIGEELDLKRR--------IQTLEESLEQH------ERMKRQALTE 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1539 AFSLKQQMEEKDL-DIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQL-RDLEAKVKDQEEELDEQAGSIQMLEQAK 1616
Cdd:pfam15665   76 FEQYKRRVEERELkAEAEHRQRVVELSREVEEAKRAFEEKLESFEQLQAQFeQEKRKALEELRAKHRQEIQELLTTQRAQ 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1617 LRLEM-EMERMRQTHSKEMESRDEEVEEarqscqkkLKQMEVQLEEEYEDK---QKALREkRELE 1677
Cdd:pfam15665  156 SASSLaEQEKLEELHKAELESLRKEVED--------LRKEKKKLAEEYEQKlskAQAFYE-RELE 211
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
1540-1924 4.16e-08

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 58.69  E-value: 4.16e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1540 FSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQ-------EEELDEQAGSIQML 1612
Cdd:PRK04778   101 RKAKHEINEIESLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLLANrfsfgpaLDELEKQLENLEEE 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1613 -------------EQAK---LRLEMEMERMRQThskeMesrdEEVEEARQSCQKKLKQmevQLEEEYEDKQKALREKREL 1676
Cdd:PRK04778   181 fsqfveltesgdyVEAReilDQLEEELAALEQI----M----EEIPELLKELQTELPD---QLQELKAGYRELVEEGYHL 249
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1677 ESKlstlsdqvnqrDFESE-KRLRKDLKRTKALLADAQimLDHLK-NNAPSKREIAQLKNQLEeseftcaAAVKARKAME 1754
Cdd:PRK04778   250 DHL-----------DIEKEiQDLKEQIDENLALLEELD--LDEAEeKNEEIQERIDQLYDILE-------REVKARKYVE 309
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1755 VEMEDLHLQIDDIAKAKTALEEQLSRLQ-----------------REKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDM 1817
Cdd:PRK04778   310 KNSDTLPDFLEHAKEQNKELKEEIDRVKqsytlneselesvrqleKQLESLEKQYDEITERIAEQEIAYSELQEELEEIL 389
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1818 AQMNDLQAQIEESN-------KEKQELQEKLQALQSQVE----FLEQS---------MVDKSLVSRQeakIRELETRLE- 1876
Cdd:PRK04778   390 KQLEEIEKEQEKLSemlqglrKDELEAREKLERYRNKLHeikrYLEKSnlpglpedyLEMFFEVSDE---IEALAEELEe 466
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1877 --FEKTQVKRLENLASRLKETMEKLTEE-RDQRAAAE------NREKEQNKRLQRQL 1924
Cdd:PRK04778   467 kpINMEAVNRLLEEATEDVETLEEETEElVENATLTEqliqyaNRYRSDNEEVAEAL 523
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1667-1843 5.81e-08

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 56.09  E-value: 5.81e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1667 QKALREKRELESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQIMLDHLKNnapskrEIAQLKNQLEESEFTCAAA 1746
Cdd:COG1579     13 QELDSELDRLEHRLKELPAELAELE-DELAALEARLEAAKTELEDLEKEIKRLEL------EIEEVEARIKKYEEQLGNV 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1747 VKAR--KAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAqasrdmaqmnDLQ 1824
Cdd:COG1579     86 RNNKeyEALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELA----------ELE 155
                          170
                   ....*....|....*....
gi 1907082211 1825 AQIEESNKEKQELQEKLQA 1843
Cdd:COG1579    156 AELEELEAEREELAAKIPP 174
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
1503-1931 6.07e-08

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 57.94  E-value: 6.07e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1503 KQRRFDSELSQAheETQREKLQ----REKLQREKDMLLA-EAFSLKQQMEEKDLDIAgfTQKVVSLEAELQDIssQESKD 1577
Cdd:pfam06160    4 LRKKIYKEIDEL--EERKNELMnlpvQEELSKVKKLNLTgETQEKFEEWRKKWDDIV--TKSLPDIEELLFEA--EELND 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1578 EASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQaklrlememermrqthsKEMESRdEEVEEAR---QSCQKKLKQ 1654
Cdd:pfam06160   78 KYRFKKAKKALDEIEELLDDIEEDIKQILEELDELLE-----------------SEEKNR-EEVEELKdkyRELRKTLLA 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1655 MEVQLEEEYEDKQKALrekRELESKLSTLSDQVNQRDFESEK----RLRKDLKRTKALLADAQIMLDHLKNNAPskREIA 1730
Cdd:pfam06160  140 NRFSYGPAIDELEKQL---AEIEEEFSQFEELTESGDYLEARevleKLEEETDALEELMEDIPPLYEELKTELP--DQLE 214
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1731 QLKN---QLEESEFtcaaaVKARKAMEVEMEDLHlqiDDIAKAKTALEE-QLSRLQREKNEIQNRLEEDQEDM------- 1799
Cdd:pfam06160  215 ELKEgyrEMEEEGY-----ALEHLNVDKEIQQLE---EQLEENLALLENlELDEAEEALEEIEERIDQLYDLLekevdak 286
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1800 NELMKKHKAAVAQASRDMAQMNDLQAQIEESNK----------EKQELQEKLQALQSQVEFLEQSMVDKSLV-SRQEAKI 1868
Cdd:pfam06160  287 KYVEKNLPEIEDYLEHAEEQNKELKEELERVQQsytlneneleRVRGLEKQLEELEKRYDEIVERLEEKEVAySELQEEL 366
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1869 RELETRLE-FEKTQVKrlenlasrLKETMEKLTEErdqraaaENREKEQNKRLQRQLRDTKEEM 1931
Cdd:pfam06160  367 EEILEQLEeIEEEQEE--------FKESLQSLRKD-------ELEAREKLDEFKLELREIKRLV 415
cpPDZ_CPP-like cd06782
circularly permuted PDZ domain of C-terminal processing peptidase (CPP), a serine protease, ...
269-309 7.34e-08

circularly permuted PDZ domain of C-terminal processing peptidase (CPP), a serine protease, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of CPP (also known as tail-specific protease, PRC protein, Protease Re, and Photosystem II D1 protein processing peptidase), and related domains. CPP belongs to the peptidase S41A family. It cleaves a C-terminal 11 residue peptide from the precursor form of penicillin-binding protein 3, and may have a role in protecting bacterium from thermal and osmotic stresses. In the plant chloroplast, the enzyme removes the C-terminal extension of the D1 polypeptide of photosystem II. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains and as well as those with circular permutations and domain swapping of beta-strands. The canonical PDZ domain contains six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2); arranged as A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F. This CPP-like PDZ domain is a circularly permuted PDZ domain which places beta-strand A on the C-terminus. Another permutation exists in the PDZ superfamily which places both beta-strands A and B on the C-terminus.


Pssm-ID: 467623 [Multi-domain]  Cd Length: 88  Bit Score: 52.10  E-value: 7.34e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1907082211  269 GLVPGDRLVEINGQNVENKSRDEIVEMIR-QSGDSVRLKVQP 309
Cdd:cd06782     31 GIKPGDVIVAVDGESVRGMSLDEVVKLLRgPKGTKVKLTIRR 72
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1298-2028 7.78e-08

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 58.30  E-value: 7.78e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1298 EQIRNKDEEIQQLRSKLEKVEKER------NELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAeRLRTEKEMK 1371
Cdd:PTZ00440  1190 EEIESYKKDIDQVKKNMSKERNDHlttfeyNAYYDKATASYENIEELTTEAKGLKGEANRSTNVDELKEI-KLQVFSYLQ 1268
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1372 ELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEwrlKYER-AVREVDFTKkRLQQELEDKMEVEQQSRRQ 1450
Cdd:PTZ00440  1269 QVIKENNKMENALHEIKNMYEFLISIDSEKILKEILNSTKKAE---EFSNdAKKELEKTD-NLIKQVEAKIEQAKEHKNK 1344
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1451 LERRLGDLQADSDesqraLQQLKKKCQRLTAELQDTKLHLEgqqvrnhELEKKQRRFDSELSQAH------------EET 1518
Cdd:PTZ00440  1345 IYGSLEDKQIDDE-----IKKIEQIKEEISNKRKEINKYLS-------NIKSNKEKCDLHVRNASrgkdkidflnkhEAI 1412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1519 QREKLQREKLQREKDML------LAEAFSLKQQMEEKDLDIAGFTQKVVSLeaeLQD--ISSQESKDEaslaKVKKQLRD 1590
Cdd:PTZ00440  1413 EPSNSKEVNIIKITDNInkckqySNEAMETENKADENNDSIIKYEKEITNI---LNNssILGKKTKLE----KKKKEATN 1485
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1591 LEAKVKDQEEELDEQAGSIQmleqaklrlememERMRQTHSKEMESRDEEV------EEARQSCQKKLKQMEVQLEEEYE 1664
Cdd:PTZ00440  1486 IMDDINGEHSIIKTKLTKSS-------------EKLNQLNEQPNIKREGDVlnndksTIAYETIQYNLGRVKHNLLNILN 1552
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1665 DKQKA---LREKRELESKLSTLSDQVNQRDFES-EKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEES- 1739
Cdd:PTZ00440  1553 IKDEIetiLNKAQDLMRDISKISKIVENKNLENlNDKEADYVKYLDNILKEKQLMEAEYKKLNEIYSDVDNIEKELKKHk 1632
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1740 --------EFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTAL-------EEQLSRLQREKNEIQNRLEEDQEDMNELMK 1804
Cdd:PTZ00440  1633 knyeigllEKVIEINKNIKLYMDSTKESLNSLVNNFSSLFNNFylnkyniNENLEKYKKKLNEIYNEFMESYNIIQEKMK 1712
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1805 KhkaavaqASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQsMVDKSLVSRQEAKIRELETRLEFEKTQVKR 1884
Cdd:PTZ00440  1713 E-------VSNDDVDYNEAKTLREEAQKEEVNLNNKEEEAKKYLNDIKK-QESFRFILYMKEKLDELSKMCKQQYNIVDE 1784
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1885 LENLASRLKETMEKLTEErDQRAAAENREKEQNKRLQRQLRDT-KEE-------------------MSELARKE--AEAS 1942
Cdd:PTZ00440  1785 GYNYIKKKIEYIKTLNDE-NNLSDSLNQAEDKNKEVANLTHYTnKNEaknllghvvksanfigikiMTGLQPTEltPDAS 1863
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1943 RKK-------HELEMDLESLEAANQSLQAD---------------LKLAF---KRIGDLQAAIEDEMESDENEDLINSLQ 1997
Cdd:PTZ00440  1864 LETapeltfeSENNSDLELDHLSSNKNELDvykniqdayksslqiLKYSDdidKKQRDCNKLVEDGNEIYLKSTAINELK 1943
                          810       820       830
                   ....*....|....*....|....*....|.
gi 1907082211 1998 DMVTKYQKKKNKLEGDSDVDSELEDRVDGVK 2028
Cdd:PTZ00440  1944 NMINSVKNKESAISNKIDNVSNKLSELNKIT 1974
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1649-2023 1.16e-07

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 57.27  E-value: 1.16e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1649 QKKLKQMEVQ-LEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLR--KDLKRTKALLadaQIMLDHLKNNAPS 1725
Cdd:COG5185    165 FGKLTQELNQnLKKLEIFGLTLGLLKGISELKKAEPSGTVNSIKESETGNLGseSTLLEKAKEI---INIEEALKGFQDP 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1726 KREIAQLKNQLEESEFtcaaAVKARKamEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEE-----DQEDMN 1800
Cdd:COG5185    242 ESELEDLAQTSDKLEK----LVEQNT--DLRLEKLGENAESSKRLNENANNLIKQFENTKEKIAEYTKSidikkATESLE 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1801 ELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEfleqSMVDKSLVSRQEAKIRELETRLEFEKT 1880
Cdd:COG5185    316 EQLAAAEAEQELEESKRETETGIQNLTAEIEQGQESLTENLEAIKEEIE----NIVGEVELSKSSEELDSFKDTIESTKE 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1881 ----QVKRLENLASRLKETMEKLTEERDQraaaenrekeQNKRLQRQLRDTKEEMSELARK--EAEASRKKHELEMDLES 1954
Cdd:COG5185    392 sldeIPQNQRGYAQEILATLEDTLKAADR----------QIEELQRQIEQATSSNEEVSKLlnELISELNKVMREADEES 461
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1955 LE-------AANQSLQADLKLAFKRIGDLQAAI-----EDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSDVDSELED 2022
Cdd:COG5185    462 QSrleeaydEINRSVRSKKEDLNEELTQIESRVstlkaTLEKLRAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILAL 541

                   .
gi 1907082211 2023 R 2023
Cdd:COG5185    542 E 542
PDZ_tamalin_CYTIP-like cd06713
PDZ domain of tamalin, cytohesin-1-interacting protein (CYTIP), and related domains; PDZ ...
223-307 1.22e-07

PDZ domain of tamalin, cytohesin-1-interacting protein (CYTIP), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of tamalin, cytohesin-1-interacting protein, and related domains. Tamalin (trafficking regulator and scaffold protein tamalin, also known as general receptor for phosphoinositides 1-associated scaffold protein, GRASP) functions to link receptors, including group 1 metabotropic glutamate receptors (mGluRs), to neuronal proteins. The tamalin PDZ domain binds the C-terminal domains of group I mGluRs; it also binds potassium/sodium hyperpolarization-activated cyclic nucleotide-gated channel 2 (HCN2), neurotrophin-3 (NT3) TrkCT1-truncated receptor, SAP90/PSD-95-associated protein, and tamalin itself. CYTIP (cytohesin-1-interacting protein, also known as Pleckstrin homology Sec7 and coiled-coil domain-binding protein) sequesters cytohesin-1 in the cytoplasm, limiting its interaction with beta2 integrins; cytohesin-1 binds the CYTIP coiled coil domain. The CYTIP PDZ domain can bind the C-terminal peptide of protocadherin alpha-1 (PCDHA1), indicating a possible interaction between the two. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This tamalin-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467197 [Multi-domain]  Cd Length: 91  Bit Score: 51.47  E-value: 1.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  223 LQRRPTGDFGF-------------SLRRTTMLDRapegqayrrvVHFAEPGagtkDLAlGLVPGDRLVEINGQNVENKSR 289
Cdd:cd06713      8 LEKQDNETFGFeiqtyglhhknsnEVEMCTYVCR----------VHEDSPA----YLA-GLTAGDVILSVNGVSVEGASH 72
                           90
                   ....*....|....*...
gi 1907082211  290 DEIVEMIRQSGDSVRLKV 307
Cdd:cd06713     73 QEIVELIRSSGNTLRLET 90
PDZ_SYNJ2BP-like cd06709
PDZ domain of synaptojanin-2-binding protein (SYNJ2BP), and related domains; PDZ (PSD-95 ...
220-308 1.41e-07

PDZ domain of synaptojanin-2-binding protein (SYNJ2BP), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of SYNJ2BP, and related domains. SYNJ2BP (also known as mitochondrial outer membrane protein 25, OMP25) regulates endocytosis of activin type 2 receptor kinases through the Ral/RALBP1-dependent pathway and may be involved in suppression of activin-induced signal transduction. Binding partners of the SYNJ2BP PDZ domain include activin type II receptors (ActR-II), and SYNJ2. SYNJ2BP interacts with the PDZ binding motif of the Notch Delta-like ligand 1 (DLL1) and DLL4, promoting Delta-Notch signaling, and inhibiting sprouting angiogenesis. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This SYNJ2BP-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467193 [Multi-domain]  Cd Length: 86  Bit Score: 51.14  E-value: 1.41e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGdFGFSLRRTTMLDRAPEGQA-YrrVVHFAEPGAGTKDlalG-LVPGDRLVEINGQNVENKSRDEIVEMIR 297
Cdd:cd06709      2 EITLKRGPSG-LGFNIVGGTDQPYIPNDSGiY--VAKIKEDGAAAID---GrLQEGDKILEINGQSLENLTHQDAVELFR 75
                           90
                   ....*....|.
gi 1907082211  298 QSGDSVRLKVQ 308
Cdd:cd06709     76 NAGEDVKLKVQ 86
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1297-1794 1.55e-07

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 57.14  E-value: 1.55e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1297 EEQIRNKDEEIQQLRSKLEKVEKERNE-------LRLSSDRLETRISELTSELTderntgesasqlldaetaerlRTEKE 1369
Cdd:pfam10174  246 ERNIRDLEDEVQMLKTNGLLHTEDREEeikqmevYKSHSKFMKNKIDQLKQELS---------------------KKESE 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1370 MKELQTQYDALK------KQMEVMEMEVMEARLIRAAEINGEVDdddaggEWRLKYERAVREVDFTKKRLQQELEDK--- 1440
Cdd:pfam10174  305 LLALQTKLETLTnqnsdcKQHIEVLKESLTAKEQRAAILQTEVD------ALRLRLEEKESFLNKKTKQLQDLTEEKstl 378
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1441 ----------MEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHL--------EGQQV-----RN 1497
Cdd:pfam10174  379 ageirdlkdmLDVKERKINVLQKKIENLQEQLRDKDKQLAGLKERVKSLQTDSSNTDTALttleealsEKERIierlkEQ 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1498 HELEKKQRRFDSELSQAHEETQREK---LQREKLQREKDM--LLAEAFSLKQQMEEKD-----LDIAGFTQK--VVSLEA 1565
Cdd:pfam10174  459 REREDRERLEELESLKKENKDLKEKvsaLQPELTEKESSLidLKEHASSLASSGLKKDsklksLEIAVEQKKeeCSKLEN 538
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1566 ELQDISSQESKDEASlAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAkLRlEMEMERmrqtHSKEMESRDEEVEEAR 1645
Cdd:pfam10174  539 QLKKAHNAEEAVRTN-PEINDRIRLLEQEVARYKEESGKAQAEVERLLGI-LR-EVENEK----NDKDKKIAELESLTLR 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1646 QSCQKKLKQMEVQLEEEyEDKQKALREKRELESKLSTLSDQVNQRDFE----SEKRLRKDLKRTKALLADAQIMLD---- 1717
Cdd:pfam10174  612 QMKEQNKKVANIKHGQQ-EMKKKGAQLLEEARRREDNLADNSQQLQLEelmgALEKTRQELDATKARLSSTQQSLAekdg 690
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1718 HLKNNAPSKREiaQLKNQLEESEFTCAAAVKARKAmevemeDLHLQIDDIAKAKTALEEQLSrLQREKNEIQNRLEE 1794
Cdd:pfam10174  691 HLTNLRAERRK--QLEEILEMKQEALLAAISEKDA------NIALLELSSSKKKKTQEEVMA-LKREKDRLVHQLKQ 758
46 PHA02562
endonuclease subunit; Provisional
1565-1748 2.25e-07

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 56.18  E-value: 2.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1565 AELQDISSQESKDEASLakvKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMER------MRQTH------SK 1632
Cdd:PHA02562   216 ARKQNKYDELVEEAKTI---KAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQfqkvikMYEKGgvcptcTQ 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1633 EMESRDEEVEEARQS---CQKKLKQMEVQLEEEYE---DKQKALREKRELESKLSTLsDQVNQRDFESEKRLRKDLKRTK 1706
Cdd:PHA02562   293 QISEGPDRITKIKDKlkeLQHSLEKLDTAIDELEEimdEFNEQSKKLLELKNKISTN-KQSLITLVDKAKKVKAAIEELQ 371
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1907082211 1707 ALLADaqimldhlknnapSKREIAQLKNQLEESEFTCAAAVK 1748
Cdd:PHA02562   372 AEFVD-------------NAEELAKLQDELDKIVKTKSELVK 400
mukB PRK04863
chromosome partition protein MukB;
1578-1940 2.35e-07

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 56.50  E-value: 2.35e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1578 EASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQ----AKLRLEMEMERMRQThskemesrdeeveEARQSCQKKLK 1653
Cdd:PRK04863   292 RRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQdyqaASDHLNLVQTALRQQ-------------EKIERYQADLE 358
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1654 QMEVQLEEEYEDKQKALREKRELESKLSTLSDQVnqrdfesekrlrkdlKRTKALLADAQIMLDHLKnnapsKREIA--Q 1731
Cdd:PRK04863   359 ELEERLEEQNEVVEEADEQQEENEARAEAAEEEV---------------DELKSQLADYQQALDVQQ-----TRAIQyqQ 418
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1732 LKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQreknEIQNRLEEdqedmneLMKKHKAAVA 1811
Cdd:PRK04863   419 AVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQ----AAHSQFEQ-------AYQLVRKIAG 487
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1812 QASRDMAQmNDLQAQIEESNKEKQELQeKLQALQSQVEFLEQsmvDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASR 1891
Cdd:PRK04863   488 EVSRSEAW-DVARELLRRLREQRHLAE-QLQQLRMRLSELEQ---RLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEE 562
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1907082211 1892 LKETMEKLTEErdQRAAAENREkeqnkRLQRQLRDTKEEMSELARKEAE 1940
Cdd:PRK04863   563 LEARLESLSES--VSEARERRM-----ALRQQLEQLQARIQRLAARAPA 604
PDZ2_L-delphilin-like cd06744
PDZ domain 2 of delphilin (L-delphilin isoform), and related domains; PDZ (PSD-95 ...
224-308 3.23e-07

PDZ domain 2 of delphilin (L-delphilin isoform), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of delphilin (also known as glutamate receptor, ionotropic, delta 2-interacting protein 1, L-delphilin). Delphilin, a postsynaptic protein which it is selectively expressed at cerebellar Purkinje cells, links the glutamate receptor delta 2 subunit (GluRdelta2) with the actin cytoskeleton and various signaling molecules. Two alternatively spliced isoforms of delphilin have been characterized: L-delphilin has two PDZ domains, PDZ1 and PDZ2, and S-delphilin has a single PDZ domain (PDZ2). These two isoforms are differently palmitoylated and may be involved in controlling GluRdelta2 signaling in Purkinje cells. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This delphilin-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467226 [Multi-domain]  Cd Length: 75  Bit Score: 49.58  E-value: 3.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  224 QRRPTGDFGFSLRrttmlDRAPegqAYRRVVhfaEPGaGTKDLAlGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSV 303
Cdd:cd06744      4 VYRGNGSFGFTLR-----GHAP---VYIESV---DPG-SAAERA-GLKPGDRILFLNGLDVRNCSHDKVVSLLQGSGSMP 70

                   ....*
gi 1907082211  304 RLKVQ 308
Cdd:cd06744     71 TLVVE 75
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1436-1811 3.80e-07

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 54.91  E-value: 3.80e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1436 ELEDKMEVEQQSRRQLERRLGDLQADSDESQRA----LQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSEL 1511
Cdd:COG4372      3 RLGEKVGKARLSLFGLRPKTGILIAALSEQLRKalfeLDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEEL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1512 SQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEkdldiagFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDL 1591
Cdd:COG4372     83 EELNEQLQAAQAELAQAQEELESLQEEAEELQEELEE-------LQKERQDLEQQRKQLEAQIAELQSEIAEREEELKEL 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1592 EAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEearqscqKKLKQMEVQLEEEYEDKQKALR 1671
Cdd:COG4372    156 EEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAE-------KLIESLPRELAEELLEAKDSLE 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1672 EKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARK 1751
Cdd:COG4372    229 AKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALS 308
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1752 AMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVA 1811
Cdd:COG4372    309 LIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAGVA 368
PDZ4_GRIP1-2-like cd06686
PDZ domain 4 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related ...
220-308 4.50e-07

PDZ domain 4 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor (AMPAR) binding proteins GRIP1 (ABP/GRIP2) and GRIP2, and related domains. GRIP1 and GRIP2 each have 7 PDZ domains. The interaction of GRIP1 and GRIP2 with GluA2/3 (AMPAR subunit) regulates AMPAR trafficking and synaptic targeting. GRIP1 has an essential role in regulating AMPAR trafficking during synaptic plasticity and learning and memory. GRIP1 and GRIP2 interact with a variety of other proteins associated with protein trafficking and internalization, for example GRIP1 also interacts with KIF5 (also known as kinesin 1), EphB receptors, scaffold protein liprin-alpha, and the rasGEF GRASP-1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This GRIP family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467174 [Multi-domain]  Cd Length: 99  Bit Score: 50.04  E-value: 4.50e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLR----RTTMLDRAPegqayrrVVHFAEPGAGTKDLALgLVPGDRLVEINGQNVENKSRDEIVEM 295
Cdd:cd06686      9 EVILRGDPLKGFGIQLQggvfATETLSSPP-------LISFIEPDSPAERCGV-LQVGDRVLSINGIPTEDRTLEEANQL 80
                           90
                   ....*....|...
gi 1907082211  296 IRQSGDSVRLKVQ 308
Cdd:cd06686     81 LRDSASKVTLEIE 93
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1727-2011 4.85e-07

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 55.51  E-value: 4.85e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1727 REIAQlKNQLEESEFTCAAAVKARKAMEVEMEDLHLQiddiakaktaLEEQLSRLQREKNEIQNRLEEDQEDMNELMKKH 1806
Cdd:pfam17380  310 REVER-RRKLEEAEKARQAEMDRQAAIYAEQERMAME----------RERELERIRQEERKRELERIRQEEIAMEISRMR 378
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1807 KAAVAQASRdmaQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLE 1886
Cdd:pfam17380  379 ELERLQMER---QQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLEE 455
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1887 NLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKE-EMSELARKEAEASRKKHELEMDLESLEAANQSLQAd 1965
Cdd:pfam17380  456 QERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEkELEERKQAMIEEERKRKLLEKEMEERQKAIYEEER- 534
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082211 1966 lklafKRIGDLQAAIEDEMESDENedlinsLQDMVTKYQKKKNKLE 2011
Cdd:pfam17380  535 -----RREAEEERRKQQEMEERRR------IQEQMRKATEERSRLE 569
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1292-1985 4.86e-07

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 55.73  E-value: 4.86e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1292 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNEL------------RLSSDRLETRISELTSELTDERNTGESASQllDAE 1359
Cdd:COG3096    376 QLAEAEARLEAAEEEVDSLKSQLADYQQALDVQqtraiqyqqavqALEKARALCGLPDLTPENAEDYLAAFRAKE--QQA 453
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1360 TAERLRTEKEM---KELQTQYDAlkkqmevmemevmEARLIRAaeINGEVDDDDAggewrlkYERAvREVdftkkrLQQE 1436
Cdd:COG3096    454 TEEVLELEQKLsvaDAARRQFEK-------------AYELVCK--IAGEVERSQA-------WQTA-REL------LRRY 504
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1437 LEDKMEVEQQSrrQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDtklhlegqqvrNHELEKKQRRFDSELSQAHE 1516
Cdd:COG3096    505 RSQQALAQRLQ--QLRAQLAELEQRLRQQQNAERLLEEFCQRIGQQLDA-----------AEELEELLAELEAQLEELEE 571
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1517 ETQREKLQREKLQREKDMLLAEAFSLKQQmeekdldiAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVK 1596
Cdd:COG3096    572 QAAEAVEQRSELRQQLEQLRARIKELAAR--------APAWLAAQDALERLREQSGEALADSQEVTAAMQQLLEREREAT 643
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1597 DQEEEldeqagsiqmLEQAKLRLEMEMERMRQTHSkemesrdeeVEEARqscqkkLKQME-----VQLEEEYEDkqKALR 1671
Cdd:COG3096    644 VERDE----------LAARKQALESQIERLSQPGG---------AEDPR------LLALAerlggVLLSEIYDD--VTLE 696
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1672 EKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKRE-----IAQLKN-QLEESEFTcAA 1745
Cdd:COG3096    697 DAPYFSALYGPARHAIVVPDLSAVKEQLAGLEDCPEDLYLIEGDPDSFDDSVFDAEEledavVVKLSDrQWRYSRFP-EV 775
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1746 AVKARKAMEVEMEDLHLQIDDIAK--AKTA--------LEEQLSR--------------------LQREKNEIQNRLEED 1795
Cdd:COG3096    776 PLFGRAAREKRLEELRAERDELAEqyAKASfdvqklqrLHQAFSQfvgghlavafapdpeaelaaLRQRRSELERELAQH 855
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1796 QEDmnelMKKHKAAVAQASRDMAQMNDLQAQIEESNKEkqELQEKLQALQSQVEFLEQsmvDKSLVSRQEAKIRELETRL 1875
Cdd:COG3096    856 RAQ----EQQLRQQLDQLKEQLQLLNKLLPQANLLADE--TLADRLEELREELDAAQE---AQAFIQQHGKALAQLEPLV 926
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1876 ----------EFEKTQVKRLENLASRLKETMEKLTEERDQRAA----------AENRekEQNKRLQRQLRDTKEEMSElA 1935
Cdd:COG3096    927 avlqsdpeqfEQLQADYLQAKEQQRRLKQQIFALSEVVQRRPHfsyedavgllGENS--DLNEKLRARLEQAEEARRE-A 1003
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1936 RKEAEASRKKHE----LEMDLES-LEAANQSLQAdlklAFKRIGDLQAAIEDEME 1985
Cdd:COG3096   1004 REQLRQAQAQYSqynqVLASLKSsRDAKQQTLQE----LEQELEELGVQADAEAE 1054
PLN03229 PLN03229
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional
1617-1967 5.54e-07

acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional


Pssm-ID: 178768 [Multi-domain]  Cd Length: 762  Bit Score: 55.25  E-value: 5.54e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1617 LRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQmevQLEEEYEDKQKALrekrELESKLSTLSDqvnqrdfESEK 1696
Cdd:PLN03229   432 RELEGEVEKLKEQILKAKESSSKPSELALNEMIEKLKK---EIDLEYTEAVIAM----GLQERLENLRE-------EFSK 497
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1697 RLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEeseftcaaavkarkamevemedlhlQIDDIAKAKTALEE 1776
Cdd:PLN03229   498 ANSQDQLMHPVLMEKIEKLKDEFNKRLSRAPNYLSLKYKLD-------------------------MLNEFSRAKALSEK 552
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1777 QlSRLQREKNEIQNRLEE--DQEDMNELMKKHKAAVAQ--ASRDMAQMNDLQAQIEESNKEKQ-ELQEKLQALQSQVEFL 1851
Cdd:PLN03229   553 K-SKAEKLKAEINKKFKEvmDRPEIKEKMEALKAEVASsgASSGDELDDDLKEKVEKMKKEIElELAGVLKSMGLEVIGV 631
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1852 EQsmvdKSLVSRQEAKIRELETRLEFEKTQV-KRLENL--ASRLKETMEKLTEERDQRAAAEN-REKEQNKRLQRQLRDT 1927
Cdd:PLN03229   632 TK----KNKDTAEQTPPPNLQEKIESLNEEInKKIERVirSSDLKSKIELLKLEVAKASKTPDvTEKEKIEALEQQIKQK 707
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 1907082211 1928 -KEEMSELARKEaeasrKKHELEMDLESLEAANQSLQADLK 1967
Cdd:PLN03229   708 iAEALNSSELKE-----KFEELEAELAAARETAAESNGSLK 743
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1634-1847 5.90e-07

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 55.02  E-value: 5.90e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1634 MESRDEEVEEARQSCQKKLKQMEVQLEEeyedKQKALREKRElESKLSTLSDQVNQRDfESEKRLRKDLKRTKALLADAQ 1713
Cdd:COG3206    166 LELRREEARKALEFLEEQLPELRKELEE----AEAALEEFRQ-KNGLVDLSEEAKLLL-QQLSELESQLAEARAELAEAE 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1714 IMLDHLKNNAPSKRE----------IAQLKNQLEESEFTCAAAVK-------ARKAMEVEMEDLHLQIDD-IAKAKTALE 1775
Cdd:COG3206    240 ARLAALRAQLGSGPDalpellqspvIQQLRAQLAELEAELAELSArytpnhpDVIALRAQIAALRAQLQQeAQRILASLE 319
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907082211 1776 EQLSRLQREKNEIQNRLEEdqedmnelmkkHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQ 1847
Cdd:COG3206    320 AELEALQAREASLQAQLAQ-----------LEARLAELPELEAELRRLEREVEVARELYESLLQRLEEARLA 380
PDZ2-PTPN13_FRMPD2-like cd06792
PDZ domain 2 of tyrosine kinase PTPN13, FERM and PDZ domain-containing protein 2 (FRMPD2), and ...
273-308 6.43e-07

PDZ domain 2 of tyrosine kinase PTPN13, FERM and PDZ domain-containing protein 2 (FRMPD2), and similar domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of human PTPN13, and related domains. PTPN13, also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1), negatively regulates FAS-mediated apoptosis and NGFR-mediated pro-apoptotic signaling, and may also regulate phosphoinositide 3-kinase (PI3K) signaling. It contains 5 PDZ domains; interaction partners of its second PDZ domain (PDZ2) include the Fas receptor (TNFRSF6) and thyroid receptor-interacting protein 6 (TRIP6). The second PDZ (PDZ2) domain, but not PDZ1 or PDZ3, of FRMPD2 binds to GluN2A and GluN2B, two subunits of N-methyl-d-aspartic acid (NMDA) receptors. Other binding partners of the FRMPDZ2 PDZ2 domain include NOD2, and catenin family members, delta catenin (CTNND2), armadillo repeat gene deleted in velo-cardio-facial syndrome (ARVCF) and p0071 (also known as plakophilin 4; PKP4). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467254 [Multi-domain]  Cd Length: 87  Bit Score: 49.13  E-value: 6.43e-07
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1907082211  273 GDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQ 308
Cdd:cd06792     51 GDRLLEVNGVSLEGVTHKQAVECLKNAGQVVTLVLE 86
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1414-2036 6.78e-07

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 55.44  E-value: 6.78e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1414 EWRLK------YERAVREvDFTK----KRLQQELEDKMEVEQQSRRQLERRLGDL------------------------- 1458
Cdd:TIGR01612  531 DQNIKaklykeIEAGLKE-SYELaknwKKLIHEIKKELEEENEDSIHLEKEIKDLfdkyleiddeiiyinklklelkeki 609
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1459 --QADSDESQRALQQLKKKCQRLTA---ELQDTKLHLEGQQVRNHEleKKQRRFDSELSQAHEEtQREKLQRE--KLQRE 1531
Cdd:TIGR01612  610 knISDKNEYIKKAIDLKKIIENNNAyidELAKISPYQVPEHLKNKD--KIYSTIKSELSKIYED-DIDALYNElsSIVKE 686
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1532 KDMLLAEAfslKQQMEEKDLDIAGFTQKVVSLEAE-----LQDISSQESKDEASLAKVKKQL-----RDLEAKVKDQEEE 1601
Cdd:TIGR01612  687 NAIDNTED---KAKLDDLKSKIDKEYDKIQNMETAtvelhLSNIENKKNELLDIIVEIKKHIhgeinKDLNKILEDFKNK 763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1602 LDEQAGSIQMLEQAKLRLEM---EMERMRQTHSKEMESRDEEVEEARQSCQKK---LKQMEVQLEEEYEDKQKALREKRE 1675
Cdd:TIGR01612  764 EKELSNKINDYAKEKDELNKyksKISEIKNHYNDQINIDNIKDEDAKQNYDKSkeyIKTISIKEDEIFKIINEMKFMKDD 843
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1676 LESKLSTLSDQVN--QRDFESEKRLRKDL-KRTKALLADAQIMlDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKA 1752
Cdd:TIGR01612  844 FLNKVDKFINFENncKEKIDSEHEQFAELtNKIKAEISDDKLN-DYEKKFNDSKSLINEINKSIEEEYQNINTLKKVDEY 922
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1753 MEVeMEDLHLQIDDIAKAKTALEEQLSR---LQREKNEIQ----NRLEEDQED-MNELMKKHKAAV-----AQASRDMAQ 1819
Cdd:TIGR01612  923 IKI-CENTKESIEKFHNKQNILKEILNKnidTIKESNLIEksykDKFDNTLIDkINELDKAFKDASlndyeAKNNELIKY 1001
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1820 MNDLQA------------QIEESNKEKQELQEKLQALQSQVEFLEQSmVDKSLVSRQEAKIRELETRLEFEKTQV-KRLE 1886
Cdd:TIGR01612 1002 FNDLKAnlgknkenmlyhQFDEKEKATNDIEQKIEDANKNIPNIEIA-IHTSIYNIIDEIEKEIGKNIELLNKEIlEEAE 1080
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1887 NLASRLKETMEKLT---------EERDQRAAAENREKEQNKRLQRQLRDTKEEMSELarkeaeasRKKHELEMDleslEA 1957
Cdd:TIGR01612 1081 INITNFNEIKEKLKhynfddfgkEENIKYADEINKIKDDIKNLDQKIDHHIKALEEI--------KKKSENYID----EI 1148
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082211 1958 ANQslqadlklafkrIGDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSDVDSELEdRVDGVKSWLSKNKG 2036
Cdd:TIGR01612 1149 KAQ------------INDLEDVADKAISNDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEIA-EIEKDKTSLEEVKG 1214
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1445-1778 7.13e-07

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 54.14  E-value: 7.13e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1445 QQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQ 1524
Cdd:COG4372     30 SEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEE 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1525 REKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLR-----DLEAKVKDQE 1599
Cdd:COG4372    110 AEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQalseaEAEQALDELL 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1600 EELDEQAGSIQMLEQAKLRLEMEMERMRQTHS--KEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELE 1677
Cdd:COG4372    190 KEANRNAEKEEELAEAEKLIESLPRELAEELLeaKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILV 269
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1678 SKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEM 1757
Cdd:COG4372    270 EKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVG 349
                          330       340
                   ....*....|....*....|.
gi 1907082211 1758 EDLHLQIDDIAKAKTALEEQL 1778
Cdd:COG4372    350 LLDNDVLELLSKGAEAGVADG 370
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1558-1843 1.04e-06

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 52.99  E-value: 1.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1558 QKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQmleqaKLRLEMemermrQTHSKEMESR 1637
Cdd:COG1340      1 SKTDELSSSLEELEEKIEELREEIEELKEKRDELNEELKELAEKRDELNAQVK-----ELREEA------QELREKRDEL 69
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1638 DEEVEEARQSCQKKLKQMEvQLEEEYEDKQKALREKRELESKLSTLSDQVNQ---------RDFESEKRLRKDLKRTKAL 1708
Cdd:COG1340     70 NEKVKELKEERDELNEKLN-ELREELDELRKELAELNKAGGSIDKLRKEIERlewrqqtevLSPEEEKELVEKIKELEKE 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1709 LADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEI 1788
Cdd:COG1340    149 LEKAKKALEKNEKLKELRAELKELRKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEADELHKEIVEAQEKADEL 228
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1789 QNRLEEDQEDMNELMKKHKAAvaqasRDMAQMNDLQAQIEESNKEKQELQEKLQA 1843
Cdd:COG1340    229 HEEIIELQKELRELRKELKKL-----RKKQRALKREKEKEELEEKAEEIFEKLKK 278
PDZ2_DLG5-like cd06765
PDZ domain 2 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density ...
270-308 1.04e-06

PDZ domain 2 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of Drosophila and mammalian Dlg5, and related domains. Dlg5 is a scaffold protein with multiple conserved functions that are independent of each other in regulating growth, cell polarity, and cell adhesion. It has a coiled-coil domain, 4 PDZ domains and a MAGUK domain (an SH3 domain next to a non-catalytically active guanylate kinase domain). Deregulation of Dlg5 has been implicated in the malignancy of several cancer types. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg5-like family PSZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467246 [Multi-domain]  Cd Length: 77  Bit Score: 48.50  E-value: 1.04e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1907082211  270 LVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQ 308
Cdd:cd06765     35 LTVGDRIIAINGIALDNKSLSECEALLRSCRDSLSLSLM 73
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1510-1700 1.05e-06

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 52.23  E-value: 1.05e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1510 ELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQlR 1589
Cdd:COG1579     11 DLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNN-K 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1590 DLEAKVKdqeeELDEQAGSIQMLEQAKLRLEMEMERMRqthskemesrdEEVEEArqscQKKLKQMEVQLEE---EYEDK 1666
Cdd:COG1579     90 EYEALQK----EIESLKRRISDLEDEILELMERIEELE-----------EELAEL----EAELAELEAELEEkkaELDEE 150
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1907082211 1667 QKALREKRE-LESKLSTLSDQVNQRDFESEKRLRK 1700
Cdd:COG1579    151 LAELEAELEeLEAEREELAAKIPPELLALYERIRK 185
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1298-1723 1.13e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 54.28  E-value: 1.13e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1298 EQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESAsqlldaeTAERLRTEKEMKELQTQY 1377
Cdd:PRK02224   349 EDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDA-------PVDLGNAEDFLEELREER 421
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1378 DALKKQMEVMEMEVMEAR-LIRAAEingevddddaggewRLKYERAVREVDftkkrlqQELEDKMEVEQQSRRqlERRLG 1456
Cdd:PRK02224   422 DELREREAELEATLRTAReRVEEAE--------------ALLEAGKCPECG-------QPVEGSPHVETIEED--RERVE 478
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1457 DLQADSDESQRALQQLKKKCQRLTaELQDTKLHLEgqqvrnhELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLL 1536
Cdd:PRK02224   479 ELEAELEDLEEEVEEVEERLERAE-DLVEAEDRIE-------RLEERREDLEELIAERRETIEEKRERAEELRERAAELE 550
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1537 AEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESkdeaSLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAK 1616
Cdd:PRK02224   551 AEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIE----SLERIRTLLAAIADAEDEIERLREKREALAELNDERR 626
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1617 LRLEMEMERMRQTHSKEMESRdeeVEEARQScqkklKQMEVQLEEEYEDKQKALREKR-ELESKLSTLsdqvnQRDFESE 1695
Cdd:PRK02224   627 ERLAEKRERKRELEAEFDEAR---IEEARED-----KERAEEYLEQVEEKLDELREERdDLQAEIGAV-----ENELEEL 693
                          410       420
                   ....*....|....*....|....*...
gi 1907082211 1696 KRLRKDLKRtkalLADAQIMLDHLKNNA 1723
Cdd:PRK02224   694 EELRERREA----LENRVEALEALYDEA 717
mukB PRK04863
chromosome partition protein MukB;
1438-1860 1.47e-06

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 54.19  E-value: 1.47e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1438 EDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKkkcqRLTAelqdTKLHLEGQQVRNHELEKKQRR---FDSELSQA 1514
Cdd:PRK04863   785 EKRIEQLRAEREELAERYATLSFDVQKLQRLHQAFS----RFIG----SHLAVAFEADPEAELRQLNRRrveLERALADH 856
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1515 HEETQREKLQREKLQREKDML--LAEAFSLkqqmeekdLDIAGFTQKVVSLEAELQdissQESKDEASLAKVKKQLrdle 1592
Cdd:PRK04863   857 ESQEQQQRSQLEQAKEGLSALnrLLPRLNL--------LADETLADRVEEIREQLD----EAEEAKRFVQQHGNAL---- 920
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1593 akvkdqeEELDEQAGSIQMLEQaklrlemEMERMRQthskemesrdeEVEEARQScQKKLKQ--------MEVQLEEEYE 1664
Cdd:PRK04863   921 -------AQLEPIVSVLQSDPE-------QFEQLKQ-----------DYQQAQQT-QRDAKQqafaltevVQRRAHFSYE 974
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1665 DKQKALREKRELESKLSTLSDQVNQ-RDfesekRLRKDLKRTKALLADAQIMLDHLKNNAPSKREI-AQLKNQLEesEFT 1742
Cdd:PRK04863   975 DAAEMLAKNSDLNEKLRQRLEQAEQeRT-----RAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMlQELKQELQ--DLG 1047
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1743 CAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVA--QASRDMAQM 1820
Cdd:PRK04863  1048 VPADSGAEERARARRDELHARLSANRSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEMREQVVNAKAgwCAVLRLVKD 1127
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|
gi 1907082211 1821 NDLqaqieesnkEKQELQEKLQALQSQVeflEQSMVDKSL 1860
Cdd:PRK04863  1128 NGV---------ERRLHRRELAYLSADE---LRSMSDKAL 1155
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1449-1605 1.54e-06

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 51.85  E-value: 1.54e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1449 RQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLhlegqQVRNHELEKKQrrfdselSQAHEETQREKLQREKL 1528
Cdd:COG1579     20 DRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEK-----EIKRLELEIEE-------VEARIKKYEEQLGNVRN 87
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1529 QREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQ 1605
Cdd:COG1579     88 NKEYEALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAE 164
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1304-1605 1.71e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 53.77  E-value: 1.71e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1304 DEEIQQLRSKLEKVEKERNELRLSSDRLEtrisELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTQYDALKKQ 1383
Cdd:COG4913    660 EIDVASAEREIAELEAELERLDASSDDLA----ALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDR 735
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1384 MEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKMeveqqsrRQLERRLGDLQADSD 1463
Cdd:COG4913    736 LEAAEDLARLELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAM-------RAFNREWPAETADLD 808
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1464 ESQRALQQLKKKCQRLTAElqdtklHLEGQQVRNHELEKKQ-----RRFDSELSQAHEETqreklqREKLQREKDMLLAE 1538
Cdd:COG4913    809 ADLESLPEYLALLDRLEED------GLPEYEERFKELLNENsiefvADLLSKLRRAIREI------KERIDPLNDSLKRI 876
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082211 1539 AFS----LKQQMEE-KDLDIAGFTQkvvsleaELQDISSQESKDEASLAKVK-KQLRDLEAKVKDQEEELDEQ 1605
Cdd:COG4913    877 PFGpgryLRLEARPrPDPEVREFRQ-------ELRAVTSGASLFDEELSEARfAALKRLIERLRSEEEESDRR 942
HMMR_N pfam15905
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ...
1693-1992 1.95e-06

Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.


Pssm-ID: 464932 [Multi-domain]  Cd Length: 329  Bit Score: 52.51  E-value: 1.95e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1693 ESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMevemedlhlqiddiAKAKT 1772
Cdd:pfam15905   60 ELKKKSQKNLKESKDQKELEKEIRALVQERGEQDKRLQALEEELEKVEAKLNAAVREKTSL--------------SASVA 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1773 ALEEQLSRLQREKNEIQNRLEED--QEDMN----ELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQS 1846
Cdd:pfam15905  126 SLEKQLLELTRVNELLKAKFSEDgtQKKMSslsmELMKLRNKLEAKMKEVMAKQEGMEGKLQVTQKNLEHSKGKVAQLEE 205
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1847 QVEFLEQSMVDkslvsrqeakireletrlefEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRD 1926
Cdd:pfam15905  206 KLVSTEKEKIE--------------------EKSETEKLLEYITELSCVSEQVEKYKLDIAQLEELLKEKNDEIESLKQS 265
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1927 TKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKlafKRIGDLQAAIedEMESDENEDL 1992
Cdd:pfam15905  266 LEEKEQELSKQIKDLNEKCKLLESEKEELLREYEEKEQTLN---AELEELKEKL--TLEEQEHQKL 326
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1478-1843 2.58e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 52.21  E-value: 2.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1478 RLTAELQDTKLHLEGQQVRNHELEKKQrrfDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFT 1557
Cdd:COG4372      3 RLGEKVGKARLSLFGLRPKTGILIAAL---SEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLE 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1558 QKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMErMRQTHSKEMESR 1637
Cdd:COG4372     80 EELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIA-EREEELKELEEQ 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1638 DEEVEEARQSCQKKLKQMEVQleEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLD 1717
Cdd:COG4372    159 LESLQEELAALEQELQALSEA--EAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALS 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1718 HLKNNAPSKREIAQLKNQLEESEFTcAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQE 1797
Cdd:COG4372    237 ALLDALELEEDKEELLEEVILKEIE-ELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALED 315
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1798 DM----NELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQA 1843
Cdd:COG4372    316 ALlaalLELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEA 365
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1297-1966 2.61e-06

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 53.03  E-value: 2.61e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1297 EEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQTQ 1376
Cdd:COG3096    514 LQQLRAQLAELEQRLRQQQNAERLLEEFCQRIGQQLDAAEELEELLAELEAQLEELEEQAAEAVEQRSELRQQLEQLRAR 593
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1377 YDALKKQMEVMEMEVmeARLIRAAEINGE-VDDDDAGGEWRLKYERAVREVDFTKKRLQQEledKMEVEQQSRRQLerrl 1455
Cdd:COG3096    594 IKELAARAPAWLAAQ--DALERLREQSGEaLADSQEVTAAMQQLLEREREATVERDELAAR---KQALESQIERLS---- 664
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1456 gdlQADSDESQRaLQQLkkkCQRLTAEL-----QDTKLH-------LEGQQ-----VRNHELEKKQ-------------- 1504
Cdd:COG3096    665 ---QPGGAEDPR-LLAL---AERLGGVLlseiyDDVTLEdapyfsaLYGPArhaivVPDLSAVKEQlagledcpedlyli 737
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1505 ----RRFDSELSQAHEETQREKLQREKLQ----------------REK--DMLLAEAFSLKQQMEEKDLDIagftQKVVS 1562
Cdd:COG3096    738 egdpDSFDDSVFDAEELEDAVVVKLSDRQwrysrfpevplfgraaREKrlEELRAERDELAEQYAKASFDV----QKLQR 813
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1563 LEAELQDISSQ------ESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAgsiQMLEQAKLRLEMEMERMRQTHSKEMES 1636
Cdd:COG3096    814 LHQAFSQFVGGhlavafAPDPEAELAALRQRRSELERELAQHRAQEQQLR---QQLDQLKEQLQLLNKLLPQANLLADET 890
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1637 RDEEVEEARqscqkklkqmevqleEEYEDKQKALREKRELESKLSTLSDQVN--QRDFESEKRLRKDLKRTKALLADAQI 1714
Cdd:COG3096    891 LADRLEELR---------------EELDAAQEAQAFIQQHGKALAQLEPLVAvlQSDPEQFEQLQADYLQAKEQQRRLKQ 955
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1715 MLDHLKNnapskreiaqlknqleeseftcaaaVKARKAmevemedlHLQIDDIAKaktaleeqlsrlqrekneiqnRLEE 1794
Cdd:COG3096    956 QIFALSE-------------------------VVQRRP--------HFSYEDAVG---------------------LLGE 981
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1795 DQeDMNELMKkHKAAVAQASRDMA--QMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSM------VDKSLVSRQEA 1866
Cdd:COG3096    982 NS-DLNEKLR-ARLEQAEEARREAreQLRQAQAQYSQYNQVLASLKSSRDAKQQTLQELEQELeelgvqADAEAEERARI 1059
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1867 KIRELETRL--------EFEKTQVKR---LENLASRLKETMEKLTEERDQRAAAenreKEQNKRLQRQLRDTKEE----M 1931
Cdd:COG3096   1060 RRDELHEELsqnrsrrsQLEKQLTRCeaeMDSLQKRLRKAERDYKQEREQVVQA----KAGWCAVLRLARDNDVErrlhR 1135
                          730       740       750
                   ....*....|....*....|....*....|....*
gi 1907082211 1932 SELARKEAEASRkkhelEMDLESLEAANQSlQADL 1966
Cdd:COG3096   1136 RELAYLSADELR-----SMSDKALGALRLA-VADN 1164
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1519-1709 3.25e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 52.71  E-value: 3.25e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1519 QREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQK--VVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVK 1596
Cdd:COG3206    164 QNLELRREEARKALEFLEEQLPELRKELEEAEAALEEFRQKngLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLA 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1597 DQEEELDE---------QAGSIQMLEQAKLRLEMEMERMRQTHSKE---MESRDEEVEEARQSCQKKLKQMEVQLEEEYE 1664
Cdd:COG3206    244 ALRAQLGSgpdalpellQSPVIQQLRAQLAELEAELAELSARYTPNhpdVIALRAQIAALRAQLQQEAQRILASLEAELE 323
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1665 DKQKALREKRELESKLSTLSDQVNQRDFESEkRLRKDLKRTKALL 1709
Cdd:COG3206    324 ALQAREASLQAQLAQLEARLAELPELEAELR-RLEREVEVARELY 367
WEMBL pfam05701
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ...
1448-1960 3.46e-06

Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".


Pssm-ID: 461718 [Multi-domain]  Cd Length: 562  Bit Score: 52.34  E-value: 3.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1448 RRQLERRLGDLQADSDESQRALQ-------QLKKKCQRLTAELQDTKLHLEGQQVrnhelEKKQRRFDSELSQ------- 1513
Cdd:pfam05701   37 RKLVELELEKVQEEIPEYKKQSEaaeaakaQVLEELESTKRLIEELKLNLERAQT-----EEAQAKQDSELAKlrveeme 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1514 ---AHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEkdldiagfTQKVVSLEAELQDISSQESKDEASLAK-VKKQLR 1589
Cdd:pfam05701  112 qgiADEASVAAKAQLEVAKARHAAAVAELKSVKEELES--------LRKEYASLVSERDIAIKRAEEAVSASKeIEKTVE 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1590 DLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKlKQMEVQLEEEYEdkqKA 1669
Cdd:pfam05701  184 ELTIELIATKESLESAHAAHLEAEEHRIGAALAREQDKLNWEKELKQAEEELQRLNQQLLSA-KDLKSKLETASA---LL 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1670 LREKREL----ESKLSTLSDQvNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIA-QLKNQLEEsEFTCA 1744
Cdd:pfam05701  260 LDLKAELaaymESKLKEEADG-EGNEKKTSTSIQAALASAKKELEEVKANIEKAKDEVNCLRVAAaSLRSELEK-EKAEL 337
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1745 AAVKARKAMEvemedlhlqiddiAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKhkaaVAQASRDMAQMNDL- 1823
Cdd:pfam05701  338 ASLRQREGMA-------------SIAVSSLEAELNRTKSEIALVQAKEKEAREKMVELPKQ----LQQAAQEAEEAKSLa 400
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1824 QAQIEESNKEKQElQEKLQALQSQVEfleqsmvdkslvSRQEAKIRELETRLEFEKT------------QVKRLENLASR 1891
Cdd:pfam05701  401 QAAREELRKAKEE-AEQAKAAASTVE------------SRLEAVLKEIEAAKASEKLalaaikalqeseSSAESTNQEDS 467
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907082211 1892 LKETMEKLTEERDQRAAAENREKEQNKRLQR---QLRDTKEEMSELARKEAEASRkkhELEMDLESLEAANQ 1960
Cdd:pfam05701  468 PRGVTLSLEEYYELSKRAHEAEELANKRVAEavsQIEEAKESELRSLEKLEEVNR---EMEERKEALKIALE 536
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1358-1606 3.47e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 52.07  E-value: 3.47e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1358 AETAERLRTEKEMKELQTQYDALKKQMEVMEMevmearliRAAEINGEVDDDDaggewrlkyeravREVDFTKKRLqQEL 1437
Cdd:COG4942     17 AQADAAAEAEAELEQLQQEIAELEKELAALKK--------EEKALLKQLAALE-------------RRIAALARRI-RAL 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1438 EDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLtAELQDTKLHLEGQQVRnhELEKKQRRFDSELSQAHEE 1517
Cdd:COG4942     75 EQELAALEAELAELEKEIAELRAELEAQKEELAELLRALYRL-GRQPPLALLLSPEDFL--DAVRRLQYLKYLAPARREQ 151
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1518 TQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKD 1597
Cdd:COG4942    152 AEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIAR 231

                   ....*....
gi 1907082211 1598 QEEELDEQA 1606
Cdd:COG4942    232 LEAEAAAAA 240
mukB PRK04863
chromosome partition protein MukB;
1296-1912 3.53e-06

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 52.65  E-value: 3.53e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1296 SEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQT 1375
Cdd:PRK04863   514 QLQQLRMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQA 593
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1376 QYDALKKQMEVMEMEVmeARLIRAAEINGEVDDDDAGGEwrlkyeravrevDFTKKRLQQELEDKMEVEQ--QSRRQLER 1453
Cdd:PRK04863   594 RIQRLAARAPAWLAAQ--DALARLREQSGEEFEDSQDVT------------EYMQQLLERERELTVERDElaARKQALDE 659
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1454 RLGDLQADSDESQRALQQLkkkCQRLTAEL-----QDTKLH-------LEGQQ-----VRNHELEKKQ------------ 1504
Cdd:PRK04863   660 EIERLSQPGGSEDPRLNAL---AERFGGVLlseiyDDVSLEdapyfsaLYGPArhaivVPDLSDAAEQlagledcpedly 736
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1505 ------RRFDSELSQAhEETQREKLQRE-----------------KLQREK--DMLLAEAFSLKQQMEEKDLDIagftQK 1559
Cdd:PRK04863   737 liegdpDSFDDSVFSV-EELEKAVVVKIadrqwrysrfpevplfgRAAREKriEQLRAEREELAERYATLSFDV----QK 811
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1560 VVSLEAELQDISSQ------ESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAgsiQMLEQAKLRLEMEMERMRQTHSKE 1633
Cdd:PRK04863   812 LQRLHQAFSRFIGShlavafEADPEAELRQLNRRRVELERALADHESQEQQQR---SQLEQAKEGLSALNRLLPRLNLLA 888
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1634 MESRDEEVEEARQscqkklkqmevQLEEEYEDK------QKALrekRELESKLSTLsdQVNQRDFEsekRLRKDLKRTKA 1707
Cdd:PRK04863   889 DETLADRVEEIRE-----------QLDEAEEAKrfvqqhGNAL---AQLEPIVSVL--QSDPEQFE---QLKQDYQQAQQ 949
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1708 LLADAQIMLDHLKNnapskreiaqlknqleeseftcaaaVKARKAmevemedlHLQIDDIAK---AKTALEEQLSRlqre 1784
Cdd:PRK04863   950 TQRDAKQQAFALTE-------------------------VVQRRA--------HFSYEDAAEmlaKNSDLNEKLRQ---- 992
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1785 kneiqnRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEF--LEQSMVDKS--- 1859
Cdd:PRK04863   993 ------RLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPADSgaEERARARRDelh 1066
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1860 -LVSRQEAKIRELETRLEFEKTqvkRLENLASRLKETMEKLTEERDQRAAAENR 1912
Cdd:PRK04863  1067 aRLSANRSRRNQLEKQLTFCEA---EMDNLTKKLRKLERDYHEMREQVVNAKAG 1117
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1639-1947 3.84e-06

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 52.27  E-value: 3.84e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1639 EEVEEARQSCQKKLKQMEvQLEEEYED--------KQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRtkallA 1710
Cdd:COG5185    232 EEALKGFQDPESELEDLA-QTSDKLEKlveqntdlRLEKLGENAESSKRLNENANNLIKQFENTKEKIAEYTKS-----I 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1711 DAQIMLDHLKNNAPSKREIAQLKNQLEESEftcaAAVKARKA-MEVEMEDLHLQIDDIAKAKTAL--EEQLSRLQREKNE 1787
Cdd:COG5185    306 DIKKATESLEEQLAAAEAEQELEESKRETE----TGIQNLTAeIEQGQESLTENLEAIKEEIENIvgEVELSKSSEELDS 381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1788 IQNRLE----EDQEDMNELMKKHKAAVAQASRDMA----QMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVDKS 1859
Cdd:COG5185    382 FKDTIEstkeSLDEIPQNQRGYAQEILATLEDTLKaadrQIEELQRQIEQATSSNEEVSKLLNELISELNKVMREADEES 461
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1860 LVSRQEAK---IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELAR 1936
Cdd:COG5185    462 QSRLEEAYdeiNRSVRSKKEDLNEELTQIESRVSTLKATLEKLRAKLERQLEGVRSKLDQVAESLKDFMRARGYAHILAL 541
                          330
                   ....*....|.
gi 1907082211 1937 KEAEASRKKHE 1947
Cdd:COG5185    542 ENLIPASELIQ 552
GOLGA2L5 pfam15070
Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein ...
1639-1957 3.88e-06

Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein family remains unknown. This family of proteins is thought to be found in the Golgi apparatus of eukaryotes.


Pssm-ID: 464485 [Multi-domain]  Cd Length: 521  Bit Score: 51.99  E-value: 3.88e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1639 EEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDF----------ESEKRLRKDLKRTKAL 1708
Cdd:pfam15070   18 ENLKEEGAVWQQKMQQLSEQVRTLREEKERSVSQVQELETSLAELKNQAAVPPAeeeqppagpsEEEQRLQEEAEQLQKE 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1709 LADAQIML-DHLKNNAPSKREIAQLKNQLEESEFTC---AAAVKARKAMeveMEDLHLQIDDIAKAKT---ALEEQLSRL 1781
Cdd:pfam15070   98 LEALAGQLqAQVQDNEQLSRLNQEQEQRLLELERAAerwGEQAEDRKQI---LEDMQSDRATISRALSqnrELKEQLAEL 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1782 Q-------REKNEIQNRLEEDQEDMNELMKKhkaavaqasrdMAQMN----DLQAQIEESNKEKQELQEK----LQALQS 1846
Cdd:pfam15070  175 QngfvkltNENMELTSALQSEQHVKKELAKK-----------LGQLQeelgELKETLELKSQEAQSLQEQrdqyLAHLQQ 243
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1847 QVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVK-RLENLASRLKETMEKLTeerdqrAAAenrekEQNKRLQRQLR 1925
Cdd:pfam15070  244 YVAAYQQLASEKEELHKQYLLQTQLMDRLQHEEVQGKvAAEMARQELQETQERLE------ALT-----QQNQQLQAQLS 312
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1907082211 1926 DTKEEMS----ELARKEAEASRKKHELEMDLESLEA 1957
Cdd:pfam15070  313 LLANPGEgdglESEEEEEEAPRPSLSIPEDFESREA 348
MutS2 COG1193
dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];
1575-1690 3.92e-06

dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];


Pssm-ID: 440806 [Multi-domain]  Cd Length: 784  Bit Score: 52.45  E-value: 3.92e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1575 SKDEASLAKVkkqLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRqthskemESRDEEVEEARQSCQKKLKQ 1654
Cdd:COG1193    510 GEESIDVEKL---IEELERERRELEEEREEAERLREELEKLREELEEKLEELE-------EEKEEILEKAREEAEEILRE 579
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1907082211 1655 MEVQLEE---EYEDKQKALREKRELESKLSTLSDQVNQR 1690
Cdd:COG1193    580 ARKEAEElirELREAQAEEEELKEARKKLEELKQELEEK 618
PRK12704 PRK12704
phosphodiesterase; Provisional
1583-1738 4.38e-06

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 52.09  E-value: 4.38e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1583 KVKKQLRDLEAKVKDQEEELDEQAGSIQmlEQAKLRLEMEMERMRQTHSKEMESRDEEVEEArqscQKKLKQMEVQLEEE 1662
Cdd:PRK12704    28 IAEAKIKEAEEEAKRILEEAKKEAEAIK--KEALLEAKEEIHKLRNEFEKELRERRNELQKL----EKRLLQKEENLDRK 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1663 YEDKQKALREKRELESKLSTLSDQVNQRDFESEK---RLRKDLKRTKALLAD--AQIMLDHLKNNApsKREIAQLKNQLE 1737
Cdd:PRK12704   102 LELLEKREEELEKKEKELEQKQQELEKKEEELEElieEQLQELERISGLTAEeaKEILLEKVEEEA--RHEAAVLIKEIE 179

                   .
gi 1907082211 1738 E 1738
Cdd:PRK12704   180 E 180
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1578-1938 4.48e-06

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 52.26  E-value: 4.48e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1578 EASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQ----AKLRLEMEMERMRQThsKEMESRDEEVEEArqscQKKLK 1653
Cdd:COG3096    291 RRELFGARRQLAEEQYRLVEMARELEELSARESDLEQdyqaASDHLNLVQTALRQQ--EKIERYQEDLEEL----TERLE 364
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1654 QMEVQLEEEYEDKQKAlREKRElesklstlsdqvnqrdfesekRLRKDLKRTKALLADAQIMLDHLKNNAPSKReiaQLK 1733
Cdd:COG3096    365 EQEEVVEEAAEQLAEA-EARLE---------------------AAEEEVDSLKSQLADYQQALDVQQTRAIQYQ---QAV 419
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1734 NQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEedqedmneLMKKHKAAV-AQ 1812
Cdd:COG3096    420 QALEKARALCGLPDLTPENAEDYLAAFRAKEQQATEEVLELEQKLSVADAARRQFEKAYE--------LVCKIAGEVeRS 491
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1813 ASRDMAQmndlqaQIEESNKEKQELQEKLQALQSQVEFLEQsmvdksLVSRQEAKIRELEtrlEFEKTQVKRLENlASRL 1892
Cdd:COG3096    492 QAWQTAR------ELLRRYRSQQALAQRLQQLRAQLAELEQ------RLRQQQNAERLLE---EFCQRIGQQLDA-AEEL 555
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082211 1893 KETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKE 1938
Cdd:COG3096    556 EELLAELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARA 601
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1732-1966 5.71e-06

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 51.43  E-value: 5.71e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1732 LKNQLEESEFTCAAAVKARKAMEVEMEDlhlQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVA 1811
Cdd:pfam07888   32 LQNRLEECLQERAELLQAQEAANRQREK---EKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKYKELSA 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1812 QASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSM------VDKSLVSR--QEAKIRELETRLEFEKTQVK 1883
Cdd:pfam07888  109 SSEELSEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELermkerAKKAGAQRkeEEAERKQLQAKLQQTEEELR 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1884 RL--------------ENLASRLKETMEKLTeerdQRAAAENREKEQNKRLQRQLRDTKEEmselarkeAEASRKKHE-L 1948
Cdd:pfam07888  189 SLskefqelrnslaqrDTQVLQLQDTITTLT----QKLTTAHRKEAENEALLEELRSLQER--------LNASERKVEgL 256
                          250
                   ....*....|....*...
gi 1907082211 1949 EMDLESLEAANQSLQADL 1966
Cdd:pfam07888  257 GEELSSMAAQRDRTQAEL 274
PDZ_Par6-like cd06718
PDZ domain of partitioning defective 6 (Par6), Drosophila Rho GTPase-activating protein 100F ...
222-297 5.79e-06

PDZ domain of partitioning defective 6 (Par6), Drosophila Rho GTPase-activating protein 100F (RhoGAP100F), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of Par6 (also known as PAR6 or Par-6), RhoGAP100F, and related domains. Par6 is part of a conserved machinery that directs metazoan cell polarity, a process necessary for the function of diverse cell types. Par6 forms a cell polarity-regulatory complex with atypical protein kinase C (aPKC) and Par3. Par6 can also directly associate with PALS1 (proteins associated with Lin7, also known as Stardust) providing a link between the Par3/aPKC/Par6 complex and the PALS1-PATJ (protein-associated TJ) complex. Binding partners of the Par6-PDZ domain include Par3, PALS1/Stardust; leucine-rich repeat-containing protein netrin-G ligand-2 (NGL-2), human crumbs (CRB3) involve in the morphogenesis of the tight junctions in mammalian epithelial cells, and PAR-6 co-operates with the Par6 semi-CRIB domain to bind CDC42. CDC42 regulates the Par6 PDZ domain through an allosteric CRIB-PDZ transition. Drosophila RhoGAP100F, also known as synapse defective protein 1 homolog (syd-1 homolog), is a GTPase activator for the Rho-type GTPases by converting them to an inactive GDP-bound form. The RhoGAP100F-PDZ domain binds the neurexin C terminus to control synapse formation at the Drosophila neuromuscular junction. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Par6-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467202 [Multi-domain]  Cd Length: 84  Bit Score: 46.41  E-value: 5.79e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211  222 ELQRRPTGDFGFSLRRTTMLDRAPeGQAYRRVVhfaePGaGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIR 297
Cdd:cd06718      4 ELIKPPGKPLGFYIRDGNGVERVP-GIFISRLV----LG-SLADSTGLLAVGDEILEVNGVEVTGKSLDDVTDMMV 73
PDZ4_PTPN13-like cd06696
PDZ domain 4 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), and related ...
220-307 6.08e-06

PDZ domain 4 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of PTPN13 [also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1)] and related domains. PTPN13 regulates negative apoptotic signaling and mediates phosphoinositide 3-kinase (PI3K) signaling. PTPN13 has five PDZ domains. Proteins known to interact with PTPN13 PDZ domains include: PLEKHA1 and PLEKHA2 via PTPN13-PDZ domain 1, Fas receptor and thyroid receptor-interacting protein 6 via PTPN13-PDZ domain 2, nerve growth factor receptor and protein kinase N2 via PTPN13-PDZ domain 3, PDZ and LIM domain 4 (PDLIM4) via PTPN13-PDZ domains 2 and 4, and brain calpain-2 via PTPN13-PDZ domains 3, 4 and 5. Calpain-2-mediated PTPN13 fragments may be involved in abnormal tau aggregation and increased risk for Alzheimer's disease. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467182 [Multi-domain]  Cd Length: 85  Bit Score: 46.53  E-value: 6.08e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLRRTTMldraPEGQAYRRVVHfaEPgagtkdlALG---LVPGDRLVEINGQNVENKSRDEIVEMI 296
Cdd:cd06696      5 EVTLTKSEKGSLGFTVTKGKD----DNGCYIHDIVQ--DP-------AKSdgrLRPGDRLIMVNGVDVTNMSHTEAVSLL 71
                           90
                   ....*....|.
gi 1907082211  297 RQSGDSVRLKV 307
Cdd:cd06696     72 RAAPKEVTLVL 82
PDZ3_Dlg1-2-4-like cd06795
PDZ domain 3 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg) ...
237-312 6.48e-06

PDZ domain 3 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of Drosophila Dlg1, human Dlg1, 2, and 4 and related domains. Dlg1 (also known as synapse-associated protein Dlg197; SAP-97), Dlg2 (also known as channel-associated protein of synapse-110; postsynaptic density protein 93, PSD-93), Dlg4 (also known as postsynaptic density protein 95, PSD-95; synapse-associated protein 90, SAP-90) each have 3 PDZ domains and belong to the membrane-associated guanylate kinase family. Dlg1 regulates antigen receptor signaling and cell polarity in lymphocytes, B-cell proliferation and antibody production, and TGFalpha bioavailability; its PDZ3 domain binds pro-TGFalpha, and its PDZ2 domain binds the TACE metalloprotease responsible for cleaving pro-TGFalpha to a soluble form. Dlg2 is involved in N-methyl-D-aspartate (NMDA) receptor signaling, regulating surface expression of NMDA receptors in dorsal horn neurons of the spinal cord; it interacts with NMDA receptor subunits and with Shaker-type K+ channel subunits to cluster into a channel complex. The Dlg4 PDZ1 domain binds NMDA receptors, and its PDZ2 domain binds neuronal nitric oxide synthase (nNOS), forming a complex in neurons. The Drosophila Scribble complex (Scribble, Dlg, and lethal giant larvae) plays a role in apico-basal cell polarity, and in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development; postsynaptic targeting of Drosophila DLG requires interactions mediated by the first two PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467257 [Multi-domain]  Cd Length: 91  Bit Score: 46.58  E-value: 6.48e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  237 RTTMLDRAPEGQAYRRV---------VHFAEPGaGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKV 307
Cdd:cd06795      3 RKIVLHKGSTGLGFNIVggedgegifISFILAG-GPADLSGELRRGDQILSVNGVDLRNATHEQAAAALKNAGQTVTIIA 81

                   ....*
gi 1907082211  308 QPIPE 312
Cdd:cd06795     82 QYKPE 86
PDZ_MPP-like cd06726
PDZ domain of membrane palmitoylated proteins (MPPs), and related domains; PDZ (PSD-95 ...
273-309 6.94e-06

PDZ domain of membrane palmitoylated proteins (MPPs), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MPP1-7 (also known as MAGUK p55 subfamily members 1-7), and related domains. MPPs comprise a subfamily of a larger group of multidomain proteins, namely, membrane-associated guanylate kinases (MAGUKs). MPPs form diverse protein complexes at the cell membranes, which are involved in a wide range of cellular processes, including establishing proper cell structure, polarity and cell adhesion. MPPs have only one PDZ domain. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MPP1-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467208 [Multi-domain]  Cd Length: 80  Bit Score: 46.10  E-value: 6.94e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907082211  273 GDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQP 309
Cdd:cd06726     44 GDEILEINGIPVSGKTVDELQKLLSSLSGSVTFKLIP 80
MYSc_Myo33 cd14894
class myosin, motor domain; Class XXXIII myosins have variable numbers of IQ domain and 2 ...
586-865 7.11e-06

class myosin, motor domain; Class XXXIII myosins have variable numbers of IQ domain and 2 tandem ANK repeats that are separated by a PH domain. The myosin classes XXX to XXXIV contain members from Phytophthora species and Hyaloperonospora parasitica. The catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle. CyMoBase classifications were used to confirm and identify the myosins in this hierarchy.


Pssm-ID: 276859 [Multi-domain]  Cd Length: 871  Bit Score: 51.67  E-value: 7.11e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  586 LLEAFGNSPTIMNGSATRFSQILSLDF-----DQAGQVASASIQTMLLEKLRVARRPASEA------TFNVFYYLLACGD 654
Cdd:cd14894    255 VLEAFGHATTSMNLNSSRFGKMTTLQVafglhPWEFQICGCHISPFLLEKSRVTSERGRESgdqnelNFHILYAMVAGVN 334
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  655 A-----TLRTELHLNHL----------AENNVFGIVplSKPEEKQKAAQQFSKLQAAMKVLAISPEEQKTCWLILASIYH 719
Cdd:cd14894    335 AfpfmrLLAKELHLDGIdcsaltylgrSDHKLAGFV--SKEDTWKKDVERWQQVIDGLDELNVSPDEQKTIFKVLSAVLW 412
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  720 LGaaGATKEAAEAGRKQFARHEWAQKAAYLLgCSLEELSSAifkHQLKGGTLQRSTSFRQGPE--ESGLGEGtklSALEC 797
Cdd:cd14894    413 LG--NIELDYREVSGKLVMSSTGALNAPQKV-VELLELGSV---EKLERMLMTKSVSLQSTSEtfEVTLEKG---QVNHV 483
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  798 LEGMASGLYSELFTLLISLVNRALKSS-------QHSLCS----------MMIVDTPGFQNpewggsARGASFEELCHNY 860
Cdd:cd14894    484 RDTLARLLYQLAFNYVVFVMNEATKMSalstdgnKHQMDSnasapeavslLKIVDVFGFED------LTHNSLDQLCINY 557

                   ....*
gi 1907082211  861 AQDRL 865
Cdd:cd14894    558 LSEKL 562
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
1434-1933 7.24e-06

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 51.45  E-value: 7.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1434 QQELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKlhleGQQVRNHELEKKQRrfDSELSQ 1513
Cdd:pfam15964  219 RLELEKLKLLYEAKTEVLESQVKSLRKDLAESQKTCEDLKERLKHKESLVAAST----SSRVGGLCLKCAQH--EAVLAQ 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1514 AHEETQREKLQRekLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSleaelqdisSQESKDEASLAKVKK--QLRDL 1591
Cdd:pfam15964  293 THTNVHMQTIER--LTKERDDLMSALVSVRSSLAEAQQRESSAYEQVKQ---------AVQMTEEANFEKTKAliQCEQL 361
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1592 EAKVKDQEEELDEQAGSiqmlEQAKLRLEMEMERmrqthsKEMESRDEEVEEARQSCQKKLKQMEVQLEeeyedkqKALR 1671
Cdd:pfam15964  362 KSELERQKERLEKELAS----QQEKRAQEKEALR------KEMKKEREELGATMLALSQNVAQLEAQVE-------KVTR 424
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1672 EKRELESKLSTLSDQVNQRDFESEK---RLRKDLKRTKALLADAQimLDHLKNNAPSKR-------EIAQLKNQLEESef 1741
Cdd:pfam15964  425 EKNSLVSQLEEAQKQLASQEMDVTKvcgEMRYQLNQTKMKKDEAE--KEHREYRTKTGRqleikdqEIEKLGLELSES-- 500
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1742 tcaaavkaRKAMEVEMEDLHLQIDDIAKaktaLEEQLSRLQREKNeiqnrleedqedmneLMKKHKAAVAQASRDMAQMN 1821
Cdd:pfam15964  501 --------KQRLEQAQQDAARAREECLK----LTELLGESEHQLH---------------LTRLEKESIQQSFSNEAKAQ 553
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1822 DLQAQIEEsnkekQELQEKLQALQSQvefleqsmvdkslvsrQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTE 1901
Cdd:pfam15964  554 ALQAQQRE-----QELTQKMQQMEAQ----------------HDKTVNEQYSLLTSQNTFIAKLKEECCTLAKKLEEITQ 612
                          490       500       510
                   ....*....|....*....|....*....|..
gi 1907082211 1902 ERDQRAAAENREKEQNKRLQRQLRDTKEEMSE 1933
Cdd:pfam15964  613 KSRSEVEQLSQEKEYLQDRLEKLQKRNEELEE 644
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1513-1739 7.33e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 50.98  E-value: 7.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1513 QAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKdldiagfTQKVVSLEAELQDIssqeskdEASLAKVKKQLRDLE 1592
Cdd:COG3883     13 FADPQIQAKQKELSELQAELEAAQAELDALQAELEEL-------NEEYNELQAELEAL-------QAEIDKLQAEIAEAE 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1593 AKVKDQEEELDEQAGSIQMLEQAKLRLEM---------------EMERMRQTHSKEMesrdEEVEEARQSCQKKLKQMEV 1657
Cdd:COG3883     79 AEIEERREELGERARALYRSGGSVSYLDVllgsesfsdfldrlsALSKIADADADLL----EELKADKAELEAKKAELEA 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1658 QLEEEYEDKQKALREKRELESKLSTLSDQVNQrdfesekrLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLE 1737
Cdd:COG3883    155 KLAELEALKAELEAAKAELEAQQAEQEALLAQ--------LSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAA 226

                   ..
gi 1907082211 1738 ES 1739
Cdd:COG3883    227 AA 228
PDZ1_PTPN13_FRMPD2-like cd06694
PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13),FERM and PDZ ...
258-305 7.34e-06

PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13),FERM and PDZ domain-containing protein 2 (FRMPD2), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of PTPN13 [also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1)], FRMPD2 (also known as PDZ domain-containing protein 4; PDZ domain-containing protein 5C), and related domains. PTPN13 regulates negative apoptotic signaling and mediates phosphoinositide 3-kinase (PI3K) signaling. PTPN13 has five PDZ domains. Proteins known to interact with PTPN13 PDZ domains include: PLEKHA1 and PLEKHA2 via PTPN13-PDZ domain 1, Fas receptor and thyroid receptor-interacting protein 6 via PTPN13-PDZ domain 2, nerve growth factor receptor and protein kinase N2 via PTPN13-PDZ domain 3, PDZ and LIM domain 4 (PDLIM4) via PTPN13-PDZ domains 2 and 4, and brain calpain-2 via PTPN13-PDZ domains 3, 4 and 5. Calpain-2-mediated PTPN13 fragments may be involved in abnormal tau aggregation and increased risk for Alzheimer's disease. FRMPD2 is localized in the basolateral membranes of polarized epithelial cells and is associated with tight junction formation and immune response; it contains 3 PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467180 [Multi-domain]  Cd Length: 92  Bit Score: 46.62  E-value: 7.34e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 1907082211  258 EPGaGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRL 305
Cdd:cd06694     38 IPG-GPADKDGRIKPGDRIIAINGQSLEGKTHHAAVEIIQNAPDKVEL 84
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
1649-2020 7.42e-06

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 51.98  E-value: 7.42e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1649 QKKLKQMEVQLEEEYEDKQKALREKRelesklsTLSDQVNQrdfesEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKRE 1728
Cdd:TIGR01612  458 KSKLKALEKRFFEIFEEEWGSYDIKK-------DIDENSKQ-----DNTVKLILMRMKDFKDIIDFMELYKPDEVPSKNI 525
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1729 IAqlknqleeseFTCAAAVKARKAMEVEmedlhlqiddiAKAKTALEeQLSRLQREKNEIQNRLEEDQEDMNELMKkhka 1808
Cdd:TIGR01612  526 IG----------FDIDQNIKAKLYKEIE-----------AGLKESYE-LAKNWKKLIHEIKKELEEENEDSIHLEK---- 579
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1809 avaqasrdmaQMNDLQAQ----IEES---NKEKQELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQ 1881
Cdd:TIGR01612  580 ----------EIKDLFDKyleiDDEIiyiNKLKLELKEKIKNISDKNEYIKKAIDLKKIIENNNAYIDELAKISPYQVPE 649
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1882 -VKRLENLASRLKETMEKLTEerDQRAAAENrekEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELE-MDLESLEAAN 1959
Cdd:TIGR01612  650 hLKNKDKIYSTIKSELSKIYE--DDIDALYN---ELSSIVKENAIDNTEDKAKLDDLKSKIDKEYDKIQnMETATVELHL 724
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1960 QSLQADLKLAFKRIGDLQAAIEDEMesdeNEDLINSLQDMVTKYQKKKNKLEGDSDVDSEL 2020
Cdd:TIGR01612  725 SNIENKKNELLDIIVEIKKHIHGEI----NKDLNKILEDFKNKEKELSNKINDYAKEKDEL 781
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1291-1610 7.46e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 50.67  E-value: 7.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1291 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEM 1370
Cdd:COG4372     24 ILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEEL 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1371 KELQTQYDALKKQMevmemevmearliraaeingevddddaggewrlkyeravrevdftkKRLQQELEDKmeveQQSRRQ 1450
Cdd:COG4372    104 ESLQEEAEELQEEL----------------------------------------------EELQKERQDL----EQQRKQ 133
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1451 LERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKlhlegQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQR 1530
Cdd:COG4372    134 LEAQIAELQSEIAEREEELKELEEQLESLQEELAALE-----QELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKL 208
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1531 EKDMLLAEAfslKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQ 1610
Cdd:COG4372    209 IESLPRELA---EELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALEL 285
Filament pfam00038
Intermediate filament protein;
1302-1627 8.34e-06

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 50.30  E-value: 8.34e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1302 NKDEEIQQLRSKL----EKVEkernELRLSSDRLETRISELTSELTDERN--------TGESASQLLDAETAERLRTEKE 1369
Cdd:pfam00038    1 NEKEQLQELNDRLasyiDKVR----FLEQQNKLLETKISELRQKKGAEPSrlyslyekEIEDLRRQLDTLTVERARLQLE 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1370 MKELQTQYDALKKQMEVMEMEVMEARliraAEINGEVDDDDAGGEWRLKYERAVR----EVDFTKKRLQQELedkmeveq 1445
Cdd:pfam00038   77 LDNLRLAAEDFRQKYEDELNLRTSAE----NDLVGLRKDLDEATLARVDLEAKIEslkeELAFLKKNHEEEV-------- 144
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1446 qsrRQLERRLGDlqadsdeSQRALQQLKKKCQRLTAELQDTKLHLEGQQVRN-HELEKKQRRFDSELSQAhEETQREKLQ 1524
Cdd:pfam00038  145 ---RELQAQVSD-------TQVNVEMDAARKLDLTSALAEIRAQYEEIAAKNrEEAEEWYQSKLEELQQA-AARNGDALR 213
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1525 REKlqrekdmllAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDIssqESKDEASLAKVKKQLRDLEAKVKDQEEELDE 1604
Cdd:pfam00038  214 SAK---------EEITELRRTIQSLEIELQSLKKQKASLERQLAET---EERYELQLADYQELISELEAELQETRQEMAR 281
                          330       340
                   ....*....|....*....|...
gi 1907082211 1605 QAGSIQMLEQAKLRLEMEMERMR 1627
Cdd:pfam00038  282 QLREYQELLNVKLALDIEIATYR 304
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
1438-1991 8.40e-06

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 51.34  E-value: 8.40e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1438 EDKMEVEQQSRRQLE----RRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNH---------ELEKKQ 1504
Cdd:PRK10246   231 EEKQLLTAQQQQQQSlnwlTRLDELQQEASRRQQALQQALAAEEKAQPQLAALSLAQPARQLRPHweriqeqsaALAHTR 310
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1505 RRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDlDIAGFTQKVVSLEAELqdisSQESKDEASLAKV 1584
Cdd:PRK10246   311 QQIEEVNTRLQSTMALRARIRHHAAKQSAELQAQQQSLNTWLAEHD-RFRQWNNELAGWRAQF----SQQTSDREQLRQW 385
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1585 KKQLRDLEAKVKDQEE-----ELDEQAGSI-QMLEQAKLRlememERMRQTHSK--EMESRDEEVEEARQSCQKKLKQME 1656
Cdd:PRK10246   386 QQQLTHAEQKLNALPAitltlTADEVAAALaQHAEQRPLR-----QRLVALHGQivPQQKRLAQLQVAIQNVTQEQTQRN 460
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1657 VQLEEeyedKQKALREKRELESKLSTLSDQvnqrdfesEKRLrKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKnqL 1736
Cdd:PRK10246   461 AALNE----MRQRYKEKTQQLADVKTICEQ--------EARI-KDLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALE--P 525
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1737 EESEFTCAAAVKARKAMEVEMEDLHLQIDdiakaktALEEQLsrlQREKNEIQNRLEEDQEdmneLMKKHKAAVAQASRD 1816
Cdd:PRK10246   526 GVNQSRLDALEKEVKKLGEEGAALRGQLD-------ALTKQL---QRDESEAQSLRQEEQA----LTQQWQAVCASLNIT 591
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1817 MAQMNDLQAQIEESNKEKQELQE--KLQALQSQVEfleqsmvdkslvsRQEAKIRELETRLEFEKTQVK-RLENLASRLK 1893
Cdd:PRK10246   592 LQPQDDIQPWLDAQEEHERQLRLlsQRHELQGQIA-------------AHNQQIIQYQQQIEQRQQQLLtALAGYALTLP 658
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1894 ETMEKLT--EERDQRAAAENREKEQNKRLQRQ------LRDTKEEMSELARKEAEAS----RKKHElemDLESLEAANQS 1961
Cdd:PRK10246   659 QEDEEASwlATRQQEAQSWQQRQNELTALQNRiqqltpLLETLPQSDDLPHSEETVAldnwRQVHE---QCLSLHSQLQT 735
                          570       580       590
                   ....*....|....*....|....*....|
gi 1907082211 1962 LQADLKLAFKRIGDLQAAIEDEMESDENED 1991
Cdd:PRK10246   736 LQQQDVLEAQRLQKAQAQFDTALQASVFDD 765
PDZ4_Scribble-like cd06701
PDZ domain 4 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 ...
220-307 8.88e-06

PDZ domain 4 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of Drosophila Scribble (also known as LAP4), human Scribble homolog (also known as hScrib, LAP4, CriB1, ScrB1 and Vartul), and related domains. They belong to the LAP family, which describes proteins that contain either one or four PDZ domains and 16 LRRs (leucine-rich repeats) and function in controlling cell shape, size and subcellular protein localization. In Drosophila, the Scribble complex, comprising Scribble, discs large, and lethal giant larvae, plays a role in apico-basal cell polarity, in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Mammalian Scribble is important in many aspects of cancer development. Scribble and its homologs can be downregulated or overexpressed in cancer; they have a role in cancer beyond their function in loss of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Scribble-like family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467185 [Multi-domain]  Cd Length: 98  Bit Score: 46.45  E-value: 8.88e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLR------RTTMLDRAPEGQAYRRVVHfaePGAGTKDlalG-LVPGDRLVEINGQNVENKSRDEI 292
Cdd:cd06701      6 ELTIVKEPGEKLGISIRggakghAGNPLDPTDEGIFISKINP---DGAAARD---GrLKVGQRILEVNGQSLLGATHQEA 79
                           90
                   ....*....|....*
gi 1907082211  293 VEMIRQSGDSVRLKV 307
Cdd:cd06701     80 VRILRSVGDTLTLLV 94
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1788-2011 1.02e-05

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 50.66  E-value: 1.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1788 IQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKE----KQELQEKLQALQSQVEFLEQSMvdKSLVSR 1863
Cdd:pfam07888   32 LQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRElesrVAELKEELRQSREKHEELEEKY--KELSAS 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1864 QEAKIRELETRLEFEKTQVKRLEnlasRLKETMEKLTEERDQRAAAENREKEQNKRLQRQlrdtkeemselaRKEAEASR 1943
Cdd:pfam07888  110 SEELSEEKDALLAQRAAHEARIR----ELEEDIKTLTQRVLERETELERMKERAKKAGAQ------------RKEEEAER 173
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1944 KkhELEMDLESLEAANQSLQADLKLAFKRIGdlqaaiEDEMESDENEDLINSLQDMVTKYQKKKNKLE 2011
Cdd:pfam07888  174 K--QLQAKLQQTEEELRSLSKEFQELRNSLA------QRDTQVLQLQDTITTLTQKLTTAHRKEAENE 233
PLN02939 PLN02939
transferase, transferring glycosyl groups
1503-1910 1.09e-05

transferase, transferring glycosyl groups


Pssm-ID: 215507 [Multi-domain]  Cd Length: 977  Bit Score: 51.06  E-value: 1.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1503 KQRRFDSElsqaheETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLD-IAGFTQKVVSLEAELQDISSQesKDEASL 1581
Cdd:PLN02939    38 RRRGFSSQ------QKKKRGKNIAPKQRSSNSKLQSNTDENGQLENTSLRtVMELPQKSTSSDDDHNRASMQ--RDEAIA 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1582 AKVKKQLRdleaKVKDQEEELDEQAGS-IQMLEQAklrlEMEMERMRQTHSKEMESRDEEVEEaRQSCQKKLKQMEVQLE 1660
Cdd:PLN02939   110 AIDNEQQT----NSKDGEQLSDFQLEDlVGMIQNA----EKNILLLNQARLQALEDLEKILTE-KEALQGKINILEMRLS 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1661 EEYEDKQKALREKRELEsklsTLSDQVNQRDFESEKRLRKDLKRTKALLADaqimLDHLK-NNAPSKREIAQLKNQL--- 1736
Cdd:PLN02939   181 ETDARIKLAAQEKIHVE----ILEEQLEKLRNELLIRGATEGLCVHSLSKE----LDVLKeENMLLKDDIQFLKAELiev 252
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1737 EESEFTCAAAVKARKAMEVEMEDLHLQIddiakakTALEEQLSRLQREKNEIqnrLEEDQEDMNELMKKHKAAVAQASRD 1816
Cdd:PLN02939   253 AETEERVFKLEKERSLLDASLRELESKF-------IVAQEDVSKLSPLQYDC---WWEKVENLQDLLDRATNQVEKAALV 322
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1817 MAQMNDLQAQI---EESNKEKQELQEKLQalqsQVEFLEQSMvdKSLVSRQEAKIRELETrlefektQVKRLENLASRLK 1893
Cdd:PLN02939   323 LDQNQDLRDKVdklEASLKEANVSKFSSY----KVELLQQKL--KLLEERLQASDHEIHS-------YIQLYQESIKEFQ 389
                          410
                   ....*....|....*..
gi 1907082211 1894 ETMEKLTEERDQRAAAE 1910
Cdd:PLN02939   390 DTLSKLKEESKKRSLEH 406
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1420-1677 1.10e-05

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 49.92  E-value: 1.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1420 ERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRlgdlqadsdESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRnHE 1499
Cdd:pfam13868   51 EERERALEEEEEKEEERKEERKRYRQELEEQIEER---------EQKRQEEYEEKLQEREQMDEIVERIQEEDQAEA-EE 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1500 LEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKdldiagftqkvvsLEAELQDIssqeskdEA 1579
Cdd:pfam13868  121 KLEKQRQLREEIDEFNEEQAEWKELEKEEEREEDERILEYLKEKAEREEE-------------REAEREEI-------EE 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1580 SLAKVKKQLRDLEAKVKDQEEELDEQagsiqmleQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQK----KLKQM 1655
Cdd:pfam13868  181 EKEREIARLRAQQEKAQDEKAERDEL--------RAKLYQEEQERKERQKEREEAEKKARQRQELQQAREEqielKERRL 252
                          250       260
                   ....*....|....*....|..
gi 1907082211 1656 EVQLEEEYEDKQKALREKRELE 1677
Cdd:pfam13868  253 AEEAEREEEEFERMLRKQAEDE 274
HMMR_N pfam15905
Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of ...
1463-1840 1.19e-05

Hyaluronan mediated motility receptor N-terminal; HMMR_N is the N-terminal region of eukaryotic hyaluronan-mediated motility receptor proteins. The protein is functionally associated with BRCA1 and thus predicted to be a common, low-penetrance breast cancer candidate.


Pssm-ID: 464932 [Multi-domain]  Cd Length: 329  Bit Score: 49.81  E-value: 1.19e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1463 DESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHE-ETQREKLQREKLQREKDMllaeafs 1541
Cdd:pfam15905   24 EKSQRFRKQKAAESQPNLNNSKDASTPATARKVKSLELKKKSQKNLKESKDQKElEKEIRALVQERGEQDKRL------- 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1542 lkQQMEEKdldiagftqkVVSLEAELQDISSQESKDEASLAKVKKQLRDLeAKVKDqeeeldeqagsiqmLEQAKLRLEM 1621
Cdd:pfam15905   97 --QALEEE----------LEKVEAKLNAAVREKTSLSASVASLEKQLLEL-TRVNE--------------LLKAKFSEDG 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1622 EMERMRQTHSKEMESRDEeVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRkd 1701
Cdd:pfam15905  150 TQKKMSSLSMELMKLRNK-LEAKMKEVMAKQEGMEGKLQVTQKNLEHSKGKVAQLEEKLVSTEKEKIEEKSETEKLLE-- 226
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1702 lkrtkalladaqimldhlknnapskrEIAQLKNQLEESEftcaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRL 1781
Cdd:pfam15905  227 --------------------------YITELSCVSEQVE-------KYKLDIAQLEELLKEKNDEIESLKQSLEEKEQEL 273
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082211 1782 QREKNEIQNRLEEDQEDMNELMKKHKAavaQASRDMAQMNDLQAQIEESNKEKQELQEK 1840
Cdd:pfam15905  274 SKQIKDLNEKCKLLESEKEELLREYEE---KEQTLNAELEELKEKLTLEEQEHQKLQQK 329
mukB PRK04863
chromosome partition protein MukB;
1531-1853 1.44e-05

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 50.73  E-value: 1.44e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1531 EKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEAS-------LAKVKKQLR-------------D 1590
Cdd:PRK04863   280 ERRVHLEEALELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDyqaasdhLNLVQTALRqqekieryqadleE 359
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1591 LEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMR-------------QTHSKEMESRDEEVEEARQSCQK---KLKQ 1654
Cdd:PRK04863   360 LEERLEEQNEVVEEADEQQEENEARAEAAEEEVDELKsqladyqqaldvqQTRAIQYQQAVQALERAKQLCGLpdlTADN 439
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1655 MEVQLEEeYEDKQKALREK-RELESKLSTLSDQVNQRD------------------FESEKRLRKDLKRTKALLADAQIM 1715
Cdd:PRK04863   440 AEDWLEE-FQAKEQEATEElLSLEQKLSVAQAAHSQFEqayqlvrkiagevsrseaWDVARELLRRLREQRHLAEQLQQL 518
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1716 LDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEE- 1794
Cdd:PRK04863   519 RMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQARIQRl 598
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1795 ---------DQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQ 1853
Cdd:PRK04863   599 aarapawlaAQDALARLREQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDEEIERLSQ 666
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
1284-1659 1.73e-05

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 50.29  E-value: 1.73e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1284 FTTVRPLIQV-QLSEEQIRNKDEEIQQLRSKLEKVEKERNELRlssdrletriSELTSELTDERNTGESASQLLDAETAE 1362
Cdd:pfam15964  349 FEKTKALIQCeQLKSELERQKERLEKELASQQEKRAQEKEALR----------KEMKKEREELGATMLALSQNVAQLEAQ 418
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1363 RLRTEKEMKELQTQYDALKKQMEVMemevmearliraaeingEVDDDDAGGEWR-------LKYERAVREVDFTKKRLQQ 1435
Cdd:pfam15964  419 VEKVTREKNSLVSQLEEAQKQLASQ-----------------EMDVTKVCGEMRyqlnqtkMKKDEAEKEHREYRTKTGR 481
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1436 ELEDKmeveQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDT--KLHLEGQqvrnhELEKKQRRFDSEL-S 1512
Cdd:pfam15964  482 QLEIK----DQEIEKLGLELSESKQRLEQAQQDAARAREECLKLTELLGESehQLHLTRL-----EKESIQQSFSNEAkA 552
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1513 QAHEETQREK----------LQREKLQREKDMLLAEAFSLkqqmeekdldIAGFTQKVVSLEAELQDISSQESKDEASLA 1582
Cdd:pfam15964  553 QALQAQQREQeltqkmqqmeAQHDKTVNEQYSLLTSQNTF----------IAKLKEECCTLAKKLEEITQKSRSEVEQLS 622
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1583 KVKKQLRDLEAKVKDQEEELDEQAgsIQ---MLEQAKLRLEmEMERMRQTHSK---EMESRDEEVEEARQSCQKKLKQME 1656
Cdd:pfam15964  623 QEKEYLQDRLEKLQKRNEELEEQC--VQhgrMHERMKQRLR-QLDKHCQATAQqlvQLLSKQNQLFKERQNLTEEVQSLR 699

                   ...
gi 1907082211 1657 VQL 1659
Cdd:pfam15964  700 SQV 702
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1696-1982 1.76e-05

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 49.53  E-value: 1.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1696 KRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVE-----MEDLHLQIDDIAKA 1770
Cdd:pfam13868    9 RELNSKLLAAKCNKERDAQIAEKKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEerkryRQELEEQIEEREQK 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1771 KtalEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQE-KLQALQSQVE 1849
Cdd:pfam13868   89 R---QEEYEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQRQLREEIDEFNEEQAEWKELEKEEEREEDeRILEYLKEKA 165
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1850 FLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKR-----LQRQL 1924
Cdd:pfam13868  166 EREEEREAEREEIEEEKEREIARLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKEREEAEKKARQRQelqqaREEQI 245
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1925 RDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRigDLQAAIED 1982
Cdd:pfam13868  246 ELKERRLAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKRLEHRR--ELEKQIEE 301
mukB PRK04863
chromosome partition protein MukB;
1637-1978 1.76e-05

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 50.34  E-value: 1.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1637 RDEEVEEArQSCQKKLKQMEVQLEEEyedkQKALREkreLESKLSTLSdqvnqrdfESEKRLRKDLKRTKALLADAQIML 1716
Cdd:PRK04863   281 RRVHLEEA-LELRRELYTSRRQLAAE----QYRLVE---MARELAELN--------EAESDLEQDYQAASDHLNLVQTAL 344
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1717 DHLKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKA----KTALEEQLSRLQREKNEIQnRL 1792
Cdd:PRK04863   345 RQQEKIERYQADLEELEERLEEQNEVVEEADEQQEENEARAEAAEEEVDELKSQladyQQALDVQQTRAIQYQQAVQ-AL 423
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1793 EEDQEdMNELMKkhkAAVAQASRDMAQmndLQAQIEESNKEKQELQEKL---QALQSQVEF---LEQSMVDKslVSRQEA 1866
Cdd:PRK04863   424 ERAKQ-LCGLPD---LTADNAEDWLEE---FQAKEQEATEELLSLEQKLsvaQAAHSQFEQayqLVRKIAGE--VSRSEA 494
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1867 K--IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDqraaAENREKEQNKRLQRQLRDtkEEMSELARKEAEASRK 1944
Cdd:PRK04863   495 WdvARELLRRLREQRHLAEQLQQLRMRLSELEQRLRQQQR----AERLLAEFCKRLGKNLDD--EDELEQLQEELEARLE 568
                          330       340       350
                   ....*....|....*....|....*....|....
gi 1907082211 1945 khELEMDLESLEAANQSLQADLKLAFKRIGDLQA 1978
Cdd:PRK04863   569 --SLSESVSEARERRMALRQQLEQLQARIQRLAA 600
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
1789-1982 1.88e-05

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 48.45  E-value: 1.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1789 QNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEfleqsmvDKSLVSRQEAKI 1868
Cdd:pfam12795    8 AKLDEAAKKKLLQDLQQALSLLDKIDASKQRAAAYQKALDDAPAELRELRQELAALQAKAE-------AAPKEILASLSL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1869 RELETRLEFEKTQvkrLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQR------QLRDTKEEMSELARKEAEAS 1942
Cdd:pfam12795   81 EELEQRLLQTSAQ---LQELQNQLAQLNSQLIELQTRPERAQQQLSEARQRLQQirnrlnGPAPPGEPLSEAQRWALQAE 157
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1943 RKKHELEMDLESLEAANQSLQADL-----KLAFKRIGDLQAAIED 1982
Cdd:pfam12795  158 LAALKAQIDMLEQELLSNNNRQDLlkarrDLLTLRIQRLEQQLQA 202
HOOK pfam05622
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ...
1457-1952 1.89e-05

HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.


Pssm-ID: 461694 [Multi-domain]  Cd Length: 528  Bit Score: 50.07  E-value: 1.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1457 DLQADSDESQralqQLKKKCQRL---TAELQDTKLHLegqQVRNHELEKKQRRFDSELSQA--------HEETQREKLQR 1525
Cdd:pfam05622    1 DLSEAQEEKD----ELAQRCHELdqqVSLLQEEKNSL---QQENKKLQERLDQLESGDDSGtpggkkylLLQKQLEQLQE 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1526 EKLQREKDmllAEAFSLKQQMEEKD-LDIAGFTQKVVSLEAElqdisSQESKDE-----ASLAKVKKqlrdLEAKV---K 1596
Cdd:pfam05622   74 ENFRLETA---RDDYRIKCEELEKEvLELQHRNEELTSLAEE-----AQALKDEmdilrESSDKVKK----LEATVetyK 141
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1597 DQEEELDEQAGSIQMLEqaklrlEMEMERMRQTHSKEMESRDEEVEEAR-QSCQKKLKQMEVQLEEEYEDKQKALREKRE 1675
Cdd:pfam05622  142 KKLEDLGDLRRQVKLLE------ERNAEYMQRTLQLEEELKKANALRGQlETYKRQVQELHGKLSEESKKADKLEFEYKK 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1676 LESKLSTLsdqvnQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPS--------------------KREIAQLknQ 1735
Cdd:pfam05622  216 LEEKLEAL-----QKEKERLIIERDTLRETNEELRCAQLQQAELSQADALlspssdpgdnlaaeimpaeiREKLIRL--Q 288
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1736 LEESEFTCaaavKARKAMEVEMEDLHLQIDDIAKAKTALEEQLsRLQREK-NEIQNRLEEDQEDMNELMKKHKAAVAqas 1814
Cdd:pfam05622  289 HENKMLRL----GQEGSYRERLTELQQLLEDANRRKNELETQN-RLANQRiLELQQQVEELQKALQEQGSKAEDSSL--- 360
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1815 rdmaqmndLQAQIEESNKEKQELQEKLQALQSQVEFLEQsmvdkSLVSRQEAKIRELETRLefektQVKRLENLA--SRL 1892
Cdd:pfam05622  361 --------LKQKLEEHLEKLHEAQSELQKKKEQIEELEP-----KQDSNLAQKIDELQEAL-----RKKDEDMKAmeERY 422
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1893 KETMEKLTEE-RDQRAAAENREKEQNKRLQRQLRDtKEEMSELARKEAEASRKKHELEMDL 1952
Cdd:pfam05622  423 KKYVEKAKSViKTLDPKQNPASPPEIQALKNQLLE-KDKKIEHLERDFEKSKLQREQEEKL 482
ATG16 pfam08614
Autophagy protein 16 (ATG16); Autophagy is a ubiquitous intracellular degradation system for ...
1556-1623 1.94e-05

Autophagy protein 16 (ATG16); Autophagy is a ubiquitous intracellular degradation system for eukaryotic cells. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. ATG16 (also known as Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate in the yeast autophagy pathway.


Pssm-ID: 462536 [Multi-domain]  Cd Length: 176  Bit Score: 47.23  E-value: 1.94e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1556 FTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQML--EQAKLRLEMEM 1623
Cdd:pfam08614   76 LAQRLVDLNEELQELEKKLREDERRLAALEAERAQLEEKLKDREEELREKRKLNQDLqdELVALQLQLNM 145
Mitofilin pfam09731
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...
1649-1964 1.95e-05

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.


Pssm-ID: 430783 [Multi-domain]  Cd Length: 618  Bit Score: 50.14  E-value: 1.95e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1649 QKKLKQMEVQLEE---EYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNAPS 1725
Cdd:pfam09731  124 QEKEKALEEVLKEaisKAESATAVAKEAKDDAIQAVKAHTDSLKEASDTAEISREKATDSALQKAEALAEKLKEVINLAK 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1726 KREIAQLKNQLEESEFTCAAAVKARKAME--VEMEDLHLQIDDIAKAKTALEEQLSRLQREK--NEIQNRLEEDQEDMNE 1801
Cdd:pfam09731  204 QSEEEAAPPLLDAAPETPPKLPEHLDNVEekVEKAQSLAKLVDQYKELVASERIVFQQELVSifPDIIPVLKEDNLLSND 283
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1802 lmkKHKAAVAQASRDMAQmndLQAQIEEsnKEKQELQEKLQALQSQVEFLEQSmvDKSLVSRQEAKIRELET--RLEFEK 1879
Cdd:pfam09731  284 ---DLNSLIAHAHREIDQ---LSKKLAE--LKKREEKHIERALEKQKEELDKL--AEELSARLEEVRAADEAqlRLEFER 353
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1880 TQVKRLENLASRLKETMEKLTEERDQRaaAENREKEQNKRLQRQ-LRDTKEEMSE----LARKEAEASRKKHELEMDLES 1954
Cdd:pfam09731  354 EREEIRESYEEKLRTELERQAEAHEEH--LKDVLVEQEIELQREfLQDIKEKVEEeragRLLKLNELLANLKGLEKATSS 431
                          330
                   ....*....|.
gi 1907082211 1955 -LEAANQSLQA 1964
Cdd:pfam09731  432 hSEVEDENRKA 442
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1809-1990 2.26e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 48.38  E-value: 2.26e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1809 AVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEqsmvdkslvsrqeAKIRELETRLEFEKTQVKRLENL 1888
Cdd:COG1579      1 AMPEDLRALLDLQELDSELDRLEHRLKELPAELAELEDELAALE-------------ARLEAAKTELEDLEKEIKRLELE 67
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1889 ASRLKETMEKLTEErdQRAAAENREKEQnkrLQRQLRDTKEEMSELARKEAEasrkkheLEMDLESLEAANQSLQADLKL 1968
Cdd:COG1579     68 IEEVEARIKKYEEQ--LGNVRNNKEYEA---LQKEIESLKRRISDLEDEILE-------LMERIEELEEELAELEAELAE 135
                          170       180
                   ....*....|....*....|..
gi 1907082211 1969 AFKRIGDLQAAIEDEMESDENE 1990
Cdd:COG1579    136 LEAELEEKKAELDEELAELEAE 157
CtpA COG0793
C-terminal processing protease CtpA/Prc, contains a PDZ domain [Posttranslational modification, ...
269-309 2.65e-05

C-terminal processing protease CtpA/Prc, contains a PDZ domain [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 440556 [Multi-domain]  Cd Length: 341  Bit Score: 48.71  E-value: 2.65e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1907082211  269 GLVPGDRLVEINGQNVENKSRDEIVEMIR-QSGDSVRLKVQP 309
Cdd:COG0793     88 GIKPGDIILAIDGKSVAGLTLDDAVKLLRgKAGTKVTLTIKR 129
Crescentin pfam19220
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ...
1728-1997 2.65e-05

Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.


Pssm-ID: 437057 [Multi-domain]  Cd Length: 401  Bit Score: 49.30  E-value: 2.65e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1728 EIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHK 1807
Cdd:pfam19220   91 RLAKLEAALREAEAAKEELRIELRDKTAQAEALERQLAAETEQNRALEEENKALREEAQAAEKALQRAEGELATARERLA 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1808 AAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMVD-KSLVSRQEAKIRELETRLEFEKT-QVKRL 1885
Cdd:pfam19220  171 LLEQENRRLQALSEEQAAELAELTRRLAELETQLDATRARLRALEGQLAAeQAERERAEAQLEEAVEAHRAERAsLRMKL 250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1886 ENLASRLKETMEKLTEERDQ--RAAAENREKEQN--------KRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESL 1955
Cdd:pfam19220  251 EALTARAAATEQLLAEARNQlrDRDEAIRAAERRlkeasierDTLERRLAGLEADLERRTQQFQEMQRARAELEERAEML 330
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1956 EAANQSLQADLKLAFKRIGDLQAAIE---DEMESDE------NEDLINSLQ 1997
Cdd:pfam19220  331 TKALAAKDAALERAEERIASLSDRIAeltKRFEVERaaleqaNRRLKEELQ 381
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1438-1647 2.77e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 49.06  E-value: 2.77e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1438 EDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEE 1517
Cdd:COG3883     15 DPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARA 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1518 TQREKLQREKLqrekDMLL-AEAFS--------LKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQL 1588
Cdd:COG3883     95 LYRSGGSVSYL----DVLLgSESFSdfldrlsaLSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAK 170
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082211 1589 RDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQS 1647
Cdd:COG3883    171 AELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAA 229
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
1818-2028 2.89e-05

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 47.72  E-value: 2.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1818 AQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETrlEFEKTQvKRLENLASRLKEtME 1897
Cdd:pfam00261    1 KKMQQIKEELDEAEERLKEAMKKLEEAEKRAEKAEAE------VAALNRRIQLLEE--ELERTE-ERLAEALEKLEE-AE 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1898 KLTEERDQ-RAAAENREKEQNKR---LQRQLRDTKEEMSELARKEAEASRKKHELEMDLE-------SLEAANQSLQADL 1966
Cdd:pfam00261   71 KAADESERgRKVLENRALKDEEKmeiLEAQLKEAKEIAEEADRKYEEVARKLVVVEGDLEraeeraeLAESKIVELEEEL 150
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1967 KLAFKRIGDLQAAIEDEMES-DENEDLINSLQD----MVTKYQ---KKKNKLEGDSDvdsELEDRVDGVK 2028
Cdd:pfam00261  151 KVVGNNLKSLEASEEKASEReDKYEEQIRFLTEklkeAETRAEfaeRSVQKLEKEVD---RLEDELEAEK 217
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1294-1665 3.34e-05

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 49.19  E-value: 3.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1294 QLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETrisELTSELTDERNTGESASQlldaETAERLRTEKEMKEL 1373
Cdd:COG5185    235 LKGFQDPESELEDLAQTSDKLEKLVEQNTDLRLEKLGENA---ESSKRLNENANNLIKQFE----NTKEKIAEYTKSIDI 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1374 QTQYDALKKQmevmemevmearlIRAAEINGEvddddaggewrlkYERAVREVDFTKKRLQQELEdkmeveqQSRRQLER 1453
Cdd:COG5185    308 KKATESLEEQ-------------LAAAEAEQE-------------LEESKRETETGIQNLTAEIE-------QGQESLTE 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1454 RLGDLQADSDESQRALQQlkkkcQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKL-QREKLQREK 1532
Cdd:COG5185    355 NLEAIKEEIENIVGEVEL-----SKSSEELDSFKDTIESTKESLDEIPQNQRGYAQEILATLEDTLKAADrQIEELQRQI 429
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1533 DmllaeafSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESKDEASLAK--VKKQLRDLEAKVKDQEEELDEQAGSiq 1610
Cdd:COG5185    430 E-------QATSSNEEVSKLLNELISELNKVMREADEESQSRLEEAYDEINrsVRSKKEDLNEELTQIESRVSTLKAT-- 500
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1611 mLEQAKLRLEMEMERMRQTHSKEMESRDEEvEEARQSCQKKLKQMEVQLEEEYED 1665
Cdd:COG5185    501 -LEKLRAKLERQLEGVRSKLDQVAESLKDF-MRARGYAHILALENLIPASELIQA 553
Rab5-bind pfam09311
Rabaptin-like protein; Members of this family are predominantly found in Rabaptin and allow ...
1693-1966 3.37e-05

Rabaptin-like protein; Members of this family are predominantly found in Rabaptin and allow for binding to the GTPase Rab5. This interaction is necessary and sufficient for Rab5-dependent recruitment of Rabaptin5 to early endosomal membranes.


Pssm-ID: 462752 [Multi-domain]  Cd Length: 307  Bit Score: 48.43  E-value: 3.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1693 ESEKRLRKDLKRTKALLADAQimlDHLKNNAPSKREIAQLKNQL-EESEFTCAAAVKARKAMEVEMEDLHLQIDdiaKAK 1771
Cdd:pfam09311   16 EQEAETRDQVKKLQEMLRQAN---DQLEKTMKDKKELEDKMNQLsEETSNQVSTLAKRNQKSETLLDELQQAFS---QAK 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1772 TALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVaqaSRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFL 1851
Cdd:pfam09311   90 RNFQDQLAVLMDSREQVSDELVRLQKDNESLQGKHSLHV---SLQQAEKFDMPDTVQELQELVLKYREELIEVRTAADHM 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1852 EQSMVDKSLVSRQEAKiRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLqRQLRDTKEEM 1931
Cdd:pfam09311  167 EEKLKAEILFLKEQIQ-AEQCLKENLEETLQAEIENCKEEIASISSLKVELERIKAEKEQLENGLTEKI-RQLEDLQTTK 244
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1907082211 1932 SELARKEAEASRKKHELEMDLESLEAANQSLQADL 1966
Cdd:pfam09311  245 GSLETQLKKETNEKAAVEQLVFEEKNKAQRLQTEL 279
DUF2046 pfam09755
Uncharacterized conserved protein H4 (DUF2046); This is the conserved N-terminal 350 residues ...
1602-1954 3.41e-05

Uncharacterized conserved protein H4 (DUF2046); This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.


Pssm-ID: 401633 [Multi-domain]  Cd Length: 304  Bit Score: 48.29  E-value: 3.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1602 LDEQAGSIQMLEQAKLRLEMEMERMRqTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKREleskls 1681
Cdd:pfam09755   23 REQLQKRIESLQQENRVLKMELETYK-LRCKALQEENRALRQASVNIQAKAEQEEEFISNTLLKKIQALKKEKE------ 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1682 TLSDQVNQRdfesEKRLRKDLKRTkalladaqimLDHLKnnapskREIAQLKNQLEeseftcaaavkarKAMEVEMEDLH 1761
Cdd:pfam09755   96 TLAMNYEQE----EEFLTNDLSRK----------LTQLR------QEKVELEQTLE-------------QEQEYQVNKLM 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1762 LQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDM-NELMKKhkaavaqasrdmaqMNDLQAqieesnkEKQELQEK 1840
Cdd:pfam09755  143 RKIEKLEAETLNKQTNLEQLRREKVELENTLEQEQEALvNRLWKR--------------MDKLEA-------EKRLLQEK 201
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1841 LQALQSQVEFLEQSMVDKSLVSRQEAKIRELetrlefeKTQVKRLENlasRLKETMEKLTEERDQRAAAENREKEQNKRL 1920
Cdd:pfam09755  202 LDQPVSAPPSPRDSTSEGDTAQNLTAHIQYL-------RKEVERLRR---QLATAQQEHTEKMAQYAQEERHIREENLRL 271
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1907082211 1921 QRQLRDTKEEMSELARK--EAEASrkkheLEMDLES 1954
Cdd:pfam09755  272 QRKLQLEMERREALCRHlsESESS-----LEMDEER 302
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
1623-1969 3.56e-05

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 49.36  E-value: 3.56e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1623 MERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQ----LEEEYEDKQKALREKRELESKLSTLSDQVNQRDFE----- 1693
Cdd:pfam07111   21 LERRLDTQRPTVTMWEQDVSGDGQGPGRRGRSLELEgsqaLSQQAELISRQLQELRRLEEEVRLLRETSLQQKMRleaqa 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1694 --------SEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTcAAAVKARKAMEVEMEDLHLQID 1765
Cdd:pfam07111  101 meldalavAEKAGQAEAEGLRAALAGAEMVRKNLEEGSQRELEEIQRLHQEQLSSLT-QAHEEALSSLTSKAEGLEKSLN 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1766 DIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMN---ELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQ 1842
Cdd:pfam07111  180 SLETKRAGEAKQLAEAQKEAELLRKQLSKTQEELEaqvTLVESLRKYVGEQVPPEVHSQTWELERQELLDTMQHLQEDRA 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1843 ALQSQVEFLE---QSMVdKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEerdQRAAAENREKEQNKR 1919
Cdd:pfam07111  260 DLQATVELLQvrvQSLT-HMLALQEEELTRKIQPSDSLEPEFPKKCRSLLNRWREKVFALMV---QLKAQDLEHRDSVKQ 335
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1920 LQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLA 1969
Cdd:pfam07111  336 LRGQVAELQEQVTSQSQEQAILQRALQDKAAEVEVERMSAKGLQMELSRA 385
DUF4686 pfam15742
Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins ...
1595-1929 3.80e-05

Domain of unknown function (DUF4686); This family of proteins is found in eukaryotes. Proteins in this family are typically between 498 and 775 amino acids in length. There is a conserved DLK sequence motif.


Pssm-ID: 464838 [Multi-domain]  Cd Length: 384  Bit Score: 48.52  E-value: 3.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1595 VKDQEEELDEQAGSIQMLEQAKLRL---------EMEMER-----MRQTHSKEMESRDEEVEEARQSCQKKL--KQMEVQ 1658
Cdd:pfam15742    1 VSSGEKLKYQQQEEVQQLRQNLQRLqilctsaekELRYERgknldLKQHNSLLQEENIKIKAELKQAQQKLLdsTKMCSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1659 LEEEYEDKQKALREkRELESKLSTLSDQvnqrdfeSEKRLRKDLKRTKALLADAQIMldhlknnapskreIAQLKNQLEE 1738
Cdd:pfam15742   81 LTAEWKHCQQKIRE-LELEVLKQAQSIK-------SQNSLQEKLAQEKSRVADAEEK-------------ILELQQKLEH 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1739 SEFTCAAAVKA--RKAMEVEMEDLhlqIDDIAKAKTAL-EEQLSR--LQREKNEIQNRLEEDQEDMNELMKKHKAAVAQA 1813
Cdd:pfam15742  140 AHKVCLTDTCIleKKQLEERIKEA---SENEAKLKQQYqEEQQKRklLDQNVNELQQQVRSLQDKEAQLEMTNSQQQLRI 216
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1814 SRDMAQMNDLQAQI---EESNKEKQELQEKLQALQSQVEFL--EQSMVDKSL----------VSRQEAKIRELETRLEFE 1878
Cdd:pfam15742  217 QQQEAQLKQLENEKrksDEHLKSNQELSEKLSSLQQEKEALqeELQQVLKQLdvhvrkynekHHHHKAKLRRAKDRLVHE 296
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1879 KTQ----VKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKE 1929
Cdd:pfam15742  297 VEQrderIKQLENEIGILQQQSEKEKAFQKQVTAQNEILLLEKRKLLEQLTEQEE 351
PDZ2_Par3-like cd23058
PDZ domain 2 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 ...
260-307 3.99e-05

PDZ domain 2 of partitioning defective 3 (Par3), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of Par3 (or PAR3 or Par-3, also known as Atypical PKC isotype-specific-interacting protein, ASIP, Drosophila Bazooka) and related domains. Par3 is a scaffold protein involved in organizing cell polarity across animals. Par3 binds numerous molecules both for its recruitment to one pole of the cell and for downstream contributions to polarized cell function. It regulates cell polarity by targeting the Par complex proteins Par6 and atypical protein kinase C (aPKC) to specific cortical sites. Physical interactions between Par3 and the Par complex include Par3 PDZ domain 1 binding to the Par6 PDZ domain, Par3 PDZ domain 1 and PDZ domain 3 binding the Par6's PDZ-binding motif, and an interaction with an undefined region of aPKC that requires both Par3 PDZ2 and PDZ3. The PDZ domains of Par3 have also been implicated as potential phosphoinositide signaling integrators, since its second PDZ domain binds to phosphoinositides, and the third PDZ interacts with phosphoinositide phosphatase PTEN. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Par3 family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467271 [Multi-domain]  Cd Length: 93  Bit Score: 44.55  E-value: 3.99e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082211  260 GAGTKDlalG-LVPGDRLVEINGQNVENKSRDEIVEMIRQS--GDSVRLKV 307
Cdd:cd23058     43 GAAIQD---GrLKAGDRLLEVNGVDVTGKTQEEVVSLLRSTklGGTVSLVV 90
PDZ_6 pfam17820
PDZ domain; This entry represents the PDZ domain from a wide variety of proteins.
269-308 4.00e-05

PDZ domain; This entry represents the PDZ domain from a wide variety of proteins.


Pssm-ID: 436067 [Multi-domain]  Cd Length: 54  Bit Score: 43.29  E-value: 4.00e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1907082211  269 GLVPGDRLVEINGQNVENKsrDEIVEMIRQSGDS-VRLKVQ 308
Cdd:pfam17820   15 GLRVGDVILAVNGKPVRSL--EDVARLLQGSAGEsVTLTVR 53
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1548-1707 4.24e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 47.23  E-value: 4.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1548 EKDLDIAGFTQKVVSLEAELQDIssqeskdEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEmemERMR 1627
Cdd:COG1579     14 ELDSELDRLEHRLKELPAELAEL-------EDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYE---EQLG 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1628 QTHS-KEMESRDEEVEEARQScQKKLKQMEVQLEEEYEDKQKALRE-KRELESKLSTLSDQVNQRDfESEKRLRKDLKRT 1705
Cdd:COG1579     84 NVRNnKEYEALQKEIESLKRR-ISDLEDEILELMERIEELEEELAElEAELAELEAELEEKKAELD-EELAELEAELEEL 161

                   ..
gi 1907082211 1706 KA 1707
Cdd:COG1579    162 EA 163
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1774-2012 4.51e-05

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 48.97  E-value: 4.51e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1774 LEEQLSRLQREKNeiQNRLEEDQEdMNELMKKHKAAVAQASRDMAQMNDLQAQIeeSNKEKQElQEKLQALQSQVEfleq 1853
Cdd:pfam05557    7 SKARLSQLQNEKK--QMELEHKRA-RIELEKKASALKRQLDRESDRNQELQKRI--RLLEKRE-AEAEEALREQAE---- 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1854 smvdkslVSRQEAKIRELETRLEFEKTQvkrLENLASRLKETMEKLTEERDQRAaaeNREKEQNKRLQRQLRDTKEEMSE 1933
Cdd:pfam05557   77 -------LNRLKKKYLEALNKKLNEKES---QLADAREVISCLKNELSELRRQI---QRAELELQSTNSELEELQERLDL 143
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1934 LARKEAEASRKKHELEMDLESLEAANQ---------SLQADLKLAFKRIGDLQAAIEdEMES------DENEDL------ 1992
Cdd:pfam05557  144 LKAKASEAEQLRQNLEKQQSSLAEAEQrikelefeiQSQEQDSEIVKNSKSELARIP-ELEKelerlrEHNKHLnenien 222
                          250       260
                   ....*....|....*....|
gi 1907082211 1993 INSLQDMVTKYQKKKNKLEG 2012
Cdd:pfam05557  223 KLLLKEEVEDLKRKLEREEK 242
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1291-1601 5.02e-05

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 48.95  E-value: 5.02e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1291 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEM 1370
Cdd:pfam05483  457 IQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQI 536
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1371 KELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDK---MEVEQQS 1447
Cdd:pfam05483  537 ENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKnknIEELHQE 616
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1448 RRQLERRlgdlqadSDESQRALQQLKKKCQRLTAELQDTKLHLeGQQVRNHELEKKQRRFDSElsQAHEETQREKLQRE- 1526
Cdd:pfam05483  617 NKALKKK-------GSAENKQLNAYEIKVNKLELELASAKQKF-EEIIDNYQKEIEDKKISEE--KLLEEVEKAKAIADe 686
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1527 --KLQREKDML----LAEAFSLKQQ--------MEEKDLDIAGFTQK-------VVSLEAELQDIssqeskdEASLAKVK 1585
Cdd:pfam05483  687 avKLQKEIDKRcqhkIAEMVALMEKhkhqydkiIEERDSELGLYKNKeqeqssaKAALEIELSNI-------KAELLSLK 759
                          330
                   ....*....|....*....
gi 1907082211 1586 KQL---RDLEAKVKDQEEE 1601
Cdd:pfam05483  760 KQLeieKEEKEKLKMEAKE 778
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1438-1842 5.05e-05

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 48.86  E-value: 5.05e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1438 EDKMEVEQQSRRQLERRLGDLQADSDESQ--------RALQQLKKKCQRLTAELQDTKlhlegqqvrNHELEKKQRrfdS 1509
Cdd:NF033838    54 ESQKEHAKEVESHLEKILSEIQKSLDKRKhtqnvalnKKLSDIKTEYLYELNVLKEKS---------EAELTSKTK---K 121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1510 ELSQAHEETQREKLQREKLQREKDMLLAEA-FSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESkdEASLAKVKKQL 1588
Cdd:NF033838   122 ELDAAFEQFKKDTLEPGKKVAEATKKVEEAeKKAKDQKEEDRRNYPTNTYKTLELEIAESDVEVKKA--ELELVKEEAKE 199
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1589 RDLEAKVKDQEEELDEQAGSIQMLEQAKlrlememermrqTHSKEMEsrdeevEEARQSCQKKLKQ-MEVQLEEEYEDKQ 1667
Cdd:NF033838   200 PRDEEKIKQAKAKVESKKAEATRLEKIK------------TDREKAE------EEAKRRADAKLKEaVEKNVATSEQDKP 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1668 KAlREKRELESKLSTlsdqvnqrdfesekrlrKDLKRTKALLADAQIMLDHLKNnaPSKREiaqlKNQLEESEFTCAAAV 1747
Cdd:NF033838   262 KR-RAKRGVLGEPAT-----------------PDKKENDAKSSDSSVGEETLPS--PSLKP----EKKVAEAEKKVEEAK 317
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1748 KARKAMEVE---------MEDLHLQiddIAKAKTALEE-QLSRLQREKNEIQNrleedQEDMNELMKKHKAAVAQASRdM 1817
Cdd:NF033838   318 KKAKDQKEEdrrnyptntYKTLELE---IAESDVKVKEaELELVKEEAKEPRN-----EEKIKQAKAKVESKKAEATR-L 388
                          410       420
                   ....*....|....*....|....*
gi 1907082211 1818 AQMNDLQAQIEESNKEKQELQEKLQ 1842
Cdd:NF033838   389 EKIKTDRKKAEEEAKRKAAEEDKVK 413
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
1782-1961 6.15e-05

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 48.41  E-value: 6.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1782 QREKNEIQNRLEEDQEDMNELMKKHkaaVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVE----FLEQSMVD 1857
Cdd:pfam15709  353 KRREQEEQRRLQQEQLERAEKMREE---LELEQQRRFEEIRLRKQRLEEERQRQEEEERKQRLQLQAAqeraRQQQEEFR 429
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1858 KSLVSRQEAKIRELETRLEFEKTQVKRLE-NLASRLKETMEKLTEERDQraaaenrekeqnkrLQRQlrdtKEEMSELAR 1936
Cdd:pfam15709  430 RKLQELQRKKQQEEAERAEAEKQRQKELEmQLAEEQKRLMEMAEEERLE--------------YQRQ----KQEAEEKAR 491
                          170       180
                   ....*....|....*....|....*
gi 1907082211 1937 KEAEASRKKHELEMDLESLEAANQS 1961
Cdd:pfam15709  492 LEAEERRQKEEEAARLALEEAMKQA 516
PDZ_syntrophin-like cd06801
PDZ domain of syntrophins, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), ...
270-308 7.09e-05

PDZ domain of syntrophins, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of syntrophins (including alpha-1-syntrophin, beta-1-syntrophin, beta-2-syntrophin, gamma-1-syntrophin, and gamma-2-syntrophin), and related domains. Syntrophins play a role in recruiting various signaling molecules into signaling complexes and help provide appropriate spatiotemporal regulation of signaling pathways. They function in cytoskeletal organization and maintenance; as components of the dystrophin-glycoprotein complex (DGC), they help maintain structural integrity of skeletal muscle fibers. They link voltage-gated sodium channels to the actin cytoskeleton and the extracellular matrix, and control the localization and activity of the actin reorganizing proteins such as PI3K, PI(3,4)P2 and TAPP1. Through association with various cytoskeletal proteins within the cells, they are involved in processes such as regulation of focal adhesions, myogenesis, calcium homeostasis, and cell migration. They also have roles in synapse formation and in the organization of utrophin, acetylcholine receptor, and acetylcholinesterase at the neuromuscular synapse. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This syntrophin-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467262 [Multi-domain]  Cd Length: 83  Bit Score: 43.33  E-value: 7.09e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1907082211  270 LVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQ 308
Cdd:cd06801     44 LFVGDAILSVNGENLEDATHDEAVQALKNAGDEVTLTVK 82
PDZ2_MAGI-1_3-like cd06732
PDZ domain 2 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, ...
269-308 7.50e-05

PDZ domain 2 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of MAGI1, 2, 3 (MAGI is also known as Membrane-associated guanylate kinase, WW and PDZ domain-containing protein) and related domains. MAGI proteins have been implicated in the control of cell migration and invasion through altering the activity of phosphatase and tensin homolog (PTEN) and modulating Akt signaling. Four MAGI proteins have been identified (MAGI1-3 and MAGIX). MAGI1-3 have 6 PDZ domains and bind to the C-terminus of PTEN via their PDZ2 domain. MAGIX has a single PDZ domain that is related to MAGI1-3 PDZ domain 5. Other binding partners for MAGI1 include JAM4, C-terminal tail of high risk HPV-18 E6, megalin, TRAF6, Kir4.1 (basolateral K+ channel subunit), and cadherin 23; for MAGI2, include DASM1, dendrin, axin, beta- and delta-catenin, neuroligin, hyperpolarization-activated cation channels, beta1-adrenergic receptors, NMDA receptor, and TARPs; and for MAGI3 includes LPA2. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MAGI family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2); arranged as beta-strands A, -B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467214 [Multi-domain]  Cd Length: 82  Bit Score: 43.31  E-value: 7.50e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1907082211  269 GLVPGDRLVEINGQNVENKSRDEIVEMIRQS--GDSVRLKVQ 308
Cdd:cd06732     39 GLQEGDLIVEINGQNVQNLSHAQVVDVLKECpkGSEVTLLVQ 80
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
1430-1628 8.40e-05

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 47.06  E-value: 8.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1430 KKRLQQELEDKMEVEQ-QSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDtklhLEGQqvrnHELEKKQRRfd 1508
Cdd:pfam09787   23 KEKLIASLKEGSGVEGlDSSTALTLELEELRQERDLLREEIQKLRGQIQQLRTELQE----LEAQ----QQEEAESSR-- 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1509 SELSQAHEETQREKLQREKLQrekdmllAEAFSLKQQM----EEKDLDIAGFTQKVVSLEAELQDISSQ-ESKDEASLAK 1583
Cdd:pfam09787   93 EQLQELEEQLATERSARREAE-------AELERLQEELryleEELRRSKATLQSRIKDREAEIEKLRNQlTSKSQSSSSQ 165
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1584 vkkqlRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQ 1628
Cdd:pfam09787  166 -----SELENRLHQLTETLIQKQTMLEALSTEKNSLVLQLERMEQ 205
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
1292-1661 1.02e-04

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 47.52  E-value: 1.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1292 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRIseltseLTDERNTGESASQLldaetaerlrtEKEMK 1371
Cdd:PRK04778   113 LLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSL------LANRFSFGPALDEL-----------EKQLE 175
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1372 ELQTQYDalkkqmevmemevmearliRAAEINGEVDdddaggewrlkYERAVREVDFTKKRLQQeLEDKME------VEQ 1445
Cdd:PRK04778   176 NLEEEFS-------------------QFVELTESGD-----------YVEAREILDQLEEELAA-LEQIMEeipellKEL 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1446 QSR------------RQLER---RLGDLQADSDesqraLQQLKKKCQRLTAELQDtkLHLEGQQVRNHELEKK------- 1503
Cdd:PRK04778   225 QTElpdqlqelkagyRELVEegyHLDHLDIEKE-----IQDLKEQIDENLALLEE--LDLDEAEEKNEEIQERidqlydi 297
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1504 -QRRFDSElSQAHEETQREKLQREKLQREKDMLLAEAFSLKQ--QMEEKDLDIA-GFTQKVVSLEAELQDISSQESKDEA 1579
Cdd:PRK04778   298 lEREVKAR-KYVEKNSDTLPDFLEHAKEQNKELKEEIDRVKQsyTLNESELESVrQLEKQLESLEKQYDEITERIAEQEI 376
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1580 SLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQ---THSKEMESRD-----EEVEEARQSCQKK 1651
Cdd:PRK04778   377 AYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNklhEIKRYLEKSNlpglpEDYLEMFFEVSDE 456
                          410
                   ....*....|
gi 1907082211 1652 LKQMEVQLEE 1661
Cdd:PRK04778   457 IEALAEELEE 466
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1304-1483 1.04e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 46.07  E-value: 1.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1304 DEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQT--QYDALK 1381
Cdd:COG1579     16 DSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNnkEYEALQ 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1382 KQmevmemEVMEARLIRAAeingevddddaggewrlkyERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRLGDLQAD 1461
Cdd:COG1579     96 KE------IESLKRRISDL-------------------EDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEE 150
                          170       180
                   ....*....|....*....|..
gi 1907082211 1462 SDESQRALQQLKKKCQRLTAEL 1483
Cdd:COG1579    151 LAELEAELEELEAEREELAAKI 172
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
1861-1978 1.06e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 47.55  E-value: 1.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1861 VSRQEAKIRELEtrlefekTQVKRLENLASRLKETMEKLteerdqraaaenreKEQNKRLQRQLRDTKEEMSELARKEAE 1940
Cdd:COG2433    408 LTEEEEEIRRLE-------EQVERLEAEVEELEAELEEK--------------DERIERLERELSEARSEERREIRKDRE 466
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1907082211 1941 ASRKKHE---LEMDLESLEAANQSLQADLKLaFKRIGDLQA 1978
Cdd:COG2433    467 ISRLDREierLERELEEERERIEELKRKLER-LKELWKLEH 506
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1868-2033 1.09e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 47.75  E-value: 1.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1868 IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLqRQLRDTKEEMSELARKEAEASRKKHE 1947
Cdd:PRK03918   171 IKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREEL-EKLEKEVKELEELKEEIEELEKELES 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1948 LEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKYQKKKNKLEGDSdvdSELEDRVDGV 2027
Cdd:PRK03918   250 LEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL---SRLEEEINGI 326

                   ....*.
gi 1907082211 2028 KSWLSK 2033
Cdd:PRK03918   327 EERIKE 332
PRK12704 PRK12704
phosphodiesterase; Provisional
1463-1606 1.11e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 47.47  E-value: 1.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1463 DESQRALQQLKKKCQrLTA--ELQDTKLHLEGQ-QVRNHELEKKQRRFdselsQAHEETQREKLqrEKLQREKDMLLAEA 1539
Cdd:PRK12704    45 EEAKKEAEAIKKEAL-LEAkeEIHKLRNEFEKElRERRNELQKLEKRL-----LQKEENLDRKL--ELLEKREEELEKKE 116
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1540 FSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQeSKDEAS---LAKVKKQLR-DLEAKVKDQEEELDEQA 1606
Cdd:PRK12704   117 KELEQKQQELEKKEEELEELIEEQLQELERISGL-TAEEAKeilLEKVEEEARhEAAVLIKEIEEEAKEEA 186
PDZ3_DLG5-like cd06767
PDZ domain 3 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density ...
269-308 1.14e-04

PDZ domain 3 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of Drosophila and mammalian Dlg5, and related domains. Dlg5 is a scaffold protein with multiple conserved functions that are independent of each other in regulating growth, cell polarity, and cell adhesion. It has a coiled-coil domain, 4 PDZ domains and a MAGUK domain (an SH3 domain next to a non-catalytically active guanylate kinase domain). Deregulation of Dlg5 has been implicated in the malignancy of several cancer types. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg5-like family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467248 [Multi-domain]  Cd Length: 82  Bit Score: 42.70  E-value: 1.14e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1907082211  269 GLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQ 308
Cdd:cd06767     42 GLEYGDQLLEVNGINLRNATEQQAALILRQCGDTITMLVQ 81
PDZ_MPP3-MPP4-MPP7-like cd06799
PDZ domain of membrane palmitoylated proteins 3 (MPP3), MPP4, and MPP7, and related domains; ...
269-309 1.18e-04

PDZ domain of membrane palmitoylated proteins 3 (MPP3), MPP4, and MPP7, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MPP3, MPP4, and MPP7, and related domains. MPP3 (also known as MAGUK p55 subfamily member 3, erythrocyte membrane protein p55, or EMP55), MPP4 (also known as MAGUK p55 subfamily member 4 or Discs large homolog 6), and MPP7 (also known as MAGUK p55 subfamily member 7) are membrane-associated guanylate kinase (MAGUK)-like proteins. MPP3 is part of a cell adhesion protein complex including tumor suppressor CADM1 and actin-binding protein 4.1B. Participation in the Crumbs cell polarity complex has also been demonstrated for MPP7 in epithelial cells, and for MPP3 and MPP4 in the retina. MPP4 is needed for proper localization of plasma membrane calcium ATPases and maintenance of calcium homeostasis at the rod photoreceptor synaptic terminals. Binding partners of the MPP3 PDZ domain include nectin-3, serotonin 5-hydroxytryptamine, 5-HT(2C) receptor, and a cell adhesion protein, TSLC1 (tumor suppressor in lung cancer 1); fragments of MPP4 having the PDZ domain bind CRB (PDZ-SH3-GUK) and GABA transporter GAT1 (PDZ-SH3). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MPP1-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467260 [Multi-domain]  Cd Length: 81  Bit Score: 42.62  E-value: 1.18e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1907082211  269 GLV-PGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQP 309
Cdd:cd06799     40 GLIhVGDELREVNGISVEGKDPEEVIQILANSQGPITFKLIP 81
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
1772-1957 1.21e-04

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 46.67  E-value: 1.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1772 TALEEQLSRLQREKN----EIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQ 1847
Cdd:pfam09787   43 TALTLELEELRQERDllreEIQKLRGQIQQLRTELQELEAQQQEEAESSREQLQELEEQLATERSARREAEAELERLQEE 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1848 VEFLEQSMVdKSLVSRQEaKIRELETRLEFEKTQVK--------------RLENLASRL--KETM-EKLTEERDQRAAAE 1910
Cdd:pfam09787  123 LRYLEEELR-RSKATLQS-RIKDREAEIEKLRNQLTsksqssssqselenRLHQLTETLiqKQTMlEALSTEKNSLVLQL 200
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 1907082211 1911 NREKEQNKRLQ-RQLRDTKEEMSELARKEAEASRKKHELEMDLESLEA 1957
Cdd:pfam09787  201 ERMEQQIKELQgEGSNGTSINMEGISDGEGTRLRNVPGLFSESDSDRA 248
Uso1_p115_C pfam04871
Uso1 / p115 like vesicle tethering protein, C terminal region; Also known as General vesicular ...
1866-1991 1.23e-04

Uso1 / p115 like vesicle tethering protein, C terminal region; Also known as General vesicular transport factor, Transcytosis associate protein (TAP) and Vesicle docking protein, this myosin-shaped molecule consists of an N-terminal globular head region, a coiled-coil tail which mediates dimerization, and a short C-terminal acidic region. p115 tethers COP1 vesicles to the Golgi by binding the coiled coil proteins giantin (on the vesicles) and GM130 (on the Golgi), via its C-terminal acidic region. It is required for intercisternal transport in the golgi stack. This family consists of the acidic C-terminus, which binds to the golgins giantin and GM130. p115 is thought to juxtapose two membranes by binding giantin with one acidic region, and GM130 with another.


Pssm-ID: 461461 [Multi-domain]  Cd Length: 121  Bit Score: 43.93  E-value: 1.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1866 AKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAEnrekEQNKRLQRQLRDTKEEMSELARKEAEASRKK 1945
Cdd:pfam04871    1 AKKSELESEASSLKNENTELKAELQELSKQYNSLEQKESQAKELE----AEVKKLEEALKKLKAELSEEKQKEKEKQSEL 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1907082211 1946 HELEMDLESLEAANQSLQADLKlafKRIGDLQAAIEDEMESDENED 1991
Cdd:pfam04871   77 DDLLLLLGDLEEKVEKYKARLK---ELGEEVLSDDEDDDEDDEEDD 119
CCDC73 pfam15818
Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil ...
1769-2020 1.27e-04

Coiled-coil domain-containing protein 73 family; CCDC73 is a family of eukaryotic coiled-coil containing proteins. The function is not known. The alternative name is sarcoma antigen NY-SAR-79.


Pssm-ID: 464893 [Multi-domain]  Cd Length: 1048  Bit Score: 47.63  E-value: 1.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1769 KAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVA----QASRDMAQMND------LQAQIEEsnKEKQELQ 1838
Cdd:pfam15818   21 EAETQYEEQIGKIIVETQELKWQKETLQNQKETLAKQHKEAMAvfkkQLQMKMCALEEekgkyqLATEIKE--KEIEGLK 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1839 EKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETRLEFEKTQVKRLEN----LASR---LKETMEKLtEERDQRAAAEN 1911
Cdd:pfam15818   99 ETLKALQVSKYSLQKK------VSEMEQKLQLHLLAKEDHHKQLNEIEKyyatITGQfglVKENHGKL-EQNVQEAIQLN 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1912 -REKEQNKRLQRQLRDTKEEM----SELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFkrigDLQAAIEDEME- 1985
Cdd:pfam15818  172 kRLSALNKKQESEICSLKKELkkvtSDLIKSKVTCQYKMGEENINLTIKEQKFQELQERLNMEL----ELNKKINEEITh 247
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1907082211 1986 -SDENEDLINSLQDMVTKYQKkknKLEGDSDVDSEL 2020
Cdd:pfam15818  248 iQEEKQDIIISFQHMQQLLQQ---QTQANTEMEAEL 280
PRK11637 PRK11637
AmiB activator; Provisional
1726-1936 1.42e-04

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 47.00  E-value: 1.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1726 KREIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIddiakakTALEEQLSRLQREKNEiQNRLEEDQEDMNELMKK 1805
Cdd:PRK11637    67 QQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQI-------DELNASIAKLEQQQAA-QERLLAAQLDAAFRQGE 138
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1806 HKAAVAQASRDMAQ-------------------MNDLQAQIEESNKEKQELQEKlQALQSQVEFLEQSMVDKSLVSRQEA 1866
Cdd:PRK11637   139 HTGLQLILSGEESQrgerilayfgylnqarqetIAELKQTREELAAQKAELEEK-QSQQKTLLYEQQAQQQKLEQARNER 217
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082211 1867 K--IRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRD-------TKEEMSELAR 1936
Cdd:PRK11637   218 KktLTGLESSLQKDQQQLSELRANESRLRDSIARAEREAKARAEREAREAARVRDKQKQAKRkgstykpTESERSLMSR 296
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1769-2141 1.46e-04

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 47.35  E-value: 1.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1769 KAKTALEEQLSRLQ--REKNEIQNRLEEDQedmnelmkkhkaavAQASRDMAQMNDLQAQIEESNKEKQELQEKLqaLQS 1846
Cdd:PTZ00108  1031 AKKKDLVKELKKLGyvRFKDIIKKKSEKIT--------------AEEEEGAEEDDEADDEDDEEELGAAVSYDYL--LSM 1094
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1847 QVEFLEQSMVDKsLVSRQEAKIRELEtrlEFEKTQVKR--LENLAsRLKETMEKlTEERDQRAAAENREKEQNKRL-QRQ 1923
Cdd:PTZ00108  1095 PIWSLTKEKVEK-LNAELEKKEKELE---KLKNTTPKDmwLEDLD-KFEEALEE-QEEVEEKEIAKEQRLKSKTKGkASK 1168
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1924 LRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEDLINSLQDMVTKY 2003
Cdd:PTZ00108  1169 LRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKN 1248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 2004 QKKKNKLEGDSDVDSELED-----RVDGVKSWLSKNKGPSKAPSDDGSLKSSRTalNSLPKEGKGPEERPSSVLSSLSYR 2078
Cdd:PTZ00108  1249 NSSKSSEDNDEFSSDDLSKegkpkNAPKRVSAVQYSPPPPSKRPDGESNGGSKP--SSPTKKKVKKRLEGSLAALKKKKK 1326
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082211 2079 KRLTlkdsiggTGDQDSLLTTLSERATSPERPPRKARggPREEPDQaqgrrcEELDDCSSIFS 2141
Cdd:PTZ00108  1327 SEKK-------TARKKKSKTRVKQASASQSSRLLRRP--RKKKSDS------SSEDDDDSEVD 1374
growth_prot_Scy NF041483
polarized growth protein Scy;
1493-1967 1.51e-04

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 47.51  E-value: 1.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1493 QQVRNHELEKKQRRFDSE--LSQAHEETQR----EKLQREKLQREkdmLLAEAFSLKQQMEEK------------DLDIA 1554
Cdd:NF041483    76 QLLRNAQIQADQLRADAEreLRDARAQTQRilqeHAEHQARLQAE---LHTEAVQRRQQLDQElaerrqtveshvNENVA 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1555 GFTQKVVSLEAE---LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEM-------- 1623
Cdd:NF041483   153 WAEQLRARTESQarrLLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESARAEAEAILRRARKDAERLLnaastqaq 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1624 ------ERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNqrdfesEKR 1697
Cdd:NF041483   233 eatdhaEQLRSSTAAESDQARRQAAELSRAAEQRMQEAEEALREARAEAEKVVAEAKEAAAKQLASAESAN------EQR 306
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1698 LRKdlkrtkalladaqimldhlknnapSKREIAQLKNQ-LEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEE 1776
Cdd:NF041483   307 TRT------------------------AKEEIARLVGEaTKEAEALKAEAEQALADARAEAEKLVAEAAEKARTVAAEDT 362
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1777 --QLSRLQREKNEIQNRLEEDQEDM-----NELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQ--------ELQEKL 1841
Cdd:NF041483   363 aaQLAKAARTAEEVLTKASEDAKATtraaaEEAERIRREAEAEADRLRGEAADQAEQLKGAAKDDTkeyraktvELQEEA 442
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1842 QALQSQVEFLEQSMVDKSLVSRQEAKireletrlefeKTQVKRLENLASRLKETMEKLTEERDQ--RAAAENREKEQNKR 1919
Cdd:NF041483   443 RRLRGEAEQLRAEAVAEGERIRGEAR-----------REAVQQIEEAARTAEELLTKAKADADElrSTATAESERVRTEA 511
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1920 LQR--QLRDTKEEMSELARKEAEasRKKHELEMDLESLEAANQSLQADLK 1967
Cdd:NF041483   512 IERatTLRRQAEETLERTRAEAE--RLRAEAEEQAEEVRAAAERAARELR 559
PDZ_rhophilin-like cd06712
PDZ domain of rhophilin-1, rhophilin-2, and related domains; PDZ (PSD-95 (Postsynaptic density ...
225-307 1.55e-04

PDZ domain of rhophilin-1, rhophilin-2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of rhophilin-1, rhophilin-2, and related domains. Rhophilin-1 (RHPN1, also known as GTP-Rho-binding protein 1) and rhophilin-2 (RHPN2, also known as GTP-Rho-binding protein 2) are Rho-GTP binding proteins involved in cytoskeletal dynamics. Rhophilin-2 inhibits RhoA's activity to induce F-actin stress fibers. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This rhophilin-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467196 [Multi-domain]  Cd Length: 78  Bit Score: 42.19  E-value: 1.55e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  225 RRPTGDFGFSLRrttmlDRAPegqayrrV-VHFAEPGaGTKDLAlGLVPGDRLVEINGQNVENKSRDEIVEMIRQSG-DS 302
Cdd:cd06712      7 TKEEGGFGFTLR-----GDSP-------VqVASVDPG-SCAAEA-GLKEGDYIVSVGGVDCKWSKHSEVVKLLKSAGeEG 72

                   ....*
gi 1907082211  303 VRLKV 307
Cdd:cd06712     73 LELQV 77
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1553-1797 1.58e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 46.75  E-value: 1.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1553 IAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQMLEQaklrlememermrqthsk 1632
Cdd:COG3883      4 LALAAPTPAFADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQA------------------ 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1633 EMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESK-LSTLSDQVNQRDF--ESEKRLRKDLKRTKALL 1709
Cdd:COG3883     66 EIDKLQAEIAEAEAEIEERREELGERARALYRSGGSVSYLDVLLGSEsFSDFLDRLSALSKiaDADADLLEELKADKAEL 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1710 ADAQIMLDHLKNnapskrEIAQLKNQLEEseftcaaavkARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQ 1789
Cdd:COG3883    146 EAKKAELEAKLA------ELEALKAELEA----------AKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAE 209

                   ....*...
gi 1907082211 1790 NRLEEDQE 1797
Cdd:COG3883    210 AAAAAAAA 217
PspA COG1842
Phage shock protein A [Transcription, Signal transduction mechanisms];
1745-1936 1.68e-04

Phage shock protein A [Transcription, Signal transduction mechanisms];


Pssm-ID: 441447 [Multi-domain]  Cd Length: 217  Bit Score: 45.20  E-value: 1.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1745 AAVKARKAMEVEMEDLHlqiDDIAKAKTALEEQLSRLQRekneIQNRLEEDQEDMNELMKKHKAAVAQASRDMA------ 1818
Cdd:COG1842     20 KAEDPEKMLDQAIRDME---EDLVEARQALAQVIANQKR----LERQLEELEAEAEKWEEKARLALEKGREDLArealer 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1819 ------QMNDLQAQIEESNKEKQELQEKLQALQSQVEfleqsmvdkslvsRQEAKIRELETRLEFEKTQVKRLENLAS-R 1891
Cdd:COG1842     93 kaeleaQAEALEAQLAQLEEQVEKLKEALRQLESKLE-------------ELKAKKDTLKARAKAAKAQEKVNEALSGiD 159
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907082211 1892 LKETMEKLT--EERDQRAAAEN---REKEQNKRLQRQLRDTKEEMS---ELAR 1936
Cdd:COG1842    160 SDDATSALErmEEKIEEMEARAeaaAELAAGDSLDDELAELEADSEvedELAA 212
MscS_porin pfam12795
Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part ...
1688-1959 1.70e-04

Mechanosensitive ion channel porin domain; The small mechanosensitive channel, MscS, is a part of the turgor-driven solute efflux system that protects bacteria from lysis in the event of osmotic shock. The MscS protein alone is sufficient to form a functional mechanosensitive channel gated directly by tension in the lipid bilayer. The MscS proteins are heptamers of three transmembrane subunits with seven converging M3 domains, and this MscS_porin is towards the N-terminal of the molecules. The high concentration of negative charges at the extracellular entrance of the pore helps select the cations for efflux.


Pssm-ID: 432790 [Multi-domain]  Cd Length: 238  Bit Score: 45.76  E-value: 1.70e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1688 NQRDFESEKRLRKDLKRTKALLADAQImldhlknnapSKREIAQLKNQLEEseftcaAAVKARKAMEvemEDLHLQIDDI 1767
Cdd:pfam12795    8 AKLDEAAKKKLLQDLQQALSLLDKIDA----------SKQRAAAYQKALDD------APAELRELRQ---ELAALQAKAE 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1768 AKAKTALEEQ-LSRLQREKNEIQNRLEEDQEDMNELMKkhkaavaqasrdmaQMNDLQAQIEESNKEKQELQEKLQALQS 1846
Cdd:pfam12795   69 AAPKEILASLsLEELEQRLLQTSAQLQELQNQLAQLNS--------------QLIELQTRPERAQQQLSEARQRLQQIRN 134
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1847 QvefLEQSMVDKSLVSrqEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRaaaenreKEQNKRLQRQLRD 1926
Cdd:pfam12795  135 R---LNGPAPPGEPLS--EAQRWALQAELAALKAQIDMLEQELLSNNNRQDLLKARRDLL-------TLRIQRLEQQLQA 202
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1907082211 1927 TKEEMSELARKEAEASRKkhELEMDLESLEAAN 1959
Cdd:pfam12795  203 LQELLNEKRLQEAEQAVA--QTEQLAEEAAGDH 233
PDZ_SYNPO2-like cd10820
PDZ domain of synaptopodin 2 (SYNPO2), synaptopodin 2-like protein (SYNPO2L), and related ...
269-308 1.73e-04

PDZ domain of synaptopodin 2 (SYNPO2), synaptopodin 2-like protein (SYNPO2L), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of SYNPO2, SYNPO2L, and related domains. SYNPO2 (also known as genethonin-2, myopodin) is a cytoskeleton adaptor protein. It participates in chaperone-assisted selective autophagy (CASA), a mechanism for the disposal of misfolded and damaged proteins and provides a link between the CASA chaperone complex and a membrane-tethering and fusion machinery that generates autophagosome membranes. The SYNPO2 PPxY motif binds CASA cochaperone BCL2-associated athanogene 3 (BAG3) and the SYNPO2 PDZ domain binds vacuolar protein sorting 18 homolog (VPS18). There are three isoforms of SYNPO2, which possess an amino-terminal PDZ domain (SYNPO2a, b, c); the short isoform SYNPO2d, lacks the PDZ domain. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This SYNPO2-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467264 [Multi-domain]  Cd Length: 78  Bit Score: 42.30  E-value: 1.73e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1907082211  269 GLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQ 308
Cdd:cd10820     39 GLCEGDELLSINGKPCADLSHSEAMDLIDSSGDTLQLLIK 78
PDZ1_L-delphilin-like cd06743
PDZ domain 1 of delphilin (L-delphilin isoform), and related domains; PDZ (PSD-95 ...
269-299 2.25e-04

PDZ domain 1 of delphilin (L-delphilin isoform), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of delphilin (also known as glutamate receptor, ionotropic, delta 2-interacting protein 1, L-delphilin). Delphilin, a postsynaptic protein which is selectively expressed at cerebellar Purkinje cells, links the glutamate receptor delta 2 subunit (GluRdelta2) with the actin cytoskeleton and various signaling molecules. Two alternatively spliced isoforms of delphilin have been characterized: L-delphilin has two PDZ domains, PDZ1 and PDZ2, and S-delphilin has a single PDZ domain (PDZ2). These two isoforms are differently palmitoylated and may be involved in controlling GluRdelta2 signaling in Purkinje cells. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This delphilin-like family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467225 [Multi-domain]  Cd Length: 76  Bit Score: 41.88  E-value: 2.25e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1907082211  269 GLVPGDRLVEINGQNVENKSRDEIVEMIRQS 299
Cdd:cd06743     36 GLQPGDQILELDGQDVSSLSCEAIIALARRC 66
PRK12705 PRK12705
hypothetical protein; Provisional
1585-1765 2.41e-04

hypothetical protein; Provisional


Pssm-ID: 237178 [Multi-domain]  Cd Length: 508  Bit Score: 46.24  E-value: 2.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1585 KKQLRDLEAKVKDQEEELDEQAgsiqMLEQAKLRLEMEMERMRqtHSKEMESRDEEVEEARQscQKKLKQMEVQLEEEYE 1664
Cdd:PRK12705    27 KRQRLAKEAERILQEAQKEAEE----KLEAALLEAKELLLRER--NQQRQEARREREELQRE--EERLVQKEEQLDARAE 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1665 DKQKALREKRELESKLSTLSDQVNQRdfesEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEEsEFTCA 1744
Cdd:PRK12705    99 KLDNLENQLEEREKALSARELELEEL----EKQLDNELYRVAGLTPEQARKLLLKLLDAELEEEKAQRVKKIEE-EADLE 173
                          170       180
                   ....*....|....*....|.
gi 1907082211 1745 AAVKARKAMEVEMEDLHLQID 1765
Cdd:PRK12705   174 AERKAQNILAQAMQRIASETA 194
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1484-2005 2.42e-04

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 46.75  E-value: 2.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1484 QDTKLHLEGQQVRNHELEKKQRRFDSELSQAHEETQreKLQREKLQREK-DMLLAEAFSLKQQMEEKDLDIagFTQKVVS 1562
Cdd:PTZ00440   617 EKNDLQEKVKYILNKFYKGDLQELLDELSHFLDDHK--YLYHEAKSKEDlQTLLNTSKNEYEKLEFMKSDN--IDNIIKN 692
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1563 LEAELQDISSQesKDEAslakVKKQLRDLEAKVKDQEEELDEQAGSIQM-LEQAKlrlemEMERMRQTHSKEMESRDEEV 1641
Cdd:PTZ00440   693 LKKELQNLLSL--KENI----IKKQLNNIEQDISNSLNQYTIKYNDLKSsIEEYK-----EEEEKLEVYKHQIINRKNEF 761
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1642 EEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTLSDQV--NQRDFESEKRLRKDLK--------RTKALL-- 1709
Cdd:PTZ00440   762 ILHLYENDKDLPDGKNTYEEFLQYKDTILNKENKISNDINILKENKknNQDLLNSYNILIQKLEahtekndeELKQLLqk 841
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1710 ---ADAQIMLDHLKNNAPSKREIAQ-LKNQLEESEFTCAAAVKARKAMEVEMEDLHLqIDDIAKAKTALEEQLS------ 1779
Cdd:PTZ00440   842 fptEDENLNLKELEKEFNENNQIVDnIIKDIENMNKNINIIKTLNIAINRSNSNKQL-VEHLLNNKIDLKNKLEqhmkii 920
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1780 ----------------RLQREKNEIQNRLeeDQEDMNEL-MKKHKA-AVAQASRDMAQMND--LQAQIEESNKEKQELQE 1839
Cdd:PTZ00440   921 ntdniiqkneklnllnNLNKEKEKIEKQL--SDTKINNLkMQIEKTlEYYDKSKENINGNDgtHLEKLDKEKDEWEHFKS 998
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1840 KLQALQSQVEFLEQSMVDksLVSRQEAKIRELETRLEFEKTQ-----VKRLENLASRLKETMEKLTeerdqraAAENREK 1914
Cdd:PTZ00440   999 EIDKLNVNYNILNKKIDD--LIKKQHDDIIELIDKLIKEKGKeieekVDQYISLLEKMKTKLSSFH-------FNIDIKK 1069
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1915 EQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAF----KRIGDLQAAIE---DEMESD 1987
Cdd:PTZ00440  1070 YKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNKSHEHVVNADKEKNKQTEHYnkkkKSLEKIYKQMEktlKELENM 1149
                          570
                   ....*....|....*....
gi 1907082211 1988 ENEDL-INSLQDMVTKYQK 2005
Cdd:PTZ00440  1150 NLEDItLNEVNEIEIEYER 1168
PRK11281 PRK11281
mechanosensitive channel MscK;
1292-1513 2.61e-04

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 46.44  E-value: 2.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1292 QVQLSEEQIRNKDEEIQQLRSKLEKVEKERNElRLSSDRLETRISELTSELTDERNT-GESASQLLDAETAERlRTEKEM 1370
Cdd:PRK11281    88 QLAQAPAKLRQAQAELEALKDDNDEETRETLS-TLSLRQLESRLAQTLDQLQNAQNDlAEYNSQLVSLQTQPE-RAQAAL 165
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1371 KELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDDDAggewRLKYERAVREVDFTKKRLQQELEDKMEVEQQsrrQ 1450
Cdd:PRK11281   166 YANSQRLQQIRNLLKGGKVGGKALRPSQRVLLQAEQALLNA----QNDLQRKSLEGNTQLQDLLQKQRDYLTARIQ---R 238
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082211 1451 LERRLGDLQADSdeSQRALQQLKKKCQRltAELQDtklhlEGQQVRNHELEKKQRRFDSELSQ 1513
Cdd:PRK11281   239 LEHQLQLLQEAI--NSKRLTLSEKTVQE--AQSQD-----EAARIQANPLVAQELEINLQLSQ 292
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
1303-1608 2.62e-04

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 46.49  E-value: 2.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1303 KDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAE---TAERLRTEKEMKELQTQYDA 1379
Cdd:COG5185    266 RLEKLGENAESSKRLNENANNLIKQFENTKEKIAEYTKSIDIKKATESLEEQLAAAEaeqELEESKRETETGIQNLTAEI 345
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1380 LKKQMEVMEmevmearliRAAEINGEVDDDDAGGEWRLKYER---AVREVDFTKKRLQQELEDKMEVEQQSRRQLERRLG 1456
Cdd:COG5185    346 EQGQESLTE---------NLEAIKEEIENIVGEVELSKSSEEldsFKDTIESTKESLDEIPQNQRGYAQEILATLEDTLK 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1457 DLQADSDESQRALQQLKKKcqrlTAELQDTKLHLEgqqvrnHELEKKQRRFDSELSQAHEETQREKLQR-----EKLQRE 1531
Cdd:COG5185    417 AADRQIEELQRQIEQATSS----NEEVSKLLNELI------SELNKVMREADEESQSRLEEAYDEINRSvrskkEDLNEE 486
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1532 KDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSLE---AELQDISSQESKDEASLAKVKKQlRDLEAKvKDQEEELDEQAGS 1608
Cdd:COG5185    487 LTQIESRVSTLKATLEKLRAKLERQLEGVRSKLdqvAESLKDFMRARGYAHILALENLI-PASELI-QASNAKTDGQAAN 564
GOLGA2L5 pfam15070
Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein ...
1774-1963 2.72e-04

Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein family remains unknown. This family of proteins is thought to be found in the Golgi apparatus of eukaryotes.


Pssm-ID: 464485 [Multi-domain]  Cd Length: 521  Bit Score: 46.21  E-value: 2.72e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1774 LEEQLSRLQREKNEIQNRLEEDQEDMNELmkKHKAAVAQASRDM------AQMNDLQAQIEESNKEKQELQEKLQAlqsQ 1847
Cdd:pfam15070   34 LSEQVRTLREEKERSVSQVQELETSLAEL--KNQAAVPPAEEEQppagpsEEEQRLQEEAEQLQKELEALAGQLQA---Q 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1848 VEFLEQSmvdKSLVSRQEAKIRELETRLEFEKTQVKRLEnlasRLKETME--KLTeerDQRAAAENRE-KEQNKRLQRQ- 1923
Cdd:pfam15070  109 VQDNEQL---SRLNQEQEQRLLELERAAERWGEQAEDRK----QILEDMQsdRAT---ISRALSQNRElKEQLAELQNGf 178
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1924 LRDTKEEM-------------SELARKEAEASRKKHEL--EMDLESLEAanQSLQ 1963
Cdd:pfam15070  179 VKLTNENMeltsalqseqhvkKELAKKLGQLQEELGELkeTLELKSQEA--QSLQ 231
GAS pfam13851
Growth-arrest specific micro-tubule binding; This family is the highly conserved central ...
1521-1672 2.91e-04

Growth-arrest specific micro-tubule binding; This family is the highly conserved central region of a number of metazoan proteins referred to as growth-arrest proteins. In mouse, Gas8 is predominantly a testicular protein, whose expression is developmentally regulated during puberty and spermatogenesis. In humans, it is absent in infertile males who lack the ability to generate gametes. The localization of Gas8 in the motility apparatus of post-meiotic gametocytes and mature spermatozoa, together with the detection of Gas8 also in cilia at the apical surfaces of epithelial cells lining the pulmonary bronchi and Fallopian tubes suggests that the Gas8 protein may have a role in the functioning of motile cellular appendages. Gas8 is a microtubule-binding protein localized to regions of dynein regulation in mammalian cells.


Pssm-ID: 464001 [Multi-domain]  Cd Length: 200  Bit Score: 44.51  E-value: 2.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1521 EKLQREKLQREKDMLLAEAFSLKQQMEEKdldIAGFTQKVVSLEAELQdissQESKDEASLAKVKKQLRDLEAKVKDQEE 1600
Cdd:pfam13851   34 EIAELKKKEERNEKLMSEIQQENKRLTEP---LQKAQEEVEELRKQLE----NYEKDKQSLKNLKARLKVLEKELKDLKW 106
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1601 ELDeqagsiqMLEQAKLRLEMEMErmrqthskEMESRDEE-VEEARQSCQKK---LKQMEVQLEEEYEDKQKALRE 1672
Cdd:pfam13851  107 EHE-------VLEQRFEKVERERD--------ELYDKFEAaIQDVQQKTGLKnllLEKKLQALGETLEKKEAQLNE 167
PRK12704 PRK12704
phosphodiesterase; Provisional
1733-1907 2.96e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 45.92  E-value: 2.96e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1733 KNQLEESEFTCAAAVK-ARKAMEVEMEDLHLQI-DDIAKAKTALEEQLsrlqREKNEIQNRLEEDQEDMNELMKKHKAAV 1810
Cdd:PRK12704    30 EAKIKEAEEEAKRILEeAKKEAEAIKKEALLEAkEEIHKLRNEFEKEL----RERRNELQKLEKRLLQKEENLDRKLELL 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1811 AQASRDMAQMNDLQAQIEESNKEKQELQEKLQalQSQVEFLEQSMvdkSLvSRQEAK---IRELETRLEFEKtqvkrlen 1887
Cdd:PRK12704   106 EKREEELEKKEKELEQKQQELEKKEEELEELI--EEQLQELERIS---GL-TAEEAKeilLEKVEEEARHEA-------- 171
                          170       180
                   ....*....|....*....|
gi 1907082211 1888 lASRLKETMEKLTEERDQRA 1907
Cdd:PRK12704   172 -AVLIKEIEEEAKEEADKKA 190
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
1673-1847 3.13e-04

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 45.39  E-value: 3.13e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  1673 KRELESKLSTLSdqvnqrdfESEKRLRKDLKRTKALLADAQIMLDHLKnnapskREIAQLKNQleeseftcaaavkarka 1752
Cdd:smart00787  146 KEGLDENLEGLK--------EDYKLLMKELELLNSIKPKLRDRKDALE------EELRQLKQL----------------- 194
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  1753 mEVEMEDLHLQIDDIAKAKtaleeqLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEE--- 1829
Cdd:smart00787  195 -EDELEDCDPTELDRAKEK------LKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQcrg 267
                           170
                    ....*....|....*....
gi 1907082211  1830 -SNKEKQELQEKLQALQSQ 1847
Cdd:smart00787  268 fTFKEIEKLKEQLKLLQSL 286
PDZ5_MUPP1-like cd06669
PDZ domain 5 of multi-PDZ-domain protein 1 (MUPP1), PATJ (protein-associated tight junction) ...
269-311 3.20e-04

PDZ domain 5 of multi-PDZ-domain protein 1 (MUPP1), PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 5 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ5 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467157 [Multi-domain]  Cd Length: 98  Bit Score: 41.83  E-value: 3.20e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211  269 GLVPGDRLVEINGQNVENKSRDEIVEMIRQSG-DSVRLKV-QPIP 311
Cdd:cd06669     54 RLLPGDRLVFVNDVSLENASLDEAVQALKSAPpGTVRIGVaKPLP 98
PDZ_MPP1-like cd10830
PDZ domain of membrane palmitoylated protein1 (MPP1), and related domains; PDZ (PSD-95 ...
273-309 3.35e-04

PDZ domain of membrane palmitoylated protein1 (MPP1), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MPP1, and related domains. MPP1 (also known as MAGUK p55 subfamily member 1, erythrocyte membrane protein p55, EMP55) is a membrane-associated guanylate kinase (MAGUK)-like protein which forms a complex with protein 4.1 and glycophorin C (GPC) at the cytoplasmic face of the plasma membrane; this complex is essential for cytoskeleton-membrane linkage in erythrocytes and many non-erythroid cells, and participates in the determination of membrane stability and cell shape. MPP1, by interacting with various scaffold proteins and cytoskeletal proteins in the postsynaptic density, also plays an important role in organizing synaptic and non-synaptic structures. MPP1 is also a component of the Crumbs protein complex in the mammalian retina and may link the Usher protein network and the Crumbs protein complex. The MPP1 PDZ domain binding partners include GPC, ABCC4, and CADM1/Necl-2/SynCAM1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MPP1-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467266 [Multi-domain]  Cd Length: 81  Bit Score: 41.39  E-value: 3.35e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907082211  273 GDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQP 309
Cdd:cd10830     45 GDEILEINGKSVTNHSVDQLQKMLKETKGMVSLKVIP 81
HOOK pfam05622
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ...
1356-1622 3.57e-04

HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.


Pssm-ID: 461694 [Multi-domain]  Cd Length: 528  Bit Score: 45.84  E-value: 3.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1356 LDAETAERLRTEKEMKELQTQYDALKKQMEVMEMEVMEARliraaEINGEV-----------DDDDAGGEWRLKYERAVR 1424
Cdd:pfam05622  199 LSEESKKADKLEFEYKKLEEKLEALQKEKERLIIERDTLR-----ETNEELrcaqlqqaelsQADALLSPSSDPGDNLAA 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1425 EV---DFTKKRLQQELEDKMEVEQQSrRQLERRLGDLQADSDESQRALQ----QLKKKCQR---LTAELQDTKLHLEGQQ 1494
Cdd:pfam05622  274 EImpaEIREKLIRLQHENKMLRLGQE-GSYRERLTELQQLLEDANRRKNeletQNRLANQRileLQQQVEELQKALQEQG 352
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1495 VRNHELEKKQRRFDS---ELSQAHEETQREKLQREKLQREKDMLLAE-AFSLKQQMEEKDLDIAGFTQK----------- 1559
Cdd:pfam05622  353 SKAEDSSLLKQKLEEhleKLHEAQSELQKKKEQIEELEPKQDSNLAQkIDELQEALRKKDEDMKAMEERykkyvekaksv 432
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082211 1560 VVSLEAELQDISSQEskdeasLAKVKKQLRDLEAKVKDQEEEldeqagsiqmLEQAKLRLEME 1622
Cdd:pfam05622  433 IKTLDPKQNPASPPE------IQALKNQLLEKDKKIEHLERD----------FEKSKLQREQE 479
CENP-F_leu_zip pfam10473
Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, ...
1769-1933 3.63e-04

Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance.


Pssm-ID: 463102 [Multi-domain]  Cd Length: 140  Bit Score: 43.05  E-value: 3.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1769 KAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQV 1848
Cdd:pfam10473    3 KKQLHVLEKLKESERKADSLKDKVENLERELEMSEENQELAILEAENSKAEVETLKAEIEEMAQNLRDLELDLVTLRSEK 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1849 EFLEQSmvdkslvsrqeakireletrLEFEKTQVKRLENLASRLKETMEKLTEERDQraaaenrEKEQNKRLQRQLRDTK 1928
Cdd:pfam10473   83 ENLTKE--------------------LQKKQERVSELESLNSSLENLLEEKEQEKVQ-------MKEESKTAVEMLQTQL 135

                   ....*
gi 1907082211 1929 EEMSE 1933
Cdd:pfam10473  136 KELNE 140
PDZ4_DLG5-like cd06766
PDZ domain 4 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density ...
237-308 3.88e-04

PDZ domain 4 of Discs Large 5 (Dlg5) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 4 of Drosophila and mammalian Dlg5, and related domains. Dlg5 is a scaffold protein with multiple conserved functions that are independent of each other in regulating growth, cell polarity, and cell adhesion. It has a coiled-coil domain, 4 PDZ domains and a MAGUK domain (an SH3 domain next to a non-catalytically active guanylate kinase domain). Deregulation of Dlg5 has been implicated in the malignancy of several cancer types. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg5-like family PDZ4 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467247 [Multi-domain]  Cd Length: 81  Bit Score: 41.22  E-value: 3.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  237 RTTMLDRAPE--------GQAYRRVVHFAEPGAGTKDLAlGLVPGDRLVEINGQNVENKSRDEI-VEMIRQSgDSVRLKV 307
Cdd:cd06766      3 RLVFLKKSQVelgiqlcgGNLHGIFVEDVEDDSPAKGPD-GLVPGDLILEYNSVDMRNKTAEEAyLEMLKPA-ETVTLKV 80

                   .
gi 1907082211  308 Q 308
Cdd:cd06766     81 Q 81
Caldesmon pfam02029
Caldesmon;
1624-1949 3.94e-04

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 45.63  E-value: 3.94e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1624 ERMRQTHSKEMESR-DEEVEEARQSCQKKLKQMEVQLEEEYEDKQKalrekrelesklstLSDQVNQRDFESEKRLRKDL 1702
Cdd:pfam02029   18 ERRRQKEEEEPSGQvTESVEPNEHNSYEEDSELKPSGQGGLDEEEA--------------FLDRTAKREERRQKRLQEAL 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1703 KRTKALLadaqimldhlKNNAPSKREIAQLKNQLEESEFTCAAAVKARKAMEVEMEdlhlqiddiakaktalEEQLSRlq 1782
Cdd:pfam02029   84 ERQKEFD----------PTIADEKESVAERKENNEEEENSSWEKEEKRDSRLGRYK----------------EEETEI-- 135
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1783 REKNEIQNRLEEDQEDMNELMKKHKAAVAQASR-DMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQS------M 1855
Cdd:pfam02029  136 REKEYQENKWSTEVRQAEEEGEEEEDKSEEAEEvPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEVKsqngeeE 215
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1856 VDKSLVS--RQEAKIRELETRLEFEKTQV---KRLENLASRLKET----MEKLtEERDQRAAAE----NREKEQNKRL-- 1920
Cdd:pfam02029  216 VTKLKVTtkRRQGGLSQSQEREEEAEVFLeaeQKLEELRRRRQEKeseeFEKL-RQKQQEAELEleelKKKREERRKLle 294
                          330       340
                   ....*....|....*....|....*....
gi 1907082211 1921 QRQLRDTKEEMSELARKEAEASRKKHELE 1949
Cdd:pfam02029  295 EEEQRRKQEEAERKLREEEEKRRMKEEIE 323
secA PRK12903
preprotein translocase subunit SecA; Reviewed
1815-1991 4.16e-04

preprotein translocase subunit SecA; Reviewed


Pssm-ID: 237258 [Multi-domain]  Cd Length: 925  Bit Score: 45.82  E-value: 4.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1815 RDMAQMNDLQAQIEESNKEKQELQEKLqALQSQVEFLEQSMVDKSLvsrqeaKIRELETRLEFEKTQVKRLENLASRL-K 1893
Cdd:PRK12903   741 VQYSQKNPYQVYTEEGTKKFNILLQEI-AYDVIVSLFNNPNAEKIL------IITEILSDGINNSDINDRPQELIDQIiE 813
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1894 ETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFK-R 1972
Cdd:PRK12903   814 SEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQNINENKNEFVEFKNDPKKLNKLIIAKdV 893
                          170
                   ....*....|....*....
gi 1907082211 1973 IGDLQAAIEDEMESDENED 1991
Cdd:PRK12903   894 LIKLVISSDEIKQDEKTTK 912
PDZ_PDLIM-like cd06753
PDZ domain of PDZ-LIM family proteins, and related domains; PDZ (PSD-95 (Postsynaptic density ...
269-308 4.20e-04

PDZ domain of PDZ-LIM family proteins, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of PDZ-LIM family proteins including PDLIM1-7, and related domains. PDZ-LIM family proteins (also known as Zasp PDZ domain proteins) are involved in the rearrangement of the actin cytoskeleton; they mediate association with the cytoskeleton through alpha-actinin as well as with other proteins involved in signal transduction pathways. Members of this family include PDLIM1 (also known as C-terminal LIM domain protein 1, elfin, LIM domain protein CLP-36), PDLIM2 (also known as PDZ-LIM protein mystique), PDLIM3 (also known as actinin-associated LIM protein, alpha-actinin-2-associated LIM protein, ALP), PDLIM4 (also known as LIM protein RIL, Reversion-induced LIM protein), PDLIM5 (also known as enigma homolog, ENH, enigma-like PDZ and LIM domains protein), PDLIM6 (also known as LIM domain-binding protein 3, ZASP, Cypher, Oracle), and PDLIM7 (also known as PDZ and LIM domain protein 7, LIM mineralization protein, LMP; protein enigma). PDLIM1 has been shown to negatively regulate NF-kappaB-mediated signaling in the cytoplasm. PDLIM7 negatively regulates p53 through binding murine double minute 2 (MDM2). The PDZ domains of PDZ-LIM family proteins PDLIM1, 2, 3, 5, 6, 7 have been shown to bind actin. Other PDZ-LIM family PDZ domain binding partners include thyroid receptor interacting protein-6 (PDLIM4-PDZ), the LIM domain of PDLIM4 (PDLIM4-PDZ), tropomyosin (PDLIM7-PDZ), myotilin and calsarcin 1 (PDLIM6-PDZ), and proteins from the myotilin and FATZ (calsarcin/myozenin) families (PDLIM1, 3, 4, 6 PDZ domains). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PDLIM-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467235 [Multi-domain]  Cd Length: 79  Bit Score: 40.98  E-value: 4.20e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1907082211  269 GLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQ 308
Cdd:cd06753     39 NLRPGDVILAINGESTEGMTHLEAQNKIKAATGSLSLTLE 78
PDZ2_Dlg1-2-4-like cd06724
PDZ domain 2 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg) ...
258-308 4.23e-04

PDZ domain 2 of human discs large homolog 1 (Dlg1), Dlg2, and Dlg4, Drosophila disc large (Dlg), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Drosophila Dlg1, human Dlg1,2, and 4 and related domains. Dlg1 (also known as synapse-associated protein Dlg197 or SAP-97), Dlg2 (also known as channel-associated protein of synapse-110, postsynaptic density protein 93, or PSD-93), Dlg4 (also known as postsynaptic density protein 95, PSD-95, synapse-associated protein 90, or SAP-90) each have 3 PDZ domains and belong to the membrane-associated guanylate kinase family. Dlg1 regulates antigen receptor signaling and cell polarity in lymphocytes, B-cell proliferation and antibody production, and TGFalpha bioavailability; its PDZ3 domain binds pro-TGFalpha, and its PDZ2 domain binds the TACE metalloprotease responsible for cleaving pro-TGFalpha to a soluble form. Dlg2 is involved in N-methyl-D-aspartate (NMDA) receptor signaling. It regulates surface expression of NMDA receptors in dorsal horn neurons of the spinal cord, and it also interacts with NMDA receptor subunits and with Shaker-type K+ channel subunits to cluster into a channel complex. Dlg4 PDZ1 domain binds NMDA receptors, and its PDZ2 domain binds neuronal nitric oxide synthase (nNOS), forming a complex in neurons. The Drosophila Scribble complex (Scribble, Dlg, and lethal giant larvae) plays a role in apico-basal cell polarity, and in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Postsynaptic targeting of Drosophila DLG requires interactions mediated by the first two PDZ domains. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Dlg-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467207 [Multi-domain]  Cd Length: 85  Bit Score: 41.10  E-value: 4.23e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082211  258 EPGAGTKDLALGLvpGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQ 308
Cdd:cd06724     37 EGGAAQKDGRLQV--GDKLLAVNDVSLEEVTHEEAVAALKNTSDVVYLKVA 85
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
1799-2015 4.38e-04

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 45.05  E-value: 4.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1799 MNELMKKHKAAVAQASRDMAqMNDLQAQIEESNKEKQELQEKLQALQSQV-EFLEQSMVDKSlvsrqeaKIRELETRLEf 1877
Cdd:cd22656     96 ILELIDDLADATDDEELEEA-KKTIKALLDDLLKEAKKYQDKAAKVVDKLtDFENQTEKDQT-------ALETLEKALK- 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1878 ektqvkrlenlasrlketmEKLTEErdqraaAENREKEQNKRLQRQLrdtKEEMSELARKEAEASRKKHELEMDLESLEA 1957
Cdd:cd22656    167 -------------------DLLTDE------GGAIARKEIKDLQKEL---EKLNEEYAAKLKAKIDELKALIADDEAKLA 218
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1958 ANQSLQADLKLAFKRIGDLQAAIEDEMESDENedLINSLQDMVTKYQKKKNKLEGDSD 2015
Cdd:cd22656    219 AALRLIADLTAADTDLDNLLALIGPAIPALEK--LQGAWQAIATDLDSLKDLLEDDIS 274
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
1828-1950 4.47e-04

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 42.21  E-value: 4.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1828 EESNKEKQELQEKLQALQSQVEFLEQSMvdkslvSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKL------TE 1901
Cdd:pfam20492    2 EEAEREKQELEERLKQYEEETKKAQEEL------EESEETAEELEEERRQAEEEAERLEQKRQEAEEEKERLeesaemEA 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1907082211 1902 ERDQRAAAENREKEQnkRLQRQlrdtkeeMSELARKEAEASRKKHELEM 1950
Cdd:pfam20492   76 EEKEQLEAELAEAQE--EIARL-------EEEVERKEEEARRLQEELEE 115
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
1291-1686 4.97e-04

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 45.51  E-value: 4.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1291 IQVQLSEEQIRNKDEeIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEM 1370
Cdd:pfam07111  318 LMVQLKAQDLEHRDS-VKQLRGQVAELQEQVTSQSQEQAILQRALQDKAAEVEVERMSAKGLQMELSRAQEARRRQQQQT 396
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1371 K--ELQTQYDALKKQMEVMEMEVMEARLIRAAEINGEVDDddaggewRLKYerAVREVDFTKKRLQQeledKMEVEQQSR 1448
Cdd:pfam07111  397 AsaEEQLKFVVNAMSSTQIWLETTMTRVEQAVARIPSLSN-------RLSY--AVRKVHTIKGLMAR----KVALAQLRQ 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1449 RQLERRLGDLQADSDESQRaLQQLKKKCQRLTAELQdTKLHLEGQQV-RNHELEKKQRRFDSELSQAHEetqreklqrEK 1527
Cdd:pfam07111  464 ESCPPPPPAPPVDADLSLE-LEQLREERNRLDAELQ-LSAHLIQQEVgRAREQGEAERQQLSEVAQQLE---------QE 532
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1528 LQREKDMLLaeafSLKQQMEEKDLDIAGFTQKVVSLEAEL---QDISSQESKDEasLAKVKKQLRDLEAKVKDQ-EEELD 1603
Cdd:pfam07111  533 LQRAQESLA----SVGQQLEVARQGQQESTEEAASLRQELtqqQEIYGQALQEK--VAEVETRLREQLSDTKRRlNEARR 606
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1604 EQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKALREKRELESKLSTL 1683
Cdd:pfam07111  607 EQAKAVVSLRQIQHRATQEKERNQELRRLQDEARKEEGQRLARRVQELERDKNLMLATLQQEGLLSRYKQQRLLAVLPSG 686

                   ...
gi 1907082211 1684 SDQ 1686
Cdd:pfam07111  687 LDK 689
DUF1633 pfam07794
Protein of unknown function (DUF1633); This family contains sequences derived from a group of ...
1585-1781 5.24e-04

Protein of unknown function (DUF1633); This family contains sequences derived from a group of hypothetical proteins expressed by Arabidopsis thaliana. These sequences are highly similar and the region concerned is about 100 residues long.


Pssm-ID: 116408 [Multi-domain]  Cd Length: 698  Bit Score: 45.26  E-value: 5.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1585 KKQLRDLEAKVK-DQEEELDEQAgsiQMLEQAKLRLEMEMermrqTHSKEMESRDEEveearqscqkKLKQMEVQLEEEY 1663
Cdd:pfam07794  457 ERAIREEDPHLGaDQDREVRFGA---EGIVPGIERLKIEL-----STSKDLEKGYAE----------KIGFMEMEFGGLE 518
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1664 EDKQKALREKRELESKLSTLSDQVnqRDFESEKR-LRKDLKRTKALLADAQI-MLDHLKNNAPSKREIAQLKNQLEESEF 1741
Cdd:pfam07794  519 ADKQMARNQIHRLEEKKDELSKKV--LDLTSIAQgAKKAVHDAKVELAAAYLkLLAGIKDKWVAKKEFTVLEGQAAEVES 596
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1907082211 1742 TCA---AAVKARKAMEVEMEDLHLQIDDIaKAKTALEE----QLSRL 1781
Cdd:pfam07794  597 NLAlidQITKAAIDLTLEKPRFQAEIDDL-EARCKLKEvsdfTLSKL 642
GOLGA2L5 pfam15070
Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein ...
1305-1799 5.25e-04

Putative golgin subfamily A member 2-like protein 5; The function of the GOLGA2L5 protein family remains unknown. This family of proteins is thought to be found in the Golgi apparatus of eukaryotes.


Pssm-ID: 464485 [Multi-domain]  Cd Length: 521  Bit Score: 45.05  E-value: 5.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1305 EEIQQLRSKLEKVEKERnelrlssDRLETRISELTSELTDERNTGESASQLLDAETA----ERLRTEKEMKELQTQYDAL 1380
Cdd:pfam15070   29 QKMQQLSEQVRTLREEK-------ERSVSQVQELETSLAELKNQAAVPPAEEEQPPAgpseEEQRLQEEAEQLQKELEAL 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1381 kkqmevmemevmearlirAAEINGEVDDDDagGEWRLKYERAVREVDftkkrLQQELEDKMEVEQQSRRQLErrlgDLQA 1460
Cdd:pfam15070  102 ------------------AGQLQAQVQDNE--QLSRLNQEQEQRLLE-----LERAAERWGEQAEDRKQILE----DMQS 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1461 DSDESQRALQQLKKKCQRLtAELQD-------------TKLHLEgQQVRNhELEKKQRRFDSELSQAHEETQREKLQREK 1527
Cdd:pfam15070  153 DRATISRALSQNRELKEQL-AELQNgfvkltnenmeltSALQSE-QHVKK-ELAKKLGQLQEELGELKETLELKSQEAQS 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1528 LQREKDMLLAE----AFSLKQQMEEKDLdiagfTQKVVSLEAELQD-ISSQESKDEASLAKVKKQLRD----LEAKVKDQ 1598
Cdd:pfam15070  230 LQEQRDQYLAHlqqyVAAYQQLASEKEE-----LHKQYLLQTQLMDrLQHEEVQGKVAAEMARQELQEtqerLEALTQQN 304
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1599 EEEldeQAGSIQMLEQAK---LRLEMEMERMRQTH---SKEMESRDEEVE---EARQSCQKKLKQMEVQLEEEYedkqka 1669
Cdd:pfam15070  305 QQL---QAQLSLLANPGEgdgLESEEEEEEAPRPSlsiPEDFESREAMVAffnSALAQAEEERAELRRQLKEQK------ 375
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1670 lREKRELESKLSTLSDQvnQRDFESEKRLRKD---LKRTKALladaQIMLDHLKNNAPS-KREIAQLKNQLEESEFTCAa 1745
Cdd:pfam15070  376 -RRCRRLAQQAAPAQEE--PEHEAHAPGTGGDsvpVEVHQAL----QVAMEKLQSRFTElMQEKADLKERVEELEHRCI- 447
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1746 avkarkamevemeDLHLQIDDIAKAKTALEEQ---LSRLQREKNEIQNRLEEDQEDM 1799
Cdd:pfam15070  448 -------------QLSGETDTIGEYIALYQSQraiLKQRHREKEEYISRLAQDKEEM 491
PDZ1_PTPN13-like cd23072
PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), and related ...
259-309 6.25e-04

PDZ domain 1 of protein tyrosine phosphatase non-receptor type 13 (PTPN13), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of PTPN13 [also known as Fas-associated protein-tyrosine phosphatase 1 (FAP-1), protein-tyrosine phosphatase 1E (PTP-E1), and protein-tyrosine phosphatase (PTPL1)], and related domains. PTPN13 regulates negative apoptotic signaling and mediates phosphoinositide 3-kinase (PI3K) signaling. PTPN13 has five PDZ domains. Proteins known to interact with PTPN13 PDZ domains include: PLEKHA1 and PLEKHA2 via PTPN13-PDZ domain 1, Fas receptor and thyroid receptor-interacting protein 6 via PTPN13-PDZ domain 2, nerve growth factor receptor and protein kinase N2 via PTPN13-PDZ domain 3, PDZ and LIM domain 4 (PDLIM4) via PTPN13-PDZ domains 2 and 4, and brain calpain-2 via PTPN13-PDZ domains 3, 4 and 5. Calpain-2-mediated PTPN13 fragments may be involved in abnormal tau aggregation and increased risk for Alzheimer's disease. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13 family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467285 [Multi-domain]  Cd Length: 92  Bit Score: 40.94  E-value: 6.25e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1907082211  259 PGaGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKV-QP 309
Cdd:cd23072     39 PG-GPADLDGRLKPGDRLISVNDVSLEGLSHDAAVEILQNAPEDVTLVVsQP 89
PDZ2_MUPP1-like cd06667
PDZ domain 2 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) ...
259-307 6.32e-04

PDZ domain 2 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) and similar domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467155 [Multi-domain]  Cd Length: 80  Bit Score: 40.73  E-value: 6.32e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1907082211  259 PGaGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKV 307
Cdd:cd06667     31 PG-GVADRDGRLRSGDHILQIGDTNLRGMGSEQVAQVLRQCGSHVRLVV 78
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1448-1665 6.49e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 45.01  E-value: 6.49e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1448 RRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTK-----LHLEGQQvrnHELEKKQRRFDSELSQAHEETQREK 1522
Cdd:COG3206    163 EQNLELRREEARKALEFLEEQLPELRKELEEAEAALEEFRqknglVDLSEEA---KLLLQQLSELESQLAEARAELAEAE 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1523 LQREKLQREKDMLLAEAFSLKQqmeekDLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDLEAKVKDQ---- 1598
Cdd:COG3206    240 ARLAALRAQLGSGPDALPELLQ-----SPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEaqri 314
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1599 ----EEELDEQAGSIQMLEQAKLRLEMEMERMRQThSKEMESRDEEVEEARQ---SCQKKLKQMEVQLEEEYED 1665
Cdd:COG3206    315 laslEAELEALQAREASLQAQLAQLEARLAELPEL-EAELRRLEREVEVARElyeSLLQRLEEARLAEALTVGN 387
RecN COG0497
DNA repair ATPase RecN [Replication, recombination and repair];
1446-1689 6.72e-04

DNA repair ATPase RecN [Replication, recombination and repair];


Pssm-ID: 440263 [Multi-domain]  Cd Length: 555  Bit Score: 45.07  E-value: 6.72e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1446 QSRRQLERRLGDLQADSDESQRALQQLKKKCQRLT-AELQdtklhlEGQQVrnhELEKKQRRfdseLSQAheetqrEKLq 1524
Cdd:COG0497    165 RAWRALKKELEELRADEAERARELDLLRFQLEELEaAALQ------PGEEE---ELEEERRR----LSNA------EKL- 224
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1525 REKLQREKDML----------LAEAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQESK-------DEASLAKVKK- 1586
Cdd:COG0497    225 REALQEALEALsggeggaldlLGQALRALERLAEYDPSLAELAERLESALIELEEAASELRRyldslefDPERLEEVEEr 304
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1587 --QLRDLEAKVKDQEEELdeqagsIQMLEQAKLRLEmemermrqthskEMESRDEEVEEarqscqkkLKQMEVQLEEEYE 1664
Cdd:COG0497    305 laLLRRLARKYGVTVEEL------LAYAEELRAELA------------ELENSDERLEE--------LEAELAEAEAELL 358
                          250       260
                   ....*....|....*....|....*
gi 1907082211 1665 DKQKALREKRELESKlsTLSDQVNQ 1689
Cdd:COG0497    359 EAAEKLSAARKKAAK--KLEKAVTA 381
OmpH pfam03938
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1918-2012 7.62e-04

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 461098 [Multi-domain]  Cd Length: 140  Bit Score: 41.79  E-value: 7.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1918 KRLQRQLRDTKEEM-SELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDENEdLINSL 1996
Cdd:pfam03938   18 KAAQAQLEKKFKKRqAELEAKQKELQKLYEELQKDGALLEEEREEKEQELQKKEQELQQLQQKAQQELQKKQQE-LLQPI 96
                           90
                   ....*....|....*.
gi 1907082211 1997 QDMVTKYQKKKNKLEG 2012
Cdd:pfam03938   97 QDKINKAIKEVAKEKG 112
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1304-1484 8.03e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 44.62  E-value: 8.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1304 DEEIQQLRSKLEKVEKERNELRLSSDrletriseltseLTDERNTGESASQLLDAETAERLRTEKEMKELQTQYDALKKQ 1383
Cdd:COG3206    181 EEQLPELRKELEEAEAALEEFRQKNG------------LVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQ 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1384 MEVMEMEVMEAR--------LIRAAEINGEVDdddaggEWRLKY-------ERAVREVDFTKKRLQQELEDKMEVEQQSR 1448
Cdd:COG3206    249 LGSGPDALPELLqspviqqlRAQLAELEAELA------ELSARYtpnhpdvIALRAQIAALRAQLQQEAQRILASLEAEL 322
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1907082211 1449 RQLERRLGDLQADSDESQRALQQLKKKCQRLtAELQ 1484
Cdd:COG3206    323 EALQAREASLQAQLAQLEARLAELPELEAEL-RRLE 357
PDZ5_DrPTPN13-like cd23060
PDZ domain 5 of Danio rerio tyrosine-protein phosphatase non-receptor type 13 (Ptpn13) and ...
220-307 9.11e-04

PDZ domain 5 of Danio rerio tyrosine-protein phosphatase non-receptor type 13 (Ptpn13) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 5 of Danio rerio Ptpn13, and related domains. Protein-tyrosine phosphatases (PTPs) dephosphorylate phosphotyrosyl residues in proteins that are phosphorylated by protein tyrosine kinases (PTKs). Danio rerio Ptpn13 is a classical non-receptor-like PTP. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This PTPN13-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467273 [Multi-domain]  Cd Length: 80  Bit Score: 40.03  E-value: 9.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLRRttmldrAPEGQAYRrvVHFAEPGaGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQS 299
Cdd:cd23060      1 QIELEKPANGGLGFSLVG------GEGGSGIF--VKSISPG-GVADRDGRLQVGDRLLQVNGESVIGLSHSKAVNILRKA 71

                   ....*...
gi 1907082211  300 GDSVRLKV 307
Cdd:cd23060     72 KGTVQLTV 79
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
1296-1382 9.18e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 44.46  E-value: 9.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1296 SEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERntgesasqlldAETAERLRTEKEMKELQT 1375
Cdd:COG2433    404 EERELTEEEEEIRRLEEQVERLEAEVEELEAELEEKDERIERLERELSEAR-----------SEERREIRKDREISRLDR 472

                   ....*..
gi 1907082211 1376 QYDALKK 1382
Cdd:COG2433    473 EIERLER 479
GAS pfam13851
Growth-arrest specific micro-tubule binding; This family is the highly conserved central ...
1433-1594 9.50e-04

Growth-arrest specific micro-tubule binding; This family is the highly conserved central region of a number of metazoan proteins referred to as growth-arrest proteins. In mouse, Gas8 is predominantly a testicular protein, whose expression is developmentally regulated during puberty and spermatogenesis. In humans, it is absent in infertile males who lack the ability to generate gametes. The localization of Gas8 in the motility apparatus of post-meiotic gametocytes and mature spermatozoa, together with the detection of Gas8 also in cilia at the apical surfaces of epithelial cells lining the pulmonary bronchi and Fallopian tubes suggests that the Gas8 protein may have a role in the functioning of motile cellular appendages. Gas8 is a microtubule-binding protein localized to regions of dynein regulation in mammalian cells.


Pssm-ID: 464001 [Multi-domain]  Cd Length: 200  Bit Score: 42.97  E-value: 9.50e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1433 LQQELEDKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQD---TKLHLEGQQVRNHELEKKQRrfds 1509
Cdd:pfam13851   27 LIKSLKEEIAELKKKEERNEKLMSEIQQENKRLTEPLQKAQEEVEELRKQLENyekDKQSLKNLKARLKVLEKELK---- 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1510 ELSQAHEEtqreKLQR-EKLQREKDMLLAE----AFSLKQQMEEKDLDIAgftQKVVSL-------EAELQDISSQESKD 1577
Cdd:pfam13851  103 DLKWEHEV----LEQRfEKVERERDELYDKfeaaIQDVQQKTGLKNLLLE---KKLQALgetlekkEAQLNEVLAAANLD 175
                          170
                   ....*....|....*...
gi 1907082211 1578 EASLAKVKKQLRD-LEAK 1594
Cdd:pfam13851  176 PDALQAVTEKLEDvLESK 193
PRK05771 PRK05771
V-type ATP synthase subunit I; Validated
1587-1901 9.69e-04

V-type ATP synthase subunit I; Validated


Pssm-ID: 235600 [Multi-domain]  Cd Length: 646  Bit Score: 44.53  E-value: 9.69e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1587 QLRDLeaKVKDQEEELDEQAGSIQMLEQA--KLRLEMEMERMRQthSKEMESRDEEVEEARQSCQKKLKQmevqLEEEYE 1664
Cdd:PRK05771    32 HIEDL--KEELSNERLRKLRSLLTKLSEAldKLRSYLPKLNPLR--EEKKKVSVKSLEELIKDVEEELEK----IEKEIK 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1665 DKQKALR----EKRELESKLSTLSdqvNQRDFESEKRLRKDLKRTKALLADAQimldhlKNNAPSKREIAQLKNQLEESE 1740
Cdd:PRK05771   104 ELEEEISelenEIKELEQEIERLE---PWGNFDLDLSLLLGFKYVSVFVGTVP------EDKLEELKLESDVENVEYIST 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1741 F----TCAAAVKARKAMEV--EMEDLHLQIDDIAKAKTALEEqLSRLQREKNEIQNRLEEDQEDMNELMKKHKaavaqas 1814
Cdd:PRK05771   175 DkgyvYVVVVVLKELSDEVeeELKKLGFERLELEEEGTPSEL-IREIKEELEEIEKERESLLEELKELAKKYL------- 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1815 rdmaqmNDLQAQIEESNKEKQELQEKLQALQS----QVE-FLEQSMVD--KSLVSRQEAK---IRELETRLEFEKTQVKr 1884
Cdd:PRK05771   247 ------EELLALYEYLEIELERAEALSKFLKTdktfAIEgWVPEDRVKklKELIDKATGGsayVEFVEPDEEEEEVPTK- 319
                          330
                   ....*....|....*..
gi 1907082211 1885 LENlaSRLKETMEKLTE 1901
Cdd:PRK05771   320 LKN--PKFIKPFESLTE 334
GBP_C pfam02841
Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral ...
1433-1570 9.83e-04

Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP.


Pssm-ID: 460721 [Multi-domain]  Cd Length: 297  Bit Score: 43.82  E-value: 9.83e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1433 LQQELEDKmEVEQQSRRQLERRLgdlqadsDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRnheLEKKQRRFDSELS 1512
Cdd:pfam02841  178 LQEFLQSK-EAVEEAILQTDQAL-------TAKEKAIEAERAKAEAAEAEQELLREKQKEEEQM---MEAQERSYQEHVK 246
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1513 QAHEETQREklqREKLQREKDMLLAeafslKQQMEEKDLDIAGFTQKVVSLEAELQDI 1570
Cdd:pfam02841  247 QLIEKMEAE---REQLLAEQERMLE-----HKLQEQEELLKEGFKTEAESLQKEIQDL 296
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
1701-1957 1.03e-03

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 44.51  E-value: 1.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1701 DLKRTKALladaqIMLDHLKNNAPSKREiaQLKNQLE-ESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLS 1779
Cdd:pfam15964  348 NFEKTKAL-----IQCEQLKSELERQKE--RLEKELAsQQEKRAQEKEALRKEMKKEREELGATMLALSQNVAQLEAQVE 420
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1780 RLQREKNEIQNRLEEDQEDmnelMKKHKAAVAQASRDMA-QMNDLQAQIEESNKEKQELQEKlqaLQSQVEFLEQsmvdk 1858
Cdd:pfam15964  421 KVTREKNSLVSQLEEAQKQ----LASQEMDVTKVCGEMRyQLNQTKMKKDEAEKEHREYRTK---TGRQLEIKDQ----- 488
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1859 slvsrqeaKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDqraaaenrEKEQNKRLQRQLRDTKEE-MSELARK 1937
Cdd:pfam15964  489 --------EIEKLGLELSESKQRLEQAQQDAARAREECLKLTELLG--------ESEHQLHLTRLEKESIQQsFSNEAKA 552
                          250       260
                   ....*....|....*....|.
gi 1907082211 1938 EA-EASRKKHELEMDLESLEA 1957
Cdd:pfam15964  553 QAlQAQQREQELTQKMQQMEA 573
PDZ_RGS12-like cd06710
PDZ domain of regulator of G-protein signaling 12 (RGS12), and related domains; PDZ (PSD-95 ...
222-308 1.04e-03

PDZ domain of regulator of G-protein signaling 12 (RGS12), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of RGS12, and related domains. RGS12 downregulates GPCR signal transduction by increasing the GTPase activity of G-protein alpha subunits, thereby driving G-proteins into their inactive GDP-bound form. The RGS12 PDZ domain can bind selectively to C-terminal (A/S)-T-X-(L/V) motifs as found within both the CXCR2 IL-8 receptor, and the alternative 3' exon form of RGS12. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This RGS12-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467194 [Multi-domain]  Cd Length: 76  Bit Score: 39.92  E-value: 1.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  222 ELQRRPTGdFGFSLRrttmldrapeGQAYRRVVHFaepGAGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGD 301
Cdd:cd06710      4 EIARGRAG-YGFTIS----------GQAPCVLSCV---VRGSPADVAGLKAGDQILAVNGINVSKASHEDVVKLIGKCTG 69

                   ....*..
gi 1907082211  302 SVRLKVQ 308
Cdd:cd06710     70 VLRLVIA 76
PRK05431 PRK05431
seryl-tRNA synthetase; Provisional
1757-1848 1.07e-03

seryl-tRNA synthetase; Provisional


Pssm-ID: 235461 [Multi-domain]  Cd Length: 425  Bit Score: 43.90  E-value: 1.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1757 MEDLHLQIDDIAKAKTALE--------EQLSRLQREKNEIQNRLEEDQEDMNEL---MKKHKAAVAQASRDMAQMNDLQA 1825
Cdd:PRK05431     1 MLDIKLIRENPEAVKEALAkrgfpldvDELLELDEERRELQTELEELQAERNALskeIGQAKRKGEDAEALIAEVKELKE 80
                           90       100
                   ....*....|....*....|...
gi 1907082211 1826 QIEESNKEKQELQEKLQALQSQV 1848
Cdd:PRK05431    81 EIKALEAELDELEAELEELLLRI 103
hsdR PRK11448
type I restriction enzyme EcoKI subunit R; Provisional
1886-1995 1.13e-03

type I restriction enzyme EcoKI subunit R; Provisional


Pssm-ID: 236912 [Multi-domain]  Cd Length: 1123  Bit Score: 44.56  E-value: 1.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1886 ENLASRLKETMEKLTEERDQRAaaenREKEQNKRLQRQLRDTKEEMSELArKEAEAsrKKHELEMDLESLEAANQSLQAD 1965
Cdd:PRK11448   141 ENLLHALQQEVLTLKQQLELQA----REKAQSQALAEAQQQELVALEGLA-AELEE--KQQELEAQLEQLQEKAAETSQE 213
                           90       100       110
                   ....*....|....*....|....*....|
gi 1907082211 1966 LKLAFKRIGDlQAAIEDEMESDENEDLINS 1995
Cdd:PRK11448   214 RKQKRKEITD-QAAKRLELSEEETRILIDQ 242
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
1727-1970 1.21e-03

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 43.09  E-value: 1.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1727 REIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKH 1806
Cdd:pfam00261    1 KKMQQIKEELDEAEERLKEAMKKLEEAEKRAEKAEAEVAALNRRIQLLEEELERTEERLAEALEKLEEAEKAADESERGR 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1807 KAAVAQASRDMAQMNDLQAQI-------EESNKEKQELQEKLQALQSQVEFLEQSmvdkslVSRQEAKIRELETRLEFEK 1879
Cdd:pfam00261   81 KVLENRALKDEEKMEILEAQLkeakeiaEEADRKYEEVARKLVVVEGDLERAEER------AELAESKIVELEEELKVVG 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1880 TQVKRLENLAsrlketmekltEERDQRaaaENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAAN 1959
Cdd:pfam00261  155 NNLKSLEASE-----------EKASER---EDKYEEQIRFLTEKLKEAETRAEFAERSVQKLEKEVDRLEDELEAEKEKY 220
                          250
                   ....*....|.
gi 1907082211 1960 QSLQADLKLAF 1970
Cdd:pfam00261  221 KAISEELDQTL 231
RseP COG0750
Membrane-associated protease RseP, regulator of RpoE activity [Posttranslational modification, ...
269-308 1.26e-03

Membrane-associated protease RseP, regulator of RpoE activity [Posttranslational modification, protein turnover, chaperones, Transcription];


Pssm-ID: 440513 [Multi-domain]  Cd Length: 349  Bit Score: 43.54  E-value: 1.26e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1907082211  269 GLVPGDRLVEINGQNVEnkSRDEIVEMIRQS-GDSVRLKVQ 308
Cdd:COG0750    145 GLQPGDRIVAINGQPVT--SWDDLVDIIRASpGKPLTLTVE 183
OmpH pfam03938
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1585-1672 1.29e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 461098 [Multi-domain]  Cd Length: 140  Bit Score: 41.41  E-value: 1.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1585 KKQLRDLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEVQLEEEYE 1664
Cdd:pfam03938   18 KAAQAQLEKKFKKRQAELEAKQKELQKLYEELQKDGALLEEEREEKEQELQKKEQELQQLQQKAQQELQKKQQELLQPIQ 97

                   ....*....
gi 1907082211 1665 DK-QKALRE 1672
Cdd:pfam03938   98 DKiNKAIKE 106
PRK12704 PRK12704
phosphodiesterase; Provisional
1804-1951 1.36e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 44.00  E-value: 1.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1804 KKHKAAVAQASRDMAQMndlqaqIEESNKEKQELQeKLQALQSQVEFLE-QSMVDKSLVSRqEAKIRELETRLEFEKTQV 1882
Cdd:PRK12704    27 KIAEAKIKEAEEEAKRI------LEEAKKEAEAIK-KEALLEAKEEIHKlRNEFEKELRER-RNELQKLEKRLLQKEENL 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1883 KRLENLASRLKETMEKLTEERDQR--------AAAENREKEQNKRLQRQLRDTKEE-----MSELARK-EAEASRKKHEL 1948
Cdd:PRK12704    99 DRKLELLEKREEELEKKEKELEQKqqelekkeEELEELIEEQLQELERISGLTAEEakeilLEKVEEEaRHEAAVLIKEI 178

                   ...
gi 1907082211 1949 EMD 1951
Cdd:PRK12704   179 EEE 181
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1622-1795 1.66e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 42.60  E-value: 1.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1622 EMERMRQTHSKEMESRDEEVEEARQSCQKKLKQMEvQLEEEYEDKQKALREKRELESKLSTLSDQV-NQRDFES-EKRLr 1699
Cdd:COG1579     21 RLEHRLKELPAELAELEDELAALEARLEAAKTELE-DLEKEIKRLELEIEEVEARIKKYEEQLGNVrNNKEYEAlQKEI- 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1700 KDLKRTKALLADaqimldhlknnapskrEIAQLKNQLEESEFTCAAAVKARKAMEVEMEDLHLQIDDIAKAktaLEEQLS 1779
Cdd:COG1579     99 ESLKRRISDLED----------------EILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAE---LEAELE 159
                          170
                   ....*....|....*.
gi 1907082211 1780 RLQREKNEIQNRLEED 1795
Cdd:COG1579    160 ELEAEREELAAKIPPE 175
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
1864-1972 1.66e-03

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 43.69  E-value: 1.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1864 QEAKIRELETRLEFEKTQVKRLENLASRLKETMEklteERDQRAAaenREKEQNKRLQRQLRDTKEEMSELARKEAEASR 1943
Cdd:COG2433    404 EERELTEEEEEIRRLEEQVERLEAEVEELEAELE----EKDERIE---RLERELSEARSEERREIRKDREISRLDREIER 476
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1907082211 1944 KKHELE---MDLESLEAANQSLQADLKLAFKR 1972
Cdd:COG2433    477 LERELEeerERIEELKRKLERLKELWKLEHSG 508
PDZ5_GRIP1-2-like cd06682
PDZ domain 5 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related ...
270-308 1.78e-03

PDZ domain 5 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor (AMPAR) binding proteins GRIP1 (ABP/GRIP2) and GRIP2, and related domains. GRIP1 and GRIP2 each have 7 PDZ domains. The interaction of GRIP1 and GRIP2 with GluA2/3 (AMPAR subunit) regulates AMPAR trafficking and synaptic targeting. GRIP1 has an essential role in regulating AMPAR trafficking during synaptic plasticity and learning and memory. GRIP1 and GRIP2 interact with a variety of other proteins associated with protein trafficking and internalization, for example GRIP1 also interacts with KIF5 (also known as kinesin 1), EphB receptors, scaffold protein liprin-alpha, and the rasGEF GRASP-1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This GRIP family domain PDZ5 is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467170 [Multi-domain]  Cd Length: 85  Bit Score: 39.64  E-value: 1.78e-03
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1907082211  270 LVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQ 308
Cdd:cd06682     46 LEPGDKLLAIDNIRLDNCSMEDAAQILQQAEDIVKLKIR 84
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1296-1514 1.79e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 43.28  E-value: 1.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1296 SEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDerntgesasqlldaetaerlrTEKEMKELQT 1375
Cdd:COG3883     14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEA---------------------LQAEIDKLQA 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1376 QYDALKKQmeVMEMEVMEARLIRAAEINGEVDDDDA---GGE------WRLKYERAVREVDFTKKRLQQELEDKMEVEQQ 1446
Cdd:COG3883     73 EIAEAEAE--IEERREELGERARALYRSGGSVSYLDvllGSEsfsdflDRLSALSKIADADADLLEELKADKAELEAKKA 150
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1447 SRRQLERRLGDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSELSQA 1514
Cdd:COG3883    151 ELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAA 218
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
1793-1943 1.87e-03

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 43.69  E-value: 1.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1793 EEDQEDMNELMKKHKAAVAQASRDMAQ-MNDLQAQIEESNKEKQELQEKLQalqsqvefleqsmvdkslvsRQEAKIREL 1871
Cdd:COG2433    387 EKELPEEEPEAEREKEHEERELTEEEEeIRRLEEQVERLEAEVEELEAELE--------------------EKDERIERL 446
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907082211 1872 ETRLEFEKTQVKRlenlasrlketmekltEERDQRaAAENREKEqNKRLQRQLRDTKEEMSELARKEAEASR 1943
Cdd:COG2433    447 ERELSEARSEERR----------------EIRKDR-EISRLDRE-IERLERELEEERERIEELKRKLERLKE 500
PDZ3_MUPP1-like cd06791
PDZ domain 3 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) ...
272-307 1.94e-03

PDZ domain 3 of multi-PDZ-domain protein 1 (MUPP1) and PATJ (protein-associated tight junction) and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of MUPP1 and PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467253 [Multi-domain]  Cd Length: 89  Bit Score: 39.52  E-value: 1.94e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1907082211  272 PGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKV 307
Cdd:cd06791     52 VNDQIIAVDGVNLQGFTNQEAVEVLRNTGQVVHLTL 87
AAA_13 pfam13166
AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA ...
1573-1937 1.96e-03

AAA domain; This family of domains contain a P-loop motif that is characteriztic of the AAA superfamily. Many of the proteins in this family are conjugative transfer proteins. This family includes the PrrC protein that is thought to be the active component of the anticodon nuclease.


Pssm-ID: 463796 [Multi-domain]  Cd Length: 712  Bit Score: 43.51  E-value: 1.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1573 QESKD-EASLAKVKKQLRDLEAKVKDQEEELdeqagsiqmlEQAKLRLEMEMERMRQTHSKEMESR-DEEVEEARQSCQK 1650
Cdd:pfam13166   89 EESIEiQEKIAKLKKEIKDHEEKLDAAEANL----------QKLDKEKEKLEADFLDECWKKIKRKkNSALSEALNGFKY 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1651 KLKQMEVQLEE--EYEDKQKALREKRELESKLSTLSDQ---------VNQRDFESEKRLRKDLKRTKALLADAQIMLDHL 1719
Cdd:pfam13166  159 EANFKSRLLREieKDNFNAGVLLSDEDRKAALATVFSDnkpeiapltFNVIDFDALEKAEILIQKVIGKSSAIEELIKNP 238
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1720 KNNAPSKREIAQLKNQLEESEFtC--------AAAVKAR--KAMEVEMEDLHLQIDDIAKAKTALEEQL----------S 1779
Cdd:pfam13166  239 DLADWVEQGLELHKAHLDTCPF-CgqplpaerKAALEAHfdDEFTEFQNRLQKLIEKVESAISSLLAQLpavsdlasllS 317
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1780 RLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASR---------DMAQMNDLQAQIEES----NKEKQELQEKLQALQS 1846
Cdd:pfam13166  318 AFELDVEDIESEAEVLNSQLDGLRRALEAKRKDPFKsieldsvdaKIESINDLVASINELiakhNEITDNFEEEKNKAKK 397
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1847 QVEfleqsmvdKSLVSRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLteerdqraaaenreKEQNKRLQRQLRD 1926
Cdd:pfam13166  398 KLR--------LHLVEEFKSEIDEYKDKYAGLEKAINSLEKEIKNLEAEIKKL--------------REEIKELEAQLRD 455
                          410
                   ....*....|....
gi 1907082211 1927 TK---EEMSELARK 1937
Cdd:pfam13166  456 HKpgaDEINKLLKA 469
PDZ2_GRIP1-2-like cd06681
PDZ domain 2 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related ...
220-308 2.01e-03

PDZ domain 2 of glutamate receptor-interacting protein 1 (GRIP1) and GRIP2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor (AMPAR) binding proteins GRIP1 (ABP/GRIP2) and GRIP2, and related domains. GRIP1 and GRIP2 each have 7 PDZ domains. The interaction of GRIP1 and GRIP2 with GluA2/3 (AMPAR subunit) regulates AMPAR trafficking and synaptic targeting. GRIP1 has an essential role in regulating AMPAR trafficking during synaptic plasticity and learning and memory. GRIP1 and GRIP2 interact with a variety of other proteins associated with protein trafficking and internalization, for example GRIP1 also interacts with KIF5 (also known as kinesin 1), EphB receptors, scaffold protein liprin-alpha, and the rasGEF GRASP-1. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This GRIP family PDZ2 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467169 [Multi-domain]  Cd Length: 89  Bit Score: 39.52  E-value: 2.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPtGDFGFSLR----RTTMLDRA-------PEGQAYRRvvhfaepgaGTkdlalgLVPGDRLVEINGQNVENKS 288
Cdd:cd06681      4 EVTLEKEG-NSFGFVIRggahEDRNKSRPltvthvrPGGPADRE---------GT------IKPGDRLLSVDGISLHGAT 67
                           90       100
                   ....*....|....*....|
gi 1907082211  289 RDEIVEMIRQSGDSVRLKVQ 308
Cdd:cd06681     68 HAEAMSILKQCGQEATLLIE 87
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1533-2010 2.02e-03

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 43.67  E-value: 2.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1533 DMLLAEAFSLKQQMEEKDLDIagfTQKVVSLEAELQDISSQESKDEASL-----------AKVKKQLRDLEAKVKDQEEE 1601
Cdd:PTZ00440   601 EELINEALFNKEKFINEKNDL---QEKVKYILNKFYKGDLQELLDELSHflddhkylyheAKSKEDLQTLLNTSKNEYEK 677
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1602 LDEQAGSI--QMLEQAKLRLEmemermrqtHSKEMESR--DEEVEEARQSCQKKLKQMEvqleEEYEDKQKALREKRELE 1677
Cdd:PTZ00440   678 LEFMKSDNidNIIKNLKKELQ---------NLLSLKENiiKKQLNNIEQDISNSLNQYT----IKYNDLKSSIEEYKEEE 744
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1678 SKLSTLSDQVNQRdfesEKRLRKDLKRTKALLADAQIMLDHLKNNapsKREIAQLKNQLEESEFTCAAAVKARKAMEVEM 1757
Cdd:PTZ00440   745 EKLEVYKHQIINR----KNEFILHLYENDKDLPDGKNTYEEFLQY---KDTILNKENKISNDINILKENKKNNQDLLNSY 817
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1758 EDLHLQIDDIAKAKTALEEQLsrLQREKNEIQN-RLEEDQEDMNELMKKhkaavaqasrdmaqMNDLQAQIEESNKekqe 1836
Cdd:PTZ00440   818 NILIQKLEAHTEKNDEELKQL--LQKFPTEDENlNLKELEKEFNENNQI--------------VDNIIKDIENMNK---- 877
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1837 lqeKLQALQSQVEFLEQSMVDKSLVSRQEAKIRELETRLEfektqvkrlenlaSRLKETMEKLTEERDQRAAAENREKEQ 1916
Cdd:PTZ00440   878 ---NINIIKTLNIAINRSNSNKQLVEHLLNNKIDLKNKLE-------------QHMKIINTDNIIQKNEKLNLLNNLNKE 941
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1917 NKRLQRQLRDTKeeMSELarkeaeasrkKHELEMDLESLEAANQSLQADLKLafkrigDLQAAIEDEMESDENEDLINSL 1996
Cdd:PTZ00440   942 KEKIEKQLSDTK--INNL----------KMQIEKTLEYYDKSKENINGNDGT------HLEKLDKEKDEWEHFKSEIDKL 1003
                          490
                   ....*....|....
gi 1907082211 1997 QdmvTKYQKKKNKL 2010
Cdd:PTZ00440  1004 N---VNYNILNKKI 1014
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
1807-1949 2.03e-03

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 42.91  E-value: 2.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1807 KAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSmvdkslvsrQEAKIRELETRLEFEKTqvKRLE 1886
Cdd:TIGR02794   39 QAVLVDPGAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAA---------EQARQKELEQRAAAEKA--AKQA 107
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907082211 1887 NLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELE 1949
Cdd:TIGR02794  108 EQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEE 170
Prefoldin_2 pfam01920
Prefoldin subunit; This family includes prefoldin subunits that are not detected by pfam02996.
1818-1902 2.10e-03

Prefoldin subunit; This family includes prefoldin subunits that are not detected by pfam02996.


Pssm-ID: 396482 [Multi-domain]  Cd Length: 102  Bit Score: 39.90  E-value: 2.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1818 AQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQS--------MVDKSLVSR-QEAKIRELETRLEFEKTQVKRLENL 1888
Cdd:pfam01920    2 NKFQQLQQQLQLLAQQIKQLETQLKELELALEELELLdedtkvykLIGDVLVKQdKEEVKEQLEERKETLEKEIKTLEKQ 81
                           90
                   ....*....|....
gi 1907082211 1889 ASRLKETMEKLTEE 1902
Cdd:pfam01920   82 LEKLEKELEELKEE 95
SHE3 pfam17078
SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an ...
1757-1924 2.14e-03

SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an RNA-binding protein that binds specific mRNAs, including the mRNA of Ash1, which is invalid in cell-fate determination. She3 acts as an adapter protein that docks the myosin motor Myo4p onto an Ash1-She2p ribonucleoprotein complex. She3 seems to bind to Myo4p and Shep2p via different domains.


Pssm-ID: 293683 [Multi-domain]  Cd Length: 228  Bit Score: 42.03  E-value: 2.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1757 MEDLHLQIDDIAKAKTALEEQ----LSRL----QRE----------KNEIQN----------RLEEDQEDMNELMKKHKA 1808
Cdd:pfam17078    5 IESLHDQIDALTKTNLQLTVQsqnlLSKLeiaqQKEskflenlaslKHENDNlssmlnrkerRLKDLEDQLSELKNSYEE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1809 AVaqasrdmAQMNDLQAQIEESNKEKQELQEKLQALQSQVEfleqSMVDKSLVSRQ--EAKIRELETRLE-FEKTQVKRL 1885
Cdd:pfam17078   85 LT-------ESNKQLKKRLENSSASETTLEAELERLQIQYD----ALVDSQNEYKDhyQQEINTLQESLEdLKLENEKQL 153
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1907082211 1886 ENLASRL----KETMEKLTEERDQRAAAENREKEQNKRLQRQL 1924
Cdd:pfam17078  154 ENYQQRIssndKDIDTKLDSYNNKFKNLDNIYVNKNNKLLTKL 196
PRK11281 PRK11281
mechanosensitive channel MscK;
1562-1863 2.16e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 43.36  E-value: 2.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1562 SLEAeLQDISSQESKDEASLAKVKKQLRDLeAKVKDQEEELDEqagsiqmLEQaklRLEMEMERMRQThSKEMES-RDEE 1640
Cdd:PRK11281    44 QLDA-LNKQKLLEAEDKLVQQDLEQTLALL-DKIDRQKEETEQ-------LKQ---QLAQAPAKLRQA-QAELEAlKDDN 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1641 VEEARQSCQK-KLKQMEVQLEE---EYEDKQKALREKRELESKLSTLSDQVNQRDFESEKRL---RKDLKRT----KALL 1709
Cdd:PRK11281   111 DEETRETLSTlSLRQLESRLAQtldQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLqqiRNLLKGGkvggKALR 190
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1710 ADAQIMLdhlknNApskrEIAQLKNQLEESEFTCAAAVKarkameveMEDLHLQIDDIAKAKTA-LEEQLSRLQREKNei 1788
Cdd:PRK11281   191 PSQRVLL-----QA----EQALLNAQNDLQRKSLEGNTQ--------LQDLLQKQRDYLTARIQrLEHQLQLLQEAIN-- 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1789 QNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQI--------EESNK-EKQELQEKLQ---ALQSQVEFLEQSMV 1856
Cdd:PRK11281   252 SKRLTLSEKTVQEAQSQDEAARIQANPLVAQELEINLQLsqrllkatEKLNTlTQQNLRVKNWldrLTQSERNIKEQISV 331

                   ....*....
gi 1907082211 1857 DK-SLV-SR 1863
Cdd:PRK11281   332 LKgSLLlSR 340
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
1433-1672 2.23e-03

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 42.05  E-value: 2.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1433 LQQELEDKMEVEQQSRRQLERRLGDLQADSDESQraLQQLKKKCQRLTAELQdtklhleGQQVRNHELEKKQRRFDSELS 1512
Cdd:cd00176      1 KLQQFLRDADELEAWLSEKEELLSSTDYGDDLES--VEALLKKHEALEAELA-------AHEERVEALNELGEQLIEEGH 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1513 QAHEETQReklQREKLQREKDMLLAEAFSLKQQMEEKdLDIAGFTQKVVSLEAELQDISSQESKDE--ASLAKVKKQLRD 1590
Cdd:cd00176     72 PDAEEIQE---RLEELNQRWEELRELAEERRQRLEEA-LDLQQFFRDADDLEQWLEEKEAALASEDlgKDLESVEELLKK 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1591 LeakvKDQEEELDEQAGSIQMLEQAKLRLEmemermrqthskemESRDEEVEEARQSCQKKLKQMEVQLEEEYEDKQKAL 1670
Cdd:cd00176    148 H----KELEEELEAHEPRLKSLNELAEELL--------------EEGHPDADEEIEEKLEELNERWEELLELAEERQKKL 209

                   ..
gi 1907082211 1671 RE 1672
Cdd:cd00176    210 EE 211
DUF4200 pfam13863
Domain of unknown function (DUF4200); This family is found in eukaryotes. It is a coiled-coil ...
1748-1853 2.35e-03

Domain of unknown function (DUF4200); This family is found in eukaryotes. It is a coiled-coil domain of unknwon function.


Pssm-ID: 464003 [Multi-domain]  Cd Length: 119  Bit Score: 40.24  E-value: 2.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1748 KARKAMEVEM------EDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMN 1821
Cdd:pfam13863    4 KKREMFLVQLaldakrEEIERLEELLKQREEELEKKEQELKEDLIKFDKFLKENDAKRRRALKKAEEETKLKKEKEKEIK 83
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1907082211 1822 DLQAQIEESNKEKQELQEKLQALQSQVEFLEQ 1853
Cdd:pfam13863   84 KLTAQIEELKSEISKLEEKLEEYKPYEDFLEK 115
PDZ1_harmonin cd06737
PDZ domain 1 of harmonin isoforms a, b, and c, and related domains; PDZ (PSD-95 (Postsynaptic ...
220-310 2.36e-03

PDZ domain 1 of harmonin isoforms a, b, and c, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of harmonin isoforms a, b, and c, and related domains. Harmonin (also known as Usher Type 1C, PDZ-73 and AIE-75) is a key organizer of the Usher (USH) protein interactome. USH syndrome is the leading cause of hereditary sensory deaf-blindness in humans; three clinically distinct types of USH have been identified, type 1 to 3. The gene encoding harmonin (USH1C) is the causative gene for the USH type 1C phenotype. There are at least 10 alternatively spliced isoforms of harmonin, which are divided into three subclasses (a, b, and c). All isoforms contain the first two PDZ domains and the first coiled-coil domain. The a and b isoforms all have a third PDZ domain. The different PDZ domains are responsible for interactions with all known Usher syndrome type 1 proteins, and most Usher syndrome type 2 proteins. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This harmonin family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467219 [Multi-domain]  Cd Length: 85  Bit Score: 39.16  E-value: 2.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  220 ELELQRRPTGDFGFSLRR---------TTMLDraPEGQAYRRvvhfaepgagtkdlalGLVPGDRLVEINGQNVENKSRD 290
Cdd:cd06737      4 LVRLDRRGPESLGFSVRGglehgcglfVSHVS--PGSQADNK----------------GLRVGDEIVRINGYSISQCTHE 65
                           90       100
                   ....*....|....*....|
gi 1907082211  291 EIVEMIRQSgDSVRLKVQPI 310
Cdd:cd06737     66 EVINLIKTK-KTVSLKVRHV 84
Rabaptin pfam03528
Rabaptin;
1655-2076 2.39e-03

Rabaptin;


Pssm-ID: 367545 [Multi-domain]  Cd Length: 486  Bit Score: 43.17  E-value: 2.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1655 MEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKrlrkDLKRTKALLADAQIMLDHLKNN-APSKREIAQLK 1733
Cdd:pfam03528    6 LQQRVAELEKENAEFYRLKQQLEAEFNQKRAKFKELYLAKEE----DLKRQNAVLQEAQVELDALQNQlALARAEMENIK 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1734 NQLEESEFTCAAAV-KARKAMEVEMEDLHLQIDDIAKA-----KTALEEQLSR-------LQREKNEIQNRLEEDQEDMN 1800
Cdd:pfam03528   82 AVATVSENTKQEAIdEVKSQWQEEVASLQAIMKETVREyevqfHRRLEQERAQwnqyresAEREIADLRRRLSEGQEEEN 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1801 ---------ELMKKHKAAVAQASRDMAQMNDLQAQIEESNKE-----KQELQEKLQALQSQVEFLEQSM----VDKSLVS 1862
Cdd:pfam03528  162 ledemkkaqEDAEKLRSVVMPMEKEIAALKAKLTEAEDKIKEleaskMKELNHYLEAEKSCRTDLEMYVavlnTQKSVLQ 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1863 RQEAKIR----ELETRLEFEKTQVKRL--------------ENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQL 1924
Cdd:pfam03528  242 EDAEKLRkelhEVCHLLEQERQQHNQLkhtwqkandqflesQRLLMRDMQRMESVLTSEQLRQVEEIKKKDQEEHKRART 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1925 RDTKEEMSE-----------LARKEAEASRKKHELEMDLESLEAANQSLQADLKL----AFKRIGDLqaaIEDEMESDEN 1989
Cdd:pfam03528  322 HKEKETLKSdrehtvsihavFSPAGVETSAPLSNVEEQINSAHGSVHSLDTDVVLgagdSFNKQEDP---FKEGLRRAQS 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1990 EDLINSLQDMVTKYQKKKNKLEGDSDVD-SELEDRV--DGV-KSWLSKNKGPSKAPSDDGSLKSSRTALNSLPKEgkgpE 2065
Cdd:pfam03528  399 TDSLGSSSSLQHKFLGHNQKAKSAGNLDeSDFGPLVgaDSVsENFDTSSLGSLKMPSGFMLTKDQEKAIKAMTPE----Q 474
                          490
                   ....*....|.
gi 1907082211 2066 ERPSSVLSSLS 2076
Cdd:pfam03528  475 EETASLLSSVT 485
PRK11281 PRK11281
mechanosensitive channel MscK;
1724-2021 2.50e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 43.36  E-value: 2.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1724 PSKREIaqlKNQLEeseftcaaAVKARKAMEVE-------MEDLHLQIDDIAKAK---TALEEQLSRLQREKNEIQNRLE 1793
Cdd:PRK11281    36 PTEADV---QAQLD--------ALNKQKLLEAEdklvqqdLEQTLALLDKIDRQKeetEQLKQQLAQAPAKLRQAQAELE 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1794 EdqedmnelMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQvefleqsmvdksLVSRQEAKIReLET 1873
Cdd:PRK11281   105 A--------LKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQ------------LVSLQTQPER-AQA 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1874 RLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELArkeaeasRKKHELemdle 1953
Cdd:PRK11281   164 ALYANSQRLQQIRNLLKGGKVGGKALRPSQRVLLQAEQALLNAQNDLQRKSLEGNTQLQDLL-------QKQRDY----- 231
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1954 sLEAANQSLQadlklafKRIGDLQAAIedemesdeNEDLINSLQDMVTKYQ--KKKNKLEGDSDVDSELE 2021
Cdd:PRK11281   232 -LTARIQRLE-------HQLQLLQEAI--------NSKRLTLSEKTVQEAQsqDEAARIQANPLVAQELE 285
DUF4795 pfam16043
Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. ...
1786-1911 2.64e-03

Domain of unknown function (DUF4795); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 285 and 978 amino acids in length.


Pssm-ID: 464990 [Multi-domain]  Cd Length: 181  Bit Score: 41.14  E-value: 2.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1786 NEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEK-------QELQEK--LQALQSQVEF------ 1850
Cdd:pfam16043   10 DQLQALILDLQEELEKLSETTSELSERLQQRQKHLEALYQQIEKLEKVKadkevveEELDEKadKEALASKVSRdqfdet 89
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1851 ---LEQSMVDK-SLVSRQEAKIR---ELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAEN 1911
Cdd:pfam16043   90 leeLNQMLQELlDKLEGQEDAWKkalETLSEELDTKLDRLELDPLKELLERRIKALQKLLQEGSEELD 157
CAGE1 pfam15066
Cancer-associated gene protein 1 family; CAGE-1 is a family of proteins overexpressed in ...
1733-1957 2.80e-03

Cancer-associated gene protein 1 family; CAGE-1 is a family of proteins overexpressed in tumour tissues compared with surrounding tissues. CAGE-1 gene showed testis-specific expression among normal tissues and displayed wide expression in a variety of cancer cell lines and cancer tissues. CAGE-1 is predominantly expressed during post-meiotic stages. It localizes to the acrosomal matrix and acrosomal granule showing it to be a component of the acrosome of mammalian spermatids and spermatozoa.


Pssm-ID: 464481  Cd Length: 528  Bit Score: 42.90  E-value: 2.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1733 KNQLEESEFTCAAAVKARKAMEVEMEDLHLQ--IDDIAKAKTALEEQLS---RLQREKNEIQNRLEEDQEdmnelmkkhk 1807
Cdd:pfam15066  329 KQQMQIQDLQCSNLYLEKKVKELQMKITKQQvfVDIINKLKENVEELIEdkyNVILEKNDINKTLQNLQE---------- 398
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1808 aavaqasrdmaQMNDLQAQIEESNKEKQELQEKLQALQ-SQVEFLEQSMVDKSLVSRQEAKIRELETRLEFEKTQVKRLE 1886
Cdd:pfam15066  399 -----------ILANTQKHLQESRKEKETLQLELKKIKvNYVHLQERYITEMQQKNKSVSQCLEMDKTLSKKEEEVERLQ 467
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907082211 1887 nlasRLKETMEKLTEerdqrAAAENREKEQNKRLQRQLRDTKeemsELARKEAEASRKKHELEMDLESLEA 1957
Cdd:pfam15066  468 ----QLKGELEKATT-----SALDLLKREKETREQEFLSLQE----EFQKHEKENLEERQKLKSRLEKLVA 525
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1456-1600 3.12e-03

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 42.49  E-value: 3.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1456 GDLQADSDESQRALQQLKKKCQRLTAELQDTKlhlEGQQVRNHELEKKQRRFDSELSQAhEETQREKLQREKLQREKDML 1535
Cdd:PRK09510    65 NRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQ---AAEQERLKQLEKERLAAQEQKKQA-EEAAKQAALKQKQAEEAAAK 140
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1536 LAEAFSLKQQMEEKDLDIAGftqKVVSLEAELQDISSQESKDEaslAKVKKQLrDLEAKVKDQEE 1600
Cdd:PRK09510   141 AAAAAKAKAEAEAKRAAAAA---KKAAAEAKKKAEAEAAKKAA---AEAKKKA-EAEAAAKAAAE 198
PDZ1_Scribble-like cd06704
PDZ domain 1 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 ...
226-308 3.41e-03

PDZ domain 1 of Drosophila Scribble, human Scribble homolog, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 1 of Drosophila Scribble (also known as LAP4), human Scribble homolog (also known as hScrib, LAP4, CriB1, ScrB1 and Vartul), and related domains. They belong to the LAP family, which describes proteins that contain either one or four PDZ domains and 16 LRRs (leucine-rich repeats) and function in controlling cell shape, size and subcellular protein localization. In Drosophila, the Scribble complex, comprising Scribble, discs large, and lethal giant larvae, plays a role in apico-basal cell polarity, in other forms of polarity, including regulation of the actin cytoskeleton, cell signaling and vesicular trafficking, and in tumor development. Mammalian Scribble is important in many aspects of cancer development. Scribble and its homologs can be downregulated or overexpressed in cancer; they have a role in cancer beyond their function in loss of cell polarity. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Scribble-like family PDZ1 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467188 [Multi-domain]  Cd Length: 87  Bit Score: 38.80  E-value: 3.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  226 RPTGDFGFSL---RRTTMLDRAPEGQAYRRVvhfAEPGAGTKDlalGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDS 302
Cdd:cd06704      7 RQTGGLGISIaggKGSTPYKGDDEGIFISRV---TEGGPAAKA---GVRVGDKLLEVNGVDLVDADHHEAVEALKNSGNT 80

                   ....*.
gi 1907082211  303 VRLKVQ 308
Cdd:cd06704     81 VTMVVL 86
PRK11637 PRK11637
AmiB activator; Provisional
1777-1966 3.54e-03

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 42.37  E-value: 3.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1777 QLSRLQREKNEIQNRLEEDQEDMNEL---MKKHKAAVAQASRdmaQMNDLQAQIEESNKEKQELQEKLQALQSQvefleQ 1853
Cdd:PRK11637    48 QLKSIQQDIAAKEKSVRQQQQQRASLlaqLKKQEEAISQASR---KLRETQNTLNQLNKQIDELNASIAKLEQQ-----Q 119
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1854 SMVDKSLVSRQEAKIRE-----LETRLEFEKTQVK-RLENLASRL----KETMEKLTEERDQrAAAENREKEQNKRLQRQ 1923
Cdd:PRK11637   120 AAQERLLAAQLDAAFRQgehtgLQLILSGEESQRGeRILAYFGYLnqarQETIAELKQTREE-LAAQKAELEEKQSQQKT 198
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 1907082211 1924 -LRDTKEEMSELarKEAEASRKKhelemDLESLEAANQSLQADL 1966
Cdd:PRK11637   199 lLYEQQAQQQKL--EQARNERKK-----TLTGLESSLQKDQQQL 235
ALIX_LYPXL_bnd pfam13949
ALIX V-shaped domain binding to HIV; The binding of the LYPxL motif of late HIV p6Gag and EIAV ...
1758-2011 4.35e-03

ALIX V-shaped domain binding to HIV; The binding of the LYPxL motif of late HIV p6Gag and EIAV p9Gag to this domain is necessary for viral budding.This domain is generally central between an N-terminal Bro1 domain, pfam03097 and a C-terminal proline-rich domain. The retroviruses thus used this domain to hijack the ESCRT system of the cell.


Pssm-ID: 464053 [Multi-domain]  Cd Length: 294  Bit Score: 41.84  E-value: 4.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1758 EDLHLQIDDIAKAKTALEEQLsrlqrekNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMA------QMNDLQAQIEESN 1831
Cdd:pfam13949   20 ERLEKSLDDLPKLKQRNREIL-------DEAEKLLDEEESEDEQLRAKYGTRWTRPPSSELtatlraEIRKYREILEQAS 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1832 KEKQELQEKLQALQSQVEFLEQSMVD--KSLVSRQEAKIREletrlefektqvkRLENLASRLKETMEKLTEERDQRAAA 1909
Cdd:pfam13949   93 ESDSQVRSKFREHEEDLELLSGPDEDleAFLPSSRRAKNSP-------------SVEEQVAKLRELLNKLNELKREREQL 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1910 EN--REKEQNKRLQRQLRDTKEEMSELARKEA---EASRKKHELEMDLESLEAANQSLQADLKLAFKRIgdLQAAIEDEM 1984
Cdd:pfam13949  160 LKdlKEKARNDDISPKLLLEKARLIAPNQEEQlfeEELEKYDPLQNRLEQNLHKQEELLKEITEANNEF--LQDKRVDSE 237
                          250       260
                   ....*....|....*....|....*..
gi 1907082211 1985 ESDENEDLINSLQDMVTKYQKKKNKLE 2011
Cdd:pfam13949  238 KQRQREEALQKLENAYDKYKELVSNLQ 264
PDZ_Lin-7-like cd06796
PDZ domain of protein Lin-7 and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), ...
259-311 4.45e-03

PDZ domain of protein Lin-7 and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of Lin-7 (also known as LIN-7 or LIN7), and related domains. Lin-7 targets and organize protein complexes to epithelial and synaptic plasma membranes. There are three mammalian Lin-7 homologs: Lin-7A (protein lin-7 homolog A, also known as mammalian lin-seven protein 1 (MALS-1), vertebrate lin-7 homolog 1 (Veli-1), tax interaction protein 33); Lin-7B (also known as MALS-2, Veli-2); and Lin-7C (also known as MALS-3, Veli-3). Lin-7 is involved in localization of the Let-23 growth factor receptor to the basolateral membrane of epithelial cells, in tight junction localization of insulin receptor substrate p53 (IRSp53), in retaining gamma-aminobutyric (GABA) transporter (BGT-1) at the basolateral surface of epithelial cells, and in regulating recruitment of neurotransmitter receptors to the postsynaptic density (PSD). The Lin7 PDZ domain binds Let-23, BGT and beta-catenin, and NMDA (N-methyl-D-aspartate) receptor NR2B. Lin-7 also binds to the PDZ binding motif located in the C-terminal tail of Rhotekin, an effector protein for small GTPase Rho. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Lin-7-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467258 [Multi-domain]  Cd Length: 86  Bit Score: 38.57  E-value: 4.45e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1907082211  259 PGaGTKDLALGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQPIP 311
Cdd:cd06796     35 PG-GVADRHGGLKRGDQLLSVNGVSVEGEHHEKAVELLKAAQGSVKLVVRYTP 86
PDZ_CASK-like cd10831
PDZ domain of peripheral plasma membrane protein CASK, Caenorhabditis Lin-2, and related ...
273-309 4.73e-03

PDZ domain of peripheral plasma membrane protein CASK, Caenorhabditis Lin-2, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of CASK, Caenorhabditis elegans Lin-2, and related domains. CASK and Lin-2 are membrane-associated guanylate kinase (MAGUK)-like proteins. CASK (also known as Calcium/calmodulin-dependent protein kinase, CAKI, and Camguk) has a role in synaptic transmembrane protein anchoring and ion channel trafficking. CASK may regulate transmembrane proteins that bind calcium, calmodulin, or nucleotides; it regulates the Drosophila ether a go-go (eag) potassium channel, and also regulates autophosphorylation of CaMKII. CASK binding partners include the transcription factor TBR1, and cell-surface proteins, including amyloid precursor protein, neurexins, and syndecans. Lin-2, as a component of the CLin-10-Lin-2-Lin-7 complex, plays a role in controlling the basolateral localization of the EGF receptor Let-23; this complex also associates with the neuron-specific motor protein KIF17 to transport vesicles containing N-methyl-D-aspartate (NMDA) receptor 2B along microtubules. CASK may also function in targeting or scaffolding of the protein parkin which is selectively truncated by a Parkinson's disease-causing mutation; the C-terminus of parkin functions as a class II PDZ-binding motif that binds CASK. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MPP6-MPP2-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467267 [Multi-domain]  Cd Length: 81  Bit Score: 38.23  E-value: 4.73e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907082211  273 GDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQP 309
Cdd:cd10831     45 GDEIREINGISVANQTVEQLQKMLREARGSITFKIVP 81
PDZ_MAST1 cd23073
PDZ domain of microtubule-associated serine-threonine (MAST) protein kinase 1; PDZ (PSD-95 ...
225-314 4.76e-03

PDZ domain of microtubule-associated serine-threonine (MAST) protein kinase 1; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of MAST family kinase MAST1, and related domains. MAST1 belongs to the MAST family kinases, which include MAST1-4. These MAST proteins contain a DUF1908 domain, a serine/threonine kinase domain, a AGC-kinase C-terminal domain, and a PDZ domain; MAST family member MASTL is a shorter protein lacking the PDZ domain. MAST1 functions as a scaffold protein to link the dystrophin/utrophin network with microfilaments via syntrophin, and it has been identified as a main driver of cisplatin resistance in human cancers. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MAST1 family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F


Pssm-ID: 467286 [Multi-domain]  Cd Length: 95  Bit Score: 38.47  E-value: 4.76e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  225 RRPTGDFGFSLRRTTMLDRAPEGQAYRRVVHFAEPGAGTKDLalGLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVR 304
Cdd:cd23073      8 QRSGKKYGFTLRAIRVYMGDSDVYSVHHIVWHVEEGGPAQEA--GLCAGDLITHVNGEPVHGMVHPEVVELILKSGNKVA 85
                           90
                   ....*....|
gi 1907082211  305 LKVQPIPELS 314
Cdd:cd23073     86 VTTTPFENTS 95
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1790-1996 4.76e-03

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 41.72  E-value: 4.76e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1790 NRLEEDQEDMNELMKKHKAAVAQASRDMAQmndlqAQIEESNKEKQELQEKLQALQSQVEFLEQSMVdkslvSRQEAKIR 1869
Cdd:PRK09510    65 NRQQQQQKSAKRAEEQRKKKEQQQAEELQQ-----KQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ-----AALKQKQA 134
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1870 ELETRLEFEKTQVKRlENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEA-------- 1941
Cdd:PRK09510   135 EEAAAKAAAAAKAKA-EAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAeakkkaaa 213
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1942 -SRKKHELEMDLESLEAANQSLQADLKLAFKRIGDLQAAIEDEMESDeneDLINSL 1996
Cdd:PRK09510   214 eAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVD---DLFGGL 266
PspA_IM30 pfam04012
PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent ...
1817-2025 4.87e-03

PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homolog in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma(54) containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator.


Pssm-ID: 461130 [Multi-domain]  Cd Length: 215  Bit Score: 40.82  E-value: 4.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1817 MAQMNDLQAQIEESNKEkqeLQEKLQALQSQVEFLEQSMVdkslvsRQEAKIRELETRLEFEKTQVKRLENLAsrlKETM 1896
Cdd:pfam04012   10 RANIHEGLDKAEDPEKM---LEQAIRDMQSELVKARQALA------QTIARQKQLERRLEQQTEQAKKLEEKA---QAAL 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1897 EKLTEERDQRAAaenrekEQNKRLQRQLRDTKEEMSELARKEAEASRKKHELEMDLESLEAANQSLQADLKLAfkrigDL 1976
Cdd:pfam04012   78 TKGNEELAREAL------AEKKSLEKQAEALETQLAQQRSAVEQLRKQLAALETKIQQLKAKKNLLKARLKAA-----KA 146
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1907082211 1977 QAAIEDEMESDENEDLINSLQDMvtkyQKKKNKLEGDSDVDSELEDRVD 2025
Cdd:pfam04012  147 QEAVQTSLGSLSTSSATDSFERI----EEKIEEREARADAAAELASAVD 191
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
1291-2017 4.96e-03

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 42.51  E-value: 4.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1291 IQVQLSEEQIRNKDEEIQQLRSKLEKVEKERNelrlssDRLETRISEltseltderntgesasqlLDAETAERLRTEKEM 1370
Cdd:PTZ00440   945 IEKQLSDTKINNLKMQIEKTLEYYDKSKENIN------GNDGTHLEK------------------LDKEKDEWEHFKSEI 1000
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1371 KELQTQYDALKKQMEVMemevmearliraaeINGEVDD---------DDAGGEWRLKYERAVREVDFTKKRLQ-----QE 1436
Cdd:PTZ00440  1001 DKLNVNYNILNKKIDDL--------------IKKQHDDiielidkliKEKGKEIEEKVDQYISLLEKMKTKLSsfhfnID 1066
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1437 LE-DKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKcqrltaeLQDTKLHLEGQQVRNHELEKKQRRFDSELSQAH 1515
Cdd:PTZ00440  1067 IKkYKNPKIKEEIKLLEEKVEALLKKIDENKNKLIEIKNK-------SHEHVVNADKEKNKQTEHYNKKKKSLEKIYKQM 1139
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1516 EETQREKLQREklqrEKDMLLAEAFSLKQQMEEK--DLDIAGFTQKVVSLEAELQDISSQESKDEASLAKVKKQLRDL-- 1591
Cdd:PTZ00440  1140 EKTLKELENMN----LEDITLNEVNEIEIEYERIliDHIVEQINNEAKKSKTIMEEIESYKKDIDQVKKNMSKERNDHlt 1215
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1592 EAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQThsKEMESRDEEVEEARQscqkklkqmevQLEEEYEDKQKALR 1671
Cdd:PTZ00440  1216 TFEYNAYYDKATASYENIEELTTEAKGLKGEANRSTNV--DELKEIKLQVFSYLQ-----------QVIKENNKMENALH 1282
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1672 EKRELESKL-STLSDQV-------NQRDFESEKRLRKDLKRTKALLADAQIMLDHLKNNApSKREIAQLKNQLEESeftC 1743
Cdd:PTZ00440  1283 EIKNMYEFLiSIDSEKIlkeilnsTKKAEEFSNDAKKELEKTDNLIKQVEAKIEQAKEHK-NKIYGSLEDKQIDDE---I 1358
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1744 AAAVKARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEI---QNRLEEDQEDMNEL-MKKHKAAVAQASRDMAQ 1819
Cdd:PTZ00440  1359 KKIEQIKEEISNKRKEINKYLSNIKSNKEKCDLHVRNASRGKDKIdflNKHEAIEPSNSKEVnIIKITDNINKCKQYSNE 1438
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1820 MNDLQAQIEESNKEKQELQEKLQALQSQVEFLEqsMVDKSLVSRQEAkiRELETRLEFEKTQVK-RLENLASRLKETMEK 1898
Cdd:PTZ00440  1439 AMETENKADENNDSIIKYEKEITNILNNSSILG--KKTKLEKKKKEA--TNIMDDINGEHSIIKtKLTKSSEKLNQLNEQ 1514
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1899 LTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELarkeaeaSRKKHELEmdlESLEAANqslqaDLKLAFKRIGDLQA 1978
Cdd:PTZ00440  1515 PNIKREGDVLNNDKSTIAYETIQYNLGRVKHNLLNI-------LNIKDEIE---TILNKAQ-----DLMRDISKISKIVE 1579
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|...
gi 1907082211 1979 AIE-DEMESDENE--DLINSLQDMVTKYQKKKNKLEG-DSDVD 2017
Cdd:PTZ00440  1580 NKNlENLNDKEADyvKYLDNILKEKQLMEAEYKKLNEiYSDVD 1622
OmpH smart00935
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1557-1654 4.98e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 214922 [Multi-domain]  Cd Length: 140  Bit Score: 39.49  E-value: 4.98e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  1557 TQKVVSLEAELQDISSQeskDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSiqMLEQAKLRLEMEMERMRQTHSKEMES 1636
Cdd:smart00935    6 VQKILQESPAGKAAQKQ---LEKEFKKRQAELEKLEKELQKLKEKLQKDAAT--LSEAAREKKEKELQKKVQEFQRKQQK 80
                            90
                    ....*....|....*...
gi 1907082211  1637 RDEEVEEARQSCQKKLKQ 1654
Cdd:smart00935   81 LQQDLQKRQQEELQKILD 98
LXG pfam04740
LXG domain of WXG superfamily; This domain is present is the N-terminal region of a group of ...
1785-1880 5.25e-03

LXG domain of WXG superfamily; This domain is present is the N-terminal region of a group of polymorphic toxin proteins in bacteria. It is predicted to use Type VII secretion pathway to mediate export of bacterial toxins.


Pssm-ID: 428100 [Multi-domain]  Cd Length: 202  Bit Score: 40.69  E-value: 5.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1785 KNEIQNRLEEDQEDMNELMKKHKAAVAQASR----DMAQMNDLQAQIEESNKEKQELQEKLQAL-QSQVEFLEQSmvdKS 1859
Cdd:pfam04740  101 EHELENGLKKAKEKTEELTDEINSILASVSDivslPKLSDSEVQDSLQKAKKKVKDTIEKLYDFdQEQTSELSEL---EA 177
                           90       100
                   ....*....|....*....|.
gi 1907082211 1860 LVSRQEAKIRELETRLEFEKT 1880
Cdd:pfam04740  178 DLQALKTYVSELEEMTSNGKI 198
PRK11281 PRK11281
mechanosensitive channel MscK;
1351-1610 5.56e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 42.21  E-value: 5.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1351 SASQLLDAETAERLRTEKEmkeLQTQYDALKKQMEVmemevmearliraaeingEVDDddaggewrlkyeravrevdftk 1430
Cdd:PRK11281    22 LSSAFARAASNGDLPTEAD---VQAQLDALNKQKLL------------------EAED---------------------- 58
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1431 KRLQQELE------DKMEVEQQSRRQLERRLGDLQADSDESQRALQQLKKKcqrltaELQDTKLHLEGQQVRnhELEKKQ 1504
Cdd:PRK11281    59 KLVQQDLEqtlallDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDD------NDEETRETLSTLSLR--QLESRL 130
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1505 RRFDSELSQAHEE------------TQREKLQRE---KLQR---------------------EKDMLLAEAFSLKQQMEE 1548
Cdd:PRK11281   131 AQTLDQLQNAQNDlaeynsqlvslqTQPERAQAAlyaNSQRlqqirnllkggkvggkalrpsQRVLLQAEQALLNAQNDL 210
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907082211 1549 KDLDIAGFTQkvvsleaeLQDIsSQESKDEASL--AKVKKQLRDLE----AKVKDQEEELDEQAGSIQ 1610
Cdd:PRK11281   211 QRKSLEGNTQ--------LQDL-LQKQRDYLTAriQRLEHQLQLLQeainSKRLTLSEKTVQEAQSQD 269
mS26_Tt cd23695
Tetrahymena thermophila ribosomal protein mS26 and similar proteins; Ribosomal protein mS26 is ...
1414-1736 5.76e-03

Tetrahymena thermophila ribosomal protein mS26 and similar proteins; Ribosomal protein mS26 is a component of small subunit (SSU) in Tetrahymena thermophila mitochondrial ribosome (mitoribosome). The structure of the mitoribosome reveals an assembly of 94-ribosomal proteins and four-rRNAs with an additional protein mass of ~700 kDa on the small subunit; the large mitoribosomal subunit (LSU) lacks 5S rRNA.


Pssm-ID: 467909 [Multi-domain]  Cd Length: 496  Bit Score: 41.73  E-value: 5.76e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1414 EWRLK--YERAVREVDFTKKRLQQELEDKMEveqqsrrQLERRLGDLQADSDESqraLQQLKKKCQRLTAELQDtklhle 1491
Cdd:cd23695    193 LFKLLkeYQDAKAIIIEDFRESSEEGAEKLE-------KLEKAFATLLKNYKEE---LEEPEKQLEFMQKRLLD------ 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1492 gqqvrnheLEKKQRRFDSELSQAH--EETQREKLQREKLQREKdmllaeaFSLKQQMEEKDLDIAGFTQKVVSLEAELQD 1569
Cdd:cd23695    257 --------LYNLLRLWGQYITIVKmpDSVVRDIMNKTQARPEV-------AKLNSKQELEDAKNRKRDTEENEFDDDYES 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1570 ISSQESKDEaslakvkkqlrdleakvkdqEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQSCQ 1649
Cdd:cd23695    322 ADEGETSDE--------------------EDEIEEENFQLQKEKKKEEELNAEFNIAKNSLYKFSPQNDKNVVDDRDFYS 381
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1650 KklKQMEVQLEEE------------YEDKQKALREKRELESkLSTLSDQVNQRDFESEKRLRKDLKrtkalladAQIMLD 1717
Cdd:cd23695    382 G--VDLENVFPRAllnnlndftgldFQNVKEILNNEEKLKI-IQGEDDQNDQEDFNNPRKFQTSLI--------VQTYKQ 450
                          330       340
                   ....*....|....*....|.
gi 1907082211 1718 HLKN--NAPSKREIAQLKNQL 1736
Cdd:cd23695    451 KINNldAESLTRATQEKKNDI 471
OmpH pfam03938
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1755-1849 5.88e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 461098 [Multi-domain]  Cd Length: 140  Bit Score: 39.48  E-value: 5.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1755 VEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQM-NDLQAQIEESNKE 1833
Cdd:pfam03938    5 VDMQKILEESPEGKAAQAQLEKKFKKRQAELEAKQKELQKLYEELQKDGALLEEEREEKEQELQKKeQELQQLQQKAQQE 84
                           90
                   ....*....|....*..
gi 1907082211 1834 -KQELQEKLQALQSQVE 1849
Cdd:pfam03938   85 lQKKQQELLQPIQDKIN 101
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
1432-1623 5.92e-03

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 41.86  E-value: 5.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1432 RLQQELEDKMEVEQQSRRQLE-RRLGDLQADSDESQRAlqQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRR---F 1507
Cdd:pfam15709  339 RAERAEMRRLEVERKRREQEEqRRLQQEQLERAEKMRE--ELELEQQRRFEEIRLRKQRLEEERQRQEEEERKQRLqlqA 416
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1508 DSELSQAHEETQREKLQreKLQREKDMLLAE-AFSLKQQMEEKDLDIAGfTQKVVSLEAELQDISSQESKDEAslakvkK 1586
Cdd:pfam15709  417 AQERARQQQEEFRRKLQ--ELQRKKQQEEAErAEAEKQRQKELEMQLAE-EQKRLMEMAEEERLEYQRQKQEA------E 487
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1907082211 1587 QLRDLEAKVKDQEEEldeqagsiqmlEQAKLRLEMEM 1623
Cdd:pfam15709  488 EKARLEAEERRQKEE-----------EAARLALEEAM 513
ATG17_like pfam04108
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
1751-1937 5.95e-03

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 41.60  E-value: 5.95e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1751 KAMEVEMEDLHLQIDDIAKAktaLEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLqaQIEES 1830
Cdd:pfam04108   20 RSLLEELVVLLAKIAFLRRG---LSVQLANLEKVREGLEKVLNELKKDFKQLLKDLDAALERLEETLDKLRNT--PVEPA 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1831 NKEKQELQEKLqalqsqvefleQSMVDKSLVSRQEAKIRELETRLefeKTQVKRLENLASRLKETMEKLTEERDQRAAAe 1910
Cdd:pfam04108   95 LPPGEEKQKTL-----------LDFIDEDSVEILRDALKELIDEL---QAAQESLDSDLKRFDDDLRDLQKELESLSSP- 159
                          170       180
                   ....*....|....*....|....*..
gi 1907082211 1911 NREKEQNKRLQRQLRDTKEEMSELARK 1937
Cdd:pfam04108  160 SESISLIPTLLKELESLEEEMASLLES 186
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1305-1602 6.00e-03

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 41.05  E-value: 6.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1305 EEIQQLRSKLEKVEKERNELRlssdrletrisELTSELTDERNtgESASQLldAETAERLRTEKEMK-ELQTQYDALKKQ 1383
Cdd:COG1340     15 EKIEELREEIEELKEKRDELN-----------EELKELAEKRD--ELNAQV--KELREEAQELREKRdELNEKVKELKEE 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1384 mevmemevmeaRLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQeLEDKMEVEQQSR---RQLERRLGDLQA 1460
Cdd:COG1340     80 -----------RDELNEKLNELREELDELRKELAELNKAGGSIDKLRKEIER-LEWRQQTEVLSPeeeKELVEKIKELEK 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1461 DSDESQRALQQlKKKCQRLTAELQdtklhlegqqvrnhELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAF 1540
Cdd:COG1340    148 ELEKAKKALEK-NEKLKELRAELK--------------ELRKEAEEIHKKIKELAEEAQELHEEMIELYKEADELRKEAD 212
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1541 SLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQES--KDEASLAKVKKQLRDLEAKVKDQEEEL 1602
Cdd:COG1340    213 ELHKEIVEAQEKADELHEEIIELQKELRELRKELKklRKKQRALKREKEKEELEEKAEEIFEKL 276
GAS pfam13851
Growth-arrest specific micro-tubule binding; This family is the highly conserved central ...
1801-1976 6.28e-03

Growth-arrest specific micro-tubule binding; This family is the highly conserved central region of a number of metazoan proteins referred to as growth-arrest proteins. In mouse, Gas8 is predominantly a testicular protein, whose expression is developmentally regulated during puberty and spermatogenesis. In humans, it is absent in infertile males who lack the ability to generate gametes. The localization of Gas8 in the motility apparatus of post-meiotic gametocytes and mature spermatozoa, together with the detection of Gas8 also in cilia at the apical surfaces of epithelial cells lining the pulmonary bronchi and Fallopian tubes suggests that the Gas8 protein may have a role in the functioning of motile cellular appendages. Gas8 is a microtubule-binding protein localized to regions of dynein regulation in mammalian cells.


Pssm-ID: 464001 [Multi-domain]  Cd Length: 200  Bit Score: 40.27  E-value: 6.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1801 ELMKKHKAAVAQA------------------SRDMAQMNDLQAQIE----ESNKEKQELQEKLQALQSQVEfleqsmvdk 1858
Cdd:pfam13851    1 ELMKNHEKAFNEIknyynditrnnlelikslKEEIAELKKKEERNEklmsEIQQENKRLTEPLQKAQEEVE--------- 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1859 slvsrqeakirELETRLEFEKTQVKRLENLASRLKET-----------------MEKLTEERDQ------RAAAENREKE 1915
Cdd:pfam13851   72 -----------ELRKQLENYEKDKQSLKNLKARLKVLekelkdlkwehevleqrFEKVERERDElydkfeAAIQDVQQKT 140
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1916 QNKR--LQRQLrdtkEEMSE-LARKEAEASRKKHELEMDLESLEAANQSLQADLKLAFKRIGDL 1976
Cdd:pfam13851  141 GLKNllLEKKL----QALGEtLEKKEAQLNEVLAAANLDPDALQAVTEKLEDVLESKNQLIKDL 200
PDZ3_MAGI-1_3-like cd06733
PDZ domain 3 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, ...
223-300 6.30e-03

PDZ domain 3 of membrane-associated guanylate kinase inverted 1 (MAGI-1), MAGI-2, and MAGI-3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 3 of MAGI1, 2, 3 (MAGI is also known as Membrane-associated guanylate kinase, WW and PDZ domain-containing protein) and related domains. MAGI proteins have been implicated in the control of cell migration and invasion through altering the activity of phosphatase and tensin homolog (PTEN) and modulating Akt signaling. Four MAGI proteins have been identified (MAGI1-3 and MAGIX). MAGI1-3 have 6 PDZ domains and bind to the C-terminus of PTEN via their PDZ2 domain. MAGIX has a single PDZ domain that is related to MAGI1-3 PDZ domain 5. Other binding partners for MAGI1 include JAM4, C-terminal tail of high risk HPV-18 E6, megalin, TRAF6, Kir4.1 (basolateral K+ channel subunit), and cadherin 23; for MAGI2, include DASM1, dendrin, axin, beta- and delta-catenin, neuroligin, hyperpolarization-activated cation channels, beta1-adrenergic receptors, NMDA receptor, and TARPs; and for MAGI3 includes LPA2. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MAGI family PDZ3 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2); arranged as beta-strands A, -B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467215 [Multi-domain]  Cd Length: 85  Bit Score: 37.98  E-value: 6.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  223 LQRRPTGdFGFSLrrttmLDRAPEG-QAYrrVVHFAEPGAGTKDlalG-LVPGDRLVEINGQNVENKSRDEIVEMIRQSG 300
Cdd:cd06733      6 LRRQETG-FGFRI-----LGGTEEGsQVS--IGAIVPGGAADLD---GrLRTGDELLSVDGVNVVGASHHKVVDLMGNAA 74
COG5644 COG5644
U3 small nucleolar RNA-associated protein 14 [Function unknown];
1469-1951 6.43e-03

U3 small nucleolar RNA-associated protein 14 [Function unknown];


Pssm-ID: 227931 [Multi-domain]  Cd Length: 869  Bit Score: 42.00  E-value: 6.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1469 LQQLKKKCQRLTAELQDTKLHLEGQQVR----NHELEKKQ---RRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFS 1541
Cdd:COG5644      1 LPQPQGKHQRKGKKQLENKILHSYEEESagfdSEELEDNDeqgYSFGVNSEDDEEIDSDEAFDEEDEKRFADWSFNASKS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1542 LKQQMEEKDLDiagfTQKVVSLEAELQDISSQESKDEASLAKV-KKQLRDLEaKVKDQEEELDEQAGSIQMLEQAKLRLE 1620
Cdd:COG5644     81 GKSNKDHKNLN----NTKEISLNDSDDSVNSDKLENEGSVSSIdENELVDLD-TLLDNDQPEKNESGNNDHATDKENLLE 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1621 MEMERMRQTHSKEMESRDEEVEEarqscqkklkqmevqlEEEYEDKQKALREKRELESKLSTLSDQVNQR------DFES 1694
Cdd:COG5644    156 SDASSSNDSESEESDSESEIESS----------------DSDHDDENSDSKLDNLRNYIVSLKKDEADAEsvlssdDNDS 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1695 EKRLRKDLKRT-KALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEFTCAAAVKarkamevemedlhlQIDDIAKAKTA 1773
Cdd:COG5644    220 IEEIKYDPHETnKESGSSETIDITDLLDSIPMEQLKVSLKPLVSESSKLDAPLAK--------------SIQDRLERQAA 285
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1774 LEEQLSRLQREKNEIQNRLEEDQE--DMNELMKKHKAAVAQASRDMAQmNDLQAQIEE-------SNKEKQELQEKLQAL 1844
Cdd:COG5644    286 YEQTKNDLEKWKPIVADNRKSDQLifPMNETARPVPSNNGLASSFEPR-TESERKMHQalldaglENESALKKQEELALN 364
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1845 QSQVEFLEQSMVD----KSLVSRQEAKIRELETRLEFEKTQVKRLENLASR-LKETMEKLTEERDQRAAAENREKEQNKR 1919
Cdd:COG5644    365 KLSVEEVAERTRQlrfmRELMFREERKAKRVAKIKSKTYRKIRKNRKEKEMaLIPKSEDLENEKSEEARALERMTQRHKN 444
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|..
gi 1907082211 1920 LQRQLRD----------TKEEMSELARKEAEASRKKHELEMD 1951
Cdd:COG5644    445 TSSWTRKmlerashgegTREAVNEQIRKGDELMQRIHGKEIM 486
Nuf2_DHR10-like pfam18595
Nuf2, DHR10-like domain; This domain is found at the C-terminal region of Nuf2 proteins. This ...
1857-1957 6.97e-03

Nuf2, DHR10-like domain; This domain is found at the C-terminal region of Nuf2 proteins. This domain was identified as MazG related domain also designated as Designed helical repeat protein 10 (DHR10) that actually adopts a coiled-coil structure. Nuf2 is part of the Ndc80 complex, which binds to the spindle and is required for chromosome segregation and spindle checkpoint activity.


Pssm-ID: 465814 [Multi-domain]  Cd Length: 117  Bit Score: 38.72  E-value: 6.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1857 DKSLVSRQEAKIRELETRLE----FEK------TQVKRLENLASRLKETMEKLTEERDQRAAAENREKE---QNKRLQRQ 1923
Cdd:pfam18595    7 EKEELAELERKARELQAKIDalqvVEKdlrsciKLLEEIEAELAKLEEAKKKLKELRDALEEKEIELRElerREERLQRQ 86
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1907082211 1924 LRDTKEemsELARKEAEASRKKHELEMDLESLEA 1957
Cdd:pfam18595   87 LENAQE---KLERLREQAEEKREAAQARLEELRE 117
TMF_TATA_bd pfam12325
TATA element modulatory factor 1 TATA binding; This is the C-terminal conserved coiled coil ...
1880-1955 7.01e-03

TATA element modulatory factor 1 TATA binding; This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organization in animal and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family.


Pssm-ID: 432481 [Multi-domain]  Cd Length: 115  Bit Score: 38.68  E-value: 7.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1880 TQVKRLENLASRLKETMEKLTEERDQrAAAE----NREKEQNKRLQRQLRDTKEEMSELARKEA-------EASRKKHEL 1948
Cdd:pfam12325   19 STIRRLEGELASLKEELARLEAQRDE-ARQEivklMKENEELKELKKELEELEKELKELEQRYEttlellgEKSEEVEEL 97

                   ....*..
gi 1907082211 1949 EMDLESL 1955
Cdd:pfam12325   98 KADVEDL 104
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1296-1519 7.10e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 41.54  E-value: 7.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1296 SEEQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELtdeRNTGESASQLLDAETAERLRTekEMKELQT 1375
Cdd:COG3206    203 QKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQL---GSGPDALPELLQSPVIQQLRA--QLAELEA 277
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1376 QYDALkkqmevmemevmearliraaeingevddddaggewRLKYERAVREVdftkKRLQQELEdkmEVEQQSRRQLERRL 1455
Cdd:COG3206    278 ELAEL-----------------------------------SARYTPNHPDV----IALRAQIA---ALRAQLQQEAQRIL 315
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907082211 1456 GDLQADSDESQRALQQLKKKCQRLTAELQDTKLHLEGQQVRNHELEKKQRRFDSeLSQAHEETQ 1519
Cdd:COG3206    316 ASLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLEREVEVARELYES-LLQRLEEAR 378
PDZ2_APBA1_3-like cd06793
PDZ domain 2 of amyloid-beta A4 precursor protein-binding family A member 1 (APBA1), APBA2, ...
273-309 7.16e-03

PDZ domain 2 of amyloid-beta A4 precursor protein-binding family A member 1 (APBA1), APBA2, APBA3, and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 2 of APBA1, APBA2, APBA3, and related domains. The APBA/X11/Mint protein family includes three members: neuron specific APBA1 (also known as X11alpha and Mint1) and APBA2 (also known as X11beta and Mint2), and the ubiquitously expressed APBA3 (also known as X12gamma and Mint3). They are involved in regulating neuronal signaling, trafficking, and plasticity. They contain two PDZ domains (PDZ1 and PDZ2) which bind a variety of proteins: Arf GTPases (APBA1 and APBA2 PDZ2) and neurexin (APBA1 and APBA2 PDZ1 and 2) which are involved in vesicle docking and exocytosis; alpha1B subunit of N-type Ca2+ channel (APBA1 PDZ1) that is involved in ion channels; KIF17 (APBA1 PDZ1) that is involved in transport and traffic; and Alzheimer's disease related proteins, APP (APBA3 PDZ2), CCS (APBA1 PDZ2), NF-kappa-B/p65 (APBA2 PDZ2), presenilin-1 (APBA1 and APBA2 PDZ1 and PDZ2). PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This APBA1,3-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged as beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467255 [Multi-domain]  Cd Length: 78  Bit Score: 37.38  E-value: 7.16e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907082211  273 GDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQP 309
Cdd:cd06793     42 GHRIIEINGQSVVATPHEKIVQLLSNSVGEIHMKTMP 78
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1786-1967 7.51e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 41.04  E-value: 7.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1786 NEIQNRLEEDQEDMNELMKKHKAAVAQASRD----MAQMNDLQAQIEESNKEKQELQEKlqalQSQVEfleqsmvdkSLV 1861
Cdd:COG4372      2 DRLGEKVGKARLSLFGLRPKTGILIAALSEQlrkaLFELDKLQEELEQLREELEQAREE----LEQLE---------EEL 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1862 SRQEAKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEERDQRAAAENREKEQNKRLQRQLRDTKEEMSELARKEAEA 1941
Cdd:COG4372     69 EQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAER 148
                          170       180
                   ....*....|....*....|....*.
gi 1907082211 1942 SRKKHELEMDLESLEAANQSLQADLK 1967
Cdd:COG4372    149 EEELKELEEQLESLQEELAALEQELQ 174
SynN cd00179
Syntaxin N-terminus domain; syntaxins are nervous system-specific proteins implicated in the ...
1778-1933 7.54e-03

Syntaxin N-terminus domain; syntaxins are nervous system-specific proteins implicated in the docking of synaptic vesicles with the presynaptic plasma membrane; they are a family of receptors for intracellular transport vesicles; each target membrane may be identified by a specific member of the syntaxin family; syntaxins contain a moderately well conserved amino-terminal domain, called Habc, whose structure is an antiparallel three-helix bundle; a linker of about 30 amino acids connects this to the carboxy-terminal region, designated H3 (t_SNARE), of the syntaxin cytoplasmic domain; the highly conserved H3 region forms a single, long alpha-helix when it is part of the core SNARE complex and anchors the protein on the cytoplasmic surface of cellular membranes; H3 is not included in defining this domain


Pssm-ID: 238105 [Multi-domain]  Cd Length: 151  Bit Score: 39.19  E-value: 7.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1778 LSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQsmvd 1857
Cdd:cd00179      1 LEEFFEEVEEIRGNIDKISEDVEELQKLHSQLLTAPDADPELKQELESLVQEIKKLAKEIKGKLKELEESNEQNEA---- 76
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1858 ksLVSRQEAKIReletrlefeKTQVkrlENLASRLKETMEKLTEERdQRAAAENREkeqnkRLQRQLRDTKEEMSE 1933
Cdd:cd00179     77 --LNGSSVDRIR---------KTQH---SGLSKKFVEVMTEFNKAQ-RKYRERYKE-----RIQRQLEITGGEATD 132
PDZ_ZASP52-like cd23068
PDZ domain of Drosophila melanogaster PDZ and LIM domain protein Zasp52 (also known as Zasp), ...
269-308 7.55e-03

PDZ domain of Drosophila melanogaster PDZ and LIM domain protein Zasp52 (also known as Zasp), and related domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain of Drosophila melanogaster Zasp52 and related domains. Drosophila melanogaster Zasp52 (also known as Z band alternatively spliced PDZ-motif protein or Zasp) colocalizes with integrins at myotendinous junctions and with alpha-actinin at Z-disks and is required for muscle attachment as well as Z-disk assembly and maintenance. The Zasp52 actin-binding site includes the extended PDZ domain and the ZM region. The Zasp52-PDZ domain is required for myofibril assembly. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This Zasp52-like family domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467281 [Multi-domain]  Cd Length: 82  Bit Score: 37.51  E-value: 7.55e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1907082211  269 GLVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKVQ 308
Cdd:cd23068     42 GLRRGDVILRINGTDTSNLTHKQAQDLIKRAGNDLQLTVQ 81
GBP_C cd16269
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ...
1582-1673 7.77e-03

Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.


Pssm-ID: 293879 [Multi-domain]  Cd Length: 291  Bit Score: 41.02  E-value: 7.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1582 AKVKKQLRDLEAKVKDQEEELDEQagsiqMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEARQScqkKLKQMEVQLEE 1661
Cdd:cd16269    203 ERAKAEAAEQERKLLEEQQRELEQ-----KLEDQERSYEEHLRQLKEKMEEERENLLKEQERALES---KLKEQEALLEE 274
                           90
                   ....*....|..
gi 1907082211 1662 EYEDKQKALREK 1673
Cdd:cd16269    275 GFKEQAELLQEE 286
PspA_IM30 pfam04012
PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent ...
1658-1870 7.82e-03

PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homolog in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma(54) containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator.


Pssm-ID: 461130 [Multi-domain]  Cd Length: 215  Bit Score: 40.43  E-value: 7.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1658 QLEEEYEDKQKALREK-RELESKLSTLSDQVNQ---RDFESEKRLRKDLKRTKALLADAQIMLDhlKNNAPSKREIAQLK 1733
Cdd:pfam04012   15 EGLDKAEDPEKMLEQAiRDMQSELVKARQALAQtiaRQKQLERRLEQQTEQAKKLEEKAQAALT--KGNEELAREALAEK 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1734 NQLEESEftcaaavkarKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEedQEDMNELMKKH--KAAVA 1811
Cdd:pfam04012   93 KSLEKQA----------EALETQLAQQRSAVEQLRKQLAALETKIQQLKAKKNLLKARLK--AAKAQEAVQTSlgSLSTS 160
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1907082211 1812 QASRDMAQMNDLQAQIEESNkekqELQEKLQALQSQVEFLEQSMVDKSLVSRQEAKIRE 1870
Cdd:pfam04012  161 SATDSFERIEEKIEEREARA----DAAAELASAVDLDAKLEQAGIQMEVSEDVLARLKA 215
HAUS-augmin3 pfam14932
HAUS augmin-like complex subunit 3; This domain is subunit three of the augmin complex found ...
1758-1851 7.99e-03

HAUS augmin-like complex subunit 3; This domain is subunit three of the augmin complex found from Drosophila to humans. The HAUS-augmin complex is made up of eight subunits. The augmin complex interacts with gamma-TuRC, and attenuation of this interaction severely impairs spindle MT generation. Furthermore, we provide evidence that human augmin plays critical and non-redundant roles in the kinetochore-MT attachment and also central spindle formation during anaphase in human cells.The HAUS complex is required for mitotic spindle assembly and for maintenance of centrosome integrity.


Pssm-ID: 464384 [Multi-domain]  Cd Length: 261  Bit Score: 40.76  E-value: 7.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1758 EDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQMNDLQAQIEESNKekqEL 1837
Cdd:pfam14932   70 EALEESLEEIREATEDLEAELQELQKTKQLKINRLNKLQAQASSLSQGLRALVAEEEEAAKQLEELQEELAALNA---KT 146
                           90
                   ....*....|....
gi 1907082211 1838 QEKLQALQSQVEFL 1851
Cdd:pfam14932  147 NNVLQSLQSEVKEL 160
CENP-F_leu_zip pfam10473
Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, ...
1501-1621 8.01e-03

Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance.


Pssm-ID: 463102 [Multi-domain]  Cd Length: 140  Bit Score: 39.20  E-value: 8.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1501 EKKQRRFDSELSQAHEETQREKLQREKLQREKDML--------------LAEAFSLKQQMEEkdldiagFTQKVVSLEAE 1566
Cdd:pfam10473    2 EKKQLHVLEKLKESERKADSLKDKVENLERELEMSeenqelaileaensKAEVETLKAEIEE-------MAQNLRDLELD 74
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907082211 1567 LQDISSQESKDEASLAKVKKQLRDLEAKVKDQEEELDE-QAGSIQMLEQAKLRLEM 1621
Cdd:pfam10473   75 LVTLRSEKENLTKELQKKQERVSELESLNSSLENLLEEkEQEKVQMKEESKTAVEM 130
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
1487-1672 8.05e-03

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 41.48  E-value: 8.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1487 KLHLEGQQVRNHELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQK-VVSLEA 1565
Cdd:pfam15709  337 RLRAERAEMRRLEVERKRREQEEQRRLQQEQLERAEKMREELELEQQRRFEEIRLRKQRLEEERQRQEEEERKqRLQLQA 416
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1566 ELQDISSQESKDEASLAKVKKQLRDLEAKvKDQEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEMESRDEEVEEAR 1645
Cdd:pfam15709  417 AQERARQQQEEFRRKLQELQRKKQQEEAE-RAEAEKQRQKELEMQLAEEQKRLMEMAEEERLEYQRQKQEAEEKARLEAE 495
                          170       180
                   ....*....|....*....|....*..
gi 1907082211 1646 QSCQKKLKQMEVQLEEEYEDKQKALRE 1672
Cdd:pfam15709  496 ERRQKEEEAARLALEEAMKQAQEQARQ 522
PDZ10_MUPP1-PDZ8_PATJ-like cd06673
PDZ domain 10 of multi-PDZ-domain protein 1 (MUPP1), domain 8 of PATJ (protein-associated ...
258-307 8.20e-03

PDZ domain 10 of multi-PDZ-domain protein 1 (MUPP1), domain 8 of PATJ (protein-associated tight junction) and similar domains; PDZ (PSD-95 (Postsynaptic density protein 95), Dlg (Discs large protein), and ZO-1 (Zonula occludens-1)) domain 10 of MUPP1, PDZ domain 8 of PATJ, and related domains. MUPP1 and PATJ serve as scaffolding proteins linking different proteins and protein complexes involved in the organization of tight junctions and epithelial polarity. MUPP1 contains an L27 (Lin-2 and Lin-7 binding) domain and 13 PDZ domains. PATJ (also known as INAD-like) contains an L27 domain and ten PDZ domains. MUPP1 and PATJ share several binding partners, including junctional adhesion molecules (JAM), zonula occludens (ZO)-3, Pals1 (protein associated with Lin-7), Par (partitioning defective)-6 proteins, and nectins (adherence junction adhesion molecules). PATJ lacks 3 PDZ domains seen in MUPP1: PDZ6, 9, and 13; consequently, MUPP1 PDZ7 and 8 align with PATJ PDZ6 and 7; and MUPP1 PDZ domains 10-12 align with PATJ PDZ domains 8-10. PDZ domains usually bind in a sequence-specific manner to short peptide sequences located at the C-terminal end of their partner proteins (known as PDZ binding motifs). The PDZ superfamily includes canonical PDZ domains as well as those with circular permutations and domain swapping mediated by beta-strands. This MUPP1-like family PDZ10 domain is a canonical PDZ domain containing six beta-strands A-F and two alpha-helices (alpha-helix 1 and 2), arranged in the order: beta-strands A, B, C, alpha-helix 1, beta-strands D, E, alpha-helix 2 and beta-strand F.


Pssm-ID: 467161 [Multi-domain]  Cd Length: 86  Bit Score: 37.66  E-value: 8.20e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1907082211  258 EPGAGTKDlalG-LVPGDRLVEINGQNVENKSRDEIVEMIRQSGDSVRLKV 307
Cdd:cd06673     37 EDGAAAKD---GrLWAGDQILEVNGEDLRKATHDEAINVLRQTPQKVRLLV 84
CCDC154 pfam15450
Coiled-coil domain-containing protein 154; CCDC154 is an osteopetrosis-related protein that ...
1449-1934 8.40e-03

Coiled-coil domain-containing protein 154; CCDC154 is an osteopetrosis-related protein that suppresses cell proliferation by inducing G2/M arrest.


Pssm-ID: 464723 [Multi-domain]  Cd Length: 526  Bit Score: 41.36  E-value: 8.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1449 RQLERRLGDLQAD-------SDESQRALQQLKKKCQRLTA--ELQDTKLHLEGQQVRN--HELEKKQRRFDSELSQAHEE 1517
Cdd:pfam15450   16 KQLEQWVADLQAEvvslrghKERCEHATLSLLRELLQVRAhvQLQDSELKQLRQEVQQaaRAPEKEALEFPGPQNQNQMQ 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1518 TQREKL--QREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKvVSLEAELQDISS---QESKDEASlAKVKKQLRDLE 1592
Cdd:pfam15450   96 ALDKRLveVREALTQIRRKQALQDSERKGAEQEANLRLTKLTGK-LKQEEQGREAACsalQKSQEEAS-QKVDHEVARMQ 173
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1593 AKVKDQEEE-----LDEQAGSIQMLEQAKLRLEmemERMRQTHSKEMESRDEEVEEARQSCQKklkqmevqLEEEYEDKQ 1667
Cdd:pfam15450  174 AQVTKLGEEmslrfLKREAKLCSFLQKSFLALE---KRMKASESTRLKAESSLREELEGRWQK--------LQELTEERL 242
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1668 KALREKRELESklSTLSDQVnqrdfeseKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIaqlKNQLEESEFTCAAAV 1747
Cdd:pfam15450  243 RALQGQREQEE--GHLLEQC--------RGLDAAVVQLTKFVRQNQVSLNRVLLAEQKARDA---KGQLEESQAGELASY 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1748 ------KARKAMEVEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQN-------RLEEDQEDMNELMKKHKAAVAQAS 1814
Cdd:pfam15450  310 vqenleAVQLAGELAQQETQGALELLQEKSQVLEGSVAELVRQVKDLSDhflalswRLDLQEQTLGLKLSEAKKEWEGAE 389
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1815 RDmaQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMV----DKSLVSRQEAKIRELET---RLE----FEKTQVK 1883
Cdd:pfam15450  390 RK--SLEDLAQWQKEVAAHLREVQEKVDSLPRQIEAVSDKCVlhksDSDLKISAEGKAREFEVeamRQElaalLSSVQLL 467
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1907082211 1884 RLENLASRLKETMEKLTEERDQRAAAENREKE----QNKRLQRQLRDTKEEMSEL 1934
Cdd:pfam15450  468 KEGNPGRKIAEIQGKLATFQNQIIKLENSIQDnktiQNLKFNTETKLRTEEIATL 522
ATG17_like pfam04108
Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ...
1675-1945 8.56e-03

Autophagy protein ATG17-like domain; This domain is found in the autophagy-related proteins ATG17 and ATG11, conserved across eukaryotes. ATG17 forms a complex with ATG29 and ATG31, critical for both PAS (preautophagosomal structure) formation and autophagy. Together with ATG13, it is required for ATG1 kinase activation. ATG11 is a scaffold protein required for the cytoplasm-to-vacuole targeting (Cvt) pathway during starvation and to recruit ATG proteins to the pre-autophagosome. It is also required for ATG1 kinase activation. In many eukaryotes, ATG11 (the orthologue in mammals is RB1-inducible coiled-coil protein 1 (RB1CC1) and in S. pombe is Taz1-interacting factor 1 (taf1)) is essential for bulk autophagy, except in S.cerevisiae. ATG17 and ATG11 are large similar proteins, both predicted to be almost entirely helical, containing conserved coiled-coil regions and lack obvious functional motifs.


Pssm-ID: 427715 [Multi-domain]  Cd Length: 360  Bit Score: 40.83  E-value: 8.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1675 ELESKLSTLSDQVNQRdfesEKRLRKDLKRTKALLADAQIMLDHLKNNAPSKREIAQLKNQLEESEftcaaavkarKAME 1754
Cdd:pfam04108  116 ILRDALKELIDELQAA----QESLDSDLKRFDDDLRDLQKELESLSSPSESISLIPTLLKELESLE----------EEMA 181
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1755 VEMEDLHLQIDDIAKAKTALEEqlsrlqrEKNEIQNRLEEDQEDMNELMKKHKAAVAQASRDMAQmndLQAQIEESNKEK 1834
Cdd:pfam04108  182 SLLESLTNHYDQCVTAVKLTEG-------GRAEMLEVLENDARELDDVVPELQDRLDEMENNYER---LQKLLEQKNSLI 251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1835 QELQEKLQalqsqvefleqsmvdkslvsrqeaKIRELETRLEFEKTQVKRLENLASRLKETMEKLTEE-RDQRAAAEN-- 1911
Cdd:pfam04108  252 DELLSALQ------------------------LIAEIQSRLPEYLAALKEFEERWEEEKETIEDYLSElEDLREFYEGfp 307
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1907082211 1912 -----------REKEQNKRLQRQLRDTKEEMSELARKEAEAsRKK 1945
Cdd:pfam04108  308 saygsllleveRRREWAEKMKKILRKLAEELDRLQEEERKR-REK 351
PRK12704 PRK12704
phosphodiesterase; Provisional
1500-1662 8.64e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 41.30  E-value: 8.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1500 LEKKQRRFDSELSQAHEETQREKlqREKLQREKDmllaEAFSLKQQMEekdldiagftQKVVSLEAELQDISSQESKDEA 1579
Cdd:PRK12704    33 IKEAEEEAKRILEEAKKEAEAIK--KEALLEAKE----EIHKLRNEFE----------KELRERRNELQKLEKRLLQKEE 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1580 SLakvKKQLRDLEAKvkdqEEELDEQAGSIQMLEQAKLRLEMEMERMRQTHSKEME-----SRDE-------EVE-EARQ 1646
Cdd:PRK12704    97 NL---DRKLELLEKR----EEELEKKEKELEQKQQELEKKEEELEELIEEQLQELErisglTAEEakeilleKVEeEARH 169
                          170
                   ....*....|....*.
gi 1907082211 1647 SCQKKLKQMEVQLEEE 1662
Cdd:PRK12704   170 EAAVLIKEIEEEAKEE 185
PRK11637 PRK11637
AmiB activator; Provisional
1808-1924 9.35e-03

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 41.22  E-value: 9.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1808 AAVAQASRDMAQMNDLQAQIEESNKEKQELQEKLQALQSQVEFLEQSMvdkSLVSRQeakIRELETRLEfektqvkRLEN 1887
Cdd:PRK11637    37 AFSAHASDNRDQLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAI---SQASRK---LRETQNTLN-------QLNK 103
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1907082211 1888 LASRLKETMEKLteerdqraaaENREKEQNKRLQRQL 1924
Cdd:PRK11637   104 QIDELNASIAKL----------EQQQAAQERLLAAQL 130
46 PHA02562
endonuclease subunit; Provisional
1432-1612 9.45e-03

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 41.15  E-value: 9.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1432 RLQQELEDKMEVEQQSRRQLER---RLGDLQADSDESQRALQQLKKKCQRLTAELQ----DTKLHLEG-------QQVrn 1497
Cdd:PHA02562   217 RKQNKYDELVEEAKTIKAEIEEltdELLNLVMDIEDPSAALNKLNTAAAKIKSKIEqfqkVIKMYEKGgvcptctQQI-- 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1498 HELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLA---EAFSLKQQMEEKDLDIAGFTQKVVSLEAELQDISSQE 1574
Cdd:PHA02562   295 SEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEqskKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEF 374
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1907082211 1575 SKDEASLAKVKKQLRDLEAKVKDQEEELDEQAGSIQML 1612
Cdd:PHA02562   375 VDNAEELAKLQDELDKIVKTKSELVKEKYHRGIVTDLL 412
OmpH smart00935
Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH ...
1755-1848 9.70e-03

Outer membrane protein (OmpH-like); This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterized as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.


Pssm-ID: 214922 [Multi-domain]  Cd Length: 140  Bit Score: 38.72  E-value: 9.70e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211  1755 VEMEDLHLQIDDIAKAKTALEEQLSRLQREKNEIQNRLEEDQEDmnelMKKHKAAVAQASRDmAQMNDLQAQIEESNKEK 1834
Cdd:smart00935    4 VDVQKILQESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEK----LQKDAATLSEAARE-KKEKELQKKVQEFQRKQ 78
                            90
                    ....*....|....
gi 1907082211  1835 QELQEKLQALQSQV 1848
Cdd:smart00935   79 QKLQQDLQKRQQEE 92
Mitofilin pfam09731
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ...
1499-1745 9.74e-03

Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.


Pssm-ID: 430783 [Multi-domain]  Cd Length: 618  Bit Score: 41.28  E-value: 9.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1499 ELEKKQRRFDSELSQAHEETQREKLQREKLQREKDMLLAEAFS-----LKQQMEEKDLDIAGFTQKVVSLEAELQDISSQ 1573
Cdd:pfam09731  230 NVEEKVEKAQSLAKLVDQYKELVASERIVFQQELVSIFPDIIPvlkedNLLSNDDLNSLIAHAHREIDQLSKKLAELKKR 309
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1574 ESKDEA-SLAKVKKQLRDLEAKV-KDQEEELDEQAGSIQMLEQAKL-------RLEMEMERMRQTHSKEMESRDEEVEEA 1644
Cdd:pfam09731  310 EEKHIErALEKQKEELDKLAEELsARLEEVRAADEAQLRLEFEREReeiresyEEKLRTELERQAEAHEEHLKDVLVEQE 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1645 rqscqkklkqMEVQLEEEYEDKQKALREKRELESKLSTLSDQVNQ--RDFESEKRLRKDLKRTKAL------LADAQIML 1716
Cdd:pfam09731  390 ----------IELQREFLQDIKEKVEEERAGRLLKLNELLANLKGleKATSSHSEVEDENRKAQQLwlaveaLRSTLEDG 459
                          250       260
                   ....*....|....*....|....*....
gi 1907082211 1717 DHLKNNAPSKREIAQLKNQLEESEFTCAA 1745
Cdd:pfam09731  460 SADSRPRPLVRELKALKELASDDEVVKAA 488
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
1305-1916 9.81e-03

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 41.32  E-value: 9.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1305 EEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTGESASQLLDAETAERLRTEKEMKELQ-TQYDALKKQ 1383
Cdd:PRK10246   191 EQHKSARTELEKLQAQASGVALLTPEQVQSLTASLQVLTDEEKQLLTAQQQQQQSLNWLTRLDELQQEASrRQQALQQAL 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1384 MEVMEMEVMEARLIRAAEINGEVDDDDAGGEWRLKYERAVREVDFTKKRLQQELEDKMEVEQQSRRQLERRLGDLQ---- 1459
Cdd:PRK10246   271 AAEEKAQPQLAALSLAQPARQLRPHWERIQEQSAALAHTRQQIEEVNTRLQSTMALRARIRHHAAKQSAELQAQQQslnt 350
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1460 --ADSDESQ---------RAL--------QQLKKKCQRLTAELQ------DTKLHLEGQQVRNH-ELEKKQRRFDSELSQ 1513
Cdd:PRK10246   351 wlAEHDRFRqwnnelagwRAQfsqqtsdrEQLRQWQQQLTHAEQklnalpAITLTLTADEVAAAlAQHAEQRPLRQRLVA 430
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1514 AHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQKVVSL------EAELQDISSQESKDEA-------- 1579
Cdd:PRK10246   431 LHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVkticeqEARIKDLEAQRAQLQAgqpcplcg 510
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1580 -----------SLAKVKKQLR--DLEAKVKDQEEELDEQAGSIQMLEQAKLRLEMEMERMR---QTHSKEMESRDEEVEE 1643
Cdd:PRK10246   511 stshpaveayqALEPGVNQSRldALEKEVKKLGEEGAALRGQLDALTKQLQRDESEAQSLRqeeQALTQQWQAVCASLNI 590
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1644 ARQSCQKKLKQMEVQleEEYEDKQKALREKRELESKLSTLSDQVNQRDFESEKR---LRKDLKRTKALLADAQIMLDHLK 1720
Cdd:PRK10246   591 TLQPQDDIQPWLDAQ--EEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRqqqLLTALAGYALTLPQEDEEASWLA 668
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1721 NNAPSKREIAQLKNQ---LEESEFTCAAAVKARKAM-EVEMEDLHLQIDDIAKAKT---ALEEQLSRLQREKNEIQNRLE 1793
Cdd:PRK10246   669 TRQQEAQSWQQRQNEltaLQNRIQQLTPLLETLPQSdDLPHSEETVALDNWRQVHEqclSLHSQLQTLQQQDVLEAQRLQ 748
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1794 EDQEDMNELMKKH----KAAVAQASRDMAQMNDLQAQieesnkeKQELQEKLQALQSQVEFLEQsmvdkSLVSRQEAKIR 1869
Cdd:PRK10246   749 KAQAQFDTALQASvfddQQAFLAALLDEETLTQLEQL-------KQNLENQRQQAQTLVTQTAQ-----ALAQHQQHRPD 816
                          650       660       670       680
                   ....*....|....*....|....*....|....*....|....*...
gi 1907082211 1870 ELETRLEFEKTQVkRLENLASRLKETMEKLTEERDQ-RAAAENREKEQ 1916
Cdd:PRK10246   817 GLDLTVTVEQIQQ-ELAQLAQQLRENTTRQGEIRQQlKQDADNRQQQQ 863
WEMBL pfam05701
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ...
1298-1661 9.93e-03

Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".


Pssm-ID: 461718 [Multi-domain]  Cd Length: 562  Bit Score: 41.17  E-value: 9.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1298 EQIRNKDEEIQQLRSKLEKVEKERNELRLSSDRLETRISELTSELTDERNTG-----ESASQLLDAETAERLRTEKEMKE 1372
Cdd:pfam05701   70 EELESTKRLIEELKLNLERAQTEEAQAKQDSELAKLRVEEMEQGIADEASVAakaqlEVAKARHAAAVAELKSVKEELES 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1373 LQTQYDALKKQMEVMEMEVMEArLIRAAEINGEVDD------------------DDAGGEWRLkyeRAVREVDFTKKRLQ 1434
Cdd:pfam05701  150 LRKEYASLVSERDIAIKRAEEA-VSASKEIEKTVEEltieliatkeslesahaaHLEAEEHRI---GAALAREQDKLNWE 225
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1435 QELEDKMEVEQQSRRQLeRRLGDLQADSDESQRALQQLKkkcqrltAEL---QDTKLHLEGQQVRNheLEKKQRRFDSEL 1511
Cdd:pfam05701  226 KELKQAEEELQRLNQQL-LSAKDLKSKLETASALLLDLK-------AELaayMESKLKEEADGEGN--EKKTSTSIQAAL 295
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907082211 1512 SQAHEETQREKLQREKLQREKDMLLAEAFSLKQQMEEKDLDIAGFTQK-------VVSLEAELQDISSQeskdeasLAKV 1584
Cdd:pfam05701  296 ASAKKELEEVKANIEKAKDEVNCLRVAAASLRSELEKEKAELASLRQRegmasiaVSSLEAELNRTKSE-------IALV 368
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907082211 1585 KKQLRDLEAKVKDQEEELDEQAgsiQMLEQAKLRLEMEMERMRQTHskemesrdEEVEEARQSCQkklkQMEVQLEE 1661
Cdd:pfam05701  369 QAKEKEAREKMVELPKQLQQAA---QEAEEAKSLAQAAREELRKAK--------EEAEQAKAAAS----TVESRLEA 430
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH