NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568991115|ref|XP_006520386|]
View 

angiopoietin-1 isoform X1 [Mus musculus]

Protein Classification

fibrinogen-related domain-containing protein( domain architecture ID 10053370)

fibrinogen-related domain-containing protein contains a C terminal globular domain similar to that of fibrinogen, and may be involved in one or more of a variety of binding interactions and functions including complement activation, signaling and regulation

PubMed:  1304888

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
FReD cd00087
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is ...
183-398 4.23e-128

Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.


:

Pssm-ID: 238040 [Multi-domain]  Cd Length: 215  Bit Score: 367.34  E-value: 4.23e-128
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115 183 KPFRDCADVYQAGFNKSGIYTIYFNNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPSGEYWLGNE 262
Cdd:cd00087    1 PLPRDCSEVLQRGGRTSGVYTIQPPGSNEPFQVYCDMDTDGGGWTVIQRRGDGSVDFYRSWKEYKDGFGNLDGEFWLGLE 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115 263 FIFAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQSSlILHGADFSTKDADNDNCMCKCA 342
Cdd:cd00087   81 KIHLLTSQGPYELRIDLEDWEGNTAYAEYDSFKVGSESEGYRLTLGGYSGTAGDALS-YHNGMKFSTFDRDNDGASGNCA 159
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 568991115 343 LMLTGGWWFDACGPSNLNGMFYTAGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 398
Cdd:cd00087  160 ESYSGGWWYNSCHASNLNGRYYSGGHRNEYDNGINWATWKGSTYSLKFTEMKIRPK 215
Mplasa_alph_rch super family cl37461
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
38-160 2.37e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


The actual alignment was detected with superfamily member TIGR04523:

Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 43.47  E-value: 2.37e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115   38 SQTAEQTRKLTDVETQVLNQTSRleIQLLENSLSTYKLEKQLLQQTNEILK--IHEKNSLLEHKILEMEGKhKEELDTLK 115
Cdd:TIGR04523 314 SELKNQEKKLEEIQNQISQNNKI--ISQLNEQISQLKKELTNSESENSEKQreLEEKQNEIEKLKKENQSY-KQEIKNLE 390
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 568991115  116 EEKENLQGLVSRQTFIIQELEKQLSRATNNNSILQKQQLELMDTV 160
Cdd:TIGR04523 391 SQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETI 435
 
Name Accession Description Interval E-value
FReD cd00087
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is ...
183-398 4.23e-128

Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.


Pssm-ID: 238040 [Multi-domain]  Cd Length: 215  Bit Score: 367.34  E-value: 4.23e-128
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115 183 KPFRDCADVYQAGFNKSGIYTIYFNNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPSGEYWLGNE 262
Cdd:cd00087    1 PLPRDCSEVLQRGGRTSGVYTIQPPGSNEPFQVYCDMDTDGGGWTVIQRRGDGSVDFYRSWKEYKDGFGNLDGEFWLGLE 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115 263 FIFAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQSSlILHGADFSTKDADNDNCMCKCA 342
Cdd:cd00087   81 KIHLLTSQGPYELRIDLEDWEGNTAYAEYDSFKVGSESEGYRLTLGGYSGTAGDALS-YHNGMKFSTFDRDNDGASGNCA 159
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 568991115 343 LMLTGGWWFDACGPSNLNGMFYTAGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 398
Cdd:cd00087  160 ESYSGGWWYNSCHASNLNGRYYSGGHRNEYDNGINWATWKGSTYSLKFTEMKIRPK 215
FBG smart00186
Fibrinogen-related domains (FReDs); Domain present at the C-termini of fibrinogen beta and ...
185-398 7.24e-126

Fibrinogen-related domains (FReDs); Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.


Pssm-ID: 214548 [Multi-domain]  Cd Length: 212  Bit Score: 361.59  E-value: 7.24e-126
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115   185 FRDCADVYQAGFNKSGIYTIYFNNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPSGEYWLGNEFI 264
Cdd:smart00186   2 PRDCSDVLQNGGKTSGLYTIYPDGSSRPLKVYCDMETDGGGWTVIQRRMDGSVDFYRDWKDYKEGFGNLAGEFWLGNENI 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115   265 FAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQSSLILHGADFSTKDADNDNCMCKCALM 344
Cdd:smart00186  82 HLLTSQGKYELRIDLEDWEGNTAYALYDSFKVADEADGYRLHIGGYSGTAGDASLTYHNGMQFSTYDRDNDKYSGNCAEE 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 568991115   345 LTGGWWFDACGPSNLNGMFYtagQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 398
Cdd:smart00186 162 YGGGWWYNNCHAANLNGRYY---PNNNYDNGINWATWKGSWYSLKFTEMKIRPL 212
Fibrinogen_C pfam00147
Fibrinogen beta and gamma chains, C-terminal globular domain;
186-398 4.96e-76

Fibrinogen beta and gamma chains, C-terminal globular domain;


Pssm-ID: 395095 [Multi-domain]  Cd Length: 221  Bit Score: 235.11  E-value: 4.96e-76
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  186 RDCADVYQAGFNKSGIYTIYFNNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPS-GEYWLGNEFI 264
Cdd:pfam00147   3 RDCSDVYNKGAKTSGLYTIRPDGATKPFEVYCDMETDGGGWTVFQRRLDGSTNFKRNWKDYKAGFGNLSpGEFWLGNDKI 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  265 FAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQ-----SSLILH-GADFSTKDADNDNCM 338
Cdd:pfam00147  83 HLLTKQGPYVLRIDLEDWNGETVFALYDSFKVTNENDKYRLHVENYIGDAGDAldtagRSMTYHnGMQFSTWDRDNDSPD 162
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  339 CKCALMLTGGWWFDACGPSNLNGMFYTaGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 398
Cdd:pfam00147 163 GNCALSYGGGWWYNNCHAANLNGVYYY-GGTYSKQNGIIWATWKGRWYSMKKAEMKIRPL 221
GGGWT_bact NF040941
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
188-229 6.07e-08

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 48.71  E-value: 6.07e-08
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*
gi 568991115 188 CADVYQAGFN-KSGIYTIYFNNMP--EPKKVFCNMDVNGGGWTVI 229
Cdd:NF040941   2 CWEILQAGPSaPSGVYWIDPDGMGglAPFQVYCDMTTDGGGWTLV 46
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
38-160 2.37e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 43.47  E-value: 2.37e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115   38 SQTAEQTRKLTDVETQVLNQTSRleIQLLENSLSTYKLEKQLLQQTNEILK--IHEKNSLLEHKILEMEGKhKEELDTLK 115
Cdd:TIGR04523 314 SELKNQEKKLEEIQNQISQNNKI--ISQLNEQISQLKKELTNSESENSEKQreLEEKQNEIEKLKKENQSY-KQEIKNLE 390
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 568991115  116 EEKENLQGLVSRQTFIIQELEKQLSRATNNNSILQKQQLELMDTV 160
Cdd:TIGR04523 391 SQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETI 435
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
38-156 5.21e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 42.23  E-value: 5.21e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  38 SQTAEQTRKLTDVETQVLNQTSRLEIQLLENSLStyKLEKQLLQQTNEILKIHEKNSLLEHKILEMEGKHKEELDTLKEE 117
Cdd:COG1196  209 AEKAERYRELKEELKELEAELLLLKLRELEAELE--ELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEA 286
                         90       100       110
                 ....*....|....*....|....*....|....*....
gi 568991115 118 KENLQGLVSRqtfiIQELEKQLSRATNNNSILQKQQLEL 156
Cdd:COG1196  287 QAEEYELLAE----LARLEQDIARLEERRRELEERLEEL 321
SHE3 pfam17078
SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an ...
78-146 6.85e-04

SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an RNA-binding protein that binds specific mRNAs, including the mRNA of Ash1, which is invalid in cell-fate determination. She3 acts as an adapter protein that docks the myosin motor Myo4p onto an Ash1-She2p ribonucleoprotein complex. She3 seems to bind to Myo4p and Shep2p via different domains.


Pssm-ID: 293683 [Multi-domain]  Cd Length: 228  Bit Score: 40.88  E-value: 6.85e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568991115   78 QLLQQTNEILKIHEKNSllehkilEMEGKHKEELDTLKEEKENLQGLVSRQTFIIQELEKQLSRATNNN 146
Cdd:pfam17078  21 QLTVQSQNLLSKLEIAQ-------QKESKFLENLASLKHENDNLSSMLNRKERRLKDLEDQLSELKNSY 82
PRK10935 PRK10935
nitrate/nitrite two-component system sensor histidine kinase NarQ;
24-157 3.22e-03

nitrate/nitrite two-component system sensor histidine kinase NarQ;


Pssm-ID: 236800 [Multi-domain]  Cd Length: 565  Bit Score: 39.45  E-value: 3.22e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  24 NHTATmlEIGT---SLLSQTAEQTRKLTDV--ETQVLNQTSRLeiqllensLSTYKLEKQLLQQTNEILKIHEKNSLLEH 98
Cdd:PRK10935 216 NQMSS--ELHKlyrSLEASVEEKTRKLTQAnrSLEVLYQCSQA--------LNASQIDVHCFRHILQIVRDHEGLDYLEL 285
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568991115  99 KILE------MEGKHKEELD----TLKEEKENL------QGLVSRQTFIIQELEKQLSRA-TNNNSILQKQQLELM 157
Cdd:PRK10935 286 EVGEnehwriSEGQPNPELPwqilPLTMEDTVLgylhwqASLPCPDEPLMNNVAQMLGRGlYFNQAQKQQQQLLLM 361
 
Name Accession Description Interval E-value
FReD cd00087
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is ...
183-398 4.23e-128

Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.


Pssm-ID: 238040 [Multi-domain]  Cd Length: 215  Bit Score: 367.34  E-value: 4.23e-128
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115 183 KPFRDCADVYQAGFNKSGIYTIYFNNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPSGEYWLGNE 262
Cdd:cd00087    1 PLPRDCSEVLQRGGRTSGVYTIQPPGSNEPFQVYCDMDTDGGGWTVIQRRGDGSVDFYRSWKEYKDGFGNLDGEFWLGLE 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115 263 FIFAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQSSlILHGADFSTKDADNDNCMCKCA 342
Cdd:cd00087   81 KIHLLTSQGPYELRIDLEDWEGNTAYAEYDSFKVGSESEGYRLTLGGYSGTAGDALS-YHNGMKFSTFDRDNDGASGNCA 159
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 568991115 343 LMLTGGWWFDACGPSNLNGMFYTAGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 398
Cdd:cd00087  160 ESYSGGWWYNSCHASNLNGRYYSGGHRNEYDNGINWATWKGSTYSLKFTEMKIRPK 215
FBG smart00186
Fibrinogen-related domains (FReDs); Domain present at the C-termini of fibrinogen beta and ...
185-398 7.24e-126

Fibrinogen-related domains (FReDs); Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.


Pssm-ID: 214548 [Multi-domain]  Cd Length: 212  Bit Score: 361.59  E-value: 7.24e-126
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115   185 FRDCADVYQAGFNKSGIYTIYFNNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPSGEYWLGNEFI 264
Cdd:smart00186   2 PRDCSDVLQNGGKTSGLYTIYPDGSSRPLKVYCDMETDGGGWTVIQRRMDGSVDFYRDWKDYKEGFGNLAGEFWLGNENI 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115   265 FAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQSSLILHGADFSTKDADNDNCMCKCALM 344
Cdd:smart00186  82 HLLTSQGKYELRIDLEDWEGNTAYALYDSFKVADEADGYRLHIGGYSGTAGDASLTYHNGMQFSTYDRDNDKYSGNCAEE 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 568991115   345 LTGGWWFDACGPSNLNGMFYtagQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 398
Cdd:smart00186 162 YGGGWWYNNCHAANLNGRYY---PNNNYDNGINWATWKGSWYSLKFTEMKIRPL 212
Fibrinogen_C pfam00147
Fibrinogen beta and gamma chains, C-terminal globular domain;
186-398 4.96e-76

Fibrinogen beta and gamma chains, C-terminal globular domain;


Pssm-ID: 395095 [Multi-domain]  Cd Length: 221  Bit Score: 235.11  E-value: 4.96e-76
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  186 RDCADVYQAGFNKSGIYTIYFNNMPEPKKVFCNMDVNGGGWTVIQHREDGSLDFQRGWKEYKMGFGNPS-GEYWLGNEFI 264
Cdd:pfam00147   3 RDCSDVYNKGAKTSGLYTIRPDGATKPFEVYCDMETDGGGWTVFQRRLDGSTNFKRNWKDYKAGFGNLSpGEFWLGNDKI 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  265 FAITSQRQYMLRIELMDWEGNRAYSQYDRFHIGNEKQNYRLYLKGHTGTAGKQ-----SSLILH-GADFSTKDADNDNCM 338
Cdd:pfam00147  83 HLLTKQGPYVLRIDLEDWNGETVFALYDSFKVTNENDKYRLHVENYIGDAGDAldtagRSMTYHnGMQFSTWDRDNDSPD 162
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  339 CKCALMLTGGWWFDACGPSNLNGMFYTaGQNHGKLNGIKWHYFKGPSYSLRSTTMMIRPL 398
Cdd:pfam00147 163 GNCALSYGGGWWYNNCHAANLNGVYYY-GGTYSKQNGIIWATWKGRWYSMKKAEMKIRPL 221
GGGWT_bact NF040941
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
188-229 6.07e-08

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 48.71  E-value: 6.07e-08
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*
gi 568991115 188 CADVYQAGFN-KSGIYTIYFNNMP--EPKKVFCNMDVNGGGWTVI 229
Cdd:NF040941   2 CWEILQAGPSaPSGVYWIDPDGMGglAPFQVYCDMTTDGGGWTLV 46
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
38-160 2.37e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 43.47  E-value: 2.37e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115   38 SQTAEQTRKLTDVETQVLNQTSRleIQLLENSLSTYKLEKQLLQQTNEILK--IHEKNSLLEHKILEMEGKhKEELDTLK 115
Cdd:TIGR04523 314 SELKNQEKKLEEIQNQISQNNKI--ISQLNEQISQLKKELTNSESENSEKQreLEEKQNEIEKLKKENQSY-KQEIKNLE 390
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 568991115  116 EEKENLQGLVSRQTFIIQELEKQLSRATNNNSILQKQQLELMDTV 160
Cdd:TIGR04523 391 SQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETI 435
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
38-156 5.21e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 42.23  E-value: 5.21e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  38 SQTAEQTRKLTDVETQVLNQTSRLEIQLLENSLStyKLEKQLLQQTNEILKIHEKNSLLEHKILEMEGKHKEELDTLKEE 117
Cdd:COG1196  209 AEKAERYRELKEELKELEAELLLLKLRELEAELE--ELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEA 286
                         90       100       110
                 ....*....|....*....|....*....|....*....
gi 568991115 118 KENLQGLVSRqtfiIQELEKQLSRATNNNSILQKQQLEL 156
Cdd:COG1196  287 QAEEYELLAE----LARLEQDIARLEERRRELEERLEEL 321
SHE3 pfam17078
SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an ...
78-146 6.85e-04

SWI5-dependent HO expression protein 3; SWI5-dependent HO expression protein 3 (She3) is an RNA-binding protein that binds specific mRNAs, including the mRNA of Ash1, which is invalid in cell-fate determination. She3 acts as an adapter protein that docks the myosin motor Myo4p onto an Ash1-She2p ribonucleoprotein complex. She3 seems to bind to Myo4p and Shep2p via different domains.


Pssm-ID: 293683 [Multi-domain]  Cd Length: 228  Bit Score: 40.88  E-value: 6.85e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568991115   78 QLLQQTNEILKIHEKNSllehkilEMEGKHKEELDTLKEEKENLQGLVSRQTFIIQELEKQLSRATNNN 146
Cdd:pfam17078  21 QLTVQSQNLLSKLEIAQ-------QKESKFLENLASLKHENDNLSSMLNRKERRLKDLEDQLSELKNSY 82
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
8-163 1.28e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 40.65  E-value: 1.28e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115    8 ENMKSEMAQIQQN-AVQNHTATMLeiGTSLLSQTAEQTRKLTDVEtQVLNQTSRLEIQLLENSL------STYKLEKQLL 80
Cdd:pfam07888 233 EALLEELRSLQERlNASERKVEGL--GEELSSMAAQRDRTQAELH-QARLQAAQLTLQLADASLalregrARWAQERETL 309
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115   81 QQTNEilKIHEKNSLLEHKILEMEGKHKEE------LDT-LKEEKENLQGLVSRQTFIIQELEKQLSRATNNNSILQKQQ 153
Cdd:pfam07888 310 QQSAE--ADKDRIEKLSAELQRLEERLQEErmerekLEVeLGREKDCNRVQLSESRRELQELKASLRVAQKEKEQLQAEK 387
                         170
                  ....*....|
gi 568991115  154 LELMDTVHNL 163
Cdd:pfam07888 388 QELLEYIRQL 397
PRK10935 PRK10935
nitrate/nitrite two-component system sensor histidine kinase NarQ;
24-157 3.22e-03

nitrate/nitrite two-component system sensor histidine kinase NarQ;


Pssm-ID: 236800 [Multi-domain]  Cd Length: 565  Bit Score: 39.45  E-value: 3.22e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  24 NHTATmlEIGT---SLLSQTAEQTRKLTDV--ETQVLNQTSRLeiqllensLSTYKLEKQLLQQTNEILKIHEKNSLLEH 98
Cdd:PRK10935 216 NQMSS--ELHKlyrSLEASVEEKTRKLTQAnrSLEVLYQCSQA--------LNASQIDVHCFRHILQIVRDHEGLDYLEL 285
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568991115  99 KILE------MEGKHKEELD----TLKEEKENL------QGLVSRQTFIIQELEKQLSRA-TNNNSILQKQQLELM 157
Cdd:PRK10935 286 EVGEnehwriSEGQPNPELPwqilPLTMEDTVLgylhwqASLPCPDEPLMNNVAQMLGRGlYFNQAQKQQQQLLLM 361
DUF724 pfam05266
Protein of unknown function (DUF724); This family contains several uncharacterized proteins ...
74-119 4.00e-03

Protein of unknown function (DUF724); This family contains several uncharacterized proteins found in Arabidopsis thaliana and other plants. This region is often found associated with Agenet domains and may contain coiled-coil.


Pssm-ID: 428400 [Multi-domain]  Cd Length: 188  Bit Score: 38.02  E-value: 4.00e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 568991115   74 KLEKQLLQQTNEILKIHEKNSLLEHKILEMEgkhkEELDTLKEEKE 119
Cdd:pfam05266 113 KLEKKIAEEESEKRKLEEEIDELEKKILELE----RQLALAKEKKE 154
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
35-160 4.02e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 39.67  E-value: 4.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115    35 SLLSQTAEQTRKLTDVETQVLNQTSRL-----EIQLLENSLSTYKLEKQLLqqTNEILKIHEKNSLLEHKILEMEGKH-- 107
Cdd:TIGR02169  305 SLERSIAEKERELEDAEERLAKLEAEIdkllaEIEELEREIEEERKRRDKL--TEEYAELKEELEDLRAELEEVDKEFae 382
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568991115   108 --------KEELDTLKEEKENLQGLVSRQTFIIQELEKQLSRATNNNSILQKQQLELMDTV 160
Cdd:TIGR02169  383 trdelkdyREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEK 443
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
36-160 4.72e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 38.37  E-value: 4.72e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  36 LLSQTAEQTRKLTDVETQVlnQTSRLEIQLLENSLSTY-----KLEKQLLQQTN---------EILKIHEKNSLLEHKIL 101
Cdd:COG1579   36 LEDELAALEARLEAAKTEL--EDLEKEIKRLELEIEEVearikKYEEQLGNVRNnkeyealqkEIESLKRRISDLEDEIL 113
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 568991115 102 EMEGKHKEELDTLKEEKENLQGLVSRQTFIIQELEKQLSRATNNNSILQKQQLELMDTV 160
Cdd:COG1579  114 ELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAAKI 172
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
7-158 4.90e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 39.28  E-value: 4.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115     7 VENMKSEMAQIQQNAVQNHTATMLEIgTSLLSQTAEQTRKLTDVETQVlnqtSRLEIQLLENSLSTYKLEKQLLQQTNEI 86
Cdd:TIGR02169  700 IENRLDELSQELSDASRKIGEIEKEI-EQLEQEEEKLKERLEELEEDL----SSLEQEIENVKSELKELEARIEELEEDL 774
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568991115    87 LKIHEKNSLLEHKILEmegkhkEELDTLKEEKENLQGLVSRQTFIIQELEKQLSRATNNNSILQKQQLELMD 158
Cdd:TIGR02169  775 HKLEEALNDLEARLSH------SRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQE 840
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
75-164 8.01e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 38.21  E-value: 8.01e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568991115  75 LEKQLLQQTNEILKIHEKNSLLEHKILEMEGKHKEELDTLKEEKENLQGLVSRQTFIIQELEKQLSRATNNNSILQKQQL 154
Cdd:COG4942  144 LAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAE 223
                         90
                 ....*....|
gi 568991115 155 ELMDTVHNLI 164
Cdd:COG4942  224 ELEALIARLE 233
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH