NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1342569868|gb|AVC05444|]
View 

thioester-containing protein 1, partial [Anopheles gambiae]

Protein Classification

prenyltransferase/squalene oxidase repeat-containing protein( domain architecture ID 693)

prenyltransferase/squalene oxidase repeat-containing protein similar to Streptomyces anulatus copalyl diphosphate synthase and Aspergillus fumigatus squalene hopane cyclase afumA

CATH:  1.50.10.20
EC:  5.-.-.-
Gene Ontology:  GO:0016853
SCOP:  3001024

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ISOPREN_C2_like super family cl08267
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two ...
1-234 1.82e-103

This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. Alpha (2)-M and PZP are known to bind to and, may modulate, the activity of placental protein-14 in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system.


The actual alignment was detected with superfamily member cd02897:

Pssm-ID: 415487  Cd Length: 292  Bit Score: 301.03  E-value: 1.82e-103
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   1 IDKATNLLRQGYQNQMRYRQTDGSFGVWEKS--GSSVFLTAFVATSMQTASKYMnDIDAAMVEKALDWLASKQHSSGRFD 78
Cdd:cd02897    44 ESKALGFLRTGYQRQLTYKHSDGSYSAFGESdkSGSTWLTAFVLKSFAQARPFI-YIDENVLQQALTWLSSHQKSNGCFR 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  79 ETGKVWHKDMQGGLRNGVALTSYVLTALLENDIAkvKHAVVIQNGMNYLSNQLAFINNPYDLSIATYAMMLNGHTMKKEA 158
Cdd:cd02897   123 EVGRVFHKAMQGGVDDEVALTAYVLIALLEAGLP--SERPVVEKALSCLEAALDSISDPYTLALAAYALTLAGSEKRPEA 200
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868 159 LDKLIDMSISDNNKKE-------------RYWGTTNQIETTAYALLSFVMA--EKYLDGIPVMNWLVNQRYVTGSFPRTQ 223
Cdd:cd02897   201 LKKLDELAISEDGTKHwsrpppseegpsyYWQAPSAEVEMTAYALLALLSAggEDLAEALPIVKWLAKQRNSLGGFSSTQ 280
                         250
                  ....*....|.
gi 1342569868 224 DTFVGLKALTK 234
Cdd:cd02897   281 DTVVALQALAK 291
 
Name Accession Description Interval E-value
A2M_2 cd02897
Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy ...
1-234 1.82e-103

Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.


Pssm-ID: 239227  Cd Length: 292  Bit Score: 301.03  E-value: 1.82e-103
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   1 IDKATNLLRQGYQNQMRYRQTDGSFGVWEKS--GSSVFLTAFVATSMQTASKYMnDIDAAMVEKALDWLASKQHSSGRFD 78
Cdd:cd02897    44 ESKALGFLRTGYQRQLTYKHSDGSYSAFGESdkSGSTWLTAFVLKSFAQARPFI-YIDENVLQQALTWLSSHQKSNGCFR 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  79 ETGKVWHKDMQGGLRNGVALTSYVLTALLENDIAkvKHAVVIQNGMNYLSNQLAFINNPYDLSIATYAMMLNGHTMKKEA 158
Cdd:cd02897   123 EVGRVFHKAMQGGVDDEVALTAYVLIALLEAGLP--SERPVVEKALSCLEAALDSISDPYTLALAAYALTLAGSEKRPEA 200
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868 159 LDKLIDMSISDNNKKE-------------RYWGTTNQIETTAYALLSFVMA--EKYLDGIPVMNWLVNQRYVTGSFPRTQ 223
Cdd:cd02897   201 LKKLDELAISEDGTKHwsrpppseegpsyYWQAPSAEVEMTAYALLALLSAggEDLAEALPIVKWLAKQRNSLGGFSSTQ 280
                         250
                  ....*....|.
gi 1342569868 224 DTFVGLKALTK 234
Cdd:cd02897   281 DTVVALQALAK 291
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
2-234 4.47e-61

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 194.05  E-value: 4.47e-61
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   2 DKATNLLRQGYQNQMRYRQTDGSFGVWEKSGSSVFLTAFVATSMQTASKYMnDIDAAMVEKALDWLASKQHSSGRFDETG 81
Cdd:pfam07678  59 SKAIDYLEQGYQRQLSYKHPDGSYSAFGHSPGSTWLTAFVLKVFAQARKFI-FIDPEEICQSLRWLLSQQKPDGSFREPG 137
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  82 KVWHKDMQGGLRNGVALTSYVLTALLE-NDIAKVKHAV--VIQNGMNYLSN-QLAFINNPYDLSIATYAMMLNGH-TMKK 156
Cdd:pfam07678 138 PLLHRAMKGGVDGEVSLTAYVTIALLEaLDINGLLQRVhpSIRKALTYLEQaQLAGLTSPYTLAILAYALALAGSpETRE 217
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868 157 EALDKLIDMSISDNNKkeRYWG-----------------TTNQIETTAYALLSFVMAEKYLDGIPVMNWLVNQRYVTGSF 219
Cdd:pfam07678 218 ELLKSLDAMAREEGNS--RYWErdeksdpqgvpeyppqaPSLEVETTAYALLAYLLLGDLTYADPIVKWLTSQRNSHGGF 295
                         250
                  ....*....|....*
gi 1342569868 220 PRTQDTFVGLKALTK 234
Cdd:pfam07678 296 SSTQDTVVALQALAE 310
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
19-234 6.16e-09

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 55.86  E-value: 6.16e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   19 RQT-DGSFGVWEKSG-SSVFLTAFVATSMQTASKYMNDIDAAMVEKALDWLASKQHSSGRFDetgkvWHKDMQGGLRngv 96
Cdd:COG2373   1190 MQNsDGGFGLWPGGSeSDPWLTAYATDFLLEAREAGYAVPDDALDRALDYLRNYLRNPWEIE-----YDDAYRLAVR--- 1261
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   97 ALTSYVLTALLENDIAKvkhavviqngMNYLSNQLAFINNPYDLS--IATYAMMlnGhtmKKEALDKLID--MSISDNNK 172
Cdd:COG2373   1262 AYALYVLARAGKADLGD----------LRYLYDRRKDALSPLAKAqlAAALALL--G---DKARAEELLAaaLARLRETG 1326
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1342569868  173 KERYWGTT--NQIETTAYALLSFVMAEKYLDGIPVM-NWLVNQRYvTGSFPRTQDTFVGLKALTK 234
Cdd:COG2373   1327 ARDYWYGDygSPLRDQALALALLAELGPDAPLAPKLaRWLAKALK-SGRWLSTQETAWALLALAA 1390
squa_tetra_cyc TIGR04277
squalene--tetrahymanol cyclase; This enzyme, also called squalene--tetrahymanol cyclase, ...
13-142 9.49e-06

squalene--tetrahymanol cyclase; This enzyme, also called squalene--tetrahymanol cyclase, occurs a small number of eukaryotes, some of them anaerobic. The pathway can occur under anaerobic conditions, and the product is thought to replace sterols, letting organisms with this compound build membrane suitable for performing phagocytosis.


Pssm-ID: 212000 [Multi-domain]  Cd Length: 624  Bit Score: 46.16  E-value: 9.49e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  13 QNQMRY-RQTDGSFGVWEKSGSSVFLTAfvATSMQTASKYMN-DIDAAMVEKALDWLASKQHSSGRFDETGKVWH--KDM 88
Cdd:TIGR04277 460 QKMIKYfMDTQEKFGSWEARWGINYIMA--AGAVLPALAKMNyDLNEGWAKNAINWLLNKQNADGGFGECTLSYNdpEKW 537
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1342569868  89 QGGLRNGVALTSYVLTALLE--NDIAKVKHAvvIQNGMNYLSNQLAFINNP-YDLSI 142
Cdd:TIGR04277 538 NGIGKSTVTQTSWGLLALLAveDHNDQIKEA--ADKAAQYLLDQFKRDDGEfKDHST 592
 
Name Accession Description Interval E-value
A2M_2 cd02897
Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy ...
1-234 1.82e-103

Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.


Pssm-ID: 239227  Cd Length: 292  Bit Score: 301.03  E-value: 1.82e-103
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   1 IDKATNLLRQGYQNQMRYRQTDGSFGVWEKS--GSSVFLTAFVATSMQTASKYMnDIDAAMVEKALDWLASKQHSSGRFD 78
Cdd:cd02897    44 ESKALGFLRTGYQRQLTYKHSDGSYSAFGESdkSGSTWLTAFVLKSFAQARPFI-YIDENVLQQALTWLSSHQKSNGCFR 122
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  79 ETGKVWHKDMQGGLRNGVALTSYVLTALLENDIAkvKHAVVIQNGMNYLSNQLAFINNPYDLSIATYAMMLNGHTMKKEA 158
Cdd:cd02897   123 EVGRVFHKAMQGGVDDEVALTAYVLIALLEAGLP--SERPVVEKALSCLEAALDSISDPYTLALAAYALTLAGSEKRPEA 200
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868 159 LDKLIDMSISDNNKKE-------------RYWGTTNQIETTAYALLSFVMA--EKYLDGIPVMNWLVNQRYVTGSFPRTQ 223
Cdd:cd02897   201 LKKLDELAISEDGTKHwsrpppseegpsyYWQAPSAEVEMTAYALLALLSAggEDLAEALPIVKWLAKQRNSLGGFSSTQ 280
                         250
                  ....*....|.
gi 1342569868 224 DTFVGLKALTK 234
Cdd:cd02897   281 DTVVALQALAK 291
A2M_like cd02891
Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier ...
2-234 1.59e-68

Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier protein in serum. It is a broadly specific proteinase inhibitor. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. This group contains another broadly specific proteinase inhibitor: pregnancy zone protein (PZP). PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system. This group also contains C3, C4 and C5 of vertebrate complement. The vertebrate complement is an effector of both the acquired and innate immune systems The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239221 [Multi-domain]  Cd Length: 282  Bit Score: 211.86  E-value: 1.59e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   2 DKATNLLRQGYQNQMRYRQTDGSFGVWEKSG-SSVFLTAFVATSMQTASKYMnDIDAAMVEKALDWLASKQHSSGRFDET 80
Cdd:cd02891    45 EKALEYIRKGYQRLLTYQRSDGSFSAWGNSDsGSTWLTAYVVKFLSQARKYI-DVDENVLARALGWLVPQQKEDGSFREL 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  81 GKVWHKDMQGGLRNGVALTSYVLTALLEndiAKVKHAVVIQNGMNYLSNQLAFINNPYDLSIATYAMMLNGH-TMKKEAL 159
Cdd:cd02891   124 GPVIHREMKGGVDDSVSLTAYVLIALAE---AGKACDASIEKALAYLETQLDGLLDPYALAILAYALALAGDsTRADEAL 200
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868 160 DKLIDMSISDNNKKE------RYWGTTNQIETTAYALLSFVMAEKYLDGIPVMNWLVNQRYVTGSFPRTQDTFVGLKALT 233
Cdd:cd02891   201 KKLLEAAREKGGTAHwslswpGDYGSSLRVEATAYALLALLKLGDLEEAGPIAKWLAQQRNSGGGFLSTQDTVVALQALA 280

                  .
gi 1342569868 234 K 234
Cdd:cd02891   281 A 281
complement_C3_C4_C5 cd02896
Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, ...
2-234 5.46e-62

Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems. The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239226 [Multi-domain]  Cd Length: 297  Bit Score: 195.96  E-value: 5.46e-62
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   2 DKATNLLRQGYQNQMRYRQTDGSFGVWEKSGSSVFLTAFVATSMQTASKYMnDIDAAMVEKALDWLASKQHSSGRFDETG 81
Cdd:cd02896    48 DEALKYIRQGYQRQLSYRKPDGSYAAWKNRPSSTWLTAFVVKVFSLARKYI-PVDQNVICGSVNWLISNQKPDGSFQEPS 126
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  82 KVWHKDMQGGLRNG---VALTSYVLTALLE----NDIAKVKHAVVIQNGMNYLSNQLAFINNPYDLSIATYAMMLNGHTM 154
Cdd:cd02896   127 PVIHREMTGGVEGSegdVSLTAFVLIALQEarsiCPPEVQNLDQSIRKAISYLENQLPNLQRPYALAITAYALALADSPL 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868 155 KKEALDKLIDMSISDNNKKERYWGTTNQ----------IETTAYALLSFVMAEKYLDGIPVMNWLVNQRYVTGSFPRTQD 224
Cdd:cd02896   207 SHAANRKLLSLAKRDGNGWYWWTIDSPYwpvpgpsaitVETTAYALLALLKLGDIEYANPIARWLTEQRNYGGGFGSTQD 286
                         250
                  ....*....|
gi 1342569868 225 TFVGLKALTK 234
Cdd:cd02896   287 TVVALQALAE 296
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
2-234 4.47e-61

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 194.05  E-value: 4.47e-61
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   2 DKATNLLRQGYQNQMRYRQTDGSFGVWEKSGSSVFLTAFVATSMQTASKYMnDIDAAMVEKALDWLASKQHSSGRFDETG 81
Cdd:pfam07678  59 SKAIDYLEQGYQRQLSYKHPDGSYSAFGHSPGSTWLTAFVLKVFAQARKFI-FIDPEEICQSLRWLLSQQKPDGSFREPG 137
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  82 KVWHKDMQGGLRNGVALTSYVLTALLE-NDIAKVKHAV--VIQNGMNYLSN-QLAFINNPYDLSIATYAMMLNGH-TMKK 156
Cdd:pfam07678 138 PLLHRAMKGGVDGEVSLTAYVTIALLEaLDINGLLQRVhpSIRKALTYLEQaQLAGLTSPYTLAILAYALALAGSpETRE 217
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868 157 EALDKLIDMSISDNNKkeRYWG-----------------TTNQIETTAYALLSFVMAEKYLDGIPVMNWLVNQRYVTGSF 219
Cdd:pfam07678 218 ELLKSLDAMAREEGNS--RYWErdeksdpqgvpeyppqaPSLEVETTAYALLAYLLLGDLTYADPIVKWLTSQRNSHGGF 295
                         250
                  ....*....|....*
gi 1342569868 220 PRTQDTFVGLKALTK 234
Cdd:pfam07678 296 SSTQDTVVALQALAE 310
ISOPREN_C2_like cd00688
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two ...
2-234 3.62e-41

This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. Alpha (2)-M and PZP are known to bind to and, may modulate, the activity of placental protein-14 in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system.


Pssm-ID: 238362 [Multi-domain]  Cd Length: 300  Bit Score: 142.30  E-value: 3.62e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   2 DKATNLLRQGYQNQMRYRQTDGSFGVWEKSG-SSVFLTAFVATSMQTASKYMnDIDAAMVEKALDWLASKQHSSGRFDET 80
Cdd:cd00688    48 DKADENIEKGIQRLLSYQLSDGGFSGWGGNDyPSLWLTAYALKALLLAGDYI-AVDRIDLARALNWLLSLQNEDGGFRED 126
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  81 GKVWHKDMQGGlrNGVALTSYVLTALLENDIAKVKhaVVIQNGMNYLSNQLAF--------INNPYDLSIATYAMMLNGH 152
Cdd:cd00688   127 GPGNHRIGGDE--SDVRLTAYALIALALLGKLDPD--PLIEKALDYLLSCQNYdggfgpggESHGYGTACAAAALALLGD 202
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868 153 T---MKKEALDKLIDMSISDNNKKERYW-----GTTNQIETTAYALLSFVMAEKYLDGIPVMNWLVNQRYVTGSF----- 219
Cdd:cd00688   203 LdspDAKKALRWLLSRQRPDGGWGEGRDrtnklSDSCYTEWAAYALLALGKLGDLEDAEKLVKWLLSQQNEDGGFsskpg 282
                         250
                  ....*....|....*..
gi 1342569868 220 --PRTQDTFVGLKALTK 234
Cdd:cd00688   283 ksYDTQHTVFALLALSL 299
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
19-234 6.16e-09

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 55.86  E-value: 6.16e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   19 RQT-DGSFGVWEKSG-SSVFLTAFVATSMQTASKYMNDIDAAMVEKALDWLASKQHSSGRFDetgkvWHKDMQGGLRngv 96
Cdd:COG2373   1190 MQNsDGGFGLWPGGSeSDPWLTAYATDFLLEAREAGYAVPDDALDRALDYLRNYLRNPWEIE-----YDDAYRLAVR--- 1261
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   97 ALTSYVLTALLENDIAKvkhavviqngMNYLSNQLAFINNPYDLS--IATYAMMlnGhtmKKEALDKLID--MSISDNNK 172
Cdd:COG2373   1262 AYALYVLARAGKADLGD----------LRYLYDRRKDALSPLAKAqlAAALALL--G---DKARAEELLAaaLARLRETG 1326
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1342569868  173 KERYWGTT--NQIETTAYALLSFVMAEKYLDGIPVM-NWLVNQRYvTGSFPRTQDTFVGLKALTK 234
Cdd:COG2373   1327 ARDYWYGDygSPLRDQALALALLAELGPDAPLAPKLaRWLAKALK-SGRWLSTQETAWALLALAA 1390
squa_tetra_cyc TIGR04277
squalene--tetrahymanol cyclase; This enzyme, also called squalene--tetrahymanol cyclase, ...
13-142 9.49e-06

squalene--tetrahymanol cyclase; This enzyme, also called squalene--tetrahymanol cyclase, occurs a small number of eukaryotes, some of them anaerobic. The pathway can occur under anaerobic conditions, and the product is thought to replace sterols, letting organisms with this compound build membrane suitable for performing phagocytosis.


Pssm-ID: 212000 [Multi-domain]  Cd Length: 624  Bit Score: 46.16  E-value: 9.49e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  13 QNQMRY-RQTDGSFGVWEKSGSSVFLTAfvATSMQTASKYMN-DIDAAMVEKALDWLASKQHSSGRFDETGKVWH--KDM 88
Cdd:TIGR04277 460 QKMIKYfMDTQEKFGSWEARWGINYIMA--AGAVLPALAKMNyDLNEGWAKNAINWLLNKQNADGGFGECTLSYNdpEKW 537
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1342569868  89 QGGLRNGVALTSYVLTALLE--NDIAKVKHAvvIQNGMNYLSNQLAFINNP-YDLSI 142
Cdd:TIGR04277 538 NGIGKSTVTQTSWGLLALLAveDHNDQIKEA--ADKAAQYLLDQFKRDDGEfKDHST 592
hopene_cyclase TIGR01507
squalene-hopene cyclase; SHC is an essential prokaryotic gene in hopanoid (triterpenoid) ...
2-130 7.46e-05

squalene-hopene cyclase; SHC is an essential prokaryotic gene in hopanoid (triterpenoid) biosynthesis. Squalene hopene cyclase, an integral membrane protein, directly cyclizes squalene into hopanoid products. [Fatty acid and phospholipid metabolism, Other]


Pssm-ID: 273661 [Multi-domain]  Cd Length: 635  Bit Score: 43.34  E-value: 7.46e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868   2 DKATNLLRQGYQNQMRYRQTDGS-FGVWEKS---GSSVFLTAFVATSMQTaskymndiDAAMVEKALDWLASKQHSSGRF 77
Cdd:TIGR01507 467 DDAWPVIERAVEYLKREQEPDGSwFGRWGVNylyGTGAVLSALKAVGIDT--------REPYIQKALAWLESHQNPDGGW 538
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1342569868  78 DETGKVWHKDMQGGLRNGVAL-TSYVLTALLEndiAKVKHAVVIQNGMNYLSNQ 130
Cdd:TIGR01507 539 GEDCRSYEDPAYAGKGASTASqTAWALIALIA---AGRAESEAARRGVQYLVET 589
SQCY cd02889
Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - ...
20-131 4.16e-03

Squalene cyclase (SQCY) domain; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. Bacterial SQCY catalyzes the convertion of squalene to hopene or diplopterol. Eukaryotic OSQCY transforms the 2,3-epoxide of squalene to compounds such as, lanosterol (a metabolic precursor of cholesterol and steroid hormones) in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain. This group also contains SQCY-like archael sequences and some bacterial SQCY's which lack this minor domain.


Pssm-ID: 239219 [Multi-domain]  Cd Length: 348  Bit Score: 37.58  E-value: 4.16e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  20 QTDGS-FGVWeksGSS-VFLTAFVATSMQTASKYMNDidaAMVEKALDWLASKQHSSGRFDETGK-VWHKDMQGGLRNGV 96
Cdd:cd02889   205 EPDGSwYGRW---GVCfIYGTWFALEALAAAGEDENS---PYVRKACDWLLSKQNPDGGWGESYEsYEDPSYAGGGRSTV 278
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 1342569868  97 ALTSYVLTALLEndiAKVKHAVVIQNGMNYL-SNQL 131
Cdd:cd02889   279 VQTAWALLALMA---AGEPDSEAVKRGVKYLlNTQQ 311
SQCY_1 cd02892
Squalene cyclase (SQCY) domain subgroup 1; found in class II terpene cyclases that have an ...
20-131 5.08e-03

Squalene cyclase (SQCY) domain subgroup 1; found in class II terpene cyclases that have an alpha 6 - alpha 6 barrel fold. Squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY) are integral membrane proteins that catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. This group contains bacterial SQCY which catalyzes the convertion of squalene to hopene or diplopterol and eukaryotic OSQCY which transforms the 2,3-epoxide of squalene to compounds such as, lanosterol in mammals and fungi or, cycloartenol in plants. Deletion of a single glycine residue of Alicyclobacillus acidocaldarius SQCY alters its substrate specificity into that of eukaryotic OSQCY. Both enzymes have a second minor domain, which forms an alpha-alpha barrel that is inserted into the major domain.


Pssm-ID: 239222 [Multi-domain]  Cd Length: 634  Bit Score: 37.56  E-value: 5.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  20 QTDGS-FGVWeksGSS-VFLTAFVATSMQTASK-YMNDidaAMVEKALDWLASKQHSSGRFDETGKVWHKDMQGGL-RNG 95
Cdd:cd02892   490 EPDGSwYGRW---GVCyIYGTWFALEALAAAGEdYENS---PYIRKACDFLLSKQNPDGGWGESYLSYEDKSYAGGgRST 563
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 1342569868  96 VALTSYVLTALLEndiAKVKHAVVIQNGMNYL-SNQL 131
Cdd:cd02892   564 VVQTAWALLALMA---AGEPDSEAVERGIKYLlNTQL 597
SqhC COG1657
Terpene cyclase SqhC [Lipid transport and metabolism];
22-132 7.13e-03

Terpene cyclase SqhC [Lipid transport and metabolism];


Pssm-ID: 441263 [Multi-domain]  Cd Length: 644  Bit Score: 37.11  E-value: 7.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1342569868  22 DGS-FGVWeksGSS-VFLTAFVATSMQTASKYMNDidaAMVEKALDWLASKQHSSGRFDETGKVWHKDMQGGLRNGVAL- 98
Cdd:COG1657   497 DGSwFGRW---GVNyIYGTWSVLTGLNAAGVDPDD---PAIRRAVAWLLSIQNADGGWGEDCRSYEDPRYVGLGPSTASq 570
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 1342569868  99 TSYVLTALLEndiAKVKHAVVIQNGMNYL-SNQLA 132
Cdd:COG1657   571 TAWALLALLA---AGEADSPAVARGIAYLlSTQRE 602
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH