NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720360732|ref|XP_030100817|]
View 

stabilin-2 isoform X3 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1018-1137 1.24e-24

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 100.41  E-value: 1.24e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732 1018 FYQWINNASLQSMLSAT-SNLTVLVPSLQAIKDMDQNEKSFWLS-RNNIPALIKYHTLLGTYRVADLQTLPSshmlATSL 1095
Cdd:pfam02469    6 FVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLKNGGT----LATL 81
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1720360732 1096 QGSFLRLDKADGNITIEGASFVDGDNAATNGVVHIINKVLIP 1137
Cdd:pfam02469   82 QGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
533-661 4.45e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.17  E-value: 4.45e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  533 PRYGKFRSLLEKTNVGQALEKGgiDEPYTIFVPSNEALSNMTAGVLDYLLSPegSRKLLELVRYHIVAfTQLEVATLVST 612
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGS--QGPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1720360732  613 LHIRSMANQIITFNISSkGQILANNVAVDETEVAAKNGRIYTLTGVLIP 661
Cdd:pfam02469   76 GTLATLQGSKLRVNVTG-GSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1156-1273 7.09e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 95.40  E-value: 7.09e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732 1156 PDYSIFRGYIIHYNLASAIEAADA-YTVFVPNNEAIESYIREKKATSLK-----EDILQYHVVLGeKLLRNDLHNGMHRE 1229
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGTLNFLLKdkeqlKNLLKYHVVPG-RLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1720360732 1230 TMLGFSylLAFFLHNDQLYVNEAPINYTNVATDKGVIHGLEKVL 1273
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
390-512 7.14e-22

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 92.70  E-value: 7.14e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  390 GQLTSFISILDRT-YAWPLSN-LGPFTVLLPSDKG---LKGVDVKELLMDKEAARYFVKLHIIAGQMSTEQMYNLDTFYT 464
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLNGsQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1720360732  465 LTGKSGEIINKDKdnqlKLKLYGSKIVQiiqGNIVASNGLVHILDRAM 512
Cdd:pfam02469   81 LQGSKLRVNVTGG----SVTVNGARVVQ---ADIEATNGVIHVIDKVL 121
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1482-1518 1.27e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.13  E-value: 1.27e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732 1482 CEISNGGCSAKADCKRTiPGSRVCVCKAGYTGDGIVC 1518
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1524-1560 1.70e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.97  E-value: 1.70e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732 1524 CLENHGGCDRHAECTQTgPNQAVCNCLPKYTGDGKVC 1560
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
844-872 7.65e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.43  E-value: 7.65e-05
                           10        20
                   ....*....|....*....|....*....
gi 1720360732  844 CHIHATCEYSNETASCVCNDGYEGDGTLC 872
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
334-369 5.74e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 5.74e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732  334 CESKN-PCHKNANCSTVsPGQTQCTCQKGYVGDGLNC 369
Cdd:pfam12947    1 CSDNNgGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
927-959 6.85e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 6.85e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1720360732  927 SGGCHDNATCLYVgPGQNECECKKGFRGNGIDC 959
Cdd:pfam12947    5 NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
965-1001 2.14e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.19  E-value: 2.14e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732  965 CLEQIEKCHPLATCQYTLSGVwSCVCQEGYEGNGVLC 1001
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
254-283 3.99e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.42  E-value: 3.99e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1720360732  254 CHPHASCSYLgPNRHSCVCQKGYQGDGQVC 283
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
881-916 6.03e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.04  E-value: 6.03e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720360732  881 STSRGGCSPNAECIQaSTGTYSCVCQRGWTGNGRDC 916
Cdd:pfam12947    2 SDNNGGCHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
 
Name Accession Description Interval E-value
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1018-1137 1.24e-24

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 100.41  E-value: 1.24e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732 1018 FYQWINNASLQSMLSAT-SNLTVLVPSLQAIKDMDQNEKSFWLS-RNNIPALIKYHTLLGTYRVADLQTLPSshmlATSL 1095
Cdd:pfam02469    6 FVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLKNGGT----LATL 81
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1720360732 1096 QGSFLRLDKADGNITIEGASFVDGDNAATNGVVHIINKVLIP 1137
Cdd:pfam02469   82 QGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
533-661 4.45e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.17  E-value: 4.45e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  533 PRYGKFRSLLEKTNVGQALEKGgiDEPYTIFVPSNEALSNMTAGVLDYLLSPegSRKLLELVRYHIVAfTQLEVATLVST 612
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGS--QGPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1720360732  613 LHIRSMANQIITFNISSkGQILANNVAVDETEVAAKNGRIYTLTGVLIP 661
Cdd:pfam02469   76 GTLATLQGSKLRVNVTG-GSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1156-1273 7.09e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 95.40  E-value: 7.09e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732 1156 PDYSIFRGYIIHYNLASAIEAADA-YTVFVPNNEAIESYIREKKATSLK-----EDILQYHVVLGeKLLRNDLHNGMHRE 1229
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGTLNFLLKdkeqlKNLLKYHVVPG-RLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1720360732 1230 TMLGFSylLAFFLHNDQLYVNEAPINYTNVATDKGVIHGLEKVL 1273
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
390-512 7.14e-22

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 92.70  E-value: 7.14e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  390 GQLTSFISILDRT-YAWPLSN-LGPFTVLLPSDKG---LKGVDVKELLMDKEAARYFVKLHIIAGQMSTEQMYNLDTFYT 464
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLNGsQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1720360732  465 LTGKSGEIINKDKdnqlKLKLYGSKIVQiiqGNIVASNGLVHILDRAM 512
Cdd:pfam02469   81 LQGSKLRVNVTGG----SVTVNGARVVQ---ADIEATNGVIHVIDKVL 121
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
511-661 8.01e-22

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 93.82  E-value: 8.01e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  511 AMDKIEPTLESNPQQTIMTMLQ--PRYGKFRSLLEKTNVGQALEKGGidePYTIFVPSNEALSNMTAGVLDYLLSPEGSR 588
Cdd:COG2335     17 ASSAAAEGAAMAPTKNIVETAAnnPDFSTLVAALKAAGLVDTLSGEG---PFTVFAPTDAAFAALPAGTLDALLKPENKA 93
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720360732  589 KLLELVRYHIVAfTQLEVATLVSTLHIRSMANQIITFNISSkGQILANNVAVDETEVAAKNGRIYTLTGVLIP 661
Cdd:COG2335     94 TLTKILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSG-GGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1008-1137 4.66e-20

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 88.81  E-value: 4.66e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732 1008 ELSFLSEAavfyqwINNASLQSMLSATSNLTVLVPSLQAIKDMDQNEKSFWLSRNNIPAL---IKYHTLLGTYRVADLQT 1084
Cdd:COG2335     42 DFSTLVAA------LKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPENKATLtkiLTYHVVPGKVTAADLKD 115
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720360732 1085 LPSshmlATSLQGSFLRLDKADGNITIEGASFVDGDNAATNGVVHIINKVLIP 1137
Cdd:COG2335    116 GKT----LTTLQGQTLTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1146-1273 2.07e-17

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 81.11  E-value: 2.07e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732 1146 PSLLTRLEQMPDYSIFRGYIIHYNLASAIEAADAYTVFVPNNEAIESYIREKKATSLKE-------DILQYHVVLGeKLL 1218
Cdd:COG2335     31 KNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPenkatltKILTYHVVPG-KVT 109
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720360732 1219 RNDLHNGMHRETMLGFSylLAFFLHNDQLYVNEAPINYTNVATDKGVIHGLEKVL 1273
Cdd:COG2335    110 AADLKDGKTLTTLQGQT--LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVL 162
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
561-662 1.78e-14

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 70.47  E-value: 1.78e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732   561 TIFVPSNEALSNMTAGvLDYLLSPegsrKLLELVRYHIVAfTQLEVATLVSTLHIRSMANQIITFNISS-KGQILANNVA 639
Cdd:smart00554    1 TVFAPTDEAFQKLPPD-LNSLLAD----KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGgSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 1720360732   640 VDETEVAAKNGRIYTLTGVLIPP 662
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1038-1137 4.57e-14

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 69.31  E-value: 4.57e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  1038 TVLVPSLQAIKDMDQNEKSfwLSRNNIPALIKYHTLLGTYRVADLQtlpsSHMLATSLQGSFLRL--DKADGNITIEGAS 1115
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNS--LLADKLKNLLLYHVVPGRLSSADLL----NGGTLPTLAGSKLRItrSGGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|..
gi 1720360732  1116 FVDGDNAATNGVVHIINKVLIP 1137
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLP 96
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
390-510 1.74e-13

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 69.94  E-value: 1.74e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  390 GQLTSFISILDRT-YAWPLSNLGPFTVLLPSDKGLKGVD---VKELLM--DKEAARYFVKLHIIAGQMSTEQMYNLDTFY 463
Cdd:COG2335     41 PDFSTLVAALKAAgLVDTLSGEGPFTVFAPTDAAFAALPagtLDALLKpeNKATLTKILTYHVVPGKVTAADLKDGKTLT 120
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720360732  464 TLTGKSgeiinkdkdnqLKLKLYGSKIV----QIIQGNIVASNGLVHILDR 510
Cdd:COG2335    121 TLQGQT-----------LTVTVSGGGVTvngaNVITADIEASNGVIHVIDK 160
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1181-1273 1.30e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 62.38  E-value: 1.30e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  1181 TVFVPNNEAIESYIREKKA--TSLKEDILQYHVVLGeKLLRNDLHNGMHRETMLGFSylLAFFLHND--QLYVNEAPINY 1256
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSllADKLKNLLLYHVVPG-RLSSADLLNGGTLPTLAGSK--LRITRSGGsgTVTVNGARIVE 77
                            90
                    ....*....|....*..
gi 1720360732  1257 TNVATDKGVIHGLEKVL 1273
Cdd:smart00554   78 ADIAATNGVVHVIDRVL 94
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
414-510 1.64e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 61.99  E-value: 1.64e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732   414 TVLLPSDKGLKGVDVKELLMDKEAARYFVKLHIIAGQMSTEQMYNLDTFYTLTGKSGEIINKDKDNQLKLklygsKIVQI 493
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTV-----NGARI 75
                            90
                    ....*....|....*..
gi 1720360732   494 IQGNIVASNGLVHILDR 510
Cdd:smart00554   76 VEADIAATNGVVHVIDR 92
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1482-1518 1.27e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.13  E-value: 1.27e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732 1482 CEISNGGCSAKADCKRTiPGSRVCVCKAGYTGDGIVC 1518
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1524-1560 1.70e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.97  E-value: 1.70e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732 1524 CLENHGGCDRHAECTQTgPNQAVCNCLPKYTGDGKVC 1560
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
844-872 7.65e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.43  E-value: 7.65e-05
                           10        20
                   ....*....|....*....|....*....
gi 1720360732  844 CHIHATCEYSNETASCVCNDGYEGDGTLC 872
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
334-369 5.74e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 5.74e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732  334 CESKN-PCHKNANCSTVsPGQTQCTCQKGYVGDGLNC 369
Cdd:pfam12947    1 CSDNNgGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
927-959 6.85e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 6.85e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1720360732  927 SGGCHDNATCLYVgPGQNECECKKGFRGNGIDC 959
Cdd:pfam12947    5 NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
965-1001 2.14e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.19  E-value: 2.14e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732  965 CLEQIEKCHPLATCQYTLSGVwSCVCQEGYEGNGVLC 1001
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
254-283 3.99e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.42  E-value: 3.99e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1720360732  254 CHPHASCSYLgPNRHSCVCQKGYQGDGQVC 283
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
881-916 6.03e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.04  E-value: 6.03e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720360732  881 STSRGGCSPNAECIQaSTGTYSCVCQRGWTGNGRDC 916
Cdd:pfam12947    2 SDNNGGCHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
 
Name Accession Description Interval E-value
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1018-1137 1.24e-24

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 100.41  E-value: 1.24e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732 1018 FYQWINNASLQSMLSAT-SNLTVLVPSLQAIKDMDQNEKSFWLS-RNNIPALIKYHTLLGTYRVADLQTLPSshmlATSL 1095
Cdd:pfam02469    6 FVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLKNGGT----LATL 81
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1720360732 1096 QGSFLRLDKADGNITIEGASFVDGDNAATNGVVHIINKVLIP 1137
Cdd:pfam02469   82 QGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
533-661 4.45e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.17  E-value: 4.45e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  533 PRYGKFRSLLEKTNVGQALEKGgiDEPYTIFVPSNEALSNMTAGVLDYLLSPegSRKLLELVRYHIVAfTQLEVATLVST 612
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGS--QGPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1720360732  613 LHIRSMANQIITFNISSkGQILANNVAVDETEVAAKNGRIYTLTGVLIP 661
Cdd:pfam02469   76 GTLATLQGSKLRVNVTG-GSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1156-1273 7.09e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 95.40  E-value: 7.09e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732 1156 PDYSIFRGYIIHYNLASAIEAADA-YTVFVPNNEAIESYIREKKATSLK-----EDILQYHVVLGeKLLRNDLHNGMHRE 1229
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGTLNFLLKdkeqlKNLLKYHVVPG-RLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1720360732 1230 TMLGFSylLAFFLHNDQLYVNEAPINYTNVATDKGVIHGLEKVL 1273
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
390-512 7.14e-22

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 92.70  E-value: 7.14e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  390 GQLTSFISILDRT-YAWPLSN-LGPFTVLLPSDKG---LKGVDVKELLMDKEAARYFVKLHIIAGQMSTEQMYNLDTFYT 464
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLNGsQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1720360732  465 LTGKSGEIINKDKdnqlKLKLYGSKIVQiiqGNIVASNGLVHILDRAM 512
Cdd:pfam02469   81 LQGSKLRVNVTGG----SVTVNGARVVQ---ADIEATNGVIHVIDKVL 121
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
511-661 8.01e-22

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 93.82  E-value: 8.01e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  511 AMDKIEPTLESNPQQTIMTMLQ--PRYGKFRSLLEKTNVGQALEKGGidePYTIFVPSNEALSNMTAGVLDYLLSPEGSR 588
Cdd:COG2335     17 ASSAAAEGAAMAPTKNIVETAAnnPDFSTLVAALKAAGLVDTLSGEG---PFTVFAPTDAAFAALPAGTLDALLKPENKA 93
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720360732  589 KLLELVRYHIVAfTQLEVATLVSTLHIRSMANQIITFNISSkGQILANNVAVDETEVAAKNGRIYTLTGVLIP 661
Cdd:COG2335     94 TLTKILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSG-GGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1008-1137 4.66e-20

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 88.81  E-value: 4.66e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732 1008 ELSFLSEAavfyqwINNASLQSMLSATSNLTVLVPSLQAIKDMDQNEKSFWLSRNNIPAL---IKYHTLLGTYRVADLQT 1084
Cdd:COG2335     42 DFSTLVAA------LKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPENKATLtkiLTYHVVPGKVTAADLKD 115
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720360732 1085 LPSshmlATSLQGSFLRLDKADGNITIEGASFVDGDNAATNGVVHIINKVLIP 1137
Cdd:COG2335    116 GKT----LTTLQGQTLTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1146-1273 2.07e-17

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 81.11  E-value: 2.07e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732 1146 PSLLTRLEQMPDYSIFRGYIIHYNLASAIEAADAYTVFVPNNEAIESYIREKKATSLKE-------DILQYHVVLGeKLL 1218
Cdd:COG2335     31 KNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPenkatltKILTYHVVPG-KVT 109
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1720360732 1219 RNDLHNGMHRETMLGFSylLAFFLHNDQLYVNEAPINYTNVATDKGVIHGLEKVL 1273
Cdd:COG2335    110 AADLKDGKTLTTLQGQT--LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVL 162
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
561-662 1.78e-14

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 70.47  E-value: 1.78e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732   561 TIFVPSNEALSNMTAGvLDYLLSPegsrKLLELVRYHIVAfTQLEVATLVSTLHIRSMANQIITFNISS-KGQILANNVA 639
Cdd:smart00554    1 TVFAPTDEAFQKLPPD-LNSLLAD----KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGgSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 1720360732   640 VDETEVAAKNGRIYTLTGVLIPP 662
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1038-1137 4.57e-14

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 69.31  E-value: 4.57e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  1038 TVLVPSLQAIKDMDQNEKSfwLSRNNIPALIKYHTLLGTYRVADLQtlpsSHMLATSLQGSFLRL--DKADGNITIEGAS 1115
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNS--LLADKLKNLLLYHVVPGRLSSADLL----NGGTLPTLAGSKLRItrSGGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|..
gi 1720360732  1116 FVDGDNAATNGVVHIINKVLIP 1137
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLP 96
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
390-510 1.74e-13

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 69.94  E-value: 1.74e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  390 GQLTSFISILDRT-YAWPLSNLGPFTVLLPSDKGLKGVD---VKELLM--DKEAARYFVKLHIIAGQMSTEQMYNLDTFY 463
Cdd:COG2335     41 PDFSTLVAALKAAgLVDTLSGEGPFTVFAPTDAAFAALPagtLDALLKpeNKATLTKILTYHVVPGKVTAADLKDGKTLT 120
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720360732  464 TLTGKSgeiinkdkdnqLKLKLYGSKIV----QIIQGNIVASNGLVHILDR 510
Cdd:COG2335    121 TLQGQT-----------LTVTVSGGGVTvngaNVITADIEASNGVIHVIDK 160
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1181-1273 1.30e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 62.38  E-value: 1.30e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732  1181 TVFVPNNEAIESYIREKKA--TSLKEDILQYHVVLGeKLLRNDLHNGMHRETMLGFSylLAFFLHND--QLYVNEAPINY 1256
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSllADKLKNLLLYHVVPG-RLSSADLLNGGTLPTLAGSK--LRITRSGGsgTVTVNGARIVE 77
                            90
                    ....*....|....*..
gi 1720360732  1257 TNVATDKGVIHGLEKVL 1273
Cdd:smart00554   78 ADIAATNGVVHVIDRVL 94
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
414-510 1.64e-11

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 61.99  E-value: 1.64e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720360732   414 TVLLPSDKGLKGVDVKELLMDKEAARYFVKLHIIAGQMSTEQMYNLDTFYTLTGKSGEIINKDKDNQLKLklygsKIVQI 493
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTV-----NGARI 75
                            90
                    ....*....|....*..
gi 1720360732   494 IQGNIVASNGLVHILDR 510
Cdd:smart00554   76 VEADIAATNGVVHVIDR 92
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1482-1518 1.27e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.13  E-value: 1.27e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732 1482 CEISNGGCSAKADCKRTiPGSRVCVCKAGYTGDGIVC 1518
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1524-1560 1.70e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.97  E-value: 1.70e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732 1524 CLENHGGCDRHAECTQTgPNQAVCNCLPKYTGDGKVC 1560
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
844-872 7.65e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.43  E-value: 7.65e-05
                           10        20
                   ....*....|....*....|....*....
gi 1720360732  844 CHIHATCEYSNETASCVCNDGYEGDGTLC 872
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
334-369 5.74e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 5.74e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732  334 CESKN-PCHKNANCSTVsPGQTQCTCQKGYVGDGLNC 369
Cdd:pfam12947    1 CSDNNgGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
927-959 6.85e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 6.85e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1720360732  927 SGGCHDNATCLYVgPGQNECECKKGFRGNGIDC 959
Cdd:pfam12947    5 NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
965-1001 2.14e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.19  E-value: 2.14e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1720360732  965 CLEQIEKCHPLATCQYTLSGVwSCVCQEGYEGNGVLC 1001
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
254-283 3.99e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.42  E-value: 3.99e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1720360732  254 CHPHASCSYLgPNRHSCVCQKGYQGDGQVC 283
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
881-916 6.03e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.04  E-value: 6.03e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720360732  881 STSRGGCSPNAECIQaSTGTYSCVCQRGWTGNGRDC 916
Cdd:pfam12947    2 SDNNGGCHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH