NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|530372113|ref|XP_005265031|]
View 

stabilin-1 isoform X2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Link_Domain super family cl02612
The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive ...
2208-2300 2.26e-50

The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN gene family are physically linked adjacent to CSPG genes. TSG-6 contains a single link module which supports high affinity binding with HA. The functional HA-binding domain of CD44 is an extended domain comprised of a link module flanked with N-and C- extensions. These extensions are essential for folding and functional activity. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of the CSPG aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) which contains link modules 3 and 4 which lack HA-binding activity. HAPLNs contain two contiguous link modules.


The actual alignment was detected with superfamily member cd03515:

Pssm-ID: 470631  Cd Length: 93  Bit Score: 173.42  E-value: 2.26e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2208 GVFHLQATSGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSLGA 2287
Cdd:cd03515     1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQLGFHLCAAGWLAKGRVGYPIVFPSANCGFGHVGIVDYGP 80
                          90
                  ....*....|...
gi 530372113 2288 RKNLSERWDAYCF 2300
Cdd:cd03515    81 RLNLSERWDAYCY 93
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
496-643 1.63e-28

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


:

Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 113.85  E-value: 1.63e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  496 QAPSGTPGDPKRTIGQILASTEAFSRFETILENCGLPSILDGPGPFTVFAPSNEAVDSLRDGRLIYLFTAG-LSKLQELV 574
Cdd:COG2335    20 AAAEGAAMAPTKNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPEnKATLTKIL 99
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530372113  575 RYHIYNhGQLTVEKLISKGRILTMANQVLAVNISeEGRILLGpeGVPLQRVDVMAANGVIHMLDGILLP 643
Cdd:COG2335   100 TYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVS-GGGVTVN--GANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1736-1866 3.60e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.49  E-value: 3.60e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  1736 GYKIFSGLLKVAGLLPLLREaSHRPFTMLWPTDAAFRALPPDRQAWLYheDHRDKLAAILRGHMIRNVeALASDLPNLGP 1815
Cdd:pfam02469    2 GFSTFVALLKAAGLVDTLNG-SQGPFTVFAPTNEAFAKLPAGTLNFLL--KDKEQLKNLLKYHVVPGR-LTSSDLKNGGT 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 530372113  1816 LRTMHGTPISFSCSRtraGELMVgeDDARIVQRHLPFEGGLAYGIDQLLEP 1866
Cdd:pfam02469   78 LATLQGSKLRVNVTG---GSVTV--NGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1606-1710 4.94e-26

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 105.03  E-value: 4.94e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  1606 KELKG-DGPFTIFVPHADLMSNLSQDELARIRAHRQL---VFRYHVVGCRrLRSEDLLEQGYATALSGHPLRFSEREGSI 1681
Cdd:pfam02469   17 DTLNGsQGPFTVFAPTNEAFAKLPAGTLNFLLKDKEQlknLLKYHVVPGR-LTSSDLKNGGTLATLQGSKLRVNVTGGSV 95
                           90       100
                   ....*....|....*....|....*....
gi 530372113  1682 YLNDfARVVSSDHEAVNGILHFIDRVLLP 1710
Cdd:pfam02469   96 TVNG-ARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
999-1120 7.82e-24

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 98.86  E-value: 7.82e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   999 AHFSIFYQWLKSAGI--TLP-ADRRVTALVPSEAAVRQLSPEDRAFWLQ-PRTLPNLVRAHFLQGALFEEELARlgGQEV 1074
Cdd:pfam02469    1 PGFSTFVALLKAAGLvdTLNgSQGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLKN--GGTL 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 530372113  1075 ATLNPTTrWEIRNISGRVWVQNASVDVADLLATNGVLHILSQVLLP 1120
Cdd:pfam02469   79 ATLQGSK-LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
2367-2462 5.24e-09

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


:

Pssm-ID: 214719  Cd Length: 97  Bit Score: 55.45  E-value: 5.24e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   2367 TLFVPVNEGF------VDNMT--LSGPDLELHASNATLLSANASQGKLLPAHSGLSLIISDAGPdnsswapvaPGTVVV- 2437
Cdd:smart00554    1 TVFAPTDEAFqklppdLNSLLadKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGG---------SGTVTVn 71
                            90       100
                    ....*....|....*....|....*.
gi 530372113   2438 -SRIIVWDIMAFNGIIHALASPLLAP 2462
Cdd:smart00554   72 gARIVEADIAATNGVVHVIDRVLLPP 97
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1459-1495 3.60e-08

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 51.06  E-value: 3.60e-08
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 530372113  1459 CAHGHGGCSPHANCTKVaPGQRTCTCQDGYMGDGELC 1495
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2095-2130 1.27e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.52  E-value: 1.27e-07
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 530372113  2095 CQDGHGGCSEHANCSQVGTMVTCTCLPDYEGDGWSC 2130
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
951-987 1.15e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 46.82  E-value: 1.15e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 530372113   951 CRAGNGGCHGLATCRAVGGgQRVCTCPPGFGGDGFSC 987
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1544-1581 5.59e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 44.90  E-value: 5.59e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113  1544 CSKNNGGCSPYATCKSTgDGQRTCTCDTAHTvGDGLTC 1581
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYT-GDGVTC 36
Fasciclin super family cl02663
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
393-490 7.09e-06

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


The actual alignment was detected with superfamily member pfam02469:

Pssm-ID: 470649  Cd Length: 123  Bit Score: 47.63  E-value: 7.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   393 TAGPFTVLVPSVSSFS--------SRTMNASLAQQLCRQHIIAGQHILEDTRTQQTRRwwTLAGQEITVTFnqftkysyk 464
Cdd:pfam02469   22 SQGPFTVFAPTNEAFAklpagtlnFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLA--TLQGSKLRVNV--------- 90
                           90       100
                   ....*....|....*....|....*....
gi 530372113   465 ykDQPQQTFN---IYKANNIAANGVFHVV 490
Cdd:pfam02469   91 --TGGSVTVNgarVVQADIEATNGVIHVI 117
Fasciclin super family cl02663
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1139-1241 1.30e-05

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


The actual alignment was detected with superfamily member pfam02469:

Pssm-ID: 470649  Cd Length: 123  Bit Score: 46.86  E-value: 1.30e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  1139 PAFSLFRELLQHHGLVPQIEAATA-YTIFVPTNRSLEA--QGNSSHLDADT------VRHHVVLGeALSMETLRKGGHRN 1209
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKlpAGTLNFLLKDKeqlknlLKYHVVPG-RLTSSDLKNGGTLA 79
                           90       100       110
                   ....*....|....*....|....*....|..
gi 530372113  1210 SLLGPAhwIVFYNHSGQPEVNHVPLEGPMLEA 1241
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEA 109
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1501-1538 3.75e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.59  E-value: 3.75e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113  1501 CLIHHGGCHIHAECIPTGPQqVSCSCREGYSGDGIrTC 1538
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGS-FTCTCNDGYTGDGV-TC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2136-2173 2.45e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 40.27  E-value: 2.45e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113  2136 CTDGHrGGCSEHANCLSTGlNTRRCECHAGYVGDGLQC 2173
Cdd:pfam12947    1 CSDNN-GGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
914-945 4.42e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.50  E-value: 4.42e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 530372113   914 GGCHTDALCSYVgPGQSRCTCKLGFAGDGYQC 945
Cdd:pfam12947    6 GGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
831-859 7.66e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.12  E-value: 7.66e-04
                           10        20
                   ....*....|....*....|....*....
gi 530372113   831 CHLHARCVSQEGVARCRCLDGFEGDGFSC 859
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
865-902 3.19e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.19  E-value: 3.19e-03
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113   865 CSHpDRGGCSENAECVPgSLGTHHCTCHKGWSGDGRVC 902
Cdd:pfam12947    1 CSD-NNGGCHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1424-1453 3.59e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.19  E-value: 3.59e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 530372113  1424 CDPNANCVQdSAGASTCACAAGYSGNGIFC 1453
Cdd:pfam12947    8 CHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1992-2021 4.24e-03

Laminin-type epidermal growth factor-like domai;


:

Pssm-ID: 214543  Cd Length: 46  Bit Score: 37.29  E-value: 4.24e-03
                            10        20        30
                    ....*....|....*....|....*....|
gi 530372113   1992 SGQCLCRSGFAGTACELCAPGAFGPHCQAC 2021
Cdd:smart00180   17 TGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
 
Name Accession Description Interval E-value
Link_domain_TSG_6_like cd03515
This is the extracellular link domain of the type found in human TSG-6. The link domain is a ...
2208-2300 2.26e-50

This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accumulate in vascular tissues during aging and diabetes, these receptors may be implicated in the pathologies of these states. Both stabilins are present in the early endocytic pathway in hepatic sinusoidal epithelium associating with clathrin/AP-2. Stabilin-1 is expressed in macrophages. Stabilin-2 is absent from the latter. In macrophages: stabilin-1 is involved in trafficking between early/sorting endosomes and the trans-Golgi network. Stabilin-1 has also been implicated in angiogenesis and possibly leucocyte trafficking. Both stabilins bind gram-positive and gram-negative bacteria. TSG-6 and stabilins contain a single link module which supports high affinity binding to HA.


Pssm-ID: 239592  Cd Length: 93  Bit Score: 173.42  E-value: 2.26e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2208 GVFHLQATSGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSLGA 2287
Cdd:cd03515     1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQLGFHLCAAGWLAKGRVGYPIVFPSANCGFGHVGIVDYGP 80
                          90
                  ....*....|...
gi 530372113 2288 RKNLSERWDAYCF 2300
Cdd:cd03515    81 RLNLSERWDAYCY 93
LINK smart00445
Link (Hyaluronan-binding);
2206-2301 2.74e-36

Link (Hyaluronan-binding);


Pssm-ID: 214667  Cd Length: 94  Bit Score: 133.24  E-value: 2.74e-36
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   2206 RAGVFHLQATsGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSL 2285
Cdd:smart00445    1 DGGVFHVEKN-GRYKLTFAEAREACRAQGATLATVGQLYAAWQDGFDTCDAGWLADGSVRYPIITPRPRCGGNLPGVRQY 79
                            90
                    ....*....|....*.
gi 530372113   2286 GARKNLSeRWDAYCFR 2301
Cdd:smart00445   80 GFPDPTS-RYDAYCFN 94
Xlink pfam00193
Extracellular link domain;
2208-2300 2.20e-35

Extracellular link domain;


Pssm-ID: 459706  Cd Length: 92  Bit Score: 130.39  E-value: 2.20e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  2208 GVFHLQAtSGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSLGA 2287
Cdd:pfam00193    1 GVFHLES-PGRYKLTFQEAQAACAALGATLATPEQLYAAWKAGLDTCDAGWLADGTVRYPITTPRPNCGGNMPGVRQYGF 79
                           90
                   ....*....|...
gi 530372113  2288 RKNLSERWDAYCF 2300
Cdd:pfam00193   80 RDPLSERYDAYCY 92
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
496-643 1.63e-28

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 113.85  E-value: 1.63e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  496 QAPSGTPGDPKRTIGQILASTEAFSRFETILENCGLPSILDGPGPFTVFAPSNEAVDSLRDGRLIYLFTAG-LSKLQELV 574
Cdd:COG2335    20 AAAEGAAMAPTKNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPEnKATLTKIL 99
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530372113  575 RYHIYNhGQLTVEKLISKGRILTMANQVLAVNISeEGRILLGpeGVPLQRVDVMAANGVIHMLDGILLP 643
Cdd:COG2335   100 TYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVS-GGGVTVN--GANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1736-1866 3.60e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.49  E-value: 3.60e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  1736 GYKIFSGLLKVAGLLPLLREaSHRPFTMLWPTDAAFRALPPDRQAWLYheDHRDKLAAILRGHMIRNVeALASDLPNLGP 1815
Cdd:pfam02469    2 GFSTFVALLKAAGLVDTLNG-SQGPFTVFAPTNEAFAKLPAGTLNFLL--KDKEQLKNLLKYHVVPGR-LTSSDLKNGGT 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 530372113  1816 LRTMHGTPISFSCSRtraGELMVgeDDARIVQRHLPFEGGLAYGIDQLLEP 1866
Cdd:pfam02469   78 LATLQGSKLRVNVTG---GSVTV--NGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1606-1710 4.94e-26

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 105.03  E-value: 4.94e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  1606 KELKG-DGPFTIFVPHADLMSNLSQDELARIRAHRQL---VFRYHVVGCRrLRSEDLLEQGYATALSGHPLRFSEREGSI 1681
Cdd:pfam02469   17 DTLNGsQGPFTVFAPTNEAFAKLPAGTLNFLLKDKEQlknLLKYHVVPGR-LTSSDLKNGGTLATLQGSKLRVNVTGGSV 95
                           90       100
                   ....*....|....*....|....*....
gi 530372113  1682 YLNDfARVVSSDHEAVNGILHFIDRVLLP 1710
Cdd:pfam02469   96 TVNG-ARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
999-1120 7.82e-24

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 98.86  E-value: 7.82e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   999 AHFSIFYQWLKSAGI--TLP-ADRRVTALVPSEAAVRQLSPEDRAFWLQ-PRTLPNLVRAHFLQGALFEEELARlgGQEV 1074
Cdd:pfam02469    1 PGFSTFVALLKAAGLvdTLNgSQGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLKN--GGTL 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 530372113  1075 ATLNPTTrWEIRNISGRVWVQNASVDVADLLATNGVLHILSQVLLP 1120
Cdd:pfam02469   79 ATLQGSK-LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
519-643 1.84e-20

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 89.23  E-value: 1.84e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   519 FSRFETILENCGLPSILDGP-GPFTVFAPSNEAVDSLRDGRLIYLFtAGLSKLQELVRYHIYNhGQLTVEKLISKGRILT 597
Cdd:pfam02469    3 FSTFVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLL-KDKEQLKNLLKYHVVP-GRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 530372113   598 MANQVLAVNIsEEGRILLgpEGVPLQRVDVMAANGVIHMLDGILLP 643
Cdd:pfam02469   81 LQGSKLRVNV-TGGSVTV--NGARVVQADIEATNGVIHVIDKVLLP 123
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1615-1711 5.28e-20

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 87.03  E-value: 5.28e-20
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   1615 TIFVPHADLMSNLSQDELARIRAHRQLVFRYHVVGcRRLRSEDLLEQGYATALSGHPLRFSERE--GSIYLNDfARVVSS 1692
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADKLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGgsGTVTVNG-ARIVEA 78
                            90
                    ....*....|....*....
gi 530372113   1693 DHEAVNGILHFIDRVLLPP 1711
Cdd:smart00554   79 DIAATNGVVHVIDRVLLPP 97
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
542-644 6.29e-20

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 86.65  E-value: 6.29e-20
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113    542 TVFAPSNEAVDSLRDGRliYLFTAglSKLQELVRYHIYNhGQLTVEKLISKGRILTMANQVLAVNISEeGRILLGPEGVP 621
Cdd:smart00554    1 TVFAPTDEAFQKLPPDL--NSLLA--DKLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSG-GSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 530372113    622 LQRVDVMAANGVIHMLDGILLPP 644
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1762-1867 1.65e-19

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 85.49  E-value: 1.65e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   1762 TMLWPTDAAFRALPPDRQAWLyhedhRDKLAAILRGHMIrNVEALASDLPNLGPLRTMHGTPISFSCSRTRAgelMVGED 1841
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLL-----ADKLKNLLLYHVV-PGRLSSADLLNGGTLPTLAGSKLRITRSGGSG---TVTVN 71
                            90       100
                    ....*....|....*....|....*.
gi 530372113   1842 DARIVQRHLPFEGGLAYGIDQLLEPP 1867
Cdd:smart00554   72 GARIVEADIAATNGVVHVIDRVLLPP 97
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1608-1710 3.16e-19

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 86.88  E-value: 3.16e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 1608 LKGDGPFTIFVPH------------ADLMSNLSQDELARIrahrqlvFRYHVVGcRRLRSEDLLEQGYATALSGHPLRFS 1675
Cdd:COG2335    59 LSGEGPFTVFAPTdaafaalpagtlDALLKPENKATLTKI-------LTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVT 130
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 530372113 1676 EREGSIYLNDfARVVSSDHEAVNGILHFIDRVLLP 1710
Cdd:COG2335   131 VSGGGVTVNG-ANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
990-1120 2.57e-16

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 78.79  E-value: 2.57e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  990 DIFRELEANAHFSIFYQWLKSAGI--TLPADRRVTALVPSEAAVRQLSPEDRAFWLQP---RTLPNLVRAHFLQGALFEE 1064
Cdd:COG2335    32 NIVETAANNPDFSTLVAALKAAGLvdTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPenkATLTKILTYHVVPGKVTAA 111
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 530372113 1065 ELARlgGQEVATLNPTTrWEIRNISGRVWVQNASVDVADLLATNGVLHILSQVLLP 1120
Cdd:COG2335   112 DLKD--GKTLTTLQGQT-LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1716-1866 4.04e-16

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 78.02  E-value: 4.04e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 1716 WEPDDAPIPRRNVTAAAQGFG-YKIFSGLLKVAGLLPLLREAshRPFTMLWPTDAAFRALPPD-RQAWLYHEDhRDKLAA 1793
Cdd:COG2335    21 AAEGAAMAPTKNIVETAANNPdFSTLVAALKAAGLVDTLSGE--GPFTVFAPTDAAFAALPAGtLDALLKPEN-KATLTK 97
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 530372113 1794 ILRGHMIRNvEALASDLPNLGPLRTMHGTPISFSCSrtrAGELMVGedDARIVQRHLPFEGGLAYGIDQLLEP 1866
Cdd:COG2335    98 ILTYHVVPG-KVTAADLKDGKTLTTLQGQTLTVTVS---GGGVTVN--GANVITADIEASNGVIHVIDKVLLP 164
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1022-1121 2.66e-15

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 73.55  E-value: 2.66e-15
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   1022 TALVPSEAAVRQLSPEDRAFWLQprTLPNLVRAHFLQGALFEEELarLGGQEVATLNPTT-RWEIRNISGRVWVQNASVD 1100
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLAD--KLKNLLLYHVVPGRLSSADL--LNGGTLPTLAGSKlRITRSGGSGTVTVNGARIV 76
                            90       100
                    ....*....|....*....|.
gi 530372113   1101 VADLLATNGVLHILSQVLLPP 1121
Cdd:smart00554   77 EADIAATNGVVHVIDRVLLPP 97
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
2367-2462 5.24e-09

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 55.45  E-value: 5.24e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   2367 TLFVPVNEGF------VDNMT--LSGPDLELHASNATLLSANASQGKLLPAHSGLSLIISDAGPdnsswapvaPGTVVV- 2437
Cdd:smart00554    1 TVFAPTDEAFqklppdLNSLLadKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGG---------SGTVTVn 71
                            90       100
                    ....*....|....*....|....*.
gi 530372113   2438 -SRIIVWDIMAFNGIIHALASPLLAP 2462
Cdd:smart00554   72 gARIVEADIAATNGVVHVIDRVLLPP 97
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1459-1495 3.60e-08

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 51.06  E-value: 3.60e-08
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 530372113  1459 CAHGHGGCSPHANCTKVaPGQRTCTCQDGYMGDGELC 1495
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2095-2130 1.27e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.52  E-value: 1.27e-07
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 530372113  2095 CQDGHGGCSEHANCSQVGTMVTCTCLPDYEGDGWSC 2130
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
951-987 1.15e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 46.82  E-value: 1.15e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 530372113   951 CRAGNGGCHGLATCRAVGGgQRVCTCPPGFGGDGFSC 987
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1544-1581 5.59e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 44.90  E-value: 5.59e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113  1544 CSKNNGGCSPYATCKSTgDGQRTCTCDTAHTvGDGLTC 1581
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYT-GDGVTC 36
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
393-490 7.09e-06

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 47.63  E-value: 7.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   393 TAGPFTVLVPSVSSFS--------SRTMNASLAQQLCRQHIIAGQHILEDTRTQQTRRwwTLAGQEITVTFnqftkysyk 464
Cdd:pfam02469   22 SQGPFTVFAPTNEAFAklpagtlnFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLA--TLQGSKLRVNV--------- 90
                           90       100
                   ....*....|....*....|....*....
gi 530372113   465 ykDQPQQTFN---IYKANNIAANGVFHVV 490
Cdd:pfam02469   91 --TGGSVTVNgarVVQADIEATNGVIHVI 117
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1139-1241 1.30e-05

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 46.86  E-value: 1.30e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  1139 PAFSLFRELLQHHGLVPQIEAATA-YTIFVPTNRSLEA--QGNSSHLDADT------VRHHVVLGeALSMETLRKGGHRN 1209
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKlpAGTLNFLLKDKeqlknlLKYHVVPG-RLTSSDLKNGGTLA 79
                           90       100       110
                   ....*....|....*....|....*....|..
gi 530372113  1210 SLLGPAhwIVFYNHSGQPEVNHVPLEGPMLEA 1241
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEA 109
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
2333-2460 1.76e-05

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 46.48  E-value: 1.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  2333 ANFSTFYGMLlgyaNATqrglDFLDFLDDELTYKTLFVPVNEGF------VDNMTLSGPD-----LELHASNATLLSANA 2401
Cdd:pfam02469    1 PGFSTFVALL----KAA----GLVDTLNGSQGPFTVFAPTNEAFaklpagTLNFLLKDKEqlknlLKYHVVPGRLTSSDL 72
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 530372113  2402 SQGKLLPAHSGLSLIISDAGpdnsswapvapGTVVV--SRIIVWDIMAFNGIIHALASPLL 2460
Cdd:pfam02469   73 KNGGTLATLQGSKLRVNVTG-----------GSVTVngARVVQADIEATNGVIHVIDKVLL 122
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
2325-2460 1.82e-05

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 47.21  E-value: 1.82e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2325 LLDVLAATANFSTFYGMLlgyaNATqrglDFLDFLDDELTYkTLFVPVNEGF-------VDNMTLSGPDLEL------HA 2391
Cdd:COG2335    33 IVETAANNPDFSTLVAAL----KAA----GLVDTLSGEGPF-TVFAPTDAAFaalpagtLDALLKPENKATLtkiltyHV 103
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 530372113 2392 SNATLLSANASQGKLLPAHSGLSLIISDAGpdnsswapvapGTVVV--SRIIVWDIMAFNGIIHALASPLL 2460
Cdd:COG2335   104 VPGKVTAADLKDGKTLTTLQGQTLTVTVSG-----------GGVTVngANVITADIEASNGVIHVIDKVLL 163
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1501-1538 3.75e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.59  E-value: 3.75e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113  1501 CLIHHGGCHIHAECIPTGPQqVSCSCREGYSGDGIrTC 1538
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGS-FTCTCNDGYTGDGV-TC 36
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
380-490 7.92e-05

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 45.28  E-value: 7.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  380 VAMMDQ-GCREILTTAGPFTVLVPSVSSFS----------SRTMNASLAQQLCRQHIIAGQHILEDTRTQQTRRwwTLAG 448
Cdd:COG2335    47 VAALKAaGLVDTLSGEGPFTVFAPTDAAFAalpagtldalLKPENKATLTKILTYHVVPGKVTAADLKDGKTLT--TLQG 124
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 530372113  449 QEITVTFNqftkysykyKDQPQ-QTFNIYKANNIAANGVFHVV 490
Cdd:COG2335   125 QTLTVTVS---------GGGVTvNGANVITADIEASNGVIHVI 158
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2136-2173 2.45e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 40.27  E-value: 2.45e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113  2136 CTDGHrGGCSEHANCLSTGlNTRRCECHAGYVGDGLQC 2173
Cdd:pfam12947    1 CSDNN-GGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
914-945 4.42e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.50  E-value: 4.42e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 530372113   914 GGCHTDALCSYVgPGQSRCTCKLGFAGDGYQC 945
Cdd:pfam12947    6 GGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
831-859 7.66e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.12  E-value: 7.66e-04
                           10        20
                   ....*....|....*....|....*....
gi 530372113   831 CHLHARCVSQEGVARCRCLDGFEGDGFSC 859
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
865-902 3.19e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.19  E-value: 3.19e-03
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113   865 CSHpDRGGCSENAECVPgSLGTHHCTCHKGWSGDGRVC 902
Cdd:pfam12947    1 CSD-NNGGCHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1424-1453 3.59e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.19  E-value: 3.59e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 530372113  1424 CDPNANCVQdSAGASTCACAAGYSGNGIFC 1453
Cdd:pfam12947    8 CHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1992-2021 4.24e-03

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 37.29  E-value: 4.24e-03
                            10        20        30
                    ....*....|....*....|....*....|
gi 530372113   1992 SGQCLCRSGFAGTACELCAPGAFGPHCQAC 2021
Cdd:smart00180   17 TGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1139-1213 5.03e-03

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 40.27  E-value: 5.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 1139 PAFSLFRELLQHHGLVPQIEAATAYTIFVPTNRSLEAqgnsshLDADTV----------------RHHVVLGEALSmETL 1202
Cdd:COG2335    41 PDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAA------LPAGTLdallkpenkatltkilTYHVVPGKVTA-ADL 113
                          90
                  ....*....|.
gi 530372113 1203 RKGGHRNSLLG 1213
Cdd:COG2335   114 KDGKTLTTLQG 124
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1164-1253 5.54e-03

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 38.50  E-value: 5.54e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   1164 TIFVPTN---RSLEAQGNSSHLDADT--VRHHVVLGeALSMETLRKGGHRNSLLGPAHWIVFYNHSGQPEVNHVPLEGPM 1238
Cdd:smart00554    1 TVFAPTDeafQKLPPDLNSLLADKLKnlLLYHVVPG-RLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTVNGARIVEAD 79
                            90
                    ....*....|....*
gi 530372113   1239 LEAPGRSLIGLSGVL 1253
Cdd:smart00554   80 IAATNGVVHVIDRVL 94
 
Name Accession Description Interval E-value
Link_domain_TSG_6_like cd03515
This is the extracellular link domain of the type found in human TSG-6. The link domain is a ...
2208-2300 2.26e-50

This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accumulate in vascular tissues during aging and diabetes, these receptors may be implicated in the pathologies of these states. Both stabilins are present in the early endocytic pathway in hepatic sinusoidal epithelium associating with clathrin/AP-2. Stabilin-1 is expressed in macrophages. Stabilin-2 is absent from the latter. In macrophages: stabilin-1 is involved in trafficking between early/sorting endosomes and the trans-Golgi network. Stabilin-1 has also been implicated in angiogenesis and possibly leucocyte trafficking. Both stabilins bind gram-positive and gram-negative bacteria. TSG-6 and stabilins contain a single link module which supports high affinity binding to HA.


Pssm-ID: 239592  Cd Length: 93  Bit Score: 173.42  E-value: 2.26e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2208 GVFHLQATSGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSLGA 2287
Cdd:cd03515     1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQLGFHLCAAGWLAKGRVGYPIVFPSANCGFGHVGIVDYGP 80
                          90
                  ....*....|...
gi 530372113 2288 RKNLSERWDAYCF 2300
Cdd:cd03515    81 RLNLSERWDAYCY 93
LINK smart00445
Link (Hyaluronan-binding);
2206-2301 2.74e-36

Link (Hyaluronan-binding);


Pssm-ID: 214667  Cd Length: 94  Bit Score: 133.24  E-value: 2.74e-36
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   2206 RAGVFHLQATsGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSL 2285
Cdd:smart00445    1 DGGVFHVEKN-GRYKLTFAEAREACRAQGATLATVGQLYAAWQDGFDTCDAGWLADGSVRYPIITPRPRCGGNLPGVRQY 79
                            90
                    ....*....|....*.
gi 530372113   2286 GARKNLSeRWDAYCFR 2301
Cdd:smart00445   80 GFPDPTS-RYDAYCFN 94
Xlink pfam00193
Extracellular link domain;
2208-2300 2.20e-35

Extracellular link domain;


Pssm-ID: 459706  Cd Length: 92  Bit Score: 130.39  E-value: 2.20e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  2208 GVFHLQAtSGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSLGA 2287
Cdd:pfam00193    1 GVFHLES-PGRYKLTFQEAQAACAALGATLATPEQLYAAWKAGLDTCDAGWLADGTVRYPITTPRPNCGGNMPGVRQYGF 79
                           90
                   ....*....|...
gi 530372113  2288 RKNLSERWDAYCF 2300
Cdd:pfam00193   80 RDPLSERYDAYCY 92
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
496-643 1.63e-28

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 113.85  E-value: 1.63e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  496 QAPSGTPGDPKRTIGQILASTEAFSRFETILENCGLPSILDGPGPFTVFAPSNEAVDSLRDGRLIYLFTAG-LSKLQELV 574
Cdd:COG2335    20 AAAEGAAMAPTKNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPEnKATLTKIL 99
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530372113  575 RYHIYNhGQLTVEKLISKGRILTMANQVLAVNISeEGRILLGpeGVPLQRVDVMAANGVIHMLDGILLP 643
Cdd:COG2335   100 TYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVS-GGGVTVN--GANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1736-1866 3.60e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.49  E-value: 3.60e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  1736 GYKIFSGLLKVAGLLPLLREaSHRPFTMLWPTDAAFRALPPDRQAWLYheDHRDKLAAILRGHMIRNVeALASDLPNLGP 1815
Cdd:pfam02469    2 GFSTFVALLKAAGLVDTLNG-SQGPFTVFAPTNEAFAKLPAGTLNFLL--KDKEQLKNLLKYHVVPGR-LTSSDLKNGGT 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 530372113  1816 LRTMHGTPISFSCSRtraGELMVgeDDARIVQRHLPFEGGLAYGIDQLLEP 1866
Cdd:pfam02469   78 LATLQGSKLRVNVTG---GSVTV--NGARVVQADIEATNGVIHVIDKVLLP 123
Link_Domain cd01102
The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive ...
2208-2300 2.17e-26

The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN gene family are physically linked adjacent to CSPG genes. TSG-6 contains a single link module which supports high affinity binding with HA. The functional HA-binding domain of CD44 is an extended domain comprised of a link module flanked with N-and C- extensions. These extensions are essential for folding and functional activity. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of the CSPG aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) which contains link modules 3 and 4 which lack HA-binding activity. HAPLNs contain two contiguous link modules.


Pssm-ID: 238534  Cd Length: 92  Bit Score: 104.81  E-value: 2.17e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2208 GVFHLQATSGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSLGA 2287
Cdd:cd01102     1 VVFHLESQNGRYKLTFAEAALACKARGAHLATPGQLEAAWQDGFDVCTAGWLADGSVRYPIVTSRPNCGGRNPGVRSYGN 80
                          90
                  ....*....|...
gi 530372113 2288 RKNlSERWDAYCF 2300
Cdd:cd01102    81 PAP-SGRYDAYCF 92
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1606-1710 4.94e-26

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 105.03  E-value: 4.94e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  1606 KELKG-DGPFTIFVPHADLMSNLSQDELARIRAHRQL---VFRYHVVGCRrLRSEDLLEQGYATALSGHPLRFSEREGSI 1681
Cdd:pfam02469   17 DTLNGsQGPFTVFAPTNEAFAKLPAGTLNFLLKDKEQlknLLKYHVVPGR-LTSSDLKNGGTLATLQGSKLRVNVTGGSV 95
                           90       100
                   ....*....|....*....|....*....
gi 530372113  1682 YLNDfARVVSSDHEAVNGILHFIDRVLLP 1710
Cdd:pfam02469   96 TVNG-ARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
999-1120 7.82e-24

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 98.86  E-value: 7.82e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   999 AHFSIFYQWLKSAGI--TLP-ADRRVTALVPSEAAVRQLSPEDRAFWLQ-PRTLPNLVRAHFLQGALFEEELARlgGQEV 1074
Cdd:pfam02469    1 PGFSTFVALLKAAGLvdTLNgSQGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLKN--GGTL 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 530372113  1075 ATLNPTTrWEIRNISGRVWVQNASVDVADLLATNGVLHILSQVLLP 1120
Cdd:pfam02469   79 ATLQGSK-LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
519-643 1.84e-20

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 89.23  E-value: 1.84e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   519 FSRFETILENCGLPSILDGP-GPFTVFAPSNEAVDSLRDGRLIYLFtAGLSKLQELVRYHIYNhGQLTVEKLISKGRILT 597
Cdd:pfam02469    3 FSTFVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLL-KDKEQLKNLLKYHVVP-GRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 530372113   598 MANQVLAVNIsEEGRILLgpEGVPLQRVDVMAANGVIHMLDGILLP 643
Cdd:pfam02469   81 LQGSKLRVNV-TGGSVTV--NGARVVQADIEATNGVIHVIDKVLLP 123
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1615-1711 5.28e-20

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 87.03  E-value: 5.28e-20
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   1615 TIFVPHADLMSNLSQDELARIRAHRQLVFRYHVVGcRRLRSEDLLEQGYATALSGHPLRFSERE--GSIYLNDfARVVSS 1692
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADKLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGgsGTVTVNG-ARIVEA 78
                            90
                    ....*....|....*....
gi 530372113   1693 DHEAVNGILHFIDRVLLPP 1711
Cdd:smart00554   79 DIAATNGVVHVIDRVLLPP 97
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
542-644 6.29e-20

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 86.65  E-value: 6.29e-20
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113    542 TVFAPSNEAVDSLRDGRliYLFTAglSKLQELVRYHIYNhGQLTVEKLISKGRILTMANQVLAVNISEeGRILLGPEGVP 621
Cdd:smart00554    1 TVFAPTDEAFQKLPPDL--NSLLA--DKLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSG-GSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 530372113    622 LQRVDVMAANGVIHMLDGILLPP 644
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
Link_domain_HAPLN_module_1 cd03518
Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins ...
2209-2300 8.78e-20

Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.


Pssm-ID: 239595  Cd Length: 95  Bit Score: 86.33  E-value: 8.78e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2209 VFHLQATSGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRV--GIVSLG 2286
Cdd:cd03518     2 VFPYQPRLGRYNLNFHEAQQACEEQDATLASFEQLYQAWTEGLDWCNAGWLSDGTVQYPITKPREPCGGKRTvpGLRSYG 81
                          90
                  ....*....|....
gi 530372113 2287 ARKNLSERWDAYCF 2300
Cdd:cd03518    82 ERDKMLSRYDAFCF 95
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1762-1867 1.65e-19

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 85.49  E-value: 1.65e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   1762 TMLWPTDAAFRALPPDRQAWLyhedhRDKLAAILRGHMIrNVEALASDLPNLGPLRTMHGTPISFSCSRTRAgelMVGED 1841
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLL-----ADKLKNLLLYHVV-PGRLSSADLLNGGTLPTLAGSKLRITRSGGSG---TVTVN 71
                            90       100
                    ....*....|....*....|....*.
gi 530372113   1842 DARIVQRHLPFEGGLAYGIDQLLEPP 1867
Cdd:smart00554   72 GARIVEADIAATNGVVHVIDRVLLPP 97
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1608-1710 3.16e-19

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 86.88  E-value: 3.16e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 1608 LKGDGPFTIFVPH------------ADLMSNLSQDELARIrahrqlvFRYHVVGcRRLRSEDLLEQGYATALSGHPLRFS 1675
Cdd:COG2335    59 LSGEGPFTVFAPTdaafaalpagtlDALLKPENKATLTKI-------LTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVT 130
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 530372113 1676 EREGSIYLNDfARVVSSDHEAVNGILHFIDRVLLP 1710
Cdd:COG2335   131 VSGGGVTVNG-ANVITADIEASNGVIHVIDKVLLP 164
Link_domain_CSPGs_modules_2_4 cd03520
Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules ...
2209-2300 2.45e-18

Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.


Pssm-ID: 239597  Cd Length: 96  Bit Score: 81.98  E-value: 2.45e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2209 VFHlqaTSGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSLGAR 2288
Cdd:cd03520     2 VFY---ATAPEKFTFQEARAECRSLGAVLATTGQLYAAWRQGLDQCDPGWLADGSVRYPISTPRPQCGGGLPGVRTLYRF 78
                          90
                  ....*....|....*...
gi 530372113 2289 KNL------SERWDAYCF 2300
Cdd:cd03520    79 PNQtgfpdpHSRFDAYCF 96
Link_domain_CD44_like cd03516
This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates ...
2207-2304 9.49e-17

This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marker for the detection and characterization of lymphatic vessels in tumors.


Pssm-ID: 239593  Cd Length: 144  Bit Score: 79.43  E-value: 9.49e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2207 AGVFHLQaTSGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSLG 2286
Cdd:cd03516     6 MGVFLVE-KNGRYSLNFTEAKEACRALGLTLASKAQVETALKFGFETCRYGWVEDGFVVIPRIDPNPLCGKNGTGVYILN 84
                          90
                  ....*....|....*...
gi 530372113 2287 ArkNLSERWDAYCFRVQD 2304
Cdd:cd03516    85 S--NLSSRYDAYCYNSSD 100
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
990-1120 2.57e-16

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 78.79  E-value: 2.57e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  990 DIFRELEANAHFSIFYQWLKSAGI--TLPADRRVTALVPSEAAVRQLSPEDRAFWLQP---RTLPNLVRAHFLQGALFEE 1064
Cdd:COG2335    32 NIVETAANNPDFSTLVAALKAAGLvdTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPenkATLTKILTYHVVPGKVTAA 111
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 530372113 1065 ELARlgGQEVATLNPTTrWEIRNISGRVWVQNASVDVADLLATNGVLHILSQVLLP 1120
Cdd:COG2335   112 DLKD--GKTLTTLQGQT-LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1716-1866 4.04e-16

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 78.02  E-value: 4.04e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 1716 WEPDDAPIPRRNVTAAAQGFG-YKIFSGLLKVAGLLPLLREAshRPFTMLWPTDAAFRALPPD-RQAWLYHEDhRDKLAA 1793
Cdd:COG2335    21 AAEGAAMAPTKNIVETAANNPdFSTLVAALKAAGLVDTLSGE--GPFTVFAPTDAAFAALPAGtLDALLKPEN-KATLTK 97
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 530372113 1794 ILRGHMIRNvEALASDLPNLGPLRTMHGTPISFSCSrtrAGELMVGedDARIVQRHLPFEGGLAYGIDQLLEP 1866
Cdd:COG2335    98 ILTYHVVPG-KVTAADLKDGKTLTTLQGQTLTVTVS---GGGVTVN--GANVITADIEASNGVIHVIDKVLLP 164
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1022-1121 2.66e-15

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 73.55  E-value: 2.66e-15
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   1022 TALVPSEAAVRQLSPEDRAFWLQprTLPNLVRAHFLQGALFEEELarLGGQEVATLNPTT-RWEIRNISGRVWVQNASVD 1100
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLAD--KLKNLLLYHVVPGRLSSADL--LNGGTLPTLAGSKlRITRSGGSGTVTVNGARIV 76
                            90       100
                    ....*....|....*....|.
gi 530372113   1101 VADLLATNGVLHILSQVLLPP 1121
Cdd:smart00554   77 EADIAATNGVVHVIDRVLLPP 97
Link_domain_CSPGs_modules_1_3 cd03517
Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third ...
2209-2300 2.89e-14

Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.


Pssm-ID: 239594  Cd Length: 95  Bit Score: 70.51  E-value: 2.89e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2209 VFHLQATSGPYGLNFSEAEAACEAQGAVLASFPQLSAAQQLGFHLCLMGWLANGSTAHPVVFPVADC---GNGRVGIVSL 2285
Cdd:cd03517     2 VFHYRDATARYALTFPRAQRACLDISAQIATPEQLLAAYEDGFEQCDAGWLADQTVRYPIQTPREGCygdMDGFPGVRNY 81
                          90
                  ....*....|....*
gi 530372113 2286 GARkNLSERWDAYCF 2300
Cdd:cd03517    82 GVR-DPDELYDVYCY 95
Link_domain_HAPLN_module_2 cd03519
Link_domain_HAPLN_module_2; this link domain is found in the second link module of proteins ...
2208-2300 1.90e-13

Link_domain_HAPLN_module_2; this link domain is found in the second link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.


Pssm-ID: 239596  Cd Length: 91  Bit Score: 68.22  E-value: 1.90e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2208 GVFHLQAtsgPYGLNFSEAEAACEAQGAVLASFPQLSAAQQL-GFHLCLMGWLANGSTAHPVVFPVADCGNGRVGIVSLG 2286
Cdd:cd03519     1 GVFYLLH---PGKLTFSEAVAACQRDGAQIAKVGQLFAAWKFhGLDRCDAGWLADGSVRYPISRPRPRCGPLEPGVRSFG 77
                          90
                  ....*....|....
gi 530372113 2287 ARKNLSERWDAYCF 2300
Cdd:cd03519    78 FPDKKHKLYGVYCY 91
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
2367-2462 5.24e-09

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 55.45  E-value: 5.24e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   2367 TLFVPVNEGF------VDNMT--LSGPDLELHASNATLLSANASQGKLLPAHSGLSLIISDAGPdnsswapvaPGTVVV- 2437
Cdd:smart00554    1 TVFAPTDEAFqklppdLNSLLadKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGG---------SGTVTVn 71
                            90       100
                    ....*....|....*....|....*.
gi 530372113   2438 -SRIIVWDIMAFNGIIHALASPLLAP 2462
Cdd:smart00554   72 gARIVEADIAATNGVVHVIDRVLLPP 97
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1459-1495 3.60e-08

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 51.06  E-value: 3.60e-08
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 530372113  1459 CAHGHGGCSPHANCTKVaPGQRTCTCQDGYMGDGELC 1495
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2095-2130 1.27e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.52  E-value: 1.27e-07
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 530372113  2095 CQDGHGGCSEHANCSQVGTMVTCTCLPDYEGDGWSC 2130
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
951-987 1.15e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 46.82  E-value: 1.15e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 530372113   951 CRAGNGGCHGLATCRAVGGgQRVCTCPPGFGGDGFSC 987
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1544-1581 5.59e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 44.90  E-value: 5.59e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113  1544 CSKNNGGCSPYATCKSTgDGQRTCTCDTAHTvGDGLTC 1581
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYT-GDGVTC 36
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
393-490 7.09e-06

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 47.63  E-value: 7.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   393 TAGPFTVLVPSVSSFS--------SRTMNASLAQQLCRQHIIAGQHILEDTRTQQTRRwwTLAGQEITVTFnqftkysyk 464
Cdd:pfam02469   22 SQGPFTVFAPTNEAFAklpagtlnFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLA--TLQGSKLRVNV--------- 90
                           90       100
                   ....*....|....*....|....*....
gi 530372113   465 ykDQPQQTFN---IYKANNIAANGVFHVV 490
Cdd:pfam02469   91 --TGGSVTVNgarVVQADIEATNGVIHVI 117
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1139-1241 1.30e-05

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 46.86  E-value: 1.30e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  1139 PAFSLFRELLQHHGLVPQIEAATA-YTIFVPTNRSLEA--QGNSSHLDADT------VRHHVVLGeALSMETLRKGGHRN 1209
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKlpAGTLNFLLKDKeqlknlLKYHVVPG-RLTSSDLKNGGTLA 79
                           90       100       110
                   ....*....|....*....|....*....|..
gi 530372113  1210 SLLGPAhwIVFYNHSGQPEVNHVPLEGPMLEA 1241
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEA 109
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
2333-2460 1.76e-05

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 46.48  E-value: 1.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  2333 ANFSTFYGMLlgyaNATqrglDFLDFLDDELTYKTLFVPVNEGF------VDNMTLSGPD-----LELHASNATLLSANA 2401
Cdd:pfam02469    1 PGFSTFVALL----KAA----GLVDTLNGSQGPFTVFAPTNEAFaklpagTLNFLLKDKEqlknlLKYHVVPGRLTSSDL 72
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 530372113  2402 SQGKLLPAHSGLSLIISDAGpdnsswapvapGTVVV--SRIIVWDIMAFNGIIHALASPLL 2460
Cdd:pfam02469   73 KNGGTLATLQGSKLRVNVTG-----------GSVTVngARVVQADIEATNGVIHVIDKVLL 122
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
2325-2460 1.82e-05

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 47.21  E-value: 1.82e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2325 LLDVLAATANFSTFYGMLlgyaNATqrglDFLDFLDDELTYkTLFVPVNEGF-------VDNMTLSGPDLEL------HA 2391
Cdd:COG2335    33 IVETAANNPDFSTLVAAL----KAA----GLVDTLSGEGPF-TVFAPTDAAFaalpagtLDALLKPENKATLtkiltyHV 103
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 530372113 2392 SNATLLSANASQGKLLPAHSGLSLIISDAGpdnsswapvapGTVVV--SRIIVWDIMAFNGIIHALASPLL 2460
Cdd:COG2335   104 VPGKVTAADLKDGKTLTTLQGQTLTVTVSG-----------GGVTVngANVITADIEASNGVIHVIDKVLL 163
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1501-1538 3.75e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.59  E-value: 3.75e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113  1501 CLIHHGGCHIHAECIPTGPQqVSCSCREGYSGDGIrTC 1538
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGS-FTCTCNDGYTGDGV-TC 36
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
380-490 7.92e-05

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 45.28  E-value: 7.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113  380 VAMMDQ-GCREILTTAGPFTVLVPSVSSFS----------SRTMNASLAQQLCRQHIIAGQHILEDTRTQQTRRwwTLAG 448
Cdd:COG2335    47 VAALKAaGLVDTLSGEGPFTVFAPTDAAFAalpagtldalLKPENKATLTKILTYHVVPGKVTAADLKDGKTLT--TLQG 124
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 530372113  449 QEITVTFNqftkysykyKDQPQ-QTFNIYKANNIAANGVFHVV 490
Cdd:COG2335   125 QTLTVTVS---------GGGVTvNGANVITADIEASNGVIHVI 158
Link_domain_KIAA0527_like cd03521
Link_domain_KIAA0527_like; this domain is found in the human protein KIAA0527. Sequence-wise, ...
2209-2299 1.48e-04

Link_domain_KIAA0527_like; this domain is found in the human protein KIAA0527. Sequence-wise, it is highly similar to the link domain. The link domain is a hyaluronan-binding (HA) domain. KIAA0527 contains a single link module. The KIAA0527 gene was originally cloned from human brain tissue.


Pssm-ID: 239598  Cd Length: 95  Bit Score: 42.99  E-value: 1.48e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 2209 VFHLQATSGPYGLNFSEAEAACEAQGAVLASFPQLS-AAQQLGFHLCLMGWLANGSTAHPVVFP-VADCGNGRVGIVSLG 2286
Cdd:cd03521     2 LFVLELENGSQGLGLRAARQSCASLGARLASAAELRrAVVECFFSACARGWLADGTVGTTVCNPvVAEALKAVDVKVEIE 81
                          90
                  ....*....|...
gi 530372113 2287 ARKNLSERWDAYC 2299
Cdd:cd03521    82 TNPIPFAHYNALC 94
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2136-2173 2.45e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 40.27  E-value: 2.45e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113  2136 CTDGHrGGCSEHANCLSTGlNTRRCECHAGYVGDGLQC 2173
Cdd:pfam12947    1 CSDNN-GGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
914-945 4.42e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.50  E-value: 4.42e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 530372113   914 GGCHTDALCSYVgPGQSRCTCKLGFAGDGYQC 945
Cdd:pfam12947    6 GGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
831-859 7.66e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.12  E-value: 7.66e-04
                           10        20
                   ....*....|....*....|....*....
gi 530372113   831 CHLHARCVSQEGVARCRCLDGFEGDGFSC 859
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
865-902 3.19e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.19  E-value: 3.19e-03
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 530372113   865 CSHpDRGGCSENAECVPgSLGTHHCTCHKGWSGDGRVC 902
Cdd:pfam12947    1 CSD-NNGGCHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1424-1453 3.59e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.19  E-value: 3.59e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 530372113  1424 CDPNANCVQdSAGASTCACAAGYSGNGIFC 1453
Cdd:pfam12947    8 CHPNATCTN-TGGSFTCTCNDGYTGDGVTC 36
EGF_Lam smart00180
Laminin-type epidermal growth factor-like domai;
1992-2021 4.24e-03

Laminin-type epidermal growth factor-like domai;


Pssm-ID: 214543  Cd Length: 46  Bit Score: 37.29  E-value: 4.24e-03
                            10        20        30
                    ....*....|....*....|....*....|
gi 530372113   1992 SGQCLCRSGFAGTACELCAPGAFGPHCQAC 2021
Cdd:smart00180   17 TGQCECKPNVTGRRCDRCAPGYYGDGPPGC 46
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1139-1213 5.03e-03

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 40.27  E-value: 5.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113 1139 PAFSLFRELLQHHGLVPQIEAATAYTIFVPTNRSLEAqgnsshLDADTV----------------RHHVVLGEALSmETL 1202
Cdd:COG2335    41 PDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAA------LPAGTLdallkpenkatltkilTYHVVPGKVTA-ADL 113
                          90
                  ....*....|.
gi 530372113 1203 RKGGHRNSLLG 1213
Cdd:COG2335   114 KDGKTLTTLQG 124
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1164-1253 5.54e-03

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 38.50  E-value: 5.54e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530372113   1164 TIFVPTN---RSLEAQGNSSHLDADT--VRHHVVLGeALSMETLRKGGHRNSLLGPAHWIVFYNHSGQPEVNHVPLEGPM 1238
Cdd:smart00554    1 TVFAPTDeafQKLPPDLNSLLADKLKnlLLYHVVPG-RLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTVNGARIVEAD 79
                            90
                    ....*....|....*
gi 530372113   1239 LEAPGRSLIGLSGVL 1253
Cdd:smart00554   80 IAATNGVVHVIDRVL 94
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH