NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|661903023|ref|NP_001284368|]
View 

hepatocyte growth factor activator isoform 1 preproprotein [Homo sapiens]

Protein Classification

KR and Tryp_SPc domain-containing protein( domain architecture ID 11249608)

protein containing domains FN2, FN1, KR, and Tryp_SPc

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
415-650 7.50e-92

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


:

Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 284.17  E-value: 7.50e-92
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 415 IIGGSSSLPGSHPWLAAIYIGDS--FCAGSLVHTCWVVSAAHCFSHSPPRdSVSVVLGQHFFNRTTDVTQTFGIEKYIPY 492
Cdd:cd00190    1 IVGGSEAKIGSFPWQVSLQYTGGrhFCGGSLISPRWVLTAAHCVYSSAPS-NYTVRLGSHDLSSNEGGGQVIKVKKVIVH 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 493 TLYSVfNPSDHDLVLIRLKKKgdrcATRSQFVQPICLPEPGSTFPAGHKCQIAGWGHLDENVSgYSSSLREALVPLVADH 572
Cdd:cd00190   80 PNYNP-STYDNDIALLKLKRP----VTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEGGP-LPDVLQEVNVPIVSNA 153
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 661903023 573 KCSSPEVYGADISPNMLCAGYFDCKSDACQGDSGGPLACEKNGVAYLYGIISWGDGCGRLHKPGVYTRVANYVDWIND 650
Cdd:cd00190  154 ECKRAYSYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQK 231
Kringle pfam00051
Kringle domain; Kringle domains have been found in plasminogen, hepatocyte growth factors, ...
286-367 3.89e-26

Kringle domain; Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta.


:

Pssm-ID: 395005  Cd Length: 79  Bit Score: 102.00  E-value: 3.89e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023  286 CFLGNGTGYRGVASTSASGLSCLAWNSDLLYQ-ELHVDSVGAAALLGLgphAYCRNPDNDERPWCYvVKDSALSWEYCRL 364
Cdd:pfam00051   1 CYHGNGESYRGTVSTTESGRPCQAWDSQTPHRhSKYTPENFPAKGLGE---NYCRNPDGDERPWCY-TTDPRVRWEYCDI 76

                  ...
gi 661903023  365 EAC 367
Cdd:pfam00051  77 PRC 79
FN2 smart00059
Fibronectin type 2 domain; One of three types of internal repeat within the plasma protein, ...
102-148 4.55e-19

Fibronectin type 2 domain; One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.


:

Pssm-ID: 128373  Cd Length: 49  Bit Score: 80.81  E-value: 4.55e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 661903023   102 TEDGRPCRFPFRYGGRMLHACTSEGSAHRK-WCATTHNYDRDRAWGYC 148
Cdd:smart00059   2 NSDGEPCVFPFIYNGKKYHDCTSEGRSDGMlWCSTTPNYDRDGKWGFC 49
FN1 cd00061
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. ...
200-240 1.35e-09

Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.


:

Pssm-ID: 238018  Cd Length: 43  Bit Score: 53.88  E-value: 1.35e-09
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|...
gi 661903023 200 EKCFDETRYEYLEGGDRWARVRQGHVEQCECFG--GRTWCEGT 240
Cdd:cd00061    1 EKCFDPQTGVFYRVGETWERPSEGHVLQCTCLGnrGEARCDPV 43
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
162-197 1.77e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.94  E-value: 1.77e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 661903023 162 DPCASG-PCLNGGSCSNTqdPQSYHCSCPRAFTGKDC 197
Cdd:cd00054    3 DECASGnPCQNGGTCVNT--VGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
245-278 2.86e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 35.69  E-value: 2.86e-03
                         10        20        30
                 ....*....|....*....|....*....|....*
gi 661903023 245 CLS-SPCLNGGTCHLIVatGTTVCACPPGFAGRLC 278
Cdd:cd00054    5 CASgNPCQNGGTCVNTV--GSYRCSCPPGYTGRNC 37
PRK14954 super family cl33039
DNA polymerase III subunits gamma and tau; Provisional
32-100 5.31e-03

DNA polymerase III subunits gamma and tau; Provisional


The actual alignment was detected with superfamily member PRK14954:

Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 39.93  E-value: 5.31e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 661903023  32 GFQPQPGGN---RTESPEPNATATPAIPTILVTSVTSETPATSAPEAEGPQSGGLPPPPRAVPSSSSPQAQA 100
Cdd:PRK14954 379 GVAPSPAGSpdvKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPLPPSPQASA 450
 
Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
415-650 7.50e-92

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 284.17  E-value: 7.50e-92
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 415 IIGGSSSLPGSHPWLAAIYIGDS--FCAGSLVHTCWVVSAAHCFSHSPPRdSVSVVLGQHFFNRTTDVTQTFGIEKYIPY 492
Cdd:cd00190    1 IVGGSEAKIGSFPWQVSLQYTGGrhFCGGSLISPRWVLTAAHCVYSSAPS-NYTVRLGSHDLSSNEGGGQVIKVKKVIVH 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 493 TLYSVfNPSDHDLVLIRLKKKgdrcATRSQFVQPICLPEPGSTFPAGHKCQIAGWGHLDENVSgYSSSLREALVPLVADH 572
Cdd:cd00190   80 PNYNP-STYDNDIALLKLKRP----VTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEGGP-LPDVLQEVNVPIVSNA 153
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 661903023 573 KCSSPEVYGADISPNMLCAGYFDCKSDACQGDSGGPLACEKNGVAYLYGIISWGDGCGRLHKPGVYTRVANYVDWIND 650
Cdd:cd00190  154 ECKRAYSYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQK 231
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
414-648 1.90e-87

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473 [Multi-domain]  Cd Length: 229  Bit Score: 272.63  E-value: 1.90e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023   414 RIIGGSSSLPGSHPWLAAIYIGDS--FCAGSLVHTCWVVSAAHCFsHSPPRDSVSVVLGQHFFNRTTDvTQTFGIEKYIP 491
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYGGGrhFCGGSLISPRWVLTAAHCV-RGSDPSNIRVRLGSHDLSSGEE-GQVIKVSKVII 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023   492 YTLYSVfNPSDHDLVLIRLKKKgdrcATRSQFVQPICLPEPGSTFPAGHKCQIAGWGHLDENVSGYSSSLREALVPLVAD 571
Cdd:smart00020  79 HPNYNP-STYDNDIALLKLKEP----VTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSN 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 661903023   572 HKCSSPEVYGADISPNMLCAGYFDCKSDACQGDSGGPLACeKNGVAYLYGIISWGDGCGRLHKPGVYTRVANYVDWI 648
Cdd:smart00020 154 ATCRRAYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVC-NDGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
Trypsin pfam00089
Trypsin;
415-648 6.37e-73

Trypsin;


Pssm-ID: 395042 [Multi-domain]  Cd Length: 219  Bit Score: 234.26  E-value: 6.37e-73
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023  415 IIGGSSSLPGSHPWLAAIYIGDS--FCAGSLVHTCWVVSAAHCFSHsppRDSVSVVLGQHFFNRTTDVTQTFGIEKYIPY 492
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSGkhFCGGSLISENWVLTAAHCVSG---ASDVKVVLGAHNIVLREGGEQKFDVEKIIVH 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023  493 TLYSVFNPsDHDLVLIRLKKKgdrcATRSQFVQPICLPEPGSTFPAGHKCQIAGWGHLDENvsGYSSSLREALVPLVADH 572
Cdd:pfam00089  78 PNYNPDTL-DNDIALLKLESP----VTLGDTVRPICLPDASSDLPVGTTCTVSGWGNTKTL--GPSDTLQEVTVPVVSRE 150
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 661903023  573 KCSSpeVYGADISPNMLCAGYfdCKSDACQGDSGGPLACEKNgvaYLYGIISWGDGCGRLHKPGVYTRVANYVDWI 648
Cdd:pfam00089 151 TCRS--AYGGTVTDTMICAGA--GGKDACQGDSGGPLVCSDG---ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 219
Kringle pfam00051
Kringle domain; Kringle domains have been found in plasminogen, hepatocyte growth factors, ...
286-367 3.89e-26

Kringle domain; Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta.


Pssm-ID: 395005  Cd Length: 79  Bit Score: 102.00  E-value: 3.89e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023  286 CFLGNGTGYRGVASTSASGLSCLAWNSDLLYQ-ELHVDSVGAAALLGLgphAYCRNPDNDERPWCYvVKDSALSWEYCRL 364
Cdd:pfam00051   1 CYHGNGESYRGTVSTTESGRPCQAWDSQTPHRhSKYTPENFPAKGLGE---NYCRNPDGDERPWCY-TTDPRVRWEYCDI 76

                  ...
gi 661903023  365 EAC 367
Cdd:pfam00051  77 PRC 79
KR cd00108
Kringle domain; Kringle domains are believed to play a role in binding mediators, such as ...
283-368 9.33e-24

Kringle domain; Kringle domains are believed to play a role in binding mediators, such as peptides, other proteins, membranes, or phospholipids. They are autonomous structural domains, found in a varying number of copies, in blood clotting and fibrinolytic proteins, some serine proteases and plasma proteins. Plasminogen-like kringles possess affinity for free lysine and lysine-containing peptides.


Pssm-ID: 238056  Cd Length: 83  Bit Score: 95.14  E-value: 9.33e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 283 DERCFLGNGTGYRGVASTSASGLSCLAWNSDLLYQelHVDSVGAAALLGLGpHAYCRNPDND-ERPWCYvVKDSALSWEY 361
Cdd:cd00108    1 TRDCYWGNGESYRGTVSTTKSGKPCQRWNSQLPHQ--HKFNPERFPEGLLE-ENYCRNPDGDpEGPWCY-TTDPNVRWEY 76

                 ....*..
gi 661903023 362 CRLEACD 368
Cdd:cd00108   77 CDIPRCE 83
KR smart00130
Kringle domain; Named after a Danish pastry. Found in several serine proteases and in ROR-like ...
284-368 6.55e-23

Kringle domain; Named after a Danish pastry. Found in several serine proteases and in ROR-like receptors. Can occur in up to 38 copies (in apolipoprotein(a)). Plasminogen-like kringles possess affinity for free lysine and lysine- containing peptides.


Pssm-ID: 214527  Cd Length: 83  Bit Score: 92.84  E-value: 6.55e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023   284 ERCFLGNGTGYRGVASTSASGLSCLAWNS--DLLYQELHVDSVGAaallgLGPHAYCRNPDND-ERPWCYvVKDSALSWE 360
Cdd:smart00130   1 RECYAGNGESYRGTVSVTKSGKPCQRWDSqtPHLHRFTPESFPDL-----GLEENYCRNPDGDsEGPWCY-TTDPNVRWE 74

                   ....*...
gi 661903023   361 YCRLEACD 368
Cdd:smart00130  75 YCDIPQCE 82
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
413-650 5.71e-22

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 227927 [Multi-domain]  Cd Length: 413  Bit Score: 98.80  E-value: 5.71e-22
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 413 PRIIGGSSSLPGSHPWLAA--IYIGD----SFCAGSLVHTCWVVSAAHCFSHSPP--RDSVSVVLG---QHFFNRTtDVT 481
Cdd:COG5640   31 SRIIGGSNANAGEYPSLVAlvDRISDyvsgTFCGGSKLGGRYVLTAAHCADASSPisSDVNRVVVDlndSSQAERG-HVR 109
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 482 QTFGIEKYIPYTLysvfnpsDHDLVLIRLKKKGDRCATRSQFVQPiclPEPG-STFPAGHKCQIAGWGHL----DENVSG 556
Cdd:COG5640  110 TIYVHEFYSPGNL-------GNDIAVLELARAASLPRVKITSFDA---SDTFlNSVTTVSPMTNGTFGVTtpsdVPRSSP 179
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 557 YSSSLREALVPLVADHKCS--SPEVYGADISPNM--LCAGYFdcKSDACQGDSGGPLACE-KNGVAYLyGIISWGDG-CG 630
Cdd:COG5640  180 KGTILHEVAVLFVPLSTCAqyKGCANASDGATGLtgFCAGRP--PKDACQGDSGGPIFHKgEEGRVQR-GVVSWGDGgCG 256
                        250       260
                 ....*....|....*....|
gi 661903023 631 RLHKPGVYTRVANYVDWIND 650
Cdd:COG5640  257 GTLIPGVYTNVSNYQDWIAA 276
FN2 smart00059
Fibronectin type 2 domain; One of three types of internal repeat within the plasma protein, ...
102-148 4.55e-19

Fibronectin type 2 domain; One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.


Pssm-ID: 128373  Cd Length: 49  Bit Score: 80.81  E-value: 4.55e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 661903023   102 TEDGRPCRFPFRYGGRMLHACTSEGSAHRK-WCATTHNYDRDRAWGYC 148
Cdd:smart00059   2 NSDGEPCVFPFIYNGKKYHDCTSEGRSDGMlWCSTTPNYDRDGKWGFC 49
FN2 cd00062
Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to ...
102-148 1.89e-18

Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.


Pssm-ID: 238019  Cd Length: 48  Bit Score: 78.89  E-value: 1.89e-18
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*...
gi 661903023 102 TEDGRPCRFPFRYGGRMLHACTSEGSA-HRKWCATTHNYDRDRAWGYC 148
Cdd:cd00062    1 NSDGAPCVFPFIYRGKWYHDCTTEGSNdGKLWCSTTPNYDRDGKWGYC 48
fn2 pfam00040
Fibronectin type II domain;
108-148 3.73e-17

Fibronectin type II domain;


Pssm-ID: 425438  Cd Length: 42  Bit Score: 75.37  E-value: 3.73e-17
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 661903023  108 CRFPFRYGGRMLHACTSEGSAH-RKWCATTHNYDRDRAWGYC 148
Cdd:pfam00040   1 CVFPFKYNGKWYHSCTTDGRSDgRLWCSTTANYDGDGKWGYC 42
FN1 cd00061
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. ...
200-240 1.35e-09

Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.


Pssm-ID: 238018  Cd Length: 43  Bit Score: 53.88  E-value: 1.35e-09
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|...
gi 661903023 200 EKCFDETRYEYLEGGDRWARVRQGHVEQCECFG--GRTWCEGT 240
Cdd:cd00061    1 EKCFDPQTGVFYRVGETWERPSEGHVLQCTCLGnrGEARCDPV 43
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
162-197 1.77e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.94  E-value: 1.77e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 661903023 162 DPCASG-PCLNGGSCSNTqdPQSYHCSCPRAFTGKDC 197
Cdd:cd00054    3 DECASGnPCQNGGTCVNT--VGSYRCSCPPGYTGRNC 37
FN1 smart00058
Fibronectin type 1 domain; One of three types of internal repeat within the plasma protein, ...
202-242 5.17e-05

Fibronectin type 1 domain; One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.


Pssm-ID: 214494  Cd Length: 45  Bit Score: 40.80  E-value: 5.17e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 661903023   202 CFDE-TRYEYLEGgDRWARVRQ-GHVEQCECFG---GRTWCEGTRH 242
Cdd:smart00058   1 CFDEeTGTTYRVG-DTWERPYEgGHVLQCTCLGggrGEWKCDPVPV 45
EGF_CA smart00179
Calcium-binding EGF-like domain;
162-197 6.91e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.61  E-value: 6.91e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 661903023   162 DPCAS-GPCLNGGSCSNTQDpqSYHCSCPRAFT-GKDC 197
Cdd:smart00179   3 DECASgNPCQNGGTCVNTVG--SYRCECPPGYTdGRNC 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
164-196 9.57e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.98  E-value: 9.57e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 661903023  164 CASGPCLNGGSCSNTqdPQSYHCSCPRAFTGKD 196
Cdd:pfam00008   1 CAPNPCSNGGTCVDT--PGGYTCICPEGYTGKR 31
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
245-278 2.86e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 35.69  E-value: 2.86e-03
                         10        20        30
                 ....*....|....*....|....*....|....*
gi 661903023 245 CLS-SPCLNGGTCHLIVatGTTVCACPPGFAGRLC 278
Cdd:cd00054    5 CASgNPCQNGGTCVNTV--GSYRCSCPPGYTGRNC 37
fn1 pfam00039
Fibronectin type I domain;
202-232 3.61e-03

Fibronectin type I domain;


Pssm-ID: 425437  Cd Length: 40  Bit Score: 35.36  E-value: 3.61e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 661903023  202 CFDETRYEYLEGGDRWARVRQ-GHVEQCECFG 232
Cdd:pfam00039   1 CYDPQTGRFYQVGETWERPHQrGHVLRCTCLG 32
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
32-100 5.31e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 39.93  E-value: 5.31e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 661903023  32 GFQPQPGGN---RTESPEPNATATPAIPTILVTSVTSETPATSAPEAEGPQSGGLPPPPRAVPSSSSPQAQA 100
Cdd:PRK14954 379 GVAPSPAGSpdvKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPLPPSPQASA 450
PHA03378 PHA03378
EBNA-3B; Provisional
40-195 6.16e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 39.67  E-value: 6.16e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023  40 NRTESPEPNATATPAIPTILVTSVTSETPATSAPEAEGPQSGGLPPP-----PRAVPSSSSPQAQALTEDGRPCRFPfry 114
Cdd:PHA03378 665 TWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPgraqrPAAATGRARPPAAAPGRARPPAAAP--- 741
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 115 gGRMLHACTSEGSAHRKWCATTHNYDRDRAWGYCVEATPPPGGPAALDPCASGPclnggscSNTQDPQSYHCS---CPRA 191
Cdd:PHA03378 742 -GRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAP-------TPQPPPQAGPTSmqlMPRA 813

                 ....
gi 661903023 192 FTGK 195
Cdd:PHA03378 814 APGQ 817
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
245-276 9.10e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 34.28  E-value: 9.10e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 661903023  245 CLSSPCLNGGTCHLIVatGTTVCACPPGFAGR 276
Cdd:pfam00008   1 CAPNPCSNGGTCVDTP--GGYTCICPEGYTGK 30
 
Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
415-650 7.50e-92

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 284.17  E-value: 7.50e-92
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 415 IIGGSSSLPGSHPWLAAIYIGDS--FCAGSLVHTCWVVSAAHCFSHSPPRdSVSVVLGQHFFNRTTDVTQTFGIEKYIPY 492
Cdd:cd00190    1 IVGGSEAKIGSFPWQVSLQYTGGrhFCGGSLISPRWVLTAAHCVYSSAPS-NYTVRLGSHDLSSNEGGGQVIKVKKVIVH 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 493 TLYSVfNPSDHDLVLIRLKKKgdrcATRSQFVQPICLPEPGSTFPAGHKCQIAGWGHLDENVSgYSSSLREALVPLVADH 572
Cdd:cd00190   80 PNYNP-STYDNDIALLKLKRP----VTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEGGP-LPDVLQEVNVPIVSNA 153
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 661903023 573 KCSSPEVYGADISPNMLCAGYFDCKSDACQGDSGGPLACEKNGVAYLYGIISWGDGCGRLHKPGVYTRVANYVDWIND 650
Cdd:cd00190  154 ECKRAYSYGGTITDNMLCAGGLEGGKDACQGDSGGPLVCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQK 231
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
414-648 1.90e-87

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473 [Multi-domain]  Cd Length: 229  Bit Score: 272.63  E-value: 1.90e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023   414 RIIGGSSSLPGSHPWLAAIYIGDS--FCAGSLVHTCWVVSAAHCFsHSPPRDSVSVVLGQHFFNRTTDvTQTFGIEKYIP 491
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYGGGrhFCGGSLISPRWVLTAAHCV-RGSDPSNIRVRLGSHDLSSGEE-GQVIKVSKVII 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023   492 YTLYSVfNPSDHDLVLIRLKKKgdrcATRSQFVQPICLPEPGSTFPAGHKCQIAGWGHLDENVSGYSSSLREALVPLVAD 571
Cdd:smart00020  79 HPNYNP-STYDNDIALLKLKEP----VTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSN 153
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 661903023   572 HKCSSPEVYGADISPNMLCAGYFDCKSDACQGDSGGPLACeKNGVAYLYGIISWGDGCGRLHKPGVYTRVANYVDWI 648
Cdd:smart00020 154 ATCRRAYSGGGAITDNMLCAGGLEGGKDACQGDSGGPLVC-NDGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
Trypsin pfam00089
Trypsin;
415-648 6.37e-73

Trypsin;


Pssm-ID: 395042 [Multi-domain]  Cd Length: 219  Bit Score: 234.26  E-value: 6.37e-73
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023  415 IIGGSSSLPGSHPWLAAIYIGDS--FCAGSLVHTCWVVSAAHCFSHsppRDSVSVVLGQHFFNRTTDVTQTFGIEKYIPY 492
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSGkhFCGGSLISENWVLTAAHCVSG---ASDVKVVLGAHNIVLREGGEQKFDVEKIIVH 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023  493 TLYSVFNPsDHDLVLIRLKKKgdrcATRSQFVQPICLPEPGSTFPAGHKCQIAGWGHLDENvsGYSSSLREALVPLVADH 572
Cdd:pfam00089  78 PNYNPDTL-DNDIALLKLESP----VTLGDTVRPICLPDASSDLPVGTTCTVSGWGNTKTL--GPSDTLQEVTVPVVSRE 150
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 661903023  573 KCSSpeVYGADISPNMLCAGYfdCKSDACQGDSGGPLACEKNgvaYLYGIISWGDGCGRLHKPGVYTRVANYVDWI 648
Cdd:pfam00089 151 TCRS--AYGGTVTDTMICAGA--GGKDACQGDSGGPLVCSDG---ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 219
Kringle pfam00051
Kringle domain; Kringle domains have been found in plasminogen, hepatocyte growth factors, ...
286-367 3.89e-26

Kringle domain; Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta.


Pssm-ID: 395005  Cd Length: 79  Bit Score: 102.00  E-value: 3.89e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023  286 CFLGNGTGYRGVASTSASGLSCLAWNSDLLYQ-ELHVDSVGAAALLGLgphAYCRNPDNDERPWCYvVKDSALSWEYCRL 364
Cdd:pfam00051   1 CYHGNGESYRGTVSTTESGRPCQAWDSQTPHRhSKYTPENFPAKGLGE---NYCRNPDGDERPWCY-TTDPRVRWEYCDI 76

                  ...
gi 661903023  365 EAC 367
Cdd:pfam00051  77 PRC 79
KR cd00108
Kringle domain; Kringle domains are believed to play a role in binding mediators, such as ...
283-368 9.33e-24

Kringle domain; Kringle domains are believed to play a role in binding mediators, such as peptides, other proteins, membranes, or phospholipids. They are autonomous structural domains, found in a varying number of copies, in blood clotting and fibrinolytic proteins, some serine proteases and plasma proteins. Plasminogen-like kringles possess affinity for free lysine and lysine-containing peptides.


Pssm-ID: 238056  Cd Length: 83  Bit Score: 95.14  E-value: 9.33e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 283 DERCFLGNGTGYRGVASTSASGLSCLAWNSDLLYQelHVDSVGAAALLGLGpHAYCRNPDND-ERPWCYvVKDSALSWEY 361
Cdd:cd00108    1 TRDCYWGNGESYRGTVSTTKSGKPCQRWNSQLPHQ--HKFNPERFPEGLLE-ENYCRNPDGDpEGPWCY-TTDPNVRWEY 76

                 ....*..
gi 661903023 362 CRLEACD 368
Cdd:cd00108   77 CDIPRCE 83
KR smart00130
Kringle domain; Named after a Danish pastry. Found in several serine proteases and in ROR-like ...
284-368 6.55e-23

Kringle domain; Named after a Danish pastry. Found in several serine proteases and in ROR-like receptors. Can occur in up to 38 copies (in apolipoprotein(a)). Plasminogen-like kringles possess affinity for free lysine and lysine- containing peptides.


Pssm-ID: 214527  Cd Length: 83  Bit Score: 92.84  E-value: 6.55e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023   284 ERCFLGNGTGYRGVASTSASGLSCLAWNS--DLLYQELHVDSVGAaallgLGPHAYCRNPDND-ERPWCYvVKDSALSWE 360
Cdd:smart00130   1 RECYAGNGESYRGTVSVTKSGKPCQRWDSqtPHLHRFTPESFPDL-----GLEENYCRNPDGDsEGPWCY-TTDPNVRWE 74

                   ....*...
gi 661903023   361 YCRLEACD 368
Cdd:smart00130  75 YCDIPQCE 82
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
413-650 5.71e-22

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 227927 [Multi-domain]  Cd Length: 413  Bit Score: 98.80  E-value: 5.71e-22
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 413 PRIIGGSSSLPGSHPWLAA--IYIGD----SFCAGSLVHTCWVVSAAHCFSHSPP--RDSVSVVLG---QHFFNRTtDVT 481
Cdd:COG5640   31 SRIIGGSNANAGEYPSLVAlvDRISDyvsgTFCGGSKLGGRYVLTAAHCADASSPisSDVNRVVVDlndSSQAERG-HVR 109
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 482 QTFGIEKYIPYTLysvfnpsDHDLVLIRLKKKGDRCATRSQFVQPiclPEPG-STFPAGHKCQIAGWGHL----DENVSG 556
Cdd:COG5640  110 TIYVHEFYSPGNL-------GNDIAVLELARAASLPRVKITSFDA---SDTFlNSVTTVSPMTNGTFGVTtpsdVPRSSP 179
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 557 YSSSLREALVPLVADHKCS--SPEVYGADISPNM--LCAGYFdcKSDACQGDSGGPLACE-KNGVAYLyGIISWGDG-CG 630
Cdd:COG5640  180 KGTILHEVAVLFVPLSTCAqyKGCANASDGATGLtgFCAGRP--PKDACQGDSGGPIFHKgEEGRVQR-GVVSWGDGgCG 256
                        250       260
                 ....*....|....*....|
gi 661903023 631 RLHKPGVYTRVANYVDWIND 650
Cdd:COG5640  257 GTLIPGVYTNVSNYQDWIAA 276
FN2 smart00059
Fibronectin type 2 domain; One of three types of internal repeat within the plasma protein, ...
102-148 4.55e-19

Fibronectin type 2 domain; One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.


Pssm-ID: 128373  Cd Length: 49  Bit Score: 80.81  E-value: 4.55e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 661903023   102 TEDGRPCRFPFRYGGRMLHACTSEGSAHRK-WCATTHNYDRDRAWGYC 148
Cdd:smart00059   2 NSDGEPCVFPFIYNGKKYHDCTSEGRSDGMlWCSTTPNYDRDGKWGFC 49
FN2 cd00062
Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to ...
102-148 1.89e-18

Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.


Pssm-ID: 238019  Cd Length: 48  Bit Score: 78.89  E-value: 1.89e-18
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*...
gi 661903023 102 TEDGRPCRFPFRYGGRMLHACTSEGSA-HRKWCATTHNYDRDRAWGYC 148
Cdd:cd00062    1 NSDGAPCVFPFIYRGKWYHDCTTEGSNdGKLWCSTTPNYDRDGKWGYC 48
fn2 pfam00040
Fibronectin type II domain;
108-148 3.73e-17

Fibronectin type II domain;


Pssm-ID: 425438  Cd Length: 42  Bit Score: 75.37  E-value: 3.73e-17
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 661903023  108 CRFPFRYGGRMLHACTSEGSAH-RKWCATTHNYDRDRAWGYC 148
Cdd:pfam00040   1 CVFPFKYNGKWYHSCTTDGRSDgRLWCSTTANYDGDGKWGYC 42
FN1 cd00061
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. ...
200-240 1.35e-09

Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.


Pssm-ID: 238018  Cd Length: 43  Bit Score: 53.88  E-value: 1.35e-09
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|...
gi 661903023 200 EKCFDETRYEYLEGGDRWARVRQGHVEQCECFG--GRTWCEGT 240
Cdd:cd00061    1 EKCFDPQTGVFYRVGETWERPSEGHVLQCTCLGnrGEARCDPV 43
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
162-197 1.77e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.94  E-value: 1.77e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 661903023 162 DPCASG-PCLNGGSCSNTqdPQSYHCSCPRAFTGKDC 197
Cdd:cd00054    3 DECASGnPCQNGGTCVNT--VGSYRCSCPPGYTGRNC 37
FN1 smart00058
Fibronectin type 1 domain; One of three types of internal repeat within the plasma protein, ...
202-242 5.17e-05

Fibronectin type 1 domain; One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.


Pssm-ID: 214494  Cd Length: 45  Bit Score: 40.80  E-value: 5.17e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 661903023   202 CFDE-TRYEYLEGgDRWARVRQ-GHVEQCECFG---GRTWCEGTRH 242
Cdd:smart00058   1 CFDEeTGTTYRVG-DTWERPYEgGHVLQCTCLGggrGEWKCDPVPV 45
EGF_CA smart00179
Calcium-binding EGF-like domain;
162-197 6.91e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.61  E-value: 6.91e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 661903023   162 DPCAS-GPCLNGGSCSNTQDpqSYHCSCPRAFT-GKDC 197
Cdd:smart00179   3 DECASgNPCQNGGTCVNTVG--SYRCECPPGYTdGRNC 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
164-196 9.57e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.98  E-value: 9.57e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 661903023  164 CASGPCLNGGSCSNTqdPQSYHCSCPRAFTGKD 196
Cdd:pfam00008   1 CAPNPCSNGGTCVDT--PGGYTCICPEGYTGKR 31
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
245-278 2.86e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 35.69  E-value: 2.86e-03
                         10        20        30
                 ....*....|....*....|....*....|....*
gi 661903023 245 CLS-SPCLNGGTCHLIVatGTTVCACPPGFAGRLC 278
Cdd:cd00054    5 CASgNPCQNGGTCVNTV--GSYRCSCPPGYTGRNC 37
fn1 pfam00039
Fibronectin type I domain;
202-232 3.61e-03

Fibronectin type I domain;


Pssm-ID: 425437  Cd Length: 40  Bit Score: 35.36  E-value: 3.61e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 661903023  202 CFDETRYEYLEGGDRWARVRQ-GHVEQCECFG 232
Cdd:pfam00039   1 CYDPQTGRFYQVGETWERPHQrGHVLRCTCLG 32
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
32-100 5.31e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 39.93  E-value: 5.31e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 661903023  32 GFQPQPGGN---RTESPEPNATATPAIPTILVTSVTSETPATSAPEAEGPQSGGLPPPPRAVPSSSSPQAQA 100
Cdd:PRK14954 379 GVAPSPAGSpdvKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTPEQQPPVARSAPLPPSPQASA 450
PHA03378 PHA03378
EBNA-3B; Provisional
40-195 6.16e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 39.67  E-value: 6.16e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023  40 NRTESPEPNATATPAIPTILVTSVTSETPATSAPEAEGPQSGGLPPP-----PRAVPSSSSPQAQALTEDGRPCRFPfry 114
Cdd:PHA03378 665 TWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPgraqrPAAATGRARPPAAAPGRARPPAAAP--- 741
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 661903023 115 gGRMLHACTSEGSAHRKWCATTHNYDRDRAWGYCVEATPPPGGPAALDPCASGPclnggscSNTQDPQSYHCS---CPRA 191
Cdd:PHA03378 742 -GRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAP-------TPQPPPQAGPTSmqlMPRA 813

                 ....
gi 661903023 192 FTGK 195
Cdd:PHA03378 814 APGQ 817
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
245-276 9.10e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 34.28  E-value: 9.10e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 661903023  245 CLSSPCLNGGTCHLIVatGTTVCACPPGFAGR 276
Cdd:pfam00008   1 CAPNPCSNGGTCVDTP--GGYTCICPEGYTGK 30
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.20
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH