NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|74188489|dbj|BAE28005|]
View 

unnamed protein product [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
36-402 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


:

Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 571.53  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489     36 SLT-RRRDAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGTNFTLRELGLGEM 113
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    114 TPPHGTLYRTDIGLPHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 190
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    191 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPSSLQNHPR 270
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    271 LRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPagSAQEPTHAQDNWLLNSNIPLETRHFLFK 350
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQ--STQESVQLQDSWVLNSNVPLETRHFLFK 314
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 74188489    351 PG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPSKYCNWK 402
Cdd:pfam06484  315 TGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL super family cl18310
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1286-1627 3.63e-47

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


The actual alignment was detected with superfamily member cd14953:

Pssm-ID: 302697 [Multi-domain]  Cd Length: 323  Bit Score: 173.10  E-value: 3.63e-47
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1286 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHKYY----LATDPmSGA 1355
Cdd:cd14953   11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1356 VFLSDTNSRRVFKVKSTTVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1433
Cdd:cd14953   90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1434 RRVDQNGIISTLLGsndlTSARPLSCDSVMeiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1511
Cdd:cd14953  156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1512 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1591
Cdd:cd14953  226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                        330       340       350
                 ....*....|....*....|....*....|....*.
gi 74188489 1592 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1627
Cdd:cd14953  288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2749-2826 3.21e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


:

Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.43  E-value: 3.21e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 74188489   2749 EEKVRVLELARQRAVRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMR 2826
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1660-2525 1.06e-33

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


:

Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.97  E-value: 1.06e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1660 LYTQSLPTGDYLYNFTYTGDGDITHITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALRSVTTQGHELAMMTY 1739
Cdd:COG3209  191 LATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGAS 270
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1740 HGNSGLLATKSNENGW--TTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQ 1817
Cdd:COG3209  271 GAGLDASTGTGGAGGSnaAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVG 350
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1818 DQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRV 1897
Cdd:COG3209  351 GGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGG 430
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1898 HNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRI 1977
Cdd:COG3209  431 TATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTT 510
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1978 TSRIFADgkmWSYTYLEKSMVLHLHSQRQYIFEFDKDDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFT 2057
Cdd:COG3209  511 TTTAGAR---GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGG 587
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2058 EDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTE 2137
Cdd:COG3209  588 TATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGT 667
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2138 EGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKE 2217
Cdd:COG3209  668 GVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTT 746
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2218 VQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSAR 2297
Cdd:COG3209  747 TSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGG 825
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2298 LTPL-----RYDLRDRITRLgdvqyrmdEDGFLRQRGGDVFEYNSAGLLIKAynRASGWSVRYRYDGLGRRVSSKSSHSH 2372
Cdd:COG3209  826 GTDLqdrtyTYDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTT 895
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2373 HLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYTAYGEI 2452
Cdd:COG3209  896 TYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNL 954
                        810       820       830       840       850       860       870
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 74188489 2453 YMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssnSIVP---FHLYMFKNNNPISNS 2525
Cdd:COG3209  955 LAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD----------PIGLaggLNLYAYVGNNPVNYV 1020
DSL super family cl19567
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
810-853 4.55e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


The actual alignment was detected with superfamily member pfam01414:

Pssm-ID: 473190  Cd Length: 46  Bit Score: 43.00  E-value: 4.55e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 74188489    810 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 853
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
Keratin_B2 super family cl37504
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
691-839 1.41e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


The actual alignment was detected with superfamily member pfam01500:

Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 42.09  E-value: 1.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    691 TNQCIDVACSSHGTCimGTCICNPGYKGESCEEVDCMDPTCS----SRGVCVRGECHCSVgwgGTNCETPraTCLDQCS- 765
Cdd:pfam01500    6 TSFCGFPTCSTGGTC--GSGCCQPCCCQSSCCRPSCCQTSCCqpttFQSSCCRPTCQPCC---QTSCCQP--TCCQTSSc 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    766 -------GHGTfLPDTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCrCEDGWMGAACdqraCHPRCAEHGTCRDGK 838
Cdd:pfam01500   79 qtgcggiGYGQ-EGSSGAVSSRTRWCRPDCRVEGTCLPPCCVVSCTPPTC-CQLHHAQASC----CRPSYCGQSCCRPAC 152

                   .
gi 74188489    839 C 839
Cdd:pfam01500  153 C 153
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
36-402 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 571.53  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489     36 SLT-RRRDAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGTNFTLRELGLGEM 113
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    114 TPPHGTLYRTDIGLPHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 190
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    191 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPSSLQNHPR 270
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    271 LRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPagSAQEPTHAQDNWLLNSNIPLETRHFLFK 350
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQ--STQESVQLQDSWVLNSNVPLETRHFLFK 314
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 74188489    351 PG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPSKYCNWK 402
Cdd:pfam06484  315 TGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1286-1627 3.63e-47

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 173.10  E-value: 3.63e-47
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1286 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHKYY----LATDPmSGA 1355
Cdd:cd14953   11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1356 VFLSDTNSRRVFKVKSTTVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1433
Cdd:cd14953   90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1434 RRVDQNGIISTLLGsndlTSARPLSCDSVMeiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1511
Cdd:cd14953  156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1512 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1591
Cdd:cd14953  226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                        330       340       350
                 ....*....|....*....|....*....|....*.
gi 74188489 1592 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1627
Cdd:cd14953  288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2749-2826 3.21e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.43  E-value: 3.21e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 74188489   2749 EEKVRVLELARQRAVRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMR 2826
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1660-2525 1.06e-33

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.97  E-value: 1.06e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1660 LYTQSLPTGDYLYNFTYTGDGDITHITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALRSVTTQGHELAMMTY 1739
Cdd:COG3209  191 LATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGAS 270
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1740 HGNSGLLATKSNENGW--TTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQ 1817
Cdd:COG3209  271 GAGLDASTGTGGAGGSnaAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVG 350
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1818 DQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRV 1897
Cdd:COG3209  351 GGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGG 430
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1898 HNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRI 1977
Cdd:COG3209  431 TATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTT 510
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1978 TSRIFADgkmWSYTYLEKSMVLHLHSQRQYIFEFDKDDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFT 2057
Cdd:COG3209  511 TTTAGAR---GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGG 587
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2058 EDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTE 2137
Cdd:COG3209  588 TATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGT 667
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2138 EGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKE 2217
Cdd:COG3209  668 GVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTT 746
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2218 VQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSAR 2297
Cdd:COG3209  747 TSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGG 825
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2298 LTPL-----RYDLRDRITRLgdvqyrmdEDGFLRQRGGDVFEYNSAGLLIKAynRASGWSVRYRYDGLGRRVSSKSSHSH 2372
Cdd:COG3209  826 GTDLqdrtyTYDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTT 895
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2373 HLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYTAYGEI 2452
Cdd:COG3209  896 TYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNL 954
                        810       820       830       840       850       860       870
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 74188489 2453 YMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssnSIVP---FHLYMFKNNNPISNS 2525
Cdd:COG3209  955 LAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD----------PIGLaggLNLYAYVGNNPVNYV 1020
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2446-2525 2.74e-10

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 58.67  E-value: 2.74e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489   2446 YTAYGEIyMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssnsivPF------HLYMFKNN 2519
Cdd:TIGR03696    1 YDPYGEV-LSESGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD-------------PIglggglNLYAYVGN 66

                   ....*.
gi 74188489   2520 NPISNS 2525
Cdd:TIGR03696   67 NPVNWV 72
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1296-1627 4.01e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 60.42  E-value: 4.01e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1296 PVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTnilEMRNKDFRHSHSpahkyyLATDPmSGAVFLSDTNSRRVFKVKST 1372
Cdd:COG4257   19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFT---EYPLGGGSGPHG------IAVDP-DGNLWFTDNGNNRIGRIDPK 88
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1373 TvvkdlvKNSEVVAGTGDQCLPFddtrcgdggkateatltnprGITVDKFGLIYFVDGT--MIRRVD-QNGIISTLLGsn 1449
Cdd:COG4257   89 T------GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEFPL-- 140
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1450 DLTSARplscdsvmeisqvrlewPTDLAINPmDNSLYVLDNnvvlqisENHQVRIVAGRPMHcqvpgidhflLSKVAIHA 1529
Cdd:COG4257  141 PTGGAG-----------------PYGIAVDP-DGNLWVTDF-------GANAIGRIDPDTGT----------LTEYALPT 185
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1530 TLESATALAVSHNGVLYIAETDEKKINRIRqvTTSGEISLVAGAPSGCDckndancdcfsgddgyakdaklntPSSLAVC 1609
Cdd:COG4257  186 PGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGGAR------------------------PYGVAVD 239
                        330
                 ....*....|....*...
gi 74188489 1610 ADGELYVADLGNIRIRFI 1627
Cdd:COG4257  240 GDGRVWFAESGANRIVRF 257
RHS_core NF041261
RHS element core protein;
1953-2362 2.21e-08

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 60.02  E-value: 2.21e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  1953 PGGHIAGIQRGIMSERMEYDQAGRITSRIFADGKMWSYTY----------LEKSMVLHLHSQRQYIFEFDKDDRLSSVTM 2022
Cdd:NF041261  351 PGRMVAHRYAGRPEMCYRYDDTGRVTEQLNPAGLSYRYQYeqdrititdsLNRREVLHTEGEGGLKRVVKKEHADGSVTR 430
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2023 PN---VARQTLETirSVGYYRNIYQPPEGNASVIQDFTEDGHLLhTFYLGTGRR---VIYKYGKLSK---------LAET 2087
Cdd:NF041261  431 SGydaAGRLTAQT--DAAGRRTEYSLNVVSGDITDITTPDGRET-KFYYNDGNQltsVTSPDGLESRreydepgrlVSET 507
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2088 LYDTTKVSFTYDETAGMLKTVNLQNEGFTCTIRYRQIGplidrQIFRFTEEGMVNARFDYNydnsfRVTSMQAVINETpl 2167
Cdd:NF041261  508 SRSGETTRYRYDDPHSELPATTTDATGSTKQMTWSRYG-----QLLAFTDCSGYQTRYEYD-----RFGQMTAVHREE-- 575
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2168 PIDLYR-YDD----VSGKTEQfGKFGVIYY----DINQIITTAVMTHTKHFDAYGR-MKEVQYEIFRSLmywmtvQYDNM 2237
Cdd:NF041261  576 GISTYRrYDNrgqlTSVKDAQ-GRETRYEYnaagDLTAVITPDGNRSETQYDAWGKaVSTTQGGLTRSM------EYDAA 648
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2238 GRVVKKELKvgpyaNTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNlhLLSPGNSARLTPLRYDLRDRITRL---GD 2314
Cdd:NF041261  649 GRITTLTNE-----NGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGK--LTQSEDEGLVTLWHYDESDRITHRtvnGE 721
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|
gi 74188489  2315 V--QYRMDEDGFLRQrggdvFEYNSAGLLIkaynrasgwSVRYRYDGLGR 2362
Cdd:NF041261  722 PaeQWQYDEHGWLTD-----ISHLSEGHRV---------AVHYGYDDKGR 757
RHS_core NF041261
RHS element core protein;
2172-2508 2.63e-06

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 53.47  E-value: 2.63e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2172 YRYDDVSGKTEQFGKFGVIY---YDINQIITTAVMT-----HT----------KHFDAYGRMKEVQYEIFRSLmywmTVQ 2233
Cdd:NF041261  367 YRYDDTGRVTEQLNPAGLSYryqYEQDRITITDSLNrrevlHTegegglkrvvKKEHADGSVTRSGYDAAGRL----TAQ 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2234 YDNMGRVVKKELKV---------GPYANTTRYSYeyDADGQLQTVSINDKPLWRYSYDLNGNLhLLSPGNSARLTPLRYD 2304
Cdd:NF041261  443 TDAAGRRTEYSLNVvsgditditTPDGRETKFYY--NDGNQLTSVTSPDGLESRREYDEPGRL-VSETSRSGETTRYRYD 519
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2305 lrDRITRLGDVQyrMDEDGFLRQrggdvFEYNSAGLLIkAYNRASGWSVRYRYDGLGRRVSSKSSHSHHLqffYADLTNP 2384
Cdd:NF041261  520 --DPHSELPATT--TDATGSTKQ-----MTWSRYGQLL-AFTDCSGYQTRYEYDRFGQMTAVHREEGIST---YRRYDNR 586
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2385 TKVTHLYNHSSSEiTSLYYDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGtGLMiKQILYTAYGEIYMDTNPNfqiii 2464
Cdd:NF041261  587 GQLTSVKDAQGRE-TRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLT-RSMEYDAAGRITTLTNEN----- 658
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 74188489  2465 GYHGG-LYDPLTKLVHMG-------RRDYDvLAGRWTSPDHE----LWKRLSSNSI 2508
Cdd:NF041261  659 GSHSTfLYDALDRLVQQRgfdgrtqRYHYD-LTGKLTQSEDEglvtLWHYDESDRI 713
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1331-1631 1.06e-05

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 51.39  E-value: 1.06e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  1331 RNKDFRHSHSPAhKY--YLATDPMSGAVFLSDTNSRRVfkvksttVVKDLVKNSEV-VAGTGDQCL---PFDDtrcgdgg 1404
Cdd:PLN02919  556 KDNDPRLLTSPL-KFpgKLAIDLLNNRLFISDSNHNRI-------VVTDLDGNFIVqIGSTGEEGLrdgSFED------- 620
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  1405 kateATLTNPRGITVDKFGLIYFVDGT---MIRRVD-QNGIISTLLGS----NDLTSARPLScdsvmeiSQVrLEWPTDL 1476
Cdd:PLN02919  621 ----ATFNRPQGLAYNAKKNLLYVADTenhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDV 688
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  1477 AINPMDNSLYV------------LDNNVVLQISENHQVRIVAGR----PMHCQVPGI------DHFLLSK---------- 1524
Cdd:PLN02919  689 CFEPVNEKVYIamagqhqiweynISDGVTRVFSGDGYERNLNGSsgtsTSFAQPSGIslspdlKELYIADsesssirald 768
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  1525 ----------------------------VAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTtsGEISLVAGAPSG 1576
Cdd:PLN02919  769 lktggsrllaggdptfsdnlfkfgdhdgVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPAT--KRVTTLAGTGKA 846
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 74188489  1577 cdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNK 1631
Cdd:PLN02919  847 ------------GFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
810-853 4.55e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 43.00  E-value: 4.55e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 74188489    810 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 853
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1744-1776 5.88e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.50  E-value: 5.88e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 74188489   1744 GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQ 1776
Cdd:pfam05593    5 GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
691-839 1.41e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 42.09  E-value: 1.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    691 TNQCIDVACSSHGTCimGTCICNPGYKGESCEEVDCMDPTCS----SRGVCVRGECHCSVgwgGTNCETPraTCLDQCS- 765
Cdd:pfam01500    6 TSFCGFPTCSTGGTC--GSGCCQPCCCQSSCCRPSCCQTSCCqpttFQSSCCRPTCQPCC---QTSCCQP--TCCQTSSc 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    766 -------GHGTfLPDTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCrCEDGWMGAACdqraCHPRCAEHGTCRDGK 838
Cdd:pfam01500   79 qtgcggiGYGQ-EGSSGAVSSRTRWCRPDCRVEGTCLPPCCVVSCTPPTC-CQLHHAQASC----CRPSYCGQSCCRPAC 152

                   .
gi 74188489    839 C 839
Cdd:pfam01500  153 C 153
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1537-1723 1.85e-03

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 43.02  E-value: 1.85e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1537 LAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGapsgcdckndancdcfSGDDGyakDAKLNTPSSLAVCADGELYV 1616
Cdd:cd14957   23 IAVDSAGNIYVADTGN---NRIQVFTSSGVYSYSIG----------------SGGTG---SGQFNSPYGIAVDSNGNIYV 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1617 ADLGNIRIRfirknkpFLNTQNMYElsspidqelYLFDTSGkhlytQSLPTGDYLYNFTYTGDGDItHITDNNGNMVNVr 1696
Cdd:cd14957   81 ADTDNNRIQ-------VFNSSGVYQ---------YSIGTGG-----SGDGQFNGPYGIAVDSNGNI-YVADTGNHRIQV- 137
                        170       180
                 ....*....|....*....|....*..
gi 74188489 1697 RDSTGmplwlvvpdgqVYWVTMGTNSA 1723
Cdd:cd14957  138 FTSSG-----------TFSYSIGSGGT 153
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
699-722 3.91e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 3.91e-03
                         10        20
                 ....*....|....*....|....*...
gi 74188489  699 CSSHGTCIMG----TCICNPGYKGESCE 722
Cdd:cd00054   11 CQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
36-402 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 571.53  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489     36 SLT-RRRDAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGTNFTLRELGLGEM 113
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    114 TPPHGTLYRTDIGLPHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 190
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    191 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPSSLQNHPR 270
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    271 LRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPagSAQEPTHAQDNWLLNSNIPLETRHFLFK 350
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQ--STQESVQLQDSWVLNSNVPLETRHFLFK 314
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 74188489    351 PG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPSKYCNWK 402
Cdd:pfam06484  315 TGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1286-1627 3.63e-47

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 173.10  E-value: 3.63e-47
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1286 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHKYY----LATDPmSGA 1355
Cdd:cd14953   11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1356 VFLSDTNSRRVFKVKSTTVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1433
Cdd:cd14953   90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1434 RRVDQNGIISTLLGsndlTSARPLSCDSVMeiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1511
Cdd:cd14953  156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1512 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1591
Cdd:cd14953  226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                        330       340       350
                 ....*....|....*....|....*....|....*.
gi 74188489 1592 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1627
Cdd:cd14953  288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2749-2826 3.21e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.43  E-value: 3.21e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 74188489   2749 EEKVRVLELARQRAVRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMR 2826
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1660-2525 1.06e-33

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.97  E-value: 1.06e-33
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1660 LYTQSLPTGDYLYNFTYTGDGDITHITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALRSVTTQGHELAMMTY 1739
Cdd:COG3209  191 LATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGAS 270
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1740 HGNSGLLATKSNENGW--TTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQ 1817
Cdd:COG3209  271 GAGLDASTGTGGAGGSnaAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVG 350
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1818 DQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRV 1897
Cdd:COG3209  351 GGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGG 430
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1898 HNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRI 1977
Cdd:COG3209  431 TATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTT 510
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1978 TSRIFADgkmWSYTYLEKSMVLHLHSQRQYIFEFDKDDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFT 2057
Cdd:COG3209  511 TTTAGAR---GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGG 587
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2058 EDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTE 2137
Cdd:COG3209  588 TATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGT 667
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2138 EGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKE 2217
Cdd:COG3209  668 GVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTT 746
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2218 VQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSAR 2297
Cdd:COG3209  747 TSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGG 825
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2298 LTPL-----RYDLRDRITRLgdvqyrmdEDGFLRQRGGDVFEYNSAGLLIKAynRASGWSVRYRYDGLGRRVSSKSSHSH 2372
Cdd:COG3209  826 GTDLqdrtyTYDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTT 895
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 2373 HLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYTAYGEI 2452
Cdd:COG3209  896 TYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNL 954
                        810       820       830       840       850       860       870
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 74188489 2453 YMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssnSIVP---FHLYMFKNNNPISNS 2525
Cdd:COG3209  955 LAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD----------PIGLaggLNLYAYVGNNPVNYV 1020
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1291-1627 5.42e-19

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 89.69  E-value: 5.42e-19
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1291 NKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHkyyLATDPmSGAVFLSDTNSRRVFK 1368
Cdd:cd05819    5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEPAG---VAVDS-DGNLYVADTGNHRIQK 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1369 VKSTTVVKDlvknseVVAGTGDQCLPFDdtrcgdggkateatltNPRGITVDKFGLIYFVDgTM---IRRVDQNGIISTL 1445
Cdd:cd05819   81 FDPDGNFLA------SFGGSGDGDGEFN----------------GPRGIAVDSSGNIYVAD-TGnhrIQKFDPDGEFLTT 137
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1446 LGSNDLTSARplscdsvmeisqvrLEWPTDLAINPmDNSLYVLDnnvvlqiSENHQVRIVAgrpmhcqvPGiDHFLL--- 1522
Cdd:cd05819  138 FGSGGSGPGQ--------------FNGPTGVAVDS-DGNIYVAD-------TGNHRIQVFD--------PD-GNFLTtfg 186
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1523 SKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGapsgcdckndancdcfsgdDGYAKDAKLNT 1602
Cdd:cd05819  187 STGTGPGQFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNG-------------------NFLGSDGQFNR 244
                        330       340
                 ....*....|....*....|....*
gi 74188489 1603 PSSLAVCADGELYVADLGNIRIRFI 1627
Cdd:cd05819  245 PSGLAVDSDGNLYVADTGNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1411-1721 3.00e-17

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 84.29  E-value: 3.00e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1411 LTNPRGITVDKFGLIYFVDGTM--IRRVDQNGIISTLLGSNDltsarplscdsvmeISQVRLEWPTDLAINPmDNSLYVL 1488
Cdd:cd05819    7 LNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNLYVA 71
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1489 D--NNVVLQISENHQVRIVAGRPmhcqvpGIDHFLLSkvaihatleSATALAVSHNGVLYIAETDEkkiNRIRQVTTSGE 1566
Cdd:cd05819   72 DtgNHRIQKFDPDGNFLASFGGS------GDGDGEFN---------GPRGIAVDSSGNIYVADTGN---HRIQKFDPDGE 133
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1567 ISLVAGAPSGCDckndancdcfsgddgyakdAKLNTPSSLAVCADGELYVADLGNIRIRFIrknkpflntqnmyelsSPI 1646
Cdd:cd05819  134 FLTTFGSGGSGP-------------------GQFNGPTGVAVDSDGNIYVADTGNHRIQVF----------------DPD 178
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1647 DQELYLFDTSGKHLYTQSLPTG------DYLYnFTYTGDGDITHITDN------NGNmVNVRRDSTGMPLWLVV-PDGQV 1713
Cdd:cd05819  179 GNFLTTFGSTGTGPGQFNYPTGiavdsdGNIY-VADSGNNRVQVFDPDgagfggNGN-FLGSDGQFNRPSGLAVdSDGNL 256

                 ....*...
gi 74188489 1714 YWVTMGTN 1721
Cdd:cd05819  257 YVADTGNN 264
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1283-1496 2.40e-14

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 76.80  E-value: 2.40e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1283 SCNGLADGNKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNilemrnkdfrhshspahkyylatdpmsgavflsd 1360
Cdd:cd14953  176 AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTT---------------------------------- 221
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1361 tnsrrvfkvksttvvkdlvknsevVAGTGDQclPFddtrcGDGGKATEATLTNPRGITVDKFGLIYFVD---GTmIRRVD 1437
Cdd:cd14953  222 ------------------------VAGTGTA--GF-----SGDGGATAAQLNNPTGVAVDAAGNLYVADsgnHR-IRKIT 269
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 74188489 1438 QNGIISTLLGSndlTSARPLSCDSVmeiSQVRLEWPTDLAINPmDNSLYVLD--NNVVLQI 1496
Cdd:cd14953  270 PAGVVTTVAGG---GAGFSGDGGPA---TSAQFNNPTGVAVDA-AGNLYVADtgNNRIRKI 323
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1289-1558 1.93e-13

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 73.12  E-value: 1.93e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1289 DGNKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPahkYYLATDPmSGAVFLSDTNSRRV 1366
Cdd:cd05819   50 GDGQFNEPAGVAVDSDGNLYVADTgnHRIQKFDPDGNFLASFGGSGDGDGEFNGP---RGIAVDS-SGNIYVADTGNHRI 125
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1367 FKVKSttvvkdlvkNSEVVAGTGdqclpfddtrcgdGGKATEATLTNPRGITVDKFGLIYFVDGT--MIRRVDQNGIIST 1444
Cdd:cd05819  126 QKFDP---------DGEFLTTFG-------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGNFLT 183
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1445 LLGSNDLTSArplscdsvmeisqvRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGrpmhcqvpgidhfll 1522
Cdd:cd05819  184 TFGSTGTGPG--------------QFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG--------------- 233
                        250       260       270
                 ....*....|....*....|....*....|....*.
gi 74188489 1523 SKVAIHATLESATALAVSHNGVLYIAETDEKKINRI 1558
Cdd:cd05819  234 NFLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2446-2525 2.74e-10

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 58.67  E-value: 2.74e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489   2446 YTAYGEIyMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssnsivPF------HLYMFKNN 2519
Cdd:TIGR03696    1 YDPYGEV-LSESGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD-------------PIglggglNLYAYVGN 66

                   ....*.
gi 74188489   2520 NPISNS 2525
Cdd:TIGR03696   67 NPVNWV 72
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1347-1624 1.91e-09

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 60.68  E-value: 1.91e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1347 LATDPmSGAVFLSDTNSRRVFKVksttvvkdlvknsevVAGTGDQC-LPFDDtrcgdggkateatLTNPRGITVDKFGLI 1425
Cdd:cd14952   15 VAVDA-AGNVYVADSGNNRVLKL---------------AAGSTTQTvLPFTG-------------LYQPQGVAVDAAGTV 65
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1426 YFVDGtmirrvDQNGIISTLLGSNDLTsarPLSCDSvmeisqvrLEWPTDLAINPMDNsLYVLD--NNVVLqisenhqvR 1503
Cdd:cd14952   66 YVTDF------GNNRVLKLAAGSTTQT---VLPFTG--------LNDPTGVAVDAAGN-VYVADtgNNRVL--------K 119
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1504 IVAGRPMHCQVPgidhFllskvaihATLESATALAVSHNGVLYIAETDEkkiNRIRQvttsgeisLVAGA------Psgc 1577
Cdd:cd14952  120 LAAGSNTQTVLP----F--------TGLSNPDGVAVDGAGNVYVTDTGN---NRVLK--------LAAGSttqtvlP--- 173
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*..
gi 74188489 1578 dckndancdcFSGddgyakdakLNTPSSLAVCADGELYVADLGNIRI 1624
Cdd:cd14952  174 ----------FTG---------LNSPSGVAVDTAGNVYVTDHGNNRV 201
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1296-1627 4.01e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 60.42  E-value: 4.01e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1296 PVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTnilEMRNKDFRHSHSpahkyyLATDPmSGAVFLSDTNSRRVFKVKST 1372
Cdd:COG4257   19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFT---EYPLGGGSGPHG------IAVDP-DGNLWFTDNGNNRIGRIDPK 88
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1373 TvvkdlvKNSEVVAGTGDQCLPFddtrcgdggkateatltnprGITVDKFGLIYFVDGT--MIRRVD-QNGIISTLLGsn 1449
Cdd:COG4257   89 T------GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEFPL-- 140
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1450 DLTSARplscdsvmeisqvrlewPTDLAINPmDNSLYVLDNnvvlqisENHQVRIVAGRPMHcqvpgidhflLSKVAIHA 1529
Cdd:COG4257  141 PTGGAG-----------------PYGIAVDP-DGNLWVTDF-------GANAIGRIDPDTGT----------LTEYALPT 185
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1530 TLESATALAVSHNGVLYIAETDEKKINRIRqvTTSGEISLVAGAPSGCDckndancdcfsgddgyakdaklntPSSLAVC 1609
Cdd:COG4257  186 PGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGGAR------------------------PYGVAVD 239
                        330
                 ....*....|....*...
gi 74188489 1610 ADGELYVADLGNIRIRFI 1627
Cdd:COG4257  240 GDGRVWFAESGANRIVRF 257
RHS_core NF041261
RHS element core protein;
1953-2362 2.21e-08

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 60.02  E-value: 2.21e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  1953 PGGHIAGIQRGIMSERMEYDQAGRITSRIFADGKMWSYTY----------LEKSMVLHLHSQRQYIFEFDKDDRLSSVTM 2022
Cdd:NF041261  351 PGRMVAHRYAGRPEMCYRYDDTGRVTEQLNPAGLSYRYQYeqdrititdsLNRREVLHTEGEGGLKRVVKKEHADGSVTR 430
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2023 PN---VARQTLETirSVGYYRNIYQPPEGNASVIQDFTEDGHLLhTFYLGTGRR---VIYKYGKLSK---------LAET 2087
Cdd:NF041261  431 SGydaAGRLTAQT--DAAGRRTEYSLNVVSGDITDITTPDGRET-KFYYNDGNQltsVTSPDGLESRreydepgrlVSET 507
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2088 LYDTTKVSFTYDETAGMLKTVNLQNEGFTCTIRYRQIGplidrQIFRFTEEGMVNARFDYNydnsfRVTSMQAVINETpl 2167
Cdd:NF041261  508 SRSGETTRYRYDDPHSELPATTTDATGSTKQMTWSRYG-----QLLAFTDCSGYQTRYEYD-----RFGQMTAVHREE-- 575
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2168 PIDLYR-YDD----VSGKTEQfGKFGVIYY----DINQIITTAVMTHTKHFDAYGR-MKEVQYEIFRSLmywmtvQYDNM 2237
Cdd:NF041261  576 GISTYRrYDNrgqlTSVKDAQ-GRETRYEYnaagDLTAVITPDGNRSETQYDAWGKaVSTTQGGLTRSM------EYDAA 648
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2238 GRVVKKELKvgpyaNTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNlhLLSPGNSARLTPLRYDLRDRITRL---GD 2314
Cdd:NF041261  649 GRITTLTNE-----NGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGK--LTQSEDEGLVTLWHYDESDRITHRtvnGE 721
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|
gi 74188489  2315 V--QYRMDEDGFLRQrggdvFEYNSAGLLIkaynrasgwSVRYRYDGLGR 2362
Cdd:NF041261  722 PaeQWQYDEHGWLTD-----ISHLSEGHRV---------AVHYGYDDKGR 757
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1413-1624 7.74e-07

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 53.44  E-value: 7.74e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1413 NPRGITVDKFGLIYFVD--GTMIRRVDQNGIISTLLGSndlTSARPLScdsvmeisqvrLEWPTDLAINPmDNSLYVLDn 1490
Cdd:cd14956  108 APRGVAVDADGNLYVADfgNQRIQKFDPDGSFLRQWGG---TGIEPGS-----------FNYPRGVAVDP-DGTLYVAD- 171
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1491 nvvlqiSENHQVrivagrpmhcQVPGIDHFLLSKVAIHAT----LESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGE 1566
Cdd:cd14956  172 ------TYNDRI----------QVFDNDGAFLRKWGGRGTgpgqFNYPYGIAIDPDGNVFVADFGN---NRIQKFTADGT 232
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*...
gi 74188489 1567 ISLVAGAPSGcdckndancdcfsgddgyaKDAKLNTPSSLAVCADGELYVADLGNIRI 1624
Cdd:cd14956  233 FLTSWGSPGT-------------------GPGQFKNPWGVVVDADGTVYVADSNNNRV 271
RHS_core NF041261
RHS element core protein;
2172-2508 2.63e-06

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 53.47  E-value: 2.63e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2172 YRYDDVSGKTEQFGKFGVIY---YDINQIITTAVMT-----HT----------KHFDAYGRMKEVQYEIFRSLmywmTVQ 2233
Cdd:NF041261  367 YRYDDTGRVTEQLNPAGLSYryqYEQDRITITDSLNrrevlHTegegglkrvvKKEHADGSVTRSGYDAAGRL----TAQ 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2234 YDNMGRVVKKELKV---------GPYANTTRYSYeyDADGQLQTVSINDKPLWRYSYDLNGNLhLLSPGNSARLTPLRYD 2304
Cdd:NF041261  443 TDAAGRRTEYSLNVvsgditditTPDGRETKFYY--NDGNQLTSVTSPDGLESRREYDEPGRL-VSETSRSGETTRYRYD 519
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2305 lrDRITRLGDVQyrMDEDGFLRQrggdvFEYNSAGLLIkAYNRASGWSVRYRYDGLGRRVSSKSSHSHHLqffYADLTNP 2384
Cdd:NF041261  520 --DPHSELPATT--TDATGSTKQ-----MTWSRYGQLL-AFTDCSGYQTRYEYDRFGQMTAVHREEGIST---YRRYDNR 586
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  2385 TKVTHLYNHSSSEiTSLYYDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGtGLMiKQILYTAYGEIYMDTNPNfqiii 2464
Cdd:NF041261  587 GQLTSVKDAQGRE-TRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLT-RSMEYDAAGRITTLTNEN----- 658
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 74188489  2465 GYHGG-LYDPLTKLVHMG-------RRDYDvLAGRWTSPDHE----LWKRLSSNSI 2508
Cdd:NF041261  659 GSHSTfLYDALDRLVQQRgfdgrtqRYHYD-LTGKLTQSEDEglvtLWHYDESDRI 713
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1409-1627 4.55e-06

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 50.79  E-value: 4.55e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1409 ATLTNPRGITVDKFGLIYFVD--GTMIRRVD-QNGIIStllgsndltsarplscdsvmEISQVRLEWPTDLAINPmDNSL 1485
Cdd:COG4257   14 APGSGPRDVAVDPDGAVWFTDqgGGRIGRLDpATGEFT--------------------EYPLGGGSGPHGIAVDP-DGNL 72
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1486 YVLD--NNVVLQIS-ENHQVRIVAGrpmhcqvPGIDHFLlskvaihatlesaTALAVSHNGVLYIAETDekkINRIRQVT 1562
Cdd:COG4257   73 WFTDngNNRIGRIDpKTGEITTFAL-------PGGGSNP-------------HGIAFDPDGNLWFTDQG---GNRIGRLD 129
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1563 T-SGEISLV-----AGAPSGCDCKND---------ANC-DCFSGDDG----YAKDAKLNTPSSLAVCADGELYVADLGNI 1622
Cdd:COG4257  130 PaTGEVTEFplptgGAGPYGIAVDPDgnlwvtdfgANAiGRIDPDTGtlteYALPTPGAGPRGLAVDPDGNLWVADTGSG 209

                 ....*
gi 74188489 1623 RIRFI 1627
Cdd:COG4257  210 RIGRF 214
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1331-1631 1.06e-05

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 51.39  E-value: 1.06e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  1331 RNKDFRHSHSPAhKY--YLATDPMSGAVFLSDTNSRRVfkvksttVVKDLVKNSEV-VAGTGDQCL---PFDDtrcgdgg 1404
Cdd:PLN02919  556 KDNDPRLLTSPL-KFpgKLAIDLLNNRLFISDSNHNRI-------VVTDLDGNFIVqIGSTGEEGLrdgSFED------- 620
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  1405 kateATLTNPRGITVDKFGLIYFVDGT---MIRRVD-QNGIISTLLGS----NDLTSARPLScdsvmeiSQVrLEWPTDL 1476
Cdd:PLN02919  621 ----ATFNRPQGLAYNAKKNLLYVADTenhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDV 688
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  1477 AINPMDNSLYV------------LDNNVVLQISENHQVRIVAGR----PMHCQVPGI------DHFLLSK---------- 1524
Cdd:PLN02919  689 CFEPVNEKVYIamagqhqiweynISDGVTRVFSGDGYERNLNGSsgtsTSFAQPSGIslspdlKELYIADsesssirald 768
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489  1525 ----------------------------VAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTtsGEISLVAGAPSG 1576
Cdd:PLN02919  769 lktggsrllaggdptfsdnlfkfgdhdgVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPAT--KRVTTLAGTGKA 846
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 74188489  1577 cdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNK 1631
Cdd:PLN02919  847 ------------GFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1414-1721 2.46e-05

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 48.80  E-value: 2.46e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1414 PRGITVDKFGLIYFVD--GTMIRRVDQNGIISTLLGSNDltsarplscdsvmeISQVRLEWPTDLAINPMDNsLYVLDnn 1491
Cdd:cd14957   20 PRGIAVDSAGNIYVADtgNNRIQVFTSSGVYSYSIGSGG--------------TGSGQFNSPYGIAVDSNGN-IYVAD-- 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1492 vvlqiSENHQVRIvagrpmhcqvpgidhFLLSKVAIHA---------TLESATALAVSHNGVLYIAETDEkkiNRIrQVT 1562
Cdd:cd14957   83 -----TDNNRIQV---------------FNSSGVYQYSigtggsgdgQFNGPYGIAVDSNGNIYVADTGN---HRI-QVF 138
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1563 TSgeislvAGAPsgcdckndancdCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRfirknkpflntqnmyel 1642
Cdd:cd14957  139 TS------SGTF------------SYSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQ----------------- 183
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1643 sspidqelyLFDTSGKHLYT-QSLPTGDYLYNFTY----TGDGDItHITDNNGNMVNVrRDSTGmplwlvvpdgqVYWVT 1717
Cdd:cd14957  184 ---------VFTSSGTFQYTfGSSGSGPGQFSDPYgiavDSDGNI-YVADTGNHRIQV-FTSSG-----------AYQYS 241

                 ....
gi 74188489 1718 MGTN 1721
Cdd:cd14957  242 IGTS 245
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1403-1624 3.96e-05

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 48.05  E-value: 3.96e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1403 GGKATEA-TLTNPRGITVDKFGLIYFVDGT--MIRRVDQNGIISTLLGSNdltSARPLSCDSvmeisqvrlewPTDLAIN 1479
Cdd:cd14956   50 GTTGDGPgQFGRPRGLAVDKDGWLYVADYWgdRIQVFTLTGELQTIGGSS---GSGPGQFNA-----------PRGVAVD 115
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1480 PmDNSLYVLD--NNVVLQISENHQ-VRIVAGRPmhcQVPGidHFLlskvaihatleSATALAVSHNGVLYIAETdekKIN 1556
Cdd:cd14956  116 A-DGNLYVADfgNQRIQKFDPDGSfLRQWGGTG---IEPG--SFN-----------YPRGVAVDPDGTLYVADT---YND 175
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 74188489 1557 RIRQVTTSGEISLVAGAPSGcdckndancdcFSGDdgyakdakLNTPSSLAVCADGELYVADLGNIRI 1624
Cdd:cd14956  176 RIQVFDNDGAFLRKWGGRGT-----------GPGQ--------FNYPYGIAIDPDGNVFVADFGNNRI 224
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
810-853 4.55e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 43.00  E-value: 4.55e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 74188489    810 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 853
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1296-1437 6.02e-05

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 47.32  E-value: 6.02e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1296 PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRhshsPahkYYLATDPmSGAVFLSDTNSRRVFKVKSTT 1373
Cdd:COG4257  147 PYGIAVDPDGNLWVTDFgaNAIGRIDPDTGTLTEYALPTPGAG----P---RGLAVDP-DGNLWVADTGSGRIGRFDPKT 218
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1374 vvkdlvknsevvagtgdqclpfddtrcgdgGKATEATLTN----PRGITVDKFGLIYFVDGT--MIRRVD 1437
Cdd:COG4257  219 ------------------------------GTVTEYPLPGggarPYGVAVDGDGRVWFAESGanRIVRFD 258
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1567-1627 6.33e-05

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 47.91  E-value: 6.33e-05
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 74188489 1567 ISLVAGAPSGcdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1627
Cdd:cd14953    1 VSTVAGSGTA------------GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKI 49
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
1740-1780 2.85e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 40.65  E-value: 2.85e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 74188489   1740 HGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSF 1780
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRY 41
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1538-1628 3.35e-04

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 45.65  E-value: 3.35e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1538 AVSHNGVLYIAETdekKINRIRQV-TTSGEISLVAGapsgcdckndancdcfSGDDGYA-KDAKLNTPSSLAVCADGELY 1615
Cdd:cd14951  202 AALPDGSVYVADT---YNHKIKRVdPATGEVSTLAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLY 262
                         90
                 ....*....|...
gi 74188489 1616 VADLGNIRIRFIR 1628
Cdd:cd14951  263 VADTNNHRIRRLD 275
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1744-1776 5.88e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.50  E-value: 5.88e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 74188489   1744 GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQ 1776
Cdd:pfam05593    5 GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1292-1550 1.31e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 43.43  E-value: 1.31e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1292 KLLAPVALTCGSDGSLYVGDFnYIRRI--F-PSGNVTNILEmRNKDFRHSHSPAHkyyLATDpmSGAVFLSDTNSRRVfk 1368
Cdd:cd14963   54 EFKYPYGIAVDSDGNIYVADL-YNGRIqvFdPDGKFLKYFP-EKKDRVKLISPAG---LAID--DGKLYVSDVKKHKV-- 124
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1369 vksttVVKDLvknsevvagTGDQCLPFddtrcGDGGKAtEATLTNPRGITVDKFGLIYFVDgTMIRRV---DQNG-IIST 1444
Cdd:cd14963  125 -----IVFDL---------EGKLLLEF-----GKPGSE-PGELSYPNGIAVDEDGNIYVAD-SGNGRIqvfDKNGkFIKE 183
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1445 LLGSNDLTSArplscdsvmeisqvrLEWPTDLAINPmDNSLYVLDN--NVVLQISENHQVRIVAGRpmhcqvPGIDhfll 1522
Cdd:cd14963  184 LNGSPDGKSG---------------FVNPRGIAVDP-DGNLYVVDNlsHRVYVFDEQGKELFTFGG------RGKD---- 237
                        250       260
                 ....*....|....*....|....*...
gi 74188489 1523 skvaiHATLESATALAVSHNGVLYIAET 1550
Cdd:cd14963  238 -----DGQFNLPNGLFIDDDGRLYVTDR 260
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
691-839 1.41e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 42.09  E-value: 1.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    691 TNQCIDVACSSHGTCimGTCICNPGYKGESCEEVDCMDPTCS----SRGVCVRGECHCSVgwgGTNCETPraTCLDQCS- 765
Cdd:pfam01500    6 TSFCGFPTCSTGGTC--GSGCCQPCCCQSSCCRPSCCQTSCCqpttFQSSCCRPTCQPCC---QTSCCQP--TCCQTSSc 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489    766 -------GHGTfLPDTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCrCEDGWMGAACdqraCHPRCAEHGTCRDGK 838
Cdd:pfam01500   79 qtgcggiGYGQ-EGSSGAVSSRTRWCRPDCRVEGTCLPPCCVVSCTPPTC-CQLHHAQASC----CRPSYCGQSCCRPAC 152

                   .
gi 74188489    839 C 839
Cdd:pfam01500  153 C 153
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1537-1723 1.85e-03

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 43.02  E-value: 1.85e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1537 LAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGapsgcdckndancdcfSGDDGyakDAKLNTPSSLAVCADGELYV 1616
Cdd:cd14957   23 IAVDSAGNIYVADTGN---NRIQVFTSSGVYSYSIG----------------SGGTG---SGQFNSPYGIAVDSNGNIYV 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1617 ADLGNIRIRfirknkpFLNTQNMYElsspidqelYLFDTSGkhlytQSLPTGDYLYNFTYTGDGDItHITDNNGNMVNVr 1696
Cdd:cd14957   81 ADTDNNRIQ-------VFNSSGVYQ---------YSIGTGG-----SGDGQFNGPYGIAVDSNGNI-YVADTGNHRIQV- 137
                        170       180
                 ....*....|....*....|....*..
gi 74188489 1697 RDSTGmplwlvvpdgqVYWVTMGTNSA 1723
Cdd:cd14957  138 FTSSG-----------TFSYSIGSGGT 153
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1291-1375 1.94e-03

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 42.70  E-value: 1.94e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1291 NKLLAPVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTNilemrnkdFRHSHSPAHKYYLATDPmSGAVFLSDTNSRRVF 1367
Cdd:COG4257  185 TPGAGPRGLAVDPDGNLWVADTgsGRIGRFDPkTGTVTE--------YPLPGGGARPYGVAVDG-DGRVWFAESGANRIV 255

                 ....*...
gi 74188489 1368 KVKSTTVV 1375
Cdd:COG4257  256 RFDPDTEL 263
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1404-1506 3.60e-03

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 42.18  E-value: 3.60e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1404 GKATEATLTNPRGITVDKFGLIYFVDgTM---IRRVD-QNGIISTLLGSNDLTSArplscdsvmeISQVRLEWPTDLAIN 1479
Cdd:cd14951  188 GPGAEALLQHPLGVAALPDGSVYVAD-TYnhkIKRVDpATGEVSTLAGTGKAGYK----------DLEAQFSEPSGLVVD 256
                         90       100
                 ....*....|....*....|....*..
gi 74188489 1480 PmDNSLYVLDNNvvlqiseNHQVRIVA 1506
Cdd:cd14951  257 G-DGRLYVADTN-------NHRIRRLD 275
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
828-850 3.65e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.94  E-value: 3.65e-03
                           10        20
                   ....*....|....*....|....*
gi 74188489    828 CAEHGTCRD--GKCECSPGWNGEHC 850
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
699-722 3.91e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 3.91e-03
                         10        20
                 ....*....|....*....|....*...
gi 74188489  699 CSSHGTCIMG----TCICNPGYKGESCE 722
Cdd:cd00054   11 CQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1296-1497 5.73e-03

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 41.04  E-value: 5.73e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1296 PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILemrnkDFRHSHSPahkYYLATDPmSGAVFLSDTNSRRVFKVkstt 1373
Cdd:cd14952   96 PTGVAVDAAGNVYVADTgnNRVLKLAAGSNTQTVL-----PFTGLSNP---DGVAVDG-AGNVYVTDTGNNRVLKL---- 162
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 74188489 1374 vvkdlvknsevVAGTGDQC-LPFDDtrcgdggkateatLTNPRGITVDKFGLIYFVDGtmirrvDQNGIISTLLGSNDLT 1452
Cdd:cd14952  163 -----------AAGSTTQTvLPFTG-------------LNSPSGVAVDTAGNVYVTDH------GNNRVLKLAAGSTTPT 212
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*..
gi 74188489 1453 sARPLScdsvmeisqvRLEWPTDLAINPmDNSLYVLD--NNVVLQIS 1497
Cdd:cd14952  213 -VLPFT----------GLNGPLGVAVDA-AGNVYVADrgNDRVVKLP 247
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
698-721 6.15e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.56  E-value: 6.15e-03
                           10        20
                   ....*....|....*....|....*.
gi 74188489    698 ACSSHGTCIM--GTCICNPGYKGESC 721
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
2335-2364 9.25e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 36.04  E-value: 9.25e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 74188489   2335 YNSAGLLIKAYNrASGWSVRYRYDGLGRRV 2364
Cdd:pfam05593    1 YDAAGRLTSVTD-PDGRVTTYTYDAAGRLT 29
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH