|
Name |
Accession |
Description |
Interval |
E-value |
| trp |
TIGR00870 |
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ ... |
16-776 |
0e+00 |
|
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ Channel (TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also been called the store-operated calcium channel (SOC) family. The prototypical members include the Drosophila retinal proteinsTRP and TRPL (Montell and Rubin, 1989; Hardie and Minke, 1993). SOC members of the family mediate the entry of extracellular Ca2+ into cells in responseto depletion of intracellular Ca2+ stores (Clapham, 1996) and agonist stimulated production of inositol-1,4,5 trisphosphate (IP3). One member of the TRP-CCfamily, mammalian Htrp3, has been shown to form a tight complex with the IP3 receptor (TC #1.A.3.2.1). This interaction is apparently required for IP3 tostimulate Ca2+ release via Htrp3. The vanilloid receptor subtype 1 (VR1), which is the receptor for capsaicin (the ?hot? ingredient in chili peppers) and servesas a heat-activated ion channel in the pain pathway (Caterina et al., 1997), is also a member of this family. The stretch-inhibitable non-selective cation channel(SIC) is identical to the vanilloid receptor throughout all of its first 700 residues, but it exhibits a different sequence in its last 100 residues. VR1 and SICtransport monovalent cations as well as Ca2+. VR1 is about 10x more permeable to Ca2+ than to monovalent ions. Ca2+ overload probably causes cell deathafter chronic exposure to capsaicin. (McCleskey and Gold, 1999). [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273311 [Multi-domain] Cd Length: 743 Bit Score: 942.59 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 16 RLDYDLMMAEEYILSDVEKNFILSCERGDLPGVKKILEEYQgtdKFNINCTDPMNRSALIS-AIENENFDLMVILLEHN- 93
Cdd:TIGR00870 1 RGPLDIVPAEESPLSDEEKAFLPAAERGDLASVYRDLEEPK---KLNINCPDRLGRSALFVaAIENENLELTELLLNLSc 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 94 -IEVGDALLHAISEEYVEAVEELLQWEETNHKEGQPYsWEAVDRSKSTFTVDITPLILAAHRNNYEILKILLDRGATLPm 172
Cdd:TIGR00870 78 rGAVGDTLLHAISLEYVDAVEAILLHLLAAFRKSGPL-ELANDQYTSEFTPGITALHLAAHRQNYEIVKLLLERGASVP- 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 173 phdVKCGCDECVTSQMTDSLRHSQSRINAYRALSASSLIALSSRDP--VLTAFQLSWELKRLQAMESEFRAEYTEMRQMV 250
Cdd:TIGR00870 156 ---ARACGDFFVKSQGVDSFYHGESPLNAAACLGSPSIVALLSEDPadILTADSLGNTLLHLLVMENEFKAEYEELSCQM 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 251 QDFGTSLLDHARTSMELEVMLNFNHEPSHDIWCLGQRQTLERLKLAIRYKQKTFVAHPNVQQLLAAIWYDGLPGFRRKQA 330
Cdd:TIGR00870 233 YNFALSLLDKLRDSKELEVILNHQGLTPLKLAAKEGRIVLFRLKLAIKYKQKKFVAWPNGQQLLSLYWLEELDGWRRKQS 312
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 331 SQQLMDVVKLGCSFPIYSLKYILAPDSEGAKFMRKPFVKFITHSCSYMFFLMLLGAASLRVVQITFELLafpwmltmled 410
Cdd:TIGR00870 313 VLELIVVFVIGLKFPELSDMYLIAPLSRLGQFKWKPFIKFIFHSASYLYFLYLIIFTSVAYYRPTRTDL----------- 381
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 411 wrkHERGSLPGPIELAIITYIMALIFEELKSLYSDGLFEYIMDLWNIVDYISNMFYVTWILCRATAWVIVHRdlwfrgiD 490
Cdd:TIGR00870 382 ---RVTGLQQTPLEMLIVTWVDGLRLGEEKLIWLGGIFEYIHQLWNILDFGMNSFYLATFLDRPFAILFVTQ-------A 451
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 491 PYFPREHWHPFDPMLLSEGAFAAGMVFSYLKLVHIFSINPHLGPLQVSLGRMIIDIIKF-FFIYTLVLFAFGCGLNQLLW 569
Cdd:TIGR00870 452 FLVLREHWLRFDPTLIEEALFAFALVLSWLNLLYIFRGNQHLGPLQIMIGRMILGDILRfLFIYAVVLFGFACGLNQLYQ 531
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 570 YYAELEKNKCYHLHPdvadfddqEKACTIWRRFSNLFETSQSLFWASFGLVDLvsfdLAGIKSFTRFWALLMFGSYSVIN 649
Cdd:TIGR00870 532 YYDELKLNECSNPHA--------RSCEKQGNAYSTLFETSQELFWAIIGLGDL----LANEHKFTEFVGLLLFGAYNVIM 599
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 650 IIVLLNMLIAMMSNSYQIISERADTEWKFARSQLWMSYFEDGGTIPPPFNLCPN-------MKMLRKTLGRKRPSRTK-- 720
Cdd:TIGR00870 600 YILLLNMLIAMMGNTYQLIADDADEEWKFQRAKLWMSYEREGGTCPPPFNIIPGpksfvglFKRIEKHDGKKRQRWCRrv 679
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 17136554 721 --SFMRKSMERAQTL------HDKVMKLLVRRYITAEQRRRDDYGITEDDIIEVRQDISSLRFE 776
Cdd:TIGR00870 680 eeVNWTTWERKAETLiedglhYQRVMKRLIKRYVLAEQRPRDDEGTTEEETKELKQDISSLRFE 743
|
|
| TRP_2 |
pfam08344 |
Transient receptor ion channel II; This domain is found in the transient receptor ion channel ... |
178-236 |
3.60e-20 |
|
Transient receptor ion channel II; This domain is found in the transient receptor ion channel (Trp) family of proteins. There is strong evidence that Trp proteins are structural elements of calcium-ion entry channels activated by G protein-coupled receptors. This domain does not tend to appear with the TRP domain (pfam06011) but is often found to the C-terminus of Ankyrin repeats (pfam00023).
Pssm-ID: 462438 Cd Length: 60 Bit Score: 85.33 E-value: 3.60e-20
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*....
gi 17136554 178 CGCDECVTSQMTDSLRHSQSRINAYRALSASSLIALSSRDPVLTAFQLSWELKRLQAME 236
Cdd:pfam08344 1 CGCDECKAERERDSLRHSLSRLNAYRALASPAYISLTSEDPILTAFELSWELRRLAFVE 59
|
|
| TRP_CaM_bind1 |
cd23650 |
Calmodulin binding site 1 (CBS1) found in Drosophila melanogaster transient receptor potential ... |
797-860 |
6.83e-19 |
|
Calmodulin binding site 1 (CBS1) found in Drosophila melanogaster transient receptor potential protein (TRP) and similar proteins; TRP is a light-sensitive calcium channel that is required for inositide-mediated Ca(2+) entry in the retina during phospholipase C (PLC)-mediated phototransduction. TRP and another retinal-specific protein, TRPL, act together in the light response, and are activated by fatty acids and metabolic stress. TRP is also required for olfactory adaptation and may be involved in olfactory system development. The TRP tail contains two calmodulin (CaM) binding sites (called CBS1 and CBS2), which binds to the CaM N-lobe and C-lobe, respectively. The model corresponds to CaM binding site 1.
Pssm-ID: 467881 Cd Length: 66 Bit Score: 81.61 E-value: 6.83e-19
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 17136554 797 GVARTTKGKVMERRILKDFQIGFVENLKQEMSES---ESGRDIFSSLAKVIGRKKtQKGDKDWNAIA 860
Cdd:cd23650 1 QGAVGKKGRQMERRLMKGFNIGLVEGLIKSVFESddkKSPKDRFKRLARLIGRKK-KKKKKDWNALV 66
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
36-168 |
1.77e-12 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 69.60 E-value: 1.77e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 36 FILSCERGDLPGVKKILEeyQGTDkfnINCTDPMNRSALISAIENENFDLMVILLEHNIEV------GDALLH-AISEEY 108
Cdd:COG0666 91 LHAAARNGDLEIVKLLLE--AGAD---VNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVnaqdndGNTPLHlAAANGN 165
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 109 VEAVEELLQweetnhKEGQPyswEAVDRSkstftvDITPLILAAHRNNYEILKILLDRGA 168
Cdd:COG0666 166 LEIVKLLLE------AGADV---NARDND------GETPLHLAAENGHLEIVKLLLEAGA 210
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1015-1156 |
3.41e-08 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 57.86 E-value: 3.41e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1015 QQPPKDSKPSAGGPKPgDQKPTPGAGAP--KPQAAGtiSKPGESQKKDAPAPPTKPgdtKPAAPKPGESAKPEaAAKKEE 1092
Cdd:NF033839 337 EKPKPEVKPQLETPKP-EVKPQPEKPKPevKPQPEK--PKPEVKPQPETPKPEVKP---QPEKPKPEVKPQPE-KPKPEV 409
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 17136554 1093 SSKTEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASKPdEKKSGPEEPK 1156
Cdd:NF033839 410 KPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKP-EVKPQPEKPK 472
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1015-1161 |
2.41e-07 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 55.16 E-value: 2.41e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1015 QQPPKDSKPSAGGPKPgDQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPG------DTKPAAPKPGESAKPEAAA 1088
Cdd:NF033839 370 EKPKPEVKPQPETPKP-EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQpekpkpEVKPQPEKPKPEVKPQPEK 448
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 17136554 1089 KKEEsSKTEASKPaatngaaksaapsaPSDAKPD-SKLKPGAAGAPEATKATNGASKPDEKKsgPEEPKKAAGD 1161
Cdd:NF033839 449 PKPE-VKPQPETP--------------KPEVKPQpEKPKPEVKPQPEKPKPDNSKPQADDKK--PSTPNNLSKD 505
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1023-1162 |
1.57e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 52.54 E-value: 1.57e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1023 PSAGGPKPGDQKPTP--GAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPK--------PGESAKPEAAAKKEE 1092
Cdd:PRK07003 362 VTGGGAPGGGVPARVagAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAaatraeapPAAPAPPATADRGDD 441
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1093 SSKTEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASKPDEKKSGPEEPKKAAGDS 1162
Cdd:PRK07003 442 AADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDAR 511
|
|
| CEND1 |
pfam15677 |
Cell cycle exit and neuronal differentiation protein 1; This family of neuron-specific ... |
1055-1154 |
4.04e-06 |
|
Cell cycle exit and neuronal differentiation protein 1; This family of neuron-specific proteins may have a role in the differentiation of neuroblastoma cells and neuronal precursors. It is involved in development of the cerebellum.
Pssm-ID: 374010 [Multi-domain] Cd Length: 143 Bit Score: 47.67 E-value: 4.04e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1055 ESQKKDA--PAPPTK-PGDTKPAAPKPGESAKPEaaaKKEESSKTEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAg 1131
Cdd:pfam15677 2 ESKGKSAsgPKPDAKaEARGPPAKDGKAPSAKAD---KKETPEKQEQPAAPTTAPAKKTPAKADPALLNNHSNLKPAPA- 77
|
90 100
....*....|....*....|...
gi 17136554 1132 APEATKATNGASKPDEKKSGPEE 1154
Cdd:pfam15677 78 APEAPSSPDATPEPKGPGDGAEE 100
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1015-1156 |
5.63e-06 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 50.54 E-value: 5.63e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1015 QQPPKDSKPSAGGPKPgDQKPtpgaGAPKPQAagTISKPGESQKKDAPAPPTKPG---DTKPAAPKPG-----ESAKPEA 1086
Cdd:NF033839 304 QPEKKEVKPEPETPKP-EVKP----QLEKPKP--EVKPQPEKPKPEVKPQLETPKpevKPQPEKPKPEvkpqpEKPKPEV 376
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 17136554 1087 AAKKEESSKTEASKPAATNGAAKSAAPSAPSDAKPD-SKLKPGAAGAPEATK--ATNGASKPD-EKKSGPEEPK 1156
Cdd:NF033839 377 KPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQpEKPKPEVKPQPEKPKpeVKPQPEKPKpEVKPQPEKPK 450
|
|
| TRPV1 |
cd22196 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), type 1; Vanilloid receptor 1 ... |
602-681 |
5.17e-05 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), type 1; Vanilloid receptor 1 (TRPV1), a capsaicin (vanilloid) receptor, is the founding member of the vanilloid TRP subfamily (TRPV). In humans, it is expressed in the brain, kidney, pancreas, testis, uterus, spleen, stomach, small intestine, lung and liver. TRPV1 has been implicated to have function in thermo-sensation (heat), autonomic thermoregulation, nociception, food intake regulation, and multiple functions in the gastrointestinal (GI) tract. The receptor has also been involved in growth cone guidance, long-term depression, endocannabinoid signaling and osmosensing in the central nervous system. TRPV1 is up regulated in several human pathological conditions including vulvodynia, GI inflammation, Crohn's disease and ulcerative colitis. TRPV1 knock-out mice exhibit impaired sensation to thermal-mechanical acute pain. The structure of TRPV shows the typical topology features of all Transient Receptor Potential (TRP) ion channel family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6 and large intracellular N- and C-terminal domains.
Pssm-ID: 411980 [Multi-domain] Cd Length: 649 Bit Score: 47.49 E-value: 5.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 602 FSNLFETSQSLFWASFGLVDLVSFDLAGIKSFtrFWALLMfgSYSVINIIVLLNMLIAMMSNSYQIISERADTEWKFARS 681
Cdd:cd22196 516 YNSLYSTCLELFKFTIGMGDLEFTENYKFKEV--FIFLLI--SYVILTYILLLNMLIALMGETVSKIAQESKNIWKLQRA 591
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1016-1176 |
4.16e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 44.76 E-value: 4.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1016 QPPKDSKPSAggPKPGdQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPgdtKPAAPKPGESAKPEAAaKKEESSK 1095
Cdd:NF033839 285 KEPGNKKPSA--PKPG-MQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKP---QPEKPKPEVKPQLETP-KPEVKPQ 357
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1096 TEASKPAATNGAAKSAAPSAPSDAKPDSKLKPgaagAPEATKATngaSKPDEKKSGPE---EPKKAAGDSKPGDDAKDKD 1172
Cdd:NF033839 358 PEKPKPEVKPQPEKPKPEVKPQPETPKPEVKP----QPEKPKPE---VKPQPEKPKPEvkpQPEKPKPEVKPQPEKPKPE 430
|
....
gi 17136554 1173 KKPG 1176
Cdd:NF033839 431 VKPQ 434
|
|
| KLF9_13_N-like |
cd21975 |
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ... |
1040-1160 |
1.10e-03 |
|
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.
Pssm-ID: 409240 [Multi-domain] Cd Length: 163 Bit Score: 41.21 E-value: 1.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1040 GAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPgeSAKPEAAAKkeesSKTEASKPAATNGAAKSAAPSAPSDA 1119
Cdd:cd21975 21 VRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGAD--SPGLVTAAP----HLLAANVLAPLRGPSVEGSSLESGDA 94
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 17136554 1120 K--PDSKLKPgAAGAPEATKATNGASKPDEKKSGPEEPKKAAG 1160
Cdd:cd21975 95 DmgSDSDVAP-ASGAAASTSPESSSDAASSPSPLSLLHPGEAG 136
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
938-1161 |
1.46e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 42.74 E-value: 1.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 938 ADDEGAPNGEGEKKPLDASGSKKSITSGGTGGGASMLAAAALRASVKNVDEKSGaDGKPGTMGKPTDDKKAGDDKDKQQP 1017
Cdd:COG5180 113 AKTPELAAGALPAPAAAAALPKAKVTREATSASAGVALAAALLQRSDPILAKDP-DGDSASTLPPPAEKLDKVLTEPRDA 191
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1018 PKDSKPSAGGPKPGDQKP--------TPGAGAPKPQAA--GTISKPGESQKKDAPAP------PTKPGDTKPAAPKPGES 1081
Cdd:COG5180 192 LKDSPEKLDRPKVEVKDEaqeeppdlTGGADHPRPEAAssPKVDPPSTSEARSRPATvdaqpeMRPPADAKERRRAAIGD 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1082 AKP------EAAAKKEESSKTEASKPAATNGAAKSAAPSAPSD---AKPDSKLKPGAAGAPEATKATNGASKPDEKKSGP 1152
Cdd:COG5180 272 TPAaeppglPVLEAGSEPQSDAPEAETARPIDVKGVASAPPATrpvRPPGGARDPGTPRPGQPTERPAGVPEAASDAGQP 351
|
....*....
gi 17136554 1153 EEPKKAAGD 1161
Cdd:COG5180 352 PSAYPPAEE 360
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
803-1158 |
1.59e-03 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 43.21 E-value: 1.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 803 KGKVMERRILKDFQIGFVENLKQEMSESESGRDIFSSLAKVIGRK--KTQKGDKD-WNAIARKNTFASDPIGSKRSSMQR 879
Cdd:PTZ00121 1097 FGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKaeEARKAEDAkRVEIARKAEDARKAEEARKAEDAK 1176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 880 HSQRSLRRKIIEQANEGLQMNQTQLIEFNPNLGDVTRATRVAYVKFMRKKMAADEVSLA--DDEGAPNGEGEKKPLDASG 957
Cdd:PTZ00121 1177 KAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAkkDAEEAKKAEEERNNEEIRK 1256
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 958 SKKSITSGGTGGGASMLAAAALRA-SVKNVDEKSGAD--------GKPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGP 1028
Cdd:PTZ00121 1257 FEEARMAHFARRQAAIKAEEARKAdELKKAEEKKKADeakkaeekKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKK 1336
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1029 KPGDQKPTPGAGAPKPQAAgtiSKPGESQKKDAPAPPTKPGDTKPAA----PKPGESAKPEAAAKKEESSKTEAS--KPA 1102
Cdd:PTZ00121 1337 KAEEAKKAAEAAKAEAEAA---ADEAEAAEEKAEAAEKKKEEAKKKAdaakKKAEEKKKADEAKKKAEEDKKKADelKKA 1413
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 17136554 1103 ATNGAAKSAAPSAPSDAKPDSKLKpgaAGAPEATKATNGASKPDEKKSGPEEPKKA 1158
Cdd:PTZ00121 1414 AAAKKKADEAKKKAEEKKKADEAK---KKAEEAKKADEAKKKAEEAKKAEEAKKKA 1466
|
|
| ANK |
smart00248 |
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ... |
146-170 |
1.80e-03 |
|
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.
Pssm-ID: 197603 [Multi-domain] Cd Length: 30 Bit Score: 36.80 E-value: 1.80e-03
|
| rad23 |
TIGR00601 |
UV excision repair protein Rad23; All proteins in this family for which functions are known ... |
1043-1105 |
1.83e-03 |
|
UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273167 [Multi-domain] Cd Length: 378 Bit Score: 42.19 E-value: 1.83e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 17136554 1043 KPQAAGTISKPGESQKKDAPAPPTKPgdtkPAAPKPGESAKPEAAAKKEESSKTEASKPAATN 1105
Cdd:TIGR00601 76 KPKTGTGKVAPPAATPTSAPTPTPSP----PASPASGMSAAPASAVEEKSPSEESATATAPES 134
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
891-1102 |
4.76e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.40 E-value: 4.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 891 EQAneGLQMNQTQLIEFNPNL--GDVTRATRVAYVKFMRKKMAADEVSLADDEGAPNGEGEKKPLDASGSKKSITSGGTG 968
Cdd:PRK12323 349 EYA--GFTMTLLRMLAFRPGQsgGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARR 426
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 969 GGASMLAAAALRASVKNVDEKSG----------ADGKPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGPKPGDQKP--- 1035
Cdd:PRK12323 427 SPAPEALAAARQASARGPGGAPApapapaaapaAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPpef 506
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 17136554 1036 -----TPGAGAPKPQAAGTISKPGESQKKDA-----PAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKTEASKPA 1102
Cdd:PRK12323 507 aspapAQPDAAPAGWVAESIPDPATADPDDAfetlaPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPA 583
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| trp |
TIGR00870 |
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ ... |
16-776 |
0e+00 |
|
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ Channel (TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also been called the store-operated calcium channel (SOC) family. The prototypical members include the Drosophila retinal proteinsTRP and TRPL (Montell and Rubin, 1989; Hardie and Minke, 1993). SOC members of the family mediate the entry of extracellular Ca2+ into cells in responseto depletion of intracellular Ca2+ stores (Clapham, 1996) and agonist stimulated production of inositol-1,4,5 trisphosphate (IP3). One member of the TRP-CCfamily, mammalian Htrp3, has been shown to form a tight complex with the IP3 receptor (TC #1.A.3.2.1). This interaction is apparently required for IP3 tostimulate Ca2+ release via Htrp3. The vanilloid receptor subtype 1 (VR1), which is the receptor for capsaicin (the ?hot? ingredient in chili peppers) and servesas a heat-activated ion channel in the pain pathway (Caterina et al., 1997), is also a member of this family. The stretch-inhibitable non-selective cation channel(SIC) is identical to the vanilloid receptor throughout all of its first 700 residues, but it exhibits a different sequence in its last 100 residues. VR1 and SICtransport monovalent cations as well as Ca2+. VR1 is about 10x more permeable to Ca2+ than to monovalent ions. Ca2+ overload probably causes cell deathafter chronic exposure to capsaicin. (McCleskey and Gold, 1999). [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273311 [Multi-domain] Cd Length: 743 Bit Score: 942.59 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 16 RLDYDLMMAEEYILSDVEKNFILSCERGDLPGVKKILEEYQgtdKFNINCTDPMNRSALIS-AIENENFDLMVILLEHN- 93
Cdd:TIGR00870 1 RGPLDIVPAEESPLSDEEKAFLPAAERGDLASVYRDLEEPK---KLNINCPDRLGRSALFVaAIENENLELTELLLNLSc 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 94 -IEVGDALLHAISEEYVEAVEELLQWEETNHKEGQPYsWEAVDRSKSTFTVDITPLILAAHRNNYEILKILLDRGATLPm 172
Cdd:TIGR00870 78 rGAVGDTLLHAISLEYVDAVEAILLHLLAAFRKSGPL-ELANDQYTSEFTPGITALHLAAHRQNYEIVKLLLERGASVP- 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 173 phdVKCGCDECVTSQMTDSLRHSQSRINAYRALSASSLIALSSRDP--VLTAFQLSWELKRLQAMESEFRAEYTEMRQMV 250
Cdd:TIGR00870 156 ---ARACGDFFVKSQGVDSFYHGESPLNAAACLGSPSIVALLSEDPadILTADSLGNTLLHLLVMENEFKAEYEELSCQM 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 251 QDFGTSLLDHARTSMELEVMLNFNHEPSHDIWCLGQRQTLERLKLAIRYKQKTFVAHPNVQQLLAAIWYDGLPGFRRKQA 330
Cdd:TIGR00870 233 YNFALSLLDKLRDSKELEVILNHQGLTPLKLAAKEGRIVLFRLKLAIKYKQKKFVAWPNGQQLLSLYWLEELDGWRRKQS 312
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 331 SQQLMDVVKLGCSFPIYSLKYILAPDSEGAKFMRKPFVKFITHSCSYMFFLMLLGAASLRVVQITFELLafpwmltmled 410
Cdd:TIGR00870 313 VLELIVVFVIGLKFPELSDMYLIAPLSRLGQFKWKPFIKFIFHSASYLYFLYLIIFTSVAYYRPTRTDL----------- 381
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 411 wrkHERGSLPGPIELAIITYIMALIFEELKSLYSDGLFEYIMDLWNIVDYISNMFYVTWILCRATAWVIVHRdlwfrgiD 490
Cdd:TIGR00870 382 ---RVTGLQQTPLEMLIVTWVDGLRLGEEKLIWLGGIFEYIHQLWNILDFGMNSFYLATFLDRPFAILFVTQ-------A 451
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 491 PYFPREHWHPFDPMLLSEGAFAAGMVFSYLKLVHIFSINPHLGPLQVSLGRMIIDIIKF-FFIYTLVLFAFGCGLNQLLW 569
Cdd:TIGR00870 452 FLVLREHWLRFDPTLIEEALFAFALVLSWLNLLYIFRGNQHLGPLQIMIGRMILGDILRfLFIYAVVLFGFACGLNQLYQ 531
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 570 YYAELEKNKCYHLHPdvadfddqEKACTIWRRFSNLFETSQSLFWASFGLVDLvsfdLAGIKSFTRFWALLMFGSYSVIN 649
Cdd:TIGR00870 532 YYDELKLNECSNPHA--------RSCEKQGNAYSTLFETSQELFWAIIGLGDL----LANEHKFTEFVGLLLFGAYNVIM 599
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 650 IIVLLNMLIAMMSNSYQIISERADTEWKFARSQLWMSYFEDGGTIPPPFNLCPN-------MKMLRKTLGRKRPSRTK-- 720
Cdd:TIGR00870 600 YILLLNMLIAMMGNTYQLIADDADEEWKFQRAKLWMSYEREGGTCPPPFNIIPGpksfvglFKRIEKHDGKKRQRWCRrv 679
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 17136554 721 --SFMRKSMERAQTL------HDKVMKLLVRRYITAEQRRRDDYGITEDDIIEVRQDISSLRFE 776
Cdd:TIGR00870 680 eeVNWTTWERKAETLiedglhYQRVMKRLIKRYVLAEQRPRDDEGTTEEETKELKQDISSLRFE 743
|
|
| TRP_2 |
pfam08344 |
Transient receptor ion channel II; This domain is found in the transient receptor ion channel ... |
178-236 |
3.60e-20 |
|
Transient receptor ion channel II; This domain is found in the transient receptor ion channel (Trp) family of proteins. There is strong evidence that Trp proteins are structural elements of calcium-ion entry channels activated by G protein-coupled receptors. This domain does not tend to appear with the TRP domain (pfam06011) but is often found to the C-terminus of Ankyrin repeats (pfam00023).
Pssm-ID: 462438 Cd Length: 60 Bit Score: 85.33 E-value: 3.60e-20
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*....
gi 17136554 178 CGCDECVTSQMTDSLRHSQSRINAYRALSASSLIALSSRDPVLTAFQLSWELKRLQAME 236
Cdd:pfam08344 1 CGCDECKAERERDSLRHSLSRLNAYRALASPAYISLTSEDPILTAFELSWELRRLAFVE 59
|
|
| Ion_trans |
pfam00520 |
Ion transport protein; This family contains sodium, potassium and calcium ion channels. This ... |
394-673 |
5.96e-19 |
|
Ion transport protein; This family contains sodium, potassium and calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane.
Pssm-ID: 459842 [Multi-domain] Cd Length: 238 Bit Score: 87.71 E-value: 5.96e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 394 ITFELLAFPWMLTMleDWRKHERGSLPGPIELAIITYIMALIFEELKSLYSDGL-FEYIMDLWNIVDYISNMFYVTWILC 472
Cdd:pfam00520 8 ILLLILLNTIFLAL--ETYFQPEEPLTTVLEILDYVFTGIFTLEMLLKIIAAGFkKRYFRSPWNILDFVVVLPSLISLVL 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 473 ratawvivhrdlwfrgidpyfprehwhpfdpmlLSEGAFAAGMVFSYLKLVHIFSINPHLGPLQV---SLGRMIIDIIKF 549
Cdd:pfam00520 86 ---------------------------------SSVGSLSGLRVLRLLRLLRLLRLIRRLEGLRTlvnSLIRSLKSLGNL 132
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 550 FFIYTLVLFAFGCGLNQLLWYYAELEKNkcyhlhPDVADFDdqekactiwrrFSNLFETSQSLFWASF--GLVDLVSfdl 627
Cdd:pfam00520 133 LLLLLLFLFIFAIIGYQLFGGKLKTWEN------PDNGRTN-----------FDNFPNAFLWLFQTMTteGWGDIMY--- 192
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 17136554 628 AGIKSFTRFWALLMFGSYSVINIIVLLNMLIAMMSNSYQIISERAD 673
Cdd:pfam00520 193 DTIDGKGEFWAYIYFVSFIILGGFLLLNLFIAVIIDNFQELTERTE 238
|
|
| TRP_CaM_bind1 |
cd23650 |
Calmodulin binding site 1 (CBS1) found in Drosophila melanogaster transient receptor potential ... |
797-860 |
6.83e-19 |
|
Calmodulin binding site 1 (CBS1) found in Drosophila melanogaster transient receptor potential protein (TRP) and similar proteins; TRP is a light-sensitive calcium channel that is required for inositide-mediated Ca(2+) entry in the retina during phospholipase C (PLC)-mediated phototransduction. TRP and another retinal-specific protein, TRPL, act together in the light response, and are activated by fatty acids and metabolic stress. TRP is also required for olfactory adaptation and may be involved in olfactory system development. The TRP tail contains two calmodulin (CaM) binding sites (called CBS1 and CBS2), which binds to the CaM N-lobe and C-lobe, respectively. The model corresponds to CaM binding site 1.
Pssm-ID: 467881 Cd Length: 66 Bit Score: 81.61 E-value: 6.83e-19
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 17136554 797 GVARTTKGKVMERRILKDFQIGFVENLKQEMSES---ESGRDIFSSLAKVIGRKKtQKGDKDWNAIA 860
Cdd:cd23650 1 QGAVGKKGRQMERRLMKGFNIGLVEGLIKSVFESddkKSPKDRFKRLARLIGRKK-KKKKKDWNALV 66
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
36-168 |
1.77e-12 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 69.60 E-value: 1.77e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 36 FILSCERGDLPGVKKILEeyQGTDkfnINCTDPMNRSALISAIENENFDLMVILLEHNIEV------GDALLH-AISEEY 108
Cdd:COG0666 91 LHAAARNGDLEIVKLLLE--AGAD---VNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVnaqdndGNTPLHlAAANGN 165
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 109 VEAVEELLQweetnhKEGQPyswEAVDRSkstftvDITPLILAAHRNNYEILKILLDRGA 168
Cdd:COG0666 166 LEIVKLLLE------AGADV---NARDND------GETPLHLAAENGHLEIVKLLLEAGA 210
|
|
| Ank_2 |
pfam12796 |
Ankyrin repeats (3 copies); |
36-117 |
2.05e-08 |
|
Ankyrin repeats (3 copies);
Pssm-ID: 463710 [Multi-domain] Cd Length: 91 Bit Score: 52.81 E-value: 2.05e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 36 FILSCERGDLPGVKKILEEyqgtdKFNINCTDPMNRSALISAIENENFDLMVILLEH-NIEVGD----ALLHAISEEYVE 110
Cdd:pfam12796 1 LHLAAKNGNLELVKLLLEN-----GADANLQDKNGRTALHLAAKNGHLEIVKLLLEHaDVNLKDngrtALHYAARSGHLE 75
|
....*..
gi 17136554 111 AVEELLQ 117
Cdd:pfam12796 76 IVKLLLE 82
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
37-168 |
3.08e-08 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 56.50 E-value: 3.08e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 37 ILSCERGDLPGVKKILEeyQGTDkfnINCTDPMNRSALISAIENENFDLMVILLEH--NIEVGD-----ALLHAISEEYV 109
Cdd:COG0666 125 HLAAYNGNLEIVKLLLE--AGAD---VNAQDNDGNTPLHLAAANGNLEIVKLLLEAgaDVNARDndgetPLHLAAENGHL 199
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 17136554 110 EAVEELLqweetnhKEGqpysweAVDRSKSTFtvDITPLILAAHRNNYEILKILLDRGA 168
Cdd:COG0666 200 EIVKLLL-------EAG------ADVNAKDND--GKTALDLAAENGNLEIVKLLLEAGA 243
|
|
| Ank_2 |
pfam12796 |
Ankyrin repeats (3 copies); |
74-168 |
3.30e-08 |
|
Ankyrin repeats (3 copies);
Pssm-ID: 463710 [Multi-domain] Cd Length: 91 Bit Score: 52.43 E-value: 3.30e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 74 LISAIENENFDLMVILLEHNIEVGD-------ALLHAISEEYVEAVEELLqweETNHKEGQPYSWeavdrskstftvdiT 146
Cdd:pfam12796 1 LHLAAKNGNLELVKLLLENGADANLqdkngrtALHLAAKNGHLEIVKLLL---EHADVNLKDNGR--------------T 63
|
90 100
....*....|....*....|..
gi 17136554 147 PLILAAHRNNYEILKILLDRGA 168
Cdd:pfam12796 64 ALHYAARSGHLEIVKLLLEKGA 85
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1015-1156 |
3.41e-08 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 57.86 E-value: 3.41e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1015 QQPPKDSKPSAGGPKPgDQKPTPGAGAP--KPQAAGtiSKPGESQKKDAPAPPTKPgdtKPAAPKPGESAKPEaAAKKEE 1092
Cdd:NF033839 337 EKPKPEVKPQLETPKP-EVKPQPEKPKPevKPQPEK--PKPEVKPQPETPKPEVKP---QPEKPKPEVKPQPE-KPKPEV 409
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 17136554 1093 SSKTEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASKPdEKKSGPEEPK 1156
Cdd:NF033839 410 KPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKP-EVKPQPEKPK 472
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1015-1161 |
2.41e-07 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 55.16 E-value: 2.41e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1015 QQPPKDSKPSAGGPKPgDQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPG------DTKPAAPKPGESAKPEAAA 1088
Cdd:NF033839 370 EKPKPEVKPQPETPKP-EVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQpekpkpEVKPQPEKPKPEVKPQPEK 448
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 17136554 1089 KKEEsSKTEASKPaatngaaksaapsaPSDAKPD-SKLKPGAAGAPEATKATNGASKPDEKKsgPEEPKKAAGD 1161
Cdd:NF033839 449 PKPE-VKPQPETP--------------KPEVKPQpEKPKPEVKPQPEKPKPDNSKPQADDKK--PSTPNNLSKD 505
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1023-1162 |
1.57e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 52.54 E-value: 1.57e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1023 PSAGGPKPGDQKPTP--GAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPK--------PGESAKPEAAAKKEE 1092
Cdd:PRK07003 362 VTGGGAPGGGVPARVagAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAaatraeapPAAPAPPATADRGDD 441
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1093 SSKTEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASKPDEKKSGPEEPKKAAGDS 1162
Cdd:PRK07003 442 AADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDAR 511
|
|
| PHA03169 |
PHA03169 |
hypothetical protein; Provisional |
1017-1161 |
1.97e-06 |
|
hypothetical protein; Provisional
Pssm-ID: 223003 [Multi-domain] Cd Length: 413 Bit Score: 51.90 E-value: 1.97e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1017 PPKDSKPSAGGPKPGDQKPTP-GAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSK 1095
Cdd:PHA03169 92 PSGSGSESVGSPTPSPSGSAEeLASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSH 171
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 17136554 1096 TEASKPAAtngaaksaapsapsdaKPDSKLKPGAAGAPEATKATNGaskPDEkKSGPEEPKKAAGD 1161
Cdd:PHA03169 172 EDSPEEPE----------------PPTSEPEPDSPGPPQSETPTSS---PPP-QSPPDEPGEPQSP 217
|
|
| CEND1 |
pfam15677 |
Cell cycle exit and neuronal differentiation protein 1; This family of neuron-specific ... |
1055-1154 |
4.04e-06 |
|
Cell cycle exit and neuronal differentiation protein 1; This family of neuron-specific proteins may have a role in the differentiation of neuroblastoma cells and neuronal precursors. It is involved in development of the cerebellum.
Pssm-ID: 374010 [Multi-domain] Cd Length: 143 Bit Score: 47.67 E-value: 4.04e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1055 ESQKKDA--PAPPTK-PGDTKPAAPKPGESAKPEaaaKKEESSKTEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAg 1131
Cdd:pfam15677 2 ESKGKSAsgPKPDAKaEARGPPAKDGKAPSAKAD---KKETPEKQEQPAAPTTAPAKKTPAKADPALLNNHSNLKPAPA- 77
|
90 100
....*....|....*....|...
gi 17136554 1132 APEATKATNGASKPDEKKSGPEE 1154
Cdd:pfam15677 78 APEAPSSPDATPEPKGPGDGAEE 100
|
|
| PRK12373 |
PRK12373 |
NADH-quinone oxidoreductase subunit E; |
1039-1160 |
5.21e-06 |
|
NADH-quinone oxidoreductase subunit E;
Pssm-ID: 237082 [Multi-domain] Cd Length: 400 Bit Score: 50.57 E-value: 5.21e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1039 AGAPKPQAAGTISKPGESQKKD-APAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKTEASKPAATNGAAKSAAPSAPS 1117
Cdd:PRK12373 216 IGDTVKRIDGTEVPLLAPWQGDaAPVPPSEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPAPKEA 295
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 17136554 1118 DAKPDSKLKPGAAGAPEATKAtngaskpdEKKSGPEEPKKAAG 1160
Cdd:PRK12373 296 AKAAAAAAKPALEDKPRPLGI--------ARPGGADDLKLISG 330
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1015-1156 |
5.63e-06 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 50.54 E-value: 5.63e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1015 QQPPKDSKPSAGGPKPgDQKPtpgaGAPKPQAagTISKPGESQKKDAPAPPTKPG---DTKPAAPKPG-----ESAKPEA 1086
Cdd:NF033839 304 QPEKKEVKPEPETPKP-EVKP----QLEKPKP--EVKPQPEKPKPEVKPQLETPKpevKPQPEKPKPEvkpqpEKPKPEV 376
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 17136554 1087 AAKKEESSKTEASKPAATNGAAKSAAPSAPSDAKPD-SKLKPGAAGAPEATK--ATNGASKPD-EKKSGPEEPK 1156
Cdd:NF033839 377 KPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQpEKPKPEVKPQPEKPKpeVKPQPEKPKpEVKPQPEKPK 450
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1015-1155 |
5.83e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.09 E-value: 5.83e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1015 QQPPKDSKPSA---GGPKP-------GDQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPK--PGESA 1082
Cdd:PHA03247 2704 PPPTPEPAPHAlvsATPLPpgpaaarQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAagPPRRL 2783
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 17136554 1083 KPEAAAKKEES--SKTEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASKPDEKKSGPEEP 1155
Cdd:PHA03247 2784 TRPAVASLSESreSLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
62-168 |
1.16e-05 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 48.80 E-value: 1.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 62 NINCTDPMNRSALISAIENENFDLMVILLEHNIEV-------GDALLHAISEEYVEAVEELLQweetnhkegQPYSWEAV 134
Cdd:COG0666 46 ALALADALGALLLLAAALAGDLLVALLLLAAGADInakddggNTLLHAAARNGDLEIVKLLLE---------AGADVNAR 116
|
90 100 110
....*....|....*....|....*....|....
gi 17136554 135 DRSkstftvDITPLILAAHRNNYEILKILLDRGA 168
Cdd:COG0666 117 DKD------GETPLHLAAYNGNLEIVKLLLEAGA 144
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
1016-1156 |
1.34e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 49.69 E-value: 1.34e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1016 QPPKDSKPSAGGPK-PGDQK-------PTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPG-----DTKPAAPKPGESA 1082
Cdd:PTZ00449 509 EPPEGPEASGLPPKaPGDKEgeegeheDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSkiptlSKKPEFPKDPKHP 588
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1083 KPEAAAKKEESSKTeASKPAATNG----------AAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNgASKPDEKKSGP 1152
Cdd:PTZ00449 589 KDPEEPKKPKRPRS-AQRPTRPKSpklpelldipKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK-SPKPPKSPKPP 666
|
....
gi 17136554 1153 EEPK 1156
Cdd:PTZ00449 667 FDPK 670
|
|
| Pro-rich |
pfam15240 |
Proline-rich protein; This family includes several eukaryotic proline-rich proteins. |
994-1084 |
2.01e-05 |
|
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
Pssm-ID: 464580 [Multi-domain] Cd Length: 167 Bit Score: 46.18 E-value: 2.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 994 GKPGTMGKPTDDKKAGDDKDKQQPPKDskpsaGGPKPGDQKPTPGAGAPKPQAAGTISKPGESQKKDAPaPPTKPGDTKP 1073
Cdd:pfam15240 81 GKQKPQGPPPQGGPRPPPGKPQGPPPQ-----GGNQQQGPPPPGKPQGPPPQGGGPPPQGGNQQGPPPP-PPGNPQGPPQ 154
|
90
....*....|.
gi 17136554 1074 AAPKPGESAKP 1084
Cdd:pfam15240 155 RPPQPGNPQGP 165
|
|
| PRK12373 |
PRK12373 |
NADH-quinone oxidoreductase subunit E; |
1028-1104 |
3.06e-05 |
|
NADH-quinone oxidoreductase subunit E;
Pssm-ID: 237082 [Multi-domain] Cd Length: 400 Bit Score: 47.87 E-value: 3.06e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 17136554 1028 PKPGDQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGeSAKPEAAAKKEESSKTEASKPAAT 1104
Cdd:PRK12373 233 PWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSG-TAAAEPAPKEAAKAAAAAAKPALE 308
|
|
| TRPV1 |
cd22196 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), type 1; Vanilloid receptor 1 ... |
602-681 |
5.17e-05 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), type 1; Vanilloid receptor 1 (TRPV1), a capsaicin (vanilloid) receptor, is the founding member of the vanilloid TRP subfamily (TRPV). In humans, it is expressed in the brain, kidney, pancreas, testis, uterus, spleen, stomach, small intestine, lung and liver. TRPV1 has been implicated to have function in thermo-sensation (heat), autonomic thermoregulation, nociception, food intake regulation, and multiple functions in the gastrointestinal (GI) tract. The receptor has also been involved in growth cone guidance, long-term depression, endocannabinoid signaling and osmosensing in the central nervous system. TRPV1 is up regulated in several human pathological conditions including vulvodynia, GI inflammation, Crohn's disease and ulcerative colitis. TRPV1 knock-out mice exhibit impaired sensation to thermal-mechanical acute pain. The structure of TRPV shows the typical topology features of all Transient Receptor Potential (TRP) ion channel family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6 and large intracellular N- and C-terminal domains.
Pssm-ID: 411980 [Multi-domain] Cd Length: 649 Bit Score: 47.49 E-value: 5.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 602 FSNLFETSQSLFWASFGLVDLVSFDLAGIKSFtrFWALLMfgSYSVINIIVLLNMLIAMMSNSYQIISERADTEWKFARS 681
Cdd:cd22196 516 YNSLYSTCLELFKFTIGMGDLEFTENYKFKEV--FIFLLI--SYVILTYILLLNMLIALMGETVSKIAQESKNIWKLQRA 591
|
|
| PRK13808 |
PRK13808 |
adenylate kinase; Provisional |
981-1139 |
5.53e-05 |
|
adenylate kinase; Provisional
Pssm-ID: 172341 [Multi-domain] Cd Length: 333 Bit Score: 46.80 E-value: 5.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 981 ASVKNVDEKSGADGKPGTMGKPTDDKKAgddkdkqqPPKDSKPSAggpkpgdqKPTPGAGAPKPQAAGTISKPGESQKKD 1060
Cdd:PRK13808 192 AAVGAANAKKAAKTPAAKSGAKKASAKA--------KSAAKKVSK--------KKAAKTAVSAKKAAKTAAKAAKKAKKT 255
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 17136554 1061 APAPPTKPGDTKPAAPKpgesaKPEAAAKKEESSKTEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKAT 1139
Cdd:PRK13808 256 AKKALKKAAKAVKKAAK-----KAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVT 329
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
931-1154 |
6.77e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 47.47 E-value: 6.77e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 931 AADEVSLADDEGAPNGEGEKKPLDASGSKKSITSGGTGGGASMlaaaalrasvknvdEKSGADGKPGTMGKPTddkkagd 1010
Cdd:PHA03307 58 GAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASP--------------AREGSPTPPGPSSPDP------- 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1011 dkdkqqPPKDSKPSAGGPKPGD--QKPTPGAGAPKPQAAGTiSKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPEAAA 1088
Cdd:PHA03307 117 ------PPPTPPPASPPPSPAPdlSEMLRPVGSPGPPPAAS-PPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSP 189
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 17136554 1089 KKEESSKTEASKPAATNGAAKSAAPSAPSDakPDSKLKPGAAGAPEATKATNGASKPDEKKSGPEE 1154
Cdd:PHA03307 190 PAEPPPSTPPAAASPRPPRRSSPISASASS--PAPAPGRSAADDAGASSSDSSSSESSGCGWGPEN 253
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1022-1152 |
9.36e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.79 E-value: 9.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1022 KPSAGGpkpGDQKPTPGAGAPKPQAAgtiskPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKTEA--- 1098
Cdd:PRK12323 364 RPGQSG---GGAGPATAAAAPVAQPA-----PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaa 435
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 17136554 1099 SKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASKPDEKKSGP 1152
Cdd:PRK12323 436 ARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAP 489
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1015-1133 |
1.05e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 46.60 E-value: 1.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1015 QQPPKDSKPSAGGPK--------PGDQKPTPGAG--APKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKP 1084
Cdd:PHA03378 668 QIGHIPYQPSPTGANtmlpiqwaPGTMQPPPRAPtpMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPP 747
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 17136554 1085 EAA---AKKEESSKTEASKPAATNGAAKSAAPsapSDAKPDSKLKPGAAGAP 1133
Cdd:PHA03378 748 AAApgrARPPAAAPGRARPPAAAPGAPTPQPP---PQAPPAPQQRPRGAPTP 796
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1021-1143 |
1.10e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 46.52 E-value: 1.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1021 SKPSAGGPKPGDQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKTEASK 1100
Cdd:PRK07764 394 PAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAA 473
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 17136554 1101 PAATngaaksaapSAPSDAKPDSKLKPGAAGAPEATKATNGAS 1143
Cdd:PRK07764 474 PEPT---------AAPAPAPPAAPAPAAAPAAPAAPAAPAGAD 507
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
1018-1093 |
1.61e-04 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 45.72 E-value: 1.61e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 17136554 1018 PKDSKPSAGGPKPGDQKPTPGAGAPKPQAagtiSKPGESQKKDAPAP---PTKPGDTKPAAPKPGESAKPEAAAKKEES 1093
Cdd:PHA03291 205 PATPRPTPRTTASPETTPTPSTTTSPPST----TIPAPSTTIAAPQAgttPEAEGTPAPPTPGGGEAPPANATPAPEAS 279
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
995-1161 |
1.76e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 46.21 E-value: 1.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 995 KPGTMGKPTDDKKAGDDKDKQQPPK---DSKPSAGGPKPGdQKPTPGAG-APKPQAAGTISKPGESQKKDAPAPPTKPGD 1070
Cdd:PHA03378 675 QPSPTGANTMLPIQWAPGTMQPPPRaptPMRPPAAPPGRA-QRPAAATGrARPPAAAPGRARPPAAAPGRARPPAAAPGR 753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1071 TKPAAPKPGESAKPEAAAKKEESSKTEASKPAATNGAAKSAAPSAPSDAKPDSK--LKPGAAGAPEATKATNGASKPDEK 1148
Cdd:PHA03378 754 ARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMqlMPRAAPGQQGPTKQILRQLLTGGV 833
|
170
....*....|....
gi 17136554 1149 KSG-PEEPKKAAGD 1161
Cdd:PHA03378 834 KRGrPSLKKPAALE 847
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
1015-1103 |
1.79e-04 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 46.04 E-value: 1.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1015 QQPPKDSKPSAGGPKPGDQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPEAAAKKEESS 1094
Cdd:PRK12270 36 YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
|
....*....
gi 17136554 1095 KTEASKPAA 1103
Cdd:PRK12270 116 EVTPLRGAA 124
|
|
| PHA03419 |
PHA03419 |
E4 protein; Provisional |
1016-1085 |
1.88e-04 |
|
E4 protein; Provisional
Pssm-ID: 223079 [Multi-domain] Cd Length: 200 Bit Score: 44.17 E-value: 1.88e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 17136554 1016 QPP------KDSKPSAGGPKPGDQKPTPGAGApKPQAAGtiSKPGESQKKDAPAPPTKPGDTKpAAPKPGESAKPE 1085
Cdd:PHA03419 87 QPPpggkkkEKKKKETEKPAQGGEKPDQGPEA-KGEGEG--HEPEDPPPEDTPPPPGGEGEVE-GGPSPGPGPGPL 158
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
988-1159 |
2.56e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.70 E-value: 2.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 988 EKSGADGKPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGP---------KPGDQKPTPgAGAPKPQAAGTISKPGESQK 1058
Cdd:PHA03247 2650 ERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvgsltslaDPPPPPPTP-EPAPHALVSATPLPPGPAAA 2728
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1059 KDA-------PAPPTKP-------GDTKPAAPkPGESAKPEAAAKKEESSkteASKPAATNGAAKSAAPsapsdAKPDSK 1124
Cdd:PHA03247 2729 RQAspalpaaPAPPAVPagpatpgGPARPARP-PTTAGPPAPAPPAAPAA---GPPRRLTRPAVASLSE-----SRESLP 2799
|
170 180 190
....*....|....*....|....*....|....*
gi 17136554 1125 LKPGAAGAPEATKATNGASKPDEKKSGPEEPKKAA 1159
Cdd:PHA03247 2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1015-1160 |
2.87e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.70 E-value: 2.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1015 QQPPKDSKPSAGGPKPgdQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPP--TKPGDTKPAAPKPGESAKPE------- 1085
Cdd:PHA03247 2611 PAPPSPLPPDTHAPDP--PPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGrvSRPRRARRLGRAAQASSPPQrprrraa 2688
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1086 --------AAAKKEESSKTEASKP-AATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASKPDEKK--SGPEE 1154
Cdd:PHA03247 2689 rptvgsltSLADPPPPPPTPEPAPhALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPttAGPPA 2768
|
....*.
gi 17136554 1155 PKKAAG 1160
Cdd:PHA03247 2769 PAPPAA 2774
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
984-1143 |
3.17e-04 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 44.54 E-value: 3.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 984 KNVD-EKSGADGKPGTMGKPTDDKKAGDDKDKQQ----PPKDSKPSAGGPK-----------PGD------QKPTPG--- 1038
Cdd:PRK10905 36 KSIDlAGNATDQANGVQPAPGTTSAEQTAGNTQQdvslPPISSTPTQGQTPvatdgqqrvevQGDlnnaltQPQNQQqln 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1039 ---AGAPKPQAAGTISkP--GESQKKDAPAPPTKPgdtKPAAPKPG------ESAKPEAAAKKEESSKTEASK---PAAT 1104
Cdd:PRK10905 116 nvaVNSTLPTEPATVA-PvrNGNASRQTAKTQTAE---RPATTRPArkqaviEPKKPQATAKTEPKPVAQTPKrtePAAP 191
|
170 180 190
....*....|....*....|....*....|....*....
gi 17136554 1105 NGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGAS 1143
Cdd:PRK10905 192 VASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAG 230
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
991-1159 |
3.57e-04 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 44.68 E-value: 3.57e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 991 GADGKPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGPKPGDQKPtPGAGAPKPQAAGTISKPGE-SQKKDAPAPPTKPG 1069
Cdd:pfam03546 139 GPSGKGANPAPPGKAGSAAPLVQVGKKEEDSESSSEESDSEGEAP-PAATQAKPSGKILQVRPASgPAKGAAPAPPQKAG 217
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1070 dtkPAAPK-PGESAKpEAAAKKEESSKTEASKPAATngaaksaapsAPSDAKPDSKLKPGAAGAPEATKATNGASKPDEK 1148
Cdd:pfam03546 218 ---PVATQvKAERSK-EDSESSEESSDSEEEAPAAA----------TPAQAKPALKTPQTKASPRKGTPITPTSAKVPPV 283
|
170
....*....|.
gi 17136554 1149 KSGPEEPKKAA 1159
Cdd:pfam03546 284 RVGTPAPWKAG 294
|
|
| PTZ00144 |
PTZ00144 |
dihydrolipoamide succinyltransferase; Provisional |
1032-1101 |
3.94e-04 |
|
dihydrolipoamide succinyltransferase; Provisional
Pssm-ID: 240289 [Multi-domain] Cd Length: 418 Bit Score: 44.29 E-value: 3.94e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1032 DQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKTEASKP 1101
Cdd:PTZ00144 119 DTGGAPPAAAPAAAAAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKPPPTPVARADPR 188
|
|
| PRK13808 |
PRK13808 |
adenylate kinase; Provisional |
1038-1162 |
4.08e-04 |
|
adenylate kinase; Provisional
Pssm-ID: 172341 [Multi-domain] Cd Length: 333 Bit Score: 44.11 E-value: 4.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1038 GAGAPKPQAAGTISKPGeSQKKDAPAPPTKPGDTKPAAPKPGESAKPeaAAKKEESSKTEASKPAATNGAAksaapsaps 1117
Cdd:PRK13808 195 GAANAKKAAKTPAAKSG-AKKASAKAKSAAKKVSKKKAAKTAVSAKK--AAKTAAKAAKKAKKTAKKALKK--------- 262
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 17136554 1118 daKPDSKLKPGAAGAPEATKATNGASKPDEKKSGPEEP---KKAAGDS 1162
Cdd:PRK13808 263 --AAKAVKKAAKKAAKAAAKAAKGAAKATKGKAKAKKKagkKAAAGSK 308
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1016-1176 |
4.16e-04 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 44.76 E-value: 4.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1016 QPPKDSKPSAggPKPGdQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPgdtKPAAPKPGESAKPEAAaKKEESSK 1095
Cdd:NF033839 285 KEPGNKKPSA--PKPG-MQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKP---QPEKPKPEVKPQLETP-KPEVKPQ 357
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1096 TEASKPAATNGAAKSAAPSAPSDAKPDSKLKPgaagAPEATKATngaSKPDEKKSGPE---EPKKAAGDSKPGDDAKDKD 1172
Cdd:NF033839 358 PEKPKPEVKPQPEKPKPEVKPQPETPKPEVKP----QPEKPKPE---VKPQPEKPKPEvkpQPEKPKPEVKPQPEKPKPE 430
|
....
gi 17136554 1173 KKPG 1176
Cdd:NF033839 431 VKPQ 434
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1017-1151 |
4.40e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 44.84 E-value: 4.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1017 PPKDSKPSAGGPKPGDQKPTPGAGAPKPQAAGTISKPGesqKKDAPAPPTKPgDTKPAAPKPGESAKPEAAAKKEESSKT 1096
Cdd:PRK07003 406 AGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAA---DGDAPVPAKAN-ARASADSRCDERDAQPPADSGSASAPA 481
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 17136554 1097 EASKPAATNGAAKSAAPSAPSDAKP------------DSKLKPGAAGAPEATKATNGASKPDEKKSG 1151
Cdd:PRK07003 482 SDAPPDAAFEPAPRAAAPSAATPAAvpdarapaaasrEDAPAAAAPPAPEARPPTPAAAAPAARAGG 548
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
915-1156 |
5.05e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 44.68 E-value: 5.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 915 TRATRVAY-------VKFMRKKMAADEVSLADDEGAP------NGEGEKKPLDASGSKKSiTSGGTGGGASMLAAAALRA 981
Cdd:PTZ00449 474 TRISKIQFtqeikklIKKSKKKLAPIEEEDSDKHDEPpegpeaSGLPPKAPGDKEGEEGE-HEDSKESDEPKEGGKPGET 552
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 982 SVKNVDEKSGA--DGKPGTMgkPTDDKKAGDDKDKQQPPKDSKPSAGGPKPGDQKPTPGAGAPKPQAAG---TISKPGES 1056
Cdd:PTZ00449 553 KEGEVGKKPGPakEHKPSKI--PTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDipkSPKRPESP 630
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1057 QKKDAPAPPTKP-------GDTKPAAPKPGESAKPEAAAKKEE------SSKTEASKPAATNGAAKSAAPSAPSDAKPDS 1123
Cdd:PTZ00449 631 KSPKRPPPPQRPssperpeGPKIIKSPKPPKSPKPPFDPKFKEkfyddyLDAAAKSKETKTTVVLDESFESILKETLPET 710
|
250 260 270
....*....|....*....|....*....|...
gi 17136554 1124 KLKPGAAGAPEATKATNGASKPDEKKSGPEEPK 1156
Cdd:PTZ00449 711 PGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQ 743
|
|
| PRK14959 |
PRK14959 |
DNA polymerase III subunits gamma and tau; Provisional |
1023-1104 |
5.64e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 184923 [Multi-domain] Cd Length: 624 Bit Score: 44.29 E-value: 5.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1023 PSAGG----PKPGDQKPTpgAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPeAAAKKEESSKTEA 1098
Cdd:PRK14959 389 PASGGaatiPTPGTQGPQ--GTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRP-APRMPEASPVPGA 465
|
....*.
gi 17136554 1099 skPAAT 1104
Cdd:PRK14959 466 --PDSV 469
|
|
| PRK13108 |
PRK13108 |
prolipoprotein diacylglyceryl transferase; Reviewed |
975-1166 |
6.07e-04 |
|
prolipoprotein diacylglyceryl transferase; Reviewed
Pssm-ID: 237284 [Multi-domain] Cd Length: 460 Bit Score: 43.81 E-value: 6.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 975 AAAALRASVKNVDEKsgADGKPGTMGKPTDDKKAgddkdkqqppKDSKPSAGGPKPGDQKPTPGAGAPKPQAAGTISKPG 1054
Cdd:PRK13108 281 APGALRGSEYVVDEA--LEREPAELAAAAVASAA----------SAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAES 348
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1055 ESQKKDapapptkpgDTKPAAPKPGESAkpEAAAKKEESSKTEASKPAAtngaaksaapsapsdakpDSKLKPGAAGAPE 1134
Cdd:PRK13108 349 VVQVAD---------RDGESTPAVEETS--EADIEREQPGDLAGQAPAA------------------HQVDAEAASAAPE 399
|
170 180 190
....*....|....*....|....*....|....
gi 17136554 1135 ATkatngASKPDEKKSG--PEEPKKAAGDSKPGD 1166
Cdd:PRK13108 400 EP-----AALASEAHDEtePEVPEKAAPIPDPAK 428
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
943-1147 |
7.50e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 7.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 943 APNGEGEKKPLDASGSKKSITSGGTGGGASMLAAAALRASVKNVDEKSGADGKPGTMGKPTDDKKAGDDKDKQQPPKDSK 1022
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSL 2850
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1023 PSAGGPKPG---DQKPTPGAGAPKPQAAGTIskPGESQKKDAPAPPTKPGDTKPAAP-KPGESAKPEAAAKKEESSKTEA 1098
Cdd:PHA03247 2851 PLGGSVAPGgdvRRRPPSRSPAAKPAAPARP--PVRRLARPAVSRSTESFALPPDQPeRPPQPQAPPPPQPQPQPPPPPQ 2928
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 17136554 1099 SKPAatngaaksaapsAPSDAKPDSKLKP--GAAGAPEATKAT----NGASKPDE 1147
Cdd:PHA03247 2929 PQPP------------PPPPPRPQPPLAPttDPAGAGEPSGAVpqpwLGALVPGR 2971
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
995-1144 |
7.71e-04 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 43.52 E-value: 7.71e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 995 KPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGPKPGDQKPTPGAGAPKP-------QAAGTISKpGESQKKDAPAPPTK 1067
Cdd:pfam03546 292 KAGTVTSPACASSPAVARGAQRPEEDSSSSEESESEEETAPAAAVGQAKSvgkglqgKAASAPTK-GPSGQGTAPVPPGK 370
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 17136554 1068 pgdTKPAAPKPGESAKPEAAAKKEES-SKTEASKPAATNGAAKSaapsapsdakPDSKLKPGAAGAPEATKATNGASK 1144
Cdd:pfam03546 371 ---TGPAVAQVKAEAQEDSESSEEESdSEEAAATPAQVKASGKT----------PQAKANPAPTKASSAKGAASAPGK 435
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
1023-1139 |
8.21e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 43.55 E-value: 8.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1023 PSAGGPKPGDQKPTPGAGAPKPQAAGTISKPGESQkkdAPAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKTEASKPA 1102
Cdd:PRK14951 384 PEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPP---AAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETV 460
|
90 100 110
....*....|....*....|....*....|....*..
gi 17136554 1103 ATngAAKSAAPSAPSDAKPDsklkPGAAGAPEATKAT 1139
Cdd:PRK14951 461 AI--PVRVAPEPAVASAAPA----PAAAPAAARLTPT 491
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
991-1155 |
9.37e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 43.44 E-value: 9.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 991 GADGKPGTMGKPTDDKKAGDDKDKQQPpkDSKPSAGGPKPGDQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPtkPGD 1070
Cdd:PRK07764 589 GPAPGAAGGEGPPAPASSGPPEEAARP--AAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPD--ASD 664
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1071 TKPAAPKPGESAKPEAAAkkeessktEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASKPDEKKS 1150
Cdd:PRK07764 665 GGDGWPAKAGGAAPAAPP--------PAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAAD 736
|
....*....
gi 17136554 1151 G----PEEP 1155
Cdd:PRK07764 737 DpvplPPEP 745
|
|
| motB |
PRK05996 |
MotB family protein; |
984-1098 |
1.01e-03 |
|
MotB family protein;
Pssm-ID: 235665 [Multi-domain] Cd Length: 423 Bit Score: 43.15 E-value: 1.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 984 KNVDEKSGADGKPGTMGKptddkkagddkdkQQPPKDSKPSAGGPKPGDQKPTPGAGAPKPqaagtiSKPGESQKKDAPA 1063
Cdd:PRK05996 188 KQVEVTTAGDLLPPGQAR-------------EQAQGAKSATAAPATVPQAAPLPQAQPKKA------ATEEELIADAKKA 248
|
90 100 110
....*....|....*....|....*....|....*
gi 17136554 1064 PPTKPGDTKPAAPKPGESAKPEAAAKKEESSKTEA 1098
Cdd:PRK05996 249 ATGEPAANAAKAAKPEPMPDDQQKEAEQLQAAIAQ 283
|
|
| PRK11633 |
PRK11633 |
cell division protein DedD; Provisional |
1028-1103 |
1.07e-03 |
|
cell division protein DedD; Provisional
Pssm-ID: 236940 [Multi-domain] Cd Length: 226 Bit Score: 41.91 E-value: 1.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1028 PKPGDQ---------------KPTPGAG-------APKPQAAGTISKPGESQKKDAPaPPTKPGDTKPAaPKPGESAKPE 1085
Cdd:PRK11633 45 PKPGDRdepdmmpaatqalptQPPEGAAeavragdAAAPSLDPATVAPPNTPVEPEP-APVEPPKPKPV-EKPKPKPKPQ 122
|
90
....*....|....*...
gi 17136554 1086 AAAKKEESSKTEaSKPAA 1103
Cdd:PRK11633 123 QKVEAPPAPKPE-PKPVV 139
|
|
| PRK13108 |
PRK13108 |
prolipoprotein diacylglyceryl transferase; Reviewed |
976-1151 |
1.09e-03 |
|
prolipoprotein diacylglyceryl transferase; Reviewed
Pssm-ID: 237284 [Multi-domain] Cd Length: 460 Bit Score: 43.04 E-value: 1.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 976 AAALRASVKNVDEKSGADGKPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGPKPGDQKPTPgagapkpqaagtisKPGE 1055
Cdd:PRK13108 300 PAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTP--------------AVEE 365
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1056 SQKKDAPAPptKPGDTKPAAPKPGESAKPEAAAKKEESSKTEASKPAATNGAAKSAAPSAPSDAKPDsklKPGAAGAPEA 1135
Cdd:PRK13108 366 TSEADIERE--QPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPAKPD---ELAVAGPGDD 440
|
170
....*....|....*.
gi 17136554 1136 TKATNGASKPDEKKSG 1151
Cdd:PRK13108 441 PAEPDGIRRQDDFSSR 456
|
|
| KLF9_13_N-like |
cd21975 |
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ... |
1040-1160 |
1.10e-03 |
|
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.
Pssm-ID: 409240 [Multi-domain] Cd Length: 163 Bit Score: 41.21 E-value: 1.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1040 GAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPgeSAKPEAAAKkeesSKTEASKPAATNGAAKSAAPSAPSDA 1119
Cdd:cd21975 21 VRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGAD--SPGLVTAAP----HLLAANVLAPLRGPSVEGSSLESGDA 94
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 17136554 1120 K--PDSKLKPgAAGAPEATKATNGASKPDEKKSGPEEPKKAAG 1160
Cdd:cd21975 95 DmgSDSDVAP-ASGAAASTSPESSSDAASSPSPLSLLHPGEAG 136
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
938-1161 |
1.46e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 42.74 E-value: 1.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 938 ADDEGAPNGEGEKKPLDASGSKKSITSGGTGGGASMLAAAALRASVKNVDEKSGaDGKPGTMGKPTDDKKAGDDKDKQQP 1017
Cdd:COG5180 113 AKTPELAAGALPAPAAAAALPKAKVTREATSASAGVALAAALLQRSDPILAKDP-DGDSASTLPPPAEKLDKVLTEPRDA 191
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1018 PKDSKPSAGGPKPGDQKP--------TPGAGAPKPQAA--GTISKPGESQKKDAPAP------PTKPGDTKPAAPKPGES 1081
Cdd:COG5180 192 LKDSPEKLDRPKVEVKDEaqeeppdlTGGADHPRPEAAssPKVDPPSTSEARSRPATvdaqpeMRPPADAKERRRAAIGD 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1082 AKP------EAAAKKEESSKTEASKPAATNGAAKSAAPSAPSD---AKPDSKLKPGAAGAPEATKATNGASKPDEKKSGP 1152
Cdd:COG5180 272 TPAaeppglPVLEAGSEPQSDAPEAETARPIDVKGVASAPPATrpvRPPGGARDPGTPRPGQPTERPAGVPEAASDAGQP 351
|
....*....
gi 17136554 1153 EEPKKAAGD 1161
Cdd:COG5180 352 PSAYPPAEE 360
|
|
| DedD |
COG3147 |
Cell division protein DedD (periplasmic protein involved in septation) [Cell cycle control, ... |
1046-1103 |
1.58e-03 |
|
Cell division protein DedD (periplasmic protein involved in septation) [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442381 [Multi-domain] Cd Length: 140 Bit Score: 40.14 E-value: 1.58e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 17136554 1046 AAGTISKPGESQKKDAPAPPTKPgdtKPAAPKPGESAKPEAAAKKEESSKTEASKPAA 1103
Cdd:COG3147 2 AEEAAAAPAAAAAPAAPAAAAAP---APAAAAAAAAPKPAAKPAAPKPAAAAAAAPAA 56
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
803-1158 |
1.59e-03 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 43.21 E-value: 1.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 803 KGKVMERRILKDFQIGFVENLKQEMSESESGRDIFSSLAKVIGRK--KTQKGDKD-WNAIARKNTFASDPIGSKRSSMQR 879
Cdd:PTZ00121 1097 FGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKaeEARKAEDAkRVEIARKAEDARKAEEARKAEDAK 1176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 880 HSQRSLRRKIIEQANEGLQMNQTQLIEFNPNLGDVTRATRVAYVKFMRKKMAADEVSLA--DDEGAPNGEGEKKPLDASG 957
Cdd:PTZ00121 1177 KAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAkkDAEEAKKAEEERNNEEIRK 1256
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 958 SKKSITSGGTGGGASMLAAAALRA-SVKNVDEKSGAD--------GKPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGP 1028
Cdd:PTZ00121 1257 FEEARMAHFARRQAAIKAEEARKAdELKKAEEKKKADeakkaeekKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKK 1336
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1029 KPGDQKPTPGAGAPKPQAAgtiSKPGESQKKDAPAPPTKPGDTKPAA----PKPGESAKPEAAAKKEESSKTEAS--KPA 1102
Cdd:PTZ00121 1337 KAEEAKKAAEAAKAEAEAA---ADEAEAAEEKAEAAEKKKEEAKKKAdaakKKAEEKKKADEAKKKAEEDKKKADelKKA 1413
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 17136554 1103 ATNGAAKSAAPSAPSDAKPDSKLKpgaAGAPEATKATNGASKPDEKKSGPEEPKKA 1158
Cdd:PTZ00121 1414 AAAKKKADEAKKKAEEKKKADEAK---KKAEEAKKADEAKKKAEEAKKAEEAKKKA 1466
|
|
| TRPV2 |
cd22197 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), type 2; TRPV2 is closely ... |
553-681 |
1.65e-03 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), type 2; TRPV2 is closely related to TRPV1, sharing high sequence identity (>50%), but TRPV2 shows a higher temperature threshold and sensitivity for activation than TRPV1. TRPV2 can be stimulated by ligands or lipids, and is involved in osmosensation and mechanosensation. TRPV2 is expressed in both neuronal and non-neuronal tissues, and it has been implicated in diverse physiological and pathophysiological processes, including cardiac-structure maintenance, innate immunity, and cancer. TRPV2 belongs to the vanilloid TRP subfamily (TRPV), named after the founding member vanilloid receptor 1 (TRPV1). The structure of TRPV shows the typical topology features of all Transient Receptor Potential (TRP) ion channel family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6 and large intracellular N- and C-terminal domains.
Pssm-ID: 411981 [Multi-domain] Cd Length: 640 Bit Score: 42.53 E-value: 1.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 553 YTLVLFAFGCGLNQLLWYYAELEKNKCYHLHPDVADFDDQEKACTIWRrfsNLFETSQSLFWASFGLVDLVSFDLAGIKS 632
Cdd:cd22197 474 YLVFLFGFAVALVSLSREAPSPKAPEDNNSTVTEQPTVGQEEEPAPYR---SILDASLELFKFTIGMGELAFQEQLRFRG 550
|
90 100 110 120
....*....|....*....|....*....|....*....|....*....
gi 17136554 633 FTRFWALLmfgsYSVINIIVLLNMLIAMMSNSYQIISERADTEWKFARS 681
Cdd:cd22197 551 VVLLLLLA----YVLLTYVLLLNMLIALMSETVNHVADNSWSIWKLQKA 595
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
887-1159 |
1.66e-03 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 42.82 E-value: 1.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 887 RKIIEQANEGLQMNQTQLIEFNPNLGDVTRATRVAYVKfMRKKMAADEVSLADdegapngegEKKPLDASGSKKSITSGG 966
Cdd:PTZ00121 1236 KKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIK-AEEARKADELKKAE---------EKKKADEAKKAEEKKKAD 1305
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 967 TGGGASMLAAAALRASVKNVDEKSGADG--KPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGPKPGDQKPTPGAGAPKP 1044
Cdd:PTZ00121 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAakKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1045 QA-----AGTISKPGESQKKDAPAPPTKPGDTKPA--APKPGESAKP--EAAAKKEESSKTEASKPAATNGAAKSAAPSA 1115
Cdd:PTZ00121 1386 KAeekkkADEAKKKAEEDKKKADELKKAAAAKKKAdeAKKKAEEKKKadEAKKKAEEAKKADEAKKKAEEAKKAEEAKKK 1465
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 17136554 1116 PSDAKPDSKLKpgaAGAPEATKATNGASKPDEKKSGPEEPKKAA 1159
Cdd:PTZ00121 1466 AEEAKKADEAK---KKAEEAKKADEAKKKAEEAKKKADEAKKAA 1506
|
|
| ANK |
smart00248 |
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ... |
146-170 |
1.80e-03 |
|
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.
Pssm-ID: 197603 [Multi-domain] Cd Length: 30 Bit Score: 36.80 E-value: 1.80e-03
|
| PRK13808 |
PRK13808 |
adenylate kinase; Provisional |
1025-1150 |
1.82e-03 |
|
adenylate kinase; Provisional
Pssm-ID: 172341 [Multi-domain] Cd Length: 333 Bit Score: 42.18 E-value: 1.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1025 AGGPKPGDQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKP--EAAAKKEESSKTEASKPA 1102
Cdd:PRK13808 196 AANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTakKALKKAAKAVKKAAKKAA 275
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 17136554 1103 -----ATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASKPDEKKS 1150
Cdd:PRK13808 276 kaaakAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKV 328
|
|
| rad23 |
TIGR00601 |
UV excision repair protein Rad23; All proteins in this family for which functions are known ... |
1043-1105 |
1.83e-03 |
|
UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273167 [Multi-domain] Cd Length: 378 Bit Score: 42.19 E-value: 1.83e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 17136554 1043 KPQAAGTISKPGESQKKDAPAPPTKPgdtkPAAPKPGESAKPEAAAKKEESSKTEASKPAATN 1105
Cdd:TIGR00601 76 KPKTGTGKVAPPAATPTSAPTPTPSP----PASPASGMSAAPASAVEEKSPSEESATATAPES 134
|
|
| PTZ00436 |
PTZ00436 |
60S ribosomal protein L19-like protein; Provisional |
979-1160 |
2.12e-03 |
|
60S ribosomal protein L19-like protein; Provisional
Pssm-ID: 185616 [Multi-domain] Cd Length: 357 Bit Score: 41.86 E-value: 2.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 979 LRASVKNVDEKSGADGKPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGPKPGDQKPTPG-AGAPKPQAAGTISKPGESQ 1057
Cdd:PTZ00436 178 LRKREKDRERARREDAAAAAAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAkAAAAPAKAAAAPAKAAAPP 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1058 KKDAPAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKteASKPAATNGAAKSAAPSAPSDAKPDSklKPGAAGAPEATK 1137
Cdd:PTZ00436 258 AKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKA--AAAPAKAAAAPAKAAAAPAKAAAPPA--KAAAPPAKAATP 333
|
170 180
....*....|....*....|...
gi 17136554 1138 ATNGASKPDEKKSGPEEpKKAAG 1160
Cdd:PTZ00436 334 PAKAAAPPAKAAAAPVG-KKAGG 355
|
|
| PRK10905 |
PRK10905 |
cell division protein DamX; Validated |
1025-1132 |
2.29e-03 |
|
cell division protein DamX; Validated
Pssm-ID: 236792 [Multi-domain] Cd Length: 328 Bit Score: 41.85 E-value: 2.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1025 AGGPKPGDQKPTPGAGAPKPQAAGTISKPGESQkkdapAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKTEASKPAAT 1104
Cdd:PRK10905 137 NASRQTAKTQTAERPATTRPARKQAVIEPKKPQ-----ATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETAT 211
|
90 100
....*....|....*....|....*...
gi 17136554 1105 NGAAKSAAPSAPSDAKPDSKLKPGAAGA 1132
Cdd:PRK10905 212 TAPVQTASPAQTTATPAAGGKTAGNVGS 239
|
|
| DUF4045 |
pfam13254 |
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ... |
940-1097 |
2.49e-03 |
|
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.
Pssm-ID: 433066 [Multi-domain] Cd Length: 415 Bit Score: 41.69 E-value: 2.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 940 DEGAPNGEGEKKPLDaSGSKKSITSGGTGGGASMLAAAALRASVKNVDEKSgadgkpgtmgKPTDDKKAGDDKDKQQPPK 1019
Cdd:pfam13254 201 KEVTPVGLMRSPAPG-GHSKSPSVSGISADSSPTKEEPSEEADTLSTDKEQ----------SPAPTSASEPPPKTKELPK 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1020 DSKPsaggpkpgDQKPTPGAGA-PKPQAAGTISKPGESQKKDAPA---PPTKPGDTKPAAPKPGES----AKPEAAAK-- 1089
Cdd:pfam13254 270 DSEE--------PAAPSKSAEAsTEKKEPDTESSPETSSEKSAPSllsPVSKASIDKPLSSPDRDPlspkPKPQSPPKdf 341
|
170
....*....|....*...
gi 17136554 1090 ----------KEESSKTE 1097
Cdd:pfam13254 342 ranlrsrevpKDKSKKDE 359
|
|
| DUF4045 |
pfam13254 |
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ... |
980-1103 |
2.66e-03 |
|
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.
Pssm-ID: 433066 [Multi-domain] Cd Length: 415 Bit Score: 41.69 E-value: 2.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 980 RASVKNVDEKSGADGKPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGPKPGDQKPTP-GAGAPKPQAAGTISKPGESQK 1058
Cdd:pfam13254 189 RASVDLGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEADTLSTdKEQSPAPTSASEPPPKTKELP 268
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 17136554 1059 KDA--PAPPTKPGDTKPAAPKPGESAKPEAAAKK------EESSKTEASKPAA 1103
Cdd:pfam13254 269 KDSeePAAPSKSAEASTEKKEPDTESSPETSSEKsapsllSPVSKASIDKPLS 321
|
|
| DedD |
COG3147 |
Cell division protein DedD (periplasmic protein involved in septation) [Cell cycle control, ... |
1028-1095 |
2.79e-03 |
|
Cell division protein DedD (periplasmic protein involved in septation) [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442381 [Multi-domain] Cd Length: 140 Bit Score: 39.37 E-value: 2.79e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 17136554 1028 PKPGDQKPTPGAGAPKPQAAgTISKPGESQKKDAPAPPTKPGDTKPAApkpGESAKPEAAAKKEESSK 1095
Cdd:COG3147 2 AEEAAAAPAAAAAPAAPAAA-AAPAPAAAAAAAAPKPAAKPAAPKPAA---AAAAAPAAKAAAPAGGG 65
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
1016-1159 |
2.90e-03 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 41.98 E-value: 2.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1016 QPPKDSKPSAGGPKPGDQKpTPGAGAP--------KPQAAGTISKPGESQKKDA-PAPptkPGDTKPAAPKpGESAKPEA 1086
Cdd:pfam03546 19 KPEEDSESSSEEESDSEEE-TPAAKTPlqakpsgkTPQVRAASAPAKESPRKGApPVP---PGKTGPAAAQ-AQAGKPEE 93
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 17136554 1087 AAKKE-ESSKTEASKPAAtngaakSAAPSAPSDAKPDSKlkpgaagAPEATKATNGASKPDEKKSGPEEPKKAA 1159
Cdd:pfam03546 94 DSESSsEESDSDGETPAA------ATLTTSPAQVKPLGK-------NSQVRPASTVGKGPSGKGANPAPPGKAG 154
|
|
| TRPV3 |
cd22194 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), type 3; TRPV3 is a ... |
553-681 |
2.95e-03 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), type 3; TRPV3 is a temperature-sensitive Transient Receptor Potential (TRP) ion channel that is activated by warm temperatures, synthetic small-molecule chemicals, and natural compounds from plants. TRPV3 function is regulated by physiological factors such as extracellular divalent cations and acidic pH, intracellular adenosine triphosphate, membrane voltage, and arachidonic acid. It is expressed in both neuronal and non-neuronal tissues including epidermal keratinocytes, epithelial cells in the gut, endothelial cells in blood vessels, and neurons in dorsal root ganglia and CNS. TRPV3 null mice have abnormal hair morphogenesis and compromised skin barrier function. It may play roles in inflammatory skin disorders, such as itch and pain sensation. TRPV3 is also expressed by many neuronal and non-neuronal tissues, showing that TRPV3 might play roles in other unknown cellular and physiological functions. TRPV3 belongs to the vanilloid TRP subfamily (TRPV), named after the founding member vanilloid receptor 1 (TRPV1). The structure of TRPV shows the typical topology features of all TRP ion channel family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6 and large intracellular N- and C-terminal domains.
Pssm-ID: 411978 [Multi-domain] Cd Length: 680 Bit Score: 42.05 E-value: 2.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 553 YTLVLFAFGCGLNQLLwyyaelekNKCyhlhPDvadfddqEKACtiwRRFSNLFETSQSLFWASFGLVDLvsfdlaGIKS 632
Cdd:cd22194 520 YILFLLGFGVALASLI--------EDC----PD-------DSEC---SSYGSFSDAVLELFKLTIGLGDL------EIQQ 571
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 17136554 633 FTRFWALLMF--GSYSVINIIVLLNMLIAMMSNSYQIISERADTEWKFARS 681
Cdd:cd22194 572 NSKYPILFLLllITYVILTFVLLLNMLIALMGETVENVSKESERIWRLQRA 622
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
1018-1144 |
3.80e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 41.45 E-value: 3.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1018 PKDSKPSAGGPKPGD----QKPTPGAGAPKPQAAGTI--SKPGESQKKDA----P-------APPTKPGDTKP------- 1073
Cdd:PLN03209 386 PIPTPPSSSPASSKSvdavAKPAEPDVVPSPGSASNVpeVEPAQVEAKKTrplsPyaryedlKPPTSPSPTAPtgvspsv 465
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 17136554 1074 ----AAPKPGESAKPEAAakkeesskTEASKPAATNGaAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASK 1144
Cdd:PLN03209 466 sstsSVPAVPDTAPATAA--------TDAAAPPPANM-RPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVK 531
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
1017-1106 |
3.96e-03 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 41.58 E-value: 3.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1017 PPKDSKPSAGGP-KPGDQKPTP-----GAGAPKPQAAGTISKPGESQKKDAP--APPTKPGDTKPAAPKPgesaKPEAAA 1088
Cdd:PHA03379 466 PVAQLPPGPLQDlEPGDQLPGVvqdgrPACAPVPAPAGPIVRPWEASLSQVPgvAFAPVMPQPMPVEPVP----VPTVAL 541
|
90 100
....*....|....*....|.
gi 17136554 1089 KKEES---SKTEASKPAATNG 1106
Cdd:PHA03379 542 ERPVCpapPLIAMQGPGETSG 562
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
891-1102 |
4.76e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.40 E-value: 4.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 891 EQAneGLQMNQTQLIEFNPNL--GDVTRATRVAYVKFMRKKMAADEVSLADDEGAPNGEGEKKPLDASGSKKSITSGGTG 968
Cdd:PRK12323 349 EYA--GFTMTLLRMLAFRPGQsgGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARR 426
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 969 GGASMLAAAALRASVKNVDEKSG----------ADGKPGTMGKPTDDKKAGDDKDKQQPPKDSKPSAGGPKPGDQKP--- 1035
Cdd:PRK12323 427 SPAPEALAAARQASARGPGGAPApapapaaapaAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPpef 506
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 17136554 1036 -----TPGAGAPKPQAAGTISKPGESQKKDA-----PAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKTEASKPA 1102
Cdd:PRK12323 507 aspapAQPDAAPAGWVAESIPDPATADPDDAfetlaPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPA 583
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
1032-1144 |
5.31e-03 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 41.41 E-value: 5.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1032 DQKPTPGAGAPKPQAAGTiskpgesqkKDAPAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKTEASKPAATngaaksa 1111
Cdd:PRK12270 35 DYGPGSTAAPTAAAAAAA---------AAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAA------- 98
|
90 100 110
....*....|....*....|....*....|...
gi 17136554 1112 apsapsDAKPDSKLKPGAAGAPEATKATNGASK 1144
Cdd:PRK12270 99 ------PPAAAAAAAPAAAAVEDEVTPLRGAAA 125
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1017-1160 |
5.32e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 41.01 E-value: 5.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1017 PPKDSKPSAGGPKP-----GDQKPTPGAGAPKPQAAGTISKPGESQKKdAPAPPTKPGDTKPAAPKPGESAKPEAAAKKE 1091
Cdd:PRK12323 417 RAVAAAPARRSPAPealaaARQASARGPGGAPAPAPAPAAAPAAAARP-AAAGPRPVAAAAAAAPARAAPAAAPAPADDD 495
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 17136554 1092 ESSKTEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATKATNGASKPDEKKSGPEEPKKAAG 1160
Cdd:PRK12323 496 PPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPR 564
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
1017-1102 |
5.59e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 40.95 E-value: 5.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1017 PPKDSKPSAGGPKPGDQKPTPgAGAPKPQAAGTISkPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPEAAAKKEESSKT 1096
Cdd:PRK14950 365 APQPAKPTAAAPSPVRPTPAP-STRPKAAAAANIP-PKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKP 442
|
....*.
gi 17136554 1097 EASKPA 1102
Cdd:PRK14950 443 KYTPPA 448
|
|
| valS |
PRK14900 |
valyl-tRNA synthetase; Provisional |
1032-1160 |
6.05e-03 |
|
valyl-tRNA synthetase; Provisional
Pssm-ID: 237855 [Multi-domain] Cd Length: 1052 Bit Score: 41.13 E-value: 6.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1032 DQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTkPGDTKPAAPKPGESAKPEAAAKKEESSKTEASKPAATNGAAKSA 1111
Cdd:PRK14900 921 EQKPTQDGPAAEAQPAQENTVVESAEKAVAAVSEA-AQQAATAVASGIEKVAEAVRKTVRRSVKKAAATRAAMKKKVAKK 999
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 17136554 1112 APSAPSDAKPDSKLKPGAAGAP-EATKATNGASKPDEKKSGPEEPKKAAG 1160
Cdd:PRK14900 1000 APAKKAAAKKAAAKKAAAKKKVaKKAPAKKVARKPAAKKAAKKPARKAAG 1049
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
1022-1159 |
6.61e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 40.62 E-value: 6.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1022 KPSAGGPKPgDQKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPEAAA-----KKEESSKT 1096
Cdd:PRK07994 360 HPAAPLPEP-EVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAArqqlqRAQGATKA 438
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 17136554 1097 EASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEAT---KATNGASKPDEKKSGPEEPKKAA 1159
Cdd:PRK07994 439 KKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAyrwKATNPVEVKKEPVATPKALKKAL 504
|
|
| TRPV1-4 |
cd22193 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 1-4; TRPV1-4 are ... |
553-681 |
7.45e-03 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 1-4; TRPV1-4 are thermo-sensing channels that function directly in temperature-sensing and nociception; they share substantial structural and functional properties. Transient Receptor Potential (TRP) ion channels activated by temperature (thermo TRPs) are important molecular players in acute, inflammatory, and chronic pain states. So far, 11 TRP channels in mammalian cells have been identified as thermosensitive TRP (thermo-TRP) channels. TRPV1-4 channels are activated by different heat temperatures, for example, TRPV1 and TRPV2 are activated by high temperatures (>43C and >55C, respectively). TRPV1-4 belong to the vanilloid TRP subfamily (TRPV), named after the founding member vanilloid receptor 1 (TRPV1). The structure of TRPV shows the typical topology features of all TRP ion channel family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6 and large intracellular N- and C-terminal domains.
Pssm-ID: 411977 [Multi-domain] Cd Length: 607 Bit Score: 40.55 E-value: 7.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 553 YTLVLFAFGCGLNQLLwyyaelekNKCYHlhpdvadfddQEKACTIWRRFSnlfETSQSLFWASFGLVDLvsfdlaGIKS 632
Cdd:cd22193 459 YLLFLFGFAVALVSLI--------EKCSS----------DKKDCSSYGSFS---DAVLELFKLTIGMGDL------EFQE 511
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 17136554 633 FTRFWA--LLMFGSYSVINIIVLLNMLIAMMSNSYQIISERADTEWKFARS 681
Cdd:cd22193 512 NSTYPAvfLILLLTYVILTFVLLLNMLIALMGETVNNVSKESKRIWKLQRA 562
|
|
| motB |
PRK05996 |
MotB family protein; |
1033-1148 |
7.69e-03 |
|
MotB family protein;
Pssm-ID: 235665 [Multi-domain] Cd Length: 423 Bit Score: 40.45 E-value: 7.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1033 QKPTPGAGAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPEAAakkEESSKTEASKPAATNGAaksaa 1112
Cdd:PRK05996 189 QVEVTTAGDLLPPGQAREQAQGAKSATAAPATVPQAAPLPQAQPKKAATEEELIA---DAKKAATGEPAANAAKA----- 260
|
90 100 110
....*....|....*....|....*....|....*..
gi 17136554 1113 psapsdAKPDSKLKPGAAGAPEAT-KATNGASKPDEK 1148
Cdd:PRK05996 261 ------AKPEPMPDDQQKEAEQLQaAIAQAIGGVAGK 291
|
|
| PRK11901 |
PRK11901 |
hypothetical protein; Reviewed |
1024-1102 |
8.54e-03 |
|
hypothetical protein; Reviewed
Pssm-ID: 237015 [Multi-domain] Cd Length: 327 Bit Score: 40.05 E-value: 8.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1024 SAGGPKPGDQKPTPGA----GAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKPEAAAKkeeSSKTEAS 1099
Cdd:PRK11901 164 SQNAQGNTSTLPTAPAtvapSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATSGKPKSGAA---SARALSS 240
|
...
gi 17136554 1100 KPA 1102
Cdd:PRK11901 241 APA 243
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1017-1159 |
9.94e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 40.31 E-value: 9.94e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1017 PPKDSKPSAGGPKPGDQKPTPGAgAPKPQAAGTISKPGESQKKDAPAPPTKPGDTKPAAPKPGESAKP-------EAAAK 1089
Cdd:PHA03247 2569 PPPRPAPRPSEPAVTSRARRPDA-PPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAAnepdphpPPTVP 2647
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 17136554 1090 KEESSKTEASKPAATNGAAKSAAPSAPSDAKPDSKLKPGAAGAPEATkATNGASKPDEKKSgPEEPKKAA 1159
Cdd:PHA03247 2648 PPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS-LTSLADPPPPPPT-PEPAPHAL 2715
|
|
|