|
Name |
Accession |
Description |
Interval |
E-value |
| Cobl |
pfam09469 |
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ... |
176-254 |
2.88e-42 |
|
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold. :
Pssm-ID: 462810 Cd Length: 79 Bit Score: 148.89 E-value: 2.88e-42
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720399534 176 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 254
Cdd:pfam09469 1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
854-1125 |
5.63e-08 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.64 E-value: 5.63e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 854 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 933
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 934 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 1000
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 1001 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1080
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1720399534 1081 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1125
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
|
|
| WH2 super family |
cl41728 |
Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This ... |
1171-1196 |
7.61e-08 |
|
Wiskott-Aldrich Syndrome Homology (WASP) region 2 (WH2 motif), and similar proteins; This family contains the Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2) as well as thymosin-beta (Tbeta; also called beta-thymosin or betaT) domains that are small, widespread intrinsically disordered actin-binding peptides displaying significant sequence variability and different regulations of actin self-assembly in motile and morphogenetic processes. These WH2/betaT peptides are identified by a central consensus actin-binding motif LKKT/V flanked by variable N-terminal and C-terminal extensions; the betaT shares a more extended and conserved C-terminal half than WH2. These single or repeated domains are found in actin-binding proteins (ABPs) such as the hematopoietic-specific protein WASP, its ubiquitously expressed ortholog neural-WASP (N-WASP), WASP-interacting protein (WAS/WASL-interacting protein family members 1 and 2), and WASP-family verprolin homologous protein (WAVE/SCAR) isoforms: WAVE1, WAVE2, and WAVE3. Also included are the WH2 domains found in inverted formin FH2 domain-containing protein (INF2), Cordon bleu (Cobl) protein, vasodilator-stimulated phosphoprotein (VASP) homology protein and actobindin (found in amoebae). These ABPs are commonly multidomain proteins that contain signaling domains and structurally conserved actin-binding motifs, the most important being the WH2 domain motif through which they bind actin in order to direct the location, rate, and timing for actin assembly in the cell into different structures, such as filopodia, lamellipodia, stress fibers, and focal adhesions. The WH2 domain motif is one of the most abundant actin-binding motifs in Wiskott-Aldrich syndrome proteins (WASPs) where they activate Arp2/3-dependent actin nucleation and branching in response to signals mediated by Rho-family GTPases. The thymosin beta (Tbeta) domains in metazoans act in cells as major actin-sequestering peptides; their complex with monomeric ATP-actin (G-ATP-actin) cannot polymerize at either filament (F-actin) end. The actual alignment was detected with superfamily member cd21801:
Pssm-ID: 425359 Cd Length: 26 Bit Score: 49.23 E-value: 7.61e-08
10 20
....*....|....*....|....*.
gi 1720399534 1171 DPEHVRQSLLTAIRSGEAAAKLKRVT 1196
Cdd:cd21801 1 NPEQARQALLEAIRSGEGAARLKKVP 26
|
|
| RBD super family |
cl46342 |
Raf-like Ras-binding domain; |
92-158 |
9.76e-03 |
|
Raf-like Ras-binding domain; The actual alignment was detected with superfamily member pfam02196:
Pssm-ID: 460485 Cd Length: 69 Bit Score: 35.96 E-value: 9.76e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720399534 92 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 158
Cdd:pfam02196 2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Cobl |
pfam09469 |
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ... |
176-254 |
2.88e-42 |
|
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.
Pssm-ID: 462810 Cd Length: 79 Bit Score: 148.89 E-value: 2.88e-42
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720399534 176 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 254
Cdd:pfam09469 1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
854-1125 |
5.63e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.64 E-value: 5.63e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 854 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 933
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 934 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 1000
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 1001 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1080
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1720399534 1081 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1125
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
|
|
| WH2_Wc_Cobl |
cd21801 |
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ... |
1171-1196 |
7.61e-08 |
|
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.
Pssm-ID: 409199 Cd Length: 26 Bit Score: 49.23 E-value: 7.61e-08
10 20
....*....|....*....|....*.
gi 1720399534 1171 DPEHVRQSLLTAIRSGEAAAKLKRVT 1196
Cdd:cd21801 1 NPEQARQALLEAIRSGEGAARLKKVP 26
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
815-1112 |
3.30e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.22 E-value: 3.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 815 PATSKSSQQPQPDLKPKP--SSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIPFApnledinnILESKFRSRASNPQ 892
Cdd:pfam03154 212 PATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQP--------SLHGQMPPMPHSLQ 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 893 AKPSsfFLQMQKRASGHYVTSAAAKSvHTAPGPAPKEPTikEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIiqK 972
Cdd:pfam03154 284 TGPS--HMQHPVPPQPFPLTPQSSQS-QVPPGPSPAAPG--QSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI--K 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 973 PAETSPPPVAPKPmtlRAETSPPPVF-PKPMTLPAETSPPPVFpKPMTLPAETSLPLVFPKPMTLRAETSP-PPVAAKPV 1050
Cdd:pfam03154 357 PPPTTPIPQLPNP---QSHKHPPHLSgPSPFQMNSNLPPPPAL-KPLSSLSTHHPPSAHPPPLQLMPQSQQlPPPPAQPP 432
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720399534 1051 ALPGSQGTSLNLKTLKTFGAPRPYSSSGP---SPFALAVVKRSQSFSKACPESASEGSSALPPAA 1112
Cdd:pfam03154 433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPfpqHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSS 497
|
|
| BimA_second |
NF040983 |
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ... |
973-1053 |
1.48e-03 |
|
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.
Pssm-ID: 468913 [Multi-domain] Cd Length: 382 Bit Score: 42.58 E-value: 1.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 973 PAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPPPVaaKPVAL 1052
Cdd:NF040983 86 PNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPSTTTPTPSMHPI--QPTQL 163
|
.
gi 1720399534 1053 P 1053
Cdd:NF040983 164 P 164
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
923-1053 |
3.56e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 41.29 E-value: 3.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 923 PGPAPKEPTIKEVQRDPQlsPEQHPSSLSERTHSAPLPNISKADddiIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPM 1002
Cdd:NF033839 345 PQLETPKPEVKPQPEKPK--PEVKPQPEKPKPEVKPQPETPKPE---VKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPE 419
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 1720399534 1003 TLPA------ETSPPPVFPKPMTLP-AETSLPLVFPKPMTLRAETSPPPVAAKPVALP 1053
Cdd:NF033839 420 VKPQpekpkpEVKPQPEKPKPEVKPqPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 477
|
|
| BimA_second |
NF040983 |
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ... |
970-1055 |
9.56e-03 |
|
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.
Pssm-ID: 468913 [Multi-domain] Cd Length: 382 Bit Score: 39.89 E-value: 9.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 970 IQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVfPKPMTLRAETSPPPVAAKP 1049
Cdd:NF040983 89 VPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPST-TTPTPSMHPIQPTQLPSIP 167
|
....*.
gi 1720399534 1050 VALPGS 1055
Cdd:NF040983 168 NATPTS 173
|
|
| RBD |
pfam02196 |
Raf-like Ras-binding domain; |
92-158 |
9.76e-03 |
|
Raf-like Ras-binding domain;
Pssm-ID: 460485 Cd Length: 69 Bit Score: 35.96 E-value: 9.76e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720399534 92 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 158
Cdd:pfam02196 2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Cobl |
pfam09469 |
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among ... |
176-254 |
2.88e-42 |
|
Cordon-bleu ubiquitin-like domain; The Cordon-bleu protein domain is highly conserved among vertebrates. The sequence contains three repeated lysine, arginine, and proline-rich regions, the KKRAP motif. The exact function of the protein is unknown but it is thought to be involved in mid-brain neural tube closure. It is expressed specifically in the node. This domain has a ubiquitin-like fold.
Pssm-ID: 462810 Cd Length: 79 Bit Score: 148.89 E-value: 2.88e-42
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720399534 176 EKTVRVVINFKKTQKTIVRVSPHAPLQDLAPIICSKCEFDPLHTVLLKDYQAQEPLDLTKSLNDLGLRELYAMDISRES 254
Cdd:pfam09469 1 EKTVRLVVNYKKTQKAVVRVSPHVPLQELLPIICSKCEFDPLHVLLLKDYISQEELDLTKSLNDLGIKELYAMDVNRES 79
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
854-1125 |
5.63e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 57.64 E-value: 5.63e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 854 PPKAPRVTTDTGTIPFAPNLEDInnilESKFRSRASNPQAKPSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPtik 933
Cdd:PHA03247 2556 PPAAPPAAPDRSVPPPRPAPRPS----EPAVTSRARRPDAPPQS----ARPRAPVDDRGDPRGPAPPSPLPPDTHAP--- 2624
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 934 evqRDPQLSPEQHPSSLSErTHSAPLPNISKADDD-------------IIQKPAETSPPPVAPKPMTLraetsPPPVfpK 1000
Cdd:PHA03247 2625 ---DPPPPSPSPAANEPDP-HPPPTVPPPERPRDDpapgrvsrprrarRLGRAAQASSPPQRPRRRAA-----RPTV--G 2693
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 1001 PMTLPAETSPPPVFPKPMTLPAETSLPLvfpKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPS 1080
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATPL---PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1720399534 1081 PFALAVVKRSQSFSKACPESASEGSSALPPAATQDEKTHTVNKPT 1125
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPA 2815
|
|
| WH2_Wc_Cobl |
cd21801 |
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in ... |
1171-1196 |
7.61e-08 |
|
third Wiskott Aldrich syndrome homology region 2 (WH2 motif) repeat (called Wc) found in protein Cordon-Bleu (Cobl) and similar proteins; This family contains the third tandem Wiskott-Aldrich syndrome protein (WASP)-homology domain 2 (WH2), called Wc, found in protein Cordon-Bleu (Cobl), a potent actin filament nucleator that plays an important role in the reorganization of the actin cytoskeleton. It regulates neuron morphogenesis and increases branching of axons and dendrites. It also modulates dendrite branching in Purkinje cells. Cobl binds to and sequesters actin monomers (G-actin). Cobl contains three tandem WH2 (or W) domains consisting of an N-terminal alpha helix and a C-terminal LRKV motif. The first two WH2 domains have the highest binding affinity for actin. They are functionally active in actin nucleation and polymerization. The model corresponds to the first WH2 domain.
Pssm-ID: 409199 Cd Length: 26 Bit Score: 49.23 E-value: 7.61e-08
10 20
....*....|....*....|....*.
gi 1720399534 1171 DPEHVRQSLLTAIRSGEAAAKLKRVT 1196
Cdd:cd21801 1 NPEQARQALLEAIRSGEGAARLKKVP 26
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
815-1161 |
6.92e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.17 E-value: 6.92e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 815 PATSKSSQQPQPdlKPKPSSGTERHLHRTLSSPTgtETNPPKAPRVTTDTGTIPFAPnledinnileSKFRSRASNPQAK 894
Cdd:PHA03247 2562 AAPDRSVPPPRP--APRPSEPAVTSRARRPDAPP--QSARPRAPVDDRGDPRGPAPP----------SPLPPDTHAPDPP 2627
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 895 PSSfflqMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPA 974
Cdd:PHA03247 2628 PPS----PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 975 ETSPPPVAPKPMTlraetSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPP-------PVAA 1047
Cdd:PHA03247 2704 PPPTPEPAPHALV-----SATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPapappaaPAAG 2778
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 1048 KPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPSPFALAVVKRSQSFSK--ACPESASEGSSALPPAATQDekthtvNKPT 1125
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplPPPTSAQPTAPPPPPGPPPP------SLPL 2852
|
330 340 350
....*....|....*....|....*....|....*.
gi 1720399534 1126 VGSQHGDGDKQNNPVQNEHSSQVLTPADGPSFTLKR 1161
Cdd:PHA03247 2853 GGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLAR 2888
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
884-1115 |
3.14e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 51.42 E-value: 3.14e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 884 FRSRASNPQAKPSSfflqmqkRASGHYVTSAAAKSVHTAPGPAPKEPTiKEVQRDPQLSPEQHPSSLSERTHSAPLPNIS 963
Cdd:PRK12323 363 FRPGQSGGGAGPAT-------AAAAPVAQPAPAAAAPAAAAPAPAAPP-AAPAAAPAAAAAARAVAAAPARRSPAPEALA 434
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 964 KADDDIIQKPAETSPPPVAPKPMTLrAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETS------LPLVFPKPMTLR 1037
Cdd:PRK12323 435 AARQASARGPGGAPAPAPAPAAAPA-AAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpppweeLPPEFASPAPAQ 513
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720399534 1038 AETSPPPVAAKPVALPGSQGTSlnlktlktfgAPRPYSSSGPSPfALAVVKRSQSFSKACPESASEGSSALPPAATQD 1115
Cdd:PRK12323 514 PDAAPAGWVAESIPDPATADPD----------DAFETLAPAPAA-APAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
815-1112 |
3.30e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 48.22 E-value: 3.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 815 PATSKSSQQPQPDLKPKP--SSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIPFApnledinnILESKFRSRASNPQ 892
Cdd:pfam03154 212 PATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQP--------SLHGQMPPMPHSLQ 283
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 893 AKPSsfFLQMQKRASGHYVTSAAAKSvHTAPGPAPKEPTikEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIiqK 972
Cdd:pfam03154 284 TGPS--HMQHPVPPQPFPLTPQSSQS-QVPPGPSPAAPG--QSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHI--K 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 973 PAETSPPPVAPKPmtlRAETSPPPVF-PKPMTLPAETSPPPVFpKPMTLPAETSLPLVFPKPMTLRAETSP-PPVAAKPV 1050
Cdd:pfam03154 357 PPPTTPIPQLPNP---QSHKHPPHLSgPSPFQMNSNLPPPPAL-KPLSSLSTHHPPSAHPPPLQLMPQSQQlPPPPAQPP 432
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720399534 1051 ALPGSQGTSLNLKTLKTFGAPRPYSSSGP---SPFALAVVKRSQSFSKACPESASEGSSALPPAA 1112
Cdd:pfam03154 433 VLTQSQSLPPPAASHPPTSGLHQVPSQSPfpqHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSS 497
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
815-1110 |
4.99e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.01 E-value: 4.99e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 815 PATSKSSQQPQPDLKPKPSSGTERHLHRTLSSPTGTETNPPKAPRVTTDTGTIpfapnledinnilESKFRSRASNPQAK 894
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV-------------SRPRRARRLGRAAQ 2675
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 895 PSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQlspeqhPSSLSERTHSAPLPNISKADDDIIQKPA 974
Cdd:PHA03247 2676 ASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPL------PPGPAAARQASPALPAAPAPPAVPAGPA 2749
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 975 ETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFP------------KPMTLPAETSLPLVFPKPMTLRAETSP 1042
Cdd:PHA03247 2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASlsesreslpspwDPADPPAAVLAPAAALPPAASPAGPLP 2829
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720399534 1043 PPVAAKPVALPGSQGTSLNLKTLKTFGAP------RPYSSSGPSPFALAVVKRSQSFSKACPESASEgSSALPP 1110
Cdd:PHA03247 2830 PPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE-SFALPP 2902
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
792-1077 |
5.40e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 47.75 E-value: 5.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 792 KMLPVGQRHTIENMTETSMQTEVPATSKSSQQPQPDLKPKPSsgterhlhrTLSSPTGTETNPPK--APR-VTTDTGTIP 868
Cdd:PHA03378 562 QLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPS---------QTPEPPTTQSHIPEtsAPRqWPMPLRPIP 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 869 FAPNLEDINNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKepTIKEVQRDP-QLSPEQHP 947
Cdd:PHA03378 633 MRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPG--TMQPPPRAPtPMRPPAAP 710
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 948 SSLSERTHSAPLPNISKADDDIIQKPAETSPPPvAPKPMTLRAETSPPPVFPKPMTLPAETSPPPV-FPKPMTLPAETSL 1026
Cdd:PHA03378 711 PGRAQRPAAATGRARPPAAAPGRARPPAAAPGR-ARPPAAAPGRARPPAAAPGRARPPAAAPGAPTpQPPPQAPPAPQQR 789
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1720399534 1027 PLVFPKPMTlRAETSPPPVAAKPVALPGSQG-TSLNLKTLKTFGAPRPYSSS 1077
Cdd:PHA03378 790 PRGAPTPQP-PPQAGPTSMQLMPRAAPGQQGpTKQILRQLLTGGVKRGRPSL 840
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
705-1081 |
6.48e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.63 E-value: 6.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 705 EKEPACTYGNNVPLSPVDGSNKNPAASylKNFPLYRQDSNPKPKPSNEITREYIPKIGMTTYKIVPPKSLEMAKDWESeA 784
Cdd:PHA03247 2586 ARRPDAPPQSARPRAPVDDRGDPRGPA--PPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV-S 2662
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 785 MGRKDDQKMLPVGQRHTIENMTETSMQTEVPATSKSSQQPQPDLKPKPSSgteRHLHRTLSSPTGTETNPPKAPRVTTDT 864
Cdd:PHA03247 2663 RPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAP---HALVSATPLPPGPAAARQASPALPAAP 2739
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 865 GT--IPFAPNLEDINNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLS 942
Cdd:PHA03247 2740 APpaVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP 2819
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 943 PEQHPSSLSerthsAPLPNISKADDDIIQKPAETSPPP---VAPK-PMTLRAETSPPPvfpkpmTLPAETSPPPVfpKPM 1018
Cdd:PHA03247 2820 PAASPAGPL-----PPPTSAQPTAPPPPPGPPPPSLPLggsVAPGgDVRRRPPSRSPA------AKPAAPARPPV--RRL 2886
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720399534 1019 TLPAETSLPLVFPKPMTLRAETSPPPVAAKPVALPGSQGTSLNLKTLKTFGAPRPYSSSGPSP 1081
Cdd:PHA03247 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDP 2949
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
807-1081 |
1.57e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 46.23 E-value: 1.57e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 807 ETSMQT-EVPATSKSSQQPQPDLKPKPssgterhlhrtlssptGTETNPPK-APRVTTDTGTIPFAPNLEDINNILESKF 884
Cdd:PRK10263 338 EPVTQTpPVASVDVPPAQPTVAWQPVP----------------GPQTGEPViAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 885 rsrasnPQAKPSSFFLQMQKRASGHYVTSAAAKSVHTAPGPAPKEPTIKEV-QRDPQLSPEQH-PSSLSERTHSAPLPni 962
Cdd:PRK10263 402 ------QPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAwQAEEQQSTFAPqSTYQTEQTYQQPAA-- 473
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 963 skADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVF-----------------------PKPMTLPAETSPPPVFPKPMT 1019
Cdd:PRK10263 474 --QEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYyfeeveekrarereqlaawyqpiPEPVKEPEPIKSSLKAPSVAA 551
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1720399534 1020 LPAETSLPLVFPKPMTLRAETSPPPVAAKpVALPgsqgtslnLKTLKTFGAPRPYSSSGPSP 1081
Cdd:PRK10263 552 VPPVEAAAAVSPLASGVKKATLATGAAAT-VAAP--------VFSLANSGGPRPQVKEGIGP 604
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
939-1058 |
2.55e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 45.15 E-value: 2.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 939 PQLSPEQHPSSLSERTHSAPLPNiSKADDDIIQKPAETSPPPVAPKPMTlRAETSPPPVfPKPMTLPAETSPPPVFPKPM 1018
Cdd:PRK14971 371 GGRGPKQHIKPVFTQPAAAPQPS-AAAAASPSPSQSSAAAQPSAPQSAT-QPAGTPPTV-SVDPPAAVPVNPPSTAPQAV 447
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 1720399534 1019 TLPAETSlplvfPKPMTLRAETSPPPVAAKPVALPGSQGT 1058
Cdd:PRK14971 448 RPAQFKE-----EKKIPVSKVSSLGPSTLRPIQEKAEQAT 482
|
|
| PRK14950 |
PRK14950 |
DNA polymerase III subunits gamma and tau; Provisional |
972-1061 |
1.13e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237864 [Multi-domain] Cd Length: 585 Bit Score: 43.26 E-value: 1.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 972 KPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPmtlPAETSLPLVFPKPMTLRAETSPPPVAAK--P 1049
Cdd:PRK14950 370 KPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR---PVAPPVPHTPESAPKLTRAAIPVDEKPKytP 446
|
90
....*....|..
gi 1720399534 1050 VALPGSQGTSLN 1061
Cdd:PRK14950 447 PAPPKEEEKALI 458
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
842-1043 |
1.28e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.94 E-value: 1.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 842 RTLSSPTGTETNPPKAPRVTTDTGTIPFAPNLEDiNNILESKFRSRASNPQAKPSSFFLQMQKRASGHYVTSAAAKSVHT 921
Cdd:PRK12323 376 TAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAA-PAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAP 454
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 922 APGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVFPKP 1001
Cdd:PRK12323 455 AAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADP 534
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1720399534 1002 mTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPP 1043
Cdd:PRK12323 535 -DDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
|
|
| BimA_second |
NF040983 |
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ... |
973-1053 |
1.48e-03 |
|
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.
Pssm-ID: 468913 [Multi-domain] Cd Length: 382 Bit Score: 42.58 E-value: 1.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 973 PAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAETSPPPVaaKPVAL 1052
Cdd:NF040983 86 PNKVPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPSTTTPTPSMHPI--QPTQL 163
|
.
gi 1720399534 1053 P 1053
Cdd:NF040983 164 P 164
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
911-1222 |
1.68e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 42.85 E-value: 1.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 911 VTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSL---SERTHSAPLPNISKADDDI--IQKPAETSPPPVAPKP 985
Cdd:PHA03307 56 VAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLapaSPAREGSPTPPGPSSPDPPppTPPPASPPPSPAPDLS 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 986 MTLRAETSPPPVfPKPMTLPAETSPPPVfpkpmTLPAETSLPLVFPKPMTLRAETSPPPVAAKPVALPGSqgtslnlktl 1065
Cdd:PHA03307 136 EMLRPVGSPGPP-PAASPPAAGASPAAV-----ASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPP---------- 199
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 1066 ktfGAPRPYSSSGPSPFALAVVkrsqsfskaCPESASEGSSALPPAATQDEKTHtvnkpTVGSQHGDGDKQNNPVQNEHS 1145
Cdd:PHA03307 200 ---AAASPRPPRRSSPISASAS---------SPAPAPGRSAADDAGASSSDSSS-----SESSGCGWGPENECPLPRPAP 262
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720399534 1146 SQVLTPADGPSFTLKRqssltfqSSDPEHVRQSLLTAIRSGEAAAKLKRVTVPSNTISVNGKSGLSQSMSIDAQDSR 1222
Cdd:PHA03307 263 ITLPTRIWEASGWNGP-------SSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS 332
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
924-1085 |
2.22e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 42.36 E-value: 2.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 924 GPAPKEPtikevqrdPQLSPEQHPSSLSERTH--SAPLPNISKADDDIIQKPAETSPPPVAPKPMtlRAETSPPPVFPKP 1001
Cdd:PHA03378 648 FPTPHQP--------PQVEITPYKPTWTQIGHipYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM--RPPAAPPGRAQRP 717
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 1002 MTLPAETSPPPVFPKPMTLPAETSLPLVFPKPMTLRAET-------SPPPVAA--KPVALPGSQGTSLNLKTLKTFGAPR 1072
Cdd:PHA03378 718 AAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPpaaapgrARPPAAApgAPTPQPPPQAPPAPQQRPRGAPTPQ 797
|
170
....*....|...
gi 1720399534 1073 PYSSSGPSPFALA 1085
Cdd:PHA03378 798 PPPQAGPTSMQLM 810
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
925-1158 |
2.38e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.45 E-value: 2.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 925 PAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPNISKADDDIIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTL 1004
Cdd:pfam03154 221 TQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPL 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 1005 PAETSPPPVFPKPMT-LPAETSLPLVFPKPMTLRAETSPP---PVAAKPVALPgsqgtslNLKTLKTFGAPR-------- 1072
Cdd:pfam03154 301 TPQSSQSQVPPGPSPaAPGQSQQRIHTPPSQSQLQSQQPPreqPLPPAPLSMP-------HIKPPPTTPIPQlpnpqshk 373
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 1073 -PYSSSGPSPFAL-------AVVKRSQSFSKACPESA--------SEGSSALPPAATQDEKTHTVNKPTVGSQH-GDGDK 1135
Cdd:pfam03154 374 hPPHLSGPSPFQMnsnlpppPALKPLSSLSTHHPPSAhppplqlmPQSQQLPPPPAQPPVLTQSQSLPPPAASHpPTSGL 453
|
250 260
....*....|....*....|...
gi 1720399534 1136 QNNPVQNEHSSQVLTPADGPSFT 1158
Cdd:pfam03154 454 HQVPSQSPFPQHPFVPGGPPPIT 476
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
815-1075 |
3.01e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.23 E-value: 3.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 815 PATSKSSQQPQPDLKPKPSSGTERHLHRTlSSPTGTETNPPKAPRVTTDTGTIPFAPNLEDINNILESKFRSRASNPQAK 894
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPA-ASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSP 2871
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 895 PSSfflqmqkrasghyVTSAAAKSVHTAPGPAPKEPTIKEVQRDPQLSPEQHPSSLSERTHSAPLPniskadddiiQKPA 974
Cdd:PHA03247 2872 AAK-------------PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP----------PPPQ 2928
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 975 ETSPPPVAPKPmtlraetsPPPVFPKPMTLPAeTSPPPVFPKP---MTLPAETSLPLVF---PKPMTLRAETSPPPVAAK 1048
Cdd:PHA03247 2929 PQPPPPPPPRP--------QPPLAPTTDPAGA-GEPSGAVPQPwlgALVPGRVAVPRFRvpqPAPSREAPASSTPPLTGH 2999
|
250 260
....*....|....*....|....*..
gi 1720399534 1049 PVALPGSQGTSLnlkTLKTFGAPRPYS 1075
Cdd:PHA03247 3000 SLSRVSSWASSL---ALHEETDPPPVS 3023
|
|
| kgd |
PRK12270 |
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ... |
971-1066 |
3.05e-03 |
|
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;
Pssm-ID: 237030 [Multi-domain] Cd Length: 1228 Bit Score: 41.80 E-value: 3.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 971 QKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPlvfpkpmtlrAETSPPPVAAKPV 1050
Cdd:PRK12270 36 YGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAA----------AAAPAAPPAAAAA 105
|
90
....*....|....*.
gi 1720399534 1051 ALPGSQGTSLNLKTLK 1066
Cdd:PRK12270 106 AAPAAAAVEDEVTPLR 121
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
923-1053 |
3.56e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 41.29 E-value: 3.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 923 PGPAPKEPTIKEVQRDPQlsPEQHPSSLSERTHSAPLPNISKADddiIQKPAETSPPPVAPKPMTLRAETSPPPVFPKPM 1002
Cdd:NF033839 345 PQLETPKPEVKPQPEKPK--PEVKPQPEKPKPEVKPQPETPKPE---VKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPE 419
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 1720399534 1003 TLPA------ETSPPPVFPKPMTLP-AETSLPLVFPKPMTLRAETSPPPVAAKPVALP 1053
Cdd:NF033839 420 VKPQpekpkpEVKPQPEKPKPEVKPqPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKP 477
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
972-1118 |
7.60e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 40.62 E-value: 7.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 972 KPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPM------TLPAETSLPLVFPKPMTLRAETSPPPV 1045
Cdd:PRK07994 360 HPAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASApqqapaVPLPETTSQLLAARQQLQRAQGATKAK 439
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720399534 1046 AAKPVALPGSQGTSLNLKTLKTFgAPRPYSSSGPSPFALAVVKRSQSfskacPESASEGSSALPPAATQ---DEKT 1118
Cdd:PRK07994 440 KSEPAAASRARPVNSALERLASV-RPAPSALEKAPAKKEAYRWKATN-----PVEVKKEPVATPKALKKaleHEKT 509
|
|
| BimA_second |
NF040983 |
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ... |
970-1055 |
9.56e-03 |
|
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.
Pssm-ID: 468913 [Multi-domain] Cd Length: 382 Bit Score: 39.89 E-value: 9.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720399534 970 IQKPAETSPPPVAPKPMTLRAETSPPPVFPKPMTLPAETSPPPVFPKPMTLPAETSLPLVfPKPMTLRAETSPPPVAAKP 1049
Cdd:NF040983 89 VPPPPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPPPSPPPPTTTPPTRTTPST-TTPTPSMHPIQPTQLPSIP 167
|
....*.
gi 1720399534 1050 VALPGS 1055
Cdd:NF040983 168 NATPTS 173
|
|
| RBD |
pfam02196 |
Raf-like Ras-binding domain; |
92-158 |
9.76e-03 |
|
Raf-like Ras-binding domain;
Pssm-ID: 460485 Cd Length: 69 Bit Score: 35.96 E-value: 9.76e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720399534 92 LSVVLPGDILKSTTVHGSKPMMDLLVFLCAQYHLNPSSHTIDLLSAEENLIkfKPNTPIGMLDVEKV 158
Cdd:pfam02196 2 CRVYLPDGQRTVVQVRPGETVRDALSKLCKKRGLNPEACDVYLVGGDKYPL--DLDTDSSTLEGEEV 66
|
|
|