|
Name |
Accession |
Description |
Interval |
E-value |
| WAPL |
pfam07814 |
Wings apart-like protein regulation of heterochromatin; This family contains sequences ... |
643-1016 |
1.67e-39 |
|
Wings apart-like protein regulation of heterochromatin; This family contains sequences expressed in eukaryotic organizms bearing high similarity to the WAPL conserved region of D. melanogaster wings apart-like protein. This protein is involved in the regulation of heterochromatin structure. hWAPL, the human homolog, is found to play a role in the development of cervical carcinogenesis, and is thought to have similar functions to Drosophila wapl protein. Malfunction of the hWAPL pathway is thought to activate an apoptotic pathway that consequently leads to cell death.
Pssm-ID: 462278 Cd Length: 344 Bit Score: 150.53 E-value: 1.67e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 643 NVRQAHECLQHGENLRFRDDVIYLLDSMKTEEPLL-VRCLSTLNLAEQCLNVDFMRQMRSLGLVKKSLLLLKDAQTHPA- 720
Cdd:pfam07814 1 NVRSIHELREAGENQRFEDEVEYILDDIEDSNPSSsTRRSSLLELASKCLDPAFRRQFRAHGLVKRLFKALGDATDDISg 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 721 -LKVALVTLSFVMTQSNVEPSEL-DKKCTELLVMALSVKDQFLvspageeeppplegkaateykKITDKSKALLKKCSPL 798
Cdd:pfam07814 81 sLALCAATLMLVLSPDSLVMDLDrWSGLLELLLKLLSVDSDIS---------------------VLAKDRKTNLSKSAQK 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 799 LIHAKysDSSNTNETVDISLQPSTLSLGELATECFLSLTSSmntsvsslkedLRTLG-AIDVIANRAIALTRQCTGYLTS 877
Cdd:pfam07814 140 SVREL--REQLLKGKIWDDLGPSKLSPQTLALECLESLVSK-----------LRELGgLIDHISDEVLDCLVELLLSSSE 206
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 878 gqeeegkeeeeREGEEREVDVHFMKLFRQLKLLENILCMSKGNQSYLTMFDGsiFISEISKFLvllgdylisKDLSTRHT 957
Cdd:pfam07814 207 -----------RDSWDDLSPEDLRLLELCLSILESVTVLNEENQAYLLWSLG--LLSSLAKLL---------SSSLRRRD 264
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1133447891 958 CIVTETLLGTLRVFLNITHDNALASYRVGE---QEKIIDSICNFTFKLAQ--SVPVIQQFDILV 1016
Cdd:pfam07814 265 DQLRQLQMLALRLLLNLTNNNPSLCEAFSTpelVHGLVEIALSNFLNLSPeyAPDRESSLDLLI 328
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
105-361 |
1.00e-08 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 60.18 E-value: 1.00e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 105 SSGLHVGQKRGRGRPRKTPKEPLDESKDDDSTANESEP---------TSQKPADRGHSRKTLNESTESEPGPNDE----S 171
Cdd:PHA03307 135 SEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAalplsspeeTARAPSSPPAEPPPSTPPAAASPRPPRRsspiS 214
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 172 TGELTPVVKRGRGRPRKTPKSTNESTPNISITSE-----------PESIGQKPPVKRGRG----RPRKTPKSTNESTPNV 236
Cdd:PHA03307 215 ASASSPAPAPGRSAADDAGASSSDSSSSESSGCGwgpenecplprPAPITLPTRIWEASGwngpSSRPGPASSSSSPRER 294
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 237 SIASEPESIGQKPPVKGGRGRPKKIRKEAMNESTASESELVQESTGTESEPSSQQPPVVKRGRGRPRKTPKPSSAVSTPS 316
Cdd:PHA03307 295 SPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRA 374
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 1133447891 317 SQEPVVKRGRGRPRKVRPPNDSIGTV---TPSSTEQRPSTIETGDDEA 361
Cdd:PHA03307 375 PSSPAASAGRPTRRRARAAVAGRARRrdaTGRFPAGRPRPSPLDAGAA 422
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
110-353 |
2.33e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 42.06 E-value: 2.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 110 VGQKRGRGRPRKTPKEPLDESKDDDSTANESEPTSQKPADRGHSRKTLNESTESEPGPNDESTGE---LTPVVKRGRGRP 186
Cdd:NF033839 271 VVTKFKKGLTQDTPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQpekPKPEVKPQLETP 350
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 187 RKTPKSTNEsTPNISITSEPEsiGQKPPVKRGRGRPRKTPKSTNEsTPNVSIASEPESigQKPPVKGGRGRPKKIRKEAm 266
Cdd:NF033839 351 KPEVKPQPE-KPKPEVKPQPE--KPKPEVKPQPETPKPEVKPQPE-KPKPEVKPQPEK--PKPEVKPQPEKPKPEVKPQ- 423
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 267 NESTASESELVQESTGTESEPSSQQP-PVVKRGRGRPRKTPKPSsavstPSSQEPVVKRgrgRPRKVRPPNDSigtvtPS 345
Cdd:NF033839 424 PEKPKPEVKPQPEKPKPEVKPQPEKPkPEVKPQPETPKPEVKPQ-----PEKPKPEVKP---QPEKPKPDNSK-----PQ 490
|
....*...
gi 1133447891 346 STEQRPST 353
Cdd:NF033839 491 ADDKKPST 498
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
150-354 |
4.04e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 41.68 E-value: 4.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 150 RGHSRKTLNESTESEPGPNDESTGELTPVVKRGRGRPRKTPKSTNESTPniSITSEPESIGQKPPVKRGRGRPRKTPKST 229
Cdd:pfam03154 40 RSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGA--SDTEEPERATAKKSKTQEISRPNSPSEGE 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 230 NESTPNVSI----ASEPESIGQKppvkgGRGRPKKIRKEAMNESTASESELVQESTGTESEPSSQQPPVVKRGRGRPRKT 305
Cdd:pfam03154 118 GESSDGRSVndegSSDPKDIDQD-----NRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTT 192
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 1133447891 306 PKPSsavSTPSSQEPVVKrGRGRPRKVRPPNDSIGTVTPSSTEQRPSTI 354
Cdd:pfam03154 193 QAAT---AGPTPSAPSVP-PQGSPATSQPPNQTQSTAAPHTLIQQTPTL 237
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
91-352 |
5.75e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 40.82 E-value: 5.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 91 SPIPVRVTGGTSKTSSGLHVGQKRGRGRPRKTPKEPLDESKDDDstanESEPTSQKPADRGHS-----RKTLNESTESEP 165
Cdd:COG5180 125 APAAAAALPKAKVTREATSASAGVALAAALLQRSDPILAKDPDG----DSASTLPPPAEKLDKvltepRDALKDSPEKLD 200
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 166 GPNDESTGELTPVVKRGRGRPRkTPKSTNESTPNISITSEPESIGQKPPVkrgRGRPRKTPKSTNESTPNVSIASEPESI 245
Cdd:COG5180 201 RPKVEVKDEAQEEPPDLTGGAD-HPRPEAASSPKVDPPSTSEARSRPATV---DAQPEMRPPADAKERRRAAIGDTPAAE 276
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 246 GQKPPVKGGRGRPKKIRKEAMNESTASESELVQESTGTesepssqqPPVVKRGRGRPRKTPKPSSAVSTPSSQEPVVKRG 325
Cdd:COG5180 277 PPGLPVLEAGSEPQSDAPEAETARPIDVKGVASAPPAT--------RPVRPPGGARDPGTPRPGQPTERPAGVPEAASDA 348
|
250 260
....*....|....*....|....*..
gi 1133447891 326 RGRPRKVRPPNDSIGTVTPSSTEQRPS 352
Cdd:COG5180 349 GQPPSAYPPAEEAVPGKPLEQGAPRPG 375
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WAPL |
pfam07814 |
Wings apart-like protein regulation of heterochromatin; This family contains sequences ... |
643-1016 |
1.67e-39 |
|
Wings apart-like protein regulation of heterochromatin; This family contains sequences expressed in eukaryotic organizms bearing high similarity to the WAPL conserved region of D. melanogaster wings apart-like protein. This protein is involved in the regulation of heterochromatin structure. hWAPL, the human homolog, is found to play a role in the development of cervical carcinogenesis, and is thought to have similar functions to Drosophila wapl protein. Malfunction of the hWAPL pathway is thought to activate an apoptotic pathway that consequently leads to cell death.
Pssm-ID: 462278 Cd Length: 344 Bit Score: 150.53 E-value: 1.67e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 643 NVRQAHECLQHGENLRFRDDVIYLLDSMKTEEPLL-VRCLSTLNLAEQCLNVDFMRQMRSLGLVKKSLLLLKDAQTHPA- 720
Cdd:pfam07814 1 NVRSIHELREAGENQRFEDEVEYILDDIEDSNPSSsTRRSSLLELASKCLDPAFRRQFRAHGLVKRLFKALGDATDDISg 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 721 -LKVALVTLSFVMTQSNVEPSEL-DKKCTELLVMALSVKDQFLvspageeeppplegkaateykKITDKSKALLKKCSPL 798
Cdd:pfam07814 81 sLALCAATLMLVLSPDSLVMDLDrWSGLLELLLKLLSVDSDIS---------------------VLAKDRKTNLSKSAQK 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 799 LIHAKysDSSNTNETVDISLQPSTLSLGELATECFLSLTSSmntsvsslkedLRTLG-AIDVIANRAIALTRQCTGYLTS 877
Cdd:pfam07814 140 SVREL--REQLLKGKIWDDLGPSKLSPQTLALECLESLVSK-----------LRELGgLIDHISDEVLDCLVELLLSSSE 206
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 878 gqeeegkeeeeREGEEREVDVHFMKLFRQLKLLENILCMSKGNQSYLTMFDGsiFISEISKFLvllgdylisKDLSTRHT 957
Cdd:pfam07814 207 -----------RDSWDDLSPEDLRLLELCLSILESVTVLNEENQAYLLWSLG--LLSSLAKLL---------SSSLRRRD 264
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1133447891 958 CIVTETLLGTLRVFLNITHDNALASYRVGE---QEKIIDSICNFTFKLAQ--SVPVIQQFDILV 1016
Cdd:pfam07814 265 DQLRQLQMLALRLLLNLTNNNPSLCEAFSTpelVHGLVEIALSNFLNLSPeyAPDRESSLDLLI 328
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
105-361 |
1.00e-08 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 60.18 E-value: 1.00e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 105 SSGLHVGQKRGRGRPRKTPKEPLDESKDDDSTANESEP---------TSQKPADRGHSRKTLNESTESEPGPNDE----S 171
Cdd:PHA03307 135 SEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAalplsspeeTARAPSSPPAEPPPSTPPAAASPRPPRRsspiS 214
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 172 TGELTPVVKRGRGRPRKTPKSTNESTPNISITSE-----------PESIGQKPPVKRGRG----RPRKTPKSTNESTPNV 236
Cdd:PHA03307 215 ASASSPAPAPGRSAADDAGASSSDSSSSESSGCGwgpenecplprPAPITLPTRIWEASGwngpSSRPGPASSSSSPRER 294
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 237 SIASEPESIGQKPPVKGGRGRPKKIRKEAMNESTASESELVQESTGTESEPSSQQPPVVKRGRGRPRKTPKPSSAVSTPS 316
Cdd:PHA03307 295 SPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRA 374
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 1133447891 317 SQEPVVKRGRGRPRKVRPPNDSIGTV---TPSSTEQRPSTIETGDDEA 361
Cdd:PHA03307 375 PSSPAASAGRPTRRRARAAVAGRARRrdaTGRFPAGRPRPSPLDAGAA 422
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
106-381 |
2.66e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 48.53 E-value: 2.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 106 SGLHVGQKRGRGRPRKTPKEPLDESkdddSTANESEPT--SQKPADRGHSRKTLNESTESEPGPNDESTGELTPVV-KRG 182
Cdd:PTZ00449 504 SDKHDEPPEGPEASGLPPKAPGDKE----GEEGEHEDSkeSDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLsKKP 579
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 183 RG--------RPRKTPKSTNESTPNISITSE----PES--IGQKPPVKRGRGRPRKTPKSTNESTPN-----VSIASEPE 243
Cdd:PTZ00449 580 EFpkdpkhpkDPEEPKKPKRPRSAQRPTRPKspklPELldIPKSPKRPESPKSPKRPPPPQRPSSPErpegpKIIKSPKP 659
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 244 SIGQKPPV--------------KGGRGRPKKIRKEAMNESTASESELVQESTGTESEPSSQQPPVVKRGRGRPRKTPKPS 309
Cdd:PTZ00449 660 PKSPKPPFdpkfkekfyddyldAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDP 739
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1133447891 310 SAVSTPSSQ---EPVVKRGRgrpRKVRPPNDSIGTVTPSSTEQRPSTIETGDDEACLKRDATgvgeDMEEEPVPT 381
Cdd:PTZ00449 740 DAEQPDDIEfftPPEEERTF---FHETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDS----PSEHEDKPP 807
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
149-334 |
1.46e-04 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 46.20 E-value: 1.46e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 149 DRGHSRKTLNESTESEPGPNDEstgELTPVVKRGRGRPRKTP---------------KSTNESTPNISITSEPESIGQKP 213
Cdd:PHA03379 342 DEGATGETREESEDTESDGDDE---ELPRIVSREGTKRKRPPiflrrlhrlllmragKLTERAREALEKASEPTYGTPRP 418
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 214 PVKRgrgrPRKTPKSTNESTPNVSIASEPESigqkPPVKGGRGRPkkirkeaMNESTASESELVQESTGT---ESEPSSQ 290
Cdd:PHA03379 419 PVEK----PRPEVPQSLETATSHGSAQVPEP----PPVHDLEPGP-------LHDQHSMAPCPVAQLPPGplqDLEPGDQ 483
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1133447891 291 QPPVVKRGRGRPRKTPKPSSAVSTPSSQEPVVKRGRGrPRKVRP 334
Cdd:PHA03379 484 LPGVVQDGRPACAPVPAPAGPIVRPWEASLSQVPGVA-FAPVMP 526
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
32-316 |
4.95e-04 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 44.68 E-value: 4.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 32 ISPSRTIYRPLPP--SKFKVPSSSVSLKRGTKKKKTSKLDVFD--FHSDDSDYEFDMPIVDYHSPIPVRVTGGTSKTSsg 107
Cdd:PTZ00449 654 IKSPKPPKSPKPPfdPKFKEKFYDDYLDAAAKSKETKTTVVLDesFESILKETLPETPGTPFTTPRPLPPKLPRDEEF-- 731
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 108 lhvgqkrgrgrPRKTPKEPLDESKDDdsTANESEPTSQkpadrghsRKTLNESTESEPGPNDESTGELTPVVKRGRGRPR 187
Cdd:PTZ00449 732 -----------PFEPIGDPDAEQPDD--IEFFTPPEEE--------RTFFHETPADTPLPDILAEEFKEEDIHAETGEPD 790
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 188 KTPKStnestPNISITSEPESIGQKPPVKRGRGRPRKTPKSTNEstpnvsIASEPESIGQKPPvkggrGRPKKIRK---- 263
Cdd:PTZ00449 791 EAMKR-----PDSPSEHEDKPPGDHPSLPKKRHRLDGLALSTTD------LESDAGRIAKDAS-----GKIVKLKRsksf 854
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1133447891 264 ------EAMNESTASESELVQESTGTESEPSSQQPPVVKR----GRGRPRKTPKPSSAVSTPS 316
Cdd:PTZ00449 855 ddlttvEEAEEMGAEARKIVVDDDGTEADDEDTHPPEEKHksevRRRRPPKKPSKPKKPSKPK 917
|
|
| PTZ00108 |
PTZ00108 |
DNA topoisomerase 2-like protein; Provisional |
36-291 |
8.01e-04 |
|
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain] Cd Length: 1388 Bit Score: 43.88 E-value: 8.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 36 RTIYRPLPPSKFKVPSS-SVSLKRGTKKKKTSKldvfdfhsddsdyefdmpiVDYHSPIPVRVTGGTSKTSSGLHVGQKR 114
Cdd:PTZ00108 1153 AKEQRLKSKTKGKASKLrKPKLKKKEKKKKKSS-------------------ADKSKKASVVGNSKRVDSDEKRKLDDKP 1213
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 115 GRGRPRKTPKEPLDESKDDDSTANESEPTSQKPADRGHSRKTLNESTESEPGPNDESTGELTPVVkrgRGRPRKTPKSTN 194
Cdd:PTZ00108 1214 DNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRV---SAVQYSPPPPSK 1290
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 195 ESTPNISITSEPESIGQKPPVKRGRGRPRKTPKSTNESTPNVSIASEPESIGQKPPVKGGRGRPkKIRKEAMNESTASES 274
Cdd:PTZ00108 1291 RPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLR-RPRKKKSDSSSEDDD 1369
|
250
....*....|....*..
gi 1133447891 275 ELVQESTGTESEPSSQQ 291
Cdd:PTZ00108 1370 DSEVDDSEDEDDEDDED 1386
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
90-321 |
1.56e-03 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 42.74 E-value: 1.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 90 HSPIPVRVTGGTSKTSSGLHVGQKRGRGRPRKTPKEPLDESKDDDSTANESEPTS--QKPADRGHSRKTLNESTESEPGP 167
Cdd:PHA03379 387 HRLLLMRAGKLTERAREALEKASEPTYGTPRPPVEKPRPEVPQSLETATSHGSAQvpEPPPVHDLEPGPLHDQHSMAPCP 466
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 168 ---------NDESTGELTP-VVKRGRGRPRKTPKSTNestpNISITSEPeSIGQKPPVKRGRGRPRKTPkstNESTPNVS 237
Cdd:PHA03379 467 vaqlppgplQDLEPGDQLPgVVQDGRPACAPVPAPAG----PIVRPWEA-SLSQVPGVAFAPVMPQPMP---VEPVPVPT 538
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 238 IASEpESIGQKPP--VKGGRGRPKKIRKEAMNESTASeselvqeSTGTESEPSSQQPPVVKRGRGRPRKTPKPSSAVSTP 315
Cdd:PHA03379 539 VALE-RPVCPAPPliAMQGPGETSGIVRVRERWRPAP-------WTPNPPRSPSQMSVRDRLARLRAEAQPYQASVEVQP 610
|
250
....*....|..
gi 1133447891 316 ------SSQEPV 321
Cdd:PHA03379 611 pqltqvSPQQPM 622
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
110-353 |
2.33e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 42.06 E-value: 2.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 110 VGQKRGRGRPRKTPKEPLDESKDDDSTANESEPTSQKPADRGHSRKTLNESTESEPGPNDESTGE---LTPVVKRGRGRP 186
Cdd:NF033839 271 VVTKFKKGLTQDTPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQpekPKPEVKPQLETP 350
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 187 RKTPKSTNEsTPNISITSEPEsiGQKPPVKRGRGRPRKTPKSTNEsTPNVSIASEPESigQKPPVKGGRGRPKKIRKEAm 266
Cdd:NF033839 351 KPEVKPQPE-KPKPEVKPQPE--KPKPEVKPQPETPKPEVKPQPE-KPKPEVKPQPEK--PKPEVKPQPEKPKPEVKPQ- 423
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 267 NESTASESELVQESTGTESEPSSQQP-PVVKRGRGRPRKTPKPSsavstPSSQEPVVKRgrgRPRKVRPPNDSigtvtPS 345
Cdd:NF033839 424 PEKPKPEVKPQPEKPKPEVKPQPEKPkPEVKPQPETPKPEVKPQ-----PEKPKPEVKP---QPEKPKPDNSK-----PQ 490
|
....*...
gi 1133447891 346 STEQRPST 353
Cdd:NF033839 491 ADDKKPST 498
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
150-354 |
4.04e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 41.68 E-value: 4.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 150 RGHSRKTLNESTESEPGPNDESTGELTPVVKRGRGRPRKTPKSTNESTPniSITSEPESIGQKPPVKRGRGRPRKTPKST 229
Cdd:pfam03154 40 RSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLKSAKRQREKGA--SDTEEPERATAKKSKTQEISRPNSPSEGE 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 230 NESTPNVSI----ASEPESIGQKppvkgGRGRPKKIRKEAMNESTASESELVQESTGTESEPSSQQPPVVKRGRGRPRKT 305
Cdd:pfam03154 118 GESSDGRSVndegSSDPKDIDQD-----NRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTT 192
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 1133447891 306 PKPSsavSTPSSQEPVVKrGRGRPRKVRPPNDSIGTVTPSSTEQRPSTI 354
Cdd:pfam03154 193 QAAT---AGPTPSAPSVP-PQGSPATSQPPNQTQSTAAPHTLIQQTPTL 237
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
91-352 |
5.75e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 40.82 E-value: 5.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 91 SPIPVRVTGGTSKTSSGLHVGQKRGRGRPRKTPKEPLDESKDDDstanESEPTSQKPADRGHS-----RKTLNESTESEP 165
Cdd:COG5180 125 APAAAAALPKAKVTREATSASAGVALAAALLQRSDPILAKDPDG----DSASTLPPPAEKLDKvltepRDALKDSPEKLD 200
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 166 GPNDESTGELTPVVKRGRGRPRkTPKSTNESTPNISITSEPESIGQKPPVkrgRGRPRKTPKSTNESTPNVSIASEPESI 245
Cdd:COG5180 201 RPKVEVKDEAQEEPPDLTGGAD-HPRPEAASSPKVDPPSTSEARSRPATV---DAQPEMRPPADAKERRRAAIGDTPAAE 276
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 246 GQKPPVKGGRGRPKKIRKEAMNESTASESELVQESTGTesepssqqPPVVKRGRGRPRKTPKPSSAVSTPSSQEPVVKRG 325
Cdd:COG5180 277 PPGLPVLEAGSEPQSDAPEAETARPIDVKGVASAPPAT--------RPVRPPGGARDPGTPRPGQPTERPAGVPEAASDA 348
|
250 260
....*....|....*....|....*..
gi 1133447891 326 RGRPRKVRPPNDSIGTVTPSSTEQRPS 352
Cdd:COG5180 349 GQPPSAYPPAEEAVPGKPLEQGAPRPG 375
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
139-335 |
6.49e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 40.82 E-value: 6.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 139 ESEPTSQKP--ADRGHSRKTLNESTESEPgPNDESTGELTPVVKRGRGRPRKTPKSTNESTPNISITSEPESIGQKPPVK 216
Cdd:PHA03378 708 AAPPGRAQRpaAATGRARPPAAAPGRARP-PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAP 786
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 217 RGRGRPRKTPKSTNESTPNVSIASEPESIGQKPPVK--------GG--RGRPkKIRKEAMNESTASESELVQESTGTESE 286
Cdd:PHA03378 787 QQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKqilrqlltGGvkRGRP-SLKKPAALERQAAAGPTPSPGSGTSDK 865
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1133447891 287 --------PSSQQPPVVKRGRGRPRktPKPSSAVSTPSSQEPVVKRGRGR--PRKVRPP 335
Cdd:PHA03378 866 ivqapvfyPPVLQPIQVMRQLGSVR--AAAASTVTQAPTEYTGERRGVGPmhPTDIPPS 922
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
186-420 |
6.85e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.08 E-value: 6.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 186 PRKTPKSTNESTPnisiTSEPESIGQKPPVKRGRGRPRKTPKSTNESTPN----------VSIASEPESIGQKPPVKGGR 255
Cdd:PHA03247 2557 PAAPPAAPDRSVP----PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVddrgdprgpaPPSPLPPDTHAPDPPPPSPS 2632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 256 GRPkkirkeamNESTASESELVQESTGTESEPSsqqPPVVKRGRgRPRKTPKPSSAVSTPSsqepvvkrgRGRPRKVRPP 335
Cdd:PHA03247 2633 PAA--------NEPDPHPPPTVPPPERPRDDPA---PGRVSRPR-RARRLGRAAQASSPPQ---------RPRRRAARPT 2691
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 336 ndsIGTVT--------PSSTEQRPSTIETGDDEACLKRDATGVGEDMEEEPVPtildiidkeekSTPANNPSTKAQDTME 407
Cdd:PHA03247 2692 ---VGSLTsladppppPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP-----------PAVPAGPATPGGPARP 2757
|
250
....*....|...
gi 1133447891 408 TTTIVITVPPDPT 420
Cdd:PHA03247 2758 ARPPTTAGPPAPA 2770
|
|
| PLN02967 |
PLN02967 |
kinase |
154-285 |
7.30e-03 |
|
kinase
Pssm-ID: 215521 [Multi-domain] Cd Length: 581 Bit Score: 40.41 E-value: 7.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 154 RKTLNESTESEPGPNDESTGEL--TPVVKRGRGRPRKTPKSTNESTPNISITSEPESIGQKPPVKRgRGRPRKtpKSTNE 231
Cdd:PLN02967 48 SRKKIESALAVDEEPDENGAVSkkKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKT-PRRTRR--KAAAA 124
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1133447891 232 STPNVSIASEPESigqkppvkggRGRPKKIRKEAMNESTASESEL--VQESTGTES 285
Cdd:PLN02967 125 SSDVEEEKTEKKV----------RKRRKVKKMDEDVEDQGSESEVsdVEESEFVTS 170
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
147-380 |
7.60e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 40.54 E-value: 7.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 147 PADRGHSRKTLNESTESEPGPNDESTGELTPVVKRGRGRPRKTPKST---NESTPNISITSEPESIGQKPPvkrgrgrpr 223
Cdd:PHA03307 61 ACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTppgPSSPDPPPPTPPPASPPPSPA--------- 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1133447891 224 ktpkstNESTPNVSIASEPESIGQKPPVKGGRGRPKKIRKEAMNESTASESELVQESTGTESEPSSqqPPVVKRGRGRPR 303
Cdd:PHA03307 132 ------PDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPA--EPPPSTPPAAAS 203
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1133447891 304 KTPKPSSAVSTPSSQEPVVKRGRGRPRKVRPPNDSigtvtpSSTEQRPSTIETGDDEACLKRDATGVGEDMEEEPVP 380
Cdd:PHA03307 204 PRPPRRSSPISASASSPAPAPGRSAADDAGASSSD------SSSSESSGCGWGPENECPLPRPAPITLPTRIWEASG 274
|
|
|