|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
88-161 |
6.40e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. :
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.40e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412593 88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
228-289 |
8.97e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain. :
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.97e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412593 228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PAT1 super family |
cl37801 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
763-951 |
1.91e-07 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division. The actual alignment was detected with superfamily member pfam09770:
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 55.43 E-value: 1.91e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 763 KPSTTPTSPRPQAQPSPSM-----------VGHQQPAPVyTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYR 831
Cdd:pfam09770 169 KAAAPAPAPQPAAQPASLPapsrkmmsleeVEAAMRAQA-KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 832 AVPNMPQQRQDQHHQ-STMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMM 910
Cdd:pfam09770 248 QQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNP 320
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1720412593 911 APPAHAQPGlvsssaaqfgahEQTHAMYVSTGSLAQQYAHP 951
Cdd:pfam09770 321 QPGVQPAPA------------HQAHRQQGSFGRQAPIITHP 349
|
|
| Atrophin-1 super family |
cl38111 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
704-1113 |
1.52e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity. The actual alignment was detected with superfamily member pfam03154:
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 49.38 E-value: 1.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 704 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 783
Cdd:pfam03154 127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 784 HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYP-------IPMTPMPvnqaktyraVPNMPQQRQDQHHQSTMMHPASAA 856
Cdd:pfam03154 207 PPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPqrlpsphPPLQPMT---------QPPPPSQVSPQPLPQPSLHGQMPP 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 857 GPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMA--PPAHAQPGLVSSSAAQFGAHEQT 934
Cdd:pfam03154 278 MPHSLQTGP-------SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhtPPSQSQLQSQQPPREQPLPPAPL 350
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 935 HAMYVSTGSLAQQYAHPNAALHPHTPHpqpsatPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLA 1014
Cdd:pfam03154 351 SMPHIKPPPTTPIPQLPNPQSHKHPPH------LSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQL 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 1015 PTPPSMTPASnTQSPQSSFPAAQQ-TVFTIHPSHVQPAYTTPPHMAHVPqahvQSGMVPSHPTAHAPMMLMTTQPPggpq 1093
Cdd:pfam03154 425 PPPPAQPPVL-TQSQSLPPPAASHpPTSGLHQVPSQSPFPQHPFVPGGP----PPITPPSGPPTSTSSAMPGIQPP---- 495
|
410 420
....*....|....*....|
gi 1720412593 1094 AALAQSALQPIPVSTTAHFP 1113
Cdd:pfam03154 496 SSASVSSSGPVPAAVSCPLP 515
|
|
| PRK12323 super family |
cl46901 |
DNA polymerase III subunit gamma/tau; |
399-606 |
9.33e-05 |
|
DNA polymerase III subunit gamma/tau; The actual alignment was detected with superfamily member PRK12323:
Pssm-ID: 481241 [Multi-domain] Cd Length: 700 Bit Score: 46.79 E-value: 9.33e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
88-161 |
6.40e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.40e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412593 88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
228-289 |
8.97e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.97e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412593 228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
763-951 |
1.91e-07 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 55.43 E-value: 1.91e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 763 KPSTTPTSPRPQAQPSPSM-----------VGHQQPAPVyTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYR 831
Cdd:pfam09770 169 KAAAPAPAPQPAAQPASLPapsrkmmsleeVEAAMRAQA-KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 832 AVPNMPQQRQDQHHQ-STMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMM 910
Cdd:pfam09770 248 QQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNP 320
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1720412593 911 APPAHAQPGlvsssaaqfgahEQTHAMYVSTGSLAQQYAHP 951
Cdd:pfam09770 321 QPGVQPAPA------------HQAHRQQGSFGRQAPIITHP 349
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
762-1060 |
4.87e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 4.87e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 762 PKPSTTPTSPRPQAQPSP--SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 833
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 834 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 912
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 913 PAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPH-------PQPSATPTGQQQSQHGGSHPAPSP 985
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPArppvrrlARPAVSRSTESFALPPDQPERPPQ 2910
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 986 VQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQS-PQSSFPAAQ-------------QTVFTIHPSHVQPA 1051
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGePSGAVPQPWlgalvpgrvavprFRVPQPAPSREAPA 2990
|
....*....
gi 1720412593 1052 YTTPPHMAH 1060
Cdd:PHA03247 2991 SSTPPLTGH 2999
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
734-899 |
6.53e-06 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 50.19 E-value: 6.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 814 LypiPMTPMPVNQAktyRAVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQS 893
Cdd:TIGR01628 433 R---PNGLAPMNAV---RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQK 506
|
170
....*....|..
gi 1720412593 894 Q------HPHVY 899
Cdd:TIGR01628 507 QvlgerlFPLVE 518
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
704-1113 |
1.52e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 49.38 E-value: 1.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 704 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 783
Cdd:pfam03154 127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 784 HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYP-------IPMTPMPvnqaktyraVPNMPQQRQDQHHQSTMMHPASAA 856
Cdd:pfam03154 207 PPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPqrlpsphPPLQPMT---------QPPPPSQVSPQPLPQPSLHGQMPP 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 857 GPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMA--PPAHAQPGLVSSSAAQFGAHEQT 934
Cdd:pfam03154 278 MPHSLQTGP-------SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhtPPSQSQLQSQQPPREQPLPPAPL 350
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 935 HAMYVSTGSLAQQYAHPNAALHPHTPHpqpsatPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLA 1014
Cdd:pfam03154 351 SMPHIKPPPTTPIPQLPNPQSHKHPPH------LSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQL 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 1015 PTPPSMTPASnTQSPQSSFPAAQQ-TVFTIHPSHVQPAYTTPPHMAHVPqahvQSGMVPSHPTAHAPMMLMTTQPPggpq 1093
Cdd:pfam03154 425 PPPPAQPPVL-TQSQSLPPPAASHpPTSGLHQVPSQSPFPQHPFVPGGP----PPITPPSGPPTSTSSAMPGIQPP---- 495
|
410 420
....*....|....*....|
gi 1720412593 1094 AALAQSALQPIPVSTTAHFP 1113
Cdd:pfam03154 496 SSASVSSSGPVPAAVSCPLP 515
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
399-606 |
9.33e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.79 E-value: 9.33e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
436-940 |
1.83e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 1.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 436 PAPVSTMPKRMSSEGPPRMSPKAQ--RHPRNHRVSAGRGSMSSGLEFVSHNPPSEAAAP-PVARTSPAGGTWSSVVSGVP 512
Cdd:PHA03247 2573 PAPRPSEPAVTSRARRPDAPPQSArpRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPsPAANEPDPHPPPTVPPPERP 2652
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 513 RLSPKTHRPRSPRQSSIGNSPSGPVlASPQAGIIPA--EAVSMPVPAASPTPASPASNRALTPSIEAKDSRL--QDQRQN 588
Cdd:PHA03247 2653 RDDPAPGRVSRPRRARRLGRAAQAS-SPPQRPRRRAarPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPgpAAARQA 2731
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 589 SPAGSKENV-KASETSPSFSKADNKGMSPVVSEHrkqiddlkkfkndfrlQPSSTSESMDQLLSKNREGEKSRDLIKDKT 667
Cdd:PHA03247 2732 SPALPAAPApPAVPAGPATPGGPARPARPPTTAG----------------PPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 668 EASAKDSFIDSSSSSSNCTSGSSKTNSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACkqekddreekkdtteqvrkSTL 747
Cdd:PHA03247 2796 ESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-------------------GSV 2856
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 748 NPNAkEFNPRSFSQPKPSTTPTSPRPQ----AQPSPSMVGHQQPAPvytqpvcfapnmmyPVPVSPGVQPLYPIPMTPMP 823
Cdd:PHA03247 2857 APGG-DVRRRPPSRSPAAKPAAPARPPvrrlARPAVSRSTESFALP--------------PDQPERPPQPQAPPPPQPQP 2921
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 824 VNQAKTYRAVPNMPQQRQDqhhqstmmhpasAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVI 903
Cdd:PHA03247 2922 QPPPPPQPQPPPPPPPRPQ------------PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSR 2986
|
490 500 510
....*....|....*....|....*....|....*..
gi 1720412593 904 QGNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYVS 940
Cdd:PHA03247 2987 EAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVS 3023
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
899-1077 |
1.96e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.76 E-value: 1.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 899 YSPVIQGNArMMAPPAHAQPGLVSSS--AAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHP----QPSATPTGQQ 972
Cdd:PRK10263 307 YDPLLNGAP-ITEPVAVAAAATTATQswAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPviapAPEGYPQQSQ 385
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 973 QSQHGGSHPAP--------SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIH 1044
Cdd:PRK10263 386 YAQPAVQYNEPlqqpvqpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465
|
170 180 190
....*....|....*....|....*....|...
gi 1720412593 1045 PSHVQPAYTTPPHMAhvPQAHVQSGMVPSHPTA 1077
Cdd:PRK10263 466 QTYQQPAAQEPLYQQ--PQPVEQQPVVEPEPVV 496
|
|
| DUF3498 |
pfam12004 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
468-617 |
3.77e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 41.28 E-value: 3.77e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412593 615 SPV 617
Cdd:pfam12004 354 SPV 356
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
92-159 |
4.10e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.84 E-value: 4.10e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412593 92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| SM-ATX |
pfam14438 |
Ataxin 2 SM domain; This SM domain is found in Ataxin-2. |
88-161 |
6.40e-21 |
|
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
Pssm-ID: 464173 Cd Length: 78 Bit Score: 87.99 E-value: 6.40e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412593 88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438 1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
|
|
| LsmAD |
pfam06741 |
LsmAD domain; This domain is found associated with Lsm domain. |
228-289 |
8.97e-16 |
|
LsmAD domain; This domain is found associated with Lsm domain.
Pssm-ID: 461998 [Multi-domain] Cd Length: 65 Bit Score: 72.60 E-value: 8.97e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412593 228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741 1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
763-951 |
1.91e-07 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 55.43 E-value: 1.91e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 763 KPSTTPTSPRPQAQPSPSM-----------VGHQQPAPVyTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYR 831
Cdd:pfam09770 169 KAAAPAPAPQPAAQPASLPapsrkmmsleeVEAAMRAQA-KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 832 AVPNMPQQRQDQHHQ-STMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMM 910
Cdd:pfam09770 248 QQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAARVGYPQNP 320
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1720412593 911 APPAHAQPGlvsssaaqfgahEQTHAMYVSTGSLAQQYAHP 951
Cdd:pfam09770 321 QPGVQPAPA------------HQAHRQQGSFGRQAPIITHP 349
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
740-986 |
2.06e-07 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 55.43 E-value: 2.06e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 740 EQVRKSTLNPNAKefnprsfSQPKPSTTPTSPRPQAQPSPsmvghqQPAPVYTQPVCFA-PNMMYPVPVSP--------G 810
Cdd:pfam09770 98 EQVRFNRQQPAAR-------AAQSSAQPPASSLPQYQYAS------QQSQQPSKPVRTGyEKYKEPEPIPDlqvdaslwG 164
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 811 VQPLYPIPMTPMPVNQAktyravpnmpQQRQDQHHQSTMM-------------HPASAAGPPIVATPPAYSTQYVAYSPQ 877
Cdd:pfam09770 165 VAPKKAAAPAPAPQPAA----------QPASLPAPSRKMMsleeveaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQ 234
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 878 QFPNQPLVQHVPHYQSQHPhvysPVIQGNARMMA-----PPAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLAQQYAHPN 952
Cdd:pfam09770 235 QFPPQIQQQQQPQQQPQQP----QQHPGQGHPVTilqrpQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLS 310
|
250 260 270
....*....|....*....|....*....|....*
gi 1720412593 953 AALHPHTPHPQPSATPT-GQQQSQHGGSHPAPSPV 986
Cdd:pfam09770 311 AARVGYPQNPQPGVQPApAHQAHRQQGSFGRQAPI 345
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
762-1060 |
4.87e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.56 E-value: 4.87e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 762 PKPSTTPTSPRPQAQPSP--SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 833
Cdd:PHA03247 2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 834 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 912
Cdd:PHA03247 2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 913 PAHAQPGLVSSSAAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPH-------PQPSATPTGQQQSQHGGSHPAPSP 985
Cdd:PHA03247 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPArppvrrlARPAVSRSTESFALPPDQPERPPQ 2910
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 986 VQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQS-PQSSFPAAQ-------------QTVFTIHPSHVQPA 1051
Cdd:PHA03247 2911 PQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGePSGAVPQPWlgalvpgrvavprFRVPQPAPSREAPA 2990
|
....*....
gi 1720412593 1052 YTTPPHMAH 1060
Cdd:PHA03247 2991 SSTPPLTGH 2999
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
756-1110 |
1.32e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 52.68 E-value: 1.32e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 756 PRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPvytqpvcfAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPN 835
Cdd:PRK07764 400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 836 MPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQpLVQHVPHYQ-------SQHPHVYSpvIQGN-- 906
Cdd:PRK07764 472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPE-ILAAVPKRSrktwailLPEATVLG--VRGDtl 548
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 907 ---------ARMMAPPAHAQPgLVSSSAAQFGAHEQTHAmYVSTGSLAQQYAHPNAAL----HPHTPHPQPSATPTGQQQ 973
Cdd:PRK07764 549 vlgfstgglARRFASPGNAEV-LVTALAEELGGDWQVEA-VVGPAPGAAGGEGPPAPAssgpPEEAARPAAPAAPAAPAA 626
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 974 SQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTI 1043
Cdd:PRK07764 627 PAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPA 706
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720412593 1044 HPSHVQPAYTTPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQSALQPIPVSTTA 1110
Cdd:PRK07764 707 ATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAAPAAAPPPSPPS 780
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
763-1105 |
1.80e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.63 E-value: 1.80e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 763 KPSTTPTSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQd 842
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 843 qhhQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQPglvS 922
Cdd:PHA03247 2667 ---ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAP---A 2740
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 923 SSAAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASP 1002
Cdd:PHA03247 2741 PPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 1003 QQQSaiyhAGLAPTPPSMTPASnTQSPQSSFPAAQQTVFTIHP----SHVQPAYTTPPHMAHVPQAHVQSGMVPSHPTAH 1078
Cdd:PHA03247 2821 AASP----AGPLPPPTSAQPTA-PPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRST 2895
|
330 340
....*....|....*....|....*..
gi 1720412593 1079 APMMLMTTQPPGGPQAALAQSALQPIP 1105
Cdd:PHA03247 2896 ESFALPPDQPERPPQPQAPPPPQPQPQ 2922
|
|
| PABP-1234 |
TIGR01628 |
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ... |
734-899 |
6.53e-06 |
|
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.
Pssm-ID: 130689 [Multi-domain] Cd Length: 562 Bit Score: 50.19 E-value: 6.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628 362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 814 LypiPMTPMPVNQAktyRAVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQS 893
Cdd:TIGR01628 433 R---PNGLAPMNAV---RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQK 506
|
170
....*....|..
gi 1720412593 894 Q------HPHVY 899
Cdd:TIGR01628 507 QvlgerlFPLVE 518
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
704-1113 |
1.52e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 49.38 E-value: 1.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 704 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 783
Cdd:pfam03154 127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 784 HQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYP-------IPMTPMPvnqaktyraVPNMPQQRQDQHHQSTMMHPASAA 856
Cdd:pfam03154 207 PPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPqrlpsphPPLQPMT---------QPPPPSQVSPQPLPQPSLHGQMPP 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 857 GPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMA--PPAHAQPGLVSSSAAQFGAHEQT 934
Cdd:pfam03154 278 MPHSLQTGP-------SHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhtPPSQSQLQSQQPPREQPLPPAPL 350
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 935 HAMYVSTGSLAQQYAHPNAALHPHTPHpqpsatPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLA 1014
Cdd:pfam03154 351 SMPHIKPPPTTPIPQLPNPQSHKHPPH------LSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQL 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 1015 PTPPSMTPASnTQSPQSSFPAAQQ-TVFTIHPSHVQPAYTTPPHMAHVPqahvQSGMVPSHPTAHAPMMLMTTQPPggpq 1093
Cdd:pfam03154 425 PPPPAQPPVL-TQSQSLPPPAASHpPTSGLHQVPSQSPFPQHPFVPGGP----PPITPPSGPPTSTSSAMPGIQPP---- 495
|
410 420
....*....|....*....|
gi 1720412593 1094 AALAQSALQPIPVSTTAHFP 1113
Cdd:pfam03154 496 SSASVSSSGPVPAAVSCPLP 515
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
841-1087 |
5.48e-05 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 47.34 E-value: 5.48e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 841 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNA--RMMAPP 913
Cdd:pfam09770 96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVApkKAAAPA 174
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 914 AHAQPGLVSSSAAQFG----------------AHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHPQPSATPTGQQQSQHG 977
Cdd:pfam09770 175 PAPQPAAQPASLPAPSrkmmsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQ 254
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 978 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQPAyttPPH 1057
Cdd:pfam09770 255 QHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQPA---PAH 330
|
250 260 270
....*....|....*....|....*....|
gi 1720412593 1058 MAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1087
Cdd:pfam09770 331 QAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
399-606 |
9.33e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.79 E-value: 9.33e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323 367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323 447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323 520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
760-1030 |
2.95e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 45.46 E-value: 2.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 760 SQPKPSTTPTSPRPQAQPSPsmvGHQQPAPVYT-QPVCFAPNMMYPVPVSPGVQPLypipMTPMPVNQAKTYRAVPNMPQ 838
Cdd:PRK10263 345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPL----QQPVQPQQPYYAPAAEQPAQ 417
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 839 QRQDQHHQSTmmhPASAAGPPIVATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgNARMMAPPAHAQP 918
Cdd:PRK10263 418 QPYYAPAPEQ---PAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-EPLYQQPQPVEQQ 487
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 919 GLVSSSAAQFGAHEQTHAMYVSTgSLAQQYAHPNAALHP-HTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQAL 997
Cdd:PRK10263 488 PVVEPEPVVEETKPARPPLYYFE-EVEEKRAREREQLAAwYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLAS 566
|
250 260 270
....*....|....*....|....*....|....*.
gi 1720412593 998 HLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1030
Cdd:PRK10263 567 GV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
779-1003 |
4.65e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 44.69 E-value: 4.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 779 PSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPgVQPLYPIPMTPMPVNQaktyravPNMPQQRQdqhhqstmmhPASAAGP 858
Cdd:PRK10263 309 PLLNGAPITEPVAVAAAATTATQSWAAPVEP-VTQTPPVASVDVPPAQ-------PTVAWQPV----------PGPQTGE 370
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 859 PIVATPPAystqyvAYSPQQFPNQPLVQHVPHYQSQHPHvyspviqgnarmmAPPAHAQPGLVSSSAAQFGAHEQTHAMY 938
Cdd:PRK10263 371 PVIAPAPE------GYPQQSQYAQPAVQYNEPLQQPVQP-------------QQPYYAPAAEQPAQQPYYAPAPEQPAQQ 431
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720412593 939 vstGSLAQQYAHPNAALHPHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQ 1003
Cdd:PRK10263 432 ---PYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPE 493
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
436-940 |
1.83e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 1.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 436 PAPVSTMPKRMSSEGPPRMSPKAQ--RHPRNHRVSAGRGSMSSGLEFVSHNPPSEAAAP-PVARTSPAGGTWSSVVSGVP 512
Cdd:PHA03247 2573 PAPRPSEPAVTSRARRPDAPPQSArpRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPsPAANEPDPHPPPTVPPPERP 2652
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 513 RLSPKTHRPRSPRQSSIGNSPSGPVlASPQAGIIPA--EAVSMPVPAASPTPASPASNRALTPSIEAKDSRL--QDQRQN 588
Cdd:PHA03247 2653 RDDPAPGRVSRPRRARRLGRAAQAS-SPPQRPRRRAarPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPgpAAARQA 2731
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 589 SPAGSKENV-KASETSPSFSKADNKGMSPVVSEHrkqiddlkkfkndfrlQPSSTSESMDQLLSKNREGEKSRDLIKDKT 667
Cdd:PHA03247 2732 SPALPAAPApPAVPAGPATPGGPARPARPPTTAG----------------PPAPAPPAAPAAGPPRRLTRPAVASLSESR 2795
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 668 EASAKDSFIDSSSSSSNCTSGSSKTNSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACkqekddreekkdtteqvrkSTL 747
Cdd:PHA03247 2796 ESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLG-------------------GSV 2856
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 748 NPNAkEFNPRSFSQPKPSTTPTSPRPQ----AQPSPSMVGHQQPAPvytqpvcfapnmmyPVPVSPGVQPLYPIPMTPMP 823
Cdd:PHA03247 2857 APGG-DVRRRPPSRSPAAKPAAPARPPvrrlARPAVSRSTESFALP--------------PDQPERPPQPQAPPPPQPQP 2921
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 824 VNQAKTYRAVPNMPQQRQDqhhqstmmhpasAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVI 903
Cdd:PHA03247 2922 QPPPPPQPQPPPPPPPRPQ------------PPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSR 2986
|
490 500 510
....*....|....*....|....*....|....*..
gi 1720412593 904 QGNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYVS 940
Cdd:PHA03247 2987 EAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVS 3023
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
899-1077 |
1.96e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.76 E-value: 1.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 899 YSPVIQGNArMMAPPAHAQPGLVSSS--AAQFGAHEQTHAMYVSTGSLAQQYAHPNAALHPHTPHP----QPSATPTGQQ 972
Cdd:PRK10263 307 YDPLLNGAP-ITEPVAVAAAATTATQswAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPviapAPEGYPQQSQ 385
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 973 QSQHGGSHPAP--------SPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIH 1044
Cdd:PRK10263 386 YAQPAVQYNEPlqqpvqpqQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTE 465
|
170 180 190
....*....|....*....|....*....|...
gi 1720412593 1045 PSHVQPAYTTPPHMAhvPQAHVQSGMVPSHPTA 1077
Cdd:PRK10263 466 QTYQQPAAQEPLYQQ--PQPVEQQPVVEPEPVV 496
|
|
| DUF3498 |
pfam12004 |
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ... |
468-617 |
3.77e-03 |
|
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.
Pssm-ID: 463427 [Multi-domain] Cd Length: 511 Bit Score: 41.28 E-value: 3.77e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004 196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004 274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353
|
...
gi 1720412593 615 SPV 617
Cdd:pfam12004 354 SPV 356
|
|
| Sm_like |
cd00600 |
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ... |
92-159 |
4.10e-03 |
|
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.
Pssm-ID: 212462 [Multi-domain] Cd Length: 63 Bit Score: 36.84 E-value: 4.10e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412593 92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600 1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
|
|
| PAM2 |
pfam07145 |
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ... |
743-758 |
4.55e-03 |
|
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.
Pssm-ID: 429316 Cd Length: 17 Bit Score: 35.67 E-value: 4.55e-03
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
693-823 |
5.97e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 40.84 E-value: 5.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 693 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 771
Cdd:PRK10263 741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412593 772 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 823
Cdd:PRK10263 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
735-845 |
7.26e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 40.53 E-value: 7.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412593 735 KKDTTEQVrkSTLNPNAKEFN---PRSFSQPKPSTTPtSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSpgv 811
Cdd:PRK14971 380 KPVFTQPA--AAPQPSAAAAAspsPSQSSAAAQPSAP-QSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFK--- 453
|
90 100 110
....*....|....*....|....*....|....
gi 1720412593 812 qPLYPIPMTPMPVNQAKTYRAVPNMPQQRQDQHH 845
Cdd:PRK14971 454 -EEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIK 486
|
|
|