NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720412588|ref|XP_030110095|]
View 

ataxin-2 isoform X2 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
88-161 6.50e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


:

Pssm-ID: 464173  Cd Length: 78  Bit Score: 87.99  E-value: 6.50e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412588   88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
228-289 9.11e-16

LsmAD domain; This domain is found associated with Lsm domain.


:

Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 9.11e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412588  228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PAT1 super family cl37801
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
763-935 9.70e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


The actual alignment was detected with superfamily member pfam09770:

Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 53.12  E-value: 9.70e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  763 KPSTTPTSPRPQAQPSPSM-----------VGHQQPAPVyTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYR 831
Cdd:pfam09770  169 KAAAPAPAPQPAAQPASLPapsrkmmsleeVEAAMRAQA-KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  832 AVPNMPQQRQDQHHQ-STMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPV---IQGNA 907
Cdd:pfam09770  248 QQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAArvgYPQNP 320
                          170       180
                   ....*....|....*....|....*...
gi 1720412588  908 RMMAPPAHAQPGLVSSSAAQFGAHEQTH 935
Cdd:pfam09770  321 QPGVQPAPAHQAHRQQGSFGRQAPIITH 348
PHA03247 super family cl33720
large tegument protein UL36; Provisional
763-1151 1.09e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 1.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  763 KPSTTPTSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQd 842
Cdd:PHA03247  2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  843 qhhQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQP---- 918
Cdd:PHA03247  2667 ---ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPappa 2743
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  919 ---GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSfyFAISTGSLAQQYAHPNAALHPHTPHPQPSATPtGQQQSQHG 995
Cdd:PHA03247  2744 vpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVL-APAAALPP 2820
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  996 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQSPQSSFPAAQQtvftihPSH 1065
Cdd:PHA03247  2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLARPAV------SRS 2894
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588 1066 VQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALA---QSALQPIPVSTTAHFPYMTHPSGEACV 1142
Cdd:PHA03247  2895 TESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApttDPAGAGEPSGAVPQPWLGALVPGRVAV 2974

                   ....*....
gi 1720412588 1143 CRGRRGTPS 1151
Cdd:PHA03247  2975 PRFRVPQPA 2983
PRK12323 super family cl46901
DNA polymerase III subunit gamma/tau;
399-606 9.34e-05

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK12323:

Pssm-ID: 481241 [Multi-domain]  Cd Length: 700  Bit Score: 46.79  E-value: 9.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323   447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323   520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
743-758 4.85e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


:

Pssm-ID: 429316  Cd Length: 17  Bit Score: 35.67  E-value: 4.85e-03
                           10
                   ....*....|....*.
gi 1720412588  743 RKSTLNPNAKEFNPRS 758
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
88-161 6.50e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 87.99  E-value: 6.50e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412588   88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
228-289 9.11e-16

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 9.11e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412588  228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
763-935 9.70e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 53.12  E-value: 9.70e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  763 KPSTTPTSPRPQAQPSPSM-----------VGHQQPAPVyTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYR 831
Cdd:pfam09770  169 KAAAPAPAPQPAAQPASLPapsrkmmsleeVEAAMRAQA-KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  832 AVPNMPQQRQDQHHQ-STMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPV---IQGNA 907
Cdd:pfam09770  248 QQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAArvgYPQNP 320
                          170       180
                   ....*....|....*....|....*...
gi 1720412588  908 RMMAPPAHAQPGLVSSSAAQFGAHEQTH 935
Cdd:pfam09770  321 QPGVQPAPAHQAHRQQGSFGRQAPIITH 348
PHA03247 PHA03247
large tegument protein UL36; Provisional
762-1078 1.02e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 1.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  762 PKPSTTPTSPRPQAQPSP--SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 833
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  834 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 912
Cdd:PHA03247  2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  913 PAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAALH 974
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQPER 2907
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  975 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfpaa 1054
Cdd:PHA03247  2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA---- 2983
                          330       340
                   ....*....|....*....|....
gi 1720412588 1055 qqtvftihPSHVQPAYTTPPHMAH 1078
Cdd:PHA03247  2984 --------PSREAPASSTPPLTGH 2999
PHA03247 PHA03247
large tegument protein UL36; Provisional
763-1151 1.09e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 1.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  763 KPSTTPTSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQd 842
Cdd:PHA03247  2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  843 qhhQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQP---- 918
Cdd:PHA03247  2667 ---ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPappa 2743
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  919 ---GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSfyFAISTGSLAQQYAHPNAALHPHTPHPQPSATPtGQQQSQHG 995
Cdd:PHA03247  2744 vpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVL-APAAALPP 2820
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  996 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQSPQSSFPAAQQtvftihPSH 1065
Cdd:PHA03247  2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLARPAV------SRS 2894
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588 1066 VQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALA---QSALQPIPVSTTAHFPYMTHPSGEACV 1142
Cdd:PHA03247  2895 TESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApttDPAGAGEPSGAVPQPWLGALVPGRVAV 2974

                   ....*....
gi 1720412588 1143 CRGRRGTPS 1151
Cdd:PHA03247  2975 PRFRVPQPA 2983
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
734-899 6.59e-06

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 50.19  E-value: 6.59e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628  362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  814 LypiPMTPMPVNQAktyRAVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQS 893
Cdd:TIGR01628  433 R---PNGLAPMNAV---RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQK 506
                          170
                   ....*....|..
gi 1720412588  894 Q------HPHVY 899
Cdd:TIGR01628  507 QvlgerlFPLVE 518
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
704-1074 2.95e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.61  E-value: 2.95e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  704 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 783
Cdd:pfam03154  127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  784 HQQPAPVYTQPVCFAPNMMYPV-------------------PVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQDQh 844
Cdd:pfam03154  207 PPQGSPATSQPPNQTQSTAAPHtliqqtptlhpqrlpsphpPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTG- 285
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  845 hQSTMMHPASAAGPPIVAT------PPAYSTQyVAYSPQQFPNQPLVQhvPHYQSQHPHVYSPVIQGNARM---MAPPAH 915
Cdd:pfam03154  286 -PSHMQHPVPPQPFPLTPQssqsqvPPGPSPA-APGQSQQRIHTPPSQ--SQLQSQQPPREQPLPPAPLSMphiKPPPTT 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  916 AQPGLVSSSAAQFGAHEQTHAMYacpKLPYNKETSPSFYFAISTGSLAQQYAHPNA-ALHPHT-PHPQPSATPTGQQQSQ 993
Cdd:pfam03154  362 PIPQLPNPQSHKHPPHLSGPSPF---QMNSNLPPPPALKPLSSLSTHHPPSAHPPPlQLMPQSqQLPPPPAQPPVLTQSQ 438
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  994 hggSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYTTP 1073
Cdd:pfam03154  439 ---SLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLP 515

                   .
gi 1720412588 1074 P 1074
Cdd:pfam03154  516 P 516
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
399-606 9.34e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.79  E-value: 9.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323   447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323   520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
DUF3498 pfam12004
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ...
468-617 3.46e-03

Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.


Pssm-ID: 463427 [Multi-domain]  Cd Length: 511  Bit Score: 41.28  E-value: 3.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004  196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004  274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353

                   ...
gi 1720412588  615 SPV 617
Cdd:pfam12004  354 SPV 356
Sm_like cd00600
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ...
92-159 4.17e-03

Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.


Pssm-ID: 212462 [Multi-domain]  Cd Length: 63  Bit Score: 36.84  E-value: 4.17e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412588   92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600      1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
743-758 4.85e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


Pssm-ID: 429316  Cd Length: 17  Bit Score: 35.67  E-value: 4.85e-03
                           10
                   ....*....|....*.
gi 1720412588  743 RKSTLNPNAKEFNPRS 758
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
PRK10263 PRK10263
DNA translocase FtsK; Provisional
693-823 5.54e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 5.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  693 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 771
Cdd:PRK10263   741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412588  772 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 823
Cdd:PRK10263   821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
88-161 6.50e-21

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 87.99  E-value: 6.50e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412588   88 MVHILTSVVGSKCEVQVKNGGIYEGVFKTYSP--KCDLVLDAAHEKSTESSSG--PKREEIMESVLFKCSDFVVVQFK 161
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASLekDFGVVLKMARRIKKSNGSGlnPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
228-289 9.11e-16

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 72.60  E-value: 9.11e-16
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412588  228 YGVVSTYDSSLssYTVPLERdNSEEFLKREARANQLAEEIESSAQYKARVALENDDR------SEEEK 289
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERGLDvddsglDEEDK 65
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
763-935 9.70e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 53.12  E-value: 9.70e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  763 KPSTTPTSPRPQAQPSPSM-----------VGHQQPAPVyTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYR 831
Cdd:pfam09770  169 KAAAPAPAPQPAAQPASLPapsrkmmsleeVEAAMRAQA-KKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  832 AVPNMPQQRQDQHHQ-STMMHPASAAGPPIVATPPaystqyvAYSPQQFPNQPLVQHVPHYQSQHPHVYSPV---IQGNA 907
Cdd:pfam09770  248 QQPQQPQQHPGQGHPvTILQRPQSPQPDPAQPSIQ-------PQAQQFHQQPPPVPVQPTQILQNPNRLSAArvgYPQNP 320
                          170       180
                   ....*....|....*....|....*...
gi 1720412588  908 RMMAPPAHAQPGLVSSSAAQFGAHEQTH 935
Cdd:pfam09770  321 QPGVQPAPAHQAHRQQGSFGRQAPIITH 348
PHA03247 PHA03247
large tegument protein UL36; Provisional
762-1078 1.02e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 1.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  762 PKPSTTPTSPRPQAQPSP--SMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQ------PLYPIPMTPMPVNQAKTYRAV 833
Cdd:PHA03247  2674 AQASSPPQRPRRRAARPTvgSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAaarqasPALPAAPAPPAVPAGPATPGG 2753
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  834 PNMPQQRQDQHHQSTMMHPAS-AAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPhyqSQHPHVYSPVIQGNARMMAP 912
Cdd:PHA03247  2754 PARPARPPTTAGPPAPAPPAApAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA---VLAPAAALPPAASPAGPLPP 2830
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  913 PAHAQPGLVSSSAAQFGAHEQTHAMYA---------CPKLPYNKETSPSF---------YFAISTGSLAQQyahPNAALH 974
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrpPSRSPAAKPAAPARppvrrlarpAVSRSTESFALP---PDQPER 2907
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  975 PHTPHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSfpaa 1054
Cdd:PHA03247  2908 PPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPA---- 2983
                          330       340
                   ....*....|....*....|....
gi 1720412588 1055 qqtvftihPSHVQPAYTTPPHMAH 1078
Cdd:PHA03247  2984 --------PSREAPASSTPPLTGH 2999
PHA03247 PHA03247
large tegument protein UL36; Provisional
763-1151 1.09e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 1.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  763 KPSTTPTSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQd 842
Cdd:PHA03247  2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR- 2666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  843 qhhQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQP---- 918
Cdd:PHA03247  2667 ---ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPappa 2743
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  919 ---GLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPSfyFAISTGSLAQQYAHPNAALHPHTPHPQPSATPtGQQQSQHG 995
Cdd:PHA03247  2744 vpaGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVL-APAAALPP 2820
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  996 GSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAP----------TPPSMTPASNTQSPQSSFPAAQQtvftihPSH 1065
Cdd:PHA03247  2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPggdvrrrppsRSPAAKPAAPARPPVRRLARPAV------SRS 2894
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588 1066 VQPAYTTPPHMAHVPQAHVQSGMVPSHPTAHAPMMLMTTQPPGGPQAALA---QSALQPIPVSTTAHFPYMTHPSGEACV 1142
Cdd:PHA03247  2895 TESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLApttDPAGAGEPSGAVPQPWLGALVPGRVAV 2974

                   ....*....
gi 1720412588 1143 CRGRRGTPS 1151
Cdd:PHA03247  2975 PRFRVPQPA 2983
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
734-899 6.59e-06

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 50.19  E-value: 6.59e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  734 EKKDTteqvRKSTLNpnaKEFNPRSFSQPKPSTTptSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQP 813
Cdd:TIGR01628  362 QRKEQ----RRAHLQ---DQFMQLQPRMRQLPMG--SPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPL 432
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  814 LypiPMTPMPVNQAktyRAVPNMPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQS 893
Cdd:TIGR01628  433 R---PNGLAPMNAV---RAPSRNAQNAAQKPPMQPVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQK 506
                          170
                   ....*....|..
gi 1720412588  894 Q------HPHVY 899
Cdd:TIGR01628  507 QvlgerlFPLVE 518
PHA03378 PHA03378
EBNA-3B; Provisional
745-1098 1.19e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 49.68  E-value: 1.19e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  745 STLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPVytQPVCFAPNMMYPVPVSPGVQPLYPIPMTPMPV 824
Cdd:PHA03378   583 SQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPL--RPIPMRPLRMQPITFNVLVFPTPHQPPQVEIT 660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  825 NQAKTYRAVPNMPQQRQDQHHqSTMMHPASAagPPIVATPPAYSTQYvaySPQQFPNQPlvqhvphyqSQHPHvyspviq 904
Cdd:PHA03378   661 PYKPTWTQIGHIPYQPSPTGA-NTMLPIQWA--PGTMQPPPRAPTPM---RPPAAPPGR---------AQRPA------- 718
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  905 gNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYACPKLPynketspsfyfAISTGSLAQQYAHPNAAlhphTPHPQPSA 984
Cdd:PHA03378   719 -AATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPP-----------AAAPGRARPPAAAPGAP----TPQPPPQA 782
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  985 TPTGQQQSQHGgshPAPSPVQHHQHQAAQALHLASPQQQSAIYH----------------------------AGLAPTPP 1036
Cdd:PHA03378   783 PPAPQQRPRGA---PTPQPPPQAGPTSMQLMPRAAPGQQGPTKQilrqlltggvkrgrpslkkpaalerqaaAGPTPSPG 859
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720412588 1037 SMTPASNTQSPQSSFPAAQqtvftihPSHV--QPAYTTPPHMAHVPQAHVQ-----SGMVPSHPTAHAP 1098
Cdd:PHA03378   860 SGTSDKIVQAPVFYPPVLQ-------PIQVmrQLGSVRAAAASTVTQAPTEytgerRGVGPMHPTDIPP 921
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
704-1074 2.95e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.61  E-value: 2.95e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  704 NAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQPKPSTTPTSPRPQAQPSPSMVG 783
Cdd:pfam03154  127 NDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSV 206
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  784 HQQPAPVYTQPVCFAPNMMYPV-------------------PVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQDQh 844
Cdd:pfam03154  207 PPQGSPATSQPPNQTQSTAAPHtliqqtptlhpqrlpsphpPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTG- 285
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  845 hQSTMMHPASAAGPPIVAT------PPAYSTQyVAYSPQQFPNQPLVQhvPHYQSQHPHVYSPVIQGNARM---MAPPAH 915
Cdd:pfam03154  286 -PSHMQHPVPPQPFPLTPQssqsqvPPGPSPA-APGQSQQRIHTPPSQ--SQLQSQQPPREQPLPPAPLSMphiKPPPTT 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  916 AQPGLVSSSAAQFGAHEQTHAMYacpKLPYNKETSPSFYFAISTGSLAQQYAHPNA-ALHPHT-PHPQPSATPTGQQQSQ 993
Cdd:pfam03154  362 PIPQLPNPQSHKHPPHLSGPSPF---QMNSNLPPPPALKPLSSLSTHHPPSAHPPPlQLMPQSqQLPPPPAQPPVLTQSQ 438
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  994 hggSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYTTP 1073
Cdd:pfam03154  439 ---SLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLP 515

                   .
gi 1720412588 1074 P 1074
Cdd:pfam03154  516 P 516
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
740-1004 3.21e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 48.49  E-value: 3.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  740 EQVRKSTLNPNAKefnprsfSQPKPSTTPTSPRPQAQPSPsmvghqQPAPVYTQPVCFA-PNMMYPVPVSP--------G 810
Cdd:pfam09770   98 EQVRFNRQQPAAR-------AAQSSAQPPASSLPQYQYAS------QQSQQPSKPVRTGyEKYKEPEPIPDlqvdaslwG 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  811 VQPLYPIPMTPMPVNQAktyravpnmpQQRQDQHHQSTMM-------------HPASAAGPPIVATPPAYSTQYVAYSPQ 877
Cdd:pfam09770  165 VAPKKAAAPAPAPQPAA----------QPASLPAPSRKMMsleeveaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQ 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  878 QFPNQPLVQHVPHYQSQHPhvysPVIQGNARMMAPPAHAQPGLVSSSAAQFGAHEQTHAMYACPKLPynketSPSfyfai 957
Cdd:pfam09770  235 QFPPQIQQQQQPQQQPQQP----QQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPV-----QPT----- 300
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720412588  958 stgslaQQYAHPN---AALHPHTPHPQPSATPT-GQQQSQHGGSHPAPSPV 1004
Cdd:pfam09770  301 ------QILQNPNrlsAARVGYPQNPQPGVQPApAHQAHRQQGSFGRQAPI 345
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
841-1105 5.91e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.34  E-value: 5.91e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  841 QDQHHQSTMMHPASAAGPPIVATPPAYSTQY-VAYSPQQFPNQPLVQHVPHYQSQHP----HVySPVIQGNARMMAPPAH 915
Cdd:pfam09770   96 EEEQVRFNRQQPAARAAQSSAQPPASSLPQYqYASQQSQQPSKPVRTGYEKYKEPEPipdlQV-DASLWGVAPKKAAAPA 174
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  916 AQPGLVSSSAAQFGAH-------EQTHAMYACPKLPYNKETSPSFYFAistgslaQQYAHPNAALHPHTPHPQPSATPTG 988
Cdd:pfam09770  175 PAPQPAAQPASLPAPSrkmmsleEVEAAMRAQAKKPAQQPAPAPAQPP-------AAPPAQQAQQQQQFPPQIQQQQQPQ 247
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  989 QQQSQHGGSHPAPSPVQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPqSSFPAAQQTVFTIHPSHVQP 1068
Cdd:pfam09770  248 QQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNP-NRLSAARVGYPQNPQPGVQP 326
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 1720412588 1069 AyttPPHMAHvPQAHVQSGMVPSHpTAHAPMMLMTTQ 1105
Cdd:pfam09770  327 A---PAHQAH-RQQGSFGRQAPII-THPQQLAQLSEE 358
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
399-606 9.34e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.79  E-value: 9.34e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  399 QSGPNSLPPRAAT----PTRPPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSPKAQRHPRNHRVSAGRGSM 474
Cdd:PRK12323   367 QSGGGAGPATAAAapvaQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  475 SSGlefvshnPPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP--------SGPVLASPQAGII 546
Cdd:PRK12323   447 APA-------PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAPA 519
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  547 PAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQnsPAGSKENVKASETSPSF 606
Cdd:PRK12323   520 GWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPV--VAPRPPRASASGLPDMF 577
PRK10263 PRK10263
DNA translocase FtsK; Provisional
760-1048 1.08e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.62  E-value: 1.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  760 SQPKPSTTPTSPRPQAQPSPsmvGHQQPAPVYT-QPVCFAPNMMYPVPVSPGVQPLypipMTPMPVNQAKTYRAVPNMPQ 838
Cdd:PRK10263   345 PVASVDVPPAQPTVAWQPVP---GPQTGEPVIApAPEGYPQQSQYAQPAVQYNEPL----QQPVQPQQPYYAPAAEQPAQ 417
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  839 QRQDQHHQSTmmhPASAAGPPIVATPPAYSTQYVAYspqqfPNQPLVQHVPHYQSQHPHVySPVIQgnarmmaPPAHAQP 918
Cdd:PRK10263   418 QPYYAPAPEQ---PAQQPYYAPAPEQPVAGNAWQAE-----EQQSTFAPQSTYQTEQTYQ-QPAAQ-------EPLYQQP 481
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  919 GLVsssaaqfgahEQTHAMYACPKLPYNKETSPSFYFAISTGSLAQQYAHPNAALhpHTPHPQPSATPTGQQQSQHGGSH 998
Cdd:PRK10263   482 QPV----------EQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAW--YQPIPEPVKEPEPIKSSLKAPSV 549
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720412588  999 PAPSPVQHHQHQAAQALHLaspqqQSAIYHAGLAPTP--PSMTPASN-TQSPQ 1048
Cdd:PRK10263   550 AAVPPVEAAAAVSPLASGV-----KKATLATGAAATVaaPVFSLANSgGPRPQ 597
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
756-1128 1.19e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.52  E-value: 1.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  756 PRSFSQPKPSTTPTSPRPQAQPSPSMVGHQQPAPvytqpvcfAPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPN 835
Cdd:PRK07764   400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPA 471
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  836 MPQQRQDQHHQSTMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQpLVQHVPHYQ-------SQHPHVYSpvIQGN-- 906
Cdd:PRK07764   472 AAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPE-ILAAVPKRSrktwailLPEATVLG--VRGDtl 548
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  907 ---------ARMMAPPAHAQpGLVSSSAAQFGAHEQTHAMYACPKLPYNKETSPsfyfaistgslaqqyAHPNAALHPHT 977
Cdd:PRK07764   549 vlgfstgglARRFASPGNAE-VLVTALAEELGGDWQVEAVVGPAPGAAGGEGPP---------------APASSGPPEEA 612
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  978 PHPQPSATPTGQQQSQHGGSHPAPSPVQHHQHQAAQALHL----------ASPQQQSAIYHAGLAPTPPSMTPASNTQSP 1047
Cdd:PRK07764   613 ARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHhpkhvavpdaSDGGDGWPAKAGGAAPAAPPPAPAPAAPAA 692
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588 1048 QSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQA-----HVQSGMVPSHPTAHAPMMLMTT--QPPGGPQAALAQSALQ 1120
Cdd:PRK07764   693 PAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDPPDPAGApaQPPPPPAPAPAAAPAA 772

                   ....*...
gi 1720412588 1121 PIPVSTTA 1128
Cdd:PRK07764   773 APPPSPPS 780
PHA03247 PHA03247
large tegument protein UL36; Provisional
398-1004 6.22e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 6.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  398 YQSGPNSLPPRAATPtrpPSRPPSRPSRPPSHPSAHGSPAPVSTMPKRMSSEGPPRMSpkaqrhprnhrvsagrgSMSSG 477
Cdd:PHA03247  2480 YRRPAEARFPFAAGA---APDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRML-----------------TWIRG 2539
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  478 LEFVSHN------PPSEAAAPPVARTSPAGGTWSSVVSGVPRLSPKTHRPRSPRQSSIGNSP---SGPVLASPQAGIIPA 548
Cdd:PHA03247  2540 LEELASDdagdppPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPvddRGDPRGPAPPSPLPP 2619
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  549 EAVSMPVPAASPTPAS---PASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGMSPVVSEHRKQI 625
Cdd:PHA03247  2620 DTHAPDPPPPSPSPAAnepDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLA 2699
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  626 DDLKKFKNDFRLQPSSTSESMDQLLSKNREGEKSRDLIKDKTEASAKDSFIDSSSSSSNCTSGSSKTNSPSiSPSMLSNA 705
Cdd:PHA03247  2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA-PPAAPAAG 2778
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  706 EHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEqvRKSTLNPNAKefnPRSFSQPKPSTTPTSPRPQAQPSPSMVG-- 783
Cdd:PHA03247  2779 PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA--PAAALPPAAS---PAGPLPPPTSAQPTAPPPPPGPPPPSLPlg 2853
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  784 --------------HQQPAPVYTQPVCFAPNMMY--PVPVSPGVQPLYPIPMTPMPVNQAKTY-RAVPNMPQQRQDQHHQ 846
Cdd:PHA03247  2854 gsvapggdvrrrppSRSPAAKPAAPARPPVRRLArpAVSRSTESFALPPDQPERPPQPQAPPPpQPQPQPPPPPQPQPPP 2933
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  847 STMMHPASAAgPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPhvySPVIQGNARMMAPPAHAQPGLVSSSAA 926
Cdd:PHA03247  2934 PPPPRPQPPL-APTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQP---APSREAPASSTPPLTGHSLSRVSSWAS 3009
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  927 QFGAHEQTHAMYACPK----LPYNKETSPSFYFAISTGSLAQQYA---HPNAALHPHTPHPQPSATPTGQQQSQHggSHP 999
Cdd:PHA03247  3010 SLALHEETDPPPVSLKqtlwPPDDTEDSDADSLFDSDSERSDLEAldpLPPEPHDPFAHEPDPATPEAGARESPS--SQF 3087

                   ....*
gi 1720412588 1000 APSPV 1004
Cdd:PHA03247  3088 GPPPL 3092
PHA03247 PHA03247
large tegument protein UL36; Provisional
768-1131 9.47e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 9.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  768 PTSPRPQAQPSPsmvghqqpAPVYTQPvcfaPNMMYPVPVSPGVQPLYPIPMTPMPVNQAKTYRAVPNMPQQRQdqhhQS 847
Cdd:PHA03247  2551 PPPPLPPAAPPA--------APDRSVP----PPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPA----PP 2614
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  848 TMMHPASAAGPPIVATPPAYSTQYVAYSPQQFPNQPLVQHVPHYQSQHPHVYSPVIQGNARMMAPPAHAQPGLVSSSAAQ 927
Cdd:PHA03247  2615 SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGS 2694
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  928 FGAHEQTHAMyacPKLPYNKETSPSFYFAISTGSLAQQYAHPNAALHPHTP----HPQPSATPTGQQQSQHGGSHPAPSP 1003
Cdd:PHA03247  2695 LTSLADPPPP---PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPavpaGPATPGGPARPARPPTTAGPPAPAP 2771
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588 1004 VQHHQHQAAQALHLASPQQQSAIYHAGLAPTPPSMTPASNTQSPQSSFPAAQQTVFTIHPSHVQPAYTTPPHMAHVPQAH 1083
Cdd:PHA03247  2772 PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1720412588 1084 VQSGMVPSHPTAHAPMMLMTTQPPGGPQAALAQSALQPIPVSTTAHFP 1131
Cdd:PHA03247  2852 LGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA 2899
DUF3498 pfam12004
Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. ...
468-617 3.46e-03

Domain of unknown function (DUF3498); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 433 to 538 amino acids in length. This domain is found associated with pfam00616, pfam00168. This domain has two conserved sequence motifs: DLQ and PLSFQNP.


Pssm-ID: 463427 [Multi-domain]  Cd Length: 511  Bit Score: 41.28  E-value: 3.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  468 SAGRGSMSSGLE-FVSHNPPSEAAAPPVARTSPAGGTWSSVVS--GVPR----------LSPKTHRPRSPRQSSIGnsPS 534
Cdd:pfam12004  196 PRGLGSPDSSSEtHSSFSSHSNSEDLSSAAANKKSGPSNSSYSedFARRsteftrrqlsLTELQHQPAVPRQNSAG--PQ 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  535 GPVLASPQAGIIPAEAVSMPVPAASPTPASPASNRALTPSIEAKDSRLQDQRQNSPAGSKENVKASETSPSFSKADNKGM 614
Cdd:pfam12004  274 RRIDQQGLGGPPLTRGRTPPSLLNSASYPRPSSGSLMSSSPDWPPARLRQQSSSSKGDSPETKQRTQHQQVPSPVNPSTL 353

                   ...
gi 1720412588  615 SPV 617
Cdd:pfam12004  354 SPV 356
Sm_like cd00600
Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to ...
92-159 4.17e-03

Sm and related proteins; The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes.


Pssm-ID: 212462 [Multi-domain]  Cd Length: 63  Bit Score: 36.84  E-value: 4.17e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720412588   92 LTSVVGSKCEVQVKNGGIYEGVFKTYSPKCDLVLDAAHEKSTEsssgpKREEIMESVLFKCSDFVVVQ 159
Cdd:cd00600      1 LKDFIGKTVSVELKDGRVLTGTLVAFDKYMNLVLDDVVETGRD-----GKVRVLGLVLIRGSNIVSIR 63
PAM2 pfam07145
Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various ...
743-758 4.85e-03

Ataxin-2 C-terminal region; The PABP-interacting motif PAM2 has been identified in various eukaryotic proteins as an important binding site for pfam00658. It has been found in a wide range of eukaryotic proteins. Strikingly, this motif appears to occur solely outside of globular domains.


Pssm-ID: 429316  Cd Length: 17  Bit Score: 35.67  E-value: 4.85e-03
                           10
                   ....*....|....*.
gi 1720412588  743 RKSTLNPNAKEFNPRS 758
Cdd:pfam07145    1 SKSKLNPNAKEFVPSF 16
PRK10263 PRK10263
DNA translocase FtsK; Provisional
693-823 5.54e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 5.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  693 NSPSISPSMLSNAEHKRGPEVTSQGVQTSSPACKQEKDDREEKKDTTEQVRKSTLNPNAKEFNPRSFSQP-KPSTTPTSP 771
Cdd:PRK10263   741 HEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPvAPQPQYQQP 820
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720412588  772 RPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSPGVQPLYPIP----MTPMP 823
Cdd:PRK10263   821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPsldlLTPPP 876
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
735-845 7.38e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 40.53  E-value: 7.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720412588  735 KKDTTEQVrkSTLNPNAKEFN---PRSFSQPKPSTTPtSPRPQAQPSPSMVGHQQPAPVYTQPVCFAPNMMYPVPVSpgv 811
Cdd:PRK14971   380 KPVFTQPA--AAPQPSAAAAAspsPSQSSAAAQPSAP-QSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFK--- 453
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1720412588  812 qPLYPIPMTPMPVNQAKTYRAVPNMPQQRQDQHH 845
Cdd:PRK14971   454 -EEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIK 486
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH