NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1908918706|ref|NP_001374111|]
View 

ataxin-2-like protein isoform 18 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
123-196 1.63e-19

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


:

Pssm-ID: 464173  Cd Length: 78  Bit Score: 83.76  E-value: 1.63e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918706  123 MLHFLTAVVGSTCDVKVKNGTTYEGIFKTLS--SKFELAVDAVHRKASE--PAGGPRREDIVDTMVFKPSDVMLVHFR 196
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASleKDFGVVLKMARRIKKSngSGLNPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
264-326 1.55e-17

LsmAD domain; This domain is found associated with Lsm domain.


:

Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 77.61  E-value: 1.55e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918706  264 YGVKTTYDSSLssYTVPLEKdNSEEFRQRELRAAQLAREIESSPQYRLRIAMEN-----DDGRTEEEK 326
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERgldvdDSGLDEEDK 65
PAT1 super family cl37801
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
835-997 5.18e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


The actual alignment was detected with superfamily member pfam09770:

Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 53.89  E-value: 5.18e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  835 SHPQAIVSSSTPQYPSAEQPTPQALYATVHQSYPHHATQL--HAHQPQPATTPTGSQPQSQHAAPSPVQHQAGQAPHLGS 912
Cdd:pfam09770  217 APAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHpgQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVP 296
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  913 GQPQQNLYHPgaltgtppSLPPGPSAQSPQSSFPQPAAVYAIHHQQLPHGFTNMAHVtQAHvqtgitaappphpgaphPP 992
Cdd:pfam09770  297 VQPTQILQNP--------NRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI-ITH-----------------PQ 350

                   ....*
gi 1908918706  993 QVMLL 997
Cdd:pfam09770  351 QLAQL 355
PHA03247 super family cl33720
large tegument protein UL36; Provisional
336-948 2.26e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 2.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  336 GRESPSLASREGKYIPLPQRVREGPRGGVRCSSSRGGRPGLSSLPPRGPHHLDNSSPgpgsearginggPSRMSPKAQRP 415
Cdd:PHA03247  2513 SRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVP------------PPRPAPRPSEP 2580
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  416 LRGAKtlsspSNRPSgetsvPPPPAAPPFLPVGRMYPPRSPKSAAPAPISASCPEPPIGSAVPTSSasipvtssvsdpgv 495
Cdd:PHA03247  2581 AVTSR-----ARRPD-----APPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN-------------- 2636
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  496 gsiSPASPKISLAPTDVKELSTKEPGRTLEPQElARIAGKVPGLQNEQKRFQLEELRKfgaqfklqPSSSPENSLDPFPP 575
Cdd:PHA03247  2637 ---EPDPHPPPTVPPPERPRDDPAPGRVSRPRR-ARRLGRAAQASSPPQRPRRRAARP--------TVGSLTSLADPPPP 2704
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  576 rilkeepkgkekevdglltsepmgspvssktesvsdkEDKPPLAPSGGTEGPEQPPPPCPSQTGSPPVGLikgedkdeGP 655
Cdd:PHA03247  2705 -------------------------------------PPTPEPAPHALVSATPLPPGPAAARQASPALPA--------AP 2739
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  656 VAEQVKKSTLNPNAKEFNPTKPLlsvnksTSTPTSPGPrthstPSIPVlTAGQSGLYSPQYISYIPQIHMGPAVQAPQMY 735
Cdd:PHA03247  2740 APPAVPAGPATPGGPARPARPPT------TAGPPAPAP-----PAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADP 2807
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  736 PYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPmmqaaaaagpplvaatPYSSYIPYNPQQFPGQPAMMQPMahyP 815
Cdd:PHA03247  2808 PAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG----------------PPPPSLPLGGSVAPGGDVRRRPP---S 2868
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  816 SQPVFAPMLQSNP--RMLTSGSHPQAIVSSSTPQYPSAEQPTPQAlyatvhqsyPHHATQLHAHQPQPATTPTGSQPQSQ 893
Cdd:PHA03247  2869 RSPAAKPAAPARPpvRRLARPAVSRSTESFALPPDQPERPPQPQA---------PPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1908918706  894 HAAPSPVQHQAGQA-PHLGSGQPQQNLYHPGALTGTPPSLP-PGPSAQSPQSSFPQP 948
Cdd:PHA03247  2940 QPPLAPTTDPAGAGePSGAVPQPWLGALVPGRVAVPRFRVPqPAPSREAPASSTPPL 2996
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
123-196 1.63e-19

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 83.76  E-value: 1.63e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918706  123 MLHFLTAVVGSTCDVKVKNGTTYEGIFKTLS--SKFELAVDAVHRKASE--PAGGPRREDIVDTMVFKPSDVMLVHFR 196
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASleKDFGVVLKMARRIKKSngSGLNPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
264-326 1.55e-17

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 77.61  E-value: 1.55e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918706  264 YGVKTTYDSSLssYTVPLEKdNSEEFRQRELRAAQLAREIESSPQYRLRIAMEN-----DDGRTEEEK 326
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERgldvdDSGLDEEDK 65
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
835-997 5.18e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 53.89  E-value: 5.18e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  835 SHPQAIVSSSTPQYPSAEQPTPQALYATVHQSYPHHATQL--HAHQPQPATTPTGSQPQSQHAAPSPVQHQAGQAPHLGS 912
Cdd:pfam09770  217 APAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHpgQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVP 296
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  913 GQPQQNLYHPgaltgtppSLPPGPSAQSPQSSFPQPAAVYAIHHQQLPHGFTNMAHVtQAHvqtgitaappphpgaphPP 992
Cdd:pfam09770  297 VQPTQILQNP--------NRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI-ITH-----------------PQ 350

                   ....*
gi 1908918706  993 QVMLL 997
Cdd:pfam09770  351 QLAQL 355
PHA03247 PHA03247
large tegument protein UL36; Provisional
336-948 2.26e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 2.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  336 GRESPSLASREGKYIPLPQRVREGPRGGVRCSSSRGGRPGLSSLPPRGPHHLDNSSPgpgsearginggPSRMSPKAQRP 415
Cdd:PHA03247  2513 SRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVP------------PPRPAPRPSEP 2580
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  416 LRGAKtlsspSNRPSgetsvPPPPAAPPFLPVGRMYPPRSPKSAAPAPISASCPEPPIGSAVPTSSasipvtssvsdpgv 495
Cdd:PHA03247  2581 AVTSR-----ARRPD-----APPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN-------------- 2636
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  496 gsiSPASPKISLAPTDVKELSTKEPGRTLEPQElARIAGKVPGLQNEQKRFQLEELRKfgaqfklqPSSSPENSLDPFPP 575
Cdd:PHA03247  2637 ---EPDPHPPPTVPPPERPRDDPAPGRVSRPRR-ARRLGRAAQASSPPQRPRRRAARP--------TVGSLTSLADPPPP 2704
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  576 rilkeepkgkekevdglltsepmgspvssktesvsdkEDKPPLAPSGGTEGPEQPPPPCPSQTGSPPVGLikgedkdeGP 655
Cdd:PHA03247  2705 -------------------------------------PPTPEPAPHALVSATPLPPGPAAARQASPALPA--------AP 2739
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  656 VAEQVKKSTLNPNAKEFNPTKPLlsvnksTSTPTSPGPrthstPSIPVlTAGQSGLYSPQYISYIPQIHMGPAVQAPQMY 735
Cdd:PHA03247  2740 APPAVPAGPATPGGPARPARPPT------TAGPPAPAP-----PAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADP 2807
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  736 PYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPmmqaaaaagpplvaatPYSSYIPYNPQQFPGQPAMMQPMahyP 815
Cdd:PHA03247  2808 PAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG----------------PPPPSLPLGGSVAPGGDVRRRPP---S 2868
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  816 SQPVFAPMLQSNP--RMLTSGSHPQAIVSSSTPQYPSAEQPTPQAlyatvhqsyPHHATQLHAHQPQPATTPTGSQPQSQ 893
Cdd:PHA03247  2869 RSPAAKPAAPARPpvRRLARPAVSRSTESFALPPDQPERPPQPQA---------PPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1908918706  894 HAAPSPVQHQAGQA-PHLGSGQPQQNLYHPGALTGTPPSLP-PGPSAQSPQSSFPQP 948
Cdd:PHA03247  2940 QPPLAPTTDPAGAGePSGAVPQPWLGALVPGRVAVPRFRVPqPAPSREAPASSTPPL 2996
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
876-951 1.47e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.92  E-value: 1.47e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1908918706  876 AHQPQPATTPTGSQPQSQHAAPSPVQHQAGQAphlgSGQPQQNLYHPGALTGTPPSLPPGPSAQSPQSSFPQPAAV 951
Cdd:PRK14971   389 APQPSAAAAASPSPSQSSAAAQPSAPQSATQP----AGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPV 460
Herpes_TAF50 pfam03326
Herpesvirus transcription activation factor (transactivator); This family includes EBV BRLF1 ...
664-922 1.27e-03

Herpesvirus transcription activation factor (transactivator); This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses.


Pssm-ID: 308764 [Multi-domain]  Cd Length: 568  Bit Score: 42.76  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  664 TLNPNAKEFNPTKPLLSVNKSTSTPTSPGPRTHSTPSIPVLTAGQSGLYSPQYISYipQIHMGPAVQAPQMYPYPvsnsv 743
Cdd:pfam03326  310 TLVPIAGSTGVTEVVSYGHNSTSPSSTPCPSTAVTEADHQTEPEVPWIATAHQESD--QRPIGPGPEKPTFLPPV----- 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  744 pgqqGKYRGAKGsLPPQRSDQHQPAsappmmQAAAAAGPPLVAATPYSSYIPYNPQQFPGQPammqpmahypsqpvfapm 823
Cdd:pfam03326  383 ----GGKQFFQG-LRDSRSTSFLTA------PEATSAISDVFQGTEVCQPKRIRALHPPGSP------------------ 433
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  824 lqSNPRMLTSGSHPQAIVSSSTPQYPSAEQPTPQALYAT--VHQSYPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQ 901
Cdd:pfam03326  434 --SANRPLPSSLAPTPTGPVHEPGSSLTPATVPQPLDAApvATPEASHELQPPDEETPQPLDEDQALCGQQDASHPPPRG 511
                          250       260
                   ....*....|....*....|.
gi 1908918706  902 HQAGQAPHLGSGQPQQNLYHP 922
Cdd:pfam03326  512 QLDELTTTLESMTEDLNLDSP 532
 
Name Accession Description Interval E-value
SM-ATX pfam14438
Ataxin 2 SM domain; This SM domain is found in Ataxin-2.
123-196 1.63e-19

Ataxin 2 SM domain; This SM domain is found in Ataxin-2.


Pssm-ID: 464173  Cd Length: 78  Bit Score: 83.76  E-value: 1.63e-19
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918706  123 MLHFLTAVVGSTCDVKVKNGTTYEGIFKTLS--SKFELAVDAVHRKASE--PAGGPRREDIVDTMVFKPSDVMLVHFR 196
Cdd:pfam14438    1 LLFLLTSLVGLVVEVTTKNGEVYEGIFSTASleKDFGVVLKMARRIKKSngSGLNPVRGEIVDTMIFPAKDIVDIEAK 78
LsmAD pfam06741
LsmAD domain; This domain is found associated with Lsm domain.
264-326 1.55e-17

LsmAD domain; This domain is found associated with Lsm domain.


Pssm-ID: 461998 [Multi-domain]  Cd Length: 65  Bit Score: 77.61  E-value: 1.55e-17
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1908918706  264 YGVKTTYDSSLssYTVPLEKdNSEEFRQRELRAAQLAREIESSPQYRLRIAMEN-----DDGRTEEEK 326
Cdd:pfam06741    1 FGVKSTYDENL--YTTKLDR-SSPDYKEREAEAERIAREIEGSASTNAHVAEERgldvdDSGLDEEDK 65
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
835-997 5.18e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 53.89  E-value: 5.18e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  835 SHPQAIVSSSTPQYPSAEQPTPQALYATVHQSYPHHATQL--HAHQPQPATTPTGSQPQSQHAAPSPVQHQAGQAPHLGS 912
Cdd:pfam09770  217 APAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHpgQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVP 296
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  913 GQPQQNLYHPgaltgtppSLPPGPSAQSPQSSFPQPAAVYAIHHQQLPHGFTNMAHVtQAHvqtgitaappphpgaphPP 992
Cdd:pfam09770  297 VQPTQILQNP--------NRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI-ITH-----------------PQ 350

                   ....*
gi 1908918706  993 QVMLL 997
Cdd:pfam09770  351 QLAQL 355
PHA03247 PHA03247
large tegument protein UL36; Provisional
336-948 2.26e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 2.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  336 GRESPSLASREGKYIPLPQRVREGPRGGVRCSSSRGGRPGLSSLPPRGPHHLDNSSPgpgsearginggPSRMSPKAQRP 415
Cdd:PHA03247  2513 SRLAPAILPDEPVGEPVHPRMLTWIRGLEELASDDAGDPPPPLPPAAPPAAPDRSVP------------PPRPAPRPSEP 2580
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  416 LRGAKtlsspSNRPSgetsvPPPPAAPPFLPVGRMYPPRSPKSAAPAPISASCPEPPIGSAVPTSSasipvtssvsdpgv 495
Cdd:PHA03247  2581 AVTSR-----ARRPD-----APPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAAN-------------- 2636
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  496 gsiSPASPKISLAPTDVKELSTKEPGRTLEPQElARIAGKVPGLQNEQKRFQLEELRKfgaqfklqPSSSPENSLDPFPP 575
Cdd:PHA03247  2637 ---EPDPHPPPTVPPPERPRDDPAPGRVSRPRR-ARRLGRAAQASSPPQRPRRRAARP--------TVGSLTSLADPPPP 2704
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  576 rilkeepkgkekevdglltsepmgspvssktesvsdkEDKPPLAPSGGTEGPEQPPPPCPSQTGSPPVGLikgedkdeGP 655
Cdd:PHA03247  2705 -------------------------------------PPTPEPAPHALVSATPLPPGPAAARQASPALPA--------AP 2739
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  656 VAEQVKKSTLNPNAKEFNPTKPLlsvnksTSTPTSPGPrthstPSIPVlTAGQSGLYSPQYISYIPQIHMGPAVQAPQMY 735
Cdd:PHA03247  2740 APPAVPAGPATPGGPARPARPPT------TAGPPAPAP-----PAAPA-AGPPRRLTRPAVASLSESRESLPSPWDPADP 2807
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  736 PYPVSNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPmmqaaaaagpplvaatPYSSYIPYNPQQFPGQPAMMQPMahyP 815
Cdd:PHA03247  2808 PAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG----------------PPPPSLPLGGSVAPGGDVRRRPP---S 2868
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  816 SQPVFAPMLQSNP--RMLTSGSHPQAIVSSSTPQYPSAEQPTPQAlyatvhqsyPHHATQLHAHQPQPATTPTGSQPQSQ 893
Cdd:PHA03247  2869 RSPAAKPAAPARPpvRRLARPAVSRSTESFALPPDQPERPPQPQA---------PPPPQPQPQPPPPPQPQPPPPPPPRP 2939
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1908918706  894 HAAPSPVQHQAGQA-PHLGSGQPQQNLYHPGALTGTPPSLP-PGPSAQSPQSSFPQP 948
Cdd:PHA03247  2940 QPPLAPTTDPAGAGePSGAVPQPWLGALVPGRVAVPRFRVPqPAPSREAPASSTPPL 2996
PHA03247 PHA03247
large tegument protein UL36; Provisional
674-1060 2.72e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 2.72e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  674 PTKPLLSVNKSTSTPTSPGPrTHSTPSIPVLTAGQSGLYSPQYISYIPQIHMGPAVQAPQMYPYPVSNSVPGQ-QGKYRG 752
Cdd:PHA03247  2595 SARPRAPVDDRGDPRGPAPP-SPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRaRRLGRA 2673
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  753 AKGSLPPQRSdqHQPASAPPMMQAAAAAGPPLVAATPYssyiPYNPQQFPGQPAMMQPMAHYPSQPVfAPMLQSNPRMLT 832
Cdd:PHA03247  2674 AQASSPPQRP--RRRAARPTVGSLTSLADPPPPPPTPE----PAPHALVSATPLPPGPAAARQASPA-LPAAPAPPAVPA 2746
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  833 SGSHPQAIVSSSTPQYPSA-EQPTPQALYATVhqsyPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQHQAGQAPhlG 911
Cdd:PHA03247  2747 GPATPGGPARPARPPTTAGpPAPAPPAAPAAG----PPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP--P 2820
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  912 SGQPQQNLYHPGALTGTPPSLPPGPSAQS------------------PQSSFPQPAAVYAIHHQQLPHGFTNMAHVTQAH 973
Cdd:PHA03247  2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSlplggsvapggdvrrrppSRSPAAKPAAPARPPVRRLARPAVSRSTESFAL 2900
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  974 VQTGITAAPPPHPGAPHPPQVMLLHPPQSHGGPPQGAVPQSGVPALSASTPSPYPYIGHPQGEQPGQAPGFPGGADDRIP 1053
Cdd:PHA03247  2901 PPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVP 2980

                   ....*..
gi 1908918706 1054 PLPPPGE 1060
Cdd:PHA03247  2981 QPAPSRE 2987
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
658-922 1.50e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.26  E-value: 1.50e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  658 EQVKKSTLNPNAKEFNPTKPLLSVNKSTSTPTSPGPrthSTPSIPVLTagqsGLYSPQYISYIPQIH-------MGPAVQ 730
Cdd:pfam09770   98 EQVRFNRQQPAARAAQSSAQPPASSLPQYQYASQQS---QQPSKPVRT----GYEKYKEPEPIPDLQvdaslwgVAPKKA 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  731 APQMYPYPVSNSVPGQQGKYRG--------AKGSLPPQRSDQHQPASAPPMMQAAAAAGPPLVAATPYSSYIPYNPQQFP 802
Cdd:pfam09770  171 AAPAPAPQPAAQPASLPAPSRKmmsleeveAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQP 250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  803 GQPAMMQPMAHYPSQpvfapmLQsnpRMlTSGSHPQAIVSSSTPQYPSAEQPTPQalyatvhqsyPHHATQLHAHQPQPA 882
Cdd:pfam09770  251 QQPQQHPGQGHPVTI------LQ---RP-QSPQPDPAQPSIQPQAQQFHQQPPPV----------PVQPTQILQNPNRLS 310
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 1908918706  883 TTPTGSQPQSQHAAPSPVQHQAGQAPHLGSGQPQQnLYHP 922
Cdd:pfam09770  311 AARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPI-ITHP 349
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
797-1014 5.28e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.34  E-value: 5.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  797 NPQQFPGQPAMMQPMAHYPSQPVFAPMLQSNPRMLTSG----SHPQAI----VSSS------TPQYPSAEQPTPQALYAT 862
Cdd:pfam09770  106 QPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGyekyKEPEPIpdlqVDASlwgvapKKAAAPAPAPQPAAQPAS 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  863 VHQSYPHHAT------QLHAHQPQPATTPTGSQPQSQHAAPSPVQHQAGQAPHLGSGQPQQNLYHPGALTGTPPSLPP-- 934
Cdd:pfam09770  186 LPAPSRKMMSleeveaAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVti 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  935 --GPSAQSPQSSFPQPAAVYAIHHQQLPhgfTNMAHVTQA-------HVQTGITAAPPPHPGAPHPPQVMLLHPPQSHGG 1005
Cdd:pfam09770  266 lqRPQSPQPDPAQPSIQPQAQQFHQQPP---PVPVQPTQIlqnpnrlSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQ 342

                   ....*....
gi 1908918706 1006 PPQGAVPQS 1014
Cdd:pfam09770  343 APIITHPQQ 351
PRK10263 PRK10263
DNA translocase FtsK; Provisional
667-947 5.61e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.39  E-value: 5.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  667 PNAKEFNP-------TKPLLSVNKSTS-TPTSPGPRTHSTPSIPVLTAgQSGLYSPQY-ISYIPQIHMGPAVQAPQMYPY 737
Cdd:PRK10263   302 PEYDEYDPllngapiTEPVAVAAAATTaTQSWAAPVEPVTQTPPVASV-DVPPAQPTVaWQPVPGPQTGEPVIAPAPEGY 380
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  738 PvsnsvPGQQGKYRGAKGSLPPQRSDQHQPASAPPMMQAAAAAGPPLVAATPYSSYIPYNPQqfPGQPAMMQPMAHYPSQ 817
Cdd:PRK10263   381 P-----QQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPA--PEQPVAGNAWQAEEQQ 453
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  818 PVFAPmlqsNPRMLTSGSHPQAIVSSSTPQYPSAEQPT-----------------PQALYATVHQSYPHHATQLHA-HQ- 878
Cdd:PRK10263   454 STFAP----QSTYQTEQTYQQPAAQEPLYQQPQPVEQQpvvepepvveetkparpPLYYFEEVEEKRAREREQLAAwYQp 529
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1908918706  879 -PQPATTP---TGSQPQSQHAAPSPVQHQAGQAPhLGSGQPQQNLYHPGALTGTPPSLPP----GPSAQSPQSSFPQ 947
Cdd:PRK10263   530 iPEPVKEPepiKSSLKAPSVAAVPPVEAAAAVSP-LASGVKKATLATGAAATVAAPVFSLansgGPRPQVKEGIGPQ 605
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
876-951 1.47e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.92  E-value: 1.47e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1908918706  876 AHQPQPATTPTGSQPQSQHAAPSPVQHQAGQAphlgSGQPQQNLYHPGALTGTPPSLPPGPSAQSPQSSFPQPAAV 951
Cdd:PRK14971   389 APQPSAAAAASPSPSQSSAAAQPSAPQSATQP----AGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPV 460
PRK10927 PRK10927
cell division protein FtsN;
847-950 4.68e-04

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 43.52  E-value: 4.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  847 QYPSAEQpTPQALYATVH-QSYPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQHQAGQAPHLGSGQPQQNLYHPGAL 925
Cdd:PRK10927   138 EVPWNEQ-TPEQRQQTLQrQRQAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASTQQPYQDLLQTPAH 216
                           90       100
                   ....*....|....*....|....*.
gi 1908918706  926 TGTPPSLP-PGPSAQSPQSsfPQPAA 950
Cdd:PRK10927   217 TTAQSKPQqAAPVTRAADA--PKPTA 240
PRK10263 PRK10263
DNA translocase FtsK; Provisional
799-948 7.28e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.92  E-value: 7.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  799 QQFPG-QPAMMQPMAHypSQPVFAPMlqsnPRMLTSGSHPQAIVSSSTPQYPSAEQPTPQALYATVHQSYPHHATQLHAH 877
Cdd:PRK10263   709 QRYSGeQPAGANPFSL--DDFEFSPM----KALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQ 782
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1908918706  878 QPQPATTPTGSQPQSQHAAPSPVQHQAGQAPHLGSGQPQQNLYHPgaltgTPPSLPPGPSAQSPQSSFPQP 948
Cdd:PRK10263   783 QPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQ-----PQYQQPQQPVAPQPQDTLLHP 848
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
740-1058 9.99e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 9.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  740 SNSVPGQQGKYRGAKGSLPPQRSDQHQPASAPPMMQAAAAAGPPLVAATPYSSYIPYNPQQFPGQPAMMQPMAHYPSQPV 819
Cdd:pfam03154  145 SPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQST 224
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  820 FAPM------LQSNPRMLTSGSHPQAIVSSSTPQYPSAEQPTPQALYATVHQSYPHHATQLHAHQPQP------ATTPTG 887
Cdd:pfam03154  225 AAPHtliqqtPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPvppqpfPLTPQS 304
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  888 SQPQSQHAAPSPVQHQAGQAPHLGSGQPQQNLYHPGALTGTPPSLPPGPSAQSPQSS----FPQPAAVYAIHHQQLPHGF 963
Cdd:pfam03154  305 SQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTpipqLPNPQSHKHPPHLSGPSPF 384
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  964 TNMAHVTQAHVQTGITAAPPPHPGAPHPPQVMLLhpPQSHGGPPQGAVP-----QSGVPALSASTPSPYPYIGHPQGEQP 1038
Cdd:pfam03154  385 QMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLM--PQSQQLPPPPAQPpvltqSQSLPPPAASHPPTSGLHQVPSQSPF 462
                          330       340
                   ....*....|....*....|
gi 1908918706 1039 GQAPGFPGGADDRIPPLPPP 1058
Cdd:pfam03154  463 PQHPFVPGGPPPITPPSGPP 482
Herpes_TAF50 pfam03326
Herpesvirus transcription activation factor (transactivator); This family includes EBV BRLF1 ...
664-922 1.27e-03

Herpesvirus transcription activation factor (transactivator); This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses.


Pssm-ID: 308764 [Multi-domain]  Cd Length: 568  Bit Score: 42.76  E-value: 1.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  664 TLNPNAKEFNPTKPLLSVNKSTSTPTSPGPRTHSTPSIPVLTAGQSGLYSPQYISYipQIHMGPAVQAPQMYPYPvsnsv 743
Cdd:pfam03326  310 TLVPIAGSTGVTEVVSYGHNSTSPSSTPCPSTAVTEADHQTEPEVPWIATAHQESD--QRPIGPGPEKPTFLPPV----- 382
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  744 pgqqGKYRGAKGsLPPQRSDQHQPAsappmmQAAAAAGPPLVAATPYSSYIPYNPQQFPGQPammqpmahypsqpvfapm 823
Cdd:pfam03326  383 ----GGKQFFQG-LRDSRSTSFLTA------PEATSAISDVFQGTEVCQPKRIRALHPPGSP------------------ 433
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  824 lqSNPRMLTSGSHPQAIVSSSTPQYPSAEQPTPQALYAT--VHQSYPHHATQLHAHQPQPATTPTGSQPQSQHAAPSPVQ 901
Cdd:pfam03326  434 --SANRPLPSSLAPTPTGPVHEPGSSLTPATVPQPLDAApvATPEASHELQPPDEETPQPLDEDQALCGQQDASHPPPRG 511
                          250       260
                   ....*....|....*....|.
gi 1908918706  902 HQAGQAPHLGSGQPQQNLYHP 922
Cdd:pfam03326  512 QLDELTTTLESMTEDLNLDSP 532
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
876-1060 2.10e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 2.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  876 AHQPQPATTPTGSQPQSQHAAPSPVQHQAGQAPHLGSGQPQQN--LYHPGALTGTPPSLPPGPSAQSPQSSFPQPAAVYA 953
Cdd:PRK07764   594 AAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPaeASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKA 673
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  954 IHHQQlphgfTNMAHVTQAHVQTGITAAPPPHPGAPhppqvmllHPPQSHGGPPQGAVPQSGVPALSASTPSPYPYIGHP 1033
Cdd:PRK07764   674 GGAAP-----AAPPPAPAPAAPAAPAGAAPAQPAPA--------PAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
                          170       180
                   ....*....|....*....|....*..
gi 1908918706 1034 QGEQPGQAPGFPGGADDRIPPLPPPGE 1060
Cdd:PRK07764   741 LPPEPDDPPDPAGAPAQPPPPPAPAPA 767
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
873-1059 4.41e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 41.17  E-value: 4.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  873 QLHAHQPQPATTPTGSQPQSQHAAPSPVQHQAGQAPHlgSGQP--------QQNLYHP-----GALTGTPPSLPPGPSAQ 939
Cdd:pfam09770   99 QVRFNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQ--PSKPvrtgyekyKEPEPIPdlqvdASLWGVAPKKAAAPAPA 176
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  940 SPQSSFP-------------------------QPAAVYAIHHQQLPHGftnmahvtQAHVQTGITAAPPPHPGAPHPPQV 994
Cdd:pfam09770  177 PQPAAQPaslpapsrkmmsleeveaamraqakKPAQQPAPAPAQPPAA--------PPAQQAQQQQQFPPQIQQQQQPQQ 248
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1908918706  995 MLLHPPQSHGGPPQGAV---PQSGVPALSASTPSPYPYIGH----PQGEQPGQA---PGFPGGADDRIPPLPPPG 1059
Cdd:pfam09770  249 QPQQPQQHPGQGHPVTIlqrPQSPQPDPAQPSIQPQAQQFHqqppPVPVQPTQIlqnPNRLSAARVGYPQNPQPG 323
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
374-702 5.05e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.83  E-value: 5.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  374 PGLSSLPPRGPHHLDNSS----PGPGSEARGINGGPSRMSP----KAQRPLRGAKTLSSP--SNRPSGETSVPPPPaapp 443
Cdd:PTZ00449   514 PEASGLPPKAPGDKEGEEgeheDSKESDEPKEGGKPGETKEgevgKKPGPAKEHKPSKIPtlSKKPEFPKDPKHPK---- 589
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  444 flpvgRMYPPRSPKS--AAPAPISASCPEPPIGSAVPTSSASIPVTSSVSDPgVGSISPASPKISLAPTDVKE-LSTKEP 520
Cdd:PTZ00449   590 -----DPEEPKKPKRprSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRP-PPPQRPSSPERPEGPKIIKSpKPPKSP 663
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  521 GRTLEP---QELARIAGKVPGLQNEQKRFQL--EELRKFGAQFKLQPSSSPENSLDPFPPRIlkeepkgkekEVDGLLTS 595
Cdd:PTZ00449   664 KPPFDPkfkEKFYDDYLDAAAKSKETKTTVVldESFESILKETLPETPGTPFTTPRPLPPKL----------PRDEEFPF 733
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1908918706  596 EPMGSPVSsktESVSDKE-DKPPLAPSggtegpeqpPPPCPSQTGSPPVGLIKGEDKDEGPVAEqvkksTLNPNAKEFNP 674
Cdd:PTZ00449   734 EPIGDPDA---EQPDDIEfFTPPEEER---------TFFHETPADTPLPDILAEEFKEEDIHAE-----TGEPDEAMKRP 796
                          330       340
                   ....*....|....*....|....*...
gi 1908918706  675 TKPllsvnkSTSTPTSPGprTHstPSIP 702
Cdd:PTZ00449   797 DSP------SEHEDKPPG--DH--PSLP 814
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH